As many people are unaware, email addresses require a library to parse. Simple regexes, like @(.*)
, are not sufficient. Email addresses can even contain comments, which can contain characters like @
, breaking simple regexes.
There is a Node.js library that parses RFC 2822 addresses:
var address = addresses[0];
console.log("Email address: " + address.address);
console.log("Email name: " + address.name());
console.log("Reformatted: " + address.format());
console.log("User part: " + address.user());
console.log("Host part: " + address.host());
which is an almost direct port of the perl module Mail::Address
.
This is something that I would expect to exist in Java's InternetAddress
class, but it doesn't break things down any further than the full address, which can include e.g. [email protected]
. But I'm trying to extract the gmail.com
part, which it doesn't include a method to do.
I'm surprised I can't find a common library that solves this, but presumably many people have this problem. How can this be solved using a library or no?
If you need to just get domain part from email address (be aware of mailing groups since they do not have @) you can do like this:
I used lastIndexOf here since by RFC2822 email address might contain more than one @ symbols (if it is escaped). If you want to skip mailing groups there is method in InternetAddress class isGroup()
PS also it could be that address contains routing information:
Or address literals: