Creating java regex to get href link

21,590

You do not have to use replaceAll. Better use pattern groups like the following:

Pattern p = Pattern.compile("href=\"(.*?)\"");
Matcher m = p.matcher(html);
String url = null;
if (m.find()) {
    url = m.group(1); // this variable should contain the link URL
}

If you have several links into your HTML perform m.find() in loop.

Share:
21,590
user1070866
Author by

user1070866

Updated on October 30, 2020

Comments

  • user1070866
    user1070866 over 3 years

    Sorry if this has been asked before, but I couldn't find any answers on the web. I'm having a hard time figuring out the inverse to this regex:

    "\"[^>]*\">"

    I want to use replaceAll to replace everything except the link. So if I had a tag similar to this:

    <p><a href="http://www.google.com">Google</a></p>

    I need a regex that would satisfy this:

    s.replaceAll(regex, "");

    to give me this output:

    http://www.google.com

    I know there are better ways to do this, but I have to use a regex. Any help is really appreciated, thanks!