This regex parses an anchor link and grabs the href, hash, and the text.
\<a\shref\=\"(\w+)\.html(\#\w+)\"\>(.+)\<\/a\>