Subject: | Mechanize seemed to discard the first URL inside first <p> tag in a html page |
If you have page with:
<p><a href="http://www.url1.com/gi1?a=1">test1</a><p>
<p><a href="http://www.url2.com/gi2?a=2">test2</a><p>
mech-dump -links
will return http://www.url2.com/gi2?a=2
or
<p><a href="http://www.first.com/gi1?a=1">first</a><a
href="http://www.url1.com/gi1?a=1">test1</a><p>
<p><a href="http://www.url2.com/gi2?a=2">test2</a><p>
mech-dump -links
will return:
http://www.url1.com/gi1?a=1
http://www.url2.com/gi2?a=2
The first link in <p> for a page is always discarded.