Bug #7811 for HTML-LinkExtractor: not the exact text extracted by using new HTML::LinkExtractor(undef, undef,1)

Wed Sep 29 06:04:43 2004 Guest - Ticket created

Subject:

not the exact text extracted by using new HTML::LinkExtractor(undef, undef,1)

Due to system migration, I have installed the latest 0.11 version on the server. However, I found out not the exact text is extracted. eg. #!/usr/bin/perl use HTML::LinkExtractor; use Data::Dumper; my $input = q{If <a href="http://perl.com/"> I am a LINK!!! </a>}; my $LX = new HTML::LinkExtractor(undef,undef,1); $LX->parse(\$input); ## print Dumper($LX->links); for my $Link(@{$LX->links}) { print "$$Link{href}\n"; print "$$Link{_TEXT}\n"; } _END_ ============================== Gives the results: a http://perl.com/ <a href="http://perl.com/"> I am a LINK!!! ====================== However, I had expected only "I am a LINK!!!" will be extracted. This did not happen when I used version 0.06 Please advise! Thanks!

Thu Sep 30 00:21:50 2004 PODMASTER [...] cpan.org - Taken

Thu Sep 30 01:39:24 2004 PODMASTER [...] cpan.org - Correspondence added

Until it hits the rest of cpan you can download it at http://pause.perl.org/incoming/HTML-LinkExtractor-0.121.tar.gz

Thu Sep 30 01:39:27 2004 PODMASTER [...] cpan.org - Status changed from 'new' to 'resolved'