Skip Menu |

This queue is for tickets about the HTML-LinkExtractor CPAN distribution.

Report information
The Basics
Id: 7811
Status: resolved
Priority: 0/
Queue: HTML-LinkExtractor

People
Owner: PODMASTER [...] cpan.org
Requestors: cindydp [...] hotmail.com
Cc:
AdminCc:

Bug Information
Severity: Important
Broken in: (no value)
Fixed in: (no value)



Subject: not the exact text extracted by using new HTML::LinkExtractor(undef, undef,1)
Due to system migration, I have installed the latest 0.11 version on the server. However, I found out not the exact text is extracted. eg. #!/usr/bin/perl use HTML::LinkExtractor; use Data::Dumper; my $input = q{If <a href="http://perl.com/"> I am a LINK!!! </a>}; my $LX = new HTML::LinkExtractor(undef,undef,1); $LX->parse(\$input); ## print Dumper($LX->links); for my $Link(@{$LX->links}) { print "$$Link{href}\n"; print "$$Link{_TEXT}\n"; } _END_ ============================== Gives the results: a http://perl.com/ <a href="http://perl.com/"> I am a LINK!!! ====================== However, I had expected only "I am a LINK!!!" will be extracted. This did not happen when I used version 0.06 Please advise! Thanks!
Until it hits the rest of cpan you can download it at http://pause.perl.org/incoming/HTML-LinkExtractor-0.121.tar.gz