Bug #5744 for HTML-TableExtract: keep_html breaks header search...

Sun Mar 21 18:41:40 2004 Guest - Ticket created

Subject:

keep_html breaks header search...

Running under perl 5.8.1-92 with HTML::Parser 3.35 on Fedora Core 1 Just tried it with 1.07 and 1.08 -- same problem I have code that looks like this: use HTML::TableExtract; my $te = new HTML::TableExtract ( headers => [ qw(Company Action) ], keep_html => 0, ); $te->parse($Mech->content); my($table) = $te->table_states or die "no such table"; for my $row ($table->rows) { my %m; print "|".join(" # ", @$row), "|\n"; } Which spits out a bunch of stuff pointed at the page (which sadly, I can't show you); change keep_html to 1 and it does not find any tables. I understand keep_html looks through the header html as well as text but I'm looking for single words here, shouldn't be affected. If that's not enough info I should be able to work up an example page where it fails.

Thu Feb 24 21:58:35 2005 MSISK [...] cpan.org - Status changed from 'new' to 'resolved'

Thu Feb 24 21:58:35 2005 MSISK [...] cpan.org - Given to MSISK