Skip Menu |

This queue is for tickets about the HTML-TableExtract CPAN distribution.

Report information
The Basics
Id: 5744
Status: resolved
Priority: 0/
Queue: HTML-TableExtract

People
Owner: MSISK [...] cpan.org
Requestors: luke [...] daeron.com
Cc:
AdminCc:

Bug Information
Severity: Normal
Broken in:
  • 1.07
  • 1.08
Fixed in: (no value)



Subject: keep_html breaks header search...
Running under perl 5.8.1-92 with HTML::Parser 3.35 on Fedora Core 1 Just tried it with 1.07 and 1.08 -- same problem I have code that looks like this: use HTML::TableExtract; my $te = new HTML::TableExtract ( headers => [ qw(Company Action) ], keep_html => 0, ); $te->parse($Mech->content); my($table) = $te->table_states or die "no such table"; for my $row ($table->rows) { my %m; print "|".join(" # ", @$row), "|\n"; } Which spits out a bunch of stuff pointed at the page (which sadly, I can't show you); change keep_html to 1 and it does not find any tables. I understand keep_html looks through the header html as well as text but I'm looking for single words here, shouldn't be affected. If that's not enough info I should be able to work up an example page where it fails.