----- Цитат от Toby Inkster via RT (bug-HTML-HTML5-Parser@rt.cpan.org), на 18.08.2012 в 09:54 -----
Show quoted text>>On 2012-08-16T15:47:33+01:00, KARAVELOV wrote:
>>The parser dies when trying to parse broken xhtml with namespaced
>>attributes. This is around
>>line 2529. Putting the condition in 'eval' fixes the problem for me.
Show quoted text>Do you have an example document that triggers the failure? Can you attach
>it to this bug report?
Here is my test case:
perl -MURI -MHTML::HTML5::Parser -E ' my $uri = URI->new("
http://www.blitz.bg/news/article/151210"); my $parser = HTML::HTML5::Parser->new; my $doc=$parser->parse_html_file($uri);'
And here is the error in TagSoupParsers
NAMESPACE ERROR: Attribute without a prefix cannot be in a namespace at /usr/share/perl5/HTML/HTML5/Parser/TagSoupParser.pm line 2524
All the articles at www.blitz.bg are severely broken. The error is on the second line "html xmlns:fb=...."
Attached is a minimal test case document.
--
Luben Karavelov