Subject: | Parser crash on invalid attributes |
Date: | Sun, 13 Mar 2016 13:38:39 +0100 |
To: | bug-HTML-HTML5-Parser [...] rt.cpan.org |
From: | Max Maischein <corion [...] cpan.org> |
Hello Toby,
thanks for writing one of the few HTML5 parsers!
While importing/cleaning some horrible HTML with some more horrible
regular expressions, I found a small bug. The parser crashes when it
encounters invalid attributes. I would expect the parser to be more
lenient and have it ignore such attributes.
<p "123px">
causes the parser to crash.
Attached is a patch against 0.301 which includes a test case. I didn't
find the module on Github, otherwise I'd have created a pull request. I
don't have Mercurial installed and am not sure whether you still
actively use Bitbucket.org.
-max
Message body is not shown because sender requested not to inline it.