Skip Menu |

This queue is for tickets about the HTML-Tree CPAN distribution.

Report information
The Basics
Id: 18570
Status: resolved
Priority: 0/
Queue: HTML-Tree

People
Owner: Nobody in particular
Requestors: mjd [...] plover.com
Cc:
AdminCc:

Bug Information
Severity: Critical
Broken in: 3.13
Fixed in: 3.22



Subject: More entity problems
use HTML::TreeBuilder; my $TB = HTML::TreeBuilder->new(); my $html = $TB->parse("This ∼ is a twiddle")->eof->element\ ify(); print $html->as_HTML("\0"); The input is "This ∼ is a twiddle"; the output should be the same. Instead, "∼" has been translated to "\x{e2}\x{c3c}". Unlike the way in which "&#383" is translated to "&#383", this translation is performed at input time, not at output time; you can verify this by dumping out the internals of $html.
Can't reproduce with 3.18 and up. Please resubmit with a test case if you are still having this issue. As an aside, I have added this case as a test in HTML-Tree 3.22, which will be released as part of the Chicago Hackathon this weekend.