Subject: | More entity problems |
use HTML::TreeBuilder;
my $TB = HTML::TreeBuilder->new();
my $html = $TB->parse("This ∼ is a twiddle")->eof->element\
ify();
print $html->as_HTML("\0");
The input is "This ∼ is a twiddle"; the output should be the same.
Instead, "∼" has been translated to "\x{e2}\x{c3c}".
Unlike the way in which "ſ" is translated to "&#383", this
translation is performed at input time, not at output time; you can
verify this by dumping out the internals of $html.