Skip Menu |

This queue is for tickets about the HTML-Parser CPAN distribution.

Report information
The Basics
Id: 64407
Status: rejected
Priority: 0/
Queue: HTML-Parser

People
Owner: Nobody in particular
Requestors: njh [...] bandsman.co.uk
Cc:
AdminCc:

Bug Information
Severity: Important
Broken in: 3.68
Fixed in: (no value)



Subject: HTML::Entities doesn't handle Austria in Danish correctly
This program: #!/usr/bin/perl -wT use strict; use warnings; use diagnostics; use Locale::Country::Multilingual; use HTML::Entities; my $l = Locale::Country::Multilingual->new(); my $foo = $l->country2code('Austria', 'en'); print encode_entities($l->code2country($foo, 'da')) . "\n"; Should print Østrig, but it prints Østrig.
This appear to be and issue with Locale::Country::Multilingual. It outputs UTF-8 encoded bytes instead of Unicode strings. In <http://search.cpan.org/dist/Locale-Country-Multilingual/lib/Locale/Country/Multilingual/Unicode.pod> you can read that it might help to add: use_io_layer => 1 argument to the constructor. I tried that it I got &Oslash;strig as the output from your sample program. ------------------------ !/usr/bin/perl -wT use strict; use warnings; use diagnostics; use Locale::Country::Multilingual; use HTML::Entities; my $l = Locale::Country::Multilingual->new(use_io_layer => 1); my $foo = $l->country2code('Austria', 'en'); print encode_entities($l->code2country($foo, 'da')) . "\n";