Subject: | PPI 0.906 chokes on embedded POD with umlauts |
The PPI tokenizer chokes on Perl modules containing umlauts in their embedded POD documentation. Example:
wget
http://search.cpan.org/src/MSCHILLI/Log-Log4perl-0.51/lib/Log/Log4perl.pm
#!/usr/bin/perl
use PPI::Document;
my $d = PPI::Document->load("Log4perl.pm");
$d or print PPI::Tokenizer::errstr(), "\n";
results "Source code contains unsupported characters (first one encountered was '�')" because of the line
Ceki Gülcü, "Short introduction to log4j",
somewhere in the POD part. Would be great if Latin-1 chars would be acceptable as well, perl allows them in strings, regexes and POD.
Anyway, thanks for this great module!