Dne út 04.bře.2008 16:47:08, MARKOV napsal(a):
Show quoted text> The character-set used when the file is opened by the parser is not
> documented in the man-page, and cannot be specified. UTF-8 would be
> a nice default, but probably Latin-1 is used. Failures are reported
for
Show quoted text> UTF-16 encoded (Windows) files.
>
> Could you add some statement to XML::LibXML::Parser?
The parser is XML 1.0 conformant. If your file does not contain a <?xml
version="1.0" encoding="...." ?> declaration, then UTF-8 or UTF-16 is
supposed (and BOM, if found, is taken into account). Note that your
filehandle should not have any Perl I/O layers on it, that is, you
should do
binmode $fh;
if not sure.
If you still see errors, then please attach a sample input file and
copy-paste the output you got. Also indicate the versions of
XML::LibXML and libxml2 you have installed.
-- Petr