Subject: | XML::Parser::Expat::current_byte() reports negative offsets for large files |
XML::Parser::Expat::current_byte() reports negative offsets for large files -- I discovered this by piping data from the Wikipedia dump files into this script:
https://secure.mysociety.org/cvstrac/getfile/mysociety/placeopedia/bin/wikipediatitles?v=1.2
after 2**31 bytes, it starts printing negative progress indications.
perl 5.8.6, FreeBSD 5.2.1-RELEASE-p13, x86
Presumably the issue here is that it's using a 32-bit counter for the file offset. perl is compiled with -Duse64bitint, so I don't think this is a limitation of the perl interpreter.