Skip Menu |

This queue is for tickets about the Text-PDF CPAN distribution.

Report information
The Basics
Id: 120334
Status: new
Priority: 0/
Queue: Text-PDF

People
Owner: Nobody in particular
Requestors: 'spro^^*%*^6ut# [...] &$%*c
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: (no value)



Subject: [PATCH] Allow whitespace before xref
PDFs generated by ABBYY FIneReader (and some old version of Acrobat Distiller) often have whitespace before the xref token; i.e., the xref token comes a couple of bytes after the offset specified by startxref. The attached patch makes Text::PDF tolerate such files.
Subject: open_DvZDcvGb.txt
--- a/lib/Text/PDF/File.pm 2016-08-16 08:01:48.000000000 -0700 +++ b/lib/Text/PDF/File.pm 2017-02-22 18:04:49.000000000 -0800 @@ -1063,9 +1063,9 @@ $fh = $self->{' INFILE'}; $fh->seek($xpos, 0); $fh->read($buf, 22); - if ($buf !~ m/^xref$cr/oi) + if ($buf !~ m/^$ws_char*xref$cr/oi) { die "Malformed xref in PDF file $self->{' fname'}"; } - $buf =~ s/^xref$cr//oi; + $buf =~ s///oi; $xlist = {}; while ($buf =~ m/^([0-9]+)$ws_char+([0-9]+)$cr(.*?)$/so)