Subject: | Malformed xref in PDF file |
Date: | Tue, 05 May 2009 17:27:36 +0200 |
To: | bug-pdf-api3 [...] rt.cpan.org |
From: | André Warnier <warnier [...] mira-consulting.net> |
Hi.
PDF::API3 v 3.001
We process a lot of PDF documents from a lot of sources, to extract the
contained meta-data (and if possible and allowed, text). We do not write
PDF documents.
Malformed xref in PDF file at
C:/Perl/site/lib/PDF/API3/Compat/API2/Basic/PDF/File.pm line 1198
is the kind of error we encounter often when processing PDF documents
these days. It happens when opening a document like so :
eval { $pdf = PDF::API3::Compat::API2::Basic::PDF::File->open($pdfpath) };
It seems to happen for a variety of PDF types :
PDF version > 1.5
"view optimised" PDFs
PDF "form" documents
(and of course a number of PDFs that are maybe just malformed).
We would welcome some general explanation of what we can expect
PDF::API3 to be able to open and allow meta-data and/or text extraction,
and what we should not expect.
Thanks in advance.
André Warnier
Mira Consulting GmbH
www.mira-consulting.net