Skip Menu |

This queue is for tickets about the PDF-API3 CPAN distribution.

Report information
The Basics
Id: 45743
Status: new
Priority: 0/
Queue: PDF-API3

People
Owner: Nobody in particular
Requestors: warnier [...] mira-consulting.net
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: (no value)



Subject: Malformed xref in PDF file
Date: Tue, 05 May 2009 17:27:36 +0200
To: bug-pdf-api3 [...] rt.cpan.org
From: André Warnier <warnier [...] mira-consulting.net>
Hi. PDF::API3 v 3.001 We process a lot of PDF documents from a lot of sources, to extract the contained meta-data (and if possible and allowed, text). We do not write PDF documents. Malformed xref in PDF file at C:/Perl/site/lib/PDF/API3/Compat/API2/Basic/PDF/File.pm line 1198 is the kind of error we encounter often when processing PDF documents these days. It happens when opening a document like so : eval { $pdf = PDF::API3::Compat::API2::Basic::PDF::File->open($pdfpath) }; It seems to happen for a variety of PDF types : PDF version > 1.5 "view optimised" PDFs PDF "form" documents (and of course a number of PDFs that are maybe just malformed). We would welcome some general explanation of what we can expect PDF::API3 to be able to open and allow meta-data and/or text extraction, and what we should not expect. Thanks in advance. André Warnier Mira Consulting GmbH www.mira-consulting.net