Yeah, that's a known issue with CAM::PDF: compressed objects are a
PDF 1.5 feature, and I've still got several PDF 1.4 features to
do... I started writing support for compressed objects over the
summer, but it's still very buggy, so I didn't release it. With any
luck, the next CAM::PDF release will help, but I can't predict when
that will be.
If you are financially motivated to get a solution quicker than
"someday", I would accept a contribution to rearrange priorities.
But otherwise, I would recommend looking for setting in your PDF
generation software that says something like "save as PDF 1.4-
compatible".
When I finish support for compressed objects, I'll be sure to handle
the generation number correctly -- I appreciate that part of your
feedback, because I probably would have stumbled on that same detail.
Chris
On Sep 12, 2008, at 6:43 AM, Eric Harlow via RT wrote:
Show quoted text> Fri Sep 12 07:43:15 2008: Request 39214 was acted upon.
> Transaction: Ticket created by linuxgeek@yahoo.com
> Queue: CAM-PDF
> Subject: Problem parsing PDF files
> Broken in: (no value)
> Severity: (no value)
> Owner: Nobody
> Requestors: linuxgeek@yahoo.com
> Status: new
> Ticket <URL:
http://rt.cpan.org/Ticket/Display.html?id=39214 >
>
>
>
> I'm trying to get the text out of a PDF file. When I try, I get
> the following error.
>
> Could not find PDF cross-ref table at location 1152553/-1/0
> 0 (empty)
>
> I am using Perl 5.8.8 on Linux Fedora 9.
>
> This same file had an issue with Xpdf/pdftotext. The developer of
> that product said
>
> "The issue is that there are references to object #1046, with
> generation
> 1 -- but object #1046 is a compressed object, and compressed
> objects are
> always generation 0, by definition. (Short version: there's a bug in
> the PDF generation software.) It looks like Acrobat ignores the
> generation number entirely for compressed objects, so I'll change Xpdf
> to do the same."
>
> I can provide a copy of the PDF file if you want to take a look.
>
>
>
>