Re: Cam::PDF question



Thanks a lot! I'll look into it.

Greger wrote:
LtCommander wrote:

Hi all,

I am using the CAM::PDF module to extract text from PDF files. (It's an
AMAZING module!!) You can pretty much do everything with it.

My snippet for extracting text from a PDF file is:

$pdf = CAM::PDF->new($Fil);
$page = $pdf->getPageText($Pg);

This works fine for all pages without any graphics. I am able to print
the contents of $page without any problems whatsoever. However, if a
particular page has some sort of an inline graphic, the $page returns
an empty value!

I was wondering:
- If somebody knows how to remove all the graphics from the $pdf object
before running the next line of code. I think that should fix it and no
longer return any empty string.

I've tried searching plenty but no luck so far!

Would be grateful for your help.

Vince
use PDF::API, available from cpan.

(I also tried CAM::PDF some while ago but pdf::api is better.)

--
Qx RSS Reader 1.2.6 released
RSS Reader for Linux.
http://www.gregerhaga.net/qxrssreader.php

.



Relevant Pages

  • Re: Cam::PDF question
    ... I am using the CAM::PDF module to extract text from PDF files. ... This works fine for all pages without any graphics. ... RSS Reader for Linux. ...
    (comp.lang.perl.misc)
  • Re: Cam::PDF question
    ... I am using the CAM::PDF module to extract text from PDF files. ... This works fine for all pages without any graphics. ... longer return any empty string. ...
    (comp.lang.perl.misc)
  • Cam::PDF question
    ... I am using the CAM::PDF module to extract text from PDF files. ... This works fine for all pages without any graphics. ... longer return any empty string. ...
    (comp.lang.perl.misc)
  • Re: Powerpoint 2004 vs. Powerpoint v.X-2004 is SLOOOOOOOOOOW1
    ... PPT 2004 can be slower than v.X, but if you have the latest updates installed it should not be that much slower. ... I think in most cases a G5 with a good graphics card should not feel slower. ... I have prepared a rather long Powerpoint slideshow on ... what is it with Microsoft applications and pdf files. ...
    (microsoft.public.mac.office.powerpoint)
  • Re: Powerpoint 2004 vs. Powerpoint v.X-2004 is SLOOOOOOOOOOW1
    ... PPT 2004 can be slower than v.X, but if you have the latest updates ... revA, 1 Gig, OSX.4, with an ATI 9600 graphics card) was painfully slow. ... PDF files only display a poster of the first page of the document. ... I am worried that Powerpoint, ...
    (microsoft.public.mac.office.powerpoint)