Re: Extracting comments from PDF files



Yes, I do know that PDF::Extract can the author, creation date and
other information but it is not capable of extracting the "comments"
section. By the "comments" section, I mean that part of the PDF file
where you see the hand like symbol and the note nearby popping up like
a tooltip? It is akin to the comments in a Microsoft Word document.

I also know that once I can get this information, I can write the
information to an XML file. So, my question is how I could extract the
comment information. I could use XML::Writer provided I can extract the
data.

Could you please think of anyway of extracting this 'comment' part?

Thanks,
Vince

.



Relevant Pages

  • Re: extract text from PDF file
    ... If the problem is a CIDFont, then you will get bigger garbage I'm ... I'm with Bugbear on this one, if you can't extract the 'text' from a PDF ... resulting from converting a PDF file. ...
    (comp.lang.postscript)
  • Extract Text Coordinates from PDF
    ... I was wondering if anyone could recommend a program which can extract ... the starting coordinates of each word in a PDF file ... Prev by Date: ...
    (comp.text.pdf)
  • Re: Does anyone know of a PDF-to-something else utility ?
    ... it's the site of the company that offers PDF-XChange and the Tools package ... that extracted the text and images from your pdf file. ... >>It would be a good test for the pdf program I brag about, PDF-XChange. ... > I can report that Don's software was able to extract the ascii in my ...
    (microsoft.public.windowsxp.general)
  • Re: Extract PDF content?
    ... Is there any gem or library which allows to extract text from a .PDF file?, any for Word or OpenOffice files? ...
    (comp.lang.ruby)
  • Re: Fw: PDF library for reading PDF files
    ... Andreas Lobinger wrote: ... >Peter Galfi schrieb: ... >> trying to extract from the PDF file is the text, ...
    (comp.lang.python)