Re: How to extract text from an PDF document

From: Nils Boedeker (info_at_nbsoft.de)
Date: 02/18/05


Date: Fri, 18 Feb 2005 12:55:39 +0100

Hi Girish,

I have do it and load the trial...

And found some problems (Chrashs during Extraction). I send you an eMail
about this to support@gnostice.com and prepare some examples PDFs as
download.

I like very much to use your application but the problems should be
solved.

Other questions... is the PDF Toolkit "Threadsave" so that I can use it
in ISAPI applications?

with best regards

Nils

"Girish Patil (Gnostice)" schrieb:
>
> Hi Nils,
>
> Gnostice PDFtoolkit can extract text from a PDF document, un-protected and even
> protected ones (you need to provide the password). You can even extract pages to
> a vector format. What's more, it's a 100% VCL component. Please take a look.
>
> --
> Girish Patil
> Gnostice Information Technologies www.gnostice.com
> ---------------------------------------------------------------------
> Gnostice eDocEngine (http://www.gnostice.com/edoc_engine.asp) -
> Electronic document creation, Report Export, PDF eForms creation...
>
> Gnostice PDFtoolkit (http://www.gnostice.com/pdftoolkit.asp) -
> View, Print, Convert, Modify, Enhance PDF docs, process PDF eForms...
> ---------------------------------------------------------------------
>
> "Nils Boedeker" <info@nbsoft.de> wrote in message
> news:42147FD4.6723E992@nbsoft.de...
> Hi,
>
> exist anywhere a component or libary that help to extract text from an
> "un"-protected PDF Dokument?
>
> Nils

-- 
_________________________________
 Verlag Eugen Ulmer
 Datenbanken und IT-Entwicklung
 Nils Bödeker
 Bürgerwohlsweg 7
 D-28215 Bremen
 Germany
 Tel:   +49 (0)421 - 3795020
 Fax:   +49 (0)421 - 3795021
 Mobil: +49 (0) 172 - 7468066
 nboedeker@ulmer.de
 www.ulmer.de
 yahoo ID: nilsboedeker
 Skype ID: nilsboedeker
 ICQ ID: 206474523


Relevant Pages

  • Re: ghostscript PDF page extraction, leaving text as text
    ... the PDF as downloaded from your site is OK. ... complaints you got must be due to a transfer error (probably some end of line ... On the linux system extract a single page with this command: ... xref table. ...
    (comp.lang.postscript)
  • Re: Extract Image From PDF
    ... I have a demo app that can execute Ghostscript with command line parameters, and at the moment I can only get the revision number and a thumbnail view of the first page based on the content I have found. ... Do you know the parameters I would need to extract the image on the first page to a TIFF please? ... Here are the args I found to generate a jpeg based on a pdf document: ...
    (microsoft.public.dotnet.languages.vb)
  • Re: Colored Text extraction from PDF
    ... is it possible to extract the colored text from pdf. ... There are 3 color texts in a pdf -- RED, ... using drawString. ...
    (comp.lang.java.programmer)
  • Re: document processing
    ... I have to work with filled forms, so I know what the fields are and I need to extract the info in the filled fields. ... I would like to build the user interface with some kind of script extracting info from the document and presentig to the user the necessary fields to fill in. ... I need to import documents in html, DOC and PDF formats and would like to parse them and automatically create fields to fill the documents. ...
    (comp.games.development.programming.algorithms)
  • Re: Extracting pages from PDF based on colour figures
    ... print each PDF file on the appropriate printer. ... There exist other "page" characteristics such as size and paper type ... I don't know if you can extract color pages directly from a pdf. ...
    (comp.text.tex)