Re: Parsing MS Word documents

From: dominix (dominix_at__@despammed.com)
Date: 02/24/04

  • Next message: Sisyphus: "Re: How to install Crypt::SSLeay using PPM on Wintel"
    Date: Tue, 24 Feb 2004 10:10:31 -1000
    
    

    Regent wrote:
    > Hi, friends
    >
    > I searched cpan in vain for a module that can read (parse) MS Word
    > .doc files. Can someone refer me to one somewhere or give me a hint
    > about the structure of such files? Thanks!
    >

    use OpenOffice for your platform and download the
    OpenOffice Perl Library http://ooolib.sourceforge.net/
    so you can have acces to your document whithin perl by the way of OpenOffice

    -- 
    dominix
    

  • Next message: Sisyphus: "Re: How to install Crypt::SSLeay using PPM on Wintel"

    Relevant Pages

    • Parsing Word Doc files
      ... I searched cpan in vain for a module that can read (parse) MS Word .doc files. ... Can someone refer me to one somewhere or give me a hint about the structure of such files? ...
      (comp.lang.perl.misc)
    • Parsing MS Word documents
      ... I searched cpan in vain for a module that can read (parse) MS Word .doc files. ... Can someone refer me to one somewhere or give me a hint about the structure of such files? ...
      (comp.lang.perl.modules)
    • Re: reading MS word files
      ... to just read properly .doc files that people email me thinking ... yet i keep using openoffice for professional reasons. ... Ron Johnson, Jr. ... Is "common sense" really valid? ...
      (Debian-User)
    • Re: need indexing PDF & DOC files
      ... And I need indexing PDF & DOC files. ... > Somebody known solutions for parse this files formats? ... Just use wvHtml and pdftotext to convert to text then parse the text. ...
      (comp.lang.java.programmer)
    • Re: remove words from mutiple files
      ... You said you had text files, now you tell us it is .doc files. ... strings are different lengths. ... OpenOffice and StarOffice differs from ms-office in the way they create the ...
      (alt.linux)