Re: Convert MS-Word to plain text



On Fri, 9 May 2008, backpack wrote:
Are there any perl modules that will allow you to convert MS-Word docs to plain text?

AFAIK there is no integrated Perl solution for this, but there are several Perl bridge to external software doing this, such as:

- Win32::OLE, if you happen to do this in a MS Windows system.

- OpenOffice::UNO supposedly let you control OpenOffice to do most anything, including open Word document and save it as text file or extract the text directly. I used OpenOffice UNO from Java before, not sure how much of UNO implemented in the Perl module.

- SWISH::Filters use external commant catdoc to extract the text out of MS Word documents.

--
(stephan paul) Arif Sahari Wibowo
_____ _____ _____ _____
/____ /____/ /____/ /____
_____/ / / / _____/ http://www.arifsaha.com/
.



Relevant Pages

  • ANNOUNCE: Archive::Zip 1.13 released
    ... don't use Perl any more, ... files), extract a file, and then re-write the zip, you will get a ... Members can be created from members in existing ... - FAQ NAME fix from Michael Schwern ...
    (comp.lang.perl.modules)
  • RE: Optimization for faster select...
    ... Although its a cute idea to create a function based index on SUBSTR ... I tried to see whether the SUBSTR could be replaced with Perl formatting ... the possibility of writes delaying the extract. ... So, ladies and gets, everything comes down to using RowCacheSize attribute. ...
    (perl.dbi.users)
  • Re: Convert MS-Word to plain text
    ... I actually just ended up using Python with the win32 extensions. ... that I was looking for a perl module i can run in a linux environment ... including open Word document and save it as text ... I used OpenOffice UNO from ...
    (comp.lang.perl.modules)
  • Re: Extracting Text
    ... >> recommended to extract text from a text file and placed into a ... I think gawk or awk would be the best thing for the OP to use too. ... editors such as Perl or Emacs Lisp. ...
    (comp.programming)
  • Re: How to extract .tar files in different directory?
    ... From perl you can execute any unix command using system. ... could anyone tell me if I can extract these big ... ..tar files without using large amount of my PC memory? ... foreach $direntry ) ...
    (comp.lang.perl.misc)