Re: Convert MS-Word to plain text



In article <alpine.OSX.1.10.0805170909340.5094@xxxxxxxxxxxxxx>,
S P Arif Sahari Wibowo <arifsaha@xxxxxxxxx> wrote:
On Fri, 9 May 2008, backpack wrote:
Are there any perl modules that will allow you to convert
MS-Word docs to plain text?

AFAIK there is no integrated Perl solution for this, but there
are several Perl bridge to external software doing this, such
as:

- Win32::OLE, if you happen to do this in a MS Windows system.

- OpenOffice::UNO supposedly let you control OpenOffice to do
most anything, including open Word document and save it as text
file or extract the text directly. I used OpenOffice UNO from
Java before, not sure how much of UNO implemented in the Perl
module.

- SWISH::Filters use external commant catdoc to extract the text
out of MS Word documents.

--
(stephan paul) Arif Sahari Wibowo
_____ _____ _____ _____
/____ /____/ /____/ /____
_____/ / / / _____/ http://www.arifsaha.com/

These two also, anyone compared to "antiword"?

Thanks

David


.



Relevant Pages

  • Re: Convert MS-Word to plain text
    ... AFAIK there is no integrated Perl solution for this, but there are several Perl bridge to external software doing this, such as: ... OpenOffice::UNO supposedly let you control OpenOffice to do most anything, including open Word document and save it as text file or extract the text directly. ... I used OpenOffice UNO from Java before, not sure how much of UNO implemented in the Perl module. ...
    (comp.lang.perl.modules)
  • Re: a reference to an element of an array or hash, how to create?
    ... S P Arif Sahari Wibowo writes: ... For instance I have an array @a, and I like $a2r contains a reference to ... Anyway to do it in perl? ...
    (comp.lang.perl.misc)
  • ANNOUNCE: Archive::Zip 1.13 released
    ... don't use Perl any more, ... files), extract a file, and then re-write the zip, you will get a ... Members can be created from members in existing ... - FAQ NAME fix from Michael Schwern ...
    (comp.lang.perl.modules)
  • RE: Optimization for faster select...
    ... Although its a cute idea to create a function based index on SUBSTR ... I tried to see whether the SUBSTR could be replaced with Perl formatting ... the possibility of writes delaying the extract. ... So, ladies and gets, everything comes down to using RowCacheSize attribute. ...
    (perl.dbi.users)
  • Re: Extracting Text
    ... >> recommended to extract text from a text file and placed into a ... I think gawk or awk would be the best thing for the OP to use too. ... editors such as Perl or Emacs Lisp. ...
    (comp.programming)