Re: Convert MS-Word to plain text



backpack <curtyoung@xxxxxxxxx> wrote:
Are there any perl modules that will allow you to convert MS-Word docs
to plain text?

Opposite to earlier formats the DOCX format is an open XML format and
information about it is available on the Microsoft website. I don't know
if someone already wrote a parser for it, but at least it should be
possible now.

jue
.



Relevant Pages

  • Re: making a word document created on my mac viewable by windows users
    ... needs, from the Microsoft website. ... However, if you save the file in RTF format, and then zip the RTF file and ... email in MIME format, you have covered 99.99 per cent of the bases :-) ... And .docx is about a quarter the size of the old formats, ...
    (microsoft.public.mac.office.word)
  • Re: open access2003 files in access97 or access2000
    ... files in Access 2003 format. ... > aware of saving within access2003 as access97 format, ... >>> to verify before i proceed... ...
    (microsoft.public.access.externaldata)
  • Re: Choosing between Word Formats
    ... Word - for whatever reason. ... I'm not challenging your "rights", ... I further believe they are best to SEND that format to other people, ... Everyone else can find a way to open the .docx format. ...
    (microsoft.public.mac.office.word)
  • Re: Two questions, Default language and font
    ... and it enables you to work natively in .docx format which will ... choose Word 2007 running in .docx format. ... I have created some styles of my own to help me write my dissertation. ... "Automatically Update Styles" have been turned off since day one. ...
    (microsoft.public.mac.office.word)
  • Re: Word 2007 Document Format
    ... If I just click Save a document (even in Compatibility Mode), ... If you check the box it save in a .docx format NOT .doc format. ... "Suzanne S. Barnhill" wrote: ...
    (microsoft.public.word.docmanagement)