Re: parse mime

From: Edwin Martin (e.j.martin_at_chello.nl)
Date: 10/07/04


Date: Thu, 07 Oct 2004 09:30:49 GMT

Erik Rosenbach wrote:

> Thanks Edwin for the answer, unfortunately it will not be that simple of a
> process. What I have to work with is actually about 80K records coming out
> of an Oracle database. These records (message bodies) are from an Lyris
> News group server and I am porting them into something else. One of the key
> problems with the content and the embedded mime is that I have some messages
> with multipart headers... for text, html, and for attachments. I'm not
> worried about any of the attachments, I just want to scrape the content.
> Not every message is embedded with the mime either. Some of them are plain
> text which is fine. The really problematic messages are the messages in
> which people have posted the messaged and replied to. Some of these
> messages have long trail of "original message" headers in them as well.
>
> What I would ideally like to find, is a email parser I can passed the
> message body to, and have it return to me a only the plain text body.

Ah, I see, that's a little bit more complex.

This functionality might be part of some Mail software/API.

Maybe you can find a solution here:

http://java.sun.com/products/javamail/Third_Party.html

Edwin Martin.

-- 
http://www.bitstorm.org/


Relevant Pages

  • Re: parse mime
    ... > problems with the content and the embedded mime is that I have some messages ... > worried about any of the attachments, I just want to scrape the content. ... > message body to, and have it return to me a only the plain text body. ...
    (comp.lang.java)
  • Re: Is email scanning necessary?
    ... > Also, you can allows save any attachments and scan them manually, ... message body. ... I guess I'll go back to reading messages in Plain ...
    (microsoft.public.win2000.general)
  • Printing format change
    ... currently have my mail format set to plain text in order ... to print off messages with attachments. ... the message body? ...
    (microsoft.public.outlook.printing)
  • Re: Is Plain Text email 100% safe?
    ... There can be no virus in the body of a plain text email, but the attachments might still be infected, if they're in some format other than plain text. ... They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety. ...
    (microsoft.public.security.virus)
  • Re: Preview pane is blank
    ... "Robert Aldwinckle" wrote: ... when she called to say she couldn't open her attachments. ... Now imagine that there is no Plain Text component to that E-mail... ... and just open the E-mail in its own message window? ...
    (microsoft.public.windows.inetexplorer.ie6_outlookexpress)