Re: parse mime

From: Edwin Martin (e.j.martin_at_chello.nl)
Date: 10/07/04

  • Next message: Raymond DeCampo: "Re: THE SINGLETON PATTERN ?"
    Date: Thu, 07 Oct 2004 09:30:49 GMT
    
    

    Erik Rosenbach wrote:

    > Thanks Edwin for the answer, unfortunately it will not be that simple of a
    > process. What I have to work with is actually about 80K records coming out
    > of an Oracle database. These records (message bodies) are from an Lyris
    > News group server and I am porting them into something else. One of the key
    > problems with the content and the embedded mime is that I have some messages
    > with multipart headers... for text, html, and for attachments. I'm not
    > worried about any of the attachments, I just want to scrape the content.
    > Not every message is embedded with the mime either. Some of them are plain
    > text which is fine. The really problematic messages are the messages in
    > which people have posted the messaged and replied to. Some of these
    > messages have long trail of "original message" headers in them as well.
    >
    > What I would ideally like to find, is a email parser I can passed the
    > message body to, and have it return to me a only the plain text body.

    Ah, I see, that's a little bit more complex.

    This functionality might be part of some Mail software/API.

    Maybe you can find a solution here:

    http://java.sun.com/products/javamail/Third_Party.html

    Edwin Martin.

    -- 
    http://www.bitstorm.org/
    

  • Next message: Raymond DeCampo: "Re: THE SINGLETON PATTERN ?"

    Relevant Pages

    • Re: parse mime
      ... > problems with the content and the embedded mime is that I have some messages ... > worried about any of the attachments, I just want to scrape the content. ... > message body to, and have it return to me a only the plain text body. ...
      (comp.lang.java.developer)
    • Re: Is email scanning necessary?
      ... > Also, you can allows save any attachments and scan them manually, ... message body. ... I guess I'll go back to reading messages in Plain ...
      (microsoft.public.win2000.general)
    • Printing format change
      ... currently have my mail format set to plain text in order ... to print off messages with attachments. ... the message body? ...
      (microsoft.public.outlook.printing)
    • Re: Is Plain Text email 100% safe?
      ... There can be no virus in the body of a plain text email, but the attachments might still be infected, if they're in some format other than plain text. ... They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety. ...
      (microsoft.public.security.virus)
    • Re: Preview pane is blank
      ... "Robert Aldwinckle" wrote: ... when she called to say she couldn't open her attachments. ... Now imagine that there is no Plain Text component to that E-mail... ... and just open the E-mail in its own message window? ...
      (microsoft.public.windows.inetexplorer.ie6_outlookexpress)