Re: parse mime

From: Edwin Martin (e.j.martin_at_chello.nl)
Date: 10/07/04

  • Next message: Raymond DeCampo: "Re: THE SINGLETON PATTERN ?"
    Date: Thu, 07 Oct 2004 09:30:49 GMT
    
    

    Erik Rosenbach wrote:

    > Thanks Edwin for the answer, unfortunately it will not be that simple of a
    > process. What I have to work with is actually about 80K records coming out
    > of an Oracle database. These records (message bodies) are from an Lyris
    > News group server and I am porting them into something else. One of the key
    > problems with the content and the embedded mime is that I have some messages
    > with multipart headers... for text, html, and for attachments. I'm not
    > worried about any of the attachments, I just want to scrape the content.
    > Not every message is embedded with the mime either. Some of them are plain
    > text which is fine. The really problematic messages are the messages in
    > which people have posted the messaged and replied to. Some of these
    > messages have long trail of "original message" headers in them as well.
    >
    > What I would ideally like to find, is a email parser I can passed the
    > message body to, and have it return to me a only the plain text body.

    Ah, I see, that's a little bit more complex.

    This functionality might be part of some Mail software/API.

    Maybe you can find a solution here:

    http://java.sun.com/products/javamail/Third_Party.html

    Edwin Martin.

    -- 
    http://www.bitstorm.org/
    

  • Next message: Raymond DeCampo: "Re: THE SINGLETON PATTERN ?"

    Relevant Pages

    • Re: parse mime
      ... > problems with the content and the embedded mime is that I have some messages ... > worried about any of the attachments, I just want to scrape the content. ... > message body to, and have it return to me a only the plain text body. ...
      (comp.lang.java.developer)
    • Re: Is email scanning necessary?
      ... > Also, you can allows save any attachments and scan them manually, ... message body. ... I guess I'll go back to reading messages in Plain ...
      (microsoft.public.win2000.general)
    • Printing format change
      ... currently have my mail format set to plain text in order ... to print off messages with attachments. ... the message body? ...
      (microsoft.public.outlook.printing)
    • Re: Is Plain Text email 100% safe?
      ... There can be no virus in the body of a plain text email, but the attachments might still be infected, if they're in some format other than plain text. ... They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety. ...
      (microsoft.public.security.virus)
    • Re: Import and Insert data from email
      ... I will assume plain text, could we read either plain or HTML from ... Once in db journey is passed to mobile device for driver journey details ... Q. Are the messages in plain text or HTML? ... Q. Would there be any attachments? ...
      (microsoft.public.scripting.vbscript)