Re: HTML to text

I am looking for a already done object for Delphi 2005/2006 that converts
HTML formated e-mail bodies to the raw text, it just removes the HTML
formatting codes. Anybody have any idea? It can be shareware or commerical.
If you recommend something, please give me an idea why it is great and any
known drawbacks on it. Thanks in advance!

Hi Barton,


will do this easily, with full Unicode support (accented & foreign language
characters) as well.

DIHtmlParser is great because it is fast, uses minimal memory, processes all
HTML you throw at it, deals with international languages, and does all this
fully natively: You do not need to have IE nor additional character sets
installed on your client's system.

The DIHtmlParser_ExtractText demo project even shows you how to go about when
extracting text from HTML.



The Delphi Inspiration

Relevant Pages

  • Re: HTML to text
    ... HTML formated e-mail bodies to the raw text, ... It can be shareware or commerical. ... function HtmlToText(const Html: string): string; ...
  • Vilistextum 2.6.5 - fault tolerant HTML to text converter
    ... Vilistextum is a small and fast HTML to text converter. ... It is quite fault-tolerant and deals well with badly-formed HTML. ... It has full support for different character sets. ... BUGFIX: sometimes the last word in the document was not output ...
  • Re: Is it time for a new charset in the Digest? [telecom]
    ... is not related to character sets. ... What part of html is "transitional"? ... that HTML has various acceptable standards, some of them transitional, as ... Avant de repondre, jeter la poubelle, SVP. ...
  • Re: ASP CDO sending MS Word copied text
    ... email with this html outside of my ASP web application it displays fine. ... This is about character sets. ... I am after advice regarding character encoding. ...