Re: How to strip HTML tags and just get the text
- From: "John McTaggart" <john_at_compnet101_dot_com>
- Date: Tue, 28 Jun 2005 11:02:22 -0700
> That's one big todo! I don't think it is sensible to do HTML stripping
> without a state engine to tell where you are. If you know the HTML you are
> getting is very simple, then you may be able to get away with a simple
> thing like your code. Having written a spam filter, the spammers do all
> sorts of fancy things to try to break simple parsers.
Amen.
There's more to it than first meets the eye..
Done right, it's not a trivial task.
John McTaggart
.
- References:
- Re: How to strip HTML tags and just get the text
- From: Aleksey Kuznetsov
- Re: How to strip HTML tags and just get the text
- From: Matthew Jones
- Re: How to strip HTML tags and just get the text
- Prev by Date: Re: Database components
- Next by Date: Real PDF component
- Previous by thread: Re: How to strip HTML tags and just get the text
- Next by thread: Re: How to strip HTML tags and just get the text
- Index(es):