Re: HTML/DOM parser
- From: Michael Ekstrand <michael@xxxxxxxxxxx>
- Date: Tue, 28 Feb 2006 07:33:47 -0600
On 28 Feb 2006 00:33:11 -0800
"Xah Lee" <xah@xxxxxxxxxx> wrote:
is there a module that lets me parse validated html files and store it
as a tree?
BeautifulSoup will parse valid HTML (not just XHTML), and also crummy
HTML while it's at it. And generates a tree structure. Warning: I
haven't actually used this myself, I've just looked at it.
http://www.crummy.com/software/BeautifulSoup/
- Michael
--
mouse, n: a device for pointing at the xterm in which you want to type.
-- Fortune
.
- References:
- HTML/DOM parser
- From: Xah Lee
- HTML/DOM parser
- Prev by Date: Re: PEP 354: Enumerations in Python
- Next by Date: Re: Vectorization and Numeric (Newbie)
- Previous by thread: Re: HTML/DOM parser
- Next by thread: Re: newbie trying understand sys.path
- Index(es):