Re: HTML Parsing



disappearedng@xxxxxxxxx wrote:
Hi everyone
I am trying to build my own web crawler for an experiement and I don't
know how to access HTTP protocol with python.

Also, Are there any Opensource Parsing engine for HTML documents
available in Python too? That would be great.


Check on Mechanize. It wraps Beautiful Soup inside of methods that aid in website crawling.

http://pypi.python.org/pypi/mechanize/0.1.7b

-Larry
.



Relevant Pages

  • Re: HTML Parsing
    ... I am trying to build my own web crawler for an experiement and I don't ... know how to access HTTP protocol with python. ...
    (comp.lang.python)
  • HTML Parsing
    ... I am trying to build my own web crawler for an experiement and I don't ... know how to access HTTP protocol with python. ...
    (comp.lang.python)
  • Re: HTML Parsing
    ... I am trying to build my own web crawler for an experiement and I don't ... know how to access HTTP protocol with python. ... Are there any Opensource Parsing engine for HTML documents ...
    (comp.lang.python)
  • Re: HTML Parsing
    ... know how to access HTTP protocol with python. ... Are there any Opensource Parsing engine for HTML documents ... Freedom is always the freedom of dissenters. ...
    (comp.lang.python)
  • Re: HTML Parsing
    ... know how to access HTTP protocol with python. ... Are there any Opensource Parsing engine for HTML documents ...
    (comp.lang.python)