Question: need to parse web pages to extract data

From: Troll (abuse_at_microsoft.com)
Date: 10/31/04


Date: Sun, 31 Oct 2004 09:12:29 GMT

Hi,

The site is:
www.homepriceguide.com.au

A sample page with data can be seen at:
http://www.homepriceguide.com.au/snapshot/price/index.cfm?action=view&suburbORpostcode=6153&source=apm

The only thing that changes is the postcode so the next page in line will
be:
http://www.homepriceguide.com.au/snapshot/price/index.cfm?action=view&suburbORpostcode=6154&source=apm

etc etc

What I'm trying to do is to extract price info and save it to a file where
each record has the postcode as its ID. Last year I wrote a script that went
through the site and gathered the data for me and dumped the results in a
file. Unfortunately it's gone walking somewhere. Can someone pls remind me
which module is best to be used here (I'm mainly concerned with the parsing
side right now)? I have not coded for <> 12mths so I'm a bit rusty now but
hopefully it will all come back.

Let me know if the above is not clear.

Thanks in advance.
Voitec



Relevant Pages