Question: need to parse web pages to extract data
From: Troll (abuse_at_microsoft.com)
Date: 10/31/04
- Next message: Jussi Mononen: "Re: IDEs"
- Previous message: Tom: "Which data structures to use? (Hash.. etc.,)"
- Next in thread: Peter Wyzl: "Re: Question: need to parse web pages to extract data"
- Reply: Peter Wyzl: "Re: Question: need to parse web pages to extract data"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Date: Sun, 31 Oct 2004 09:12:29 GMT
Hi,
The site is:
www.homepriceguide.com.au
A sample page with data can be seen at:
http://www.homepriceguide.com.au/snapshot/price/index.cfm?action=view&suburbORpostcode=6153&source=apm
The only thing that changes is the postcode so the next page in line will
be:
http://www.homepriceguide.com.au/snapshot/price/index.cfm?action=view&suburbORpostcode=6154&source=apm
etc etc
What I'm trying to do is to extract price info and save it to a file where
each record has the postcode as its ID. Last year I wrote a script that went
through the site and gathered the data for me and dumped the results in a
file. Unfortunately it's gone walking somewhere. Can someone pls remind me
which module is best to be used here (I'm mainly concerned with the parsing
side right now)? I have not coded for <> 12mths so I'm a bit rusty now but
hopefully it will all come back.
Let me know if the above is not clear.
Thanks in advance.
Voitec
- Next message: Jussi Mononen: "Re: IDEs"
- Previous message: Tom: "Which data structures to use? (Hash.. etc.,)"
- Next in thread: Peter Wyzl: "Re: Question: need to parse web pages to extract data"
- Reply: Peter Wyzl: "Re: Question: need to parse web pages to extract data"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Relevant Pages
|