Extracting nested tables from HTML
From: Terry (just_at_say.no)
Date: 12/31/04
- Next message: Alan J. Flavell: "Re: Multiple submit actions."
- Previous message: George Dishman: "Re: Is zero even or odd?"
- Next in thread: Paul Lalli: "Re: Extracting nested tables from HTML"
- Reply: Paul Lalli: "Re: Extracting nested tables from HTML"
- Reply: Tad McClellan: "Re: Extracting nested tables from HTML"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Date: Fri, 31 Dec 2004 10:05:50 GMT
Hi!
I have several very large HTML files from which I'd like to extract only
tables nested at the deepest level. I thought this would be quite easy
by extracting something like (<table.*?</table>) where I'd alter the
'.*?' to test for and reject any new occurrences of a starting table
tag, but I can't seem to get it. Any pointers?
I want to deal with the file at a text level until the tables are
extracted, after which I plan to use HTML::TableContentParser to extract
the needed content.
Thanks for your help.
Terry.
- Next message: Alan J. Flavell: "Re: Multiple submit actions."
- Previous message: George Dishman: "Re: Is zero even or odd?"
- Next in thread: Paul Lalli: "Re: Extracting nested tables from HTML"
- Reply: Paul Lalli: "Re: Extracting nested tables from HTML"
- Reply: Tad McClellan: "Re: Extracting nested tables from HTML"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Relevant Pages
|