Re: Pulling out data between <TD> tags using regular expressions
- From: Paul <none@none>
- Date: Fri, 27 May 2005 12:05:13 +1000
tdmailbox@xxxxxxxxx wrote:
<TD class=tblform3 id=L_listnum.*?>(.*?)<\/TD>
That works.. however it returns the whole <TD> tag.. I just want the value inside the tag. That is my core issue that I cant find the solution to. I can find plenty of expressions that will find the right <TD> tag but not one that will just give me the data between the tags
Read up on HTML::TableExtract.
Getting this sort of data using regex or similar is tricky and the page definition may change ( will change ).
If the tables are not well structured you may have to search by depth and count to get the right table. You will have to come to grips with the structure of the data you are dealing with - the tables and the form.
Start here "http://search.cpan.org/~msisk/HTML-TableExtract-1.08/lib/HTML/TableExtract.pm"
Happy reading. .
- References:
- Pulling out data between <TD> tags using regular expressions
- From: tdmailbox
- Re: Pulling out data between <TD> tags using regular expressions
- From: andrewflanders
- Re: Pulling out data between <TD> tags using regular expressions
- From: tdmailbox
- Pulling out data between <TD> tags using regular expressions
- Prev by Date: Re: Pulling out data between <TD> tags using regular expressions
- Next by Date: FAQ 8.20 How can I call my system's unique C functions from Perl?
- Previous by thread: Re: Pulling out data between <TD> tags using regular expressions
- Next by thread: Re: Pulling out data between <TD> tags using regular expressions
- Index(es):
Relevant Pages
|