Re: dereferencing hashes



Hunter Barrington wrote:

ok so im using HTML::TreeBuilder to pull some data off of a website
and parse it. If I'm understanding this right each tag of the tree is
put inside the tree as a hash reference with hash references inside
of it for each tag thats below it. Like: <html><body><p> =
$root_hash_ref->$body_hash_ref->$p_hash_ref

No, not at all. It's a linked structure of HTML::Element objects which
can be navigated by using object method calls. Take a look at

perldoc HTML::Element

for an idea of the sort of things you can do.

regardless if i understand the module or not, how do i dereference
the data i need? in other words, i need to be able to pull data out
of the hashes (i know where its kept) without iterating over it. i
tried my $piece = ${row}{3} where 3 would be they key number and that
didnt work. perlref was only so helpful and i only got some of the
basics. any thoughts?

You can use the address() method if the HTML element you need is always
in the same place in the HTML document, but this is a little risky if
the document can change at all. Or you can find it by context using
look_down(). If you give us an example of what it is you're trying to
extract then we can help you better,

Rob
.



Relevant Pages

  • Re: SQL for presentation
    ... It's yet common in web application to have an internal tree representing the "hierarchic tag scheme". ... Some web template systems also create this tree, then transform it to it's textual representation. ... So, yes, an HTML document is a structure. ... note easy to express in a SQL DBMS (BTW, if anyone has a usable SQL schema for a HTML document, I'd be very interested). ...
    (comp.databases.theory)
  • Re: How to get reference to an object
    ... DOM is a tree that can contain different nodes, ... Elements, Attr, Comments, DocumentFragment and Text. ... Are you searching in a HTML document or in an XML document (where the ...
    (microsoft.public.scripting.jscript)