Re: Delayed WEB Page Response



On Feb 7, 5:00 pm, Mark Clements <mark.clementsREMOVET...@xxxxxxxxxx>
wrote:
aage.gribs...@xxxxxxxxx wrote:
I wish to capture data from a Web page e.g.
"http://www.eppraisal.com/PropertyInfo.aspx?a=1215%20Jefferson
%20Ave&z=46201"

I am using the LWP modules.
The page responds in three steps and I have succeeded in capturing
only the first.

The page first paints up nicely with "Loading" text in the area of
interest.
After a delay the "Loading" text is replaced with "Calculating".
Shortly thereafter, sometimes apparently instantaniously, the data of
interest appears.

I have tried LWP:: UserAgent and LWP::Parallel::UserAgent and capture
only the initial response.
TimeOut parameters do not change the behavior.
The callback subroutine indicates the HTML comes in several chunks.

How can the other responses be captured?
The documentation mentions  LPW::Parallel::UserAgent::Entry objects
and follow up requests.
Will this be of help?
I have found no documentation of this feature.
Is there any additional documentation or examples?

It's using javascript - which neither LWP nor WWW::Mechanize will
execute -  to move between pages. You could try using
Win32::IE::Mechanize or Selenium, but both of these rely on controlling
a running browser.

Mark

There is an API to some of our data. What data elements are you
looking to pull?

Send me an email or to info (at) eppraisal.com. Scraping the front-end
is time consuming and prone to errors (when we push out updates).

Damian (from eppraisal.com)
.



Relevant Pages

  • Re: Teach me how to fish, regexp
    ... Capturing parentheses "save" whatever is matched between them, ... There is more information about this in the perlre documentation, ... If you only want to group some stuff together in a subpattern, ... perlop) than of regular expressions themselves. ...
    (comp.lang.perl.misc)
  • Re: POSIX.Memory_Mapping.Map_Memory
    ... "Adrian Hoe" writes: ... > Anders Gidenstam. ... According to latest V4L documentation, capturing with ...
    (comp.lang.ada)
  • Re: Delayed WEB Page Response
    ... I am using the LWP modules. ... The page responds in three steps and I have succeeded in capturing ... only the initial response. ... I have found no documentation of this feature. ...
    (comp.lang.perl.modules)
  • Re: Process does not contain any programs (???)
    ... Not sure what to put on that bug report. ... New feature didn't work. ... Wanted to set a breakpoint. ... have any representation in the documentation (Note to Microsoft: ...
    (microsoft.public.vc.mfc)
  • Re: Is WM5 worth trying on a Dell Axim x50v?
    ... I've done my google searches for this documentation, ... to come up with is people complaining about the feature removal. ... have been extra effort, but it would have made a sizeable portion of their ... and require a VPN connection to sync. ...
    (microsoft.public.pocketpc)