Re: Question about lwp-rget
- From: rob.dixon@xxxxxxx (Rob Dixon)
- Date: Wed, 30 Jul 2008 15:12:05 +0100
bdy wrote:
If I use lwp-rget to retrieve a Web site, will it retrieve new pages
added that may not be linked to?
For example, the site www.123.com is composed of 10 pages, each of
which is accessible through links on the site.
But, two pages are added in the span of three days. www.123.com/fourteen-five.jsp
and www.123.com/eight-nine.html.
fourteen-five.jsp isn't accessible through any links on the site, but
eight-nine.html is.
Will lwp-rget be able to find both pages in addition to the 10
original pages?
lwp-rget will fetch only the page you specify and any others that the pages it
has already read link to. Fetching a page without specifying a full URL usually
results in either a default web page for that address or a directory listing (or
an error 404) so
lwp-rget www.123.com
is the same as
lwp-rget http://www.rget.com/index.htm
and you will get the index.htm file, and all the resources that file links to,
and all that they in turn link to and so on.
HTH,
Rob
.
- References:
- Question about lwp-rget
- From: Bdy
- Question about lwp-rget
- Prev by Date: Re: How to get a computed string to act as a re in if statement
- Next by Date: Re: rand()
- Previous by thread: Question about lwp-rget
- Next by thread: Checking to see if file exists.
- Index(es):