Re: Downloading lots and lots and lots of files
- From: Abigail <abigail@xxxxxxxxxx>
- Date: 29 Jan 2007 15:26:01 GMT
coolneo (coolneo@xxxxxxxxx) wrote on MMMMDCCCXCIX September MCMXCIII in
<URL:news:1170081842.925051.117310@xxxxxxxxxxxxxxxxxxxxxxxxxxxx>:
== First, what I am doing is legit... I'm NOT trying to grab someone
== elses content. I work for a non-profit organization and we have
== something going on with Google where they are providing digitized
== versions of our material. They (Google) provided some information on
== howto write a script (shell) to download the digitized version using
== wget.
==
== There are about 50,000 items, raning in size from 15MB-600MB. My
== script downloads them fine, but it would be much faster if i could
== multi-thread(?) it. I'm running the wget using the sys command on a
== windows box (i know, i know, but the whole place is windows so I don't
== have much of a choice).
==
== Am I on the right track? Or should I be doing this differently?
Before you do anything, first check with google if they allow multiple
connection, and if they do, how many multiple connection you may start.
It won't do you much good to start 100 downloads in parallel if google
holds up 95 of them.
Of course, it's quite likely that the network is the bottleneck.
Starting up many simultaneous connections isn't going to help in
that case.
Finally, I wouldn't use threads. I'd either fork() or use a select()
loop, depending on the details of the work that needs to be done.
But then, I'm a Unix person.
Abigail
--
A perl rose: perl -e '@}-`-,-`-%-'
.
- Follow-Ups:
- Re: Downloading lots and lots and lots of files
- From: Ted Zlatanov
- Re: Downloading lots and lots and lots of files
- From: xhoster
- Re: Downloading lots and lots and lots of files
- References:
- Downloading lots and lots and lots of files
- From: coolneo
- Downloading lots and lots and lots of files
- Prev by Date: Re: Downloading lots and lots and lots of files
- Next by Date: Re: subpattern reference using vaiable subpattern index
- Previous by thread: Re: Downloading lots and lots and lots of files
- Next by thread: Re: Downloading lots and lots and lots of files
- Index(es):
Relevant Pages
|
|