Re: Googling



On Dec 25, 2007 3:50 PM, yitzle <yitzle@xxxxxxxxxxxxxxxxxxxxx> wrote:
Hi

Summary: Is there an easy method to search Google and get the top
result (title & URL)?

I'm trying to write a script that gets URLs based on a name. I figured
using Google would be the simplest method. I need to search a specific
site, and can construct a query so that the top result is often enough
the correct result.

However, Net::Google requires a SOAP API key, and Google's site says
they are no longer providing new keys. I thought I might be able to
use WWW::Mechanizer, but the HTML that Google returns is fairly ugly
and I haven't tried parsing that just yet.

Is there an easy method to search Google and get the top result (title & URL)?

Using a script to scrape Google's result pages is against their Terms
of Service and their robots.txt.

from http://www.google.com/accounts/TOS
snip
5.3 You agree not to access (or attempt to access) any of the Services
by any means other than through the interface that is provided by
Google, unless you have been specifically allowed to do so in a
separate agreement with Google. You specifically agree not to access
(or attempt to access) any of the Services through any automated means
(including use of scripts or web crawlers) and shall ensure that you
comply with the instructions set out in any robots.txt file present on
the Services.
snip

from http://www.google.com/robots.txt
User-agent: *
Allow: /searchhistory/
Disallow: /news?output=xhtml&
Allow: /news?output=xhtml
Disallow: /search
snip
.



Relevant Pages

  • Re: Fragen an MacSOUP-Kenner
    ... > aktuelleren Artikeln klappt). ... Default-Browser einen Suchbefehl mit der MID für Google Groups. ... | No responsibility is taken for any damage caused by this script. ... | 0.2 First Try was by far too complicated, ...
    (de.comp.sys.mac.internet)
  • Re: seo uk
    ... On every generated page there was a link to chembuddy and to the main script page. ... Knowing Google behaviors I have never expected the script to be deeply indexed, however, I was interested what will happen to the PR of the main script page. ... http://www.chembuddy.com - chemical calculators for labs and education ...
    (alt.internet.search-engines)
  • Re: crontab last day shell script
    ... Why in the name of Socrates' stained toga do search engines not ... was a rather simple script for executing some task (the ... I am really feeling google groups' usability is ... half the posts are "collapsed" and I have to go through and expand them. ...
    (comp.unix.shell)
  • Re: From one queue to another!!!!! What an American nightmare!!!
    ... such advertisement at the moment are google and msn. ... To attract revenue ... that this guys script assigns titles like Junior Member to our aliases ... cheats like these employ to scam money out of google and msn, ...
    (misc.immigration.usa)
  • Re: Fox 5 Closed Captions never show "never"
    ... differes from the closed captions, so I have to assume these are built ... So if it comes from a script, ... google, google, google. ... The BBC does in fact have an automated voice ...
    (rec.arts.tv)