Re: search engine challenge

From: Frank (frank.sonck_at_pandora.be)
Date: 01/26/04


Date: Mon, 26 Jan 2004 20:30:10 GMT

I don't think it's possible to have Google index an MySQL db? And the html
files on the server are not .html

"Philipp Lenssen" <info@outer-court.com> wrote in message
news:bv3682$m6l26$1@ID-203055.news.uni-berlin.de...
> Frank wrote:
>
> >
> > I'm running a site with +20.000 articles. The articles (html files)
> > are saved on the server as txt files. Alle other data (author, date,
> > category and so on) are in a MySQL db. Before we had the articles put
> > in the db also and then performed SQL queries for the search engine.
> > But this is no longer feasable since there are too many articles and
> > the db has gotten too big. The search engine does all of the db and
> > the server cpu goes max. I'm looking for a php type search engine
> > that automatically indexes the txt files, produces 1 index file with
> > all indexed words + the id's of articles having those words. Like
> > that the search script doesn't have to query all the articles (the
> > whole db) anymore but just this one index file. Would be nice also if
> > there would be possibility to have a blacklist of words (the, a,...)
> > and other admin things.
> >
>
> If the site is public, have you thought about letting Google do the
> hard work, and then either using the Google site search, or the Google
> Web API to display results? Google is getting _very_ fast in indexing
> large amounts of data on one's site. They picked up thousands of my
> pages recently while I was playing around with the htaccess... even too
> fast for my taste since I changed it again the next day...
>
> --
> Google Blogoscoped
> http://blog.outer-court.com
>



Relevant Pages

  • Risks Digest 24.70
    ... court case upended ... Search Engine Dispute Notifications: ... Extending Google Blacklists for Dispute Resolutions ...
    (comp.risks)
  • Re: search engine challenge
    ... > are saved on the server as txt files. ... Before we had the articles put ... > in the db also and then performed SQL queries for the search engine. ... hard work, and then either using the Google site search, or the Google ...
    (comp.lang.php)
  • Re: how to explain these logs?
    ... >Also, in some place of the log file, I see these 2 lines where the ... Someone who oriented privacy at 210.21.30.169 searched proxy server. ... Someone or search engine from 216.35.116.91 wanted to get robots.txt ... Google "robots.txt". ...
    (comp.os.linux.security)
  • Re: Google search for word
    ... Google search engine. ... doesn't seem to search within html "docs". ... into an Ops Guide. ... like" search engine to view content within this single doc/web view. ...
    (microsoft.public.word.vba.general)
  • Google Agrees to Censor Results in China
    ... SAN FRANCISCO - Online search engine leader Google Inc. has agreed to ... Because of government barriers set up to suppress information, ... Google officials characterized the censorship concessions in China as ...
    (soc.culture.vietnamese)