Re: search engine challenge
From: Philipp Lenssen (info_at_outer-court.com)
Date: 01/26/04
- Next message: Justin Koivisto: "Re: use php but not access the source"
- Previous message: Bob: "Re: Why php and not C?"
- In reply to: Frank: "search engine challenge"
- Next in thread: Frank: "Re: search engine challenge"
- Reply: Frank: "Re: search engine challenge"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Date: 26 Jan 2004 13:55:14 GMT
Frank wrote:
>
> I'm running a site with +20.000 articles. The articles (html files)
> are saved on the server as txt files. Alle other data (author, date,
> category and so on) are in a MySQL db. Before we had the articles put
> in the db also and then performed SQL queries for the search engine.
> But this is no longer feasable since there are too many articles and
> the db has gotten too big. The search engine does all of the db and
> the server cpu goes max. I'm looking for a php type search engine
> that automatically indexes the txt files, produces 1 index file with
> all indexed words + the id's of articles having those words. Like
> that the search script doesn't have to query all the articles (the
> whole db) anymore but just this one index file. Would be nice also if
> there would be possibility to have a blacklist of words (the, a,...)
> and other admin things.
>
If the site is public, have you thought about letting Google do the
hard work, and then either using the Google site search, or the Google
Web API to display results? Google is getting _very_ fast in indexing
large amounts of data on one's site. They picked up thousands of my
pages recently while I was playing around with the htaccess... even too
fast for my taste since I changed it again the next day...
-- Google Blogoscoped http://blog.outer-court.com
- Next message: Justin Koivisto: "Re: use php but not access the source"
- Previous message: Bob: "Re: Why php and not C?"
- In reply to: Frank: "search engine challenge"
- Next in thread: Frank: "Re: search engine challenge"
- Reply: Frank: "Re: search engine challenge"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Relevant Pages
|