Re: Boolean Query Algorithm



"Sherrie Laraurens" <sherrielaraurens@xxxxxxxxxxx> schrieb:
I can't imagine anyone placing information such as document id and
position in document of words for each unique word encountered in the
initial pre-processing of ones corpus of data - which could literally
contain 100's of gig of data - it simply just sounds like a very
stupid way of going about it as is my suggestion too.

But... how could then phrase search be feasible?

Some techniques are mentioned in this rather old article
from Sergey Brin and Lawrence Page.
"The Anatomy of a Large-Scale Hypertextual Web Search Engine"
http://www-db.stanford.edu/~backrub/google.html

I'm yet trying to understand it.

Regards,
Joachim


.