Re: Automatic text tagging



       I need to implement an automatic text tagging system. Any suggestions
of algorithms? I've used Bayesian classification with great success when the
categories are fixed and in small number, but in the case of tags I believe
it won't work very well (too few items per tag to train well). I'm also looking
for something more sophisticated than simply finding tags in text.

       Any pointers to papers, books or code is appreciated. Thanks a lot.

You mean Part-Of-Speech tags (Noun, Verb, etc.)?

        No, I mean tags as they're used in many websites nowadays,
describing what the text is about. For example, this message could be
tagged "text mining, tag, probabilities".

I see.

Spreading Activation is my favorite technique. Take a look a the paper
"Eigensearching the web..." Also, for "something more sophisticated
than simply finding tags in text," consider a postcoordination
approach: don't tag; search.

.