Re: Automatic text tagging
- From: amado.alves@xxxxxxxxx
- Date: Wed, 9 Apr 2008 07:20:12 -0700 (PDT)
On Apr 8, 10:05 pm, Bruno Barberi Gnecco
<brunobgDELETET...@xxxxxxxxxxxxxxxxxxxxx> wrote:
I need to implement an automatic text tagging system. Any suggestions
of algorithms? I've used Bayesian classification with great success when the
categories are fixed and in small number, but in the case of tags I believe
it won't work very well (too few items per tag to train well). I'm also looking
for something more sophisticated than simply finding tags in text.
Any pointers to papers, books or code is appreciated. Thanks a lot.
You mean Part-Of-Speech tags (Noun, Verb, etc.)?
But these *are* a "fixed and small number of categories", are not
they?
For a small training set a very successful technique is to take into
account the context, namely the few words to the left and to the right
of the word under tagging. Work with probabilities. In an enlarged
context many times there are choices with probability 1 (e.g. words
"the", "at"). These "ground" choices help chose the others.
.
- Follow-Ups:
- Re: Automatic text tagging
- From: Bruno Barberi Gnecco
- Re: Automatic text tagging
- References:
- Automatic text tagging
- From: Bruno Barberi Gnecco
- Automatic text tagging
- Prev by Date: Re: How can I tell if F is a string or if it is a number?
- Next by Date: Re: How can I tell if F is a string or if it is a number?
- Previous by thread: Automatic text tagging
- Next by thread: Re: Automatic text tagging
- Index(es):
Relevant Pages
|