Re: best way to index numerical text data ?



On 31 Mar 2006 08:45:53 -0800, "Jack" <jack_posemsky@xxxxxxxxx> wrote
in comp.lang.c:

Hi I have a lot of data that is in a TEXT file which are numbers does
anyone have a good suggestion for indexing TEXT numbers (zip codes,
other codes, dollar amounts, quantities, etc). since Lucene and other
indexers are really optimized for Alpha character indexing. What
approaches are typically taken in computer science for example to index

text numbers..hash maps or something else ??

Thank you,

Jack

I don't see any question here about the C language. If you are
looking for an algorithm, I would suggest asking in
news:comp.programming.

Once you have selected an algorithm, if you have problems writing a
standard C program to implement it, then ask for help here.

--
Jack Klein
Home: http://JK-Technology.Com
FAQs for
comp.lang.c http://c-faq.com/
comp.lang.c++ http://www.parashift.com/c++-faq-lite/
alt.comp.lang.learn.c-c++
http://www.contrib.andrew.cmu.edu/~ajo/docs/FAQ-acllc.html
.



Relevant Pages

  • Re: Generating a large sequence of unique, random numbers
    ... > Generate n unique codes of length l so that they are non-predictable. ... > whatsoever which algorithm to use or how to start in general. ... Luby-Rackoff construct, using F1..F4, producing L' and R'. ... Use that to encrypt the ...
    (sci.crypt)
  • Re: sparse polynomial arithmetic
    ... Remaining within the boundaries of this representation (which are ... do arithmetics directly on codes ... perfect hash values in a hash set ... The algorithm first checks if the exponents of the polynomials to be ...
    (sci.math.symbolic)
  • Re: primality
    ... the point is that it is easier and faster to use an algorithm to generate a prime/semiprime/compound number that happens to be odd rather than generating the number randomly and testing for primality because if the primality test fails then a new random number has to be generated and tested. ... You just grab any random number that's about the size you need and set the first and last bit to "1" (the first bit makes it the size you need and the last bit makes it an odd number - it is very simple and very quick. ... This representation is called the code. ... Different things can be encoded using different codes. ...
    (sci.crypt)
  • Re: primality
    ... but using an algorithm will produce inear results once the initial ... That ciphertext is stored ... This representation is called the code. ... Different things can be encoded using different codes. ...
    (sci.crypt)
  • Re: Difficult query problem
    ... visits made by different employees, for the same visit codes (eg ... Given the algorithm you use in your VB code I think the following ... query will do the work. ...
    (comp.databases.oracle.misc)