combining millions of different regular expressions
- From: e2point@xxxxxxxxx
- Date: Sun, 16 Mar 2008 09:09:41 -0700 (PDT)
i have large number of regular expressions. (millions). i want to
match a given string with all of them some how. but i cant iteratively
match the string with all of them because it will not give the
performance expected.
i know that any regular expression can be transformed to a state
machine. im wondering whether it is possible to transform all these
regex's to state machines and merge those state machines somehow(this
merged state machine will have an optimal structure to improve the
memory usage and performance). and then run the string once through
this state machine. (even though i only have one state machine, i need
to know exactly which regex's got matched with the input string.)
is this something that is done already somewhere?
or is this something that is still researched?
any advice/info is appreciated.
thanks
.
- Follow-Ups:
- Re: combining millions of different regular expressions
- From: Chris F Clark
- Re: combining millions of different regular expressions
- From: Ralph
- Re: combining millions of different regular expressions
- From: Alain Ketterlin
- Re: combining millions of different regular expressions
- Prev by Date: Re: An intuitive reason why P=BPP
- Next by Date: Re: combining millions of different regular expressions
- Previous by thread: CALL FOR PAPERS: Finite-State Methods and Natural Language Processing 2008
- Next by thread: Re: combining millions of different regular expressions
- Index(es):
Relevant Pages
|