combining millions of different regular expressions



i have large number of regular expressions. (millions). i want to
match a given string with all of them some how. but i cant iteratively
match the string with all of them because it will not give the
performance expected.
i know that any regular expression can be transformed to a state
machine. im wondering whether it is possible to transform all these
regex's to state machines and merge those state machines somehow(this
merged state machine will have an optimal structure to improve the
memory usage and performance). and then run the string once through
this state machine. (even though i only have one state machine, i need
to know exactly which regex's got matched with the input string.)
is this something that is done already somewhere?
or is this something that is still researched?
any advice/info is appreciated.

thanks




.



Relevant Pages

  • Re: combining millions of different regular expressions
    ... match a given string with all of them some how. ... merged state machine will have an optimal structure to improve the ... First, be careful with what you mean by matching regular expressions, ... (One of those cases where theory and practice mis-align.) ...
    (comp.theory)
  • Re: The state machine of no input
    ... >> I have been given a state machine, which takes no input, even though it ... >> only language accepted by a state machine taking no input must be the ... >> string and then only if the starting state is an accepting state. ... > How did your textbook define what it means for an automaton to acknowledge ...
    (sci.math)
  • Re: The state machine of no input
    ... > I have been given a state machine, which takes no input, even though it has ... > triggered by nothing i.e. they are triggered by the empty string. ... > only language accepted by a state machine taking no input must be the empty ... How did your textbook define what it means for an automaton to acknowledge ...
    (sci.math)
  • Re: combining millions of different regular expressions
    ... match a given string with all of them some how. ... i know that any regular expression can be transformed to a state ... merged state machine will have an optimal structure to improve the ... If you build a NFA instead of a DFA then space might be ...
    (comp.theory)
  • Re: splitting a string and put it into an array
    ... >> The C library has a totally worthless complement of string parsing ... > really not that hard to use strchrand your own state machine for ... > parsing if strtokdoesn't fit your needs. ... Paul Hsieh ...
    (comp.lang.cpp)