Re: Parser Generator?



On Aug 18, 11:37 pm, "Jack" <nos...@xxxxxxxxxxx> wrote:
Thanks for all the replies!

SPARK looks promising. Its doc doesn't say if it handles unicode
(CJK in particular) encoding though.

Yapps also looks powerful:http://theory.stanford.edu/~amitp/yapps/

There's also PyGgyhttp://lava.net/~newsham/pyggy/

I may also give Antlr a try.

If anyone has experiences using any of the parser generators with CJK
languages, I'd be very interested in hearing that.

Jack

"Jack" <nos...@xxxxxxxxxxx> wrote in message

news:abKdnVoQMu2o7FrbnZ2dnUVZ_gqdnZ2d@xxxxxxxxxxxxxx



Hi all, I need to do syntax parsing of simple naturual languages,
for example, "weather of London" or "what is the time", simple
things like these, with Unicode support in the syntax.

In Java, there are JavaCC, Antlr, etc. I wonder what people use
in Python? Antlr also has Python support but I'm not sure how good
it is. Comments/hints are welcome.- Hide quoted text -

- Show quoted text -

Jack -

Pyparsing was already mentioned once on this thread. Here is an
application using pyparsing that parses Chinese characters to convert
to English Python.

http://pypi.python.org/pypi/zhpy/0.5

-- Paul

.



Relevant Pages

  • Re: UTF-8 / German, Scandinavian letters - is it really this difficult?? Linux & Windows XP
    ... For string literals, with the "coding" declaration, Python will accept ... "coding" declaration to produce a Unicode object which unambiguously ... represents the sequence of characters - ie. something that can be ... > strings and/or gibberished characters in Tk GUI title? ...
    (comp.lang.python)
  • python-dev summary for 2005-07-01 to 2005-07-15
    ... we should use strings and Unicode like ... Python documentation. ... Contributing threads: ... previous summaries is available online. ...
    (comp.lang.python)
  • python-dev summary for 2005-07-01 to 2005-07-15
    ... we should use strings and Unicode like ... Python documentation. ... Contributing threads: ... previous summaries is available online. ...
    (comp.lang.python.announce)
  • Re: Unicode & Pythonwin / win32 / console?
    ... >> is in a PythonWin Interactive session - ok results for cyrillic chars ... >> I think this is not a good behaviour of python to be so picky. ... unknown unicode chars as "?" ... are much higher when apps suddenly break in simple print/display-output ...
    (comp.lang.python)
  • python-dev Summary for 2004-08-16 through 2004-08-31
    ... It is intended to inform the wider Python ... simple and is accepted for `PEP markup`_ and gives some perks for the HTML ... One is that it makes the summaries more ... longer represented straight binary data (since Unicode has different encodings ...
    (comp.lang.python)