Re: regex for chars 192 to 255




"Jürgen Exner" <jurgenex@xxxxxxxxxxx> wrote in message
news:65sfs3tvuhbcgggibo97325hh3hg9elv3f@xxxxxxxxxx
"John" <john1949@xxxxxxxxx> wrote:
Is there a character class like [a-zA-Z] for accented characters that lie
between ASCII 192 and 255 (excluding 215 and 247)?

No, because there are no such characters. ASCII is a 7-bit encoding and
the
largest possible code point is 127 (0x7f).

If not, what is the easiest way to match those, other that specifying them
indvidually?

In whatever encoding you are using (obviously not ASCII) have you tried
specifying ranges like e.g. [\0xC0-\0xFF]?
This will get you the code points although what characters are encoded in
those code points will be very different depending upon the actual
encoding
used.

jue

Many thanks, gentlemen. That's the solution I was after.
Regards
John


.



Relevant Pages

  • =?utf-8?B?UmU6IFN0cmluZyAiw6LigqzihKIiIHRyYW5zbGF0ZWQgdG8gYXBvc3Ryb3BoZS4gV2h5Pw==?=
    ... it works), though it seems to use mostly just Ascii characters, representing ... but the author is not making the best possible use of UTF-8. ... They don't map it to ASCII apostrophe, ... Latin 1 encoding. ...
    (alt.html)
  • Re: what does "serialization" mean?
    ... it's the most important piece of the ASCII ... ANSI recognized that 128 characters were ... ASCII committee hasn't met to discuss character encoding formats for many, ... Space Invaders or LEM games. ...
    (comp.programming)
  • Re: How do I get unicode support in python?
    ... unable to print any characters outside of ascii. ... What do I need to do to get python on the web server to have unicode ... For Python to be able to "print" unicode characters to the console, ... know the encoding of the console. ...
    (freebsd-questions)
  • Re: Help me!! Why java is so popular
    ... Well, Unicode is not a storage encoding system, or anything like that. ... Unicode is primarily a mapping from characters (in the linguistic conceptual ... French, Russian, Japanese and Korean songs. ...
    (comp.lang.java.programmer)
  • Re: Workable encryption in Tcl??
    ... abstract characters using the concrete UTF-8 encoding, ... character streams and octet streams when doing input and output. ... How does this relate to encryption? ...
    (comp.lang.tcl)