Re: how to convert the characters ASCII(0-255) to ASCII(0-127)



On Fri, 30 Dec 2005, Eric Bohlman wrote:

> Samwyse <samwyse@xxxxxxxxx> wrote in news:Pa1tf.38841$dO2.20814
> @newssvr29.news.prodigy.net:
>
> > Alan, I am in awe of your skills in pedantry. In the future, I
> > promise that I will *never* use the term "ASCII" to mean anything
> > other than whatever it was you just said.
>
> It's not pedantry. The subject of character encodings is one that
> simply can't be meaningfully discussed without using extremely
> precise language;
[...]

Thanks. It might be worth adding, since the original poster is in
..fr, that their data *might* be using the French MS-DOS code page
(this doesn't seem to be listed amongst the Unicode cross-mapping
tables - I'm sure it's listed in my old DOS manual in the office),
although one of my French colleagues, back in MS-DOS days, told me
that he preferred to use the French-Canadian code page instead - that
would be:
http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/PC/CP863.TXT

I already mentioned the possibility of CP850, the Latin1 Multinational
code page. The original poster used the term "OEM", but a search for
"OEM codepage" will easily reveal that there are *many* different
MS-DOS "OEM" codepages: http://www.google.co.uk/search?q=oem+codepage

See also http://www.unicode.org/Public/MAPPINGS/VENDORS/IBM/readme.txt
for some useful notes.

> "you know what I mean" simply won't cut it here because in fact
> different people will come up with *radically* different ideas of
> what you mean. "High ASCII" or "wide ASCII" mean different things
> to different people, because there is simply no common definition
> for them (which in turn comes from the fact that they're inherently
> contradictory).

Quite.

Things aren't helped by the fact that MS mischievously refer to their
proprietary Windows character encoding(s) as "ANSI". On finding
contradictory assertions about this, I researched further, and am
convinced that the (US-)American National Standards Inst. has never
published such a specification. After they had initially discussed a
US specification for an ASCII-based 8-bit character coding, they
wisely decided not to have one, and adopted the international
iso-8859-1 specification instead.

Not that it's directly relevant to the present question, but I
concluded that a conscientious author would avoid referring to
Windows-1252 (or to the Windows-125x family of codings) as "ANSI"
character coding(s).

best regards
.



Relevant Pages

  • Re: How many years dedicated to characters learning in China ?
    ... how many years of school do ... Each character is a morpheme. ... These are called "radicals". ... "gouvernement" on a news headline of a French newspaper means. ...
    (sci.lang)
  • Re: That German news server.. .
    ... A Brilliant Chinese pop singer. ... Formerly known as Lodgers Nelson ... It is full of what we call in French ... And each character, like everywhere, varies according to their ...
    (uk.misc)
  • Re: why cant my message be searched
    ... the concept of 12 stats rather than the usual 6; ... (sorry, I'm french, so I have to translate them into ... at example 2, if the character had 18 in toughness, that should give ...
    (rec.games.frp.dnd)
  • Re: Jimmy, jimmy, jimmy, the honky tonk blues
    ... One night in a bar a French guy was playing the guitar ... His guitar playing was good but his ... command of English frequently detracted from my enjoyment. ... "Character" allows me to slip into another amusing less than perfect ...
    (sci.lang)
  • Re: Pakistan vs England Summer 06
    ... Pedantry on top of your racist boorishness. ... facets to your character you'd like to share with us? ...
    (uk.sport.cricket)