Re: Brian Kernighan, maybe I'm not worthy, maybe I'm scum



Malcolm McLean said:


"spinoza1111" <spinoza1111@xxxxxxxxx> wrote in message
You have told me, for example, that C is fully aware of international
strings. But when I try to re-use this code in .Net, it can't be
called directly from C Sharp using a String object, because its
interface appears in C Sharp as consisting of sbyte arrays. I fear
that there is no way of converting sbyte arrays to and from two-byte
wide character arrays without an extra loop...although there "might"
be a single Pentium instruction to do so.

The identity of char and byte, with hindsight, was a mistake.

I'm not convinced, Malcolm. What makes you think it's a mistake?


However it has the good effect that it encourages the use of ASCII, which
is the one de facto universal standard for data representation.

No, it isn't. For one thing, EBCDIC is used to store a vast amount of data.
For another, how, precisely, do you represent (in the sense of
"symbolise") the Polish for "represent" (in the sense of "symbolise")
using ASCII? ASCII unaccountably lacks the c-acute character which is
necessary for this representation to be possible.

Non-English languages can be build on top of ASCII, as has happened
successfully with HTML.

In other words, ASCII is inadequate to the task. I agree. And non-English
languages could be built on top of any inadequate encoding; there's
nothing special about ASCII.

Non-ASCII representations certainly exist, but
they risk being unreadable on some platforms,

So does ASCII.

and in fact they all have
weaknesses which make them undesireable as replacements for ASCII.

But you have said yourself that ASCII needs to be replaced by something
that can represent non-English languages.

--
Richard Heathfield <http://www.cpax.org.uk>
Email: -http://www. +rjh@
Google users: <http://www.cpax.org.uk/prg/writings/googly.php>
"Usenet is a strange place" - dmr 29 July 1999
.



Relevant Pages

  • Re: Brian Kernighan, maybe Im not worthy, maybe Im scum
    ... called directly from C Sharp using a String object, ... interface appears in C Sharp as consisting of sbyte arrays. ... However it has the good effect that it encourages the use of ASCII, which is the one de facto universal standard for data representation. ...
    (comp.programming)
  • Re: Brian Kernighan, maybe Im not worthy, maybe Im scum
    ... interface appears in C Sharp as consisting of sbyte arrays. ... and ASCII today use 8 bits. ... ASCII unaccountably lacks the c-acute character which is ... modern strings on .Net use two bytes consistently? ...
    (comp.programming)
  • Re: Look what i made!
    ... than the rest..and very sharp. ... Oh okay thanks for the explanation on the ASCII ... Bit of a thread jack, have you done one of the nuts up too tight so the ...
    (rec.sport.unicycling)
  • Re: Brian Kernighan, maybe Im not worthy, maybe Im scum
    ... Malcolm McLean said: ... Well, ASCII is of course 7 bits, so it isn't even a coincidence. ... character sets, and 8 bits was sufficient to represent either of them. ... ASCII to EBCDIC conversion is commonly done ...
    (comp.programming)