Re: Brian Kernighan, maybe I'm not worthy, maybe I'm scum



"Richard Heathfield" <rjh@xxxxxxxxxxxxxxx> wrote in message
Malcolm McLean said:

The identity of char and byte, with hindsight, was a mistake.

I'm not convinced, Malcolm. What makes you think it's a mistake?

There's no logical connection between the number of bits used to represent a human-readable character and the smallest addressible unit of memory.
Typically bytes are 8 bits and chars are ASCII, so sizeof(char) equals 1 byte, but that's just a coincidence.

However it has the good effect that it encourages the use of ASCII,
which is the one de facto universal standard for data representation.

No, it isn't. For one thing, EBCDIC is used to store a vast amount of data. >
I doubt that there are many such machines without a heavily used ASCII to EBCDIC conversion utility. However if I have one on my PC I don't know where to find it. You can pretty much guarantee that an ASCII file will be human-readable on your machine.

For another, how, precisely, do you represent (in the sense of
"symbolise") the Polish for "represent" (in the sense of "symbolise")
using ASCII? ASCII unaccountably lacks the c-acute character which is
necessary for this representation to be possible.

&c_acute;
or something similar. Readable in English without special software, on in Polish with the right fonts installed.

But you have said yourself that ASCII needs to be replaced by something
that can represent non-English languages.

Yes. If we were inventing the computer from scratch we wouldn't use ASCII. Unfortunately the "cover every glyph" approach has the problem that the fonts are then too difficult to implement, also that keyboards typically don't have the characters. So ASCII has stuck.

--
Free games and programming goodies.
http://www.personal.leeds.ac.uk/~bgy1mm

.



Relevant Pages

  • Re: what does "serialization" mean?
    ... Sorry eddie, but you're dead wrong there as usual. ... >>How about ASCII character 0xB0, ... > Totalitarians and Fascists are often self-appointed language police. ...
    (comp.programming)
  • Re: what does "serialization" mean?
    ... > attempt to present myself as an authority on any and every topic I have ... >> survived and EBCDIC did not because ASCII properly sequenced letters. ... > How about ASCII character 0xB0, ... >> must assert negative facts, for all he knows is there is no knowledge ...
    (comp.programming)
  • Re: Cohens paper on byte order
    ... I think you're using "ASCII" in a notional sense. ... a good reason to teach the *opposite* convention, ... Computers should be as easy to understand as is possible _without_ ... arithmetic on character strings ...
    (sci.crypt)
  • Re: Reading a file.
    ... your program will interpret them as ASCII. ... Bruce.Eitman AT EuroTech DOT com ... buffer is character values, then in memory ASCII values are displayed. ... DWORD d = GetLastError; ...
    (microsoft.public.windowsce.app.development)
  • Re: Get ASCII values for PC arrow keys?
    ... those responsible for standards usually do attempt ... ASCII is a character set, ... ISO/IEC registry for character sets for them to receive identifying ...
    (alt.comp.lang.learn.c-cpp)