Re: To unicode or not to unicode



Ross Ridge (Sat, 21 Feb 2009 14:52:09 -0500)
Except in practice unlike Python, many newsreaders don't assume ASCII.

Thorsten Kampe <thorsten@xxxxxxxxxxxxxxxx> wrote:
They assume ASCII - unless you declare your charset (the exception being
Outlook Express and a few Windows newsreaders). Everything else is
"guessing".

No, it's an assumption like the way Python by default assumes ASCII.

The original article displayed fine for me. Google Groups displays it
correctly too:

http://groups.google.com/group/comp.lang.python/msg/828fefd7040238bc

Your understanding of the principles of Unicode is as least as non-
existant as the OP's.

The link demonstrates that Google Groups doesn't assume ASCII like
Python does. Since popular newsreaders like Google Groups and Outlook
Express can display the message correctly without the MIME headers,
but your obscure one can't, there's a much stronger case to made that
it's your newsreader that's broken.

I could just as easily argue that assuming ISO 8859-1 is the defacto
standard, and that its your newsreader that's broken.

There is no "standard" in regard to guessing (this is what you call
"assuming"). The need for explicit declaration of an encoding is exactly
the same in Python as in any Usenet article.

No, many newsreaders don't assume ASCII by default like Python.

The reality however is that RFC 1036 is the only standard for Usenet
messages, defacto or otherwise, and so there's nothing wrong with
anyone's newsreader.

The reality is that all non-broken newsreaders use MIME headers to
declare and interpret the charset being used.

Since RFC 1036 doesn't require MIME headers a reader that doesn't generate
them is by definition not broken.

Ross Ridge

--
l/ // Ross Ridge -- The Great HTMU
[oo][oo] rridge@xxxxxxxxxxxxxxxxxxx
-()-/()/ http://www.csclub.uwaterloo.ca/~rridge/
db //
.



Relevant Pages

  • Re: To unicode or not to unicode
    ... Since popular newsreaders like Google Groups and Outlook ... Express can display the message correctly without the MIME headers, ... Python and in any communication across the Internet (may it be NNTP, ...
    (comp.lang.python)
  • Re: What cars do programming languages resemble?
    ... Many newsreaders do not display the subject ... Python feels to me like a Chevrolet; ... It's interesting to compare their reasons for associating a VW ...
    (comp.programming)
  • Re: Content-Type (charset=EUC-KR) [was: rational exponentialisation problem of BABY RUDIN]
    ... the original article had charset set to EUC-KR ... and which all newsreaders i have read it on properly display ... if it were to be mandated that communications be in ascii ... and the unicode consortium has generously given the world utf-8 as a great all-encompassing default ...
    (sci.math)
  • Re: To unicode or not to unicode
    ... ISO 8859-1 text to be properly displayed by many newsreaders. ... encoding the OP probably used. ... Newsreaders assuming ISO 8859-1 instead of ASCII doesn't make it a guess. ...
    (comp.lang.python)
  • Re: Opinions on Newsgroup Readers
    ... > polite) of newsreaders. ... Apparently it's full of security holes, ... > on, I just discovered that when posting to a news group, Outlook Express' ... "Un gromono, ...
    (rec.models.scale)