Encoded file

pengz1_at_netzero.com
Date: 03/17/04


Date: Wed, 17 Mar 2004 20:33:41 GMT
To: python-list@python.org


Hi!
Could I ask you, how can I know a file is encoded as 'UTF-8', 'UTF-16' or something else? Thanks in advance.

Zhi

________________________________________________________________
The best thing to hit the Internet in years - NetZero HiSpeed!
Surf the Web up to FIVE TIMES FASTER!
Only $14.95/ month -visit www.netzero.com to sign up today!



Relevant Pages

  • =?ISO-8859-15?Q?Re:_Wof=FCr_sind_AnsiStrings_=FCberhaupt_noch_brauchbar=3F?=
    ... Fehler in Design und Implementierung der neuen AnsiStrings ... dort ein UTF-16 String, wo ist das Problem? ... und dabei kommt man um funktionierende UTF-8 Strings ... UTF-8 Strings funktionieren ganz prächtig in Delphi, ...
    (de.comp.lang.delphi.misc)
  • Re: =?ISO-8859-15?Q?Wof=FCr_sind_AnsiStrings_=FCberhaupt_?= =?ISO-8859-15?Q?noch_bra
    ... Und zwar ohne Verwendung von UnicodeString, denn dann kann man von vornherein auf AnsiStrings verzichten. ... Bei Linux-Benutzern wirst Du Probleme bekommen, wenn Du die von UTF-8 abbringen möchtest. ... Klar, UTF-8 *kann* man zum Funktionieren bringen, wenn unter der Hand alles in UTF-16 abgewickelt wird. ...
    (de.comp.lang.delphi.misc)
  • Re: Unicode format
    ... UTF-8 does take fewer bytes than UTF-16LE. ... A text byte stream cannot be losslessly converted to UTF-16, ... the possible presence of errors in the byte stream encoding. ... Why would that be different with UTF-16 over UTF-8? ...
    (microsoft.public.sqlserver.programming)
  • Re: New utf8string design may make UTF-8 the superior encoding
    ... The host operating system's native Unicode encoding is unlikely to be UTF-8, ... Manipulating UTF-16 will always be more efficient than ... I am curious what a Chinese "letter" is according to the regexp. ...
    (microsoft.public.vc.mfc)
  • Re: GAS-style syntax issue...
    ... but, alas, the issue becomes a little more hairy than a few simple parser ... I guess it is an issue right up there with making the assembler UTF-8 ... (UTF-16 just wastes too much memory IMO, ... majority of text is ASCII... ...
    (alt.lang.asm)