Re: Library function to detect UTF-8 streams without BOM
- From: "Remy Lebeau \(TeamB\)" <no.spam@xxxxxxxxxxx>
- Date: Fri, 14 Dec 2007 11:14:01 -0800
"Franz-Leo Chomse" <franz-leo.chomse@xxxxxxxx> wrote in message
news:3g05m392tko91h002uoteb281lcj9dldk1@xxxxxxxxxx
For XML files, UNICODE is the default character set,
any other one has to be declared.
If no "Encoding" attribute is present in the XML's prolog, and no Encoding
is specified externally (such as in a MIME header), then the XML has to be
encoded in either UTF-8 or UTF-16, depending on the presence of a BOM. If a
BOM is present, either UTF-8 or UTF-16 can be used, but if a BOM is not
present then UTF-8 must be used. This is clearly outlined in section 4.3.3
of the XML 1.0 spec.
Gambit
.
- References:
- Library function to detect UTF-8 streams without BOM
- From: marek jedlinski
- Re: Library function to detect UTF-8 streams without BOM
- From: Franz-Leo Chomse
- Library function to detect UTF-8 streams without BOM
- Prev by Date: Re: Looking for a cheque printing system, MICR fonts ready
- Next by Date: Re: Library function to detect UTF-8 streams without BOM
- Previous by thread: Re: Library function to detect UTF-8 streams without BOM
- Next by thread: Re: Library function to detect UTF-8 streams without BOM
- Index(es):
Relevant Pages
|
|