RE: forcing utf8 on selected values






On Wed, 29 Mar 2006, Mark Hedges wrote:

Hmm, for some reason this morning the Matrix has allowed the
selected characters to display correctly in the browser. This
doesn't make a whole lot of sense to me now.

The scalars still do not have the utf8 flag set, though.

This is really weird. All the characters display correctly in
the browser now EXCEPT what used to be a generic apostrophe,
which now comes across as this non-printable character:

U+0092 <control>

General Character Properties

Unicode category: Other, Control

Various Useful Representations

UTF-8: 0xC2 0x92
Octal escaped UTF-8: \302\222
Decimal entity reference: &#146;

Annotations and Cross References

Alias names:
PRIVATE USE TWO

But this displays fine in mysql console on my terminal.

Here I am talking to myself in public again.

Scratch that -- the original source database is corrupt and
actually has this character stored in these fields. Why they
display correctly as quotes in the terminal or when viewed in
western encoding is a mystery.

Mark
.



Relevant Pages

  • Thunderbird bugs [was: lots of other topics]
    ... Question marks are very, very specific thing and has very, very specific cause - written down in my previous e-mail - or in my Outlook Express instruction (same issue in Thunderbird and OE): ... interest in non-ASCII character sets comes partly from the fact that I ... It's a problem because the web browser designers ... specify a language at the sending end and a preferred language at the ...
    (alt.usage.english)
  • Re: overwriting characters
    ... character in the z dimension, or "overprint" a character, though ... However it does have a weird side-effect in my FireFox 2.0.0.11. ... As soon as I resize the browser window, ... The orderly method to do this would be to make a new font set ...
    (alt.html)
  • Re: Parsing large amounts of data (200,000 entries) with XML?
    ... > white space in all character sets. ... > and if the browser is set to another character set, ... I messed with the browser settings for making fonts ...
    (microsoft.public.vb.general.discussion)
  • Re: which quotation marks to use
    ... character. ... render the same in my browser. ... Note that if you use the curly quotes, you should - for stylistic uniformity - also use curly single quotes and apostrophes (instead of ASCII apostrophe, '), en or em dashes or minus sign when appropriate instead of ASCII hyphen, and maybe even the horizontal ellipsis character instead of "...". ...
    (comp.infosystems.www.authoring.html)
  • Setting the encoding in the basic auth header
    ... The user can use any unicode character in the username ... encoded by the browser before transmission. ... to encode the data as utf-8 before sending it over? ... any way I can get them to encode the data with utf-8 instead? ...
    (comp.lang.python)