Re: CLOB issue with JDBC, IBATIS and Oracle 10g



RedGrittyBrick wrote:
Roedy Green wrote:
On Sat, 22 Sep 2007 09:57:56 -0400, Lew <lew@xxxxxxxxxxxxx> wrote, quoted or indirectly quoted someone who said :


Small nit: the number of bytes per char is variable, depending on
the database's encoding.


In a pathological case UTF-8 can take 3 bytes per char.

UTF-8 takes *four* octets for a code point outside Unicode's base plane. For example the cuneiform numeric sign three sharu (U+1242E) is
F0 92 90 AE

How wonderfully ironic that there's a Unicode block (and a UTF-8 encoding!) devoted to the cuneiform script, a data storage format that remains readable almost five thousand years after it was invented.

How many of our databases will still be readable in 7000 AD?

Thanks for sharing that. It made my day :-)

David Harper
Cambridge, England
.



Relevant Pages

  • Re: lost mysql root password
    ... Not 02, but $20's, eg an ascii space char. ... this is encoding related. ... the database reserves multiple bytes ... To UNSUBSCRIBE, email to debian-user-REQUEST@xxxxxxxxxxxxxxxx ...
    (Debian-User)
  • Re: lost mysql root password
    ... On Friday 03 March 2006 04:38 pm, Gene Heskett wrote: ... Not 02, but $20's, eg an ascii space char. ... this is encoding related. ... the database reserves multiple bytes ...
    (Debian-User)
  • Re: Need help on string manipulation
    ... unlike characters using char, which may use a multibyte encoding. ... If any unicode char can be encoded in a single ...
    (comp.lang.c)
  • Re: WCHAR conversion problem
    ... and I'm using an API with WCHARs... ... The times where you could assume that a char* is just a char* in terms of an ASCII string are long over. ... The multibyte encoding from the mb* string functions is only one of many. ...
    (microsoft.public.vc.language)
  • Re: CLOB issue with JDBC, IBATIS and Oracle 10g
    ... the database's encoding. ... In a pathological case UTF-8 can take 3 bytes per char. ... For example the cuneiform numeric sign three sharu is ...
    (comp.lang.java.databases)