Re: not quite 1252
- From: "Fredrik Lundh" <fredrik@xxxxxxxxxxxxxx>
- Date: Wed, 26 Apr 2006 14:42:10 +0200
Anton Vredegoor wrote:
I'm trying to import text from an open office document (save as .sxw and
read the data from content.xml inside the sxw-archive using
elementtree and such tools).
The encoding that gives me the least problems seems to be cp1252,
however it's not completely perfect because there are still characters
in it like \93 or \94. Has anyone handled this before?
this might help:
http://effbot.org/zone/unicode-gremlins.htm
</F>
.
- Follow-Ups:
- Re: not quite 1252
- From: Anton Vredegoor
- Re: not quite 1252
- References:
- not quite 1252
- From: Anton Vredegoor
- not quite 1252
- Prev by Date: Re: Introspection Class/Instance Name
- Next by Date: Re: KeybordInterrupts and friends
- Previous by thread: not quite 1252
- Next by thread: Re: not quite 1252
- Index(es):
Relevant Pages
|