East european characters from LaTex to UTF8
Hi
With the module TeX::Encode and Encode, I convert characters from
LaTex to UTF8. It works great except for characters use in Slovacia,
for example c or z with caron: č ž
TeX::Encode use the followings modules
use Encode::Encoding;
use Pod::LaTeX;
use HTML::Entities
and from the comments in TeX::Encode "It uses the the mapping from
Pod::LaTeX, but we use HTML::Entities
to get the Unicode character".
Is there another module I should install to convert these east
european characters ?
Thanks for any advice !
Francois
.
Relevant Pages
- Re: unicode conversion
... breaks utf8 output of Chinese characters to an otherwise perfectly utf8- transparent console, see my XML::Simple and utf8 woe posting of ... As I explained in the other thread, what's probably happening is that, without -CS, your data is being read in by Perl as octets, then printed out as octets; however, under -CS your data is still read as octets yet printed to a UTF8-aware filehandle. ... my latest experience is with bulk quantities of utf8 data (latin, CJK material, _tons_ of characters with accents and diacritics in one soup). ... When I try to segment such a string with approx. ... (comp.lang.perl.misc) - Re: Reg multilanguage support by gnuplot
... So far as I know, Matsuda's postscript example used EUC encoding, not utf8. ... Your original query was about an EUC font. ... Unicode is an assignment of "all" characters to unique ... # Test of UTF-8 support by gnuplot terminals. ... (comp.graphics.apps.gnuplot) - Re: utf8 Problems
... I converted to utf8 in the hope that my non ASCII character problems ... use all sorts of special characters, limited only by the fonts you have ... encoding in a standardized way, for example in plain text files. ... $ locale | grep -v en_US ... (Debian-User) - Re: perl 5.6 multi byte
... GB1312 is in fact GB2312 and is used for Simplified Chinese. ... Both GB2312 and ShiftJIS are double byte character sets (DBCS). ... Some characters have on byte, ... they are very different from utf8. ... (comp.lang.perl.modules) - Re: perl 5.6 multi byte
... GB1312 is in fact GB2312 and is used for Simplified Chinese. ... Both GB2312 and ShiftJIS are double byte character sets (DBCS). ... Some characters have on byte, ... they are very different from utf8. ... (comp.lang.perl.misc) |
|