Re: East european characters from LaTex to UTF8



On Fri, 30 Nov 2007 08:11:12 -0800, Francois wrote:

Hi
With the module TeX::Encode and Encode, I convert characters from LaTex
to UTF8. It works great except for characters use in Slovacia, for
example c or z with caron: č ž

Which encoding are your original latex files? Plain 7bit ASCII or
ISO-8859-1 with latex markup for the special characters or something else?

If something else, it may help to open/read the latex files using the
right "lower level" encoding layer, for example, if you're using cp1250
for the latex files:

open my $fh,"<:encoding(cp1250)","/my/latex/file.tex" or die $!;

print decode('latex',<$fh>);

See also the manpages for perlio and Encode

Joost.



.



Relevant Pages

  • Re: Thundermail and txt attachments
    ... Jerome BENOIT wrote: ... >> The only difference if it were Base64 encoded would be that the LaTeX ... the recipient would then have to gunzip it. ... >> Have you tried to find an extension for Thunderbird to Base64 encode ...
    (Debian-User)
  • Re: ANNOUNCE: Styled Text Toolbox 3.1
    ... Internal "command language" similar to LaTeX ... Many other characters from Symbol font available ... The Styled Text Toolbox requires at least version 4.2 ...
    (comp.soft-sys.matlab)
  • Re: Simple search across lines?
    ... Thanks, yes, I see where I would need to be careful with /foo.*bar/, ... only want to do this when I have the output of a LaTeX document in hand ... or on screen and want to edit something in the code. ... would make the match ignore all new line characters. ...
    (comp.editors)
  • Re: Reserved word in latex2e
    ... a specific meaning meaning with definitions etc. which only the EU ... LaTeX has no reserved words -- the basic TeX syntax does not use ... Certain other characters have special ...
    (comp.text.tex)
  • Re: tex4ht: Problem with title in DocBook
    ... It is not straightforward to directly translate LaTeX files to DocBook ... format because latex imposes very few restrictions on the structure ...
    (comp.text.tex)