Re: RTF and UTF-8 files in Perl



Matt Garrish wrote:

>
> "V S Rawat" <VSRawat@xxxxxxxxxxxx> wrote in message
> news:xn0e93zst5r39c000@xxxxxxxxxxx
> > 1. How do I open a RTF file as input in Perl and read
> > formatted ASCII text from it?
> >
>
> You don't. There's no such thing as formatted ASCII text. You
> could look into a module such as RTF::Tokenizer if you want to
> parse apart the RTF file and extract the text from it. If you
> want to know what formatting has been applied you'll also need
> to check the formatting commands as you go.
>
> > 2. How do I open a UTF-8 (Unicode) file as output in Perl and
> > write Unicode text to it?
> >
>
> Assuming your data is not already utf8:

I am reading some unicode char codes, and converting them to
display the unicode chars.

Say, my code is ($cod = ) 0964 (hexadec chars) which should lead
to a unicode char. But, when I do $char = chr($cod), end up
getting "d" in $char.

How do I get 0964 to give the unicode char it represents.

>
> [untested]
>
> use Encode;
>
> open(my $out, '>:utf8', 'somefile.dat') or die "Could not open
> file: $!"; print $out Encode::encode('utf8', $mydata);
> close($out) or die "Could not close file: $!";
>
> Otherwise you could skip the encoding step.
>
> Matt

Thanks Matt.
--
Rawat
.