Re: parse a textfile containing japanese characters



tej wrote:
Hi Hiwa
i ve to parse all japanese characters of a text file
I am not able to search occurence of any japanese character in given
string.
can u help me?
tej
I have asked what is the problem.
For instance, couldn't you write a Japanese search string in your
source code due to lack of proper IM or font set?
Will you post a small demo code that is generally compilable,
runnable and could reproduce your problem.
See: http://homepage1.nifty.com/algafield/sscce.html and
http://www.yoda.arachsys.com/java/newsgroups.html

.



Relevant Pages

  • Re: parse a textfile containing japanese characters
    ... i ve to parse all japanese characters of a text file ... I am not able to search occurence of any japanese character in given ... string. ...
    (comp.lang.java.programmer)
  • still working with utf8
    ... OK, I sorted out what the deal is with charsets, Encode, utf8 and other goodies. ... After I decode_base64 them and decodethem I can print out something that looks exactly like japanese characters. ... But I guess this string of spaghetti is actually a language where one character as represented in a unicode terminal is actually one 'word' according to the perl definition of a word... ... I guess my question is, for CJK languages, should I expect the notion of using a regex like \w+ to pick up entire strings of text instead of discrete words like latin based languages? ...
    (perl.beginners)
  • Re: Japanese strings get mangled when sent from VB6 to VC++ dll
    ... But it has problems with *some* Japanese characters. ... Declare Function DoStuff Lib "Test.dll" (ByVal strName As String) As Double ... No wonder then that the conversion fails. ... To avoid VB converting strings to ANSI, you have to pass the StringPointer to the function: ...
    (microsoft.public.vb.general.discussion)
  • convert string from Unicode to ISO-2022-JP
    ... converting the string from Unicode to ISO-2022-JP. ... ISO-2022-JP a subset of Unicode? ... Someone told me that some common Japanese characters would become? ...
    (comp.software.international)