Wide character, portable function to parse words like O'Clock as one word?
In English words like "O'Clock" contain an embedded character
which the C function iswpunct() classifies as punctuation. So
in order to tokenize a string of text containing this type of
word properly one cannot simply use wcstok(), special
rules like "a quote immediately followed and preceded by an alphabet
character is not treated as punctuation" must be added.
What I'm wondering is if there is a standard function to do this
somewhere in the "w" set of functions which were added for multilingual
support? I mean, I know what the rules are for English, but the whole
point of the wide characters is to support other languages portable, and
it would seem the somewhere in the LC_CTYPE information set this
information should be present and accessible. That said, I have yet to
find anything in there which seems appropriate. Is there such a function?
Thanks,
David Mathog
.
Relevant Pages
- Re: Wide character, portable function to parse words like OClock as one word?
... which the C function iswpunctclassifies as punctuation. ... character is not treated as punctuation" must be added. ... point of the wide characters is to support other languages portable, ... it would seem the somewhere in the LC_CTYPE information set this ... (comp.lang.c) - Re: Track Changes VBA Granular information needed
... MsgBox "Deleted Period" ... Add more ElseIf clauses for any other punctuation you want to catch. ... one character) will show up in the table column that contains .Range.Text. ... Since Track Changes creates a Delete Bubble on the document, ... (microsoft.public.word.vba.beginners) - Re: Strong Passwords Revisited
... >increase the password entropy if you enforce rules like "password must ... If your users were using English words before, ... the attacker is tasked with doing a brute-force attack. ... least one character to be a digit - you've actually removed a portion of the ... (comp.security.misc) - Re: Strong Passwords Revisited
... >increase the password entropy if you enforce rules like "password must ... If your users were using English words before, ... the attacker is tasked with doing a brute-force attack. ... least one character to be a digit - you've actually removed a portion of the ... (comp.os.ms-windows.nt.admin.security) - Re: Strong Passwords Revisited
... >increase the password entropy if you enforce rules like "password must ... If your users were using English words before, ... the attacker is tasked with doing a brute-force attack. ... least one character to be a digit - you've actually removed a portion of the ... (microsoft.public.win2000.security) |
|