Wide character, portable function to parse words like O'Clock as one word?



In English words like "O'Clock" contain an embedded character
which the C function iswpunct() classifies as punctuation. So
in order to tokenize a string of text containing this type of
word properly one cannot simply use wcstok(), special
rules like "a quote immediately followed and preceded by an alphabet character is not treated as punctuation" must be added.

What I'm wondering is if there is a standard function to do this somewhere in the "w" set of functions which were added for multilingual support? I mean, I know what the rules are for English, but the whole point of the wide characters is to support other languages portable, and it would seem the somewhere in the LC_CTYPE information set this information should be present and accessible. That said, I have yet to find anything in there which seems appropriate. Is there such a function?

Thanks,

David Mathog
.



Relevant Pages

  • Re: Wide character, portable function to parse words like OClock as one word?
    ... which the C function iswpunctclassifies as punctuation. ... character is not treated as punctuation" must be added. ... point of the wide characters is to support other languages portable, ... it would seem the somewhere in the LC_CTYPE information set this ...
    (comp.lang.c)
  • Re: Track Changes VBA Granular information needed
    ... MsgBox "Deleted Period" ... Add more ElseIf clauses for any other punctuation you want to catch. ... one character) will show up in the table column that contains .Range.Text. ... Since Track Changes creates a Delete Bubble on the document, ...
    (microsoft.public.word.vba.beginners)
  • Re: Strong Passwords Revisited
    ... >increase the password entropy if you enforce rules like "password must ... If your users were using English words before, ... the attacker is tasked with doing a brute-force attack. ... least one character to be a digit - you've actually removed a portion of the ...
    (comp.security.misc)
  • Re: Strong Passwords Revisited
    ... >increase the password entropy if you enforce rules like "password must ... If your users were using English words before, ... the attacker is tasked with doing a brute-force attack. ... least one character to be a digit - you've actually removed a portion of the ...
    (comp.os.ms-windows.nt.admin.security)
  • Re: Strong Passwords Revisited
    ... >increase the password entropy if you enforce rules like "password must ... If your users were using English words before, ... the attacker is tasked with doing a brute-force attack. ... least one character to be a digit - you've actually removed a portion of the ...
    (microsoft.public.win2000.security)