Re: Convert html diacritics to unicode



Running html-entities:decode HTML does the trick. Thanks!
-deech

On Oct 30, 4:19 pm, Tamas K Papp <tkp...@xxxxxxxxx> wrote:
On Thu, 30 Oct 2008 12:33:51 -0700, deech wrote:
Hi all,
I am trying to convert an HTML page that includes accent characters into
unicode. Is there a way to do this is Common Lisp?

Yes.  Unless you need to verify the correctness of the input or need some
output format other than HTML, a simple algorithm that replaces strings
using a table (eg "&eacute" -> "é") should suffice.

Search for the terms "replace string" in the c.l.l archives (eg using
Google groups).

HTH,

Tamas

.



Relevant Pages

  • Re: FAQ Topic - How do I access a frames content? (2009-10-25)
    ... Dr J R Stockton wrote: ... you are not reading any files "HTML or TXT, as it exists on disc" - ... "Given an array of strings with the minimum 2 and the maximum 1o ... find the shortest common word in these strings. ...
    (comp.lang.javascript)
  • Re: FAQ Topic - How do I access a frames content? (2009-10-25)
    ... you are not reading any files "HTML or TXT, as it exists on disc" - ... you were getting so far "by using innerHTML ..., ... "Given an array of strings with the minimum 2 and the maximum 1o ... find the shortest common word in these strings. ...
    (comp.lang.javascript)
  • Re: efficiency of JList setElementAt()
    ... If you modify your program to produce strings about 3 times as long, make the strings HTML with a font color tag, and increase the list size to about 130, I think you'll get the kind of results I cited in my first post. ... The Swing cell renderer design is based on assumptions that construction is expensive and updating values is cheap. ... So let's cache. ...
    (comp.lang.java.gui)
  • Re: Best way to dump HTML strings to a page in ASP.Net?
    ... of HTML as strings into a string builder. ... HTML as text into a string builder. ... longer work, this only happens with a high number of strings being ...
    (microsoft.public.dotnet.framework.aspnet)
  • Re: Another Drop List question
    ... If your database field is set to not allow zero-length strings, ... > ASP code and HTML and for me, it makes it easier to line up my quotes. ...
    (microsoft.public.inetserver.asp.general)