Need advice on spell-checking for city names

From: Marc Bissonnette (dragnet_at_internalysis.com)
Date: 12/30/03


Date: Tue, 30 Dec 2003 00:16:13 GMT

Hi all;

I'm hoping someone can point me in the direction of an FAQ or other
appropriate reading material in order to learn how to solve my problem.

I have users submitting multiple city names in a perl application and this
often results in multiple spellings, depending either on their local
dialect, mis-spellings, or copy-and-pasting from a micro$oft app.

For example, the city of Montréal has appeared as

Montréal
Montreal
MontrÈal

Or I'll get close-but-not-correct spellings, like

Ottawa (correct)
Otawa
Ottawaa
Autawa

etc.

How can I go about reducing the numbers of these incidences ? Providing
menus of all possible cities is not feasible, since there are multiple
areas within each province that each need a city list...

Many thanks in advance for insights.

-- 
Marc Bissonnette
CGI / Database / Web Management Tools: http://www.internalysis.com
Something To Sell? Looking To Buy? http://www.whitewaterclassifieds.ca
Looking for a new ISP? http://www.canadianisp.com


Relevant Pages

  • Re: Need advice on spell-checking for city names
    ... I have users submitting multiple city names in a perl application and this ... often results in multiple spellings, ... Ottawa ...
    (comp.lang.perl.misc)
  • Re: What is Pick anyway?
    ... CITY and STATE as a lookups into the POSTAL file. ... > LIST PEOPLE ID FIRSTNAME LASTNAME POSTALCODE ... > an attribute and multiple sub-values in a value and in some versions, ... In a simple SQL table design, you would need to reserve a certain number of ...
    (comp.databases.theory)
  • Re: Trying to wrap my head around splitting up & combining tables
    ... Address tables are a thorny problem in normalization terms. ... multiple times can introduce redundancy with all the problems that entails. ... Both Michigan and Minnesota have a city called Grand ... One step down from full normalization is to just have separate City, State, ...
    (microsoft.public.access.tablesdbdesign)
  • Re: Multiple names per row in table-How to print labels
    ... If companies have multiple contacts, ... Each contact should have a separate record in a "contact ... > I'm trying to set-up a mailing label for selected entries via a form with ... > City, St, Zip City, St, Zip City, St, Zip ...
    (microsoft.public.access.reports)
  • RE: merging information from partial duplicate rows
    ... "Todd" wrote: ... records once I've got them in a single row. ... each break in city, however (which you'd have to use some human logic to be ... I have multiple spellings of the same town, ...
    (microsoft.public.excel.misc)