Re: Loose comparison



In article <42e7a7e2@xxxxxxxxxxxxxxxxxxxxxx>, Stevewarby@xxxxxxxxxxxxxx
says...
> We collect vehicle registration numbers eg V5LTG N806FCH etc from tickets
> and manually enter them into the database. We are now automating the system
> whereby clients enter the details online or are sent via pocket pcs.
>
> The problem is we are ending up with duplicate entries because some
> intuprits a 5 as an S an 0 as O etc. eg W520HYT becomes WS20HYT and we end
> up with two entries for the same vehicle.
>
> The data actually arrives twice. The top hard copy is sent by the client the
> carbon copy is sent by our staff therefore we to check to see if the
> registration is already on the database.
>
> Is there a fuzzy compare routine / component that could bring back results
> from the existing database and the client could then make an informed
> decision about the data.
>

See this entry at code central:

http://cc.borland.com/Item.aspx?id=20126

"Strcomp2k is an enhanced string comparison function that provides
better results than Levenstein or other methods. It takes into account
common typing and phonetic errors. It is ideally suited for record
matching in data scrubbing and other lookup applications where the
quality of the input data can not be guaranteed. This is a port of the
algorithm described by Bill Winkler of the US Bureau of the Census."

I wrote it and use it for name and address scrubbing...

--
John
Life is complex. It has real and imaginary parts
.



Relevant Pages

  • Re: Loose comparison
    ... >> We collect vehicle registration numbers eg V5LTG N806FCH etc from ... >> and manually enter them into the database. ... The top hard copy is sent by the client ... > matching in data scrubbing and other lookup applications where the ...
    (borland.public.delphi.thirdpartytools.general)
  • Re: Loose comparison
    ... best regards ... > We collect vehicle registration numbers eg V5LTG N806FCH etc from tickets and manually enter them into the database. ... The top hard copy is sent by the client the carbon copy is sent by our staff ...
    (borland.public.delphi.thirdpartytools.general)
  • Loose comparison
    ... We collect vehicle registration numbers eg V5LTG N806FCH etc from tickets ... and manually enter them into the database. ... The problem is we are ending up with duplicate entries because some ... The top hard copy is sent by the client the ...
    (borland.public.delphi.thirdpartytools.general)
  • Re: Opinions needed about the best "Middleware suite" kbmMW vs. RODA
    ... kbmMW supports cross db in such way that all you need to do in your application is to set one property to switch to ... What one have to concentrate about is minimizing the amount of data moved from the app server to the client. ... C/S setup's usually have a quite active chatter going on between the client and the database, ...
    (borland.public.delphi.thirdpartytools.general)
  • Re: Access 2003 Macro Security Modification
    ... the desired way to prevent the macro security warning is to ... digitally sign your database. ... Your database fileon the client computer. ...
    (microsoft.public.access.modulesdaovba)