Re: Shorter checksum than MD5
From: Elbert Lev (elbertlev_at_hotmail.com)
Date: 09/10/04
- Next message: Jamey Saunders: "Re: Changing state of buttons."
- Previous message: Colin J. Williams: "Re: PEP 335: Overloadable Boolean Operators - Official Posting"
- In reply to: Jeff Epler: "Re: Shorter checksum than MD5"
- Next in thread: Paul Rubin: "Re: Shorter checksum than MD5"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Date: 10 Sep 2004 05:27:30 -0700
> For your application, you should consider the total number of records
> you ever expect to have, and use more than 2 * lg(records) bits of hash.
> Due to the so-called "birthday paradox", when you have N possible hash
> values, two will be identical with 50% probability with around sqrt(N)
> items. You'd probably prefer that the probability be much lower in your
> application, since a collision will result in incorrect results.
>
Wrong! "birthday paradox" is not applicable here.
If you want an analogy with this combinatorial problem, imagine 2 rows with N
objects in each, There exists a "measure" of each object.
Some objects can be modified with probability 1/2**32
the measure will not change after modification.
Objects in the SAME POSITION in each row are compared by comparing their measures.
After M objects are modified what is the probability that
at least one modification will be "missed" by the comparison process.
I don't think, that in the foreseen future (if M and N are not too high)
such collision will occur.
- Next message: Jamey Saunders: "Re: Changing state of buttons."
- Previous message: Colin J. Williams: "Re: PEP 335: Overloadable Boolean Operators - Official Posting"
- In reply to: Jeff Epler: "Re: Shorter checksum than MD5"
- Next in thread: Paul Rubin: "Re: Shorter checksum than MD5"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]