Re: removing duplication from a huge list.
- From: bearophileHUGS@xxxxxxxxx
- Date: Fri, 27 Feb 2009 00:58:39 -0800 (PST)
odeits:
How big of a list are we talking about? If the list is so big that the
entire list cannot fit in memory at the same time this approach wont
work e.g. removing duplicate lines from a very large file.
If the data are lines of a file, and keeping the original order isn't
important, then the first to try may be to use the unix (or cygwin on
Windows) commands sort and uniq.
Bye,
bearophile
.
- Follow-Ups:
- Re: removing duplication from a huge list.
- From: Stefan Behnel
- Re: removing duplication from a huge list.
- References:
- Re: removing duplication from a huge list.
- From: Chris Rebert
- Re: removing duplication from a huge list.
- From: odeits
- Re: removing duplication from a huge list.
- Prev by Date: Re: Multiple conditional expression
- Next by Date: Re: Proposed implementation for an Ordered Dictionary
- Previous by thread: Re: removing duplication from a huge list.
- Next by thread: Re: removing duplication from a huge list.
- Index(es):
Relevant Pages
|
Loading