Re: Computational complexity of cluster detection
- From: Chris Smith <cdsmith@xxxxxxx>
- Date: Sat, 28 Apr 2007 22:04:56 -0600
vincent64@xxxxxxxxx <datashaping@xxxxxxxxx> wrote:
It also suggests that the
computational complexity of computing whether a data set contains one
of more clusters is well below O(n), possibly O(n0.5), if one uses
sampling techniques.
My first inclination is to respond that it's misleading to claim an
asymptotic complexity for solving a problem before the problem has been
sufficiently specified. There are many, many problems for which solving
something close to the problem is much easier than solving the problem
itself. That's only valuable if one can say more about in what ways the
answer is "close", and what that means in terms of the problem one
intends to solve.
If I were you and wanted to proceed with your idea, I'd do the
following:
* Survey how statisticians define "clusters" of data points and find a
reasonable, common, and formalized mathematical characterization of
clusters.
* Investigate how your sample-based algorithms relate to the probability
of having identified a cluster for various kinds of random data
distributions for the complete data set.
* Be sure to pay attention to worst-case data; how would one design
a data set to fool your algorithm, and why is data that looks like
that going to be likely or unlikely in practical situations?
* State the results in a way that involves specific verifiable or
provable conclusions about well-defined situations. If you can make
a valid statement about even a very limited but well-specified model,
then you have something. If you can only say things that start with
"In general, ..." or similar, then you don't.
--
Chris Smith
.
- References:
- Computational complexity of cluster detection
- From: vincent64@xxxxxxxxx
- Computational complexity of cluster detection
- Prev by Date: Computational complexity of cluster detection
- Next by Date: What is the language of this grammar?
- Previous by thread: Computational complexity of cluster detection
- Next by thread: What is the language of this grammar?
- Index(es):
Relevant Pages
|
Loading