Abstract
Scalability of clustering algorithms is a critical issue facing the data mining community. One method to handle this issue is to use only a subset of all instances. This paper develops an optimization-based approach to the partitional clustering problem using an algorithm specifically designed for noisy performance, which is a problem that arises when using a subset of instances. Numerical results show that computation time can be dramatically reduced by using a partial set of instances without sacrificing solution quality. In addition, these results are more persuasive as the size of the problem is larger.Journal of the Operational Research Society (2009) 60, 1069-1084. doi:10.1057/jors.2008.195; published online 8 April 2009.
| Original language | English |
|---|---|
| Pages (from-to) | 1069-1084 |
| Number of pages | 16 |
| Journal | Journal of the Operational Research Society |
| Volume | 60 |
| Issue number | 8 |
| DOIs | |
| State | Published - 2009.08 |
Keywords
- Optimization-based partitional clustering
- Partitioning
- Scalability
Quacquarelli Symonds(QS) Subject Topics
- Business & Management Studies
- Mathematics
- Statistics & Operational Research
- Data Science
Fingerprint
Dive into the research topics of 'An optimization approach to partitional data clustering'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver