Abstract
Recently data mining applications require a large amount of high-dimensional data. However, most clustering methods for data mining do not work efficiently for dealing with large, high-dimensional data because of the so-called 'curse of dimensionality'[1] and the limitation of available memory. In this paper, we propose a new cell-based clustering method which is more efficient for large, high-dimensional data than the existing clustering methods. Our clustering method provides an efficient cell creation algorithm using a space-partitioning technique and uses a filtering-based index structure using an approximation technique. Finally, we compare the performance of our cell-based clustering method with the CLIQUE method in terms of cluster construction time, precision, and retrieval time. The experimental results show that our clustering method achieves better performance on cluster construction time and retrieval time.
| Original language | English |
|---|---|
| Pages | 503-507 |
| Number of pages | 5 |
| DOIs | |
| State | Published - 2002 |
| Event | Applied Computing 2002: Proceeedings of the 2002 ACM Symposium on Applied Computing - Madrid, Spain Duration: 2002.03.11 → 2002.03.14 |
Conference
| Conference | Applied Computing 2002: Proceeedings of the 2002 ACM Symposium on Applied Computing |
|---|---|
| Country/Territory | Spain |
| City | Madrid |
| Period | 02.03.11 → 02.03.14 |
Keywords
- Cell-based clustering
- Data mining
- Filtering-based index structure
- High dimensional data
Quacquarelli Symonds(QS) Subject Topics
- Computer Science & Information Systems
Fingerprint
Dive into the research topics of 'A new cell-based clustering method for large, high-dimensional data in data mining applications'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver