Incremental Cluster Detection using a Soft Computing Approach
TL;DR: An incremental algorithm, IPYRAMID: Incremental Parallel hYbrid clusteRing using genetic progrAmming and Multiobjective fItness with Density employs a combination of data parallelism, genetic programming, special operators, and multi-objective density-based incremental fitness function that helps to handle outliers.
read more
Abstract: Clustering is the process of locating patterns in large data sets. As databases continue to grow in size, efficient and effective clustering algorithms play a paramount role in data mining applications. Traditional clustering approaches usually analyze static datasets in which objects are kept unchanged after being processed, but many practical datasets are dynamically modified which means some previously learned patterns have to be updated accordingly. Re-clustering the whole dataset from scratch is not a good choice due to the frequent data modifications and the limited out-of-service time, so the development of incremental clustering approaches is highly desirable. In this paper, we propose an incremental algorithm, IPYRAMID: Incremental Parallel hYbrid clusteRing using genetic progrAmming and Multiobjective fItness with Density employs a combination of data parallelism, genetic programming (GP), special operators, and multi-objective density-based incremental fitness function. Although many incremental clustering algorithms have been proposed which can handle insertion of new record properly using incremental approach but cannot handle deletion of record properly. This issue is resolved in the proposed algorithm and density based incremental fitness function that helps to handle outliers. Use of parallelism increases the speed of execution as well as identifies clusters of arbitrary shapes. The incremental merge engine can dynamically determine the number of clusters. Preliminary experimental results show that it can increase the efficiency of clustering process.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Evolve systems using incremental clustering approach
Parag Kulkarni,Preeti Mulay +1 more
TL;DR: This work introduces a new paradigm for machine learning and propose new framework for incremental clustering and builds the platform for dynamic and incremental learning.
31
•Journal Article
Incremental generalization for mining in a data warehousing environment
Martin Ester,Rüdiger Wittmann +1 more
TL;DR: In this paper, the authors present algorithms for incremental attribute-oriented generalization with the conflicting goals of good efficiency and minimal overly generalization, which is a common method for the task of summarization.
14
Dynamic clustering with soft computing
Georg Peters,Richard Weber +1 more
TL;DR: In the past decades, very promising approaches to enrich and extend classic static clustering algorithms by dynamic derivatives are suggested; some selected ones will be introduced in this review.
14
References
Chameleon: hierarchical clustering using dynamic modeling
TL;DR: Chameleon's key feature is that it accounts for both interconnectivity and closeness in identifying the most similar pair of clusters, which is important for dealing with highly variable clusters.
2.3K
•Proceedings Article
Incremental Clustering for Mining in a Data Warehousing Environment
Martin Ester,Hans-Peter Kriegel,Jörg Sander,Michael Wimmer,Xiaowei Xu +4 more
- 24 Aug 1998
TL;DR: It can be proven that the incremental algorithm yields the same result as DBSCAN, which is applicable to any database containing data from a metric space, e.g., to a spatial database or to a WWW-log database.
Maintenance of Discovered Association Rules
Sau Dan Lee,David W. Cheung +1 more
- 01 Jan 2002
TL;DR: This chapter defines the problem of incrementally updating mined association rules and presents efficient technique that has been developed to solve this non-trivial problem.
90
•Journal Article
An Incremental Grid Density-Based Clustering Algorithm
TL;DR: A grid density-based clustering algorithm——GDCA is introduced, which discovers clusters with arbitrary shape in spatial databases, and first partitions the data space into a number of units, and then deals with units instead of points.
51
Related Papers (5)
Tao Li,Sarabjot Singh Anand +1 more
- 15 Dec 2008
Gregory James Hamerly,Charles P. Elkan +1 more
- 01 Jan 2003