Proceedings Article10.1109/ISISE.2009.115
GAKC: A New GA-Based k Clustering Algorithm
Li Xiaohong,Luo Min +1 more
- 26 Dec 2009
- pp 334-338
3
TL;DR: This work proposes an effective GAKC algorithm by using a genetic algorithm to search for the optimal cluster result of spectral clustering, which uses group number coding chromosome, a new uniform crossover operator and exponential mutation rate.
read more
Abstract: Clustering is an important, hard and active topic in data analysis and pattern recognition. K clustering is a branch of data clustering where the number of clusters is know in advance. Recently, spectral clustering (SC) becomes one of the most popular and appealing k clustering methods because of its generality, efficiency and its rich theoretical foundation. But the final results obtained from SCs depend on spectral relaxation which may have no guarantee on the quality of the solution. In order to overcome the SCs’ shortcoming, we propose an effective GAKC algorithm by using a genetic algorithm to search for the optimal cluster result of SCs. The algorithm uses group number coding chromosome, a new uniform crossover operator and exponential mutation rate. To verify the effectiveness of GAKC, a comparison among the experimental results of the proposed GAKC, a classical GA-based method by Ujjwal Malulik and the SC methods by SM and NJW on a real-life data set is presented. The conclusion comes that the proposed algorithm can gain much more accurate clustering result.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Incremental Cluster Detection using a Soft Computing Approach
TL;DR: An incremental algorithm, IPYRAMID: Incremental Parallel hYbrid clusteRing using genetic progrAmming and Multiobjective fItness with Density employs a combination of data parallelism, genetic programming, special operators, and multi-objective density-based incremental fitness function that helps to handle outliers.
Evolutionary computing based hybrid bisecting clustering algorithm for multidimensional data
TL;DR: An improved Canonical GA based Bisecting K-Means algorithm (CGABC) has been developed, which has exhibited optimal solution for highly accurate and efficient clustering with high dimensional data sets.
3
A pragmatic approach for multidimensional data clustering
K Aparna,Mydhili K. Nair +1 more
- 01 Jul 2017
TL;DR: By incorporating the optimized cluster center information, the Bisecting K-Means clustering has been performed, which has exhibited optimal solution for highly accurate and efficient clustering with high dimensional data sets.
2
References
Genetic algorithms in search, optimization and machine learning
David E. Goldberg
- 01 Jan 1989
TL;DR: This book brings together the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields.
58.6K
•Book
Genetic algorithms in search, optimization, and machine learning
David E. Goldberg
- 01 Sep 1988
TL;DR: In this article, the authors present the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields, including computer programming and mathematics.
•Book
Genetic Algorithms
David E. Goldberg,William Shakespeare +1 more
- 01 Jan 2002
TL;DR: The present work expresses the problem as a multi-objective optimization problem and a methodology has been proposed based on multi-objective genetic algo-rithm (MOGA) that exploits the effectiveness of MOGA for searching global optimal solutions in selecting an appropriate image enhancement operator.
17.1K
Normalized cuts and image segmentation
Jianbo Shi,Jitendra Malik +1 more
TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
Data clustering: a review
TL;DR: An overview of pattern clustering methods from a statistical pattern recognition perspective is presented, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners.