Optimal Clustering Algorithms for Data Mining
TL;DR: SVC is better than the k-mean, fu zzy c-mean and SOM, because; it doesn't depend on either number or shape of clusters, and it dealing with outlier and overlapping, where; the practical total time improvement support vector clustering (iSVC) labeling method isbetter than the other methods that improve SVC.
read more
Abstract: Data mining is the process used to analyze a large quantity of heterogeneous data to extract useful informat ion. Meanwhile, many data min ing techniques are used; clustering classified to be an important technique, used to divide data into several groups called, clusters. Those clusters contain, objects that are homogeneous in one cluster, and different fro m other clusters. As a reason of the dependence of many applications on clustering techniques, while there is no combined method for clustering; this study compares k- mean, Fu zzy c-mean, self-organizing map (SOM ), and support vector clustering (SVC); to show how those algorith ms solve clustering problems, and then; compares the new methods of clustering (SVC) with the traditional clustering methods (K-mean, fuzzy c-mean and SOM). The main findings show that SVC is better than the k-mean, fu zzy c-mean and SOM, because; it doesn't depend on either number or shape of clusters, and it dealing with outlier and overlapping. Finally; this paper show that; the enhancement using the gradient decent, and the proximity g raph, imp roves the support vector clustering time by decreasing its computational complexity to O(n logn) instead of O(n2d), where; the practical total time fo r improvement support vector clustering (iSVC) labeling method is better than the other methods that improve SVC.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Toward successful agile requirements change management process in global software development: a client–vendor analysis
TL;DR: The findings of this study provide a robust framework to assist GSD firms in implementing ARCM activities and reveal that human resource management is the most significant knowledge area amongst the investigated factors.
39
Data Cleaning In Data Warehouse: A Survey of Data Pre-processing Techniques and Tools
TL;DR: To determine which technique of preprocessing is best in what scenario to improve the performance of Data Warehouse is main goal of this paper and can help in data auditing and pattern detection in the data.
Clustering of Faculty by Evaluating their Appraisal Performance by using Feed Forward Neural Network Approach
TL;DR: An approach was given with an approach to solve the problem of inability in grouping of staff members by considering possible optimum soft computing technique that includes Feed Forward Neural Network approach.
E-Mail Spam Detection Using Refined MLP with Feature Selection
Harjot Kaur,Er. Prince Verma +1 more
TL;DR: The process of filtering the emails into spam and ham using various techniques is discussed, which involves learning and non-machine learning techniques.
Comparative Weka Analysis of Clustering Algorithm‘s
Harjot Kaur,Prince Verma +1 more
TL;DR: Clustering is an unsupervised technique that is fairly applicable on large datasets with a large number of attributes that gives a concise view of data.
References
Data clustering: a review
TL;DR: An overview of pattern clustering methods from a statistical pattern recognition perspective is presented, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners.
Data clustering: 50 years beyond K-means
Anil K. Jain
- 01 Jun 2010
TL;DR: A brief overview of clustering is provided, well known clustering methods are summarized, the major challenges and key issues in designing clustering algorithms are discussed, and some of the emerging and useful research directions are pointed out.
8.4K
Data Clustering: 50 Years Beyond K-means
Anil K. Jain
- 15 Sep 2008
TL;DR: Cluster analysis as mentioned in this paper is the formal study of algorithms and methods for grouping objects according to measured or perceived intrinsic characteristics, which is one of the most fundamental modes of understanding and learning.
Support vector clustering
TL;DR: In this paper, a Gaussian kernel based clustering method using support vector machines (SVM) is proposed to find the minimal enclosing sphere, which can separate into several components, each enclosing a separate cluster of points.