Analysing microarray expression data through effective clustering

doi:10.1016/J.INS.2013.12.003

Journal Article10.1016/J.INS.2013.12.003

Analysing microarray expression data through effective clustering

Elio Masciari, +2 more

- 01 Mar 2014

- Information Sciences

- Vol. 262, pp 32-45

22

TL;DR: A clustering algorithm called M-CLUBS (for Microarray data CLustering Using Binary Splitting) is proposed exhibiting higher accuracy than the hierarchical ones proposed so far while allowing a faster computation with respect to partition based approaches.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.4137/BBI.S38316

Clustering Algorithms: Their Application to Gene Expression Data

Jelili Oyelade, +7 more

- 30 Nov 2016

- Bioinformatics and Biology Insights

TL;DR: This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure.

...read moreread less

211

Journal Article•10.1016/J.INS.2014.05.030

Feature subset selection by gravitational search algorithm optimization

Xiaohong Han, +6 more

- 01 Oct 2014

- Information Sciences

TL;DR: The experimental results show that the proposed FSS-MGSA has the ability of selecting the discriminating input features correctly and can achieve high accuracy of classification, which is comparable to or better than well-known similar classifier systems.

...read moreread less

89

•Journal Article•10.1016/J.INS.2019.04.039

Multi-view cluster analysis with incomplete data to understand treatment effects

Guoqing Chao, +6 more

- 01 Aug 2019

- Information Sciences

TL;DR: This work proposes an enhanced formulation for a family of multi-view co-clustering methods to cope with the missing data problem by introducing an indicator matrix whose elements indicate which data entries are observed and assessing cluster validity only on observed entries.

...read moreread less

68

Journal Article•10.1016/J.FUTURE.2019.07.077

Fast and effective Big Data exploration by clustering

Michele Ianni, +4 more

- 01 Jan 2020

- Future Generation Computer Systems

TL;DR: By using four stages of successive refinements, CLUBS+ delivers high-quality clusters of data grouped around their centroids, working in a totally unsupervised fashion.

...read moreread less

54

Journal Article•10.1016/J.INS.2017.03.002

A fast and accurate algorithm for unsupervised clustering around centroids

Giuseppe M. Mazzeo, +2 more

- 01 Aug 2017

- Information Sciences

TL;DR: Results confirm that the new algorithm is fast, impervious to noise, and produces results of better quality than other algorithms, such as BOOL, BIRCH, and k-means++, even when the analyst can determine the correct number of clusters.

...read moreread less

37

...

Expand

References

•Book

Data Mining: Concepts and Techniques

Jiawei Han, +2 more

- 08 Sep 2000

TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.

...read moreread less

29.9K

Some methods for classification and analysis of multivariate observations

James B. MacQueen

- 01 Jan 1967

TL;DR: The k-means algorithm as mentioned in this paper partitions an N-dimensional population into k sets on the basis of a sample, which is a generalization of the ordinary sample mean, and it is shown to give partitions which are reasonably efficient in the sense of within-class variance.

...read moreread less

28.1K

Biometery: The principles and practice of statistics in biological research

Robert R. Sokal, +3 more

- 01 Jan 1995

TL;DR: In this paper, the authors present a model for the analysis of variance in a single-classification and two-way and multiway analysis of Variance with the assumption of correlation.

...read moreread less

23.4K

•Book

Biometry: The Principles and Practice of Statistics in Biological Research

Robert R. Sokal, +1 more

- 01 Jan 1969

TL;DR: In this paper, the authors present a model for the analysis of variance in a single-classification and two-way and multiway analysis of Variance with the assumption of correlation.

...read moreread less

21.3K

•Proceedings Article

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

Martin Ester, +3 more

- 02 Aug 1996

TL;DR: In this paper, a density-based notion of clusters is proposed to discover clusters of arbitrary shape, which can be used for class identification in large spatial databases and is shown to be more efficient than the well-known algorithm CLAR-ANS.

...read moreread less

20.3K

...

Expand

Analysing microarray expression data through effective clustering

Chat with Paper

AI Agents for this Paper

Citations

Clustering Algorithms: Their Application to Gene Expression Data

Feature subset selection by gravitational search algorithm optimization

Multi-view cluster analysis with incomplete data to understand treatment effects

Fast and effective Big Data exploration by clustering

A fast and accurate algorithm for unsupervised clustering around centroids

References

Data Mining: Concepts and Techniques

Some methods for classification and analysis of multivariate observations

Biometery: The principles and practice of statistics in biological research

Biometry: The Principles and Practice of Statistics in Biological Research

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

Related Papers (5)

Parallel boosted clustering

Robust deep k-means: An effective and simple method for data clustering

Learning to Link

Scalable Support Vector Clustering Using Budget.

A distance-type-insensitive clustering approach