Ward's method

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1080/01621459.1963.10500845•

Hierarchical Grouping to Optimize an Objective Function

[...]

01 Mar 1963-Journal of the American Statistical Association

TL;DR: In this paper, a procedure for forming hierarchical groups of mutually exclusive subsets, each of which has members that are maximally similar with respect to specified characteristics, is suggested for use in large-scale (n > 100) studies when a precise optimal solution for a specified number of groups is not practical.

...read moreread less

Abstract: A procedure for forming hierarchical groups of mutually exclusive subsets, each of which has members that are maximally similar with respect to specified characteristics, is suggested for use in large-scale (n > 100) studies when a precise optimal solution for a specified number of groups is not practical. Given n sets, this procedure permits their reduction to n − 1 mutually exclusive sets by considering the union of all possible n(n − 1)/2 pairs and selecting a union having a maximal value for the functional relation, or objective function, that reflects the criterion chosen by the investigator. By repeating this process until only one group remains, the complete hierarchical structure and a quantitative estimate of the loss associated with each stage in the grouping can be obtained. A general flowchart helpful in computer programming and a numerical example are included.

...read moreread less

19,865 citations

Journal Article•10.1007/S00357-014-9161-Z•

Ward's Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward's Criterion?

[...]

Fionn Murtagh¹, Pierre Legendre²•Institutions (2)

De Montfort University¹, Université de Montréal²

01 Oct 2014-Journal of Classification

TL;DR: The survey work and case studies will be useful for all those involved in developing software for data analysis using Ward’s hierarchical clustering method.

...read moreread less

Abstract: The Ward error sum of squares hierarchical clustering method has been very widely used since its first description by Ward in a 1963 publication. It has also been generalized in various ways. Two algorithms are found in the literature and software, both announcing that they implement the Ward clustering method. When applied to the same distance matrix, they produce different results. One algorithm preserves Ward's criterion, the other does not. Our survey work and case studies will be useful for all those involved in developing software for data analysis using Ward's hierarchical clustering method.

...read moreread less

3,338 citations

Journal Article•10.1007/S00357-005-0012-9•

Hierarchical Clustering via Joint Between-Within Distances: extending Ward's Minimum Variance Method

[...]

Gábor J. Székely¹, Maria L. Rizzo²•Institutions (2)

Bowling Green State University¹, Ohio University²

01 Sep 2005-Journal of Classification

TL;DR: A hierarchical clustering method that minimizes a joint between-within measure of distance between clusters, by defining a cluster distance and objective function in terms of Euclidean distance, or any power of Euclidesan distance in the interval (0,2).

...read moreread less

Abstract: We propose a hierarchical clustering method that minimizes a joint between-within measure of distance between clusters. This method extends Ward's minimum variance method, by defining a cluster distance and objective function in terms of Euclidean distance, or any power of Euclidean distance in the interval (0,2]. Ward's method is obtained as the special case when the power is 2. The ability of the proposed extension to identify clusters with nearly equal centers is an important advantage over geometric or cluster center methods. The between-within distance statistic determines a clustering method that is ultrametric and space-dilating; and for powers strictly less than 2, determines a consistent test of homogeneity and a consistent clustering procedure. The clustering procedure is applied to three problems: classification of tumors by microarray gene expression data, classification of dermatology diseases by clinical and histopathological attributes, and classification of simulated multivariate normal data.

...read moreread less

725 citations

Journal Article•10.1371/JOURNAL.PONE.0168288•

Generalising Ward's Method for Use with Manhattan Distances.

[...]

Trudie Strauss¹, Michael von Maltitz¹•Institutions (1)

University of the Free State¹

13 Jan 2017-PLOS ONE

TL;DR: This paper argues that the generalisation of Ward’s linkage method to incorporate Manhattan distances is theoretically sound and provides an example of where this method outperforms the method using Euclidean distances.

...read moreread less

Abstract: The claim that Ward's linkage algorithm in hierarchical clustering is limited to use with Euclidean distances is investigated. In this paper, Ward's clustering algorithm is generalised to use with l1 norm or Manhattan distances. We argue that the generalisation of Ward's linkage method to incorporate Manhattan distances is theoretically sound and provide an example of where this method outperforms the method using Euclidean distances. As an application, we perform statistical analyses on languages using methods normally applied to biology and genetic classification. We aim to quantify differences in character traits between languages and use a statistical language signature based on relative bi-gram (sequence of two letters) frequencies to calculate a distance matrix between 32 Indo-European languages. We then use Ward's method of hierarchical clustering to classify the languages, using the Euclidean distance and the Manhattan distance. Results obtained from using the different distance metrics are compared to show that the Ward's algorithm characteristic of minimising intra-cluster variation and maximising inter-cluster variation is not violated when using the Manhattan metric.

...read moreread less

210 citations

Journal Article•10.1016/J.JHYDROL.2005.09.009•

Identification of homogeneous regions for regional frequency analysis using the self-organizing map

[...]

Gwo-Fong Lin¹, Lu-Hsien Chen¹•Institutions (1)

National Taiwan University¹

15 Jun 2006-Journal of Hydrology

TL;DR: The self-organizing map (SOM) is applied to identify the homogeneous regions for regional frequency analysis and shows that the SOM can identify thehomogeneous regions more accurately as compared to the other two clustering methods.

...read moreread less

208 citations

...

Expand

Year	Papers
2021	4
2020	4
2019	7
2018	5
2017	9
2016	6

Topic Tools

Papers published on a yearly basis

Papers

Hierarchical Grouping to Optimize an Objective Function

Ward's Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward's Criterion?

Hierarchical Clustering via Joint Between-Within Distances: extending Ward's Minimum Variance Method

Generalising Ward's Method for Use with Manhattan Distances.

Identification of homogeneous regions for regional frequency analysis using the self-organizing map

Related Topics (5)

Performance Metrics