Similitude

Topic Tools

Papers published on a yearly basis

1 / 2

Papers

Journal Article•10.1109/TIT.1962.1057692•

Visual pattern recognition by moment invariants

[...]

Ming-Kuei Hu¹•Institutions (1)

Syracuse University¹

01 Feb 1962-IEEE Transactions on Information Theory

TL;DR: It is shown that recognition of geometrical patterns and alphabetical characters independently of position, size and orientation can be accomplished and it is indicated that generalization is possible to include invariance with parallel projection.

...read moreread less

Abstract: In this paper a theory of two-dimensional moment invariants for planar geometric figures is presented. A fundamental theorem is established to relate such moment invariants to the well-known algebraic invariants. Complete systems of moment invariants under translation, similitude and orthogonal transformations are derived. Some moment invariants under general two-dimensional linear transformations are also included. Both theoretical formulation and practical models of visual pattern recognition based upon these moment invariants are discussed. A simple simulation program together with its performance are also presented. It is shown that recognition of geometrical patterns and alphabetical characters independently of position, size and orientation can be accomplished. It is also indicated that generalization is possible to include invariance with parallel projection.

...read moreread less

8,568 citations

Journal Article•10.1140/EPJB/E2009-00335-8•

Predicting missing links via local information

[...]

Tao Zhou¹, Tao Zhou², Linyuan Lü², Yi-Cheng Zhang¹, Yi-Cheng Zhang² - Show less +1 more•Institutions (2)

University of Science and Technology of China¹, University of Fribourg²

10 Oct 2009-European Physical Journal B

TL;DR: In this paper, the authors empirically investigate a simple framework of link prediction on the basis of node similarity and propose a new similarity measure, motivated by the resource allocation process taking place on networks, which can remarkably enhance the prediction accuracy.

...read moreread less

Abstract: Missing link prediction in networks is of both theoretical interest and practical significance in modern science. In this paper, we empirically investigate a simple framework of link prediction on the basis of node similarity. We compare nine well-known local similarity measures on six real networks. The results indicate that the simplest measure, namely Common Neighbours, has the best overall performance, and the Adamic-Adar index performs second best. A new similarity measure, motivated by the resource allocation process taking place on networks, is proposed and shown to have higher prediction accuracy than common neighbours. It is found that many links are assigned the same scores if only the information of the nearest neighbours is used. We therefore design another new measure exploiting information on the next nearest neighbours, which can remarkably enhance the prediction accuracy.

...read moreread less

1,688 citations

Journal Article•10.1109/TIT.2005.844059•

Clustering by compression

[...]

Ming Li, Xin Chen¹, Xin Li, Bin Ma, Paul M. B. Vitányi¹ - Show less +1 more•Institutions (1)

University of Amsterdam¹

15 Sep 2003

TL;DR: Evidence of successful application in areas as diverse as genomics, virology, languages, literature, music, handwritten digits, astronomy, and combinations of objects from completely different domains, using statistical, dictionary, and block sorting compressors is reported.

...read moreread less

Abstract: We present a new method for clustering based on compression. The method does not use subject-specific features or background knowledge, and works as follows: First, we determine a parameter-free, universal, similarity distance, the normalized compression distance or NCD, computed from the lengths of compressed data files (singly and in pairwise concatenation). Second, we apply a hierarchical clustering method. The NCD is not restricted to a specific application area, and works across application area boundaries. A theoretical precursor, the normalized information distance, co-developed by one of the authors, is provably optimal. However, the optimality comes at the price of using the noncomputable notion of Kolmogorov complexity. We propose axioms to capture the real-world setting, and show that the NCD approximates optimality. To extract a hierarchy of clusters from the distance matrix, we determine a dendrogram (ternary tree) by a new quartet method and a fast heuristic to implement it. The method is implemented and available as public software, and is robust under choice of different compressors. To substantiate our claims of universality and robustness, we report evidence of successful application in areas as diverse as genomics, virology, languages, literature, music, handwritten digits, astronomy, and combinations of objects from completely different domains, using statistical, dictionary, and block sorting compressors. In genomics, we presented new evidence for major questions in Mammalian evolution, based on whole-mitochondrial genomic analysis: the Eutherian orders and the Marsupionta hypothesis against the Theria hypothesis.

...read moreread less

1,324 citations

Proceedings Article•

Combination of multiple searches

[...]

Edward A. Fox, Joseph A. Shaw

1 Jan 1994

TL;DR: This paper describes one method that has been shown to increase performance by combining the similarity values from five different retrieval runs using both vector space and P-norm extended boolean retrieval methods.

...read moreread less

Abstract: The TREC-2 project at Virginai Tech focused on methods for combining the evidence from multiple retrieval runs to improve performance over any single retrieval method. This paper describes one such method that has been shown to increase performance by combining the similarity values from five different retrieval runs using both vector space and P-norm extended boolean retrieval methods

...read moreread less

1,192 citations

Journal Article•10.1109/TIT.2004.838101•

The similarity metric

[...]

Ming Li¹, Xin Chen², Xin Li³, Bin Ma³, Paul M. B. Vitányi⁴ - Show less +1 more•Institutions (4)

University of Waterloo¹, University of California, Santa Barbara², University of Western Ontario³, University of Amsterdam⁴

01 Dec 2004-IEEE Transactions on Information Theory

TL;DR: In this paper, the authors proposed a new normalized information distance based on the non-computable notion of Kolmogorov complexity, which minorizes every computable distance in the class (that is, it is universal in that it discovers all computable similarities).

...read moreread less

Abstract: A new class of distances appropriate for measuring similarity relations between sequences, say one type of similarity per distance, is studied. We propose a new "normalized information distance," based on the noncomputable notion of Kolmogorov complexity, and show that it is in this class and it minorizes every computable distance in the class (that is, it is universal in that it discovers all computable similarities). We demonstrate that it is a metric and call it the similarity metric . This theory forms the foundation for a new practical tool. To evidence generality and robustness, we give two distinctive applications in widely divergent areas using standard compression programs like gzip and GenCompress. First, we compare whole mitochondrial genomes and infer their evolutionary history. This results in a first completely automatic computed whole mitochondrial phylogeny tree. Secondly, we fully automatically compute the language tree of 52 different languages.

...read moreread less

1,136 citations

...

Expand

Year	Papers
2025	23
2024	111
2023	45
2022	67
2021	70
2020	56

Topic Tools

Papers published on a yearly basis

Papers

Visual pattern recognition by moment invariants

Predicting missing links via local information

Clustering by compression

Combination of multiple searches

The similarity metric

Related Topics (5)

Performance Metrics