A guided tour to approximate string matching

doi:10.1145/375360.375365

Journal Article10.1145/375360.375365

A guided tour to approximate string matching

Gonzalo Navarro

- 01 Mar 2001

- ACM Computing Surveys

- Vol. 33, Iss: 1, pp 31-88

3K

TL;DR: This work surveys the current techniques to cope with the problem of string matching that allows errors, and focuses on online searching and mostly on edit distance, explaining the problem and its relevance, its statistical behavior, its history and current developments, and the central ideas of the algorithms.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book

Supervised Sequence Labelling with Recurrent Neural Networks

Alex Graves

- 09 Feb 2012

TL;DR: A new type of output layer that allows recurrent networks to be trained directly for sequence labelling tasks where the alignment between the inputs and the labels is unknown, and an extension of the long short-term memory network architecture to multidimensional data, such as images and video sequences.

...read moreread less

3.1K

•Journal Article•10.1109/TKDE.2007.250581

Duplicate Record Detection: A Survey

Elmagarmid, +2 more

- 01 Jan 2007

- IEEE Transactions on Knowledge and Data ...

TL;DR: This paper presents an extensive set of duplicate detection algorithms that can detect approximately duplicate records in a database and covers similarity metrics that are commonly used to detect similar field entries.

...read moreread less

2.1K

•Journal Article•10.1109/TKDE.2007.9

Duplicate Record Detection: A Survey

Ahmed K. Elmagarmid, +2 more

- 01 Jan 2007

- IEEE Transactions on Knowledge and Data ...

TL;DR: This paper presents an extensive set of duplicate detection algorithms that can detect approximately duplicate records in a database and covers similarity metrics that are commonly used to detect similar field entries.

...read moreread less

1.6K

•Journal Article•10.1007/S00778-006-0004-3

Efficient query evaluation on probabilistic databases

Nilesh Dalvi, +1 more

- 31 Aug 2004

TL;DR: It is shown that the data complexity of some queries is #P-complete, which implies that these queries do not admit any efficient evaluation methods, and an optimization algorithm is described that can compute efficiently most queries.

...read moreread less

1.2K

[서평]「Algorithms on Strings, Trees, and Sequences」

김동규

- 01 Mar 2000

1.1K

...

Expand

References

Journal Article•10.1016/S0022-2836(05)80360-2

Basic Local Alignment Search Tool

Stephen F. Altschul, +4 more

- 01 Oct 1990

- Journal of Molecular Biology

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

...read moreread less

98.8K

Lecture Notes in Computer Science 2382

Petrus Bollen

- 01 Jan 2002

36.7K

•Book

Introduction to Algorithms

Thomas H. Cormen, +2 more

- 01 Jan 1990

TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.

...read moreread less

24.8K

Journal Article•10.1007/BF02837777

Introduction to algorithms: 4. Turtle graphics

R. K. Shyamasundar

- 01 Sep 1996

- Resonance

TL;DR: In this article, a language similar to logo is used to draw geometric pictures using this language and programs are developed to draw geometrical pictures using it, which is similar to the one we use in this paper.

...read moreread less

15.4K

•Book

Introduction to Automata Theory, Languages, and Computation

John E. Hopcroft, +3 more

- 01 Jan 1979

TL;DR: This book is a rigorous exposition of formal languages and models of computation, with an introduction to computational complexity, appropriate for upper-level computer science undergraduates who are comfortable with mathematical arguments.

...read moreread less

14.5K

...

Expand

A guided tour to approximate string matching

Chat with Paper

AI Agents for this Paper

Citations

Supervised Sequence Labelling with Recurrent Neural Networks

Duplicate Record Detection: A Survey

Duplicate Record Detection: A Survey

Efficient query evaluation on probabilistic databases

[서평]「Algorithms on Strings, Trees, and Sequences」

References

Basic Local Alignment Search Tool

Lecture Notes in Computer Science 2382

Introduction to Algorithms

Introduction to algorithms: 4. Turtle graphics

Introduction to Automata Theory, Languages, and Computation

Related Papers (5)

Binary codes capable of correcting deletions, insertions, and reversals

The String-to-String Correction Problem

A general method applicable to the search for similarities in the amino acid sequence of two proteins

Basic Local Alignment Search Tool

Identification of common molecular subsequences.