Rabin–Karp algorithm

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1137/0206024•

Fast Pattern Matching in Strings

[...]

Donald E. Knuth, James Morris, Vaughan R. Pratt

01 Jun 1977-SIAM Journal on Computing

TL;DR: An algorithm is presented which finds all occurrences of one given string within another, in running time proportional to the sum of the lengths of the strings, showing that the set of concatenations of even palindromes, i.e., the language $\{\alpha \alpha ^R\}^*$, can be recognized in linear time.

...read moreread less

Abstract: An algorithm is presented which finds all occurrences of one given string within another, in running time proportional to the sum of the lengths of the strings. The constant of proportionality is low enough to make this algorithm of practical use, and the procedure can also be extended to deal with some more general pattern-matching problems. A theoretical application of the algorithm shows that the set of concatenations of even palindromes, i.e., the language $\{\alpha \alpha ^R\}^*$, can be recognized in linear time. Other algorithms which run even faster on the average are also considered.

...read moreread less

3,461 citations

Journal Article•10.1145/360825.360855•

Efficient string matching: an aid to bibliographic search

[...]

Alfred V. Aho¹, Margaret J. Corasick¹•Institutions (1)

Bell Labs¹

01 Jun 1975-Communications of The ACM

TL;DR: A simple, efficient algorithm to locate all occurrences of any of a finite number of keywords in a string of text that has been used to improve the speed of a library bibliographic search program by a factor of 5 to 10.

...read moreread less

Abstract: This paper describes a simple, efficient algorithm to locate all occurrences of any of a finite number of keywords in a string of text. The algorithm consists of constructing a finite state pattern matching machine from the keywords and then using the pattern matching machine to process the text string in a single pass. Construction of the pattern matching machine takes time proportional to the sum of the lengths of the keywords. The number of state transitions made by the pattern matching machine in processing the text string is independent of the number of keywords. The algorithm has been used to improve the speed of a library bibliographic search program by a factor of 5 to 10.

...read moreread less

3,450 citations

Journal Article•10.1145/359842.359859•

A fast string searching algorithm

[...]

Robert S. Boyer¹, J. Strother Moore²•Institutions (2)

SRI International¹, PARC²

01 Oct 1977-Communications of The ACM

TL;DR: The algorithm has the unusual property that, in most cases, not all of the first i.” in another string, are inspected.

...read moreread less

Abstract: An algorithm is presented that searches for the location, “il” of the first occurrence of a character string, “pat,” in another string, “string.” During the search operation, the characters of pat are matched starting with the last character of pat. The information gained by starting the match at the end of the pattern often allows the algorithm to proceed in large jumps through the text being searched. Thus the algorithm has the unusual property that, in most cases, not all of the first i characters of string are inspected. The number of characters actually inspected (on the average) decreases as a function of the length of pat. For a random English pattern of length 5, the algorithm will typically inspect i/4 characters of string before finding a match at i. Furthermore, the algorithm has been implemented so that (on the average) fewer than i + patlen machine instructions are executed. These conclusions are supported with empirical evidence and a theoretical analysis of the average behavior of the algorithm. The worst case behavior of the algorithm is linear in i + patlen, assuming the availability of array space for tables linear in patlen plus the size of the alphabet.

...read moreread less

2,710 citations

Journal Article•10.1147/RD.312.0249•

Efficient randomized pattern-matching algorithms

[...]

Richard M. Karp¹, Michael O. Rabin²•Institutions (2)

University of California, Berkeley¹, Harvard University²

01 Mar 1987-Ibm Journal of Research and Development

TL;DR: In this article, the first occurrence of a string X as a consecutive block within a text Y is found by using a randomized algorithm. But the algorithm requires a constant number of storage locations, and essentially runs in real time.

...read moreread less

Abstract: We present randomized algorithms to solve the following string-matching problem and some of its generalizations: Given a string X of length n (the pattern) and a string Y (the text), find the first occurrence of X as a consecutive block within Y. The algorithms represent strings of length n by much shorter strings called fingerprints, and achieve their efficiency by manipulating fingerprints instead of longer strings. The algorithms require a constant number of storage locations, and essentially run in real time. They are conceptually simple and easy to implement. The method readily generalizes to higher-dimensional patternmatching problems.

...read moreread less

1,547 citations

Journal Article•10.1016/0022-0000(85)90014-5•

A linear-time algorithm for a special case of disjoint set union

[...]

Harold N. Gabow¹, Robert E. Tarjan²•Institutions (2)

University of Colorado Boulder¹, AT&T²

01 Apr 1985-Journal of Computer and System Sciences

TL;DR: A linear-time algorithm for the special case of the disjoint set union problem in which the structure of the unions (defined by a “union tree”) is known in advance that is useful in finding maximum cardinality matchings in nonbipartite graphs.

...read moreread less

764 citations

...

Expand

Year	Papers
2021	2
2020	8
2019	10
2018	12
2017	14
2016	14

Topic Tools

Papers published on a yearly basis

Papers

Fast Pattern Matching in Strings

Efficient string matching: an aid to bibliographic search

A fast string searching algorithm

Efficient randomized pattern-matching algorithms

A linear-time algorithm for a special case of disjoint set union

Related Topics (5)

Performance Metrics