Proceedings Article10.1109/ICCET.2009.199
An Efficient Multi-Patterns Parameterized String Matching Algorithm with Super Alphabet
Rajesh Prasad,Suneeta Agarwal +1 more
- 22 Jan 2009
- Vol. 1, pp 536-540
4
TL;DR: This paper extends single pattern exact shift-or string matching algorithm to find all parameterized occurrences of multiple patterns P0, P1, P2 …Pr-1, (r³1), each of equal size m, in the text T, and extends MPSO by using the concept of super alphabets.
read more
Abstract: In the parameterized string matching, a given pattern P is said to match with a sub-string t of the text T, if there exist a bijection from the symbols of P to the symbols of t. This problem has an important application in software maintenance where it is required to find equivalency between two sections of codes. Two sections of codes are said to be equivalent if one can be transformed into the other by renaming identifiers and variables only. In this paper, we extend single pattern exact shift-or string matching algorithm to find all parameterized occurrences of multiple patterns P0, P1, P2 …Pr-1, (r³1), each of equal size m , in the text T. The set of r multiple patterns is being handled by using the concept of classes of characters. The new algorithm is named as multi-pattern parameterized shift-or (MPSO) string matching algorithm. We further extend MPSO by using the concept of super alphabets. Implementation results show that by using a super alphabet of size s, the algorithm (MPSO) is speeded-up by a factor of s, where s is the size of the super alphabet (i.e. s is the number of characters processed simultaneously). By using multi-pattern parameterized string matching, the search time is lesser than individual pattern searching in the text. We also show the performance of super alphabet MPSO with respect to duplicity present in the code. However these algorithms are applicable only when pattern length (m) is less than or equal to word length (w) of computer used (i.e. m≤w).
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Interference filtration matching algorithm based on dynamic partitioning and semantic weighting
Zhu Yongqiang,Jiang Xue +1 more
- 02 Oct 2013
TL;DR: In this paper, an interference filtration matching algorithm based on dynamic partitioning and semantic weighting is proposed, which comprises an interference filtering module used for dynamically generating a corresponding interference filter according to a current keyword and character encoding mode.
6
An Efficient Bit-Parallel Multi-Patterns Word Searching Algorithm through Splitting the Text
Ishadutta Yadav,Bharat Singh,Suneeta Agarwal,Rajesh Prasad +3 more
- 27 Oct 2009
TL;DR: Experimental results show that MPWSA algorithm is much faster than the previously proposed WSA algorithm and applies the shift-or algorithm to find the words of same length in the text T.
3
An Advance Approach for Spam Document Detection Using QAP Rabin-Karp Algorithm
Nidhi Ruthia,Abhigyan Tiwary +1 more
- 01 Jan 2020
TL;DR: A novel QAP-based Rabin-Karp algorithm is proposed which is a combination of score computation using QAP functions and finally similarity measure computation using Rabin, Karp algorithm and shows the effectiveness of proposed approach.
Plagiarism detection on bigdata using modified map-reduced based SCAM algorithm
Jayshree Dwivedi,Abhigyan Tiwary +1 more
- 01 Feb 2017
TL;DR: A SCAM (Standard Copy Analysis Mechanism) plagiarism detection algorithm which calculates relative measure to detect overlap by making comparison on asset of words that are common between test document and registered document and implemented Map-Reduce based SCAM algorithm for processing big data using Hadoop and detect plagiarism in big data.
References
Efficient string matching: an aid to bibliographic search
TL;DR: A simple, efficient algorithm to locate all occurrences of any of a finite number of keywords in a string of text that has been used to improve the speed of a library bibliographic search program by a factor of 5 to 10.
A fast string searching algorithm
TL;DR: The algorithm has the unusual property that, in most cases, not all of the first i .” in another string, are inspected.
A new approach to text searching
TL;DR: A family of simple and fast algorithms for solving the classical string matching problem, string matching with don't care symbols and complement symbols, and multiple patterns are introduced.
689
A new approach to text searching
Ricardo Baeza-Yates,Gaston H. Gonnet +1 more
- 01 May 1989
TL;DR: A family of simple and fast algorithms for solving the classical string matching problem, string matching with don't care symbols and complement symbols, and multiple patterns are introduced.
398
Fast and flexible string matching by combining bit-parallelism and suffix automata
Gonzalo Navarro,Mathieu Raffinot +1 more
TL;DR: A new automaton to recognize suffixes of patterns with classes of characters is introduced, which seems very adequate for computational biology applications, since it is the fastest algorithm to search on DNA sequences and flexible searching is an important problem in that area.
195