1. What are the contributions in "Mining sequential patterns by pattern-growth: the prefixspan approach" ?
However, it is also a difficult problem since the mining may have to generate or examine a combinatorially explosive number of intermediate subsequences.. Most of the previously developed sequential pattern mining methods, such as GSP, explore a candidate generation-and-test approach [ 1 ] to reduce the number of candidates to be examined.. In this paper, the authors propose a projection-based, sequential pattern-growth approach for efficient mining of sequential patterns.. In this approach, a sequence database is recursively projected into a set of smaller projected databases, and sequential patterns are grown in each projected database by exploring only locally frequent fragments.. Based on an initial study of the pattern growth-based sequential pattern mining, FreeSpan [ 8 ], the authors propose a more efficient method, called PSP, which offers ordered growth and reduced projected databases.. To further improve the performance, a pseudoprojection technique is developed in PrefixSpan.. A comprehensive performance study shows that PrefixSpan, in most cases, outperforms the a priori-based algorithm GSP, FreeSpan, and SPADE [ 29 ] ( a sequential pattern mining algorithm that adopts vertical data format ), and PrefixSpan integrated with pseudoprojection is the fastest among all the tested algorithms.. Furthermore, this mining methodology can be extended to mining sequential patterns with user-specified constraints.. The high promise of the pattern-growth approach may lead to its further extension toward efficient mining of other kinds of frequent patterns, such as frequent substructures.
read more
2. What are the future works in "Mining sequential patterns by pattern-growth: the prefixspan approach" ?
Here, the authors illustrate a few problems that need further research.
read more
3. What is the a priori-based method for generating a large set of candidate?
Since theset of candidate sequences includes all the possiblepermutations of the elements and repetition of itemsin a sequence, the a priori-based method maygenerate a really large set of candidate sequenceseven for a moderate seed set.
read more
4. What is the basic evaluation framework for sequential patterns?
the basic evaluation framework for sequential patterns is still based on GSP [23], a typical candidate generation-andtest approach.
read more





