A Fast KNN Algorithm for Text Categorization

doi:10.1109/ICMLC.2007.4370742

Proceedings Article10.1109/ICMLC.2007.4370742

A Fast KNN Algorithm for Text Categorization

Yu Wang, +1 more

- 29 Oct 2007

- Vol. 6, pp 3436-3441

71

TL;DR: A method called TFKNN(Tree-Fast-K-Nearest-Neighbor) is presented, which can search the exact k nearest neighbors quickly and the time of similarity computing is decreased largely.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/J.ESWA.2011.08.040

An improved K-nearest-neighbor algorithm for text categorization

Shengyi Jiang, +3 more

- 01 Jan 2012

- Expert Systems With Applications

TL;DR: An improved KNN algorithm is proposed, which builds the classification model by combining constrained one pass clustering algorithm and KNN text categorization, which can reduce the text similarity computation substantially and outperform the-state-of-the-art KNN, Naive Bayes and Support Vector Machine classifiers.

...read moreread less

354

Journal Article•10.4304/JCP.4.3.230-237

An Improved KNN Text Classification Algorithm Based on Clustering

Yong Zhou, +2 more

- 03 Jan 2009

- Journal of Computers

TL;DR: The simulation results show that the algorithm proposed in this paper can not only effectively reduce the actual number of training samples and lower the calculation complexity, but also improve the accuracy of KNN text classification algorithm.

...read moreread less

216

An Improved k-Nearest Neighbor Classification Using Genetic Algorithm

N. Suguna, +1 more

- 01 Jan 2010

TL;DR: In this article, an improved version of KNN is proposed in which GA is combined with KNN to improve its classification performance, instead of considering all the training samples and taking k-neighbors, the GA is employed to take k-NEighbors straightaway and then calculate the distance to classify the test samples.

...read moreread less

214

Proceedings Article•10.1109/ICITECH.2017.8079924

Using KNN algorithm for classification of textual documents

Aiman Moldagulova, +1 more

- 17 May 2017

TL;DR: An approach for building a machine learning system in R that uses K-Nearest Neighbors (KNN) method for the classification of textual documents and challenges the KNN algorithm to find the proper value of k which represents the number of neighbors.

...read moreread less

83

Techniques for text classification: Literature review and current trends

Rajni Jindal, +2 more

- 01 Dec 2015

TL;DR: This paper has studied the existing work in the area of text classification and tried to summarize all existing information in a comprehensive and succinct manner to have a fair evaluation of the progress made in this field till date.

...read moreread less

69

...

Expand

References

Proceedings Article•10.1145/312624.312647

A re-examination of text categorization methods

Yiming Yang, +1 more

- 01 Aug 1999

TL;DR: The results show that SVM, kNN and LLSF signi cantly outperform NNet and NB when the number of positive training instances per category are small, and that all the methods perform comparably when the categories are over 300 instances.

...read moreread less

3K

•Proceedings Article•10.1109/ICDE.1996.492202

Similarity indexing with the SS-tree

David A. White, +1 more

- 26 Feb 1996

TL;DR: This work describes the fundamental types of "similarity queries" that should be supported and proposes a new dynamic structure for similarity indexing called the similarity search tree or SS-tree, which performs better than the R*-tree in nearly every test.

...read moreread less

736

Similarity Indexing with SS-tree

Da White

- 01 Jan 1996

708

•Journal Article•10.1016/J.CELL.2005.04.022

Erratum: Facilitated transport of a Dpp/Scw heterodimer by Sog/Tsg leads to robust patterning of the Drosophila blastoderm embryo (Cell (2005) 120 (873-886) )

Osamu Shimmi, +3 more

- 06 May 2005

- Cell

TL;DR: It is demonstrated mathematically that heterodimer levels can be less sensitive to changes in gene dosage than homodimer levels, thereby providing further selective advantage for using heterodimers as morphogens.

...read moreread less

330

•Proceedings Article•10.1145/564691.564729

Efficient k-NN search on vertically decomposed data

Arjen P. de Vries, +3 more

- 03 Jun 2002

TL;DR: The suggested (physical) database design accommodates well a novel variant of branch-and-bound search, that reduces the high dimensional space quickly to a small candidate set, especially suited for high dimensional spaces.

...read moreread less

108

A Fast KNN Algorithm for Text Categorization

Chat with Paper

AI Agents for this Paper

Citations

An improved K-nearest-neighbor algorithm for text categorization

An Improved KNN Text Classification Algorithm Based on Clustering

An Improved k-Nearest Neighbor Classification Using Genetic Algorithm

Using KNN algorithm for classification of textual documents

Techniques for text classification: Literature review and current trends

References

A re-examination of text categorization methods

Similarity indexing with the SS-tree

Similarity Indexing with SS-tree

Erratum: Facilitated transport of a Dpp/Scw heterodimer by Sog/Tsg leads to robust patterning of the Drosophila blastoderm embryo (Cell (2005) 120 (873-886) )

Efficient k-NN search on vertically decomposed data

Related Papers (5)

Nearest neighbor pattern classification

Discriminatory Analysis - Nonparametric Discrimination: Consistency Properties

The Novel k Nearest Neighbor Algorithm

Using kNN model for automatic text categorization

A simple KNN algorithm for text categorization