Open AccessBook
Text Mining: Predictive Methods for Analyzing Unstructured Information
Sholom M. Weiss,Nitin Indurkhya,Tong Zhang,Fred J. Damerau +3 more
- 25 Oct 2004
714
TL;DR: This book introduces the new world of text mining and examines proven methods for various critical text-mining tasks, such as automated document indexing and information retrieval and search, as well as new research areas that rely on evolving text- mining techniques.
read more
Abstract: The growth of the web can be seen as an expanding public digital library collection. Online digital information extends far beyond the web and its publicly available information. Huge amounts of information are private and are of interest to local communities, such as the records of customers of a business. This information is overwhelmingly text and has its record-keeping purpose, but an automated analysis might be desirable to find patterns in the stored records. Analogous to this data mining is text mining, which also finds patterns and trends in information samples but which does so with far less structured--though with greater immediate utility for users--ingredients. This book focuses on the concepts and methods needed to expand horizons beyond structured, numeric data to automated mining of text samples. It introduces the new world of text mining and examines proven methods for various critical text-mining tasks, such as automated document indexing and information retrieval and search. New research areas are explored, such as information extraction and document summarization, that rely on evolving text-mining techniques.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Data Mining: Concepts and Techniques
G. Thamaraiselvi,A. Kaliammal +1 more
TL;DR: This article explains What is data mining?
4.4K
•Book
Natural Language Processing with Python
Steven Bird,Steven Bird,Ewan Klein,Edward Loper +3 more
- 12 Jun 2009
TL;DR: This book offers a highly accessible introduction to natural language processing, the field that supports a variety of language technologies, from predictive text and email filtering to automatic summarization and translation.
4.3K
Data Mining: Concepts and Techniques (2nd edition)
Jiawei Han,Micheline Kamber +1 more
- 01 Jan 2006
TL;DR: There have been many data mining books published in recent years, including Predictive Data Mining by Weiss and Indurkhya [WI98], Data Mining Solutions: Methods and Tools for Solving Real-World Problems by Westphal and Blaxton [WB98], Mastering Data Mining: The Art and Science of Customer Relationship Management by Berry and Linofi [BL99].
Text Mining Infrastructure in R
TL;DR: The tm package is presented which provides a framework for text mining applications within R and techniques for count-based analysis methods, text clustering, text classification and string kernels are presented.
1.2K
Large-Scale Bayesian Logistic Regression for Text Categorization
TL;DR: In this article, a simple Bayesian logistic regression approach that uses a Laplace prior to avoid overfitting and produces sparse predictive models for text data is presented. But this approach is not suitable for document classification problems.
Related Papers (5)
Jiawei Han,Micheline Kamber,Jian Pei +2 more
- 08 Sep 2000
Christopher D. Manning,Prabhakar Raghavan,Hinrich Schütze +2 more
- 01 Jan 2008
Christopher D. Manning,Hinrich Schütze +1 more
- 28 May 1999