Modeling Tree Structures, Machine Learning, and Information Extraction

Open Access

Modeling Tree Structures, Machine Learning, and Information Extraction

- 01 Jan 2007

10

TL;DR: This project wants to incorporate novel approaches for modeling tree structure and emerging techniques for machine learning into adaptive information extraction systems for the Web.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1145/242224.242229

Machine learning

Thomas G. Dietterich

- 01 Dec 1996

- ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

14K

Efficient Learning of Semi-structured Data from Queries

Hiroki Arimura, +5 more

- 01 May 2001

TL;DR: This paper presents a polynomial time learning algorithm for µ-OGT, the subclass of OGT without repeated tree variables, and gives representation-independent hardness results which indicate that both of equivalence and membership queries are necessary to learn µ- OGT.

...read moreread less

37

•Journal Article

Parallelism and tree regular constraints

Joachim Niehren, +1 more

- 01 Jan 2002

- Lecture Notes in Computer Science

TL;DR: It is proved that parallelism constraints and context unification remain equivalent when extended with tree regular constraints.

...read moreread less

5

Proceedings Article•10.1142/9789812792464_0021

On decidability of boundedness property for regular path queries

Yves Andre, +2 more

- 01 Nov 2000

TL;DR: In this paper, the authors studied the evaluation of regular path queries on semi-structured data, i.e. path queries of the form nd all objects reachable by path whose labels form a word in p where p is a regular expression.

...read moreread less

3

Analyzing the Average-Case Behavior of Conjunctive Learning Algorithms

Rüdiger Reischuk, +1 more

- 01 Aug 1998

TL;DR: A new learning model, stochastic nite learning, in which, in contrast to PAC learning, some information about the underlying distribution is given and the goal is to find a correct (not only approximatively correct) hypothesis.

...read moreread less

2

References

Journal Article•10.1145/242224.242229

Machine learning

Thomas G. Dietterich

- 01 Dec 1996

- ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

14K

•Book

Foundations of Statistical Natural Language Processing

Christopher D. Manning, +1 more

- 28 May 1999

TL;DR: This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear and provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.

...read moreread less

10.9K

Proceedings Article•10.1145/279943.279962

Combining labeled and unlabeled data with co-training

Avrim Blum, +1 more

- 24 Jul 1998

TL;DR: A PAC-style analysis is provided for a problem setting motivated by the task of learning to classify web pages, in which the description of each example can be partitioned into two distinct views, to allow inexpensive unlabeled data to augment, a much smaller set of labeled examples.

...read moreread less

6.4K

•Journal Article•10.1016/S0019-9958(67)91165-5

Language identification in the limit

E. Mark Gold

- 01 May 1967

- Information & Computation

TL;DR: It was found that theclass of context-sensitive languages is learnable from an informant, but that not even the class of regular languages is learningable from a text.

...read moreread less

3.8K

•Journal Article•10.1023/A:1007692713085

Text Classification from Labeled and Unlabeled Documents using EM

Kamal Nigam, +3 more

- 01 May 2000

- Machine Learning

TL;DR: This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents, and presents two extensions to the algorithm that improve classification accuracy under these conditions.

...read moreread less

3.4K