Multi-Relational Learning, Text Mining, and Semi-Supervised Learning for Functional Genomics

doi:10.1023/B:MACH.0000035472.73496.0C

Open AccessJournal Article10.1023/B:MACH.0000035472.73496.0C

Multi-Relational Learning, Text Mining, and Semi-Supervised Learning for Functional Genomics

Mark-A. Krogel, +1 more

- 01 Oct 2004

- Machine Learning

- Vol. 57, Iss: 1, pp 61-81

81

TL;DR: The goal is to study the effectiveness of approaches that utilize all data sources that are available in this problem setting, including relational data, abstracts of research papers, and unlabeled data, and a propositionalization approach which uses relational gene interaction data.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book•10.7551/MITPRESS/9780262033589.001.0001

Semi-Supervised Learning

Olivier Chapelle, +2 more

- 31 Mar 2010

TL;DR: Semi-supervised learning (SSL) as discussed by the authors is the middle ground between supervised learning (in which all training examples are labeled) and unsupervised training (where no label data are given).

...read moreread less

4.2K

•Journal Article•10.1109/TPAMI.2018.2798607

Multimodal Machine Learning: A Survey and Taxonomy

Tadas Baltrusaitis, +2 more

- 01 Feb 2019

- IEEE Transactions on Pattern Analysis an...

TL;DR: This paper surveys the recent advances in multimodal machine learning itself and presents them in a common taxonomy to enable researchers to better understand the state of the field and identify directions for future research.

...read moreread less

3.4K

•Journal Article•10.1186/GB-2008-9-S2-S2

Overview of BioCreative II gene mention recognition

Larry Smith, +36 more

- 01 Sep 2008

- Genome Biology

TL;DR: It is demonstrated that, by combining the results from all submissions, an F score of 0.9066 is feasible, and furthermore that the best result makes use of the lowest scoring submissions.

...read moreread less

581

•Posted Content

Data Programming: Creating Large Training Sets, Quickly

Alexander Ratner, +4 more

- 25 May 2016

- arXiv: Machine Learning

TL;DR: A paradigm for the programmatic creation of training sets called data programming is proposed in which users express weak supervision strategies or domain heuristics as labeling functions, which are programs that label subsets of the data, but that are noisy and may conflict.

...read moreread less

460

•Journal Article•10.1109/TPAMI.2005.224

Open set face recognition using transduction

Fayin Li, +1 more

- 01 Nov 2005

- IEEE Transactions on Pattern Analysis an...

TL;DR: Open set TCM-kNN (transduction confidence machine-k nearest neighbors), suitable for multiclass authentication operational scenarios that have to include a rejection option for classes never enrolled in the gallery, is shown to be suitable for PSEI (pattern specific error inhomogeneities) error analysis in order to identify difficult to recognize faces.

...read moreread less

195

...

Expand

References

Journal Article•10.1108/EB046814

An algorithm for suffix stripping

M. F. Porter

- 01 Dec 1997

- Program: Electronic Library and Informat...

TL;DR: An algorithm for suffix stripping is described, which has been implemented as a short, fast program in BCPL, and performs slightly better than a much more elaborate system with which it has been compared.

...read moreread less

9.1K

•Proceedings Article•10.5555/299094

Advances in kernel methods: support vector learning

Bernhard Schölkopf, +2 more

- 08 Feb 1999

TL;DR: Support vector machines for dynamic reconstruction of a chaotic system, Klaus-Robert Muller et al pairwise classification and support vector machines, Ulrich Kressel.

...read moreread less

7.3K

•Journal Article•10.1016/S0031-3203(96)00142-2

The use of the area under the ROC curve in the evaluation of machine learning algorithms

Andrew P. Bradley

- 01 Jul 1997

- Pattern Recognition

TL;DR: AUC exhibits a number of desirable properties when compared to overall accuracy: increased sensitivity in Analysis of Variance (ANOVA) tests; a standard error that decreased as both AUC and the number of test samples increased; decision threshold independent; and it is invariant to a priori class probabilities.

...read moreread less

7K

Journal Article•10.1016/S0167-739X(97)00014-9

Data mining

Se June Hong

- 01 Nov 1997

- Future Generation Computer Systems

TL;DR: The graduate certificate’s narrow focus allows you to dig deep into this specific topic, and start applying your knowledge sooner.

...read moreread less

6.8K

Proceedings Article•10.1145/279943.279962

Combining labeled and unlabeled data with co-training

Avrim Blum, +1 more

- 24 Jul 1998

TL;DR: A PAC-style analysis is provided for a problem setting motivated by the task of learning to classify web pages, in which the description of each example can be partitioned into two distinct views, to allow inexpensive unlabeled data to augment, a much smaller set of labeled examples.

...read moreread less

6.4K