Knowledge-based biomedical Data Science.

doi:10.3233/DS-170001

Open AccessJournal Article10.3233/DS-170001

Knowledge-based biomedical Data Science.

Lawrence Hunter

- 01 Jan 2017

- Vol. 1, pp 19-25

44

TL;DR: This position paper argues that knowledge-based data science research is ripe for expansion, and expanded application.

Abstract: Computational manipulation of knowledge is an important, and often under-appreciated, aspect of biomedical Data Science. The first Data Science initiative from the US National Institutes of Health was entitled "Big Data to Knowledge (BD2K)." The main emphasis of the more than $200M allocated to that program has been on "Big Data;" the "Knowledge" component has largely been the implicit assumption that the work will lead to new biomedical knowledge. However, there is long-standing and highly productive work in computational knowledge representation and reasoning, and computational processing of knowledge has a role in the world of Data Science. Knowledge-based biomedical Data Science involves the design and implementation of computer systems that act as if they knew about biomedicine. There are many ways in which a computational approach might act as if it knew something: for example, it might be able to answer a natural language question about a biomedical topic, or pass an exam; it might be able to use existing biomedical knowledge to rank or evaluate hypotheses; it might explain or interpret data in light of prior knowledge, either in a Bayesian or other sort of framework. These are all examples of automated reasoning that act on computational representations of knowledge. After a brief survey of existing approaches to knowledge-based data science, this position paper argues that such research is ripe for expansion, and expanded application.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1016/J.JNCA.2021.103076

Domain-specific knowledge graphs: A survey

Bilal Abu-Salih

- 01 Jul 2021

- Journal of Network and Computer Applicat...

TL;DR: This survey is the first to provide an inclusive definition to the notion of domain KG, and a comprehensive review of the state-of-the-art approaches drawn from academic works relevant to seven dissimilar domains of knowledge is provided.

...read moreread less

308

Journal Article•10.1038/s41551-022-00942-x

Graph representation learning in biomedicine and healthcare

Michelle Li, +2 more

- 31 Oct 2022

- Nature Biomedical Engineering

TL;DR: It is argued that graph representation learning will keep pushing forward machine learning for biomedicine and healthcare applications, including the identification of genetic variants underlying complex traits, the disentanglement of single-cell behaviours and their effects on health, the assistance of patients in diagnosis and treatment, and the development of safe and effective medicines.

...read moreread less

196

•Journal Article•10.1093/BIB/BBAA199

Semantic similarity and machine learning with ontologies

Maxat Kulmanov, +3 more

- 20 Jul 2021

- Briefings in Bioinformatics

TL;DR: An overview over the methods that use ontologies to compute similarity and incorporate them in machine learning methods is provided, which outlines how semantic similarity measures and ontology embeddings can exploit the background knowledge in ontologies and how ontologies can provide constraints that improve machine learning models.

...read moreread less

160

•Journal Article•10.1038/NPRE.2009.3212.1

Reflect: Augmented Browsing for the Life Scientist

Evangelos Pafilis, +6 more

- 04 May 2009

- Nature Precedings

TL;DR: Reflect as discussed by the authors tags gene, protein, and small molecule names in any web page, typically within a few seconds, and without affecting document layout, and shows a concise summary that includes synonyms, database identifiers, sequence, domains, 3D structure, interaction partners, subcellular location, and related literature.

...read moreread less

69

•Posted Content

Erratum: Link prediction in drug-target interactions network using similarity indices

Yiding Lu, +2 more

- 01 Nov 2017

- arXiv: Artificial Intelligence

TL;DR: This paper proposes a new, alternative method for DTI prediction that makes use of only network topology information attempting to solve the problem of in silico drug-target interaction (DTI) prediction, and shows that when applied to the MATADOR database, the approach based on node neighborhoods yield higher precision for high-ranking predictions than RBM when no information regarding DTI types is available.

...read moreread less

59

...

Expand

References

•Journal Article•10.1101/GR.1239303

Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks

Paul Shannon, +8 more

- 01 Nov 2003

- Genome Research

TL;DR: Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.

...read moreread less

46.4K

•Journal Article•10.1093/NAR/GKN923

Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists

Da-Wei Huang, +2 more

- 01 Jan 2009

- Nucleic Acids Research

TL;DR: The survey will help tool designers/developers and experienced end users understand the underlying algorithms and pertinent details of particular tool categories/tools, enabling them to make the best choices for their particular research interests.

...read moreread less

14.9K

Journal Article•10.1038/NBT1206-1565

What is a support vector machine

William Stafford Noble

- 01 Dec 2006

- Nature Biotechnology

TL;DR: Support vector machines are becoming popular in a wide variety of biological applications, but how do they work and what are their most promising applications in the life sciences?

...read moreread less

6K

•Journal Article•10.1093/NAR/GKV1351

The Reactome Pathway Knowledgebase.

Antonio Fabregat, +25 more

- 01 Jan 2014

- Nucleic Acids Research

TL;DR: The Reactome Knowledgebase provides molecular details of signal transduction, transport, DNA replication, metabolism and other cellular processes as an ordered network of molecular transformations—an extended version of a classic metabolic map, in a single consistent data model.

...read moreread less

5.9K

•Proceedings Article•10.1145/2736277.2741093

LINE: Large-scale Information Network Embedding

Jian Tang, +5 more

- 18 May 2015

TL;DR: A novel network embedding method called the ``LINE,'' which is suitable for arbitrary types of information networks: undirected, directed, and/or weighted, and optimizes a carefully designed objective function that preserves both the local and global network structures.

...read moreread less

4.9K

...

Expand

Knowledge-based biomedical Data Science.

Chat with Paper

AI Agents for this Paper

Citations

Domain-specific knowledge graphs: A survey

Graph representation learning in biomedicine and healthcare

Semantic similarity and machine learning with ontologies

Reflect: Augmented Browsing for the Life Scientist

Erratum: Link prediction in drug-target interactions network using similarity indices

References

Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks

Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists

What is a support vector machine

The Reactome Pathway Knowledgebase.

LINE: Large-scale Information Network Embedding

Related Papers (5)

Knowledge-based Biomedical Data Science 2019.

Guest Editors' Introduction: Approaches to Knowledge Representation

Reflections on 25+ years of knowledge acquisition

Computational Knowledge and Ontology

Knowledge Graphs: A Tutorial on the History of Knowledge Graph's Main Ideas