MAPI: a software framework for distributed biomedical applications.
Johan Karlsson,Oswaldo Trelles +1 more
TL;DR: This manuscript describes a framework, called MAPI, which provides a uniform representation of resources available over the Internet, in particular for Web Services, which enhances their interoperability and collaborative use by enabling a uniform and remote access.
read more
Abstract: The amount of web-based resources (databases, tools etc.) in biomedicine has increased, but the integrated usage of those resources is complex due to differences in access protocols and data formats. However, distributed data processing is becoming inevitable in several domains, in particular in biomedicine, where researchers face rapidly increasing data sizes. This big data is difficult to process locally because of the large processing, memory and storage capacity required. This manuscript describes a framework, called MAPI, which provides a uniform representation of resources available over the Internet, in particular for Web Services. The framework enhances their interoperability and collaborative use by enabling a uniform and remote access. The framework functionality is organized in modules that can be combined and configured in different ways to fulfil concrete development requirements. The framework has been tested in the biomedical application domain where it has been a base for developing several clients that are able to integrate different web resources. The MAPI binaries and documentation are freely available at http://www.bitlab-es.com/mapi
under the Creative Commons Attribution-No Derivative Works 2.5 Spain License. The MAPI source code is available by request (GPL v3 license).
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Toward a Literature-Driven Definition of Big Data in Healthcare
TL;DR: A definition of big data in healthcare is proposed, defined by volume, which can be defined as datasets with Log(n∗p) ≥ 7 and its great variety and high velocity.
BioServices: a Common Python Package to Access Biological Web Services Programmatically
TL;DR: BioServices is a comprehensive Python framework that provides programmatic access to major bioinformatics Web Services (e.g. KEGG, UniProt, BioModels, ChEMBLdb) and wrapping additional Web Services based either on Representational State Transfer or Simple Object Access Protocol/Web Services Description Language technologies is eased by the usage of object-oriented programming.
127
A systematic literature review of data science, data analytics and machine learning applied to healthcare engineering systems
Roberto Salazar-Reyna,Fernando González-Aleu,Edgar M.A. Granda-Gutierrez,Jenny Díaz-Ramírez,Jose Arturo Garza-Reyes,Anil Kumar +5 more
TL;DR: This paper represents the most comprehensive literature-based study on the fields of data analytics, big data, data mining, and machine learning applied to healthcare engineering systems.
37
GXD: a community resource of mouse Gene Expression Data
Constance M. Smith,Jacqueline H. Finger,Terry F. Hayamizu,Ingeborg J. McCright,Jingxia Xu,Janan T. Eppig,James A. Kadin,Joel E. Richardson,Martin Ringwald +8 more
TL;DR: The Gene Expression Database (GXD) is an extensive, easily searchable, and freely available database of mouse gene expression information that can provide researchers with critical insights into the functions of genes and the molecular mechanisms of development, differentiation, and disease.
The mouse gene expression database: New features and how to use them effectively.
Jacqueline H. Finger,Constance M. Smith,Terry F. Hayamizu,Ingeborg J. McCright,Jingxia Xu,Janan T. Eppig,James A. Kadin,Joel E. Richardson,Martin Ringwald +8 more
TL;DR: The Gene Expression Database (GXD) curates and integrates expression data from the literature, via electronic data submissions, and by collaborations with large‐scale projects, and provides tools for the research community to search for and analyze expression data in this larger context.
16
References
Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences
TL;DR: Galaxy Pages are interactive, web-based documents that provide users with a medium to communicate a complete computational analysis and provide support for capturing the context and intent of computational methods.
The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants
TL;DR: The FASTQ format is defined, covering the original Sanger standard, the Solexa/Illumina variants and conversion between them, based on publicly available information such as the MAQ documentation and conventions recently agreed by the Open Bioinformatics Foundation projects Biopython, BioPerl, BioRuby, BioJava and EMBOSS.
1.8K
Principled design of the modern Web architecture
TL;DR: The Representational State Transfer (REST) architectural style is introduced, developed as an abstract model of the Web architecture and used to guide the redesign and definition of the Hypertext Transfer Protocol and Uniform Resource Identifiers.
1.6K
SURVEY AND SUMMARY The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants
Peter J. A. Cock,Christopher J. Fields,Naohisa Goto,Michael L. Heuer,Peter M. Rice +4 more
- 01 Jan 2010
TL;DR: FASTQ has emerged as a common file format for sharing sequencing read data combining both the sequence and an associated per base quality score, despite lacking any formal definition to date, and existing in at least three incompatible variants as mentioned in this paper.
BioCatalogue: a universal catalogue of web services for the life sciences
Jiten Bhagat,Franck Tanoh,Eric Nzuobontane,Thomas Laurent,Jerzy Orlowski,Marco Roos,Katy Wolstencroft,Sergejs Aleksejevs,Robert Stevens,Steve Pettifer,Rodrigo Lopez,Carole Goble +11 more
TL;DR: The use of Web Services to enable programmatic access to on-line bioinformatics is becoming increasingly important in the Life Sciences, but their number, distribution and the variable quality of their documentation can make their discovery and subsequent use difficult.