SIMAP

Topic Tools

Papers

MIPS: analysis and annotation of proteins from whole genomes

[...]

Hans-Werner Mewes, Clara Amid, Roland Arnold, Dmitrij Frishman, Ulrich Güldener, Gertrud Mannhaupt, Martin Münsterkötter, Philipp Pagel, Normann Strack, Volker Stümpflen, Jens Warfsmann, Andreas Ruepp - Show less +8 more

01 Jan 2004-Nucleic Acids Research

TL;DR: The Munich Information Center for Protein Sequences (MIPS at the GSF), Neuherberg, Germany, provides resources related to genome information and develops databases covering computable information such as the basic evolutionary relations among all genes.

...read moreread less

Abstract: The Munich Information Center for Protein Sequences (MIPS-GSF), Neuherberg, Germany, provides protein sequence-related information based on whole-genome analysis. The main focus of the work is directed toward the systematic organization of sequence-related attributes as gathered by a variety of algorithms, primary information from experimental data together with information compiled from the scientific literature. MIPS maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the database of complete cDNAs (German Human Genome Project, NGFN), the database of mammalian protein-protein interactions (MPPI), the database of FASTA homologies (SIMAP), and the interface for the fast retrieval of protein-associated information (QUIPOS). The Arabidopsis thaliana database, the rice database, the plant EST databases (MATDB, MOsDB, SPUTNIK), as well as the databases for the comprehensive set of genomes (PEDANT genomes) are described elsewhere in the 2003 and 2004 NAR database issues, respectively. All databases described, and the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).

...read moreread less

890 citations

Journal Article•10.1093/NAR/GKJ148•

MIPS: analysis and annotation of proteins from whole genomes in 2005.

[...]

Hans-Werner Mewes, Dmitrij Frishman, Klaus F. X. Mayer, Martin Münsterkötter, Octave Noubibou, Philipp Pagel, Thomas Rattei, Matthias Oesterheld, Andreas Ruepp, Volker Stümpflen - Show less +6 more

01 Jan 2006-Nucleic Acids Research

TL;DR: The Munich Information Center for Protein Sequences (MIPS-GSF), Neuherberg, Germany, provides protein sequence-related information based on whole-genome analysis, and maintains automatically generated and manually annotated genome-specific databases and provides tools for the comprehensive analysis of protein sequences.

...read moreread less

Abstract: The Munich Information Center for Protein Sequences (MIPS at the GSF), Neuherberg, Germany, provides resources related to genome information. Manually curated databases for several reference organisms are maintained. Several of these databases are described elsewhere in this and other recent NAR database issues. In a complementary effort, a comprehensive set of >400 genomes automatically annotated with the PEDANT system are maintained. The main goal of our current work on creating and maintaining genome databases is to extend gene centered information to information on interactions within a generic comprehensive framework. We have concentrated our efforts along three lines (i) the development of suitable comprehensive data structures and database technology, communication and query tools to include a wide range of different types of information enabling the representation of complex information such as functional modules or networks Genome Research Environment System, (ii) the development of databases covering computable information such as the basic evolutionary relations among all genes, namely SIMAP, the sequence similarity matrix and the CABiNet network analysis framework and (iii) the compilation and manual annotation of information related to interactions such as protein-protein interactions or other types of relations (e.g. MPCDB, MPPI, CYGD). All databases described and the detailed descriptions of our projects can be accessed through the MIPS WWW server (http://mips.gsf.de).

...read moreread less

632 citations

Journal Article•10.1016/J.COMPIND.2006.02.011•

SIMAP: intelligent system for predictive maintenance application to the health condition monitoring of a windturbine gearbox

[...]

Mari Cruz Garcia¹, Miguel A. Sanz-Bobi¹, Javier del Pico•Institutions (1)

Comillas Pontifical University¹

01 Aug 2006-Computers in Industry

TL;DR: In this real case, SIMAP is able to optimize and to dynamically adapt a maintenance calendar for a monitored windturbine according to the real needs and operating life of it as well as other technical and economical criteria.

...read moreread less

345 citations

Journal Article•10.1093/NAR/GKJ106•

SIMAP: the similarity matrix of proteins

[...]

Thomas Rattei¹, Roland Arnold, Patrick Tischler, Dominik Lindner, Volker Stümpflen, H. Werner Mewes - Show less +2 more•Institutions (1)

Technische Universität München¹

01 Jan 2006-Nucleic Acids Research

TL;DR: This work has implemented SIMAP-a database containing the similarity space formed by almost all amino acid sequences from public databases and completely sequenced genomes and a powerful backbone for similarity computation, which is based on FASTA heuristics.

...read moreread less

Abstract: Similarity Matrix of Proteins (SIMAP) (http://mips.gsf.de/simap) provides a database based on a pre-computed similarity matrix covering the similarity space formed by >4 million amino acid sequences from public databases and completely sequenced genomes. The database is capable of handling very large datasets and is updated incrementally. For sequence similarity searches and pairwise alignments, we implemented a grid-enabled software system, which is based on FASTA heuristics and the Smith–Waterman algorithm. Our ProtInfo system allows querying by protein sequences covered by the SIMAP dataset as well as by fragments of these sequences, highly similar sequences and title words. Each sequence in the database is supplemented with pre-calculated features generated by detailed sequence analyses. By providing WWW interfaces as well as web-services, we offer the SIMAP resource as an efficient and comprehensive tool for sequence similarity searches.

...read moreread less

64 citations

Journal Article•10.1093/NAR/GKP949•

SIMAP--a comprehensive database of pre-calculated protein sequence similarities, domains, annotations and clusters.

[...]

Thomas Rattei¹, Patrick Tischler¹, Stefan Götz¹, Marc-André Jehl¹, Jonathan Hoser¹, Roland Arnold¹, Ana Conesa¹, Hans-Werner Mewes¹ - Show less +4 more•Institutions (1)

Technische Universität München¹

01 Jan 2010-Nucleic Acids Research

TL;DR: The SIMAP database as discussed by the authors provides a comprehensive and up-to-date pre-calculation of the protein sequence similarity matrix, sequence-based features and sequence clusters, which can be used for the prediction of protein function and the reconstruction of evolutionary genesis employing sequence comparison at large.

...read moreread less

Abstract: The prediction of protein function as well as the reconstruction of evolutionary genesis employing sequence comparison at large is still the most powerful tool in sequence analysis. Due to the exponential growth of the number of known protein sequences and the subsequent quadratic growth of the similarity matrix, the computation of the Similarity Matrix of Proteins (SIMAP) becomes a computational intensive task. The SIMAP database provides a comprehensive and up-to-date pre-calculation of the protein sequence similarity matrix, sequence-based features and sequence clusters. As of September 2009, SIMAP covers 48 million proteins and more than 23 million non-redundant sequences. Novel features of SIMAP include the expansion of the sequence space by including databases such as ENSEMBL as well as the integration of metagenomes based on their consistent processing and annotation. Furthermore, protein function predictions by Blast2GO are pre-calculated for all sequences in SIMAP and the data access and query functions have been improved. SIMAP assists biologists to query the up-to-date sequence space systematically and facilitates large-scale downstream projects in computational biology. Access to SIMAP is freely provided through the web portal for individuals (http://mips.gsf.de/simap/) and for programmatic access through DAS (http://webclu.bio.wzw.tum.de/das/) and Web-Service (http://mips.gsf.de/webservices/services/SimapService2.0?wsdl).

...read moreread less

53 citations

...

Expand

Year	Papers
2014	2
2011	1
2010	2
2009	1
2008	1
2007	4

Topic Tools

Papers

MIPS: analysis and annotation of proteins from whole genomes

MIPS: analysis and annotation of proteins from whole genomes in 2005.

SIMAP: intelligent system for predictive maintenance application to the health condition monitoring of a windturbine gearbox

SIMAP: the similarity matrix of proteins

SIMAP--a comprehensive database of pre-calculated protein sequence similarities, domains, annotations and clusters.

Related Topics (5)

Performance Metrics