InterPro

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1093/NAR/30.7.1575•

An efficient algorithm for large-scale detection of protein families

[...]

Anton J. Enright¹, S. Van Dongen, Christos A. Ouzounis•Institutions (1)

European Bioinformatics Institute¹

01 Apr 2002-Nucleic Acids Research

TL;DR: This work presents a novel approach called TRIBE-MCL for rapid and accurate clustering of protein sequences into families based on precomputed sequence similarity information that has been rigorously tested and validated on a number of very large databases.

...read moreread less

Abstract: Detection of protein families in large databases is one of the principal research objectives in structural and functional genomics. Protein family classification can significantly contribute to the delineation of functional diversity of homologous proteins, the prediction of function based on domain architecture or the presence of sequence motifs as well as comparative genomics, providing valuable evolutionary insights. We present a novel approach called TRIBE-MCL for rapid and accurate clustering of protein sequences into families. The method relies on the Markov cluster (MCL) algorithm for the assignment of proteins into families based on precomputed sequence similarity information. This novel approach does not suffer from the problems that normally hinder other protein sequence clustering algorithms, such as the presence of multi-domain proteins, promiscuous domains and fragmented proteins. The method has been rigorously tested and validated on a number of very large databases, including SwissProt, InterPro, SCOP and the draft human genome. Our results indicate that the method is ideally suited to the rapid and accurate detection of protein families on a large scale. The method has been used to detect and categorise protein families within the draft human genome and the resulting families have been used to annotate a large proportion of human proteins.

...read moreread less

3,949 citations

Journal Article•10.1093/NAR/GKI442•

InterProScan: protein domains identifier.

[...]

Emmanuel Quevillon¹, Ville Silventoinen¹, Sharmila Pillai¹, Nicola Harte¹, Nicola Mulder¹, Rolf Apweiler¹, Rodrigo Lopez¹ - Show less +3 more•Institutions (1)

European Bioinformatics Institute¹

01 Jul 2005-Nucleic Acids Research

TL;DR: InterProScan is a tool that combines different protein signature recognition methods from the InterPro consortium member databases into one resource and can be analysed for protein as well as DNA sequences.

...read moreread less

Abstract: InterProScan [E. M. Zdobnov and R. Apweiler (2001) Bioinformatics, 17, 847-848] is a tool that combines different protein signature recognition methods from the InterPro [N. J. Mulder, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bradley, P. Bork, P. Bucher, L. Cerutti et al. (2005) Nucleic Acids Res., 33, D201-D205] consortium member databases into one resource. At the time of writing there are 10 distinct publicly available databases in the application. Protein as well as DNA sequences can be analysed. A web-based version is accessible for academic and commercial organizations from the EBI (http://www.ebi.ac.uk/InterProScan/). In addition, a standalone Perl version and a SOAP Web Service [J. Snell, D. Tidwell and P. Kulchenko (2001) Programming Web Services with SOAP, 1st edn. O'Reilly Publishers, Sebastopol, CA, http://www.w3.org/TR/soap/] are also available to the users. Various output formats are supported and include text tables, XML documents, as well as various graphs to help interpret the results.

...read moreread less

3,091 citations

Journal Article•10.1093/BIOINFORMATICS/17.9.847•

InterProScan--an integration platform for the signature-recognition methods in InterPro.

[...]

Evgeni M. Zdobnov¹, Rolf Apweiler¹•Institutions (1)

European Bioinformatics Institute¹

01 Sep 2001-Bioinformatics

TL;DR: The Perl-based InterProScan implementation is available from the EBI ftp server (ftp://ftp.ebi.ac.uk/pub/software/unix/iprscan/) and the SRS-basedInterProScan is available upon request.

...read moreread less

Abstract: Summary: InterProScan is a tool that scans given protein sequences against the protein signatures of the InterPro member databases, currently – PROSITE, PRINTS, Pfam, ProDom and SMART. The number of signature databases and their associated scanning tools as well as the further refinement procedures make the problem complex. InterProScan is designed to be a scalable and extensible system with a robust internal architecture. Availability: The Perl-based InterProScan implementation is available from the EBI ftp server (ftp://ftp.ebi.ac.uk/pub/ software/unix/iprscan/) and the SRS-based InterProScan is available upon request. We provide the public web interface (http://www.ebi.ac.uk/interpro/scan.html) as well

...read moreread less

3,082 citations

Journal Article•10.1155/2008/619832•

Blast2GO: A comprehensive suite for functional analysis in plant genomics.

[...]

Ana Conesa, Stefan Götz

30 Apr 2008-International Journal of Plant Genomics

TL;DR: The Blast2GO suite is described as a comprehensive bioinformatics tool for functional annotation of sequences and data mining on the resulting annotations, primarily based on the gene ontology (GO) vocabulary.

...read moreread less

Abstract: Functional annotation of novel sequence data is a primary requirement for the utilization of functional genomics approaches in plant research. In this paper, we describe the Blast2GO suite as a comprehensive bioinformatics tool for functional annotation of sequences and data mining on the resulting annotations, primarily based on the gene ontology (GO) vocabulary. Blast2GO optimizes function transfer from homologous sequences through an elaborate algorithm that considers similarity, the extension of the homology, the database of choice, the GO hierarchy, and the quality of the original annotations. The tool includes numerous functions for the visualization, management, and statistical analysis of annotation results, including gene set enrichment analysis. The application supports InterPro, enzyme codes, KEGG pathways, GO direct acyclic graphs (DAGs), and GOSlim. Blast2GO is a suitable tool for plant genomics research because of its versatility, easy installation, and friendly use.

...read moreread less

2,174 citations

Journal Article•10.1093/NAR/GKN785•

InterPro: the integrative protein signature database

[...]

01 Jan 2009-Nucleic Acids Research

TL;DR: The InterPro database integrates together predictive models or ‘signatures’ representing protein domains, families and functional sites from multiple, diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs.

...read moreread less

Abstract: The InterPro database (http://www.ebi.ac.uk/interpro/) integrates together predictive models or 'signatures' representing protein domains, families and functional sites from multiple, diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. Integration is performed manually and approximately half of the total approximately 58,000 signatures available in the source databases belong to an InterPro entry. Recently, we have started to also display the remaining un-integrated signatures via our web interface. Other developments include the provision of non-signature data, such as structural data, in new XML files on our FTP site, as well as the inclusion of matchless UniProtKB proteins in the existing match XML files. The web interface has been extended and now links out to the ADAN predicted protein-protein interaction database and the SPICE and Dasty viewers. The latest public release (v18.0) covers 79.8% of UniProtKB (v14.1) and consists of 16 549 entries. InterPro data may be accessed either via the web address above, via web services, by downloading files by anonymous FTP or by using the InterProScan search software (http://www.ebi.ac.uk/Tools/InterProScan/).

...read moreread less

2,133 citations

...

Expand

Year	Papers
2021	7
2020	10
2019	10
2018	12
2017	9
2016	11

Topic Tools

Papers published on a yearly basis

Papers

An efficient algorithm for large-scale detection of protein families

InterProScan: protein domains identifier.

InterProScan--an integration platform for the signature-recognition methods in InterPro.

Blast2GO: A comprehensive suite for functional analysis in plant genomics.

InterPro: the integrative protein signature database

Related Topics (5)

Performance Metrics