CDD/SPARCLE: the conserved domain database in 2020
Shennan Lu,Jiyao Wang,Farideh Chitsaz,Myra K. Derbyshire,Renata C. Geer,Noreen R. Gonzales,Marc Gwadz,David I. Hurwitz,Gabriele H. Marchler,James S. Song,Narmada Thanki,Roxanne A. Yamashita,Mingzhang Yang,Dachuan Zhang,Chanjuan Zheng,Christopher J. Lanczycki,Aron Marchler-Bauer +16 more
TL;DR: As NLM's Conserved Domain Database (CDD) enters its 20th year of operations as a publicly available resource, curation staff continues to develop hierarchical classifications of widely distributed protein domain families, and to record conserved sites associated with molecular function, so that they can be mapped onto user queries in support of hypothesis-driven biomolecular research.
read more
Abstract: As NLM's Conserved Domain Database (CDD) enters its 20th year of operations as a publicly available resource, CDD curation staff continues to develop hierarchical classifications of widely distributed protein domain families, and to record conserved sites associated with molecular function, so that they can be mapped onto user queries in support of hypothesis-driven biomolecular research. CDD offers both an archive of pre-computed domain annotations as well as live search services for both single protein or nucleotide queries and larger sets of protein query sequences. CDD staff has continued to characterize protein families via conserved domain architectures and has built up a significant corpus of curated domain architectures in support of naming bacterial proteins in RefSeq. These architecture definitions are available via SPARCLE, the Subfamily Protein Architecture Labeling Engine. CDD can be accessed at https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Genome-wide analysis of the ABC gene family in almond and functional predictions during flower development, freezing stress, and salt stress
Dongdong Zhang,Zhenfan Yu,Bin Zeng,Xingyue Liu +3 more
TL;DR: This study preliminarily investigated the expression characteristics of ABC genes in different tissues of almond during flower development, freezing stress and salt stress, and the results will provide a reference for further in-depth research and breeding of almond in the future.
5
Neuropeptidergic regulation of compulsive ethanol seeking in C. elegans
TL;DR: In this article , the authors found that neuropeptide signaling via SEB-3, a CRF receptor-like GPCR, facilitates the development of ethanol preference and compels animals to seek ethanol compulsively.
Completion of Maize Stripe Virus Genome Sequence and Analysis of Diverse Isolates.
Stephen Bolus,Kathryn S Braithwaite,Samuel Grinstead,Irazema Fuentes-Bueno,Robert Beiriger,Bryce W. Falk,Dimitre Mollov +6 more
TL;DR: Maize stripe virus is a pathogen of corn and sorghum in subtropical and tropical regions worldwide as mentioned in this paper, and it is most closely related to rice stripe virus.
Whole-Genome Sequencing of a Chlorimuron-Ethyl-Degrading Strain: Chenggangzhangella methanolivorans CHL1 and Its Degrading Enzymes
Zhi-hao Yu,Wu Gu,Yi Yang,Xiang Zhen Li,Xinyu Li,Tingting Li,Jian Dong Wang,Zhencheng Su,Xu Li,Yumeng Dai,Mingkai Xu,Huiwen Zhang +11 more
TL;DR: Three key functional genes of strain CHL1 that are involved in the degradation of chlorimuron-ethyl are identified and new approaches by which to construct engineered bacteria for the bioremediation of environments polluted with sulfonylurea herbicides are provided.
5
A New Pathway for Forming Acetate and Synthesizing ATP during Fermentation in Bacteria
Bo Zhang,Christopher Lingga,Courtney Bowman,Timothy J. Hackmann +3 more
TL;DR: A biochemical pathway for forming acetate and synthesizing ATP that was unknown in fermentative bacteria is found in the fermentative bacterium Cutibacterium granulosum, suggesting the SCACT/SCS pathway is important for acetate formation in many branches of the tree of life.
5
References
The Pfam protein families database in 2019.
Sara El-Gebali,Jaina Mistry,Alex Bateman,Sean R. Eddy,Aurelien Luciani,Simon C. Potter,Matloob Qureshi,Lorna Richardson,Gustavo A. Salazar,Alfredo Smart,Erik L. L. Sonnhammer,Layla Hirsh,Layla Hirsh,Lisanna Paladin,Damiano Piovesan,Silvio C. E. Tosatto,Robert D. Finn +16 more
TL;DR: A significant comparison to the structural classification database that led to the creation of 825 new families based on their set of uncharacterized families (EUFs) was carried out and Pfam entries were connected to the Sequence Ontology (SO) through mapping of the Pfam type definitions to SO terms.
4.7K
CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.
Aron Marchler-Bauer,Yu Bo,Lianyi Han,Jane He,Christopher J. Lanczycki,Shennan Lu,Farideh Chitsaz,Myra K. Derbyshire,Renata C. Geer,Noreen R. Gonzales,Marc Gwadz,David I. Hurwitz,Fu Lu,Gabriele H. Marchler,James S. Song,Narmada Thanki,Zhouxi Wang,Roxanne A. Yamashita,Dachuan Zhang,Chanjuan Zheng,Lewis Y. Geer,Stephen H. Bryant +21 more
TL;DR: NCBI's Conserved Domain Database (CDD) aims at annotating biomolecular sequences with the location of evolutionarily conserved protein domain footprints, and functional sites inferred from such footprints.
2.6K
CD-Search: protein domain annotations on the fly
TL;DR: The Conserved Domain Search service (CD-Search), a web-based tool for the detection of structural and functional domains in protein sequences, uses BLAST(R) heuristics to provide a fast, interactive service, and searches a comprehensive collection of domain models.
The COG database: new developments in phylogenetic classification of proteins from complete genomes
Roman L. Tatusov,Darren A. Natale,Igor Garkavtsev,Tatiana Tatusova,Uma Shankavaram,Bachoti S. Rao,Boris Kiryutin,Michael Y. Galperin,Natalie D. Fedorova,Eugene V. Koonin +9 more
TL;DR: The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis.
20 years of the SMART protein domain annotation resource.
Ivica Letunic,Peer Bork +1 more
TL;DR: In its 20th year, the SMART analysis results pages have been streamlined again and its information sources have been updated, and the internal full text search engine has been redesigned and updated, resulting in greatly increased search speed.