CDD/SPARCLE: the conserved domain database in 2020
Shennan Lu,Jiyao Wang,Farideh Chitsaz,Myra K. Derbyshire,Renata C. Geer,Noreen R. Gonzales,Marc Gwadz,David I. Hurwitz,Gabriele H. Marchler,James S. Song,Narmada Thanki,Roxanne A. Yamashita,Mingzhang Yang,Dachuan Zhang,Chanjuan Zheng,Christopher J. Lanczycki,Aron Marchler-Bauer +16 more
TL;DR: As NLM's Conserved Domain Database (CDD) enters its 20th year of operations as a publicly available resource, curation staff continues to develop hierarchical classifications of widely distributed protein domain families, and to record conserved sites associated with molecular function, so that they can be mapped onto user queries in support of hypothesis-driven biomolecular research.
read more
Abstract: As NLM's Conserved Domain Database (CDD) enters its 20th year of operations as a publicly available resource, CDD curation staff continues to develop hierarchical classifications of widely distributed protein domain families, and to record conserved sites associated with molecular function, so that they can be mapped onto user queries in support of hypothesis-driven biomolecular research. CDD offers both an archive of pre-computed domain annotations as well as live search services for both single protein or nucleotide queries and larger sets of protein query sequences. CDD staff has continued to characterize protein families via conserved domain architectures and has built up a significant corpus of curated domain architectures in support of naming bacterial proteins in RefSeq. These architecture definitions are available via SPARCLE, the Subfamily Protein Architecture Labeling Engine. CDD can be accessed at https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Omics‐based molecular analyses of adhesion by aquatic invertebrates
Peter John Davey,Anne Marie Power,Romana Santos,Philip Bertemes,Peter Ladurner,Pawel Palmowski,Jessica L. Clarke,Patrick Flammang,Birgit Lengerer,Elise Hennebert,Ute Rothbächer,Robert Pjeta,Julia Wunderer,Michal Zurovec,Nick Aldred +14 more
TL;DR: A review of the various ways in which "omics have contributed to our understanding of adhesion by aquatic invertebrates, with new data to illustrate key points is provided in this paper.
43
Convergent evolution of processivity in bacterial and fungal cellulases.
Taku Uchiyama,Takayuki Uchihashi,Takayuki Uchihashi,Akihiko Nakamura,Hiroki Watanabe,Satoshi Kaneko,Masahiro Samejima,Kiyohiko Igarashi +7 more
TL;DR: High-speed atomic force microscopic observations of the movement of four types of cellulases derived from the cellulolytic bacteria Cellulomonas fimi on various insoluble cellulose substrates indicate that bacteria utilize family 6 cellulases as high-processivity enzymes for efficient degradation of crystalline cellulose, whereas family 7 enzymes have the same function in fungi.
42
The Natural Product Domain Seeker version 2 (NaPDoS2) webtool relates ketosynthase phylogeny to biosynthetic function
Leesa J. Klau,Sheila Podell,Kaitlin E. Creamer,Alyssa M. Demko,Hans W. Singh,Eric E. Allen,Bradley S. Moore,Nadine Ziemert,Anne-Catrin Letzel,Paul R. Jensen +9 more
TL;DR: The Natural Product Domain Seeker (NaPDoS) as discussed by the authors detects and classifies ketosynthase (KS) and condensation domains from genomic, metagenomic, and amplicon sequence data.
42
New Lineage of Microbial Predators Adds Complexity to Reconstructing the Evolutionary Origin of Animals.
Denis V. Tikhonenkov,Denis V. Tikhonenkov,Kirill V. Mikhailov,Kirill V. Mikhailov,Elisabeth Hehenberger,Sergei A. Karpov,Kristina I. Prokina,Anton S. Esaulov,Olga I. Belyakova,Yuri Mazei,Alexander P. Mylnikov,Vladimir V. Aleoshin,Vladimir V. Aleoshin,Patrick J. Keeling +13 more
TL;DR: Phylogenomics, including Tunicaraptor, challenges the existing framework used to reconstruct the evolution of animal-specific genes and emphasizes that the diversity ofAnimal- related lineages may be better understood only once the smaller, more inconspicuous animal-related lineages are better studied.
41
Mitotic recombination between homologous chromosomes drives genomic diversity in diatoms
Petra Bulankova,Mirna Sekulic,Denis Jallet,Charlotte Nef,Cock van Oosterhout,Tom O. Delmont,Ilse Vercauteren,Cristina Maria Osuna-Cruz,Emmelien Vancaester,Thomas Mock,Koen Sabbe,Fayza Daboussi,Chris Bowler,Wim Vyverman,Klaas Vandepoele,Lieven De Veylder +15 more
TL;DR: In this article, the authors quantified haplotype diversity by next-generation sequencing and amplicon re-sequencing of selected loci, and documented a rapid accumulation of multiple haplotypes accompanied by the appearance of novel protein variants in cell cultures initiated from a single founder cell.
40
References
The Pfam protein families database in 2019.
Sara El-Gebali,Jaina Mistry,Alex Bateman,Sean R. Eddy,Aurelien Luciani,Simon C. Potter,Matloob Qureshi,Lorna Richardson,Gustavo A. Salazar,Alfredo Smart,Erik L. L. Sonnhammer,Layla Hirsh,Layla Hirsh,Lisanna Paladin,Damiano Piovesan,Silvio C. E. Tosatto,Robert D. Finn +16 more
TL;DR: A significant comparison to the structural classification database that led to the creation of 825 new families based on their set of uncharacterized families (EUFs) was carried out and Pfam entries were connected to the Sequence Ontology (SO) through mapping of the Pfam type definitions to SO terms.
4.7K
CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.
Aron Marchler-Bauer,Yu Bo,Lianyi Han,Jane He,Christopher J. Lanczycki,Shennan Lu,Farideh Chitsaz,Myra K. Derbyshire,Renata C. Geer,Noreen R. Gonzales,Marc Gwadz,David I. Hurwitz,Fu Lu,Gabriele H. Marchler,James S. Song,Narmada Thanki,Zhouxi Wang,Roxanne A. Yamashita,Dachuan Zhang,Chanjuan Zheng,Lewis Y. Geer,Stephen H. Bryant +21 more
TL;DR: NCBI's Conserved Domain Database (CDD) aims at annotating biomolecular sequences with the location of evolutionarily conserved protein domain footprints, and functional sites inferred from such footprints.
2.6K
CD-Search: protein domain annotations on the fly
TL;DR: The Conserved Domain Search service (CD-Search), a web-based tool for the detection of structural and functional domains in protein sequences, uses BLAST(R) heuristics to provide a fast, interactive service, and searches a comprehensive collection of domain models.
The COG database: new developments in phylogenetic classification of proteins from complete genomes
Roman L. Tatusov,Darren A. Natale,Igor Garkavtsev,Tatiana Tatusova,Uma Shankavaram,Bachoti S. Rao,Boris Kiryutin,Michael Y. Galperin,Natalie D. Fedorova,Eugene V. Koonin +9 more
TL;DR: The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis.
20 years of the SMART protein domain annotation resource.
Ivica Letunic,Peer Bork +1 more
TL;DR: In its 20th year, the SMART analysis results pages have been streamlined again and its information sources have been updated, and the internal full text search engine has been redesigned and updated, resulting in greatly increased search speed.