eggNOG 6.0: enabling comparative genomics across 12 535 organisms
Ana Hernández-Plaza,Damian Szklarczyk,J. Botas,Carlos Pérez Cantalapiedra,Joaquín Giner-Lamia,Daniel R. Mende,Rebecca Kirsch,Thomas Rattei,Ivica Letunic,Lars Juhl Jensen,Peer Bork,Christian von Mering,Jaime Huerta-Cepas +12 more
148
TL;DR: The eggNOG 6.0 as discussed by the authors database provides a hierarchy of over 17M orthologous groups (OGs) computed at 1601 taxonomic levels, spanning 10 756 bacterial, 457 archaeal and 1322 eukaryotic organisms.
read more
Abstract: Abstract The eggNOG (evolutionary gene genealogy Non-supervised Orthologous Groups) database is a bioinformatics resource providing orthology data and comprehensive functional information for organisms from all domains of life. Here, we present a major update of the database and website (version 6.0), which increases the number of covered organisms to 12 535 reference species, expands functional annotations, and implements new functionality. In total, eggNOG 6.0 provides a hierarchy of over 17M orthologous groups (OGs) computed at 1601 taxonomic levels, spanning 10 756 bacterial, 457 archaeal and 1322 eukaryotic organisms. OGs have been thoroughly annotated using recent knowledge from functional databases, including KEGG, Gene Ontology, UniProtKB, BiGG, CAZy, CARD, PFAM and SMART. eggNOG also offers phylogenetic trees for all OGs, maximising utility and versatility for end users while allowing researchers to investigate the evolutionary history of speciation and duplication events as well as the phylogenetic distribution of functional terms within each OG. Furthermore, the eggNOG 6.0 website contains new functionality to mine orthology and functional data with ease, including the possibility of generating phylogenetic profiles for multiple OGs across species or identifying single-copy OGs at custom taxonomic levels. eggNOG 6.0 is available at http://eggnog6.embl.de.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Antibiotic-resistance plasmid amplified among MRSA cases in an urban jail and its connected communities
Stephanie N. Thiede,Hannah D. Steinberg,Emily Benedict,Sarah Sansom,Alla Aroutcheva,Kyle J Gontjes,Katherine Winner,Sonya Royzenblat,Ali Pirani,Robert A. Weinstein,Chad Zawitz,Nicholas M. Moore,Kyle J Popovich,Evan S. Snitkin,Stephanie N. Thiede,Kyle J Gontjes,Robert Weinstein,Nicholas M. Moore,Kyle J Popovich,Evan Snitkin,Stephanie N. Thiede,Hannah D. Steinberg,Emily Benedict,Sarah Sansom,Alla Aroutcheva,Kyle J Gontjes,Katherine Winner,Sonya Royzenblat,Ali Pirani,Robert Weinstein,Chad Zawitz,Nicholas M. Moore,Kyle J Popovich,Evan Snitkin +33 more
- 24 Oct 2025
Abstract: Abstract Jails have been hypothesized to be hotspots for the spread of methicillin-resistant Staphylococcus aureus (MRSA). We integrate genomic and epidemiologic data to investigate USA300 MRSA transmission in an urban jail and its connected communities. A genome-wide association study of 308 jail isolates from 2015-2018 revealed a plasmid encoding the ermC clindamycin/erythromycin resistance gene was associated with a 6-fold increased odds of MRSA genetic linkages among detainees. Additionally, 52% of jail-onset MRSA infections carried this plasmid compared to 14% of intake colonization isolates, supporting its role in MRSA spread in the jail. Extending our analysis to 774 isolates from a local healthcare system from 2011-2014, the ermC- carrying plasmid was also associated with MRSA transmission in the larger community and was enriched among former jail detainees and those with related isolates to recently incarcerated cases. Lastly, topical clindamycin exposure before MRSA infection was associated with ermC plasmid presence in both settings, but exposure prevalence was higher in jail versus community cases (7.5% vs. 0.9%), suggesting antibiotic use in the jail may have created a favorable environment for the spread of ermC -carrying strains. These findings highlight the impact of antibiotic use in jails on antibiotic resistance in both jails and their surrounding communities.
Global Characterization of the Avian Gut Virome Reveals Extensive Viral Diversity and Functional Implications
Xueqin Yang,Peng-Ying Li,Yunsheng Zhang,Zezhong Wang,Di Hu,Li Yang,Zhigang Hu,Li Deng,Xia Wang,Jinlong Ru +9 more
- 16 Oct 2025
Abstract: Abstract Background The avian gut virome plays a crucial role in shaping the gastrointestinal microbial ecosystem of birds. However, its taxonomic and functional diversity remains poorly characterized due to the absence of a dedicated reference database. This limitation hampers our understanding of the complex interactions among viruses, their bacterial hosts, and the overarching avian host, as well as viral contributions to gut microbial ecology.Results To address this gap, we developed the Avian Virome Database (AvianViromeDB) by integrating 2,692 gut metagenomic samples from poultry and wild birds. This effort yielded 252,752 viral contigs, which are clustered into 61,608 high-quality, species-level viral operational taxonomic units (vOTUs). Taxonomic analysis revealed that 99.05% of these vOTUs belonged to the class Caudoviricetes , yet only 4.69% could be assigned to known viral families—suggesting over 95% likely represent novel viral lineages. Prediction of prokaryotic hosts indicated that these viruses primarily target core gut microbiota, particularly Bacillota and Bacteroidota , both central to carbohydrate metabolism. Functional annotation uncovered tens of thousands of auxiliary metabolic genes (AMGs), with enrichments in glycolysis, amino acid metabolism, and nucleotide biosynthesis pathways.Conclusion These findings demonstrate that avian gut viruses may modulate microbial communities both through direct lysis of their bacterial hosts (“top-down” control) and by altering host metabolism via AMGs (“bottom-up” modulation). The resulting high-quality genome catalog reveals the remarkable diversity and functional potential of the avian gut virome, offering a valuable resource for future research into avian microbial ecology and the intricate interplay between viruses, bacteria, and their avian hosts. The AvianViromeDB is publicly accessible at: https://phagebyte.github.io/avianviromedb.
The evolutionary tale of tailocins in <i>Pectobacterium</i> show genus-wide homologous recombination and inversions of tail fiber genes
Lakhansing Pardeshi,Anne Kupczok,Dick de Ridder,Sandra Smit,Théo van der Lee +4 more
- 01 Aug 2025
TL;DR: This study reveals the evolutionary tale of tailocins in Pectobacterium, a globally present phytopathogen, showing genus-wide homologous recombination and inversions of tail fiber genes, which enables tailocin polymorphism and host range specificity.
Additional file 1 of FAM171B stabilizes vimentin and enhances CCL2-mediated TAM infiltration to promote bladder cancer progression
HU Wei-min,Li Ming,Tang Yu-qi,Song Tian-bao,Zou Fan,Cheng Fan,YU Wei-min +6 more
- 02 Nov 2023
Abstract: Supplementary Material 1
The protein interactome of Escherichia coli carbohydrate metabolism
Shomeek Chowdhury,Stephen S. Fong,Peter Uetz +2 more
TL;DR: The protein interactome of E. coli carbohydrate metabolism regulates carbohydrate metabolism by modulating enzyme activity and stoichiometry.
References
The Protein Data Bank
Helen M. Berman,John D. Westbrook,Zukang Feng,Gary L. Gilliland,Talapady N. Bhat,Helge Weissig,Ilya N. Shindyalov,Philip E. Bourne +7 more
TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.
KEGG: new perspectives on genomes, pathways, diseases and drugs
TL;DR: The content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases, and the newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined.
8K
The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.
Damian Szklarczyk,Annika L. Gable,Katerina C. Nastou,David Lyon,Rebecca Kirsch,Sampo Pyysalo,Nadezhda Tsankova Doncheva,Marc Legeay,Tao Fang,Peer Bork,Lars Juhl Jensen,Christian von Mering +11 more
TL;DR: Changes to the text-mining system, a new scoring-mode for physical interactions, as well as extensive user interface features for customizing, extending and sharing protein networks are described.
UniProt: the universal protein knowledgebase in 2021
Alex Bateman,Maria Jesus Martin,Sandra Orchard,Michele Magrane,Rahat Agivetova,Shadab Ahmad,Emanuele Alpi,Emily H Bowler-Barnett,Ramona Britto,Borisas Bursteinas,Hema Bye-A-Jee,Ray Coetzee,Austra Cukura,Alan Wilter Sousa da Silva,Paul Denny,Tunca Doğan,ThankGod Ebenezer,Jun Fan,Leyla Jael Garcia Castro,Penelope Garmiri,George Georghiou,Leonardo Gonzales,Emma Hatton-Ellis,Abdulrahman Hussein,Alexandr Ignatchenko,Giuseppe Insana,Rizwan Ishtiaq,Petteri Jokinen,Vishal Joshi,Dushyanth Jyothi,Antonia Lock,Rodrigo Lopez,Aurelien Luciani,Jie Luo,Yvonne Lussi,Alistair MacDougall,Fábio Madeira,Mahdi Mahmoudy,Manuela Menchi,Alok Mishra,Katie Moulang,Andrew Nightingale,Carla Susana Oliveira,Sangya Pundir,Guoying Qi,Shriya Raj,Daniel Rice,Milagros Rodriguez Lopez,Rabie Saidi,Joseph Sampson,Tony Sawford,Elena Speretta,Edward Turner,Nidhi Tyagi,Preethi Vasudev,Vladimir Volynkin,Kate Warner,Xavier Watkins,Rossana Zaru,Hermann Zellner,Alan Bridge,Sylvain Poux,Nicole Redaschi,Lucila Aimo,Ghislaine Argoud-Puy,Andrea H. Auchincloss,Kristian B. Axelsen,Parit Bansal,Delphine Baratin,Marie-Claude Blatter,Jerven Bolleman,Emmanuel Boutet,Lionel Breuza,Cristina Casals-Casas,Edouard de Castro,Kamal Chikh Echioukh,Elisabeth Coudert,Béatrice A. Cuche,M Doche,Dolnide Dornevil,Anne Estreicher,Maria Livia Famiglietti,Marc Feuermann,Elisabeth Gasteiger,Sebastien Gehant,Vivienne Baillie Gerritsen,Arnaud Gos,Nadine Gruaz-Gumowski,Ursula Hinz,Chantal Hulo,Nevila Hyka-Nouspikel,Florence Jungo,Guillaume Keller,Arnaud Kerhornou,Vicente Lara,Philippe Le Mercier,Damien Lieberherr,Thierry Lombardot,Xavier D. Martin,Patrick Masson,Anne Morgat,Teresa Batista Neto,Salvo Paesano,Ivo Pedruzzi,Sandrine Pilbout,Lucille Pourcel,Monica Pozzato,Manuela Pruess,Catherine Rivoire,Christian J. A. Sigrist,K Sonesson,Andre Stutz,Shyamala Sundaram,Michael Tognolli,Laure Verbregue,Cathy H. Wu,Cecilia N. Arighi,Leslie Arminski,Chuming Chen,Yongxing Chen,John S. Garavelli,Hongzhan Huang,Kati Laiho,Peter B. McGarvey,Darren A. Natale,Karen E. Ross,C. R. Vinayaka,Qinghua Wang,Yuqi Wang,Lai-Su L. Yeh,Jian Zhang,Patrick Ruch,Douglas Teodoro +132 more
TL;DR: The UniProtKB responded to the COVID-19 pandemic through expert curation of relevant entries that were rapidly made available to the research community through a dedicated portal and a credit-based publication submission interface was developed.
Pfam: The protein families database in 2021.
Jaina Mistry,Sara Chuguransky,Lowri Williams,Matloob Qureshi,Gustavo A. Salazar,Erik L. L. Sonnhammer,Silvio C. E. Tosatto,Lisanna Paladin,Shriya Raj,Lorna Richardson,Robert D. Finn,Alex Bateman +11 more
TL;DR: The Pfam database is a widely used resource for classifying protein sequences into families and domains and the reintroduced Pfam-B which provides an automatically generated supplement to Pfam and contains 136 730 novel clusters of sequences that are not yet matched by a Pfam family.
5.6K