Machine Learning Methods for Predicting Human-Adaptive Influenza A Viruses Based on Viral Nucleotide Compositions.
Jing Li,Sen Zhang,Bo Li,Yi Hu,Xiaoping Kang,Wu Xiaoyan,Meng-Ting Huang,Li Yuchang,Zhongpeng Zhao,Cheng-Feng Qin,Tao Jiang +10 more
41
TL;DR: The model performed well in predicting the human adaptation of the swine/avian IAVs before and after the 2009 H1N1 pandemic and the identification of key viral factors that affect virus transmission/pathogenicity.
read more
Abstract: Each influenza pandemic was caused at least partly by avian- and/or swine-origin influenza A viruses (IAVs). The timing of and the potential IAVs involved in the next pandemic are currently unpredictable. We aim to build machine learning (ML) models to predict human-adaptive IAV nucleotide composition. A total of 217,549 IAV full-length coding sequences of the PB2 (polymerase basic protein-2), PB1, PA (polymerase acidic protein), HA (hemagglutinin), NP (nucleoprotein), and NA (neuraminidase) segments were decomposed for their codon position-based mononucleotides (12 nts) and dinucleotides (48 dnts). A total of 68,742 human sequences and 68,739 avian sequences (1:1) were resampled to characterize the human adaptation-associated (d)nts with principal component analysis (PCA) and other ML models. Then, the human adaptation of IAV sequences was predicted based on the characterized (d)nts. Respectively, 9, 12, 11, 13, 10 and 9 human-adaptive (d)nts were optimized for the six segments. PCA and hierarchical clustering analysis revealed the linear separability of the optimized (d)nts between the human-adaptive and avian-adaptive sets. The results of the confusion matrix and the area under the receiver operating characteristic curve indicated a high performance of the ML models to predict human adaptation of IAVs. Our model performed well in predicting the human adaptation of the swine/avian IAVs before and after the 2009 H1N1 pandemic. In conclusion, we identified the human adaptation-associated genomic composition of IAV segments. ML models for IAV human adaptation prediction using large IAV genomic data sets can facilitate the identification of key viral factors that affect virus transmission/pathogenicity. Most importantly, it allows the prediction of pandemic influenza.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Avian Influenza in Wild Birds and Poultry: Dissemination Pathways, Monitoring Methods, and Virus Ecology
Artem Blagodatski,Kseniya Trutneva,Olga Glazova,Olga Mityaeva,Liudmila Shevkova,Evgenii Kegeles,Nikita Onyanov,Kseniia Fede,Anna Maznina,Elena Khavina,Seon-Ju Yeo,Hyun Park,Pavel Volchkov +12 more
TL;DR: In this article, the role of migratory birds in the spread and introduction of influenza strains on a global level, based on recent data, is assessed, which sheds light on the details of viral dissemination linked to avian migration, the viral exchange between migratory waterfowl and domestic poultry, virus ecology in general, and viral evolution as a process tightly linked to bird migration.
100
The science of the host-virus network.
Gregory F. Albery,Daniel J. Becker,Liam Brierley,Cara E. Brook,Rebecca C. Christofferson,Lily E. Cohen,Tad A. Dallas,Evan A. Eskew,Anna C. Fagre,Maxwell J. Farrell,Emma E. Glennon,Sarah Guth,Maxwell B. Joseph,Nardus Mollentze,Benjamin A. Neely,Timothée Poisot,Angela L. Rasmussen,Angela L. Rasmussen,Sadie J. Ryan,Sadie J. Ryan,Stephanie N. Seifert,Anna R Sjodin,Erin M. Sorrell,Colin J. Carlson +23 more
TL;DR: In this article, a network science framework for understanding and predicting human and animal susceptibility to viral infections is proposed to identify basic biological rules that govern cross-species transmission and structure the global virome.
83
Research perspectives on animal health in the era of artificial intelligence.
Pauline Ezanno,Sébastien Picault,Gaël Beaunée,Xavier Bailly,Facundo Muñoz,Raphaël Duboz,Hervé Monod,Jean-François Guégan +7 more
TL;DR: In this paper, a literature review of scientific papers at the interface between AI and AH covering the period 2009-2019, and interviews with French researchers positioned at this interface, explains the main AH areas where various AI approaches are currently mobilised, how it may contribute to renew AH research issues and remove methodological or conceptual barriers.
Host–Virus Interaction: How Host Cells Defend against Influenza A Virus Infection
TL;DR: A general description on recent work regarding different host cells and molecules facilitating antiviral defenses against IAV infection and how IAVs antagonize host immune responses is presented.
30
Operationalizing “One Health” as “One Digital Health” Through a Global Framework That Emphasizes Fair and Equitable Sharing of Benefits From the Use of Artificial Intelligence and Related Digital Technologies
TL;DR: By operationalizing OH as ODH, it is more likely to be able to protect and restore natural habitats, secure the health and well-being of all living things, and thereby realize the goals set out in the post-2020 Global Biodiversity Framework under the CBD.
19
References
MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets
TL;DR: The latest version of the Molecular Evolutionary Genetics Analysis (Mega) software, which contains many sophisticated methods and tools for phylogenomics and phylomedicine, has been optimized for use on 64-bit computing systems for analyzing larger datasets.
Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees.
Koichiro Tamura,Masatoshi Nei +1 more
TL;DR: In this paper, a new mathematical method for estimating the number of transitional and transversional substitutions per site, as well as the total number of nucleotide substitutions was proposed, taking into account excess transitions, unequal nucleotide frequencies, and variation of substitution rate among different sites.
Principal component analysis: a review and recent developments
TL;DR: The basic ideas of PCA are introduced, discussing what it can and cannot do, and some variants of the technique have been developed that are tailored to various different data types and structures.
7.4K
What is a support vector machine
TL;DR: Support vector machines are becoming popular in a wide variety of biological applications, but how do they work and what are their most promising applications in the life sciences?
Evolution and ecology of influenza A viruses.
TL;DR: Wild aquatic bird populations have long been considered the natural reservoir for influenza A viruses with virus transmission from these birds seeding other avian and mammalian hosts, but recent studies in bats have suggested other reservoir species may also exist.
4.5K
Related Papers (5)
Wen Su,Rhodri Harfoot,Rhodri Harfoot,Yvonne C. F. Su,Jennifer DeBeauchamp,Udayan Joseph,Jayanthi Jayakumar,Jeri-Carol Crumpton,Trushar Jeevan,Adam Rubrum,John Franks,Philippe Noriel Q. Pascua,Christina Kackos,Yuqin Zhang,Mengting Zhang,Yue Ji,Huyen Trang Bui,Jeremy C. Jones,Lisa Kercher,Scott Krauss,Stephan Pleschka,Michael C. W. Chan,Robert G. Webster,Chung-Yi Wu,Kristien Van Reeth,Malik Peiris,Richard J. Webby,Gavin J. D. Smith,Hui-Ling Yen +28 more