PhenotypeSimulator: A comprehensive framework for simulating multi-trait, multi-locus genotype to phenotype relationships.
Hannah Meyer,Ewan Birney +1 more
TL;DR: PhenotypeSimulator is developed, a comprehensive phenotype simulation scheme that can model multiple traits with multiple underlying genetic loci as well as complex covariate and observational noise structure.
read more
Abstract: Motivation: Simulation is a critical part of method development and assessment. With the increasing sophistication of multi-trait and multi-locus genetic analysis techniques, it is important that the community has flexible simulation tools to challenge and explore the properties of these methods. Results: We have developed PhenotypeSimulator, a comprehensive phenotype simulation scheme that can model multiple traits with multiple underlying genetic loci as well as complex covariate and observational noise structure. This package has been designed to work with many common genetic tools both for input and output. We describe the underlying components of this simulation tool and illustrate its use on an example dataset. Availability and implementation: PhenotypeSimulator is available as a well documented R/CRAN package and the code is available on github: https://github.com/HannahVMeyer/PhenotypeSimulator. Supplementary information: Supplementary data are available at Bioinformatics online.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Multi-ancestry eQTL meta-analysis of human brain identifies candidate causal variants for brain-related traits
TL;DR: A meta-analysis of 3,983 RNA-seq samples from 2,119 donors using the multivariate multiple QTL (mmQTL) approach characterizes the genetics of gene expression in the human brain and identifies candidate causal variants for brain-related traits as mentioned in this paper .
72
Persistent Overactive Cytotoxic Immune Response in a Spanish Cohort of Individuals With Long-COVID: Identification of Diagnostic Biomarkers
Miguel Ladero Galán,Lorena Vigón,Daniel Fuertes,M. A. Murciano-Antón,G. Casado-Fernández,Susana Domínguez-Mateos,Elena Mateos,F. Ramos-Martín,Vicente Planelles,M. Torres,Sara Rodríguez-Mora,María Rosa López-Huertas,Mayte Coiras +12 more
TL;DR: It was determined that individuals with Long-COVID showed significantly increased levels of functional memory cells with high antiviral cytotoxic activity such as CD8+ TEMRA cells, CD8±TCRγδ+ cells, and NK cells with CD56+CD57+NKG2C+ phenotype.
67
simplePHENOTYPES: SIMulation of pleiotropic, linked and epistatic phenotypes
TL;DR: simplePHENOTYPES as discussed by the authors is an R/CRAN package that simulates pleiotropy, partial pleioropy, and spurious pleiotropies in a wide range of genetic architectures, including additive, dominance and epistatic models.
Large-scale genomic analyses reveal insights into pleiotropy across circulatory system diseases and nervous system disorders
Xinyuan Zhang,Anastasia Lucas,Yogasudha Veturi,Theodore G. Drivas,William P. Bone,Anurag Verma,Wendy K. Chung,David R. Crosslin,Joshua C. Denny,Scott J. Hebbring,Gail P. Jarvik,Iftikhar J. Kullo,Eric B. Larson,Laura J. Rasmussen-Torvik,Daniel J. Schaid,Jordan W. Smoller,Ian B. Stanaway,Wei Wei,Chunhua Weng,Marylyn D. Ritchie +19 more
TL;DR: In this article , the authors characterized pleiotropy across 107 circulatory system and 40 nervous system traits using an ensemble of methods in the eMERGE Network and UK Biobank.
Identifying novel associations in GWAS by hierarchical Bayesian latent variable detection of differentially misclassified phenotypes
TL;DR: PheLEx shows promise in reanalyzing GWAS datasets to provide supplemental candidate loci that are ignored by traditional GWAS analysis methodologies and dramatically improves recovery of the correct disease state when considering realistic allele effect sizes compared to existing methodologies designed for Bayesian recovery of disease phenotypes.
References
A power primer.
TL;DR: A convenient, although not comprehensive, presentation of required sample sizes is providedHere the sample sizes necessary for .80 power to detect effects at these levels are tabled for eight standard statistical tests.
43.7K
Second-generation PLINK: rising to the challenge of larger and richer datasets
Christopher C. Chang,Carson C. Chow,Laurent C. A. M. Tellier,Shashaank Vattikuti,Shaun Purcell,James J. Lee +5 more
TL;DR: The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility, and for the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.
An integrated map of genetic variation from 1,092 human genomes
Gonçalo R. Abecasis,Adam Auton,Lisa D. Brooks,Mark A. DePristo,Richard Durbin,Robert E. Handsaker,Robert E. Handsaker,Hyun Min Kang,Gabor T. Marth,Gil McVean +9 more
TL;DR: It is shown that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites.
Second-generation PLINK: rising to the challenge of larger and richer datasets
Christopher C. Chang,Carson C. Chow,Laurent C. A. M. Tellier,Shashaank Vattikuti,Shaun Purcell,James J. Lee +5 more
TL;DR: PLINK as discussed by the authors is a C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics, which has been widely used in the literature.
3.5K
A new multipoint method for genome-wide association studies by imputation of genotypes
TL;DR: This work proposes a coherent analysis framework that treats the genome-wide association problem as one involving missing or uncertain genotypes, and proposes a model-based imputation method for inferring genotypes at observed or unobserved SNPs, leading to improved power over existing methods for multipoint association mapping.
2.9K