TL;DR: A suite of computational tools that incorporate state-of-the-art statistical techniques for the analysis of DNAm data are described that include methods for preprocessing, quality assessment and detection of differentially methylated regions from the kilobase to the megabase scale.
Abstract: Motivation The recently released Infinium HumanMethylation450 array (the '450k' array) provides a high-throughput assay to quantify DNA methylation (DNAm) at ∼450 000 loci across a range of genomic features. Although less comprehensive than high-throughput sequencing-based techniques, this product is more cost-effective and promises to be the most widely used DNAm high-throughput measurement technology over the next several years. Results Here we describe a suite of computational tools that incorporate state-of-the-art statistical techniques for the analysis of DNAm data. The software is structured to easily adapt to future versions of the technology. We include methods for preprocessing, quality assessment and detection of differentially methylated regions from the kilobase to the megabase scale. We show how our software provides a powerful and flexible development platform for future methods. We also illustrate how our methods empower the technology to make discoveries previously thought to be possible only with sequencing-based methods. Availability and implementation http://bioconductor.org/packages/release/bioc/html/minfi.html. Contact khansen@jhsph.edu; rafa@jimmy.harvard.edu Supplementary information Supplementary data are available at Bioinformatics online.
TL;DR: Methylation changes in cancer are at sites that vary normally in tissue differentiation, consistent with the epigenetic progenitor model of cancer, which proposes that epigenetic alterations affecting tissue-specific differentiation are the predominant mechanism by which epigenetic changes cause cancer.
Abstract: For the past 25 years, it has been known that alterations in DNA methylation (DNAm) occur in cancer, including hypomethylation of oncogenes and hypermethylation of tumor suppressor genes. However, most studies of cancer methylation have assumed that functionally important DNAm will occur in promoters, and that most DNAm changes in cancer occur in CpG islands. Here we show that most methylation alterations in colon cancer occur not in promoters, and also not in CpG islands, but in sequences up to 2 kb distant, which we term 'CpG island shores'. CpG island shore methylation was strongly related to gene expression, and it was highly conserved in mouse, discriminating tissue types regardless of species of origin. There was a notable overlap (45-65%) of the locations of colon cancer-related methylation changes with those that distinguished normal tissues, with hypermethylation enriched closer to the associated CpG islands, and hypomethylation enriched further from the associated CpG island and resembling that of noncolon normal tissues. Thus, methylation changes in cancer are at sites that vary normally in tissue differentiation, consistent with the epigenetic progenitor model of cancer, which proposes that epigenetic alterations affecting tissue-specific differentiation are the predominant mechanism by which epigenetic changes cause cancer.
TL;DR: Using large scale validation data from thousands of individuals, it is demonstrated that DNAm GrimAge stands out among existing epigenetic clocks in terms of its predictive ability for time-to-death, and a novel measure of epigenetic age acceleration, AgeAccelGrim.
Abstract: It was unknown whether plasma protein levels can be estimated based on DNA methylation (DNAm) levels, and if so, how the resulting surrogates can be consolidated into a powerful predictor of lifespan. We present here, seven DNAm-based estimators of plasma proteins including those of plasminogen activator inhibitor 1 (PAI-1) and growth differentiation factor 15. The resulting predictor of lifespan, DNAm GrimAge (in units of years), is a composite biomarker based on the seven DNAm surrogates and a DNAm-based estimator of smoking pack-years. Adjusting DNAm GrimAge for chronological age generated novel measure of epigenetic age acceleration, AgeAccelGrim.Using large scale validation data from thousands of individuals, we demonstrate that DNAm GrimAge stands out among existing epigenetic clocks in terms of its predictive ability for time-to-death (Cox regression P=2.0E-75), time-to-coronary heart disease (Cox P=6.2E-24), time-to-cancer (P= 1.3E-12), its strong relationship with computed tomography data for fatty liver/excess visceral fat, and age-at-menopause (P=1.6E-12). AgeAccelGrim is strongly associated with a host of age-related conditions including comorbidity count (P=3.45E-17). Similarly, age-adjusted DNAm PAI-1 levels are associated with lifespan (P=5.4E-28), comorbidity count (P= 7.3E-56) and type 2 diabetes (P=2.0E-26). These DNAm-based biomarkers show the expected relationship with lifestyle factors including healthy diet and educational attainment.Overall, these epigenetic biomarkers are expected to find many applications including human anti-aging studies.
TL;DR: It is shown that an exogenously expressed algal nucleotide triphosphate transporter efficiently imports the triph phosphates of both d5SICS and dNaM into Escherichia coli, and that the endogenous replication machinery uses them to accurately replicate a plasmid containing d5 SICS–dNaM.
Abstract: Organisms are defined by the information encoded in their genomes, and since the origin of life this information has been encoded using a two-base-pair genetic alphabet (A-T and G-C). In vitro, the alphabet has been expanded to include several unnatural base pairs (UBPs). We have developed a class of UBPs formed between nucleotides bearing hydrophobic nucleobases, exemplified by the pair formed between d5SICS and dNaM (d5SICS-dNaM), which is efficiently PCR-amplified and transcribed in vitro, and whose unique mechanism of replication has been characterized. However, expansion of an organism's genetic alphabet presents new and unprecedented challenges: the unnatural nucleoside triphosphates must be available inside the cell; endogenous polymerases must be able to use the unnatural triphosphates to faithfully replicate DNA containing the UBP within the complex cellular milieu; and finally, the UBP must be stable in the presence of pathways that maintain the integrity of DNA. Here we show that an exogenously expressed algal nucleotide triphosphate transporter efficiently imports the triphosphates of both d5SICS and dNaM (d5SICSTP and dNaMTP) into Escherichia coli, and that the endogenous replication machinery uses them to accurately replicate a plasmid containing d5SICS-dNaM. Neither the presence of the unnatural triphosphates nor the replication of the UBP introduces a notable growth burden. Lastly, we find that the UBP is not efficiently excised by DNA repair pathways. Thus, the resulting bacterium is the first organism to propagate stably an expanded genetic alphabet.
TL;DR: A novel and highly robust DNAm age estimator for human fibroblasts, keratinocytes, buccal cells, endothelial cells, lymphoblastoid cells, skin, blood, and saliva samples that uncovered an epigenetic age acceleration with a magnitude below the sensitivity levels of other DNAm-based biomarkers.
Abstract: DNA methylation (DNAm)-based biomarkers of aging have been developed for many tissues and organs However, these biomarkers have sub-optimal accuracy in fibroblasts and other cell types used in ex vivo studies To address this challenge, we developed a novel and highly robust DNAm age estimator (based on 391 CpGs) for human fibroblasts, keratinocytes, buccal cells, endothelial cells, lymphoblastoid cells, skin, blood, and saliva samples High age correlations can also be observed in sorted neurons, glia, brain, liver, and even bone samples Gestational age correlates with DNAm age in cord blood When used on fibroblasts from Hutchinson Gilford Progeria Syndrome patients, this age estimator (referred to as the skin & blood clock) uncovered an epigenetic age acceleration with a magnitude that is below the sensitivity levels of other DNAm-based biomarkers Furthermore, this highly sensitive age estimator accurately tracked the dynamic aging of cells cultured ex vivo and revealed that their proliferation is accompanied by a steady increase in epigenetic age The skin & blood clock predicts lifespan and it relates to many age-related conditions Overall, this biomarker is expected to become useful for forensic applications (eg blood or buccal swabs) and for a quantitative ex vivo human cell aging assay