TL;DR: High contiguity human genomes can be assembled de novo in 6 h using nanopore long-read sequences and the Shasta toolkit, and the assembly performance is compared to existing methods for diploid, haploid and trio-binned human samples and report superior accuracy and speed.
Abstract: De novo assembly of a human genome using nanopore long-read sequences has been reported, but it used more than 150,000 CPU hours and weeks of wall-clock time. To enable rapid human genome assembly, we present Shasta, a de novo long-read assembler, and polishing algorithms named MarginPolish and HELEN. Using a single PromethION nanopore sequencer and our toolkit, we assembled 11 highly contiguous human genomes de novo in 9 d. We achieved roughly 63× coverage, 42-kb read N50 values and 6.5× coverage in reads >100 kb using three flow cells per sample. Shasta produced a complete haploid human genome assembly in under 6 h on a single commercial compute node. MarginPolish and HELEN polished haploid assemblies to more than 99.9% identity (Phred quality score QV = 30) with nanopore reads alone. Addition of proximity-ligation sequencing enabled near chromosome-level scaffolds for all 11 genomes. We compare our assembly performance to existing methods for diploid, haploid and trio-binned human samples and report superior accuracy and speed. High contiguity human genomes can be assembled de novo in 6 h using nanopore long-read sequences and the Shasta toolkit.
TL;DR: Semiconductor high-throughput sequencing, represented by Ion Torrent PGM/Proton, proves to be feasible in the noninvasive prenatal diagnosis of fetal aneuploidies and is recommended to researchers or clinicians to use the appropriate chip with barcoding technology on the basis of the sample number.
Abstract: Semiconductor high-throughput sequencing, represented by Ion Torrent PGM/Proton, proves to be feasible in the noninvasive prenatal diagnosis of fetal aneuploidies. It is commendable that, with less data and relevant cost also, an accurate result can be achieved owing to the high sensitivity and specificity of such kind of technology. We conducted a comparative analysis of the performance of four different Ion chips in detecting fetal chromosomal aneuploidies. Eight maternal plasma DNA samples, including four pregnancies with normal fetuses and four with trisomy 21 fetuses, were sequenced on Ion Torrent 314/316/318/PI chips, respectively. Results such as read mapped ratio, correlation coefficient and phred quality score were calculated and parallelly compared. All samples were correctly classified even with low-throughput chip, and, among the four chips, the 316 chip had the highest read mapped ratio, correlation coefficient, mean read length and phred quality score. All chips were well consistent with each other. Our results showed that all Ion chips are applicable in noninvasive prenatal fetal aneuploidy diagnosis. We recommend researchers or clinicians to use the appropriate chip with barcoding technology on the basis of the sample number.
TL;DR: The results from NGS identified 12 novel missense mutations in the following genes: NOTCH1, ATM, STK36, MAGI1, DST, RECQL4, NUMA1, THBS1, MYH11, MALT1, SMARCA4 and CDH20, which had not been reported in previous database.
Abstract: Choroid plexus papilloma (CPP) is a rare benign tumor of the central nervous system that is usually confined to the cerebral ventricles. According to the World Health Organization, CPP corresponds to a grade I atypical CPP (a-CPP); however, it can become more aggressive and reach grade II, which can rarely undergo malignant transformation into a choroid plexus carcinoma (grade III). To the best of our knowledge, identification of these tumors mutations by next generation DNA sequencing (NGS) has not been yet reported. In the present study, NGS analysis of an a-CPP case was performed. Data were analyzed using Advaita Bioinformatics i-VariantGuide and Ion Reporter 5.6 programs. The results from NGS identified 12 novel missense mutations in the following genes: NOTCH1, ATM, STK36, MAGI1, DST, RECQL4, NUMA1, THBS1, MYH11, MALT1, SMARCA4 and CDH20. The PolyPhen score of six variants viz., DST, RECQL4, NUMA1, THBS1, MYHI1 and SMARCA4 were high, which suggested these variants represents pathogenic variants. Two novel insertions that caused frameshift were also found. Furthermore, two novel nonsense mutations and 14 novel intronic variants were identified in this tumor. The novel missense mutation detected in ATM gene was situated in c.5808A>T; p. (Leu1936Phe) in exon 39, and a known ATM mutation was in c.5948A>G; p. (Asn1983Ser). These novel mutations had not been reported in previous database. Subsequently, the quality statistics of these variants, including allele coverage, allele ratio, P-value, Phred quality score, sequencing coverage, PolyPhen score and alleles frequency was performed. For all variants, P-value was highly significant and the Phred quality score was high. In addition, the results from sequencing coverage demonstrated that 97.02% reads were on target and that 97.88% amplicons had at least 500 reads. These findings may serve at determining new strategies to distinguish the types of choroid plexus tumor, and at developing novel targeted therapies. Development of NGS technologies in the Kingdom of Saudi Arabia may be used in molecular pathology laboratories.
TL;DR: The aim was to characterize colon Microbiota in CRC patients regarding location, gene markers and outcome and implement the linear model (GLM) was implemented in the DESeq2 R kit.
Abstract: 3615Background: Location and somatic gene signature of CRCs may impact prognosis and therapy response. The aim was to characterize colon Microbiota in CRC patients regarding location, gene markers and outcome. Methods: Patients (N = 173) signed consent for whole metagenome (shot gun sequencing on Illumina HiSeq2500) analysis of stool DNA: 72 CRC (53 sporadic-S, 19 Lynch-L), 87 asymptomatic subjects (normal colonoscopy), 14 first degree healthy relatives from Lynch families. "MOCAT" pipeline was used, library sorted (Phred quality score i20 Alientrimmer v0.4.0) after exclusion of < 35 nt, human genes or phage sequences. Quality sequences were aligned (REFMG.V13) and most abundant genes constructed (MBMA program v0.1). The Shaman program (shaman.c3bi.pasteur.fr) was used. The number of bacteria was estimated (REFMG program). The linear model (GLM) was implemented in the DESeq2 R kit. Dfferences between Control (N = 87) and CRCs (N = 69), between L (N = 19) and S CRCs(N = 50), and between LCRC (N = 19) and H...
TL;DR: The Phred quality score can measure the sequence quality, and quality scores are normally stored together with the nucleotide sequence in the widely accepted FASTQ format, and the range of scores will depend on the technology and the base caller used.