Top 298 Genome Research papers published in 2003

Large-Scale Discovery of Induced Point Mutations With High-Throughput TILLING

[...]

Bradley J. Till¹, Steven H. Reynolds, Elizabeth A. Greene, Christine A. Codomo, Linda C. Enns, Jessica Johnson, Christopher R. Burtner, Anthony R. Odden, Kim Young, Nicholas E. Taylor, Jorja G. Henikoff, Luca Comai, Steven Henikoff - Show less +9 more•Institutions (1)

Fred Hutchinson Cancer Research Center¹

01 Mar 2003-Genome Research

TL;DR: The goal is to rapidly deliver allelic series of ethylmethanesulfonate-induced mutations in target 1-kb loci requested by the international research community.

...read moreread less

Abstract: TILLING (Targeting Induced Local Lesions in Genomes) is a general reverse-genetic strategy that provides an allelic series of induced point mutations in genes of interest High-throughput TILLING allows the rapid and low-cost discovery of induced point mutations in populations of chemically mutagenized individuals As chemical mutagenesis is widely applicable and mutation detection for TILLING is dependent only on sufficient yield of PCR products, TILLING can be applied to most organisms We have developed TILLING as a service to the Arabidopsis community known as the Arabidopsis TILLING Project (ATP) Our goal is to rapidly deliver allelic series of ethylmethanesulfonate-induced mutations in target 1-kb loci requested by the international research community In the first year of public operation, ATP has discovered, sequenced, and delivered >1000 mutations in >100 genes ordered by Arabidopsis researchers The tools and methodologies described here can be adapted to create similar facilities for other organisms

...read moreread less

Journal Article•10.1101/GR.1272403•

Decay Rates of Human mRNAs: Correlation With Functional Characteristics and Sequence Attributes

[...]

Edward Yang¹, Erik van Nimwegen, Mihaela Zavolan, Nikolaus Rajewsky, Mark D Schroeder, Marcelo O. Magnasco, James E. Darnell - Show less +3 more•Institutions (1)

Rockefeller University¹

01 Aug 2003-Genome Research

TL;DR: This work measures mRNA decay rates in two human cell lines with high-density oligonucleotide arrays and investigates the dependence of decay rates on sequence composition, that is, the presence or absence of short mRNA motifs in various regions of the mRNA transcript.

...read moreread less

Abstract: Although mRNA decay rates are a key determinant of the steady-state concentration for any given mRNA species, relatively little is known, on a population level, about what factors influence turnover rates and how these rates are integrated into cellular decisions. We decided to measure mRNA decay rates in two human cell lines with high-density oligonucleotide arrays that enable the measurement of decay rates simultaneously for thousands of mRNA species. Using existing annotation and the Gene Ontology hierarchy of biological processes, we assign mRNAs to functional classes at various levels of resolution and compare the decay rate statistics between these classes. The results show statistically significant organizational principles in the variation of decay rates among functional classes. In particular, transcription factor mRNAs have increased average decay rates compared with other transcripts and are enriched in "fast-decaying" mRNAs with half-lives <2 h. In contrast, we find that mRNAs for biosynthetic proteins have decreased average decay rates and are deficient in fast-decaying mRNAs. Our analysis of data from a previously published study of Saccharomyces cerevisiae mRNA decay shows the same functional organization of decay rates, implying that it is a general organizational scheme for eukaryotes. Additionally, we investigated the dependence of decay rates on sequence composition, that is, the presence or absence of short mRNA motifs in various regions of the mRNA transcript. Our analysis recovers the positive correlation of mRNA decay with known AU-rich mRNA motifs, but we also uncover further short mRNA motifs that show statistically significant correlation with decay. However, we also note that none of these motifs are strong predictors of mRNA decay rate, indicating that the regulation of mRNA decay is more complex and may involve the cooperative binding of several RNA-binding proteins at different sites.

...read moreread less

Journal Article•10.1101/GR.789803•

AVID: A Global Alignment Program

[...]

Nicholas L. Bray¹, Inna Dubchak, Lior Pachter•Institutions (1)

Lawrence Berkeley National Laboratory¹

01 Jan 2003-Genome Research

TL;DR: A new global alignment method called AVID is described, designed to be fast, memory efficient, and practical for sequence alignments of large genomic regions up to megabases long, and a format is established for the representation of alignments and methods for their comparison.

...read moreread less

Abstract: In this paper we describe a new global alignment method called AVID. The method is designed to be fast, memory efficient, and practical for sequence alignments of large genomic regions up to megabases long. We present numerous applications of the method, ranging from the comparison of assemblies to alignment of large syntenic genomic regions and whole genome human/mouse alignments. We have also performed a quantitative comparison of AVID with other popular alignment tools. To this end, we have established a format for the representation of alignments and methods for their comparison. These formats and methods should be useful for future studies. The tools we have developed for the alignment comparisons, as well as the AVID program, are publicly available. See Web Site References section for AVID Web address and Web addresses for other programs discussed in this paper.

...read moreread less

Journal Article•10.1101/GR.1196503•

Fast Evaluation of Fluctuations in Biochemical Networks With the Linear Noise Approximation

[...]

Johan Elf¹, Måns Ehrenberg•Institutions (1)

Uppsala University¹

01 Nov 2003-Genome Research

TL;DR: The method complements bifurcation studies of the system's parameter dependence by providing estimates of sizes, correlations, and time scales of stochastic fluctuations by suitable variable changes and elimination of fast variables.

...read moreread less

Abstract: Biochemical networks in single cells can display large fluctuations in molecule numbers, making mesoscopic approaches necessary for correct system descriptions. We present a general method that allows rapid characterization of the stochastic properties of intracellular networks. The starting point is a macroscopic description that identifies the system's elementary reactions in terms of rate laws and stoichiometries. From this formulation follows directly the stationary solution of the linear noise approximation (LNA) of the Master equation for all the components in the network. The method complements bifurcation studies of the system's parameter dependence by providing estimates of sizes, correlations, and time scales of stochastic fluctuations. We describe how the LNA can give precise system descriptions also near macroscopic instabilities by suitable variable changes and elimination of fast variables.

...read moreread less

Journal Article•10.1101/GR.1006603•

Allelic Variation in Gene Expression Is Common in the Human Genome

[...]

H. Shuen Lo, Zhining Wang¹, Ying Hu¹, Howard H. Yang¹, Sheryl Gere¹, Kenneth H. Buetow¹, Maxwell P. Lee¹ - Show less +3 more•Institutions (1)

National Institutes of Health¹

01 Aug 2003-Genome Research

TL;DR: It is demonstrated that variation of gene expression between alleles is common, and this variation may contribute to human variability, as shown by real-time quantitative PCR experiments.

...read moreread less

Abstract: Variations in gene sequence and expression underlie much of human variability. Despite the known biological roles of differential allelic gene expression resulting from X-chromosome inactivation and genomic imprinting, a large-scale analysis of allelic gene expression in human is lacking. We examined allele-specific gene expression of 1063 transcribed single-nucleotide polymorphisms (SNPs) by using Affymetrix HuSNP oligo arrays. Among the 602 genes that were heterozygous and expressed in kidney or liver tissues from seven individuals, 326 (54%) showed preferential expression of one allele in at least one individual, and 170 of those showed greater than fourfold difference between the two alleles. The allelic variation has been confirmed by real-time quantitative PCR experiments. Some of these 170 genes are known to be imprinted, such as SNRPN, IPW, HTR2A, and PEG3. Most of the differentially expressed genes are not in known imprinting domains but instead are distributed throughout the genome. Our studies demonstrate that variation of gene expression between alleles is common, and this variation may contribute to human variability.

...read moreread less

Journal Article•10.1101/GR.1725103•

[...]

01 Jul 2003-Genome Research

TL;DR: Comparative genomics revealed trends in amino acid and tRNA composition, and structural features of proteins from cold-adapted Archaea, and indicated that GC content is the major factor influencing tRNA stability in hyperthermophiles, but not in the psychrophiles, mesophiles or moderate thermophiles

...read moreread less

Abstract: We generated draft genome sequences for two cold-adapted Archaea, Methanogenium frigidum and Methanococcoides burtonii, to identify genotypic characteristics that distinguish them from Archaea with a higher optimal growth temperature (OGT). Comparative genomics revealed trends in amino acid and tRNA composition, and structural features of proteins. Proteins from the cold-adapted Archaea are characterized by a higher content of non-charged polar amino acids, particularly Gln and Thr and a lower content of hydrophobic amino acids, particularly Leu. Sequence data from nine methanogen genomes (OGT 15-98 C) was used to generate 1 111 modeled protein structures. Analysis of the models from the cold-adapted Archaea showed a strong tendency in the solvent accessible area for more Gln, Thr an hydrophobic residues and fewer charged residues. A cold shock domain (CSD) protein (CspA homolog) was identified in M. frigidum, two hypothetical proteins with CSD-folds in M. burtonii, and a unique winged helix DNA-binding domain protein in M. burtonii. This suggests that these types of nucleic acid binding proteins have a critical role in cold-adapted Archaea. Structural analysis of tRNA sequences from the Archaea indicated that GC content is the major factor influencing tRNA stability in hyperthermophiles, but not in the psychrophiles, mesophiles or moderate thermophiles. Below an OGT of 60 C, the GC content in tRNA was largely unchanged, indicating that any requirement for flexibility of tRNA in psychrophiles is mediated by other means. This is the first time that comparisons have been performed with genome data from Archaea spanning the growth temperature extremes from psychrophiles to hyperthermophiles.

...read moreread less

Journal Article•10.1101/GR.982903•

Antisense Transcripts With FANTOM2 Clone Set and Their Implications for Gene Regulation

[...]

Hidenori Kiyosawa, Itaru Yamanaka, Naoki Osato, Shinji Kondo, GSLMembers, Yoshihide Hayashizaki¹ - Show less +2 more•Institutions (1)

Yokohama City University¹

01 Jun 2003-Genome Research

TL;DR: Using the FANTOM2 mouse cDNA set, public mRNA data, and mouse genome sequence data, the analysis greatly expands the number of known examples of sense-antisense transcript and nonantisense bidirectional transcription pairs in mammals and implies that the regulation of gene expression by antisense transcripts is more common that previously recognized.

...read moreread less

Abstract: We have used the FANTOM2 mouse cDNA set (60,770 clones), public mRNA data, and mouse genome sequence data to identify 2481 pairs of sense-antisense transcripts and 899 further pairs of nonantisense bidirectional transcription based upon genomic mapping. The analysis greatly expands the number of known examples of sense-antisense transcript and nonantisense bidirectional transcription pairs in mammals. The FANTOM2 cDNA set appears to contain substantially large numbers of noncoding transcripts suitable for antisense transcript analysis. The average proportion of loci encoding sense-antisense transcript and nonantisense bidirectional transcription pairs on autosomes was 15.1 and 5.4%, respectively. Those on the X chromosome were 6.3 and 4.2%, respectively. Sense-antisense transcript pairs, rather than nonantisense bidirectional transcription pairs, may be less prevalent on the X chromosome, possibly due to X chromosome inactivation. Sense and antisense transcripts tended to be isolated from the same libraries, where nonantisense bidirectional transcription pairs were not apparently coregulated. The existence of large numbers of natural antisense transcripts implies that the regulation of gene expression by antisense transcripts is more common that previously recognized. The viewer showing mapping patterns of sense-antisense transcript pairs and nonantisense bidirectional transcription pairs on the genome and other related statistical data is available on our Web site.

...read moreread less

Journal Article•10.1101/GR.1271603•

A Biophysical Approach to Transcription Factor Binding Site Discovery

[...]

Marko Djordjevic¹, Anirvan M. Sengupta, Boris I. Shraiman•Institutions (1)

Columbia University¹

01 Nov 2003-Genome Research

TL;DR: A novel bioinformatics method that bases classification of potential binding sites explicitly on the estimate of sequence-specific binding energy of a given transcription factor, resulting in a significant improvement in the number of expected false positives.

...read moreread less

Abstract: Identification of transcription factor binding sites within regulatory segments of genomic DNA is an important step toward understanding of the regulatory circuits that control expression of genes. Here, we describe a novel bioinformatics method that bases classification of potential binding sites explicitly on the estimate of sequence-specific binding energy of a given transcription factor. The method also estimates the chemical potential of the factor that defines the threshold of binding. In contrast with the widely used information-theoretic weight matrix method, the new approach correctly describes saturation in the transcription factor/DNA binding probability. This results in a significant improvement in the number of expected false positives, particularly in the ubiquitous case of low-specificity factors. In the strong binding limit, the algorithm is related to the "support vector machine" approach to pattern recognition. The new method is used to identify likely genomic binding sites for the E. coli transcription factors collected in the DPInteract database. In addition, for CRP (a global regulatory factor), the likely regulatory modality (i.e., repressor or activator) of predicted binding sites is determined.

...read moreread less

...

Expand