TL;DR: Combining information about genomic segmental duplications, gene family phylogenies, and gene positions provides a method to evaluate contributions of tandem duplication and segmental genome duplication in the generation and maintenance of gene families.
Abstract: Background
Most genes in Arabidopsis thaliana are members of gene families. How do the members of gene families arise, and how are gene family copy numbers maintained? Some gene families may evolve primarily through tandem duplication and high rates of birth and death in clusters, and others through infrequent polyploidy or large-scale segmental duplications and subsequent losses.
TL;DR: It is shown that the yeast Saccharomyces cerevisiae arose from ancient whole-genome duplication, by sequencing and analysing Kluyveromyces waltii, a related yeast species that diverged before the duplication.
Abstract: Whole-genome duplication followed by massive gene loss and specialization has long been postulated as a powerful mechanism of evolutionary innovation. Recently, it has become possible to test this notion by searching complete genome sequence for signs of ancient duplication. Here, we show that the yeast Saccharomyces cerevisiae arose from ancient whole-genome duplication, by sequencing and analysing Kluyveromyces waltii, a related yeast species that diverged before the duplication. The two genomes are related by a 1:2 mapping, with each region of K. waltii corresponding to two regions of S. cerevisiae, as expected for whole-genome duplication. This resolves the long-standing controversy on the ancestry of the yeast genome, and makes it possible to study the fate of duplicated genes directly. Strikingly, 95% of cases of accelerated evolution involve only one member of a gene pair, providing strong support for a specific model of evolution, and allowing us to distinguish ancestral and derived functions.
TL;DR: The unusual age profile of tandem gene duplications in Arabidopsis indicates that other scenarios, such as variation in the rate at which duplicated genes are deleted, must also be considered.
Abstract: It is often anticipated that many of today’s diploid plant species are in fact paleopolyploids. Given that an ancient large-scale duplication will result in an excess of relatively old duplicated genes with similar ages, we analyzed the timing of duplication of pairs of paralogous genes in 14 model plant species. Using EST contigs (unigenes), we identified pairs of paralogous genes in each species and used the level of synonymous nucleotide substitution to estimate the relative ages of gene duplication. For nine of the investigated species (wheat [Triticum aestivum], maize [Zea mays], tetraploid cotton [Gossypium hirsutum], diploid cotton [G. arboretum], tomato [Lycopersicon esculentum], potato [Solanum tuberosum], soybean [Glycine max], barrel medic [Medicago truncatula], and Arabidopsis thaliana), the age distributions of duplicated genes contain peaks corresponding to short evolutionary periods during which large numbers of duplicated genes were accumulated. Large-scale duplications (polyploidy or aneuploidy) are strongly suspected to be the cause of these temporal peaks of gene duplication. However, the unusual age profile of tandem gene duplications in Arabidopsis indicates that other scenarios, such as variation in the rate at which duplicated genes are deleted, must also be considered.
TL;DR: This study is the first direct evidence that PIK3CA is an oncogene in ovarian cancer and greatly extends recent findings in breast cancer.
Abstract: Phosphatidylinositol 3'-kinases are lipid kinases with important roles in neoplasia. Recently, a very high frequency of somatic mutations in PIK3CA has been reported among a large series of colorectal cancers. However, the relevance of PIK3CA mutation in other cancer types remains unclear because of the limited number of tumors investigated. We have screened a total of 284 primary human tumors for mutations in all coding exons of PIK3CA using a combination of single stranded conformational polymorphism and denaturing high-performance liquid chromatography analysis. Among 70 primary breast cancers, 40% (28 of 70) harbored mutations in PIK3CA, making it the most common mutation described to date in this cancer type. Mutations were not associated with histologic subtype, estrogen receptor status, grade or presence of tumor in lymph nodes. Among the primary epithelial ovarian cancers only 11 of 167 (6.6%) contain somatic mutations, but there was a clear histologic subtype bias in their distribution. Only 2 of 88 (2.3%) of serous carcinomas had PIK3CA mutations compared with 8 of 40 (20.0%) endometrioid and clear cell cancers, which was highly significant (P = 0.001). In contrast, PIK3CA gene amplification (>7-fold) was common among all histologic subtypes (24.5%) and was inversely associated with the presence of mutations. Overall, PIK3CA mutation or gene amplification was detected in 30.5% of all ovarian cancers and 45% of the endometrioid and clear cell subtypes. Our study is the first direct evidence that PIK3CA is an oncogene in ovarian cancer and greatly extends recent findings in breast cancer.
TL;DR: Results suggest that the C13orf25 gene is the most likely candidate gene for the 13q31-q32 amplicon found in hematopoietic malignancies.
Abstract: The amplification at 13q31-q32 has been reported in not only hematopoietic malignancies but also in other solid tumors. We identified previously frequent amplification of chromosomal band 13q31-q32 in 70 cases of diffuse large B-cell lymphoma patients by conventional comparative genomic hybridization analysis. In an attempt to identify a candidate gene within this region, we used array comparative genomic hybridization and fluorescent in situ hybridization to map the 13q31-q32 amplicon. We then screened the 65 expressed sequence tags and Glypican 5 (GPC5) by reverse transcription-PCR and Northern blotting. As a result, we identified a novel gene, designated Chromosome 13 open reading frame 25 (C13orf25), which was overexpressed in B-cell lymphoma cell lines and diffuse large B-cell lymphoma patients with 13q31-q32 amplifications. However, GPC5, which has been reported to be a target gene for 13q31-q32 amplification, was truncated in one cell line, Rec1, possessing the amplification, and its expression in various cell lines with amplification at 13q31-q32 was not significantly different from that in other cell lines without amplification, suggesting that GPC5 is not likely to be the candidate gene. Additional analysis identified two major transcripts in the C13orf25 gene. The two transcripts A and B predicted open reading frames of 32 and 70-amino acid polypeptides, respectively. The former has been reported as bA121J7.2, which is conserved among species. Transcript-B also contained seven mature microRNAs in its untranslated region. These results suggest that the C13orf25 gene is the most likely candidate gene for the 13q31-q32 amplicon found in hematopoietic malignancies.
TL;DR: Evidence is provided that genes encoding miRNAs in plants originated by inverted duplication of target gene sequences, and a model for miRNA evolution is proposed that suggests a mechanism for de novo generation of new miRNA genes with unique target specificities.
Abstract: MicroRNAs (miRNAs) in plants and animals function as post-transcriptional regulators of target genes, many of which are involved in multicellular development. miRNAs guide effector complexes to target mRNAs through base-pair complementarity, facilitating site-specific cleavage or translational repression. Biogenesis of miRNAs involves nucleolytic processing of a precursor transcript with extensive foldback structure. Here, we provide evidence that genes encoding miRNAs in plants originated by inverted duplication of target gene sequences. Several recently evolved genes encoding miRNAs in Arabidopsis thaliana and other small RNA‐generating loci possess the hallmarks of inverted duplication events that formed the arms on each side of their respective foldback precursors. We propose a model for miRNA evolution that suggests a mechanism for de novo generation of new miRNA genes with unique target specificities.
TL;DR: Individual gene trees for these three genes and a concatenated dataset support the hypothesis that the fish-specific genome duplication event took place after the split of the Acipenseriformes and the Semionotiformes from the lineage leading to teleost fish but before the divergence of Osteoglossiformes.
Abstract: For many genes, ray-finned fish (Actin- opterygii) have two paralogous copies, where only one ortholog is present in tetrapods. The discovery of an additional, almost-complete set of Hox clusters in teleosts (zebrafish, pufferfish, medaka, and cichlid) but not in basal actinopterygian lineages (Polypterus) led to the formulation of the fish-specific genome duplication hypothesis. The phylogenetic timing of this genome duplication during the evolution of ray- finned fish is unknown, since only a few species of basal fish lineages have been investigated so far. In this study, three nuclear genes (fzd8, sox11, tyrosin- ase) were sequenced from sturgeons (Acipenseri- formes), gars (Semionotiformes), bony tongues (Osteoglossomorpha), and a tenpounder (Elopo- morpha). For these three genes, two copies have been described previously teleosts (e.g., zebrafish, puffer- fish), but only one orthologous copy is found in tetrapods. Individual gene trees for these three genes and a concatenated dataset support the hypothesis that the fish-specific genome duplication event took place after the split of the Acipenseriformes and the Semionotiformes from the lineage leading to teleost fish but before the divergence of Osteoglossiformes. If these three genes were duplicated during the pro- posed fish-specific genome duplication event, then this event separates the species-poor early-branching lineages from the species-rich teleost lineage. The additional number of genes resulting from this event might have facilitated the evolutionary radiation and the phenotypic diversification of the teleost fish.
TL;DR: The duplication of genes and their subsequent diversification has had a key role in evolution and a range of fates can befall a duplicated gene.
Abstract: The duplication of genes and their subsequent diversification has had a key role in evolution. A range of fates can befall a duplicated gene.
TL;DR: Members of the GRAS gene family encode transcriptional regulators that have diverse functions in plant growth and development such as gibberellin signal transduction, root radial patterning, axillary meristem formation, phytochrome A signalTransduction, and gametogenesis.
Abstract: Members of the GRAS gene family encode transcriptional regulators that have diverse functions in plant growth and development such as gibberellin signal transduction, root radial patterning, axillary meristem formation, phytochrome A signal transduction, and gametogenesis. Bioinformatic analysis identified 57 and 32 GRAS genes in rice and Arabidopsis, respectively. Here, we provide a complete overview of this gene family, describing the gene structure, gene expression, chromosome localization, protein motif organization, phylogenetic analysis, and comparative analysis between rice and Arabidopsis. Phylogenetic analysis divides the GRAS gene family into eight subfamilies, which have distinct conserved domains and functions. Both genome/segmental duplication and tandem duplication contributed to the expansion of the GRAS gene family in the rice and Arabidopsis genomes. The existence of GRAS-like genes in bryophytes suggests that GRAS is an ancient family of transcription factors, which arose before the appearance of land plants over 400 million years ago.
TL;DR: Notably, differential gene loss played an important role in the evolution of different nuclear receptor sets in bilaterian lineages and was also shaped by periods of gene duplication in vertebrates, as well as a lineage-specific duplication burst in nematodes.
Abstract: Bilaterian animals are notably characterized by complex endocrine systems. The receptors for many steroids, retinoids, and other hormones belong to the superfamily of nuclear receptors, which are transcription factors regulating many aspects of development and homeostasis. Despite a diversity of regulatory mechanisms and physiological roles, nuclear receptors share a common protein organization. To obtain the broad picture of bilaterian nuclear hormone receptor evolution, we have characterized the complete set of nuclear receptor genes from nine animal genome sequences and analyzed it in a phylogenetic framework. In addition, expressed sequence tags from key lineages with no available genome sequence were also searched. This allows us to date the evolutionary events that led from an ancestral nuclear receptor gene, in an early metazoan, to present day diversity. We show that there were ;25 nuclear receptor genes in Urbilateria, the ancestor of bilaterians, at which point the fundamental diversity of the subfamily was already established. Surprisingly, differential gene loss played an important role in the evolution of different nuclear receptor sets in bilaterian lineages. The nuclear receptor distribution was also shaped by periods of gene duplication, essentially in vertebrates, as well as a lineage-specific duplication burst in nematodes. Our results imply that the genes for major receptors such as steroid receptors or thyroid hormone receptors were present in Urbilateria.
TL;DR: In this article, the fidelity and genome representation of f29 polymerase-based genome amplification (f29MDA) using direct sequencing and high density oligonucleotide arrays probing >10 000 SNP alleles were tested.
Abstract: Major efforts are underway to systematically define the somatic and germline genetic variations causally associated with disease. Genome-wide genetic analysis of actual clinical samples is, however, limited by the paucity of genomic DNA available. Here we have tested the fidelity and genome representation of f29 polymerase-based genome amplification (f29MDA) using direct sequencing and high density oligonucleotide arrays probing >10 000 SNP alleles. Genome representation was comprehensive and estimated to be 99.82% complete, although six regions encompassing a maximum of 5.62 Mb failed to amplify. There was no degradation in the accuracy of SNP genotyping and, in direct sequencing experiments sampling 500 000 bp, the estimated error rate (9.5 3 10 ‐6 ) was the same as in paired unamplified samples. The detection of cancer-associated loss of heterozygosity and copy number changes, including homozygous deletion and gene amplification, were similarly robust. These results suggest that f29MDA yields high fidelity, near-complete genome representation suitable for high resolution genetic analysis.
TL;DR: It is found that the extant Aux/IAA loci arose primarily through segmental duplication events, in sharp contrast to the ARF family and to the general pattern of gene family proliferation in Arabidopsis.
Abstract: The complete genomic sequence for Arabidopsis provides the opportunity to combine phylogenetic and genomic approaches to study the evolution of gene families in plants. The Aux/IAA and ARF gene families, consisting of 29 and 23 loci in Arabidopsis, respectively, encode proteins that interact to mediate auxin responses and regulate various aspects of plant morphological development. We developed scenarios for the genomic proliferation of the Aux/IAA and ARF families by combining phylogenetic analysis with information on the relationship between each locus and the previously identified duplicated genomic segments in Arabidopsis. This analysis shows that both gene families date back at least to the origin of land plants and that the major Aux/IAA and ARF lineages originated before the monocot-eudicot divergence. We found that the extant Aux/IAA loci arose primarily through segmental duplication events, in sharp contrast to the ARF family and to the general pattern of gene family proliferation in Arabidopsis. Possible explanations for the unusual mode of Aux/IAA duplication include evolutionary constraints imposed by complex interactions among proteins and pathways, or the presence of long-distance cis-regulatory sequences. The antiquity of the two gene families and the unusual mode of Aux/IAA diversification have a number of potential implications for understanding both the functional and evolutionary roles of these genes.
TL;DR: It is concluded that the chicken genome encodes only β-defensin sequences and that all mammalian defensins are evolved from a common β- defensin-like ancestor.
Abstract: Defensins comprise a large family of cationic antimicrobial peptides that are characterized by the presence of a conserved cysteine-rich defensin motif. Based on the spacing pattern of cysteines, these defensins are broadly divided into five groups, namely plant, invertebrate, α-, β-, and θ-defensins, with the last three groups being mostly found in mammalian species. However, the evolutionary relationships among these five groups of defensins remain controversial. Following a comprehensive screen, here we report that the chicken genome encodes a total of 13 different β-defensins but with no other groups of defensins being discovered. These chicken β-defensin genes, designated as Gallinacin 1–13, are clustered densely within a 86-Kb distance on the chromosome 3q3.5-q3.7. The deduced peptides vary from 63 to 104 amino acid residues in length sharing the characteristic defensin motif. Based on the tissue expression pattern, 13 β-defensin genes can be divided into two subgroups with Gallinacin 1–7 being predominantly expressed in bone marrow and the respiratory tract and the remaining genes being restricted to liver and the urogenital tract. Comparative analysis of the defensin clusters among chicken, mouse, and human suggested that vertebrate defensins have evolved from a single β-defensin-like gene, which has undergone rapid duplication, diversification, and translocation in various vertebrate lineages during evolution. We conclude that the chicken genome encodes only β-defensin sequences and that all mammalian defensins are evolved from a common β-defensin-like ancestor. The α-defensins arose from β-defensins by gene duplication, which may have occurred after the divergence of mammals from other vertebrates, and θ-defensins have arisen from α-defensins specific to the primate lineage. Further analysis of these defensins in different vertebrate lineages will shed light on the mechanisms of host defense and evolution of innate immunity.
TL;DR: The genes identified here likely represent a significant majority of the major gene copy number changes that have occurred over the past 15 million years of human and great ape evolution and are likely to underlie some of the key phenotypic characteristics that distinguish these species.
Abstract: Given that gene duplication is a major driving force of evolutionary change and the key mechanism underlying the emergence of new genes and biological processes, this study sought to use a novel genome-wide approach to identify genes that have undergone lineage-specific duplications or contractions among several hominoid lineages. Interspecies cDNA array-based comparative genomic hybridization was used to individually compare copy number variation for 39,711 cDNAs, representing 29,619 human genes, across five hominoid species, including human. We identified 1,005 genes, either as isolated genes or in clusters positionally biased toward rearrangement-prone genomic regions, that produced relative hybridization signals unique to one or more of the hominoid lineages. Measured as a function of the evolutionary age of each lineage, genes showing copy number expansions were most pronounced in human (134) and include a number of genes thought to be involved in the structure and function of the brain. This work represents, to our knowledge, the first genome-wide gene-based survey of gene duplication across hominoid species. The genes identified here likely represent a significant majority of the major gene copy number changes that have occurred over the past 15 million years of human and great ape evolution and are likely to underlie some of the key phenotypic characteristics that distinguish these species.
TL;DR: Plant MADS-box genes form a large gene family for transcription factors and are involved in various aspects of developmental processes, including flower development, and the higher rate of birth-and-death evolution in type I genes appeared partly due to a higher frequency of segmental gene duplication and weaker purifying selection.
Abstract: Plant MADS-box genes form a large gene family for transcription factors and are involved in various aspects of developmental processes, including flower development. They are known to be subject to birth-and-death evolution, but the detailed features of this mode of evolution remain unclear. To have a deeper insight into the evolutionary pattern of this gene family, we enumerated all available functional and nonfunctional (pseudogene) MADS-box genes from the Arabidopsis and rice genomes. Plant MADS-box genes can be classified into types I and II genes on the basis of phylogenetic analysis. Conducting extensive homology search and phylogenetic analysis, we found 64 presumed functional and 37 nonfunctional type I genes and 43 presumed functional and 4 nonfunctional type II genes in Arabidopsis. We also found 24 presumed functional and 6 nonfunctional type I genes and 47 presumed functional and 1 nonfunctional type II genes in rice. Our phylogenetic analysis indicated there were at least about four to eight type I genes and ≈15–20 type II genes in the most recent common ancestor of Arabidopsis and rice. It has also been suggested that type I genes have experienced a higher rate of birth-and-death evolution than type II genes in angiosperms. Furthermore, the higher rate of birth-and-death evolution in type I genes appeared partly due to a higher frequency of segmental gene duplication and weaker purifying selection in type I than in type II genes.
TL;DR: CD34 cells in monosomy 7 showed up-regulation of genes inducing leukemia transformation and tumorigenesis and apoptosis and down- regulation of genes controlling cell growth and differentiation, which imply distinct molecular mechanisms for monosome 7 and trisomy 8 MDS and implicate specific pathogenic pathways.
TL;DR: This analysis supports a nonrandom model of chromosomal evolution that implicates specific regions within the mammalian genome as having been predisposed to both recurrent small-scale duplication and large-scale evolutionary rearrangements.
Abstract: Chromosomal evolution is thought to occur through a random process of breakage and rearrangement that leads to karyotype differences and disruption of gene order. With the availability of both the human and mouse genomic sequences, detailed analysis of the sequence properties underlying these breakpoints is now possible. We report an abundance of primate-specific segmental duplications at the breakpoints of syntenic blocks in the human genome. Using conservative criteria, we find that 25% (122/461) of all breakpoints contain ≥ 10 kb of duplicated sequence. This association is highly significant (p < 0.0001) when compared to a simulated random-breakage model. The significance is robust under a variety of parameters, multiple sets of conserved synteny data, and for orthologous breakpoints between and within chromosomes. A comparison of mouse lineage-specific breakpoints since the divergence of rat and mouse showed a similar association with regions associated with segmental duplications in the primate genome. These results indicate that segmental duplications are associated with syntenic rearrangements, even when pericentromeric and subtelomeric regions are excluded. However, segmental duplications are not necessarily the cause of the rearrangements. Rather, our analysis supports a nonrandom model of chromosomal evolution that implicates specific regions within the mammalian genome as having been predisposed to both recurrent small-scale duplication and large-scale evolutionary rearrangements.
TL;DR: Protein sequence comparisons suggest that intramolecular duplications have played an important role in the evolution of this gene family and three new genes encoding these CHAC-similar proteins are characterized.
TL;DR: The challenge ahead is to uncover the relationship between the individual biochemical activities of homologous recombination proteins and their coordinated action in the context of the living cell.
TL;DR: Haplo(in)sufficiency of a gene and its propensity for duplication might have a common evolutionary basis, supporting the idea that gene dosage could be important for the initial fixation of duplications.
TL;DR: There is a remarkable but not perfect synteny among the clusters of the three mammalian species, and most remarkably, the human K3 gene, which causes Meesmann's corneal dystrophy when mutated, lacks a counterpart in the mouse genome.
TL;DR: From the organization of paralogs in the genome, it is concluded that duplicated genes in bacteria appear to have been mainly created by small-scale duplication events, such as tandem and operon duplications.
TL;DR: The impact of MDR1 polymorphisms on pharmacokinetics and pharmacodynamics of Pgp substrates is moderate, and clinical studies on the effects of the C3435T polymorphism and drug treatment with cardiac glycosides, the immunosuppressants cyclosporine and tacrolimus, HIV protease inhibitors, and tricyclic antidepressants are discussed.
Abstract: Transporter proteins, in particular P-glycoprotein (Pgp), are important determinants in absorption, tissue targeting, and elimination of drugs. In addition to physiological and environmental factors, its expression and function are modified by genetic polymorphisms of the MDR1 gene. So far, several MDR1 SNPs have been identified, and mutations at positions 2677 and 3435 were associated with alteration of Pgp expression and/or function. In contrast to drug-metabolizing enzymes (eg, CYP2D6), for which loss of function mutations or gene amplification manifests as distinct phenotypes in the population, the impact of MDR1 polymorphisms on pharmacokinetics and pharmacodynamics of Pgp substrates is moderate. Clinical studies on the effects of the C3435T polymorphism and drug treatment with cardiac glycosides, the immunosuppressants cyclosporine and tacrolimus, HIV protease inhibitors, and tricyclic antidepressants are discussed.
TL;DR: Although multiple mechanisms are known to regulate normal and aberrent expression of c-myc, in this study, where in situ methodologies were used to evaluate high-grade human breast cancers, gene amplification of c -myc appears to play a key role in regulating expression of its mRNA and protein.
Abstract: In this study, we analysed gene amplification, RNA expression and protein expression of the c-myc gene on archival tissue specimens of high-grade human breast cancer, using fluorescent in situ hybridisation (FISH), nonradioactive in situ hybridisation and immunohistochemistry. The specific question that we addressed was whether expression of c-Myc mRNA and protein were correlated with its gene copy amplification, as determined by FISH. Although c-Myc is one of the most commonly amplified oncogenes in human breast cancer, few studies have utilised in situ approaches to directly analyse the gene copy amplification, RNA transcription and protein expression on human breast tumour tissue sections. We now report that by using the sensitive FISH technique, a high proportion (70%) of high-grade breast carcinoma were amplified for the c-myc gene, irrespective of status of the oestrogen receptor. However, the level of amplification was low, ranging between one and four copies of gene gains, and the majority (84%) of the cases with this gene amplification gained only one to two copies. Approximately 92% of the cases were positive for c-myc RNA transcription, and essentially all demonstrated c-myc protein expression. In fact, a wide range of expression levels were detected. Statistically significant correlations were identified among the gene amplification indices, the RNA expression scores and protein expression scores. c-myc gene amplification, as detected by FISH, was significantly associated with expression of its mRNA, as measured by the intensity of in situ hybridisation in invasive cells (P=0.0067), and by the percentage of invasive cells positive for mRNA expression (P=0.0006). c-myc gene amplification was also correlated with the percentage of tumour cells which expressed high levels of its protein, as detected by immunohistochemistry in invasive cells (P=0.0016). Thus, although multiple mechanisms are known to regulate normal and aberrent expression of c-myc, in this study, where in situ methodologies were used to evaluate high-grade human breast cancers, gene amplification of c-myc appears to play a key role in regulating expression of its mRNA and protein.
TL;DR: A genomics approach based on the conservation of synteny was used to develop a bacterial artificial chromosome (BAC) contig across the chicken T2 cytokine gene cluster, revealing that a recently characterized cytokinelike transcript, KK34, is also encoded in this cluster.
Abstract: A genomics approach based on the conservation of synteny was used to develop a bacterial artificial chromosome (BAC) contig across the chicken T2 cytokine gene cluster. Sequencing of representative BACs showed that the chicken genome encodes genes for the homologs of mammalian interleukin-3 (IL-3), IL-4, IL-5, IL-13, and granulocyte-macrophage colony-stimulating factor (GM-CSF). These sequences represent the first T2 cytokines found outside of mammals, and their location demonstrates that the T2 cluster is ancient (at least 300 million years old). Four of these genes (IL-3, IL-4, IL-13, and GM-CSF) are expressed at the mRNA level and can be expressed as recombinant protein. In contrast to the other four genes, the chicken IL-5 (ChIL-5) gene we sequenced lacks a recognizable promoter and regulatory sequences in the predicted 3'-untranslated region (3'-UTR). Further, there is no evidence for its expression at the mRNA level. We, therefore, hypothesize that it is a pseudogene. Genomic analysis revealed that a recently characterized cytokinelike transcript, KK34, not identified in our initial analysis of the BAC sequence, is also encoded in this cluster. This gene may represent a duplication of an ancestral IL-5 gene and may encode the functional homolog of IL-5 in the chicken.
TL;DR: Subtelomeric screening can be offered to all mentally retarded patients, although clinical preselection increases the percentage of chromosomal aberrations detected.
Abstract: Background: Subtelomeric rearrangements contribute to idiopathic mental retardation and human malformations, sometimes as distinct mental retardation syndromes. However, for most subtelomeric defects a characteristic clinical phenotype remains to be elucidated.
Objective: To screen for submicroscopic subtelomeric aberrations using multiplex ligation dependent probe amplification (MLPA).
Methods: 210 individuals with unexplained mental retardation were studied. A new set of subtelomeric probes, the SALSA P036 human telomere test kit, was used.
Results: A subtelomeric aberration was identified in 14 patients (6.7%) (10 deletions and four duplications). Five deletions were de novo; four were inherited from phenotypically normal parents, suggesting that these were polymorphisms. For one deletion, DNA samples of the parents were not available. Two de novo submicroscopic duplications were detected (dup 5qter, dup 12pter), while the other duplications (dup 18qter and dup 22qter) were inherited from phenotypically similarly affected parents. All clinically relevant aberrations (de novo or inherited from similarly affected parents) occurred in patients with a clinical score of ⩾3 using an established checklist for subtelomeric rearrangements. Testing of patients with a clinical score of ⩾3 increased the diagnostic yield twofold to 12.4%. Abnormalities with clinical relevance occurred in 6.3%, 5.1%, and 1.7% of mildly, moderately, and severely retarded patients, respectively, indicating that testing for subtelomeric aberrations among mildly retarded individuals is necessary.
Conclusions: The value of MLPA is confirmed. Subtelomeric screening can be offered to all mentally retarded patients, although clinical preselection increases the percentage of chromosomal aberrations detected. Duplications may be a more common cause of mental retardation than has been appreciated.
TL;DR: This article investigated two different types of duplications, tandem and segmental duplications in Arabidopsis for correlation, conservation, and differences of expression characteristics by making use of large genome-wide expression data as measured by the massively parallel signature sequencing method.
Abstract: In plants, duplication of individual genes, long chromosomal regions, and complete genomes provides a major source for evolutionary innovation. We investigated two different types of duplications, tandem and segmental duplications, in Arabidopsis for correlation, conservation, and differences of expression characteristics by making use of large genome-wide expression data as measured by the massively parallel signature sequencing method. Our analysis indicates that large fractions of duplicated gene pairs still share transcriptional characteristics. However, our results also indicate that expression divergence occurs frequently between duplicated gene pairs, a process which frequently might be employed for the retention of sequence redundant gene pairs. Preserved overall similarity between promoters of duplicated genes as well as preservation of individual cis-elements within the respective promoters indicates that the process of transcriptional neo- and subfunctionalization is restricted to only a fraction of cis-elements. We show that sequence similarities and shared regulatory properties within duplicated promoters provide a powerful means to undertake large-scale cis-regulatory element identification by applying an intragenomic phylogenetic footprinting approach. Our work lays a foundation for future comparative studies to elucidate the molecular manifestation of regulatory similarities and dissimilarities of duplicated genes. Plant genomes are rich in duplicated genes. Various mechanisms can lead to duplication of individual genes or longer chromosomal regions. In addition to the duplication of individual chromosomal regions, polyploidization, and subsequent reorganization of the genome, tandem duplication and the generation of dispersed, duplicated genes account for the generation of sequence redundant copies. Within Arabidopsis, the evolutionary history of the modern genome structure has been exhaustively analyzed. Large fractions of the genome are derived from ancient polyploidization events, and as much as 17% of the genome reportedly is composed of tandemly repeated genes (The Arabidopsis Genome Initiative, 2000; Vision et al., 2000;
TL;DR: The results suggest that the CYP2D6 gene duplication is a possible factor that influences the development of persistence in patients with mood disorders probably by ultrarapid drug metabolism.
Abstract: Recent studies have revealed that genetic polymorphisms of cytochrome P
450 2D6 (CYP2D6) are among the factors that determine the interindividual differences in the metabolism and response to antidepressants. We investigated the relationship between persistent mood disorders and the duplication of the CYP2D6 gene, which encodes an enzyme with increased activity. We screened the prevalence of the CYP2D6 genotypes in 108 patients with persistent mood disorders using long polymerase chain reaction (PCR) and the real-time PCR methods. Clinical correlates with the genotypes were also analyzed. Among the 108 patients, 81 had failed to respond to antidepressants shown to be metabolized by CYP2D6. Of those 81, 8 had a CYP2D6 gene duplication (9.9%, 95% confidence interval 3.4–16.4%) which was higher than the 0.8–1.0% incidence previously observed in healthy Nordic Caucasians. The worst week scores of the Hamilton Depression Rating Scale were higher in the patients with the duplication compared with those without the duplication (P=0.026, student’s t-test). These results suggest that the CYP2D6 gene duplication is a possible factor that influences the development of persistence in patients with mood disorders probably by ultrarapid drug metabolism.
TL;DR: This new, combined approach to studying viral evolution enhances the ability to infer the origins of virus and host genes as well as the complex evolutionary history of large DNA viruses.
TL;DR: Identification of amplified MYCN in neuroblastomas has marked the clinical debut of oncogenes, and MYCN status now is being used world wide as a standard marker for neuroblastoma stratification.