TL;DR: The model proposed herein leads to quantitative predictions that are consistent with observations on the frequency of long-term duplicate gene preservation and with observations that indicate that a common fate of the members of duplicate-gene pairs is the partitioning of tissue-specific patterns of expression of the ancestral gene.
Abstract: It has often been argued that gene-duplication events are most commonly followed by a mutational event that silences one member of the pair, while on rare occasions both members of the pair are preserved as one acquires a mutation with a beneficial function and the other retains the original function. However, empirical evidence from genome duplication events suggests that gene duplicates are preserved in genomes far more commonly and for periods far in excess of the expectations under this model, and whereas some gene duplicates clearly evolve new functions, there is little evidence that this is the most common mechanism of duplicate-gene preservation. An alternative hypothesis is that gene duplicates are frequently preserved by subfunctionalization, whereby both members of a pair experience degenerative mutations that reduce their joint levels and patterns of activity to that of the single ancestral gene. We consider the ways in which the probability of duplicate-gene preservation by such complementary mutations is modified by aspects of gene structure, degree of linkage, mutation rates and effects, and population size. Even if most mutations cause complete loss-of-subfunction, the probability of duplicate-gene preservation can be appreciable if the long-term effective population size is on the order of 10 5 or smaller, especially if there are more than two independently mutable subfunctions per locus. Even a moderate incidence of partial loss-of-function mutations greatly elevates the probability of preservation. The model proposed herein leads to quantitative predictions that are consistent with observations on the frequency of long-term duplicate gene preservation and with observations that indicate that a common fate of the members of duplicate-gene pairs is the partitioning of tissue-specific patterns of expression of the ancestral gene.
TL;DR: Processes and mechanisms of gene and genome evolution in polyploids are reviewed, including the role of transposable elements in structural and regulatory gene evolution; processes and significance of epigenetic silencing; underlying controls of chromosome pairing and mechanisms and functional significance of rapid genome changes are reviewed.
Abstract: Polyploidy is a prominent process in plants and has been significant in the evolutionary history of vertebrates and other eukaryotes. In plants, interdisciplinary approaches combining phylogenetic and molecular genetic perspectives have enhanced our awareness of the myriad genetic interactions made possible by polyploidy. Here, processes and mechanisms of gene and genome evolution in polyploids are reviewed. Genes duplicated by polyploidy may retain their original or similar function, undergo diversification in protein function or regulation, or one copy may become silenced through mutational or epigenetic means. Duplicated genes also may interact through inter-locus recombination, gene conversion, or concerted evolution. Recent experiments have illuminated important processes in polyploids that operate above the organizational level of duplicated genes. These include inter-genomic chromosomal exchanges, saltational, non-Mendelian genomic evolution in nascent polyploids, inter-genomic invasion, and cytonuclear stabilization. Notwithstanding many recent insights, much remains to be learned about many aspects of polyploid evolution, including: the role of transposable elements in structural and regulatory gene evolution; processes and significance of epigenetic silencing; underlying controls of chromosome pairing; mechanisms and functional significance of rapid genome changes; cytonuclear accommodation; and coordination of regulatory factors contributed by two, sometimes divergent progenitor genomes. Continued application of molecular genetic approaches to questions of polyploid genome evolution holds promise for producing lasting insight into processes by which novel genotypes are generated and ultimately into how polyploidy facilitates evolution and adaptation.
TL;DR: A program called NOTUNG is described that facilitates large scale analysis, using both rooted and unrooted trees, and provides a basic building block for inferring duplication dates from gene trees automatically and can be used as an exploratory analysis tool for evaluating alternative hypotheses.
Abstract: Large scale gene duplication is a major force driving the evolution of genetic functional innovation. Whole genome duplications are widely believed to have played an important role in the evolution of the maize, yeast, and vertebrate genomes. The use of evolutionary trees to analyze the history of gene duplication and estimate duplication times provides a powerful tool for studying this process. Many studies in the molecular evolution literature have used this approach on small data sets, using analyses performed by hand. The rapid growth of genetic sequence data will soon allow similar studies on a genomic scale, but such studies will be limited unless the analysis can be automated. Even existing data sets admit alternative hypotheses that would be too tedious to consider without automation. In this paper, we describe a program called NOTUNG that facilitates large scale analysis, using both rooted and unrooted trees. When tested on trees analyzed in the literature, NOTUNG consistently yielded results that agree with the assessments in the original publications. Thus, NOTUNG provides a basic building block for inferring duplication dates from gene trees automatically and can also be used as an exploratory analysis tool for evaluating alternative hypotheses.
TL;DR: It is suggested that a gene duplication ancestral to the divergence of plants and animals gave rise to two main lineages of MADS-box genes: TypeI and TypeII, and the root of the eukaryotic MADs-box gene family is located between these two lineages.
Abstract: Changes in genes encoding transcriptional regulators can alter development and are important components of the molecular mechanisms of morphological evolution. MADS-box genes encode transcriptional regulators of diverse and important biological functions. In plants, MADS-box genes regulate flower, fruit, leaf, and root development. Recent sequencing efforts in Arabidopsis have allowed a nearly complete sampling of the MADS-box gene family from a single plant, something that was lacking in previous phylogenetic studies. To test the long-suspected parallel between the evolution of the MADS-box gene family and the evolution of plant form, a polarized gene phylogeny is necessary. Here we suggest that a gene duplication ancestral to the divergence of plants and animals gave rise to two main lineages of MADS-box genes: TypeI and TypeII. We locate the root of the eukaryotic MADS-box gene family between these two lineages. A novel monophyletic group of plant MADS domains (AGL34 like) seems to be more closely related to previously identified animal SRF-like MADS domains to form TypeI lineage. Most other plant sequences form a clear monophyletic group with animal MEF2-like domains to form TypeII lineage. Only plant TypeII members have a K domain that is downstream of the MADS domain in most plant members previously identified. This suggests that the K domain evolved after the duplication that gave rise to the two lineages. Finally, a group of intermediate plant sequences could be the result of recombination events. These analyses may guide the search for MADS-box sequences in basal eukaryotes and the phylogenetic placement of new genes from other plant species.
TL;DR: It is demonstrated that interactions among unrelated genes are the major cause of robustness against mutations, which is probably an evolved response of genetic networks to stabilizing selection.
Abstract: There are two principal mechanisms that are responsible for the ability of an organism's physiological and developmental processes to compensate for mutations. In the first, genes have overlapping functions, and loss-of-function mutations in one gene will have little phenotypic effect if there are one or more additional genes with similar functions. The second mechanism has its origin in interactions between genes with unrelated functions, and has been documented in metabolic and regulatory gene networks. Here I analyse, on a genome-wide scale, which of these mechanisms of robustness against mutations is more prevalent. I used functional genomics data from the yeast Saccharomyces cerevisiae to test hypotheses related to the following: if gene duplications are mostly responsible for robustness, then a correlation is expected between the similarity of two duplicated genes and the effect of mutations in one of these genes. My results demonstrate that interactions among unrelated genes are the major cause of robustness against mutations. This type of robustness is probably an evolved response of genetic networks to stabilizing selection.
TL;DR: It is proposed that, on the scale of BAC-sized segments of DNA, chromosomal rearrangements have been only a minor factor in the divergence of genome organization among plants and the dominating factors have been repeated rounds of large-scale genome duplication followed by selective gene loss.
Abstract: A 105-kilobase bacterial artificial chromosome (BAC) clone from the ovate-containing region of tomato chromosome 2 was sequenced and annotated. The tomato BAC sequence was then compared, gene by gene, with the sequenced portions of the Arabidopsis thaliana genome. Rather than matching a single portion of the Arabidopsis genome, the tomato clone shows conservation of gene content and order with four different segments of Arabidopsis chromosomes 2–5. The gene order and content of these individual Arabidopsis segments indicate that they derived from a common ancestral segment through two or more rounds of large-scale genome duplication events—possibly polyploidy. One of these duplication events is ancient and may predate the divergence of the Arabidopsis and tomato lineages. The other is more recent and is estimated to have occurred after the divergence of tomato and Arabidopsis ≈112 million years ago. Together, these data suggest that, on the scale of BAC-sized segments of DNA, chromosomal rearrangements (e.g., inversions and translocations) have been only a minor factor in the divergence of genome organization among plants. Rather, the dominating factors have been repeated rounds of large-scale genome duplication followed by selective gene loss. We hypothesize that these processes have led to the network of synteny revealed between tomato and Arabidopsis and predict that such networks of synteny will be common when making comparisons among higher plant taxa (e.g., families).
TL;DR: MLL tandem duplications are less common than previously reported, are preferentially observed in AML with normal karyotypes, but can also be found in the presence of chromosome alterations; are not strongly associated with an FAB subtype; and identifies a subgroup of patients with an unfavorable prognosis.
Abstract: Partial tandem duplications of the MLL gene have been associated with trisomy 11 in acute myeloid leukemia (AML) and recently, have also been reported for karyotypically normal AML. In order to test the incidence and prognostic importance of this molecular marker, we have analyzed eight cases of AML with trisomy 11 and 387 unselected consecutive cases with AML for partial duplications of the MLL gene. Patients with normal karyotypes and those with various chromosome aberrations were included. De novo as well as secondary leukemias including all FAB subtypes were analyzed. Performing a one-step RT-PCR with 35 cycles using an exon 9 forward primer and an exon 3 reverse primer partial tandem duplications of the MLL gene were demonstrated in 3/8 (37.5%) patients with trisomy 11. In addition, 13/387 (3.4%) of unselected cases revealed a tandem duplication. Ten of these 13 cases were cytogenetically normal, the other three cases had < or =2 additional chromosomal alterations. Sequencing of the RT-PCR products of all 16 positive cases revealed fusions of MLL exon 9/exon 3 (e9/e3) (six cases), e10/e3 (three cases), e11/e3 (four cases) or combinations of differentially spliced e10/e3 and e11/e3 (three cases) transcripts. The duplications were confirmed by genomic long range PCR and Southern blot hybridization. Twelve cases with the MLL duplication were de novo myeloid leukemia, one was a secondary AML after MDS, three were therapy-related AML (t-AML). Of the 16 MLL-duplication positive cases, seven were classified as FAB M2, two as M1, five as M4, one as M0, one as M5b. The mean age was 62.3 years for patients with MLL duplication vs 50.3 years for the control group. Of 15 adult patients, 12 received treatment. Of these, three were nonresponders, five had early relapse (< or =6 months), four relapsed between 7 and 12 months. Median survival and relapse-free interval of the MLL duplication positive group was significantly worse than those of an age-matched karyotypically normal control group. In conclusion, MLL tandem duplications (1) are less common than previously reported; (2) are preferentially observed in AML with normal karyotypes, but can also be found in the presence of chromosome alterations; (3) are not strongly associated with an FAB subtype; (4) were not observed with the prognostically favorable t(8;21), inv(16), and t(15;17), other recurrent translocations, or in complex karyotypes; and (5) identifies a subgroup of patients with an unfavorable prognosis.
TL;DR: It is demonstrated that the FLT3 ITD mutation occurs in a significant percentage of adult AML cases and is an important adverse prognostic factor that appears independent of conventional karyotypic findings.
Abstract: Genomic DNA from 106 cases of adult de novo acute myeloid leukaemia (AML) was screened by polymerase chain reaction (PCR) and gel electrophoresis for FLT3 internal tandem duplication (ITD) mutations within the juxtamembrane (JM) domain. FLT3 mutations were detected in 14 cases (13.2%) and occurred in FAB types M1 (4 out of 14 cases), M3 (1 out of 10 cases), M4 (5 out of 37 cases) and M5 (4 out of 11 cases). Sequence analysis of four cases with abnormal PCR electrophoretic patterns revealed in frame duplications in the region of exon 11 of between 27 and 111 base pairs. Three are predicted to result in the tandem duplication of adjacent amino acid residues and one to result in a tandem duplication plus insertion of a novel amino acid motif. Statistical analysis showed the FLT3 mutations to be a strong prognostic factor, with patients lacking the mutation surviving significantly longer from diagnosis (mean 29.1 months) than those with an ITD (mean 12.8 months; P = 0.0002). Thirteen of the 14 patients with FLT3 mutations died within 18 months of diagnosis. FLT3 mutations were of prognostic significance in good risk disease (P = 0.04), as well as in patients with standard risk disease (P = 0.0096). This study demonstrates that the FLT3 ITD mutation occurs in a significant percentage of adult AML cases and is an important adverse prognostic factor that appears independent of conventional karyotypic findings.
TL;DR: The atomic structures of two proteins in the histidine biosynthesis pathway consist of beta/alpha barrels with a twofold repeat pattern and the detection of subdomain structures may be useful in efforts to search genome sequences for functionally and structurally related proteins.
Abstract: The atomic structures of two proteins in the histidine biosynthesis pathway consist of beta/alpha barrels with a twofold repeat pattern. It is likely that these proteins evolved by twofold gene duplication and gene fusion from a common half-barrel ancestor. These ancestral domains are not visible as independent domains in the extant proteins but can be inferred from a combination of sequence and structural analysis. The detection of subdomain structures may be useful in efforts to search genome sequences for functionally and structurally related proteins.
TL;DR: Intergeneric relationships of Pinaceae using sequences of the chloroplast matK gene, the mitochondrial nad5 gene, and the low-copy nuclear gene 4CL are studied to suggest morphology of both vegetative and reproductive organs has undergone convergent evolution within the pine family.
Abstract: In Pinaceae, the chloroplast, mitochondrial, and nuclear genomes are paternally, maternally, and biparentally inherited, respectively. Examining congruence and incongruence of gene phylogenies among the three genomes should provide insights into phylogenetic relationships within the family. Here we studied intergeneric relationships of Pinaceae using sequences of the chloroplast matK gene, the mitochondrial nad5 gene, and the low-copy nuclear gene 4CL. The 4CL gene may exist as a single copy in some species of Pinaceae, but it constitutes a small gene family with two or three members in others. Duplication and deletion of the 4CL gene occurred at a tempo such that paralogous loci are maintained within but not between genera. Exons of the 4CL gene have diverged approximately twice as fast as the matK gene and five times more rapidly than the nad5 gene. The partition-homogeneity test indicates that the three data sets are homogeneous. A combined analysis of the three gene sequences generated a well-resolved and strongly supported phylogeny. The combined phylogeny, which is topologically congruent with the three individual gene trees based on the Templeton test, is likely to represent the organismal phylogeny of Pinaceae. This phylogeny agrees to a certain extent with previous phylogenetic hypotheses based on morphological, anatomical, and immunological data. Disagreement between the previous hypotheses and the three-genome phylogeny suggests that morphology of both vegetative and reproductive organs has undergone convergent evolution within the pine family. The strongly supported monophyly of Nothotsuga longibracteata, Tsuga mertensiana, and Tsuga canadensis on all three gene phylogenies provides evidence against previous hypotheses of intergeneric hybrid origins of N. longibracteata and T. mertensiana. Divergence times of the genera were estimated based on sequence divergence of the matK gene, and they correspond well with the fossil record.
TL;DR: The nearly neutral theory is presented with special reference to the nature of weak selection, where the mean selection coefficient is negative, and the variance is dependent on the environmental diversity.
Abstract: Both drift and selection are important for nucleotide substitutions in evolution. The nearly neutral theory was developed to clarify the effects of these processes. In this article, the nearly neutral theory is presented with special reference to the nature of weak selection. The mean selection coefficient is negative, and the variance is dependent on the environmental diversity. Some facts relating to the theory are reviewed. As well as nucleotide substitutions, illegitimate recombination events such as duplications, deletions and gene conversions leave indelible marks on molecular evolution. Gene duplication and conversion are sources of the evolution of new gene functions. Positive selection is necessary for the evolution of novel functions. However, many examples of current gene families suggest that both drift and selection are at work on their evolution.
TL;DR: In this article, an analysis of variations in mitochondrial gene arrangement observed so far among the best sampled group, the deuterostome animals, is presented, and the duplication/random loss model is invoked to potentially reconstruct many of these rearrangements.
Abstract: One model for gene rearrangement specifies that a portion of the genome is duplicated, followed by the random loss of the now supernumerary genes. Depending on which genes are lost, this process may reorder the duplicated block of genes. Comparisons of mitochondrial genomes have many advantages for addressing the processes of molecular evolution, including gene rearrangement. Here is an analysis of those variations in mitochondrial gene arrangement observed so far among the best sampled group, the deuterostome animals. The duplication/ random loss model is invoked to potentially reconstruct many, though not all, of these rearrangements.
TL;DR: The maize genome is replete with chromosomal duplications and repetitive DNA, and studies of diversity in the wild relatives of maize indicate that different genes have different histories and also show that domestication and intensive breeding have had heterogeneous effects on genetic diversity across genes.
Abstract: The maize genome is replete with chromosomal duplications and repetitive DNA. The duplications resulted from an ancient polyploid event that occurred over 11 million years ago. Based on DNA sequence data, the polyploid event occurred after the divergence between sorghum and maize, and hence the polyploid event explains some of the difference in DNA content between these two species. Genomic rearrangement and diploidization followed the polyploid event. Most of the repetitive DNA in the maize genome is retrotransposable elements, and they comprise 50% of the genome. Retrotransposon multiplication has been relatively recent—within the last 5–6 million years—suggesting that the proliferation of retrotransposons has also contributed to differences in DNA content between sorghum and maize. There are still unanswered questions about repetitive DNA, including the distribution of repetitive DNA throughout the genome, the relative impacts of retrotransposons and chromosomal duplication in plant genome evolution, and the hypothesized correlation of duplication events with transposition. Population genetic processes also affect the evolution of genomes. We discuss how centromeric genes should, in theory, contain less genetic diversity than noncentromeric genes. In addition, studies of diversity in the wild relatives of maize indicate that different genes have different histories and also show that domestication and intensive breeding have had heterogeneous effects on genetic diversity across genes.
TL;DR: The results show that unresolved DNA damage triggers this instability, and that XRCC2 andXRCC3 are potential tumour-suppressor genes in mammals.
Abstract: Growth and development are dependent on the faithful duplication of cells. Duplication requires accurate genome replication, the repair of any DNA damage, and the precise segregation of chromosomes at mitosis; molecular checkpoints ensure the proper progression and fidelity of each stage. Loss of any of these highly conserved functions may result in genetic instability and proneness to cancer. Here we show that highly significant increases in chromosome missegregation occur in cell lines lacking the RAD51-like genes XRCC2 and XRCC3. This increased missegregation is associated with fragmentation of the centrosome, a component of the mitotic spindle, and not with loss of the spindle checkpoint. Our results show that unresolved DNA damage triggers this instability, and that XRCC2 and XRCC3 are potential tumour-suppressor genes in mammals.
TL;DR: The findings suggest that overexpressed GASC1 may play an important role in the development and/or progression of various types of cancer including ESC.
Abstract: In a recent study, we identified frequent amplification of DNA copy
number at chromosome 9p23–24 in cell lines derived from esophageal
squamous cell carcinomas (ESCs), using comparative genomic
hybridization. Because amplified regions often harbor oncogenes and/or
other tumor-associated genes, and because 9p23–24 amplification had
been reported in various other types of cancers, we used fluorescence
in situ hybridization and Southern blot analysis to map
the 9p23–24 amplicon. We then screened target genes/transcripts
present within this amplicon by Northern blotting. With this strategy,
we successfully cloned a novel gene, designated gene amplified
in squamous cell carcinoma 1 (GASC1), that was amplified and
overexpressed in several ESC cell lines. The deduced amino acid
sequence of GASC1 contains two PHD-finger motifs and a PX domain.
PHD-finger motifs are found in nuclear proteins that participate in
chromatin-mediated transcriptional regulation and are present in a
number of proto-oncogenes. Our findings suggest that overexpressed
GASC1 may play an important role in the development and/or progression
of various types of cancer including ESC.
TL;DR: In the authors' opinion, real-time quantitative PCR is the first method of choice in diagnosing PMP22 duplication and deletion and is fast, allowing 13 patients to be diagnosed in 2 h, and involves no radioisotopes and requires no post-PCR handling.
Abstract: In Charcot-Marie-Tooth type 1A disease (CMTIA), heterozygosity for the peripheral myelin protein 22 (PMP22) duplication increases the gene dose from two to three, whereas, in hereditary neuropathy with liability to pressure palsies (HNPP), heterozygosity for the PMP22 deletion reduces the gene dose from two to one. Thirty-eight Norwegian patients with CMT1, 4 patients with HNPP, 15 asymptomatic family members, and 45 normal controls were studied using the ABI 7700 sequence detection system and the TaqMan method of real-time quantitative polymerase chain reaction (PCR). Using a comparative threshold cycle (Ct) method and albumin as reference gene, the gene copy number by PMP22 gene duplication or deletion on chromosome 17p11.2-12 was quantified. The PMP22 duplication ratio ranged from 1.50 to 2.21, the PMP22 deletion ratio ranged from 0.44 to 0.55, and the PMP22 ratio in normals ranged from 0.82 to 1.27. All samples were run in triplicate, with a mean standard deviation of 0.07 (range 0.01-0.17). Thirty-four of thirty-eight CMT1 patients (89.6%) had the PMP22 duplication and the four HNPP patients had the PMP22 deletion. This was not found in any of the asymptomatic family members or the controls. Real-time quantitative PCR is a sensitive, specific, and reproducible method for diagnosing PMP22 duplication and deletion. The method is fast, allowing 13 patients to be diagnosed in 2 h. It involves no radioisotopes and requires no post-PCR handling. In our opinion, real-time quantitative PCR is the first method of choice in diagnosing PMP22 duplication and deletion.
TL;DR: The results show that T NDM can be associated with a methylation change and identify a novel methylation imprint on chromosome 6 associated with TNDM.
Abstract: Recently, we reported the localization of a gene for transient neonatal diabetes mellitus (TNDM), a rare form of childhood diabetes, to an approximately 5.4 Mb region of chromosome 6q24. We have also shown that TNDM is associated with both paternal uniparental disomy (UPD) of chromosome 6 and paternal duplications of the critical region. The sequencing of P1-derived artificial chromosome clones from within the region of interest has allowed us to further localize the gene and to investigate the methylation status of the region. The gene is now known to reside in a 300-400 kb region of 6q24 which contains several CpG islands. At one island we have demonstrated differential DNA methylation between patients with paternal UPD of chromosome 6 and normal controls. In addition, two patients with TNDM, in whom neither paternal UPD of chromosome 6 nor duplication of 6q24 have been found, show a DNA methylation pattern identical to that of patients with paternal UPD of chromosome 6. Control individuals show a hemizygous methylation pattern. These results show that TNDM can be associated with a methylation change and identify a novel methylation imprint on chromosome 6 associated with TNDM.
TL;DR: Domain believed to be involved in recognitional specificity, such as the leucine-rich repeat (LRR), are subject to adaptive selection and may generate further variation at these complexes.
Abstract: Several common themes have shaped the evolution of plant disease resistance genes These include duplication events of progenitor resistance genes and further expansion to create clustered gene families Variation can arise from both intragenic and intergenic recombination and gene conversion Recombination has also been implicated in the generation of novel resistance specificities Resistance gene clusters appear to evolve more rapidly than other regions of the genome In addition, domains believed to be involved in recognitional specificity, such as the leucine-rich repeat (LRR), are subject to adaptive selection Transposable elements have been associated with some resistance gene clusters, and may generate further variation at these complexes
TL;DR: A simple semiquantitative procedure based on the multiplex PCR of short fluorescent fragments allowed this work to confirm in HNPCC families three known deletions of MLH1 or MSH2 and to detect in 19 H NPCC families, in which analysis of mismatch repair genes using classical methods had revealed no alteration.
Abstract: Large genomic deletions within the mismatch repair MLH1 and MSH2 genes have been identified in families with the hereditary nonpolyposis colorectal cancer (HNPCC) syndrome, and their detection represents a technical problem. To facilitate their detection, we developed a simple semiquantitative procedure based on the multiplex PCR of short fluorescent fragments. This method allowed us to confirm in HNPCC families three known deletions of MLH1 or MSH2 and to detect in 19 HNPCC families, in which analysis of mismatch repair genes using classical methods had revealed no alteration, a deletion of exon 5 and a duplication of MSH2 involving exons 9 and 10. The presence of such duplications, the frequency of which is probably underestimated, must be considered in HNPCC families in which conventional screening methods have failed to detect mutations.
TL;DR: It is concluded that recurrent gene duplication and subsequent differentiation is a major adaptive strategy in plant genome evolution.
Abstract: Plant genomes appear to exploit the process of gene duplication as a primary means of acquiring biochemical and developmental flexibility. Thus, for example, most of the enzymatic components of plant secondary metabolism are encoded by small families of genes that originated through duplication over evolutionary time. The dynamics of gene family evolution are well illustrated by the genes that encode chalcone synthase (CHS), the first committed step in flavonoid biosynthesis. We review pertinent facts about CHS evolution in flowering plants with special reference to the morning glory genus, Ipomoea. Our review shows that new CHS genes are recruited recurrently in flowering plant evolution. Rates of nucleotide substitution are frequently accelerated in new duplicate genes, and there is clear evidence for repeated shifts in enzymatic function among duplicate copies of CHS genes. In addition, we present new data on expression patterns of CHS genes as a function of tissue and developmental stage in the common morning glory (I. purpurea). These data show extensive differentiation in gene expression among duplicate copies of CHS genes. We also show that a single mutation which blocks anthocyanin biosynthesis in the floral limb is correlated with a loss of expression of one of the six duplicate CHS genes present in the morning glory genome. This suggests that different duplicate copies of CHS have acquired specialized functional roles over the course of evolution. We conclude that recurrent gene duplication and subsequent differentiation is a major adaptive strategy in plant genome evolution.
TL;DR: A two-step model is proposed to reconstruct the possible evolutionary history of the four nitrogen fixation genes, finding that an ancestor gene gave rise to an ancestral bicistronic operon, which underwent a paralogous operon duplication event followed by evolutionary divergence leading to the ancestors of the present-day nifDK and nifEN operons.
Abstract: The pairs of nitrogen fixation genes nifDK and nifEN encode for the alpha and beta subunits of nitrogenase and for the two subunits of the NifNE protein complex, involved in the biosynthesis of the FeMo cofactor, respectively. Comparative analysis of the amino acid sequences of the four NifD, NifK, NifE, and NifN in several archaeal and bacterial diazotrophs showed extensive sequence similarity between them, suggesting that their encoding genes constitute a novel paralogous gene family. We propose a two-step model to reconstruct the possible evolutionary history of the four genes. Accordingly, an ancestor gene gave rise, by an in-tandem paralogous duplication event followed by divergence, to an ancestral bicistronic operon; the latter, in turn, underwent a paralogous operon duplication event followed by evolutionary divergence leading to the ancestors of the present-day nifDK and nifEN operons. Both these paralogous duplication events very likely predated the appearance of the last universal common ancestor. The possible role of the ancestral gene and operon in nitrogen fixation is also discussed.
TL;DR: The Ehlers-Danlos syndromes are a heterogeneous group of inherited connective tissue disorders that are characterized by joint hypermobility and skin fragility and hyperextensibility, and at least 20 different mutations have been identified in the LH1 gene that contribute to LH deficiency and the clinical characteristics of EDS VI.
TL;DR: The results suggest that amplification and over‐expression of AIB1 are likely to increase the number of malignant phenotypes of gastric cancers and that it can be expected to be useful as a marker of poor prognosis.
Abstract: Our analysis of chromosomal aberrations in primary gastric cancers using comparative genomic hybridization has revealed novel, high and frequent copy number increases in the long arm of chromosome 20, indicating that this region contains novel amplified genes involved in gastric cancer progression. AIB1, a member of the steroid receptor co-activator-1 family, has been cloned on 20q12 as a candidate target gene for this amplification in human breast cancers. In this study, we examined the numbers of AIB1 copies as well as their expression and relation to clinico-pathological features in 72 primary gastric cancers. AIB1 amplification was observed in 7% and over-expression in 40% of the specimens. AIB1 amplification always coincided with its over-expression, but several cases showed AIB1 over-expression without amplification, suggesting that expression of AIB1 is regulated not only by gene amplification but also by other mechanisms, such as transcriptional activation, in human gastric cancer. Gastric cancers with AIB1 amplification showed extensive lymph node metastases, liver metastases and poorer prognosis compared to those without amplification. Our results suggest that amplification and over-expression of AIB1 are likely to increase the number of malignant phenotypes of gastric cancers and that it can be expected to be useful as a marker of poor prognosis.
TL;DR: The present findings imply that the hTERT locus may be a frequent target for amplification during tumorigenesis and that this genetic event probably contributes to the dysregulation of telomerase activity occurring in human tumors.
Abstract: Activation of telomerase is a crucial step during cellular immortalization and malignant transformation of human cells and requires the induction of the catalytic component, human telomerase reverse transcriptase (hTERT), encoded by the hTERT gene. It is poorly understood how the hTERT gene is activated in human cancer cells. In the present study, we examined the hTERT gene copy number in human cancer cell lines and in primary tumor tissues. Amplification of the hTERT gene was observed in 8 of 26 (31%) tumor cell lines and 17 of 58 (30%) primary tumors examined (8 of 21 lung tumors, 3 of 10 cervical tumors, 5 of 19 breast carcinomas, and 1 of 8 neuroblastomas). In addition, 13 of 26 (50%) cell lines and 13 of 58 (22%) primary tumors displayed gain of hTERT gene copies with 3-4 copies/cell. The present findings imply that the hTERT locus may be a frequent target for amplification during tumorigenesis and that this genetic event probably contributes to the dysregulation of telomerase activity occurring in human tumors.
TL;DR: In this paper, a de novo duplication of chromosome 7p11.2-p13 in a proband with features characteristic of Silver-Russell syndrome (SRS) was identified.
Abstract: Summary Silver-Russell syndrome (SRS) is characterized by pre- and postnatal growth failure and other dysmorphic features. The syndrome is genetically heterogenous, but maternal uniparental disomy of chromosome 7 has been demonstrated in ∼7% of cases. This suggests that at least one gene on chromosome 7 is imprinted and involved in the pathogenesis of SRS. We have identified a de novo duplication of 7p11.2-p13 in a proband with features characteristic of SRS. FISH confirmed the presence of a tandem duplication encompassing the genes for growth factor receptor–binding protein 10 ( GRB10 ) and insulin-like growth factor–binding proteins 1 and 3 ( IGFBP1 and - 3 ) but not that for epidermal growth factor–receptor ( EGFR ). Microsatellite markers showed that the duplication was of maternal origin. These findings provide the first evidence that SRS may result from overexpression of a maternally expressed imprinted gene, rather than from absent expression of a paternally expressed gene. GRB10 lies within the duplicated region and is a strong candidate, since it is a known growth supressor. Futhermore, the mouse homologue ( Grb10/Meg1 ) is reported to be maternally expressed and maps to the imprinted region of proximal mouse chromosome 11 that demonstrates prenatal growth failure when it is maternally disomic. We have demonstrated that the GRB10 genomic interval replicates asynchronously in human lymphocytes, suggestive of imprinting. An additional 36 SRS probands were investigated for duplication of GRB10 , but none were found. However, it remains possible that GRB10 and/or other genes within 7p11.2-p13 are responsible for some cases of SRS.
TL;DR: Only a weak correlation is observed, suggesting that coding sequence and mRNA expression patterns of duplicate gene pairs evolve independently and at vastly different rates.
Abstract: The neutralist perspective on molecular evolution maintains that the vast majority of mutations affecting gene function are neutral or deleterious. After a gene duplication where both genes are retained, it predicts that original and duplicate genes diverge at clock-like rates. This prediction is usually tested for coding sequences, but can also be applied to another important aspect of gene function, the genes' expression pattern. Moreover, if both sequence and expression pattern diverge at clock-like rates, a correlation between divergence in sequence and divergence in expression patterns is expected. Duplicate gene pairs with more highly diverged sequences should also show more highly diverged expression patterns. This prediction is tested for a large sample of duplicated genes in the yeast Saccharomyces cerevisiae, using both genome sequence and microarray expression data. Only a weak correlation is observed, suggesting that coding sequence and mRNA expression patterns of duplicate gene pairs evolve independently and at vastly different rates. Implications of this finding for the neutralist–selectionist debate are discussed.
TL;DR: In this brief survey, the source materials derive from some expected sources, such as the progenitor of the chloroplast genome and genes inherited from the common ancestor of all eukaryotes, but also include a surprising number of novel genes of unknown origin.
TL;DR: The response to treatment of patients with tumors wild- type for p53 and without MDM2 amplification was quite varied, indicating that expression of a wild-type p53 gene at diagnosis cannot always facilitate a favorable outcome.
TL;DR: The results suggest that the MHC class III complement region was established in the tetrapod lineage, or lost in the teleost lineage.
Abstract: The thioester-containing complement components, C3 and C4, are believed to have arisen by gene duplication from a common ancestor, and the mammalian C4 gene resides in the vicinity of the C2 and B genes within the major histocompatibility complex (MHC) class III region. To analyze the evolution of both the complement system and the MHC, we determined the complete primary structures of two C3 genes, termed Orla C3-1 and Orla C3-2, and one C4 gene, termed Orla C4, of a teleost, Japanese medaka fish (Oryzias latipes), by analyzing cDNA clones isolated from a liver library constructed using the inbred AA2 strain. The deduced basic structures of Orla C3-1, C3-2, and C4, such as the subunit chain structure, the thioester site, and the proteolytic activation site, are similar to their mammalian counterparts. However, the catalytic His residue which greatly increases the rate of thioester reaction, is replaced by Ala in Orla C3-2, implying functional differentiation between two C3 molecules. Mapping analysis revealed a close linkage between the C3-1 and C3-2 genes, indicating that they arose by a local duplication rather than by a genome-wide tetraploidization. The C4 gene belongs to a different linkage group, and no linkage was observed among the C3, C4, Bf/C2, MHC class I, and MHC class II loci. These results suggest that the MHC class III complement region was established in the tetrapod lineage, or lost in the teleost lineage.