TL;DR: The pattern of historical recombination in A. thaliana is characterized and an enrichment of hotspots in its intergenic regions and repetitive DNA is observed, which is consistent with the pattern that is observed for humans but which is strikingly different from that observed in other plant species.
Abstract: Arabidopsis thaliana is native to Eurasia and is naturalized across the world. Its ability to be easily propagated and its high phenotypic variability make it an ideal model system for functional, ecological and evolutionary genetics. To date, analyses of the natural genetic variation of A. thaliana have involved small numbers of individual plants or genetic markers. Here we genotype 1,307 worldwide accessions, including several regional samples, using a 250K SNP chip. This allowed us to produce a high-resolution description of the global pattern of genetic variation. We applied three complementary selection tests and identified new targets of selection. Further, we characterized the pattern of historical recombination in A. thaliana and observed an enrichment of hotspots in its intergenic regions and repetitive DNA, which is consistent with the pattern that is observed for humans but which is strikingly different from that observed in other plant species. We have made the seeds we used to produce this Regional Mapping (RegMap) panel publicly available. This panel comprises one of the largest genomic mapping resources currently available for global natural isolates of a non-human species.
TL;DR: Analytical methods for genetic mapping using the JAX Diversity Outbred population are described and the power and high mapping resolution achieved with this population are demonstrated by mapping a serum cholesterol trait to a 2-Mb region on chromosome 3 containing only 11 genes.
Abstract: The JAX Diversity Outbred population is a new mouse resource derived from partially inbred Collaborative Cross strains and maintained by randomized outcrossing. As such, it segregates the same allelic variants as the Collaborative Cross but embeds these in a distinct population architecture in which each animal has a high degree of heterozygosity and carries a unique combination of alleles. Phenotypic diversity is striking and often divergent from phenotypes seen in the founder strains of the Collaborative Cross. Allele frequencies and recombination density in early generations of Diversity Outbred mice are consistent with expectations based on simulations of the mating design. We describe analytical methods for genetic mapping using this resource and demonstrate the power and high mapping resolution achieved with this population by mapping a serum cholesterol trait to a 2-Mb region on chromosome 3 containing only 11 genes. Analysis of the estimated allele effects in conjunction with complete genome sequence data of the founder strains reduced the pool of candidate polymorphisms to seven SNPs, five of which are located in an intergenic region upstream of the Foxo1 gene.
TL;DR: A modification of BSA (BSR-Seq) is developed that makes use of RNA-Sequ reads to efficiently map genes even in populations for which no polymorphic markers have been previously identified.
Abstract: Bulked segregant analysis (BSA) is an efficient method to rapidly and efficiently map genes responsible for mutant phenotypes. BSA requires access to quantitative genetic markers that are polymorphic in the mapping population. We have developed a modification of BSA (BSR-Seq) that makes use of RNA-Seq reads to efficiently map genes even in populations for which no polymorphic markers have been previously identified. Because of the digital nature of next-generation sequencing (NGS) data, it is possible to conduct de novo SNP discovery and quantitatively genotype BSA samples by analyzing the same RNA-Seq data using an empirical Bayesian approach. In addition, analysis of the RNA-Seq data provides information on the effects of the mutant on global patterns of gene expression at no extra cost. In combination these results greatly simplify gene cloning experiments. To demonstrate the utility of this strategy BSR-Seq was used to clone the glossy3 (gl3) gene of maize. Mutants of the glossy loci exhibit altered accumulation of epicuticular waxes on juvenile leaves. By subjecting the reference allele of gl3 to BSR-Seq, we were able to map the gl3 locus to an ∼2 Mb interval. The single gene located in the ∼2 Mb mapping interval whose expression was down-regulated in the mutant pool was subsequently demonstrated to be the gl3 gene via the analysis of multiple independent transposon induced mutant alleles. The gl3 gene encodes a putative myb transcription factor, which directly or indirectly affects the expression of a number of genes involved in the biosynthesis of very-long-chain fatty acids.
TL;DR: This chapter discusses molecular marker maps of major crop species, and some concepts and new methods for molecular mapping in plants using PCR-based marker systems and information systems approaches to support discovery in agricultural genomics.
Abstract: General preface. Preface. 1. Some concepts and new methods for molecular mapping in plants* B. Burr. 2. PCR-based marker systems R. Reiter. 3. Constructing a plant genetic linkage map with DNA markers N.D. Young. 4. Use of DNA markers in introgression and isolation of genes associated with durable resistance to rice blast D.-H. Chen, et al. 5. Mapping quantitative trait loci S.J. Knapp. 6. Comparative mapping of plant chromosomes A.H. Paterson, J.L. Bennetzen. 7. Breeding multigenic traits C.W. Stuber. 8. Information systems approaches to support discovery in agricultural genomics B.W.S. Sobral, et al. 9. Introduction: molecular marker maps of major crop species R.L. Phillips, I.K. Vasil. 10. Molecular marker analyses in alfalfa and related species E.C. Brummer, et al. 11. An integrated RFLP map of Arabidopsis thaliana* H.M. Goodman, et al. 12. An integrated map of the barley genome A. Kleinhofs, A. Graner. 13. DNA-based marker maps of Brassica, C.F. Quiros. 14. Molecular genetic map of cotton A.H. Paterson. 15. Maize molecular maps: Markers, bins, and database E.H. Coe, et al. 16. RFLP map of peanut H.T. Stalker, et al. 17. Phaseolus vulgaris - The common bean integration of RFLP and RAPD-based linkage maps C.E. Vallejos, et al. 18. RFLP map of the potato C. Gebhardt, et al. 19. Rice molecular map S.R. McCouch. 20. A framework genetic map of sorghum containing RFLP, SSR and morphological markers J.L. Bennetzen, et al. 21. RFLP map of soybean R.C. Shoemaker, et al. 22. Genetic mapping in sunflowers S.J. Knapp, et al. 23. The molecular map of tomato A. Frary, S.D. Tanksley. 24. Molecular-marker maps of the cultivated wheats and other Triticum species G.E. Hart. 25. Molecular marker linkage maps in diploid and hexaploid oat (Avena sp.) S.F. Kianian, et al. 26. A compilation of molecular genetic maps of cultivated plants O. Riera-Lizarazu, et al. List of Contributors. Subject Index.
TL;DR: The high degree of concordance between the linkage maps and the pseudomolecules demonstrates both the quality of the potato genome sequence and the functionality of the Infinium 8303 Potato Array.
Abstract: To facilitate genome-guided breeding in potato, we developed an 8303 Single Nucleotide Polymorphism (SNP) marker array using potato genome and transcriptome resources. To validate the Infinium 8303 Potato Array, we developed linkage maps from two diploid populations (DRH and D84) and compared these maps with the assembled potato genome sequence. Both populations used the doubled monoploid reference genotype DM1-3 516 R44 as the female parent but had different heterozygous diploid male parents (RH89-039-16 and 84SD22). Over 4,400 markers were mapped (1,960 in DRH and 2,454 in D84, 787 in common) resulting in map sizes of 965 (DRH) and 792 (D84) cM, covering 87% (DRH) and 88% (D84) of genome sequence length. Of the mapped markers, 33.5% were in candidate genes selected for the array, 4.5% were markers from existing genetic maps, and 61% were selected based on distribution across the genome. Markers with distorted segregation ratios occurred in blocks in both linkage maps, accounting for 4% (DRH) and 9% (D84) of mapped markers. Markers with distorted segregation ratios were unique to each population with blocks on chromosomes 9 and 12 in DRH and 3, 4, 6 and 8 in D84. Chromosome assignment of markers based on linkage mapping differed from sequence alignment with the Potato Genome Sequencing Consortium (PGSC) pseudomolecules for 1% of the mapped markers with some disconcordant markers attributable to paralogs. In total, 126 (DRH) and 226 (D84) mapped markers were not anchored to the pseudomolecules and provide new scaffold anchoring data to improve the potato genome assembly. The high degree of concordance between the linkage maps and the pseudomolecules demonstrates both the quality of the potato genome sequence and the functionality of the Infinium 8303 Potato Array. The broad genome coverage of the Infinium 8303 Potato Array compared to other marker sets will enable numerous downstream applications.
TL;DR: In this article, a high-density genetic map for Cucumis sativus was developed that contained 735 marker loci in seven linkage groups spanning 707.8 cM. Integration of genetic and physical maps resulted in a chromosomelevel draft genome assembly comprising 193 Mbp, or 53% of the 367 Mbp cucumber genome.
Abstract: Cucumber, Cucumis sativus L. is the only taxon with 2n = 2x = 14 chromosomes in the genus Cucumis. It consists of two cross-compatible botanical varieties: the cultivated C. sativus var. sativus and the wild C. sativus var. hardwickii. There is no consensus on the evolutionary relationship between the two taxa. Whole-genome sequencing of the cucumber genome provides a new opportunity to advance our understanding of chromosome evolution and the domestication history of cucumber. In this study, a high-density genetic map for cultivated cucumber was developed that contained 735 marker loci in seven linkage groups spanning 707.8 cM. Integration of genetic and physical maps resulted in a chromosome-level draft genome assembly comprising 193 Mbp, or 53% of the 367 Mbp cucumber genome. Strategically selected markers from the genetic map and draft genome assembly were employed to screen for fosmid clones for use as probes in comparative fluorescence in situ hybridization analysis of pachytene chromosomes to investigate genetic differentiation between wild and cultivated cucumbers. Significant differences in the amount and distribution of heterochromatins, as well as chromosomal rearrangements, were uncovered between the two taxa. In particular, six inversions, five paracentric and one pericentric, were revealed in chromosomes 4, 5 and 7. Comparison of the order of fosmid loci on chromosome 7 of cultivated and wild cucumbers, and the syntenic melon chromosome I suggested that the paracentric inversion in this chromosome occurred during domestication of cucumber. The results support the sub-species status of these two cucumber taxa, and suggest that C. sativus var. hardwickii is the progenitor of cultivated cucumber.
TL;DR: RNA-Seq at low sequence coverage of divergent populations is a fast and effective means of identifying SNPs, with allelic imbalances between phenotypes, suitable for marker development in non-model species lacking complete and well-annotated genome reference sequences.
Abstract: Fast growth is an important and highly desired trait, which affects the profitability of food animal production, with feed costs accounting for the largest proportion of production costs. Traditional phenotype-based selection is typically used to select for growth traits; however, genetic improvement is slow over generations. Single nucleotide polymorphisms (SNPs) explain 90% of the genetic differences between individuals; therefore, they are most suitable for genetic evaluation and strategies that employ molecular genetics for selective breeding. SNPs found within or near a coding sequence are of particular interest because they are more likely to alter the biological function of a protein. We aimed to use SNPs to identify markers and genes associated with genetic variation in growth. RNA-Seq whole-transcriptome analysis of pooled cDNA samples from a population of rainbow trout selected for improved growth versus unselected genetic cohorts (10 fish from 1 full-sib family each) identified SNP markers associated with growth-rate. The allelic imbalances (the ratio between the allele frequencies of the fast growing sample and that of the slow growing sample) were considered at scores .5.0 as an amplification and ,0.2 as loss of heterozygosity. A subset of SNPs (n=54) were validated and evaluated for association with growth traits in 778 individuals of a three-generation parent/offspring panel representing 40 families. Twenty-two SNP markers and one mitochondrial haplotype were significantly associated with growth traits. Polymorphism of 48 of the markers was confirmed in other commercially important aquaculture stocks. Many markers were clustered into genes of metabolic energy production pathways and are suitable candidates for genetic selection. The study demonstrates that RNASeq at low sequence coverage of divergent populations is a fast and effective means of identifying SNPs, with allelic imbalances between phenotypes. This technique is suitable for marker development in non-model species lacking complete and well-annotated genome reference sequences.
TL;DR: Single-nucleotide amplified polymorphisms markers for FAD3 alleles were demonstrated to be a reliable type of SNP markers for unambiguous identification of genotypes with different content of C18:3 in amphidiploid B. napus.
Abstract: Modification of oleic acid (C18:1) and linolenic acid (C18:3) contents in seeds is one of the major goals for quality breeding after removal of erucic acid in oilseed rape (Brassica napus). The fatty acid desaturase genes FAD2 and FAD3 have been shown as the major genes for the control of C18:1 and C18:3 contents. However, the genome structure and locus distributions of the two gene families in amphidiploid B. napus are still not completely understood to date. In the present study, all copies of FAD2 and FAD3 genes in the A- and C-genome of B. napus and its two diploid progenitor species, Brassica rapa and Brassica oleracea, were identified through bioinformatic analysis and extensive molecular cloning. Two FAD2 genes exist in B. rapa and B. oleracea, and four copies of FAD2 genes exist in B. napus. Three and six copies of FAD3 genes were identified in diploid species and amphidiploid species, respectively. The genetic control of high C18:1 and low C18:3 contents in a double haploid population was investigated through mapping of the quantitative trait loci (QTL) for the traits and the molecular cloning of the underlying genes. One major QTL of BnaA.FAD2.a located on A5 chromosome was responsible for the high C18:1 content. A deleted mutation in the BnaA.FAD2.a locus was uncovered, which represented a previously unidentified allele for the high oleic variation in B. napus species. Two major QTLs on A4 and C4 chromosomes were found to be responsible for the low C18:3 content in the DH population as well as in SW Hickory. Furthermore, several single base pair changes in BnaA.FAD3.b and BnaC.FAD3.b were identified to cause the phenotype of low C18:3 content. Based on the results of genetic mapping and identified sequences, allele-specific markers were developed for FAD2 and FAD3 genes. Particularly, single-nucleotide amplified polymorphisms markers for FAD3 alleles were demonstrated to be a reliable type of SNP markers for unambiguous identification of genotypes with different content of C18:3 in amphidiploid B. napus.
TL;DR: The high resolution genetic map developed here should be a useful platform for the assembly of the watermelon genome, for the development of sequence-based markers used in breeding programs, and for the identification of genes associated with important agricultural traits.
Abstract: As part of our ongoing efforts to sequence and map the watermelon (Citrullus spp.) genome, we have constructed a high density genetic linkage map. The map positioned 234 watermelon genome sequence scaffolds (an average size of 1.41 Mb) that cover about 330 Mb and account for 93.5% of the 353 Mb of the assembled genomic sequences of the elite Chinese watermelon line 97103 (Citrullus lanatus var. lanatus). The genetic map was constructed using an F8 population of 103 recombinant inbred lines (RILs). The RILs are derived from a cross between the line 97103 and the United States Plant Introduction (PI) 296341-FR (C. lanatus var. citroides) that contains resistance to fusarium wilt (races 0, 1, and 2). The genetic map consists of eleven linkage groups that include 698 simple sequence repeat (SSR), 219 insertion-deletion (InDel) and 36 structure variation (SV) markers and spans ∼800 cM with a mean marker interval of 0.8 cM. Using fluorescent in situ hybridization (FISH) with 11 BACs that produced chromosome-specifc signals, we have depicted watermelon chromosomes that correspond to the eleven linkage groups constructed in this study. The high resolution genetic map developed here should be a useful platform for the assembly of the watermelon genome, for the development of sequence-based markers used in breeding programs, and for the identification of genes associated with important agricultural traits.
TL;DR: A fast and cost-effective method for genetic mapping using next-generation sequencing that combines single nucleotide polymorphism discovery, mutation localization, and potential identification of causal sequence variants is reported.
Abstract: Genetic mapping of mutations in model systems has facilitated the identification of genes contributing to fundamental biological processes including human diseases. However, this approach has historically required the prior characterization of informative markers. Here we report a fast and cost-effective method for genetic mapping using next-generation sequencing that combines single nucleotide polymorphism discovery, mutation localization, and potential identification of causal sequence variants. In contrast to prior approaches, we have developed a hidden Markov model to narrowly define the mutation area by inferring recombination breakpoints of chromosomes in the mutant pool. In addition, we created an interactive online software resource to facilitate automated analysis of sequencing data and demonstrate its utility in the zebrafish and mouse models. Our novel methodology and online tools will make next-generation sequencing an easily applicable resource for mutation mapping in all model systems.
TL;DR: Despite the ascertainment biases in relation to the Clementine origin, the SNP data confirm the important stratification of the gene pools around C. reticulata as well as previous hypothesis on the origin of secondary species.
Abstract: With the increasing availability of EST databases and whole genome sequences, SNPs have become the most abundant and powerful polymorphic markers. However, SNP chip data generally suffers from ascertainment biases caused by the SNP discovery and selection process in which a small number of individuals are used as discovery panels. The ongoing International Citrus Genome Consortium sequencing project of the highly heterozygous Clementine and sweet orange genomes will soon result in the release of several hundred thousand SNPs. The primary goals of this study were: (i) to estimate the transferability within the genus Citrus of SNPs discovered from Clementine BACend sequencing (BES), (ii) to estimate bias associated with the very narrow discovery panel, and (iii) to evaluate the usefulness of the Clementine-derived SNP markers for diversity analysis and comparative mapping studies between the different cultivated Citrus species. Fifty-four accessions covering the main Citrus species and 52 interspecific hybrids between pummelo and Clementine were genotyped on a GoldenGate array platform using 1,457 SNPs mined from Clementine BES and 37 SNPs identified between and within C. maxima, C. medica, C. reticulata and C. micrantha. Consistent results were obtained from 622 SNP loci. Of these markers, 116 displayed incomplete transferability primarily in C. medica, C. maxima and wild Citrus species. The two primary biases associated with the SNP mining in Clementine were an overestimation of the C. reticulata diversity and an underestimation of the interspecific differentiation. However, the genetic stratification of the gene pool was high, with very frequent significant linkage disequilibrium. Furthermore, the shared intraspecific polymorphism and accession heterozygosity were generally enough to perform interspecific comparative genetic mapping. A set of 622 SNP markers providing consistent results was selected. Of the markers mined from Clementine, 80.5% were successfully transferred to the whole Citrus gene pool. Despite the ascertainment biases in relation to the Clementine origin, the SNP data confirm the important stratification of the gene pools around C. maxima, C. medica and C. reticulata as well as previous hypothesis on the origin of secondary species. The implemented SNP marker set will be very useful for comparative genetic mapping in Citrus and genetic association in C. reticulata.
TL;DR: The intra-specific linkage map should have utility for elucidating the genetic basis of other phenotypic traits in eggplant, and the identification of potential candidate genes for the eggplant QTLs related to anthocyanin accumulation.
Abstract: Both inter- and intra-specific maps have been developed in eggplant (Solanum melongena L.). The former benefit from an enhanced frequency of marker polymorphism, but their relevance to marker-assisted crop breeding is limited. Combining the restriction-site associated DNA strategy with high throughput sequencing has facilitated the discovery of a large number of functional single nucleotide polymorphism (SNP) markers discriminating between the two eggplant mapping population parental lines ‘305E40’ and ‘67/3’. A set of 347 de novo SNPs, together with 84 anchoring markers, were applied to the F2 mapping population bred from the cross ‘305E40’ x ‘67/3’ to construct a linkage map. In all, 415 of the 431 markers were assembled into twelve major and one minor linkage group, spanning 1,390 cM, and the inclusion of established markers allowed each linkage group to be assigned to one of the 12 eggplant chromosomes. The map was then used to discover the genetic basis of seven traits associated with anthocyanin content. Each of the traits proved to be controlled by between one and six quantitative trait loci (QTL), of which at least one was a major QTL. Exploitation of syntenic relationships between the eggplant and tomato genomes facilitated the identification of potential candidate genes for the eggplant QTLs related to anthocyanin accumulation. The intra-specific linkage map should have utility for elucidating the genetic basis of other phenotypic traits in eggplant.
TL;DR: The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using designed SNP markers.
Abstract: Background: Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS) technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD) might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP) marker development. Results: An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. Conclusions: The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison.
TL;DR: Mapped PKAMs showed a higher degree of synteny with the genome of Glycine max followed by Medicago truncatula and Lotus japonicus and least with Vigna unguiculata and an integrated genetic map of 996.21 cM was constructed.
Abstract: Single-nucleotide polymorphisms (SNPs, >2000) were discovered by using RNA-seq and allele-specific sequencingapproachesinpigeonpea(Cajanuscajan).FormakingtheSNPgenotypingcost-effective,successful competitive allele-specific polymerase chain reaction (KASPar) assays were developed for 1616 SNPs and referredtoasPKAMs(pigeonpeaKASParassaymarkers).ScreeningofPKAMson24genotypes[23fromcultivated species and 1 wild species (Cajanus scarabaeoides)] defined a set of 1154 polymorphic markers (77.4%) with a polymorphism information content (PIC) value from 0.04 to 0.38. One thousand and ninety-four PKAMs showed polymorphisms between parental lines of the reference mapping population (C. cajan ICP 28 3 C. scarabaeoides ICPW 94). By using high-quality marker genotyping data on 167 F2 lines from the population, a comprehensive genetic map comprising 875 PKAMs with an average intermarker distance of 1.11 cM was developed. Previously mapped 35 simple sequence repeat markers were integrated into the PKAM map and an integrated genetic map of 996.21 cM was constructed. Mapped PKAMs showed a higher degree of synteny with the genome of Glycine max followed by Medicago truncatula and Lotus japonicus and least with Vigna unguiculata. These PKAMs will be useful for genetics research and breeding applications in pigeonpea and for utilizing genome information from other legume species.
TL;DR: Combined inter-/intra-species comparative and phylogenetic analysis of CMF, COL and PRR gene families indicates they evolved prior to the monocot/dicot divergence ∼200 mya, with Poaceae CMF evolution described as the interplay between whole genome duplication in the ancestral cereal, and subsequent clade-specific mutation, deletion and duplication events.
Abstract: Numerous CCT domain genes are known to control flowering in plants. They belong to the CONSTANS-like (COL) and PREUDORESPONSE REGULATOR (PRR) gene families, which in addition to a CCT domain possess B-box or response-regulator domains, respectively. Ghd7 is the most recently identified COL gene to have a proven role in the control of flowering time in the Poaceae. However, as it lacks B-box domains, its inclusion within the COL gene family, technically, is incorrect. Here, we show Ghd7 belongs to a larger family of previously uncharacterized Poaceae genes which possess just a single CCT domain, termed here CCT MOTIF FAMILY (CMF) genes. We molecularly describe the CMF (and related COL and PRR) gene families in four sequenced Poaceae species, as well as in the draft genome assembly of barley (Hordeum vulgare). Genetic mapping of the ten barley CMF genes identified, as well as twelve previously unmapped HvCOL and HvPRR genes, finds the majority map to colinear positions relative to their Poaceae orthologues. Combined inter-/intra-species comparative and phylogenetic analysis of CMF, COL and PRR gene families indicates they evolved prior to the monocot/dicot divergence ∼200 mya, with Poaceae CMF evolution described as the interplay between whole genome duplication in the ancestral cereal, and subsequent clade-specific mutation, deletion and duplication events. Given the proven role of CMF genes in the modulation of cereals flowering, the molecular, phylogenetic and comparative analysis of the Poaceae CMF, COL and PRR gene families presented here provides the foundation from which functional investigation can be undertaken.
TL;DR: Extreme QTL mapping is used to identify the genetic basis of resistance to 13 chemicals in all 6 pairwise crosses of four ecologically and genetically diverse yeast strains, and it is found that the number of loci detected in each experiment was primarily a function of the trait rather than the cross, suggesting that the level of genetic complexity is a consistent property of a trait across different genetic backgrounds.
Abstract: Many questions about the genetic basis of complex traits remain unanswered. This is in part due to the low statistical power of traditional genetic mapping studies. We used a statistically powerful approach, extreme QTL mapping (X-QTL), to identify the genetic basis of resistance to 13 chemicals in all 6 pairwise crosses of four ecologically and genetically diverse yeast strains, and we detected a total of more than 800 loci. We found that the number of loci detected in each experiment was primarily a function of the trait (explaining 46% of the variance) rather than the cross (11%), suggesting that the level of genetic complexity is a consistent property of a trait across different genetic backgrounds. Further, we observed that most loci had trait-specific effects, although a small number of loci with effects in many conditions were identified. We used the patterns of resistance and susceptibility alleles in the four parent strains to make inferences about the allele frequency spectrum of functional variants. We also observed evidence of more complex allelic series at a number of loci, as well as strain-specific signatures of selection. These results improve our understanding of complex traits in yeast and have implications for study design in other organisms.
TL;DR: The results suggest that the upper region of ‘Robusta 5’ linkage group 3 contains multiple genes contributing to fire blight resistance and that their contributions to resistance can vary depending upon pathogen virulence and other factors.
Abstract: Breeding of fire blight resistant scions and rootstocks is a goal of several international apple breeding programs, as options are limited for management of this destructive disease caused by the bacterial pathogen Erwinia amylovora. A broad, large-effect quantitative trait locus (QTL) for fire blight resistance has been reported on linkage group 3 of Malus ‘Robusta 5’. In this study we identified markers derived from putative fire blight resistance genes associated with the QTL by integrating further genetic mapping studies with bioinformatics analysis of transcript profiling data and genome sequence databases. When several defined E.amylovora strains were used to inoculate three progenies from international breeding programs, all with ‘Robusta 5’ as a common parent, two distinct QTLs were detected on linkage group 3, where only one had previously been mapped. In the New Zealand ‘Malling 9’ X ‘Robusta 5’ population inoculated with E. amylovora ICMP11176, the proximal QTL co-located with SNP markers derived from a leucine-rich repeat, receptor-like protein ( MxdRLP1) and a closely linked class 3 peroxidase gene. While the QTL detected in the German ‘Idared’ X ‘Robusta 5’ population inoculated with E. amylovora strains Ea222_JKI or ICMP11176 was approximately 6 cM distal to this, directly below a SNP marker derived from a heat shock 90 family protein gene ( HSP90). In the US ‘Otawa3’ X ‘Robusta5’ population inoculated with E. amylovora strains Ea273 or E2002a, the position of the LOD score peak on linkage group 3 was dependent upon the pathogen strains used for inoculation. One of the five MxdRLP1 alleles identified in fire blight resistant and susceptible cultivars was genetically associated with resistance and used to develop a high resolution melting PCR marker. A resistance QTL detected on linkage group 7 of the US population co-located with another HSP90 gene-family member and a WRKY transcription factor previously associated with fire blight resistance. However, this QTL was not observed in the New Zealand or German populations. The results suggest that the upper region of ‘Robusta 5’ linkage group 3 contains multiple genes contributing to fire blight resistance and that their contributions to resistance can vary depending upon pathogen virulence and other factors. Mapping markers derived from putative fire blight resistance genes has proved a useful aid in defining these QTLs and developing markers for marker-assisted breeding of fire blight resistance.
TL;DR: In this paper, a total of 313 new gene-based markers were developed at target genes, including intron-spanning EST-SNP markers for linkage and association mapping in common beans.
Abstract: In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. In short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop.
TL;DR: The constructed linkage map and determined QTLs can provide a tool for further genetic analysis of the traits and be potential for marker-assisted selection in C. gigas breeding.
Abstract: The Pacific oyster (Crassostrea gigas) is one of the most important oysters cultured worldwide. To analyze the oyster genome and dissect growth-related traits, we constructed a sex-averaged linkage map by combining 64 genomic simple sequence repeats, 42 expressed sequence tag-derived SSRs, and 320 amplified fragment length polymorphism markers in an F1 full-sib family. A total of 426 markers were assigned to 11 linkage groups, spanning 558.2 cM with an average interval of 1.3 cM and 94.7% of genome coverage. Segregation distortion was significant for 18.8% of the markers (P < 0.05), and distorted markers tended to occur on some genetic regions or linkage groups. Most growth-related quantitative traits were highly significantly (P < 0.01) correlated, and principal component analysis obtained four principal components. Quantitative trait locus (QTL) analysis identified three significant QTLs for two principal components, which explained 0.6–13.9% of the phenotypic variation. One QTL for sex was detected on linkage group 6, and the inheritabilities of sex for parental alleles and maternal alleles on that locus C15 are 39.8% and 0.01%, respectively. The constructed linkage map and determined QTLs can provide a tool for further genetic analysis of the traits and be potential for marker-assisted selection in C. gigas breeding.
TL;DR: A new gene near the Rag2 locus for soybean aphid resistance that should be useful in breeding for new aphid-resistant soybean cultivars is mapped and closely linked to this gene are available for marker-assisted breeding.
Abstract: The soybean aphid (Aphis glycines Matsumura) is the most damaging insect pest of soybean [Glycine max (L.) Merr.] in North America. New soybean aphid biotypes have been evolving quickly and at least three confirmed biotypes have been reported in USA. These biotypes are capable of defeating most known aphid resistant soybean genes indicating the need for identification of new genes. Plant Introduction (PI) 567301B was earlier identified to have antixenosis resistance against biotype 1 and 2 of the soybean aphid. Two hundred and three F7:9 recombinant inbred lines (RILs) developed from a cross of soybean aphid susceptible cultivar Wyandot and resistant PI 567301B were used for mapping aphid resistance genes using the quantitative trait loci (QTL) mapping approach. A subset of 94 RILs and 516 polymorphic SNP makers were used to construct a genome-wide molecular linkage map. Two candidate QTL regions for aphid resistance were identified on this linkage map. Fine mapping of the QTL regions was conducted with SSR markers using all 203 RILs. A major gene on chromosome 13 was mapped near the previously identified Rag2 gene. However, an earlier study revealed that the detached leaves of PI 567301B had no resistance against the soybean aphids while the detached leaves of PI 243540 (source of Rag2) maintained aphid resistance. These results and the earlier finding that PI 243540 showed antibiosis resistance and PI 567301B showed antixenosis type resistance, indicating that the aphid resistances in the two PIs are not controlled by the same gene. Thus, we have mapped a new gene near the Rag2 locus for soybean aphid resistance that should be useful in breeding for new aphid-resistant soybean cultivars. Molecular markers closely linked to this gene are available for marker-assisted breeding. Also, the minor locus found on chromosome 8 represents the first reported soybean aphid-resistant locus on this chromosome.
TL;DR: In this paper, a comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes.
TL;DR: The applicability of microsatellite markers in various genotyping applications, determining phylogenetic relationships and comparative mapping in several important grass species is demonstrated and the utility of markers in germplasm characterizations is demonstrated.
Abstract: The unavailability of microsatellite markers and saturated genetic linkage map has restricted the genetic improvement of foxtail millet [Setaria italica (L.) P. Beauv.], despite the fact that in recent times it has been documented as a new model species for biofuel grasses. With the objective to generate a good number of microsatellite markers in foxtail millet cultivar ‘Prasad’, 690 clones were sequenced which generated 112.95 kb high quality sequences obtained from three genomic libraries each enriched with different microsatellite repeat motifs. Microsatellites were identified in 512 (74.2%) of the 690 positive clones and 172 primer pairs (pp) were successfully designed from 249 (48.6%) unique SSR-containing clones. The efficacies of the microsatellite containing genomic sequences were established by superior primer designing ability (69%), PCR amplification efficiency (85.5%) and polymorphic potential (52%) in the parents of F2 mapping population. Out of 172 pp, functional 147 markers showed high level of cross-species amplification (~74%) in six grass species. Higher polymorphism rate and broad range of genetic diversity (0.30–0.69 averaging 0.58) obtained in constructed phylogenetic tree using 52 microsatellite markers, demonstrated the utility of markers in germplasm characterizations. In silico comparative mapping of 147 foxtail millet microsatellite containing sequences against the mapping data of sorghum (~18%), maize (~16%) and rice (~5%) indicated the presence of orthologous sequences of the foxtail millet in the respective species. The result thus demonstrates the applicability of microsatellite markers in various genotyping applications, determining phylogenetic relationships and comparative mapping in several important grass species.
TL;DR: High-density genetic linkage maps of half-smooth tongue sole were developed with 1007 microsatellite markers, two SCAR markers and an F1 family containing 94.65% of the genome and should prove to be very useful for improving growth traits using molecular MAS.
Abstract: High-density genetic linkage maps of half-smooth tongue sole were developed with 1007 microsatellite markers, two SCAR markers and an F1 family containing 94. The female map was composed of 828 markers in 21 linkage groups, covering a total of 1447.3 cM, with an average interval 1.83 cM between markers. The male map consisted of 794 markers in 21 linkage groups, spanning 1497.5 cM, with an average interval of 1.96 cM. The female and male maps had 812 and 785 unique positions, respectively. The genome length of half-smooth tongue sole was estimated to be 1527.7 cM for the females and 1582.1 cM for the males. Based on estimations of the map lengths, the female and male maps covered 94.74 and 94.65% of the genome, respectively. The consensus map was composed of 1007 microsatellite markers and two SCAR markers in 21 linkage groups, covering a total of 1624 cM with an average interval of 1.67 cM. Furthermore, 159 sex-linked SSR markers were identified. Five sex-linked microsatellite markers were confirmed in their association with sex in a large number of individuals selected from different families. These sex-linked markers were mapped on the female map LG1f with zero recombination. Two QTLs that were identified for body weight, designated as We-1 and We-2, accounted for 26.39% and 10.60% of the phenotypic variation. Two QTLs for body width, designated Wi-1 and Wi-2, were mapped in LG4f and accounted for 14.33% and 12.83% of the phenotypic variation, respectively. Seven sex-related loci were mapped in LG1f, LG14f and LG1m by CIM, accounting for 12.5–25.2% of the trait variation. The results should prove to be very useful for improving growth traits using molecular MAS.
TL;DR: This study identified a dominant R gene, Pi2-2, at the Pi2/9 locus from Jefferson, an elite U.S. rice cultivar, through genetic and physical mapping and fine-mapped the gene within a 270-kb region.
Abstract: Utilization of broad-spectrum resistance (R) genes is an effective and economical strategy to control the fungal pathogen Magnaporthe oryzae, the causal agent of the rice blast disease. Among the cloned blast resistance genes, Pi9, Pi2 and Piz-t confer broad-spectrum resistance to diverse M. oryzae isolates and were isolated from the Pi2/9 locus on chromosome 6. Identification and isolation of additional R genes with different resistance spectra from this locus will provide novel genetic resources for better control of this important rice disease. In this study, we identified a dominant R gene, Pi2-2, at the Pi2/9 locus from Jefferson, an elite U.S. rice cultivar, through genetic and physical mapping. Inoculation tests showed that Jefferson has different resistant specificities to M. oryzae isolates compared rice lines with the Pi9, Pi2 and Piz-t genes. Fine mapping delimited Pi2-2 to a 270-kb interval between the markers AP5659-3 and RM19817, and this interval contains three nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes in the Nipponbare genome. Five bacterial artificial chromosome (BAC) clones spanning the region were identified, and a BAC contig covering the Pi2-2 locus was constructed. We identified a new allelic gene at the Pi2/9 locus and fine-mapped the gene within a 270-kb region. Our results provide essential information for the isolation of the Pi2-2 gene and tightly linked DNA markers for rice blast resistance breeding.
TL;DR: Because molecular markers show Mendelian inheritance, it is possible to trace the fingerprint of each organism and determine the evolutionary history of the species by phylogenetic analysis, studies of genetic relationship, population genetic structures and genetic mapping.
Abstract: Powerful tools for the analysis of genetic biodiversity are molecular markers, which are based on DNA sequence polymorphisms. Indeed, DNA sequences determine the diversity of organisms, and therefore, the techniques used to evaluate DNA polymorphisms directly measure the genetic diversity. Because molecular markers show Mendelian inheritance, it is possible to trace the fingerprint of each organism and determine the evolutionary history of the species by phylogenetic analysis, studies of genetic relationship, population genetic structures and genetic mapping.
TL;DR: The results indicate that resistance segregates as expected for a single dominant gene, which is designated Barley stripe mosaic virus resistance 1 (Bsr1), and the power of using RILs to rapidly map the genetic determinants of BSMV resistance in Brachypodium is demonstrated.
Abstract: The ND18 strain of Barley stripe mosaic virus (BSMV) infects several lines of Brachypodium distachyon, a recently developed model system for genomics research in cereals. Among the inbred lines tested, Bd3-1 is highly resistant at 20 to 25°C, whereas Bd21 is susceptible and infection results in an intense mosaic phenotype accompanied by high levels of replicating virus. We generated an F6∶7 recombinant inbred line (RIL) population from a cross between Bd3-1 and Bd21 and used the RILs, and an F2 population of a second Bd21 × Bd3-1 cross to evaluate the inheritance of resistance. The results indicate that resistance segregates as expected for a single dominant gene, which we have designated Barley stripe mosaic virus resistance 1 (Bsr1). We constructed a genetic linkage map of the RIL population using SNP markers to map this gene to within 705 Kb of the distal end of the top of chromosome 3. Additional CAPS and Indel markers were used to fine map Bsr1 to a 23 Kb interval containing five putative genes. Our study demonstrates the power of using RILs to rapidly map the genetic determinants of BSMV resistance in Brachypodium. Moreover, the RILs and their associated genetic map, when combined with the complete genomic sequence of Brachypodium, provide new resources for genetic analyses of many other traits.
TL;DR: A dense genetic map was constructed using a high-throughput 1,536 EST-derived SNP GoldenGate genotyping platform and a global consensus map established by combining the new genetic map with four existing reliable genetic maps of apple, identifying markers with both major and minor conflicts in positioning across all five maps.
Abstract: Genetic maps serve as frameworks for determining the genetic architecture of quantitative traits, assessing structure of a genome, as well as aid in pursuing association mapping and comparative genetic studies. In this study, a dense genetic map was constructed using a high-throughput 1,536 EST-derived SNP GoldenGate genotyping platform and a global consensus map established by combining the new genetic map with four existing reliable genetic maps of apple. The consensus map identified markers with both major and minor conflicts in positioning across all five maps. These major inconsistencies among marker positions were attributed either to structural variations within the apple genome, or among mapping populations, or genotyping technical errors. These also highlighted problems in assembly and anchorage of the reference draft apple genome sequence in regions with known segmental duplications. Markers common across all five apple genetic maps resulted in successful positioning of 2875 markers, consisting of 2033 SNPs and 843 SSRs as well as other specific markers, on the global consensus map. These markers were distributed across all 17 linkage groups, with an average of 169±33 marker per linkage group and with an average distance of 0.70±0.14 cM between markers. The total length of the consensus map was 1991.38 cM with an average length of 117.14±24.43 cM per linkage group. A total of 569 SNPs were mapped onto the genetic map, consisting of 140 recombinant individuals, from our recently developed apple Oligonucleotide pool assays (OPA). The new functional SNPs, along with the dense consensus genetic map, will be useful for high resolution QTL mapping of important traits in apple and for pursuing comparative genetic studies in Rosaceae.
TL;DR: This is the first comprehensive study dedicated to PAL gene family characterization in coffee, allowing us to advance functional studies which are indispensable to learning to decipher what role this family plays in channeling the metabolism of coffee phenylpropanoids.
Abstract: Phenylalanine ammonia lyase (PAL) is the first entry enzyme of the phenylpropanoid pathway producing phenolics, widespread constituents of plant foods and beverages, including chlorogenic acids, polyphenols found at remarkably high levels in the coffee bean and long recognized as powerful antioxidants. To date, whereas PAL is generally encoded by a small gene family, only one gene has been characterized in Coffea canephora (CcPAL1), an economically important species of cultivated coffee. In this study, a molecular- and bioinformatic-based search for CcPAL1 paralogues resulted successfully in identifying two additional genes, CcPAL2 and CcPAL3, presenting similar genomic structures and encoding proteins with close sequences. Genetic mapping helped position each gene in three different coffee linkage groups, CcPAL2 in particular, located in a coffee genome linkage group (F) which is syntenic to a region of Tomato Chromosome 9 containing a PAL gene. These results, combined with a phylogenetic study, strongly suggest that CcPAL2 may be the ancestral gene of C. canephora. A quantitative gene expression analysis was also conducted in coffee tissues, showing that all genes are transcriptionally active, but they present distinct expression levels and patterns. We discovered that CcPAL2 transcripts appeared predominantly in flower, fruit pericarp and vegetative/lignifying tissues like roots and branches, whereas CcPAL1 and CcPAL3 were highly expressed in immature fruit. This is the first comprehensive study dedicated to PAL gene family characterization in coffee, allowing us to advance functional studies which are indispensable to learning to decipher what role this family plays in channeling the metabolism of coffee phenylpropanoids.
TL;DR: High-density genetic linkage maps were constructed for the Japanese flounder and four quantiative trait loci associated with growth traits were mapped on the genetic map, which should prove to be very useful for improving growth traits using molecular MAS.
Abstract: High-density genetic linkage maps were constructed for the Japanese flounder (Paralichthys olivaceus). A total of 1624 microsatellite markers were polymorphic in the reference family. Linkage analysis using JoinMap 4.0 resulted in the mapping of 1487 markers to 24 linkage groups, a result which was consistent with the 24 chromosomes seen in chromosome spreads. The female map was composed of 1257 markers, covering a total of 1663.8 cM with an average interval 1.35 cM between markers. The male map consisted of 1224 markers, spanning 1726.5 cM, with an average interval of 1.44 cM. The genome length in the Japanese flounder was estimated to be 1730.3 cM for the females and 1798.0 cM for the males, a coverage of 96.2% for the female and 96.0% for the male map. The mean recombination at common intervals throughout the genome revealed a slight difference between sexes, i.e. 1.07 times higher in the male than female. High-density genetic linkage maps are very useful for marker-assisted selection (MAS) programs for economically valuable traits in this species and for further evolutionary studies in flatfish and vertebrate species. Furthermore, four quantiative trait loci (QTL) associated with growth traits were mapped on the genetic map. One QTL was identified for body weight on LG 14 f, which explained 14.85% of the total variation of the body weight. Three QTL were identified for body width on LG14f and LG14m, accounting for 16.75%, 13.62% and 13.65% of the total variation in body width, respectively. The additive effects were evident as negative values. There were four QTL for growth traits clustered on LG14, which should prove to be very useful for improving growth traits using molecular MAS.
TL;DR: This study illustrates the power of intron-based markers for linkage and association mapping in common bean and the utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop.
Abstract: BackgroundIn common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies.ResultsIn this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings.ConclusionsIn short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop.