Sample records for haploid genome size

  1. Dramatic improvement in genome assembly achieved using doubled-haploid genomes.

    PubMed

    Zhang, Hong; Tan, Engkong; Suzuki, Yutaka; Hirose, Yusuke; Kinoshita, Shigeharu; Okano, Hideyuki; Kudoh, Jun; Shimizu, Atsushi; Saito, Kazuyoshi; Watabe, Shugo; Asakawa, Shuichi

    2014-10-27

    Improvement in de novo assembly of large genomes is still to be desired. Here, we improved draft genome sequence quality by employing doubled-haploid individuals. We sequenced wildtype and doubled-haploid Takifugu rubripes genomes, under the same conditions, using the Illumina platform and assembled contigs with SOAPdenovo2. We observed 5.4-fold and 2.6-fold improvement in the sizes of the N50 contig and scaffold of doubled-haploid individuals, respectively, compared to the wildtype, indicating that the use of a doubled-haploid genome aids in accurate genome analysis.

  2. The arbuscular mycorrhizal fungus Glomus intraradices is haploid and has a small genome size in the lower limit of eukaryotes.

    PubMed

    Hijri, Mohamed; Sanders, Ian R

    2004-02-01

    The genome size, complexity, and ploidy of the arbuscular mycorrhizal fungus (AMF) Glomus intraradices was determined using flow cytometry, reassociation kinetics, and genomic reconstruction. Nuclei of G. intraradices from in vitro culture, were analyzed by flow cytometry. The estimated average length of DNA per nucleus was 14.07+/-3.52 Mb. Reassociation kinetics on G. intraradices DNA indicated a haploid genome size of approximately 16.54 Mb, comprising 88.36% single copy DNA, 1.59% repetitive DNA, and 10.05% fold-back DNA. To determine ploidy, the DNA content per nucleus measured by flow cytometry was compared with the genome estimate of reassociation kinetics. G. intraradices was found to have a DNA index (DNA per nucleus per haploid genome size) of approximately 0.9, indicating that it is haploid. Genomic DNA of G. intraradices was also analyzed by genomic reconstruction using four genes (Malate synthase, RecA, Rad32, and Hsp88). Because we used flow cytometry and reassociation kinetics to reveal the genome size of G. intraradices and show that it is haploid, then a similar value for genome size should be found when using genomic reconstruction as long as the genes studied are single copy. The average genome size estimate was 15.74+/-1.69 Mb indicating that these four genes are single copy per haploid genome and per nucleus of G. intraradices. Our results show that the genome size of G. intraradices is much smaller than estimates of other AMF and that the unusually high within-spore genetic variation that is seen in this fungus cannot be due to high ploidy.

  3. Haploid plants produced by centromere-mediated genome elimination.

    PubMed

    Ravi, Maruthachalam; Chan, Simon W L

    2010-03-25

    Production of haploid plants that inherit chromosomes from only one parent can greatly accelerate plant breeding. Haploids generated from a heterozygous individual and converted to diploid create instant homozygous lines, bypassing generations of inbreeding. Two methods are generally used to produce haploids. First, cultured gametophyte cells may be regenerated into haploid plants, but many species and genotypes are recalcitrant to this process. Second, haploids can be induced from rare interspecific crosses, in which one parental genome is eliminated after fertilization. The molecular basis for genome elimination is not understood, but one theory posits that centromeres from the two parent species interact unequally with the mitotic spindle, causing selective chromosome loss. Here we show that haploid Arabidopsis thaliana plants can be easily generated through seeds by manipulating a single centromere protein, the centromere-specific histone CENH3 (called CENP-A in human). When cenh3 null mutants expressing altered CENH3 proteins are crossed to wild type, chromosomes from the mutant are eliminated, producing haploid progeny. Haploids are spontaneously converted into fertile diploids through meiotic non-reduction, allowing their genotype to be perpetuated. Maternal and paternal haploids can be generated through reciprocal crosses. We have also exploited centromere-mediated genome elimination to convert a natural tetraploid Arabidopsis into a diploid, reducing its ploidy to simplify breeding. As CENH3 is universal in eukaryotes, our method may be extended to produce haploids in any plant species.

  4. Effective de novo assembly of fish genome using haploid larvae.

    PubMed

    Iwasaki, Yuki; Nishiki, Issei; Nakamura, Yoji; Yasuike, Motoshige; Kai, Wataru; Nomura, Kazuharu; Yoshida, Kazunori; Nomura, Yousuke; Fujiwara, Atushi; Kobayashi, Takanori; Ototake, Mitsuru

    2016-02-01

    Recent improvements in next-generation sequencing technology have made it possible to do whole genome sequencing, on even non-model eukaryote species with no available reference genomes. However, de novo assembly of diploid genomes is still a big challenge because of allelic variation. The aim of this study was to determine the feasibility of utilizing the genome of haploid fish larvae for de novo assembly of whole-genome sequences. We compared the efficiency of assembly using the haploid genome of yellowtail (Seriola quinqueradiata) with that using the diploid genome obtained from the dam. De novo assembly from the haploid and the diploid sequence reads (100 million reads per each datasets) generated by the Ion Proton sequencer (200 bp) was done under two different assembly algorithms, namely overlap-layout-consensus (OLC) and de Bruijn graph (DBG). This revealed that the assembly of the haploid genome significantly reduced (approximately 22% for OLC, 9% for DBG) the total number of contigs (with longer average and N50 contig lengths) when compared to the diploid genome assembly. The haploid assembly also improved the quality of the scaffolds by reducing the number of regions with unassigned nucleotides (Ns) (total length of Ns; 45,331,916 bp for haploids and 67,724,360 bp for diploids) in OLC-based assemblies. It appears clear that the haploid genome assembly is better because the allelic variation in the diploid genome disrupts the extension of contigs during the assembly process. Our results indicate that utilizing the genome of haploid larvae leads to a significant improvement in the de novo assembly process, thus providing a novel strategy for the construction of reference genomes from non-model diploid organisms such as fish. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.

  5. The Small Nuclear Genomes of Selaginella Are Associated with a Low Rate of Genome Size Evolution.

    PubMed

    Baniaga, Anthony E; Arrigo, Nils; Barker, Michael S

    2016-06-03

    The haploid nuclear genome size (1C DNA) of vascular land plants varies over several orders of magnitude. Much of this observed diversity in genome size is due to the proliferation and deletion of transposable elements. To date, all vascular land plant lineages with extremely small nuclear genomes represent recently derived states, having ancestors with much larger genome sizes. The Selaginellaceae represent an ancient lineage with extremely small genomes. It is unclear how small nuclear genomes evolved in Selaginella We compared the rates of nuclear genome size evolution in Selaginella and major vascular plant clades in a comparative phylogenetic framework. For the analyses, we collected 29 new flow cytometry estimates of haploid genome size in Selaginella to augment publicly available data. Selaginella possess some of the smallest known haploid nuclear genome sizes, as well as the lowest rate of genome size evolution observed across all vascular land plants included in our analyses. Additionally, our analyses provide strong support for a history of haploid nuclear genome size stasis in Selaginella Our results indicate that Selaginella, similar to other early diverging lineages of vascular land plants, has relatively low rates of genome size evolution. Further, our analyses highlight that a rapid transition to a small genome size is only one route to an extremely small genome. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  6. Centromere Size and Its Relationship to Haploid Formation in Plants.

    PubMed

    Wang, Na; Dawe, R Kelly

    2018-03-05

    Wide species crosses often result in uniparental genome elimination and visible failures in centromere function. Crosses involving lines with mutated forms of the CENH3 histone variant that organizes the centromere/kinetochore interface have been shown to have similar effects, inducing haploids at high frequencies. Here, we propose a simple centromere size model that endeavors to explain both observations. It is based on the idea of a quantitative centromere architecture where each centromere in an individual is the same size, and the average size is dictated by a natural equilibrium between bound and unbound CENH3 (and its chaperones or binding proteins). While centromere size is determined by the cellular milieu, centromere positions are heritable and defined by the interactions of a small set of proteins that bind to both DNA and CENH3. Lines with defective or mutated CENH3 have a lower loading capacity and support smaller centromeres. In cases where a line with small or defective centromeres is crossed to a line with larger or normal centromeres, the smaller/defective centromeres are selectively degraded or not maintained, resulting in chromosome loss from the small-centromere parent. The model is testable and generalizable, and helps to explain the counterintuitive observation that inducer lines do not induce haploids when crossed to themselves. Copyright © 2017 The Author. Published by Elsevier Inc. All rights reserved.

  7. Genome size variation in the pine fusiform rust pathogen Cronartium quercuum f.sp. fusiforme as determined by flow cytometry

    Treesearch

    Claire L Anderson; Thomas L Kubisiak; C Dana Nelson; Jason A Smith; John M Davis

    2010-01-01

    The genome size of the pine fusiform rust pathogen Cronartium quercuum f.sp. fusiforme (Cqf) was determined by flow cytometric analysis of propidium iodide-stained, intact haploid pycniospores with haploid spores of two genetically well characterized fungal species, Sclerotinia sclerotiorum and Puccinia graminis f.sp. tritici, as size standards. The Cqf haploid genome...

  8. Novel technologies in doubled haploid line development.

    PubMed

    Ren, Jiaojiao; Wu, Penghao; Trampe, Benjamin; Tian, Xiaolong; Lübberstedt, Thomas; Chen, Shaojiang

    2017-11-01

    haploid inducer line can be transferred (DH) technology can not only shorten the breeding process but also increase genetic gain. Haploid induction and subsequent genome doubling are the two main steps required for DH technology. Haploids have been generated through the culture of immature male and female gametophytes, and through inter- and intraspecific via chromosome elimination. Here, we focus on haploidization via chromosome elimination, especially the recent advances in centromere-mediated haploidization. Once haploids have been induced, genome doubling is needed to produce DH lines. This study has proposed a new strategy to improve haploid genome doubling by combing haploids and minichromosome technology. With the progress in haploid induction and genome doubling methods, DH technology can facilitate reverse breeding, cytoplasmic male sterile (CMS) line production, gene stacking and a variety of other genetic analysis. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  9. Recovery and characterization of a Citrus clementina Hort. ex Tan. 'Clemenules' haploid plant selected to establish the reference whole Citrus genome sequence.

    PubMed

    Aleza, Pablo; Juárez, José; Hernández, María; Pina, José A; Ollitrault, Patrick; Navarro, Luis

    2009-08-22

    In recent years, the development of structural genomics has generated a growing interest in obtaining haploid plants. The use of homozygous lines presents a significant advantage for the accomplishment of sequencing projects. Commercial citrus species are characterized by high heterozygosity, making it difficult to assemble large genome sequences. Thus, the International Citrus Genomic Consortium (ICGC) decided to establish a reference whole citrus genome sequence from a homozygous plant. Due to the existence of important molecular resources and previous success in obtaining haploid clementine plants, haploid clementine was selected as the target for the implementation of the reference whole genome citrus sequence. To obtain haploid clementine lines we used the technique of in situ gynogenesis induced by irradiated pollen. Flow cytometry, chromosome counts and SSR marker (Simple Sequence Repeats) analysis facilitated the identification of six different haploid lines (2n = x = 9), one aneuploid line (2n = 2x+4 = 22) and one doubled haploid plant (2n = 2x = 18) of 'Clemenules' clementine. One of the haploids, obtained directly from an original haploid embryo, grew vigorously and produced flowers after four years. This is the first haploid plant of clementine that has bloomed and we have, for the first time, characterized the histology of haploid and diploid flowers of clementine. Additionally a double haploid plant was obtained spontaneously from this haploid line. The first haploid plant of 'Clemenules' clementine produced directly by germination of a haploid embryo, which grew vigorously and produced flowers, has been obtained in this work. This haploid line has been selected and it is being used by the ICGC to establish the reference sequence of the nuclear genome of citrus.

  10. A Trichosporonales genome tree based on 27 haploid and three evolutionarily conserved 'natural' hybrid genomes.

    PubMed

    Takashima, Masako; Sriswasdi, Sira; Manabe, Ri-Ichiroh; Ohkuma, Moriya; Sugita, Takashi; Iwasaki, Wataru

    2018-01-01

    To construct a backbone tree consisting of basidiomycetous yeasts, draft genome sequences from 25 species of Trichosporonales (Tremellomycetes, Basidiomycota) were generated. In addition to the hybrid genomes of Trichosporon coremiiforme and Trichosporon ovoides that we described previously, we identified an interspecies hybrid genome in Cutaneotrichosporon mucoides (formerly Trichosporon mucoides). This hybrid genome had a gene retention rate of ~55%, and its closest haploid relative was Cutaneotrichosporon dermatis. After constructing the C. mucoides subgenomes, we generated a phylogenetic tree using genome data from the 27 haploid species and the subgenome data from the three hybrid genome species. It was a high-quality tree with 100% bootstrap support for all of the branches. The genome-based tree provided superior resolution compared with previous multi-gene analyses. Although our backbone tree does not include all Trichosporonales genera (e.g. Cryptotrichosporon), it will be valuable for future analyses of genome data. Interest in interspecies hybrid fungal genomes has recently increased because they may provide a basis for new technologies. The three Trichosporonales hybrid genomes described in this study are different from well-characterized hybrid genomes (e.g. those of Saccharomyces pastorianus and Saccharomyces bayanus) because these hybridization events probably occurred in the distant evolutionary past. Hence, they will be useful for studying genome stability following hybridization and speciation events. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  11. A stable hybrid containing haploid genomes of two obligate diploid Candida species.

    PubMed

    Chakraborty, Uttara; Mohamed, Aiyaz; Kakade, Pallavi; Mugasimangalam, Raja C; Sadhale, Parag P; Sanyal, Kaustuv

    2013-08-01

    Candida albicans and Candida dubliniensis are diploid, predominantly asexual human-pathogenic yeasts. In this study, we constructed tetraploid (4n) strains of C. albicans of the same or different lineages by spheroplast fusion. Induction of chromosome loss in the tetraploid C. albicans generated diploid or near-diploid progeny strains but did not produce any haploid progeny. We also constructed stable heterotetraploid somatic hybrid strains (2n + 2n) of C. albicans and C. dubliniensis by spheroplast fusion. Heterodiploid (n + n) progeny hybrids were obtained after inducing chromosome loss in a stable heterotetraploid hybrid. To identify a subset of hybrid heterodiploid progeny strains carrying at least one copy of all chromosomes of both species, unique centromere sequences of various chromosomes of each species were used as markers in PCR analysis. The reduction of chromosome content was confirmed by a comparative genome hybridization (CGH) assay. The hybrid strains were found to be stably propagated. Chromatin immunoprecipitation (ChIP) assays with antibodies against centromere-specific histones (C. albicans Cse4/C. dubliniensis Cse4) revealed that the centromere identity of chromosomes of each species is maintained in the hybrid genomes of the heterotetraploid and heterodiploid strains. Thus, our results suggest that the diploid genome content is not obligatory for the survival of either C. albicans or C. dubliniensis. In keeping with the recent discovery of the existence of haploid C. albicans strains, the heterodiploid strains of our study can be excellent tools for further species-specific genome elimination, yielding true haploid progeny of C. albicans or C. dubliniensis in future.

  12. The evolutionary dynamics of haplodiploidy: Genome architecture and haploid viability

    PubMed Central

    Blackmon, Heath; Hardy, Nate B.; Ross, Laura

    2015-01-01

    Haplodiploid reproduction, in which males are haploid and females are diploid, is widespread among animals, yet we understand little about the forces responsible for its evolution. The current theory is that haplodiploidy has evolved through genetic conflicts, as it provides a transmission advantage to mothers. Male viability is thought to be a major limiting factor; diploid individuals tend to harbor many recessive lethal mutations. This theory predicts that the evolution of haplodiploidy is more likely in male heterogametic lineages with few chromosomes, as genes on the X chromosome are often expressed in a haploid environment, and the fewer the chromosome number, the greater the proportion of the total genome that is X‐linked. We test this prediction with comparative phylogenetic analyses of mites, among which haplodiploidy has evolved repeatedly. We recover a negative correlation between chromosome number and haplodiploidy, find evidence that low chromosome number evolved prior to haplodiploidy, and that it is unlikely that diplodiploidy has reevolved from haplodiploid lineages of mites. These results are consistent with the predicted importance of haploid male viability. PMID:26462452

  13. Genome size of 14 species of fireflies (Insecta, Coleoptera, Lampyridae)

    PubMed Central

    Liu, Gui-Chun; Dong, Zhi-Wei; He, Jin-Wu; Zhao, Ruo-Ping; Wang, Wen; Li, Xue-Yan

    2017-01-01

    Eukaryotic genome size data are important both as the basis for comparative research into genome evolution and as estimators of the cost and difficulty of genome sequencing programs for non-model organisms. In this study, the genome size of 14 species of fireflies (Lampyridae) (two genera in Lampyrinae, three genera in Luciolinae, and one genus in subfamily incertae sedis) were estimated by propidium iodide (PI)-based flow cytometry. The haploid genome sizes of Lampyridae ranged from 0. 42 to 1. 31 pg, a 3. 1-fold span. Genome sizes of the fireflies varied within the tested subfamilies and genera. Lamprigera and Pyrocoelia species had large and small genome sizes, respectively. No correlation was found between genome size and morphological traits such as body length, body width, eye width, and antennal length. Our data provide additional information on genome size estimation of the firefly family Lampyridae. Furthermore, this study will help clarify the cost and difficulty of genome sequencing programs for non-model organisms and will help promote studies on firefly genome evolution. PMID:29280364

  14. The evolutionary dynamics of haplodiploidy: Genome architecture and haploid viability.

    PubMed

    Blackmon, Heath; Hardy, Nate B; Ross, Laura

    2015-11-01

    Haplodiploid reproduction, in which males are haploid and females are diploid, is widespread among animals, yet we understand little about the forces responsible for its evolution. The current theory is that haplodiploidy has evolved through genetic conflicts, as it provides a transmission advantage to mothers. Male viability is thought to be a major limiting factor; diploid individuals tend to harbor many recessive lethal mutations. This theory predicts that the evolution of haplodiploidy is more likely in male heterogametic lineages with few chromosomes, as genes on the X chromosome are often expressed in a haploid environment, and the fewer the chromosome number, the greater the proportion of the total genome that is X-linked. We test this prediction with comparative phylogenetic analyses of mites, among which haplodiploidy has evolved repeatedly. We recover a negative correlation between chromosome number and haplodiploidy, find evidence that low chromosome number evolved prior to haplodiploidy, and that it is unlikely that diplodiploidy has reevolved from haplodiploid lineages of mites. These results are consistent with the predicted importance of haploid male viability. © 2015 The Author(s). Evolution published by Wiley Periodicals, Inc. on behalf of The Society for the Study of Evolution.

  15. Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies

    PubMed Central

    2014-01-01

    Background The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. Results We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. Conclusions In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied. PMID:24647006

  16. Haploids: Constraints and opportunities in plant breeding.

    PubMed

    Dwivedi, Sangam L; Britt, Anne B; Tripathi, Leena; Sharma, Shivali; Upadhyaya, Hari D; Ortiz, Rodomiro

    2015-11-01

    The discovery of haploids in higher plants led to the use of doubled haploid (DH) technology in plant breeding. This article provides the state of the art on DH technology including the induction and identification of haploids, what factors influence haploid induction, molecular basis of microspore embryogenesis, the genetics underpinnings of haploid induction and its use in plant breeding, particularly to fix traits and unlock genetic variation. Both in vitro and in vivo methods have been used to induce haploids that are thereafter chromosome doubled to produce DH. Various heritable factors contribute to the successful induction of haploids, whose genetics is that of a quantitative trait. Genomic regions associated with in vitro and in vivo DH production were noted in various crops with the aid of DNA markers. It seems that F2 plants are the most suitable for the induction of DH lines than F1 plants. Identifying putative haploids is a key issue in haploid breeding. DH technology in Brassicas and cereals, such as barley, maize, rice, rye and wheat, has been improved and used routinely in cultivar development, while in other food staples such as pulses and root crops the technology has not reached to the stage leading to its application in plant breeding. The centromere-mediated haploid induction system has been used in Arabidopsis, but not yet in crops. Most food staples are derived from genomic resources-rich crops, including those with sequenced reference genomes. The integration of genomic resources with DH technology provides new opportunities for the improving selection methods, maximizing selection gains and accelerate cultivar development. Marker-aided breeding and DH technology have been used to improve host plant resistance in barley, rice, and wheat. Multinational seed companies are using DH technology in large-scale production of inbred lines for further development of hybrid cultivars, particularly in maize. The public sector provides support to

  17. A Perfect Match Genomic Landscape Provides a Unified Framework for the Precise Detection of Variation in Natural and Synthetic Haploid Genomes

    PubMed Central

    Palacios-Flores, Kim; García-Sotelo, Jair; Castillo, Alejandra; Uribe, Carina; Aguilar, Luis; Morales, Lucía; Gómez-Romero, Laura; Reyes, José; Garciarubio, Alejandro; Boege, Margareta; Dávila, Guillermo

    2018-01-01

    We present a conceptually simple, sensitive, precise, and essentially nonstatistical solution for the analysis of genome variation in haploid organisms. The generation of a Perfect Match Genomic Landscape (PMGL), which computes intergenome identity with single nucleotide resolution, reveals signatures of variation wherever a query genome differs from a reference genome. Such signatures encode the precise location of different types of variants, including single nucleotide variants, deletions, insertions, and amplifications, effectively introducing the concept of a general signature of variation. The precise nature of variants is then resolved through the generation of targeted alignments between specific sets of sequence reads and known regions of the reference genome. Thus, the perfect match logic decouples the identification of the location of variants from the characterization of their nature, providing a unified framework for the detection of genome variation. We assessed the performance of the PMGL strategy via simulation experiments. We determined the variation profiles of natural genomes and of a synthetic chromosome, both in the context of haploid yeast strains. Our approach uncovered variants that have previously escaped detection. Moreover, our strategy is ideally suited for further refining high-quality reference genomes. The source codes for the automated PMGL pipeline have been deposited in a public repository. PMID:29367403

  18. A Perfect Match Genomic Landscape Provides a Unified Framework for the Precise Detection of Variation in Natural and Synthetic Haploid Genomes.

    PubMed

    Palacios-Flores, Kim; García-Sotelo, Jair; Castillo, Alejandra; Uribe, Carina; Aguilar, Luis; Morales, Lucía; Gómez-Romero, Laura; Reyes, José; Garciarubio, Alejandro; Boege, Margareta; Dávila, Guillermo

    2018-04-01

    We present a conceptually simple, sensitive, precise, and essentially nonstatistical solution for the analysis of genome variation in haploid organisms. The generation of a Perfect Match Genomic Landscape (PMGL), which computes intergenome identity with single nucleotide resolution, reveals signatures of variation wherever a query genome differs from a reference genome. Such signatures encode the precise location of different types of variants, including single nucleotide variants, deletions, insertions, and amplifications, effectively introducing the concept of a general signature of variation. The precise nature of variants is then resolved through the generation of targeted alignments between specific sets of sequence reads and known regions of the reference genome. Thus, the perfect match logic decouples the identification of the location of variants from the characterization of their nature, providing a unified framework for the detection of genome variation. We assessed the performance of the PMGL strategy via simulation experiments. We determined the variation profiles of natural genomes and of a synthetic chromosome, both in the context of haploid yeast strains. Our approach uncovered variants that have previously escaped detection. Moreover, our strategy is ideally suited for further refining high-quality reference genomes. The source codes for the automated PMGL pipeline have been deposited in a public repository. Copyright © 2018 by the Genetics Society of America.

  19. Characterization of in vitro haploid and doubled haploid Chrysanthemum morifolium plants via unfertilized ovule culture for phenotypical traits and DNA methylation pattern

    PubMed Central

    Wang, Haibin; Dong, Bin; Jiang, Jiafu; Fang, Weimin; Guan, Zhiyong; Liao, Yuan; Chen, Sumei; Chen, Fadi

    2014-01-01

    Chrysanthemum is one of important ornamental species in the world. Its highly heterozygous state complicates molecular analysis, so it is of interest to derive haploid forms. A total of 2579 non-fertilized chrysanthemum ovules pollinated by Argyranthemum frutescens were cultured in vitro to isolate haploid progeny. One single regenerant emerged from each of three of the 105 calli produced. Chromosome counts and microsatellite fingerprinting showed that only one of the regenerants was a true haploid. Nine doubled haploid derivatives were subsequently generated by colchicine treatment of 80 in vitro cultured haploid nodal segments. Morphological screening showed that the haploid plant was shorter than the doubled haploids, and developed smaller leaves, flowers, and stomata. An in vitro pollen germination test showed that few of the haploid's pollen were able to germinate and those which did so were abnormal. Both the haploid and the doubled haploids produced yellow flowers, whereas those of the maternal parental cultivar were mauve. Methylation-sensitive amplification polymorphism (MSAP) profiling was further used to detect alterations in cytosine methylation caused by the haploidization and/or the chromosome doubling processes. While 52.2% of the resulting amplified fragments were cytosine methylated in the maternal parent's genome, the corresponding proportions for the haploid's and doubled haploids' genomes were, respectively, 47.0 and 51.7%, demonstrating a reduction in global cytosine methylation caused by haploidization and a partial recovery following chromosome doubling. PMID:25566305

  20. Genome size of termites (Insecta, Dictyoptera, Isoptera) and wood roaches (Insecta, Dictyoptera, Cryptocercidae)

    NASA Astrophysics Data System (ADS)

    Koshikawa, Shigeyuki; Miyazaki, Satoshi; Cornette, Richard; Matsumoto, Tadao; Miura, Toru

    2008-09-01

    The evolution of genome size has been discussed in relation to the evolution of various biological traits. In the present study, the genome sizes of 22 dictyopteran species were estimated by Feulgen image analysis densitometry and 6-diamidino-2-phenylindole (DAPI)-based flow cytometry. The haploid genome sizes ( C-values) of termites (Isoptera) ranged from 0.58 to 1.90 pg, and those of Cryptocercus wood roaches (Cryptocercidae) were 1.16 to 1.32 pg. Compared to known values of other cockroaches (Blattaria) and mantids (Mantodea), these values are low. A relatively small genome size appears to be a (syn)apomorphy of Isoptera + Cryptocercus, together with their sociality. In some phylogenetic groups, genome size evolution is thought to be influenced by selective pressure on a particular trait, such as cell size or rate of development. The present results raise the possibility that genome size is influenced by selective pressures on traits associated with the evolution of sociality.

  1. Genome size of termites (Insecta, Dictyoptera, Isoptera) and wood roaches (Insecta, Dictyoptera, Cryptocercidae).

    PubMed

    Koshikawa, Shigeyuki; Miyazaki, Satoshi; Cornette, Richard; Matsumoto, Tadao; Miura, Toru

    2008-09-01

    The evolution of genome size has been discussed in relation to the evolution of various biological traits. In the present study, the genome sizes of 22 dictyopteran species were estimated by Feulgen image analysis densitometry and 6-diamidino-2-phenylindole (DAPI)-based flow cytometry. The haploid genome sizes (C-values) of termites (Isoptera) ranged from 0.58 to 1.90 pg, and those of Cryptocercus wood roaches (Cryptocercidae) were 1.16 to 1.32 pg. Compared to known values of other cockroaches (Blattaria) and mantids (Mantodea), these values are low. A relatively small genome size appears to be a (syn)apomorphy of Isoptera + Cryptocercus, together with their sociality. In some phylogenetic groups, genome size evolution is thought to be influenced by selective pressure on a particular trait, such as cell size or rate of development. The present results raise the possibility that genome size is influenced by selective pressures on traits associated with the evolution of sociality.

  2. Megacycles of atmospheric carbon dioxide concentration correlate with fossil plant genome size.

    PubMed

    Franks, Peter J; Freckleton, Rob P; Beaulieu, Jeremy M; Leitch, Ilia J; Beerling, David J

    2012-02-19

    Tectonic processes drive megacycles of atmospheric carbon dioxide (CO(2)) concentration, c(a), that force large fluctuations in global climate. With a period of several hundred million years, these megacycles have been linked to the evolution of vascular plants, but adaptation at the subcellular scale has been difficult to determine because fossils typically do not preserve this information. Here we show, after accounting for evolutionary relatedness using phylogenetic comparative methods, that plant nuclear genome size (measured as the haploid DNA amount) and the size of stomatal guard cells are correlated across a broad taxonomic range of extant species. This phylogenetic regression was used to estimate the mean genome size of fossil plants from the size of fossil stomata. For the last 400 Myr, spanning almost the full evolutionary history of vascular plants, we found a significant correlation between fossil plant genome size and c(a), modelled independently using geochemical data. The correlation is consistent with selection for stomatal size and genome size by c(a) as plants adapted towards optimal leaf gas exchange under a changing CO(2) regime. Our findings point to the possibility that major episodes of change in c(a) throughout Earth history might have selected for changes in genome size, influencing plant diversification.

  3. Evolution of haploid-diploid life cycles when haploid and diploid fitnesses are not equal.

    PubMed

    Scott, Michael F; Rescan, Marie

    2017-02-01

    Many organisms spend a significant portion of their life cycle as haploids and as diploids (a haploid-diploid life cycle). However, the evolutionary processes that could maintain this sort of life cycle are unclear. Most previous models of ploidy evolution have assumed that the fitness effects of new mutations are equal in haploids and homozygous diploids, however, this equivalency is not supported by empirical data. With different mutational effects, the overall (intrinsic) fitness of a haploid would not be equal to that of a diploid after a series of substitution events. Intrinsic fitness differences between haploids and diploids can also arise directly, for example because diploids tend to have larger cell sizes than haploids. Here, we incorporate intrinsic fitness differences into genetic models for the evolution of time spent in the haploid versus diploid phases, in which ploidy affects whether new mutations are masked. Life-cycle evolution can be affected by intrinsic fitness differences between phases, the masking of mutations, or a combination of both. We find parameter ranges where these two selective forces act and show that the balance between them can favor convergence on a haploid-diploid life cycle, which is not observed in the absence of intrinsic fitness differences. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.

  4. Evolution of haploid selection in predominantly diploid organisms

    PubMed Central

    Otto, Sarah P.; Scott, Michael F.; Immler, Simone

    2015-01-01

    Diploid organisms manipulate the extent to which their haploid gametes experience selection. Animals typically produce sperm with a diploid complement of most proteins and RNA, limiting selection on the haploid genotype. Plants, however, exhibit extensive expression in pollen, with actively transcribed haploid genomes. Here we analyze models that track the evolution of genes that modify the strength of haploid selection to predict when evolution intensifies and when it dampens the “selective arena” within which male gametes compete for fertilization. Considering deleterious mutations, evolution leads diploid mothers to strengthen selection among haploid sperm/pollen, because this reduces the mutation load inherited by their diploid offspring. If, however, selection acts in opposite directions in haploids and diploids (“ploidally antagonistic selection”), mothers evolve to reduce haploid selection to avoid selectively amplifying alleles harmful to their offspring. Consequently, with maternal control, selection in the haploid phase either is maximized or reaches an intermediate state, depending on the deleterious mutation rate relative to the extent of ploidally antagonistic selection. By contrast, evolution generally leads diploid fathers to mask mutations in their gametes to the maximum extent possible, whenever masking (e.g., through transcript sharing) increases the average fitness of a father’s gametes. We discuss the implications of this maternal–paternal conflict over the extent of haploid selection and describe empirical studies needed to refine our understanding of haploid selection among seemingly diploid organisms. PMID:26669442

  5. Genome size evolution at the speciation level: the cryptic species complex Brachionus plicatilis (Rotifera).

    PubMed

    Stelzer, Claus-Peter; Riss, Simone; Stadler, Peter

    2011-04-07

    Studies on genome size variation in animals are rarely done at lower taxonomic levels, e.g., slightly above/below the species level. Yet, such variation might provide important clues on the tempo and mode of genome size evolution. In this study we used the flow-cytometry method to study the evolution of genome size in the rotifer Brachionus plicatilis, a cryptic species complex consisting of at least 14 closely related species. We found an unexpectedly high variation in this species complex, with genome sizes ranging approximately seven-fold (haploid '1C' genome sizes: 0.056-0.416 pg). Most of this variation (67%) could be ascribed to the major clades of the species complex, i.e. clades that are well separated according to most species definitions. However, we also found substantial variation (32%) at lower taxonomic levels--within and among genealogical species--and, interestingly, among species pairs that are not completely reproductively isolated. In one genealogical species, called B. 'Austria', we found greatly enlarged genome sizes that could roughly be approximated as multiples of the genomes of its closest relatives, which suggests that whole-genome duplications have occurred early during separation of this lineage. Overall, genome size was significantly correlated to egg size and body size, even though the latter became non-significant after controlling for phylogenetic non-independence. Our study suggests that substantial genome size variation can build up early during speciation, potentially even among isolated populations. An alternative, but not mutually exclusive interpretation might be that reproductive isolation tends to build up unusually slow in this species complex.

  6. Genome size evolution at the speciation level: The cryptic species complex Brachionus plicatilis (Rotifera)

    PubMed Central

    2011-01-01

    Background Studies on genome size variation in animals are rarely done at lower taxonomic levels, e.g., slightly above/below the species level. Yet, such variation might provide important clues on the tempo and mode of genome size evolution. In this study we used the flow-cytometry method to study the evolution of genome size in the rotifer Brachionus plicatilis, a cryptic species complex consisting of at least 14 closely related species. Results We found an unexpectedly high variation in this species complex, with genome sizes ranging approximately seven-fold (haploid '1C' genome sizes: 0.056-0.416 pg). Most of this variation (67%) could be ascribed to the major clades of the species complex, i.e. clades that are well separated according to most species definitions. However, we also found substantial variation (32%) at lower taxonomic levels - within and among genealogical species - and, interestingly, among species pairs that are not completely reproductively isolated. In one genealogical species, called B. 'Austria', we found greatly enlarged genome sizes that could roughly be approximated as multiples of the genomes of its closest relatives, which suggests that whole-genome duplications have occurred early during separation of this lineage. Overall, genome size was significantly correlated to egg size and body size, even though the latter became non-significant after controlling for phylogenetic non-independence. Conclusions Our study suggests that substantial genome size variation can build up early during speciation, potentially even among isolated populations. An alternative, but not mutually exclusive interpretation might be that reproductive isolation tends to build up unusually slow in this species complex. PMID:21473744

  7. Coconut genome size determined by flow cytometry: Tall versus Dwarf types.

    PubMed

    Freitas Neto, M; Pereira, T N S; Geronimo, I G C; Azevedo, A O N; Ramos, S R R; Pereira, M G

    2016-02-11

    Coconuts (Cocos nucifera L.) are tropical palm trees that are classified into Tall and Dwarf types based on height, and both types are diploid (2n = 2x = 32 chromosomes). The reproduction mode is autogamous for Dwarf types and allogamous for Tall types. One hypothesis for the origin of the Dwarf coconut suggests that it is a Tall variant that resulted from either mutation or inbreeding, and differences in genome size between the two types would support this hypothesis. In this study, we estimated the genome sizes of 14 coconut accessions (eight Tall and six Dwarf types) using flow cytometry. Nuclei were extracted from leaf discs and stained with propidium iodide, and Pisum sativum (2C = 9.07 pg DNA) was used as an internal standard. Histograms with good resolution and low coefficients of variation (2.5 to 3.2%) were obtained. The 2C DNA content ranged from 5.72 to 5.48 pg for Tall accessions and from 5.58 to 5.52 pg for Dwarf accessions. The mean genome sizes for Tall and Dwarf specimens were 5.59 and 5.55 pg, respectively. Among all accessions, Rennel Island Tall had the highest mean DNA content (5.72 pg), whereas West African Tall had the lowest (5.48 pg). The mean coconut genome size (2C = 5.57 pg, corresponding to 2723.73 Mbp/haploid set) was classified as small. Only small differences in genome size existed among the coconut accessions, suggesting that the Dwarf type did not evolve from the Tall type.

  8. Evolution of genome size and chromosome number in the carnivorous plant genus Genlisea (Lentibulariaceae), with a new estimate of the minimum genome size in angiosperms

    PubMed Central

    Fleischmann, Andreas; Michael, Todd P.; Rivadavia, Fernando; Sousa, Aretuza; Wang, Wenqin; Temsch, Eva M.; Greilhuber, Johann; Müller, Kai F.; Heubl, Günther

    2014-01-01

    Background and Aims Some species of Genlisea possess ultrasmall nuclear genomes, the smallest known among angiosperms, and some have been found to have chromosomes of diminutive size, which may explain why chromosome numbers and karyotypes are not known for the majority of species of the genus. However, other members of the genus do not possess ultrasmall genomes, nor do most taxa studied in related genera of the family or order. This study therefore examined the evolution of genome sizes and chromosome numbers in Genlisea in a phylogenetic context. The correlations of genome size with chromosome number and size, with the phylogeny of the group and with growth forms and habitats were also examined. Methods Nuclear genome sizes were measured from cultivated plant material for a comprehensive sampling of taxa, including nearly half of all species of Genlisea and representing all major lineages. Flow cytometric measurements were conducted in parallel in two laboratories in order to compare the consistency of different methods and controls. Chromosome counts were performed for the majority of taxa, comparing different staining techniques for the ultrasmall chromosomes. Key Results Genome sizes of 15 taxa of Genlisea are presented and interpreted in a phylogenetic context. A high degree of congruence was found between genome size distribution and the major phylogenetic lineages. Ultrasmall genomes with 1C values of <100 Mbp were almost exclusively found in a derived lineage of South American species. The ancestral haploid chromosome number was inferred to be n = 8. Chromosome numbers in Genlisea ranged from 2n = 2x = 16 to 2n = 4x = 32. Ascendant dysploid series (2n = 36, 38) are documented for three derived taxa. The different ploidy levels corresponded to the two subgenera, but were not directly correlated to differences in genome size; the three different karyotype ranges mirrored the different sections of the genus. The smallest known plant genomes were not found in

  9. Induction of gynogenetic and androgenetic haploid and doubled haploid development in the brown trout (Salmo trutta Linnaeus 1758).

    PubMed

    Michalik, O; Dobosz, S; Zalewski, T; Sapota, M; Ocalewicz, K

    2015-04-01

    Gynogenetic and androgenetic brown trout (Salmo trutta Linnaeus 1758) haploids (Hs) and doubled haploids (DHs) were produced in the present research. Haploid development was induced by radiation-induced genetic inactivation of spermatozoa (gynogenesis) or eggs (androgenesis) before insemination. To provide DHs, gynogenetic and androgenetic haploid zygotes were subjected to the high pressure shock to suppress the first mitotic cleavage. Among haploids, gynogenetic embryos were showing lower mortality when compared to the androgenetic embryos; however, most of them die before the first feeding stage. Gynogenetic doubled haploids provided in the course of the brown trout eggs activation performed by homologous and heterologous sperm (rainbow trout) were developing equally showing hatching rates of 14.76 ± 2.4% and 16.14 ± 2.90% and the survival rates at the first feeding stage of 10.48 ± 3.48% and 12.78 ± 2.18%, respectively. Significantly, lower survival rate was observed among androgenetic progenies from the diploid groups with only few specimens that survived to the first feeding stage. Cytogenetic survey showed that among embryos from the diploid variants of the research, only gynogenetic individuals possessed doubled sets of chromosomes. Thus, it is reasonable to assume that radiation employed for the genetic inactivation of the brown trout eggs misaligned mechanism responsible for the cell divisions and might have delayed or even arrested the first mitotic cleavage in the androgenetic brown trout zygotes. Moreover, protocol for the radiation-induced inactivation of the paternal and maternal genome should be adjusted as some of the cytogenetically surveyed gynogenetic and androgenetic embryos exhibited fragments of the irradiated chromosomes. © 2015 Blackwell Verlag GmbH.

  10. Sequencing and assembly of the 22-gb loblolly pine genome.

    PubMed

    Zimin, Aleksey; Stevens, Kristian A; Crepeau, Marc W; Holtz-Morris, Ann; Koriabine, Maxim; Marçais, Guillaume; Puiu, Daniela; Roberts, Michael; Wegrzyn, Jill L; de Jong, Pieter J; Neale, David B; Salzberg, Steven L; Yorke, James A; Langley, Charles H

    2014-03-01

    Conifers are the predominant gymnosperm. The size and complexity of their genomes has presented formidable technical challenges for whole-genome shotgun sequencing and assembly. We employed novel strategies that allowed us to determine the loblolly pine (Pinus taeda) reference genome sequence, the largest genome assembled to date. Most of the sequence data were derived from whole-genome shotgun sequencing of a single megagametophyte, the haploid tissue of a single pine seed. Although that constrained the quantity of available DNA, the resulting haploid sequence data were well-suited for assembly. The haploid sequence was augmented with multiple linking long-fragment mate pair libraries from the parental diploid DNA. For the longest fragments, we used novel fosmid DiTag libraries. Sequences from the linking libraries that did not match the megagametophyte were identified and removed. Assembly of the sequence data were aided by condensing the enormous number of paired-end reads into a much smaller set of longer "super-reads," rendering subsequent assembly with an overlap-based assembly algorithm computationally feasible. To further improve the contiguity and biological utility of the genome sequence, additional scaffolding methods utilizing independent genome and transcriptome assemblies were implemented. The combination of these strategies resulted in a draft genome sequence of 20.15 billion bases, with an N50 scaffold size of 66.9 kbp.

  11. The resurgence of haploids in higher plants.

    PubMed

    Forster, Brian P; Heberle-Bors, Erwin; Kasha, Ken J; Touraev, Alisher

    2007-08-01

    The life cycle of plants proceeds via alternating generations of sporophytes and gametophytes. The dominant and most obvious life form of higher plants is the free-living sporophyte. The sporophyte is the product of fertilization of male and female gametes and contains a set of chromosomes from each parent; its genomic constitution is 2n. Chromosome reduction at meiosis means cells of the gametophytes carry half the sporophytic complement of chromosomes (n). Plant haploid research began with the discovery that sporophytes can be produced in higher plants carrying the gametic chromosome number (n instead of 2n) and that their chromosome number can subsequently be doubled up by colchicine treatment. Recent technological innovations, greater understanding of underlying control mechanisms and an expansion of end-user applications has brought about a resurgence of interest in haploids in higher plants.

  12. Chromosome number reduction in the sister clade of Carica papaya with concomitant genome size doubling.

    PubMed

    Rockinger, Alexander; Sousa, Aretuza; Carvalho, Fernanda A; Renner, Susanne S

    2016-06-01

    Caricaceae include six genera and 34 species, among them papaya, a model species in plant sex chromosome research. The family was held to have a conserved karyotype with 2n = 18 chromosomes, an assumption based on few counts. We examined the karyotypes and genome size of species from all genera to test for possible cytogenetic variation. We used fluorescent in situ hybridization using standard telomere, 5S, and 45S rDNA probes. New and published data were combined with a phylogeny, molecular clock dating, and C values (available for ∼50% of the species) to reconstruct genome evolution. The African genus Cylicomorpha, which is sister to the remaining Caricaceae (all neotropical), has 2n = 18, as do the species in two other genera. A Mexican clade of five species that includes papaya, however, has 2n = 18 (papaya), 2n = 16 (Horovitzia cnidoscoloides), and 2n = 14 (Jarilla caudata and J. heterophylla; third Jarilla not counted), with the phylogeny indicating that the dysploidy events occurred ∼16.6 and ∼5.5 million years ago and that Jarilla underwent genome size doubling (∼450 to 830-920 Mbp/haploid genome). Pericentromeric interstitial telomere repeats occur in both Jarilla adjacent to 5S rDNA sites, and the variability of 5S rDNA sites across all genera is high. On the basis of outgroup comparison, 2n = 18 is the ancestral number, and repeated chromosomal fusions with simultaneous genome size increase as a result of repetitive elements accumulating near centromeres characterize the papaya clade. These results have implications for ongoing genome assemblies in Caricaceae. © 2016 Botanical Society of America.

  13. Identifying structural variation in haploid microbial genomes from short-read resequencing data using breseq.

    PubMed

    Barrick, Jeffrey E; Colburn, Geoffrey; Deatherage, Daniel E; Traverse, Charles C; Strand, Matthew D; Borges, Jordan J; Knoester, David B; Reba, Aaron; Meyer, Austin G

    2014-11-29

    Mutations that alter chromosomal structure play critical roles in evolution and disease, including in the origin of new lifestyles and pathogenic traits in microbes. Large-scale rearrangements in genomes are often mediated by recombination events involving new or existing copies of mobile genetic elements, recently duplicated genes, or other repetitive sequences. Most current software programs for predicting structural variation from short-read DNA resequencing data are intended primarily for use on human genomes. They typically disregard information in reads mapping to repeat sequences, and significant post-processing and manual examination of their output is often required to rule out false-positive predictions and precisely describe mutational events. We have implemented an algorithm for identifying structural variation from DNA resequencing data as part of the breseq computational pipeline for predicting mutations in haploid microbial genomes. Our method evaluates the support for new sequence junctions present in a clonal sample from split-read alignments to a reference genome, including matches to repeat sequences. Then, it uses a statistical model of read coverage evenness to accept or reject these predictions. Finally, breseq combines predictions of new junctions and deleted chromosomal regions to output biologically relevant descriptions of mutations and their effects on genes. We demonstrate the performance of breseq on simulated Escherichia coli genomes with deletions generating unique breakpoint sequences, new insertions of mobile genetic elements, and deletions mediated by mobile elements. Then, we reanalyze data from an E. coli K-12 mutation accumulation evolution experiment in which structural variation was not previously identified. Transposon insertions and large-scale chromosomal changes detected by breseq account for ~25% of spontaneous mutations in this strain. In all cases, we find that breseq is able to reliably predict structural variation

  14. Intron size and genome size in plants.

    Treesearch

    J. Wendel; R. Cronn; I. Alvarez; B. Liu; R. Small; D. Senchina

    2002-01-01

    It has long been known that genomes vary over a remarkable range of sizes in both plants (Bennett, Cox, and Leitch 1997) and animals (Gregory 2001). It also has become evident that across the broad phylogenetic sweep, genome size may be correlated with intron size (Deutsch and Long 1999; Vinogradov 1999; McLysaght et al. 2000), suggesting that some component of genome...

  15. A genotyping system capable of simultaneously analyzing >1000 single nucleotide polymorphisms in a haploid genome.

    PubMed

    Wang, Hui-Yun; Luo, Minjie; Tereshchenko, Irina V; Frikker, Danielle M; Cui, Xiangfeng; Li, James Y; Hu, Guohong; Chu, Yi; Azaro, Marco A; Lin, Yong; Shen, Li; Yang, Qifeng; Kambouris, Manousos E; Gao, Richeng; Shih, Weichung; Li, Honghua

    2005-02-01

    A high-throughput genotyping system for scoring single nucleotide polymorphisms (SNPs) has been developed. With this system, >1000 SNPs can be analyzed in a single assay, with a sensitivity that allows the use of single haploid cells as starting material. In the multiplex polymorphic sequence amplification step, instead of attaching universal sequences to the amplicons, primers that are unlikely to have nonspecific and productive interactions are used. Genotypes of SNPs are then determined by using the widely accessible microarray technology and the simple single-base extension assay. Three SNP panels, each consisting of >1000 SNPs, were incorporated into this system. The system was used to analyze 24 human genomic DNA samples. With 5 ng of human genomic DNA, the average detection rate was 98.22% when single probes were used, and 96.71% could be detected by dual probes in different directions. When single sperm cells were used, 91.88% of the SNPs were detectable, which is comparable to the level that was reached when very few genetic markers were used. By using a dual-probe assay, the average genotyping accuracy was 99.96% for 5 ng of human genomic DNA and 99.95% for single sperm. This system may be used to significantly facilitate large-scale genetic analysis even if the amount of DNA template is very limited or even highly degraded as that obtained from paraffin-embedded cancer specimens, and to make many unpractical research projects highly realistic and affordable.

  16. Ninety-six haploid yeast strains with individual disruptions of open reading frames between YOR097C and YOR192C, constructed for the Saccharomyces genome deletion project, have an additional mutation in the mismatch repair gene MSH3.

    PubMed

    Lehner, Kevin R; Stone, Megan M; Farber, Rosann A; Petes, Thomas D

    2007-11-01

    As part of the Saccharomyces Genome Deletion Project, sets of presumably isogenic haploid and diploid strains that differed only by single gene deletions were constructed. We found that one set of 96 strains (containing deletions of ORFs located between YOR097C and YOR192C) in the collection, which was derived from the haploid BY4741, has an additional mutation in the MSH3 mismatch repair gene.

  17. Salix transect of Europe: variation in ploidy and genome size in willow-associated common nettle, Urtica dioica L. sens. lat., from Greece to arctic Norway.

    PubMed

    Cronk, Quentin; Hidalgo, Oriane; Pellicer, Jaume; Percy, Diana; Leitch, Ilia J

    2016-01-01

    The common stinging nettle, Urtica dioica L. sensu lato, is an invertebrate "superhost", its clonal patches maintaining large populations of insects and molluscs. It is extremely widespread in Europe and highly variable, and two ploidy levels (diploid and tetraploid) are known. However, geographical patterns in cytotype variation require further study. We assembled a collection of nettles in conjunction with a transect of Europe from the Aegean to Arctic Norway (primarily conducted to examine the diversity of Salix and Salix -associated insects). Using flow cytometry to measure genome size, our sample of 29 plants reveals 5 diploids and 24 tetraploids. Two diploids were found in SE Europe (Bulgaria and Romania) and three diploids in S. Finland. More detailed cytotype surveys in these regions are suggested. The tetraploid genome size (2C value) varied between accessions from 2.36 to 2.59 pg. The diploids varied from 1.31 to 1.35 pg per 2C nucleus, equivalent to a haploid genome size of c. 650 Mbp. Within the tetraploids, we find that the most northerly samples (from N. Finland and arctic Norway) have a generally higher genome size. This is possibly indicative of a distinct population in this region.

  18. Semiconservative quasispecies equations for polysomic genomes: The general case

    NASA Astrophysics Data System (ADS)

    Itan, Eran; Tannenbaum, Emmanuel

    2010-06-01

    This paper develops a formulation of the quasispecies equations appropriate for polysomic, semiconservatively replicating genomes. This paper is an extension of previous work on the subject, which considered the case of haploid genomes. Here, we develop a more general formulation of the quasispecies equations that is applicable to diploid and even polyploid genomes. Interestingly, with an appropriate classification of population fractions, we obtain a system of equations that is formally identical to the haploid case. As with the work for haploid genomes, we consider both random and immortal DNA strand chromosome segregation mechanisms. However, in contrast to the haploid case, we have found that an analytical solution for the mean fitness is considerably more difficult to obtain for the polyploid case. Accordingly, whereas for the haploid case we obtained expressions for the mean fitness for the case of an analog of the single-fitness-peak landscape for arbitrary lesion repair probabilities (thereby allowing for noncomplementary genomes), here we solve for the mean fitness for the restricted case of perfect lesion repair.

  19. Sauropod dinosaurs evolved moderately sized genomes unrelated to body size.

    PubMed

    Organ, Chris L; Brusatte, Stephen L; Stein, Koen

    2009-12-22

    Sauropodomorph dinosaurs include the largest land animals to have ever lived, some reaching up to 10 times the mass of an African elephant. Despite their status defining the upper range for body size in land animals, it remains unknown whether sauropodomorphs evolved larger-sized genomes than non-avian theropods, their sister taxon, or whether a relationship exists between genome size and body size in dinosaurs, two questions critical for understanding broad patterns of genome evolution in dinosaurs. Here we report inferences of genome size for 10 sauropodomorph taxa. The estimates are derived from a Bayesian phylogenetic generalized least squares approach that generates posterior distributions of regression models relating genome size to osteocyte lacunae volume in extant tetrapods. We estimate that the average genome size of sauropodomorphs was 2.02 pg (range of species means: 1.77-2.21 pg), a value in the upper range of extant birds (mean = 1.42 pg, range: 0.97-2.16 pg) and near the average for extant non-avian reptiles (mean = 2.24 pg, range: 1.05-5.44 pg). The results suggest that the variation in size and architecture of genomes in extinct dinosaurs was lower than the variation found in mammals. A substantial difference in genome size separates the two major clades within dinosaurs, Ornithischia (large genomes) and Saurischia (moderate to small genomes). We find no relationship between body size and estimated genome size in extinct dinosaurs, which suggests that neutral forces did not dominate the evolution of genome size in this group.

  20. Sauropod dinosaurs evolved moderately sized genomes unrelated to body size

    PubMed Central

    Organ, Chris L.; Brusatte, Stephen L.; Stein, Koen

    2009-01-01

    Sauropodomorph dinosaurs include the largest land animals to have ever lived, some reaching up to 10 times the mass of an African elephant. Despite their status defining the upper range for body size in land animals, it remains unknown whether sauropodomorphs evolved larger-sized genomes than non-avian theropods, their sister taxon, or whether a relationship exists between genome size and body size in dinosaurs, two questions critical for understanding broad patterns of genome evolution in dinosaurs. Here we report inferences of genome size for 10 sauropodomorph taxa. The estimates are derived from a Bayesian phylogenetic generalized least squares approach that generates posterior distributions of regression models relating genome size to osteocyte lacunae volume in extant tetrapods. We estimate that the average genome size of sauropodomorphs was 2.02 pg (range of species means: 1.77–2.21 pg), a value in the upper range of extant birds (mean = 1.42 pg, range: 0.97–2.16 pg) and near the average for extant non-avian reptiles (mean = 2.24 pg, range: 1.05–5.44 pg). The results suggest that the variation in size and architecture of genomes in extinct dinosaurs was lower than the variation found in mammals. A substantial difference in genome size separates the two major clades within dinosaurs, Ornithischia (large genomes) and Saurischia (moderate to small genomes). We find no relationship between body size and estimated genome size in extinct dinosaurs, which suggests that neutral forces did not dominate the evolution of genome size in this group. PMID:19793755

  1. A haploid system of sex determination in the brown alga Ectocarpus sp.

    PubMed

    Ahmed, Sophia; Cock, J Mark; Pessia, Eugenie; Luthringer, Remy; Cormier, Alexandre; Robuchon, Marine; Sterck, Lieven; Peters, Akira F; Dittami, Simon M; Corre, Erwan; Valero, Myriam; Aury, Jean-Marc; Roze, Denis; Van de Peer, Yves; Bothwell, John; Marais, Gabriel A B; Coelho, Susana M

    2014-09-08

    A common feature of most genetic sex-determination systems studied so far is that sex is determined by nonrecombining genomic regions, which can be of various sizes depending on the species. These regions have evolved independently and repeatedly across diverse groups. A number of such sex-determining regions (SDRs) have been studied in animals, plants, and fungi, but very little is known about the evolution of sexes in other eukaryotic lineages. We report here the sequencing and genomic analysis of the SDR of Ectocarpus, a brown alga that has been evolving independently from plants, animals, and fungi for over one giga-annum. In Ectocarpus, sex is expressed during the haploid phase of the life cycle, and both the female (U) and the male (V) sex chromosomes contain nonrecombining regions. The U and V of this species have been diverging for more than 70 mega-annum, yet gene degeneration has been modest, and the SDR is relatively small, with no evidence for evolutionary strata. These features may be explained by the occurrence of strong purifying selection during the haploid phase of the life cycle and the low level of sexual dimorphism. V is dominant over U, suggesting that femaleness may be the default state, adopted when the male haplotype is absent. The Ectocarpus UV system has clearly had a distinct evolutionary trajectory not only to the well-studied XY and ZW systems but also to the UV systems described so far. Nonetheless, some striking similarities exist, indicating remarkable universality of the underlying processes shaping sex chromosome evolution across distant lineages. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. Genome size analyses of Pucciniales reveal the largest fungal genomes.

    PubMed

    Tavares, Sílvia; Ramos, Ana Paula; Pires, Ana Sofia; Azinheira, Helena G; Caldeirinha, Patrícia; Link, Tobias; Abranches, Rita; Silva, Maria do Céu; Voegele, Ralf T; Loureiro, João; Talhinhas, Pedro

    2014-01-01

    Rust fungi (Basidiomycota, Pucciniales) are biotrophic plant pathogens which exhibit diverse complexities in their life cycles and host ranges. The completion of genome sequencing of a few rust fungi has revealed the occurrence of large genomes. Sequencing efforts for other rust fungi have been hampered by uncertainty concerning their genome sizes. Flow cytometry was recently applied to estimate the genome size of a few rust fungi, and confirmed the occurrence of large genomes in this order (averaging 225.3 Mbp, while the average for Basidiomycota was 49.9 Mbp and was 37.7 Mbp for all fungi). In this work, we have used an innovative and simple approach to simultaneously isolate nuclei from the rust and its host plant in order to estimate the genome size of 30 rust species by flow cytometry. Genome sizes varied over 10-fold, from 70 to 893 Mbp, with an average genome size value of 380.2 Mbp. Compared to the genome sizes of over 1800 fungi, Gymnosporangium confusum possesses the largest fungal genome ever reported (893.2 Mbp). Moreover, even the smallest rust genome determined in this study is larger than the vast majority of fungal genomes (94%). The average genome size of the Pucciniales is now of 305.5 Mbp, while the average Basidiomycota genome size has shifted to 70.4 Mbp and the average for all fungi reached 44.2 Mbp. Despite the fact that no correlation could be drawn between the genome sizes, the phylogenomics or the life cycle of rust fungi, it is interesting to note that rusts with Fabaceae hosts present genomes clearly larger than those with Poaceae hosts. Although this study comprises only a small fraction of the more than 7000 rust species described, it seems already evident that the Pucciniales represent a group where genome size expansion could be a common characteristic. This is in sharp contrast to sister taxa, placing this order in a relevant position in fungal genomics research.

  3. Genome size analyses of Pucciniales reveal the largest fungal genomes

    PubMed Central

    Tavares, Sílvia; Ramos, Ana Paula; Pires, Ana Sofia; Azinheira, Helena G.; Caldeirinha, Patrícia; Link, Tobias; Abranches, Rita; Silva, Maria do Céu; Voegele, Ralf T.; Loureiro, João; Talhinhas, Pedro

    2014-01-01

    Rust fungi (Basidiomycota, Pucciniales) are biotrophic plant pathogens which exhibit diverse complexities in their life cycles and host ranges. The completion of genome sequencing of a few rust fungi has revealed the occurrence of large genomes. Sequencing efforts for other rust fungi have been hampered by uncertainty concerning their genome sizes. Flow cytometry was recently applied to estimate the genome size of a few rust fungi, and confirmed the occurrence of large genomes in this order (averaging 225.3 Mbp, while the average for Basidiomycota was 49.9 Mbp and was 37.7 Mbp for all fungi). In this work, we have used an innovative and simple approach to simultaneously isolate nuclei from the rust and its host plant in order to estimate the genome size of 30 rust species by flow cytometry. Genome sizes varied over 10-fold, from 70 to 893 Mbp, with an average genome size value of 380.2 Mbp. Compared to the genome sizes of over 1800 fungi, Gymnosporangium confusum possesses the largest fungal genome ever reported (893.2 Mbp). Moreover, even the smallest rust genome determined in this study is larger than the vast majority of fungal genomes (94%). The average genome size of the Pucciniales is now of 305.5 Mbp, while the average Basidiomycota genome size has shifted to 70.4 Mbp and the average for all fungi reached 44.2 Mbp. Despite the fact that no correlation could be drawn between the genome sizes, the phylogenomics or the life cycle of rust fungi, it is interesting to note that rusts with Fabaceae hosts present genomes clearly larger than those with Poaceae hosts. Although this study comprises only a small fraction of the more than 7000 rust species described, it seems already evident that the Pucciniales represent a group where genome size expansion could be a common characteristic. This is in sharp contrast to sister taxa, placing this order in a relevant position in fungal genomics research. PMID:25206357

  4. Use of doubled haploid technology for development of stable drought tolerant bread wheat (Triticum aestivum L.) transgenics.

    PubMed

    Chauhan, Harsh; Khurana, Paramjit

    2011-04-01

    Anther culture-derived haploid embryos were used as explants for Agrobacterium-mediated genetic transformation of bread wheat (Triticum aestivum L. cv CPAN1676) using barley HVA1 gene for drought tolerance. Regenerated plantlets were checked for transgene integration in T₀ generation, and positive transgenic haploid plants were doubled by colchicine treatment. Stable transgenic doubled haploid plants were obtained, and transgene expression was monitored till T₄ generation, and no transgene silencing was observed over the generations. Doubled haploid transgenic plants have faster seed germination and seedling establishment and show better drought tolerance in comparison with nontransgenic, doubled haploid plants, as measured by per cent germination, seedling growth and biomass accumulation. Physiological evaluation for abiotic stress by assessing nitrate reductase enzyme activity and plant yield under post-anthesis water limitation revealed a better tolerance of the transgenics over the wild type. This is the first report on the production of double haploid transgenic wheat through anther culture technique in a commercial cultivar for a desirable trait. This method would also be useful in functional genomics of wheat and other allopolyploids of agronomic importance. © 2010 The Authors. Plant Biotechnology Journal © 2010 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.

  5. Androgenesis, gynogenesis, and parthenogenesis haploids in cucurbit species.

    PubMed

    Dong, Yan-Qi; Zhao, Wei-Xing; Li, Xiao-Hui; Liu, Xi-Cun; Gao, Ning-Ning; Huang, Jin-Hua; Wang, Wen-Ying; Xu, Xiao-Li; Tang, Zhen-Hai

    2016-10-01

    Haploids and doubled haploids are critical components of plant breeding. This review is focused on studies on haploids and double haploids inducted in cucurbits through in vitro pollination with irradiated pollen, unfertilized ovule/ovary culture, and anther/microspore culture during the last 30 years, as well as comprehensive analysis of the main factors of each process and comparison between chromosome doubling and ploidy identification methods, with special focus on the application of double haploids in plant breeding and genetics. This review identifies existing problems affecting the efficiency of androgenesis, gynogenesis, and parthenogenesis in cucurbit species. Donor plant genotypes and surrounding environments, developmental stages of explants, culture media, stress factors, and chromosome doubling and ploidy identification are compared at length and discussed as methodologies and protocols for androgenesis, gynogenesis, and parthenogenesis in haploid and double haploid production technologies.

  6. Detection and mapping of QTL for temperature tolerance and body size in Chinook salmon (Oncorhynchus tshawytscha) using genotyping by sequencing

    PubMed Central

    Everett, Meredith V; Seeb, James E

    2014-01-01

    Understanding how organisms interact with their environments is increasingly important for conservation efforts in many species, especially in light of highly anticipated climate changes. One method for understanding this relationship is to use genetic maps and QTL mapping to detect genomic regions linked to phenotypic traits of importance for adaptation. We used high-throughput genotyping by sequencing (GBS) to both detect and map thousands of SNPs in haploid Chinook salmon (Oncorhynchus tshawytscha). We next applied this map to detect QTL related to temperature tolerance and body size in families of diploid Chinook salmon. Using these techniques, we mapped 3534 SNPs in 34 linkage groups which is consistent with the haploid chromosome number for Chinook salmon. We successfully detected three QTL for temperature tolerance and one QTL for body size at the experiment-wide level, as well as additional QTL significant at the chromosome-wide level. The use of haploids coupled with GBS provides a robust pathway to rapidly develop genomic resources in nonmodel organisms; these QTL represent preliminary progress toward linking traits of conservation interest to regions in the Chinook salmon genome. PMID:24822082

  7. Salix transect of Europe: variation in ploidy and genome size in willow-associated common nettle, Urtica dioica L. sens. lat., from Greece to arctic Norway

    PubMed Central

    Hidalgo, Oriane; Pellicer, Jaume; Percy, Diana; Leitch, Ilia J.

    2016-01-01

    Abstract Background The common stinging nettle, Urtica dioica L. sensu lato, is an invertebrate "superhost", its clonal patches maintaining large populations of insects and molluscs. It is extremely widespread in Europe and highly variable, and two ploidy levels (diploid and tetraploid) are known. However, geographical patterns in cytotype variation require further study. New information We assembled a collection of nettles in conjunction with a transect of Europe from the Aegean to Arctic Norway (primarily conducted to examine the diversity of Salix and Salix-associated insects). Using flow cytometry to measure genome size, our sample of 29 plants reveals 5 diploids and 24 tetraploids. Two diploids were found in SE Europe (Bulgaria and Romania) and three diploids in S. Finland. More detailed cytotype surveys in these regions are suggested. The tetraploid genome size (2C value) varied between accessions from 2.36 to 2.59 pg. The diploids varied from 1.31 to 1.35 pg per 2C nucleus, equivalent to a haploid genome size of c. 650 Mbp. Within the tetraploids, we find that the most northerly samples (from N. Finland and arctic Norway) have a generally higher genome size. This is possibly indicative of a distinct population in this region. PMID:27932918

  8. Polyploid titan cells produce haploid and aneuploid progeny to promote stress adaptation.

    PubMed

    Gerstein, Aleeza C; Fu, Man Shun; Mukaremera, Liliane; Li, Zhongming; Ormerod, Kate L; Fraser, James A; Berman, Judith; Nielsen, Kirsten

    2015-10-13

    Cryptococcus neoformans is a major life-threatening fungal pathogen. In response to the stress of the host environment, C. neoformans produces large polyploid titan cells. Titan cell production enhances the virulence of C. neoformans, yet whether the polyploid aspect of titan cells is specifically influential remains unknown. We show that titan cells were more likely to survive and produce offspring under multiple stress conditions than typical cells and that even their normally sized daughters maintained an advantage over typical cells in continued exposure to stress. Although polyploid titan cells generated haploid daughter cell progeny upon in vitro replication under nutrient-replete conditions, titan cells treated with the antifungal drug fluconazole produced fluconazole-resistant diploid and aneuploid daughter cells. Interestingly, a single titan mother cell was capable of generating multiple types of aneuploid daughter cells. The increased survival and genomic diversity of titan cell progeny promote rapid adaptation to new or high-stress conditions. The ability to adapt to stress is a key element for survival of pathogenic microbes in the host and thus plays an important role in pathogenesis. Here we investigated the predominantly haploid human fungal pathogen Cryptococcus neoformans, which is capable of ploidy and cell size increases during infection through production of titan cells. The enlarged polyploid titan cells are then able to rapidly undergo ploidy reduction to generate progeny with reduced ploidy and/or aneuploidy. Under stressful conditions, titan cell progeny have a growth and survival advantage over typical cell progeny. Understanding how titan cells enhance the rate of cryptococcal adaptation under stress conditions may assist in the development of novel drugs aimed at blocking ploidy transitions. Copyright © 2015 Gerstein et al.

  9. Karyotype and genome size of Iberochondrostoma almacai (Teleostei, Cyprinidae) and comparison with the sister-species I.lusitanicum

    PubMed Central

    2009-01-01

    This study aimed to define the karyotype of the recently described Iberian endemic Iberochondrostoma almacai, to revisit the previously documented chromosome polymorphisms of its sister species I.lusitanicum using C-, Ag-/CMA3 and RE-banding, and to compare the two species genome sizes. A 2n = 50 karyotype (with the exception of a triploid I.lusitanicum specimen) and a corresponding haploid chromosome formula of 7M:15SM:3A (FN = 94) were found. Multiple NORs were observed in both species (in two submetacentric chromosome pairs, one of them clearly homologous) and a higher intra and interpopulational variability was evidenced in I.lusitanicum. Flow cytometry measurements of nuclear DNA content showed some significant differences in genome size both between and within species: the genome of I. almacai was smaller than that of I.lusitanicum (mean values 2.61 and 2.93 pg, respectively), which presented a clear interpopulational variability (mean values ranging from 2.72 to 3.00 pg). These data allowed the distinction of both taxa and confirmed the existence of two well differentiated groups within I. lusitanicum: one that includes the populations from the right bank of the Tejo and Samarra drainages, and another that reunites the southern populations. The peculiar differences between the two species, presently listed as “Critically Endangered”, reinforced the importance of this study for future conservation plans. PMID:21637679

  10. Genome size variation in the genus Avena.

    PubMed

    Yan, Honghai; Martin, Sara L; Bekele, Wubishet A; Latta, Robert G; Diederichsen, Axel; Peng, Yuanying; Tinker, Nicholas A

    2016-03-01

    Genome size is an indicator of evolutionary distance and a metric for genome characterization. Here, we report accurate estimates of genome size in 99 accessions from 26 species of Avena. We demonstrate that the average genome size of C genome diploid species (2C = 10.26 pg) is 15% larger than that of A genome species (2C = 8.95 pg), and that this difference likely accounts for a progression of size among tetraploid species, where AB < AC < CC (average 2C = 16.76, 18.60, and 21.78 pg, respectively). All accessions from three hexaploid species with the ACD genome configuration had similar genome sizes (average 2C = 25.74 pg). Genome size was mostly consistent within species and in general agreement with current information about evolutionary distance among species. Results also suggest that most of the polyploid species in Avena have experienced genome downsizing in relation to their diploid progenitors. Genome size measurements could provide additional quality control for species identification in germplasm collections, especially in cases where diploid and polyploid species have similar morphology.

  11. Genome size evolution in Ontario ferns (Polypodiidae): evolutionary correlations with cell size, spore size, and habitat type and an absence of genome downsizing.

    PubMed

    Henry, Thomas A; Bainard, Jillian D; Newmaster, Steven G

    2014-10-01

    Genome size is known to correlate with a number of traits in angiosperms, but less is known about the phenotypic correlates of genome size in ferns. We explored genome size variation in relation to a suite of morphological and ecological traits in ferns. Thirty-six fern taxa were collected from wild populations in Ontario, Canada. 2C DNA content was measured using flow cytometry. We tested for genome downsizing following polyploidy using a phylogenetic comparative analysis to explore the correlation between 1Cx DNA content and ploidy. There was no compelling evidence for the occurrence of widespread genome downsizing during the evolution of Ontario ferns. The relationship between genome size and 11 morphological and ecological traits was explored using a phylogenetic principal component regression analysis. Genome size was found to be significantly associated with cell size, spore size, spore type, and habitat type. These results are timely as past and recent studies have found conflicting support for the association between ploidy/genome size and spore size in fern polyploid complexes; this study represents the first comparative analysis of the trend across a broad taxonomic group of ferns.

  12. Patterns of genome size variation in snapping shrimp.

    PubMed

    Jeffery, Nicholas W; Hultgren, Kristin; Chak, Solomon Tin Chi; Gregory, T Ryan; Rubenstein, Dustin R

    2016-06-01

    Although crustaceans vary extensively in genome size, little is known about how genome size may affect the ecology and evolution of species in this diverse group, in part due to the lack of large genome size datasets. Here we investigate interspecific, intraspecific, and intracolony variation in genome size in 39 species of Synalpheus shrimps, representing one of the largest genome size datasets for a single genus within crustaceans. We find that genome size ranges approximately 4-fold across Synalpheus with little phylogenetic signal, and is not related to body size. In a subset of these species, genome size is related to chromosome size, but not to chromosome number, suggesting that despite large genomes, these species are not polyploid. Interestingly, there appears to be 35% intraspecific genome size variation in Synalpheus idios among geographic regions, and up to 30% variation in Synalpheus duffyi genome size within the same colony.

  13. Genome size variation in deep-sea amphipods

    PubMed Central

    Jamieson, A. J.; Piertney, S. B.

    2017-01-01

    Genome size varies considerably across taxa, and extensive research effort has gone into understanding whether variation can be explained by differences in key ecological and life-history traits among species. The extreme environmental conditions that characterize the deep sea have been hypothesized to promote large genome sizes in eukaryotes. Here we test this supposition by examining genome sizes among 13 species of deep-sea amphipods from the Mariana, Kermadec and New Hebrides trenches. Genome sizes were estimated using flow cytometry and found to vary nine-fold, ranging from 4.06 pg (4.04 Gb) in Paralicella caperesca to 34.79 pg (34.02 Gb) in Alicella gigantea. Phylogenetic independent contrast analysis identified a relationship between genome size and maximum body size, though this was largely driven by those species that display size gigantism. There was a distinct shift in the genome size trait diversification rate in the supergiant amphipod A. gigantea relative to the rest of the group. The variation in genome size observed is striking and argues against genome size being driven by a common evolutionary history, ecological niche and life-history strategy in deep-sea amphipods. PMID:28989783

  14. Fixation Probability in a Haploid-Diploid Population.

    PubMed

    Bessho, Kazuhiro; Otto, Sarah P

    2017-01-01

    Classical population genetic theory generally assumes either a fully haploid or fully diploid life cycle. However, many organisms exhibit more complex life cycles, with both free-living haploid and diploid stages. Here we ask what the probability of fixation is for selected alleles in organisms with haploid-diploid life cycles. We develop a genetic model that considers the population dynamics using both the Moran model and Wright-Fisher model. Applying a branching process approximation, we obtain an accurate fixation probability assuming that the population is large and the net effect of the mutation is beneficial. We also find the diffusion approximation for the fixation probability, which is accurate even in small populations and for deleterious alleles, as long as selection is weak. These fixation probabilities from branching process and diffusion approximations are similar when selection is weak for beneficial mutations that are not fully recessive. In many cases, particularly when one phase predominates, the fixation probability differs substantially for haploid-diploid organisms compared to either fully haploid or diploid species. Copyright © 2017 by the Genetics Society of America.

  15. Fixation Probability in a Haploid-Diploid Population

    PubMed Central

    Bessho, Kazuhiro; Otto, Sarah P.

    2017-01-01

    Classical population genetic theory generally assumes either a fully haploid or fully diploid life cycle. However, many organisms exhibit more complex life cycles, with both free-living haploid and diploid stages. Here we ask what the probability of fixation is for selected alleles in organisms with haploid-diploid life cycles. We develop a genetic model that considers the population dynamics using both the Moran model and Wright–Fisher model. Applying a branching process approximation, we obtain an accurate fixation probability assuming that the population is large and the net effect of the mutation is beneficial. We also find the diffusion approximation for the fixation probability, which is accurate even in small populations and for deleterious alleles, as long as selection is weak. These fixation probabilities from branching process and diffusion approximations are similar when selection is weak for beneficial mutations that are not fully recessive. In many cases, particularly when one phase predominates, the fixation probability differs substantially for haploid-diploid organisms compared to either fully haploid or diploid species. PMID:27866168

  16. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly

    PubMed Central

    Schneider, Valerie A.; Graves-Lindsay, Tina; Howe, Kerstin; Bouk, Nathan; Chen, Hsiu-Chuan; Kitts, Paul A.; Murphy, Terence D.; Pruitt, Kim D.; Thibaud-Nissen, Françoise; Albracht, Derek; Fulton, Robert S.; Kremitzki, Milinn; Magrini, Vincent; Markovic, Chris; McGrath, Sean; Steinberg, Karyn Meltz; Auger, Kate; Chow, William; Collins, Joanna; Harden, Glenn; Hubbard, Timothy; Pelan, Sarah; Simpson, Jared T.; Threadgold, Glen; Torrance, James; Wood, Jonathan M.; Clarke, Laura; Koren, Sergey; Boitano, Matthew; Peluso, Paul; Li, Heng; Chin, Chen-Shan; Phillippy, Adam M.; Durbin, Richard; Wilson, Richard K.; Flicek, Paul; Eichler, Evan E.; Church, Deanna M.

    2017-01-01

    The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health. PMID:28396521

  17. Cell size, genome size and the dominance of Angiosperms

    NASA Astrophysics Data System (ADS)

    Simonin, K. A.; Roddy, A. B.

    2016-12-01

    Angiosperms are capable of maintaining the highest rates of photosynthetic gas exchange of all land plants. High rates of photosynthesis depends mechanistically both on efficiently transporting water to the sites of evaporation in the leaf and on regulating the loss of that water to the atmosphere as CO2 diffuses into the leaf. Angiosperm leaves are unique in their ability to sustain high fluxes of liquid and vapor phase water transport due to high vein densities and numerous, small stomata. Despite the ubiquity of studies characterizing the anatomical and physiological adaptations that enable angiosperms to maintain high rates of photosynthesis, the underlying mechanism explaining why they have been able to develop such high leaf vein densities, and such small and abundant stomata, is still incomplete. Here we ask whether the scaling of genome size and cell size places a fundamental constraint on the photosynthetic metabolism of land plants, and whether genome downsizing among the angiosperms directly contributed to their greater potential and realized primary productivity relative to the other major groups of terrestrial plants. Using previously published data we show that a single relationship can predict guard cell size from genome size across the major groups of terrestrial land plants (e.g. angiosperms, conifers, cycads and ferns). Similarly, a strong positive correlation exists between genome size and both stomatal density and vein density that together ultimately constrains maximum potential (gs, max) and operational stomatal conductance (gs, op). Further the difference in the slopes describing the covariation between genome size and both gs, max and gs, op suggests that genome downsizing brings gs, op closer to gs, max. Taken together the data presented here suggests that the smaller genomes of angiosperms allow their final cell sizes to vary more widely and respond more directly to environmental conditions and in doing so bring operational photosynthetic

  18. Genome size diversity in orchids: consequences and evolution

    PubMed Central

    Leitch, I. J.; Kahandawala, I.; Suda, J.; Hanson, L.; Ingrouille, M. J.; Chase, M. W.; Fay, M. F.

    2009-01-01

    Background The amount of DNA comprising the genome of an organism (its genome size) varies a remarkable 40 000-fold across eukaryotes, yet most groups are characterized by much narrower ranges (e.g. 14-fold in gymnosperms, 3- to 4-fold in mammals). Angiosperms stand out as one of the most variable groups with genome sizes varying nearly 2000-fold. Nevertheless within angiosperms the majority of families are characterized by genomes which are small and vary little. Species with large genomes are mostly restricted to a few monocots families including Orchidaceae. Scope A survey of the literature revealed that genome size data for Orchidaceae are comparatively rare representing just 327 species. Nevertheless they reveal that Orchidaceae are currently the most variable angiosperm family with genome sizes ranging 168-fold (1C = 0·33–55·4 pg). Analysing the data provided insights into the distribution, evolution and possible consequences to the plant of this genome size diversity. Conclusions Superimposing the data onto the increasingly robust phylogenetic tree of Orchidaceae revealed how different subfamilies were characterized by distinct genome size profiles. Epidendroideae possessed the greatest range of genome sizes, although the majority of species had small genomes. In contrast, the largest genomes were found in subfamilies Cypripedioideae and Vanilloideae. Genome size evolution within this subfamily was analysed as this is the only one with reasonable representation of data. This approach highlighted striking differences in genome size and karyotype evolution between the closely related Cypripedium, Paphiopedilum and Phragmipedium. As to the consequences of genome size diversity, various studies revealed that this has both practical (e.g. application of genetic fingerprinting techniques) and biological consequences (e.g. affecting where and when an orchid may grow) and emphasizes the importance of obtaining further genome size data given the considerable

  19. Doubled haploid production in Flax (Linum usitatissimum L.).

    PubMed

    Obert, Bohus; Zácková, Zuzana; Samaj, Jozef; Pretová, Anna

    2009-01-01

    There is a requirement of haploid and double haploid material and homozygous lines for cell culture studies and breeding in flax. Anther culture is currently the most successful method producing doubled haploid lines in flax. Recently, ovary culture was also described as a good source of doubled haploids. In this review we focus on tissue and plants regeneration using anther culture, and cultivation of ovaries containing unfertilized ovules. The effect of genotype, physiological status of donor plants, donor material pre-treatment and cultivation conditions for flax anthers and ovaries is discussed here. The process of plant regeneration from anther and ovary derived calli is also in the focus of this review. Attention is paid to the ploidy level of regenerated tissue and to the use of molecular markers for determining of gametic origin of flax plants derived from anther and ovary cultures. Finally, some future prospects on the use of doubled haploids in flax biotechnology are outlined here.

  20. Polyploid Titan Cells Produce Haploid and Aneuploid Progeny To Promote Stress Adaptation

    PubMed Central

    Gerstein, Aleeza C.; Fu, Man Shun; Mukaremera, Liliane; Li, Zhongming; Ormerod, Kate L.; Fraser, James A.; Berman, Judith

    2015-01-01

    ABSTRACT Cryptococcus neoformans is a major life-threatening fungal pathogen. In response to the stress of the host environment, C. neoformans produces large polyploid titan cells. Titan cell production enhances the virulence of C. neoformans, yet whether the polyploid aspect of titan cells is specifically influential remains unknown. We show that titan cells were more likely to survive and produce offspring under multiple stress conditions than typical cells and that even their normally sized daughters maintained an advantage over typical cells in continued exposure to stress. Although polyploid titan cells generated haploid daughter cell progeny upon in vitro replication under nutrient-replete conditions, titan cells treated with the antifungal drug fluconazole produced fluconazole-resistant diploid and aneuploid daughter cells. Interestingly, a single titan mother cell was capable of generating multiple types of aneuploid daughter cells. The increased survival and genomic diversity of titan cell progeny promote rapid adaptation to new or high-stress conditions. PMID:26463162

  1. Evolution of genome size and genomic GC content in carnivorous holokinetics (Droseraceae).

    PubMed

    Veleba, Adam; Šmarda, Petr; Zedek, František; Horová, Lucie; Šmerda, Jakub; Bureš, Petr

    2017-02-01

    Studies in the carnivorous family Lentibulariaceae in the last years resulted in the discovery of the smallest plant genomes and an unusual pattern of genomic GC content evolution. However, scarcity of genomic data in other carnivorous clades still prevents a generalization of the observed patterns. Here the aim was to fill this gap by mapping genome evolution in the second largest carnivorous family, Droseraceae, where this evolution may be affected by chromosomal holokinetism in Drosera METHODS: The genome size and genomic GC content of 71 Droseraceae species were measured by flow cytometry. A dated phylogeny was constructed, and the evolution of both genomic parameters and their relationship to species climatic niches were tested using phylogeny-based statistics. The 2C genome size of Droseraceae varied between 488 and 10 927 Mbp, and the GC content ranged between 37·1 and 44·7 %. The genome sizes and genomic GC content of carnivorous and holocentric species did not differ from those of their non-carnivorous and monocentric relatives. The genomic GC content positively correlated with genome size and annual temperature fluctuations. The genome size and chromosome numbers were inversely correlated in the Australian clade of Drosera CONCLUSIONS: Our results indicate that neither carnivory (nutrient scarcity) nor the holokinetism have a prominent effect on size and DNA base composition of Droseraceae genomes. However, the holokinetic drive seems to affect karyotype evolution in one of the major clades of Drosera Our survey confirmed that the evolution of GC content is tightly connected with the evolution of genome size and also with environmental conditions. © The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  2. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.

    PubMed

    Schneider, Valerie A; Graves-Lindsay, Tina; Howe, Kerstin; Bouk, Nathan; Chen, Hsiu-Chuan; Kitts, Paul A; Murphy, Terence D; Pruitt, Kim D; Thibaud-Nissen, Françoise; Albracht, Derek; Fulton, Robert S; Kremitzki, Milinn; Magrini, Vincent; Markovic, Chris; McGrath, Sean; Steinberg, Karyn Meltz; Auger, Kate; Chow, William; Collins, Joanna; Harden, Glenn; Hubbard, Timothy; Pelan, Sarah; Simpson, Jared T; Threadgold, Glen; Torrance, James; Wood, Jonathan M; Clarke, Laura; Koren, Sergey; Boitano, Matthew; Peluso, Paul; Li, Heng; Chin, Chen-Shan; Phillippy, Adam M; Durbin, Richard; Wilson, Richard K; Flicek, Paul; Eichler, Evan E; Church, Deanna M

    2017-05-01

    The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health. © 2017 Schneider et al.; Published by Cold Spring Harbor Laboratory Press.

  3. Production of haploids and doubled haploids in oil palm

    PubMed Central

    2010-01-01

    Background Oil palm is the world's most productive oil-food crop despite yielding well below its theoretical maximum. This maximum could be approached with the introduction of elite F1 varieties. The development of such elite lines has thus far been prevented by difficulties in generating homozygous parental types for F1 generation. Results Here we present the first high-throughput screen to identify spontaneously-formed haploid (H) and doubled haploid (DH) palms. We secured over 1,000 Hs and one DH from genetically diverse material and derived further DH/mixoploid palms from Hs using colchicine. We demonstrated viability of pollen from H plants and expect to generate 100% homogeneous F1 seed from intercrosses between DH/mixoploids once they develop female inflorescences. Conclusions This study has generated genetically diverse H/DH palms from which parental clones can be selected in sufficient numbers to enable the commercial-scale breeding of F1 varieties. The anticipated step increase in productivity may help to relieve pressure to extend palm cultivation, and limit further expansion into biodiverse rainforest. PMID:20929530

  4. Genome engineering in human cells.

    PubMed

    Song, Minjung; Kim, Young-Hoon; Kim, Jin-Soo; Kim, Hyongbum

    2014-01-01

    Genome editing in human cells is of great value in research, medicine, and biotechnology. Programmable nucleases including zinc-finger nucleases, transcription activator-like effector nucleases, and RNA-guided engineered nucleases recognize a specific target sequence and make a double-strand break at that site, which can result in gene disruption, gene insertion, gene correction, or chromosomal rearrangements. The target sequence complexities of these programmable nucleases are higher than 3.2 mega base pairs, the size of the haploid human genome. Here, we briefly introduce the structure of the human genome and the characteristics of each programmable nuclease, and review their applications in human cells including pluripotent stem cells. In addition, we discuss various delivery methods for nucleases, programmable nickases, and enrichment of gene-edited human cells, all of which facilitate efficient and precise genome editing in human cells.

  5. A p53-dependent response limits the viability of mammalian haploid cells

    PubMed Central

    Olbrich, Teresa; Mayor-Ruiz, Cristina; Vega-Sendino, Maria; Gomez, Carmen; Ortega, Sagrario; Ruiz, Sergio; Fernandez-Capetillo, Oscar

    2017-01-01

    The recent development of haploid cell lines has facilitated forward genetic screenings in mammalian cells. These lines include near-haploid human cell lines isolated from a patient with chronic myelogenous leukemia (KBM7 and HAP1), as well as haploid embryonic stem cells derived from several organisms. In all cases, haploidy was shown to be an unstable state, so that cultures of mammalian haploid cells rapidly become enriched in diploids. Here we show that the observed diploidization is due to a proliferative disadvantage of haploid cells compared with diploid cells. Accordingly, single-cell–sorted haploid mammalian cells maintain the haploid state for prolonged periods, owing to the absence of competing diploids. Although the duration of interphase is similar in haploid and diploid cells, haploid cells spend longer in mitosis, indicative of problems in chromosome segregation. In agreement with this, a substantial proportion of the haploids die at or shortly after the last mitosis through activation of a p53-dependent cytotoxic response. Finally, we show that p53 deletion stabilizes haploidy in human HAP1 cells and haploid mouse embryonic stem cells. We propose that, similar to aneuploidy or tetraploidy, haploidy triggers a p53-dependent response that limits the fitness of mammalian cells. PMID:28808015

  6. Verification and characterization of chromosome duplication in haploid maize.

    PubMed

    de Oliveira Couto, E G; Resende Von Pinho, E V; Von Pinho, R G; Veiga, A D; de Carvalho, M R; de Oliveira Bustamante, F; Nascimento, M S

    2015-06-26

    Doubled haploid technology has been used by various private companies. However, information regarding chromosome duplication methodologies, particularly those concerning techniques used to identify duplication in cells, is limited. Thus, we analyzed and characterized artificially doubled haploids using microsatellites molecular markers, pollen viability, and flow cytometry techniques. Evaluated material was obtained using two different chromosome duplication protocols in maize seeds considered haploids, resulting from the cross between the haploid inducer line KEMS and 4 hybrids (GNS 3225, GNS 3032, GNS 3264, and DKB 393). Fourteen days after duplication, plant samples were collected and assessed by flow cytometry. Further, the plants were transplanted to a field, and samples were collected for DNA analyses using microsatellite markers. The tassels were collected during anthesis for pollen viability analyses. Haploid, diploid, and mixoploid individuals were detected using flow cytometry, demonstrating that this technique was efficient for identifying doubled haploids. The microsatellites markers were also efficient for confirming the ploidies preselected by flow cytometry and for identifying homozygous individuals. Pollen viability showed a significant difference between the evaluated ploidies when the Alexander and propionic-carmin stains were used. The viability rates between the plodies analyzed show potential for fertilization.

  7. Critical Mutation Rate Has an Exponential Dependence on Population Size in Haploid and Diploid Populations

    PubMed Central

    Aston, Elizabeth; Channon, Alastair; Day, Charles; Knight, Christopher G.

    2013-01-01

    Understanding the effect of population size on the key parameters of evolution is particularly important for populations nearing extinction. There are evolutionary pressures to evolve sequences that are both fit and robust. At high mutation rates, individuals with greater mutational robustness can outcompete those with higher fitness. This is survival-of-the-flattest, and has been observed in digital organisms, theoretically, in simulated RNA evolution, and in RNA viruses. We introduce an algorithmic method capable of determining the relationship between population size, the critical mutation rate at which individuals with greater robustness to mutation are favoured over individuals with greater fitness, and the error threshold. Verification for this method is provided against analytical models for the error threshold. We show that the critical mutation rate for increasing haploid population sizes can be approximated by an exponential function, with much lower mutation rates tolerated by small populations. This is in contrast to previous studies which identified that critical mutation rate was independent of population size. The algorithm is extended to diploid populations in a system modelled on the biological process of meiosis. The results confirm that the relationship remains exponential, but show that both the critical mutation rate and error threshold are lower for diploids, rather than higher as might have been expected. Analyzing the transition from critical mutation rate to error threshold provides an improved definition of critical mutation rate. Natural populations with their numbers in decline can be expected to lose genetic material in line with the exponential model, accelerating and potentially irreversibly advancing their decline, and this could potentially affect extinction, recovery and population management strategy. The effect of population size is particularly strong in small populations with 100 individuals or less; the exponential model has

  8. Ancestral chromosomal blocks are triplicated in Brassiceae species with varying chromosome number and genome size.

    PubMed

    Lysak, Martin A; Cheung, Kwok; Kitschke, Michaela; Bures, Petr

    2007-10-01

    The paleopolyploid character of genomes of the economically important genus Brassica and closely related species (tribe Brassiceae) is still fairly controversial. Here, we report on the comparative painting analysis of block F of the crucifer Ancestral Karyotype (AK; n = 8), consisting of 24 conserved genomic blocks, in 10 species traditionally treated as members of the tribe Brassiceae. Three homeologous copies of block F were identified per haploid chromosome complement in Brassiceae species with 2n = 14, 18, 20, 32, and 36. In high-polyploid (n >or= 30) species Crambe maritima (2n = 60), Crambe cordifolia (2n = 120), and Vella pseudocytisus (2n = 68), six, 12, and six copies of the analyzed block have been revealed, respectively. Homeologous regions resembled the ancestral structure of block F within the AK or were altered by inversions and/or translocations. In two species of the subtribe Zillineae, two of the three homeologous regions were combined via a reciprocal translocation onto one chromosome. Altogether, these findings provide compelling evidence of an ancient hexaploidization event and corresponding whole-genome triplication shared by the tribe Brassiceae. No direct relationship between chromosome number and genome size variation (1.2-2.5 pg/2C) has been found in Brassiceae species with 2n = 14 to 36. Only two homeologous copies of block F suggest a whole-genome duplication but not the triplication event in Orychophragmus violaceus (2n = 24), and confirm a phylogenetic position of this species outside the tribe Brassiceae. Chromosome duplication detected in Orychophragmus as well as chromosome rearrangements shared by Zillineae species demonstrate the usefulness of comparative cytogenetics for elucidation of phylogenetic relationships.

  9. The Evolution of Haploid Chromosome Numbers in the Sunflower Family

    PubMed Central

    Mota, Lucie; Torices, Rubén; Loureiro, João

    2016-01-01

    Chromosome number changes during the evolution of angiosperms are likely to have played a major role in speciation. Their study is of utmost importance, especially now, as a probabilistic model is available to study chromosome evolution within a phylogenetic framework. In the present study, likelihood models of chromosome number evolution were fitted to the largest family of flowering plants, the Asteraceae. Specifically, a phylogenetic supertree of this family was used to reconstruct the ancestral chromosome number and infer genomic events. Our approach inferred that the ancestral chromosome number of the family is n = 9. Also, according to the model that best explained our data, the evolution of haploid chromosome numbers in Asteraceae was a very dynamic process, with genome duplications and descending dysploidy being the most frequent genomic events in the evolution of this family. This model inferred more than one hundred whole genome duplication events; however, it did not find evidence for a paleopolyploidization at the base of this family, which has previously been hypothesized on the basis of sequence data from a limited number of species. The obtained results and potential causes of these discrepancies are discussed. PMID:27797951

  10. Plant Genome Size Research: A Field In Focus

    PubMed Central

    BENNETT, M. D.; LEITCH, I. J.

    2005-01-01

    This Special Issue contains 18 papers arising from presentations at the Second Plant Genome Size Workshop and Discussion Meeting (hosted by the Royal Botanic Gardens, Kew, 8–12 September, 2003). This preface provides an overview of these papers, setting their key contents in the broad framework of this highly active field. It also highlights a few overarching issues with wide biological impact or interest, including (1) the need to unify terminology relating to C-value and genome size, (2) the ongoing quest for accurate gold standards for accurate plant genome size estimation, (3) how knowledge of species' DNA amounts has increased in recent years, (4) the existence, causes and significance of intraspecific variation, (5) recent progress in understanding the mechanisms and evolutionary patterns of genome size change, and (6) the impact of genome size knowledge on related biological activities such as genetic fingerprinting and quantitative genetics. The paper offers a vision of how increased knowledge and understanding of genome size will contribute to holisitic genomic studies in both plants and animals in the next decade. PMID:15596455

  11. Ancestral Chromosomal Blocks Are Triplicated in Brassiceae Species with Varying Chromosome Number and Genome Size1

    PubMed Central

    Lysak, Martin A.; Cheung, Kwok; Kitschke, Michaela; Bureš, Petr

    2007-01-01

    The paleopolyploid character of genomes of the economically important genus Brassica and closely related species (tribe Brassiceae) is still fairly controversial. Here, we report on the comparative painting analysis of block F of the crucifer Ancestral Karyotype (AK; n = 8), consisting of 24 conserved genomic blocks, in 10 species traditionally treated as members of the tribe Brassiceae. Three homeologous copies of block F were identified per haploid chromosome complement in Brassiceae species with 2n = 14, 18, 20, 32, and 36. In high-polyploid (n ≥ 30) species Crambe maritima (2n = 60), Crambe cordifolia (2n = 120), and Vella pseudocytisus (2n = 68), six, 12, and six copies of the analyzed block have been revealed, respectively. Homeologous regions resembled the ancestral structure of block F within the AK or were altered by inversions and/or translocations. In two species of the subtribe Zillineae, two of the three homeologous regions were combined via a reciprocal translocation onto one chromosome. Altogether, these findings provide compelling evidence of an ancient hexaploidization event and corresponding whole-genome triplication shared by the tribe Brassiceae. No direct relationship between chromosome number and genome size variation (1.2–2.5 pg/2C) has been found in Brassiceae species with 2n = 14 to 36. Only two homeologous copies of block F suggest a whole-genome duplication but not the triplication event in Orychophragmus violaceus (2n = 24), and confirm a phylogenetic position of this species outside the tribe Brassiceae. Chromosome duplication detected in Orychophragmus as well as chromosome rearrangements shared by Zillineae species demonstrate the usefulness of comparative cytogenetics for elucidation of phylogenetic relationships. PMID:17720758

  12. Plasmodium copy number variation scan: gene copy numbers evaluation in haploid genomes.

    PubMed

    Beghain, Johann; Langlois, Anne-Claire; Legrand, Eric; Grange, Laura; Khim, Nimol; Witkowski, Benoit; Duru, Valentine; Ma, Laurence; Bouchier, Christiane; Ménard, Didier; Paul, Richard E; Ariey, Frédéric

    2016-04-12

    In eukaryotic genomes, deletion or amplification rates have been estimated to be a thousand more frequent than single nucleotide variation. In Plasmodium falciparum, relatively few transcription factors have been identified, and the regulation of transcription is seemingly largely influenced by gene amplification events. Thus copy number variation (CNV) is a major mechanism enabling parasite genomes to adapt to new environmental changes. Currently, the detection of CNVs is based on quantitative PCR (qPCR), which is significantly limited by the relatively small number of genes that can be analysed at any one time. Technological advances that facilitate whole-genome sequencing, such as next generation sequencing (NGS) enable deeper analyses of the genomic variation to be performed. Because the characteristics of Plasmodium CNVs need special consideration in algorithms and strategies for which classical CNV detection programs are not suited a dedicated algorithm to detect CNVs across the entire exome of P. falciparum was developed. This algorithm is based on a custom read depth strategy through NGS data and called PlasmoCNVScan. The analysis of CNV identification on three genes known to have different levels of amplification and which are located either in the nuclear, apicoplast or mitochondrial genomes is presented. The results are correlated with the qPCR experiments, usually used for identification of locus specific amplification/deletion. This tool will facilitate the study of P. falciparum genomic adaptation in response to ecological changes: drug pressure, decreased transmission, reduction of the parasite population size (transition to pre-elimination endemic area).

  13. A first exploration of genome size diversity in sponges.

    PubMed

    Jeffery, Nicholas W; Jardine, Catherine B; Gregory, T Ryan

    2013-08-01

    The phyla known as early-branching lineages of animals have become the subject of increasing interest from the perspectives of genomics and evolutionary biology. Unfortunately, data on even the most fundamental properties of their genomes, such as genome size, remain very scarce. In this study, genome size estimates are reported for 75 species of sponges (phylum Porifera) representing 33 families and 12 orders, marking the first large survey of genome size diversity for an early-branching phylum. Sponge genome sizes averaged around 0.2 pg but exhibited a 17-fold range overall (0.04-0.63 pg). In addition, the results of comparisons of two methods of genome size quantification (flow cytometry and Feulgen image analysis densitometry) are presented, thereby facilitating future work on these animals. Some particularly promising avenues for future investigation are highlighted.

  14. Total centromere size and genome size are strongly correlated in ten grass species.

    PubMed

    Zhang, Han; Dawe, R Kelly

    2012-05-01

    It has been known for decades that centromere size varies across species, but the factors involved in setting centromere boundaries are unknown. As a means to address this question, we estimated centromere sizes in ten species of the grass family including rice, maize, and wheat, which diverged 60~80 million years ago and vary by 40-fold in genome size. Measurements were made using a broadly reactive antibody to rice centromeric histone H3 (CENH3). In species-wide comparisons, we found a clear linear relationship between total centromere size and genome size. Species with large genomes and few chromosomes tend to have the largest centromeres (e.g., rye) while species with small genomes and many chromosomes have the smallest centromeres (e.g., rice). However, within a species, centromere size is surprisingly uniform. We present evidence from three oat-maize addition lines that support this claim, indicating that each of three maize centromeres propagated in oat are not measurably different from each other. In the context of previously published data, our results suggest that the apparent correlation between chromosome and centromere size is incidental to a larger trend that reflects genome size. Centromere size may be determined by a limiting component mechanism similar to that described for Caenorhabditis elegans centrosomes.

  15. Stomatal vs. genome size in angiosperms: the somatic tail wagging the genomic dog?

    PubMed Central

    Hodgson, J. G.; Sharafi, M.; Jalili, A.; Díaz, S.; Montserrat-Martí, G.; Palmer, C.; Cerabolini, B.; Pierce, S.; Hamzehee, B.; Asri, Y.; Jamzad, Z.; Wilson, P.; Raven, J. A.; Band, S. R.; Basconcelo, S.; Bogard, A.; Carter, G.; Charles, M.; Castro-Díez, P.; Cornelissen, J. H. C.; Funes, G.; Jones, G.; Khoshnevis, M.; Pérez-Harguindeguy, N.; Pérez-Rontomé, M. C.; Shirvany, F. A.; Vendramini, F.; Yazdani, S.; Abbas-Azimi, R.; Boustani, S.; Dehghan, M.; Guerrero-Campo, J.; Hynd, A.; Kowsary, E.; Kazemi-Saeed, F.; Siavash, B.; Villar-Salvador, P.; Craigie, R.; Naqinezhad, A.; Romo-Díez, A.; de Torres Espuny, L.; Simmons, E.

    2010-01-01

    Background and Aims Genome size is a function, and the product, of cell volume. As such it is contingent on ecological circumstance. The nature of ‘this ecological circumstance’ is, however, hotly debated. Here, we investigate for angiosperms whether stomatal size may be this ‘missing link’: the primary determinant of genome size. Stomata are crucial for photosynthesis and their size affects functional efficiency. Methods Stomatal and leaf characteristics were measured for 1442 species from Argentina, Iran, Spain and the UK and, using PCA, some emergent ecological and taxonomic patterns identified. Subsequently, an assessment of the relationship between genome-size values obtained from the Plant DNA C-values database and measurements of stomatal size was carried out. Key Results Stomatal size is an ecologically important attribute. It varies with life-history (woody species < herbaceous species < vernal geophytes) and contributes to ecologically and physiologically important axes of leaf specialization. Moreover, it is positively correlated with genome size across a wide range of major taxa. Conclusions Stomatal size predicts genome size within angiosperms. Correlation is not, however, proof of causality and here our interpretation is hampered by unexpected deficiencies in the scientific literature. Firstly, there are discrepancies between our own observations and established ideas about the ecological significance of stomatal size; very large stomata, theoretically facilitating photosynthesis in deep shade, were, in this study (and in other studies), primarily associated with vernal geophytes of unshaded habitats. Secondly, the lower size limit at which stomata can function efficiently, and the ecological circumstances under which these minute stomata might occur, have not been satisfactorally resolved. Thus, our hypothesis, that the optimization of stomatal size for functional efficiency is a major ecological determinant of genome size, remains unproven

  16. Whole genome duplication and transposable element proliferation drive genome expansion in Corydoradinae catfishes.

    PubMed

    Marburger, Sarah; Alexandrou, Markos A; Taggart, John B; Creer, Simon; Carvalho, Gary; Oliveira, Claudio; Taylor, Martin I

    2018-02-14

    Genome size varies significantly across eukaryotic taxa and the largest changes are typically driven by macro-mutations such as whole genome duplications (WGDs) and proliferation of repetitive elements. These two processes may affect the evolutionary potential of lineages by increasing genetic variation and changing gene expression. Here, we elucidate the evolutionary history and mechanisms underpinning genome size variation in a species-rich group of Neotropical catfishes (Corydoradinae) with extreme variation in genome size-0.6 to 4.4 pg per haploid cell. First, genome size was quantified in 65 species and mapped onto a novel fossil-calibrated phylogeny. Two evolutionary shifts in genome size were identified across the tree-the first between 43 and 49 Ma (95% highest posterior density (HPD) 36.2-68.1 Ma) and the second at approximately 19 Ma (95% HPD 15.3-30.14 Ma). Second, restriction-site-associated DNA (RAD) sequencing was used to identify potential WGD events and quantify transposable element (TE) abundance in different lineages. Evidence of two lineage-scale WGDs was identified across the phylogeny, the first event occurring between 54 and 66 Ma (95% HPD 42.56-99.5 Ma) and the second at 20-30 Ma (95% HPD 15.3-45 Ma) based on haplotype numbers per contig and between 35 and 44 Ma (95% HPD 30.29-64.51 Ma) and 20-30 Ma (95% HPD 15.3-45 Ma) based on SNP read ratios. TE abundance increased considerably in parallel with genome size, with a single TE-family (TC1-IS630-Pogo) showing several increases across the Corydoradinae, with the most recent at 20-30 Ma (95% HPD 15.3-45 Ma) and an older event at 35-44 Ma (95% HPD 30.29-64.51 Ma). We identified signals congruent with two WGD duplication events, as well as an increase in TE abundance across different lineages, making the Corydoradinae an excellent model system to study the effects of WGD and TEs on genome and organismal evolution. © 2018 The Authors.

  17. Genome size variation affects song attractiveness in grasshoppers: evidence for sexual selection against large genomes.

    PubMed

    Schielzeth, Holger; Streitner, Corinna; Lampe, Ulrike; Franzke, Alexandra; Reinhold, Klaus

    2014-12-01

    Genome size is largely uncorrelated to organismal complexity and adaptive scenarios. Genetic drift as well as intragenomic conflict have been put forward to explain this observation. We here study the impact of genome size on sexual attractiveness in the bow-winged grasshopper Chorthippus biguttulus. Grasshoppers show particularly large variation in genome size due to the high prevalence of supernumerary chromosomes that are considered (mildly) selfish, as evidenced by non-Mendelian inheritance and fitness costs if present in high numbers. We ranked male grasshoppers by song characteristics that are known to affect female preferences in this species and scored genome sizes of attractive and unattractive individuals from the extremes of this distribution. We find that attractive singers have significantly smaller genomes, demonstrating that genome size is reflected in male courtship songs and that females prefer songs of males with small genomes. Such a genome size dependent mate preference effectively selects against selfish genetic elements that tend to increase genome size. The data therefore provide a novel example of how sexual selection can reinforce natural selection and can act as an agent in an intragenomic arms race. Furthermore, our findings indicate an underappreciated route of how choosy females could gain indirect benefits. © 2014 The Author(s). Evolution © 2014 The Society for the Study of Evolution.

  18. Chromosomes in a genome-wise order: evidence for metaphase architecture.

    PubMed

    Weise, Anja; Bhatt, Samarth; Piaszinski, Katja; Kosyakova, Nadezda; Fan, Xiaobo; Altendorf-Hofmann, Annelore; Tanomtong, Alongklod; Chaveerach, Arunrat; de Cioffi, Marcelo Bello; de Oliveira, Edivaldo; Walther, Joachim-U; Liehr, Thomas; Chaudhuri, Jyoti P

    2016-01-01

    One fundamental finding of the last decade is that, besides the primary DNA sequence information there are several epigenetic "information-layers" like DNA-and histone modifications, chromatin packaging and, last but not least, the position of genes in the nucleus. We postulate that the functional genomic architecture is not restricted to the interphase of the cell cycle but can also be observed in the metaphase stage, when chromosomes are most condensed and microscopically visible. If so, it offers the unique opportunity to directly analyze the functional aspects of genomic architecture in different cells, species and diseases. Another aspect not directly accessible by molecular techniques is the genome merged from two different haploid parental genomes represented by the homologous chromosome sets. Our results show that there is not only a well-known and defined nuclear architecture in interphase but also in metaphase leading to a bilateral organization of the two haploid sets of chromosomes. Moreover, evidence is provided for the parental origin of the haploid grouping. From our findings we postulate an additional epigenetic information layer within the genome including the organization of homologous chromosomes and their parental origin which may now substantially change the landscape of genetics.

  19. Production of haploids from anther culture of banana [Musa balbisiana (BB)].

    PubMed

    Assani, A; Bakry, F; Kerbellec, F; Haïcour, R; Wenzel, G; Foroughi-Wehr, B

    2003-02-01

    We report here, for the first time, the production of haploid plants of banana Musa balbisiana (BB). Callus was induced from anthers in which the majority of the microspores were at the uninucleate stage. The frequency of callus induction was 77%. Callus proliferation usually preceded embryo formation. About 8% of the anthers developed androgenic embryos. Of the 147 plantlets obtained, 41 were haploids (n=x=11). The frequency of haploid production depended on genotypes used: 18 haploid plants were produced from genotype Pisang klutuk, 12 from Pisang batu, seven from Pisang klutuk wulung and four from Tani. The frequency of regeneration was 1.1%, which was based on the total number of anthers cultured. Diploid plants (2n=2x=22) were also observed in the regenerated plants. The haploid banana plants that were developed will be important material for the improvement of banana through breeding programmes.

  20. The Genome Sizes of Ostracod Crustaceans Correlate with Body Size and Evolutionary History, but not Environment.

    PubMed

    Jeffery, Nicholas W; Ellis, Emily A; Oakley, Todd H; Gregory, T Ryan

    2017-09-01

    Within animals, a positive correlation between genome size and body size has been detected in several taxa but not in others, such that it remains unknown how pervasive this pattern may be. Here, we provide another example of a positive relationship in a group of crustaceans whose genome sizes have not previously been investigated. We analyze genome size estimates for 46 species across the 2 most diverse orders of Class Ostracoda, commonly known as seed shrimps, including 29 new estimates made using Feulgen image analysis densitometry and flow cytometry. Genome sizes in this group range ~80-fold, a level of variability that is otherwise not seen in crustaceans with the exception of some malacostracan orders. We find a strong positive correlation between genome size and body size across all species, including after phylogenetic correction. We additionally detect evidence of XX/XO sex determination in 3 species of marine ostracods where male and female genome sizes were estimated. On average, genome sizes are larger but less variable in Order Myodocopida than in Order Podocopida, and marine ostracods have larger genomes than freshwater species, but this appears to be explained by phylogenetic inertia. The relationship between phylogeny, genome size, body size, and habitat is complex in this system and provides a baseline for future studies examining the interactions of these biological traits. © The American Genetic Association 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  1. Chironomid midges (Diptera, chironomidae) show extremely small genome sizes.

    PubMed

    Cornette, Richard; Gusev, Oleg; Nakahara, Yuichi; Shimura, Sachiko; Kikawada, Takahiro; Okuda, Takashi

    2015-06-01

    Chironomid midges (Diptera; Chironomidae) are found in various environments from the high Arctic to the Antarctic, including temperate and tropical regions. In many freshwater habitats, members of this family are among the most abundant invertebrates. In the present study, the genome sizes of 25 chironomid species were determined by flow cytometry and the resulting C-values ranged from 0.07 to 0.20 pg DNA (i.e. from about 68 to 195 Mbp). These genome sizes were uniformly very small and included, to our knowledge, the smallest genome sizes recorded to date among insects. Small proportion of transposable elements and short intron sizes were suggested to contribute to the reduction of genome sizes in chironomids. We discuss about the possible developmental and physiological advantages of having a small genome size and about putative implications for the ecological success of the family Chironomidae.

  2. Reconstructing relative genome size of vascular plants through geological time.

    PubMed

    Lomax, Barry H; Hilton, Jason; Bateman, Richard M; Upchurch, Garland R; Lake, Janice A; Leitch, Ilia J; Cromwell, Avery; Knight, Charles A

    2014-01-01

    The strong positive relationship evident between cell and genome size in both animals and plants forms the basis of using the size of stomatal guard cells as a proxy to track changes in plant genome size through geological time. We report for the first time a taxonomic fine-scale investigation into changes in stomatal guard-cell length and use these data to infer changes in genome size through the evolutionary history of land plants. Our data suggest that many of the earliest land plants had exceptionally large genome sizes and that a predicted overall trend of increasing genome size within individual lineages through geological time is not supported. However, maximum genome size steadily increases from the Mississippian (c. 360 million yr ago (Ma)) to the present. We hypothesise that the functional relationship between stomatal size, genome size and atmospheric CO2 may contribute to the dichotomy reported between preferential extinction of neopolyploids and the prevalence of palaeopolyploidy observed in DNA sequence data of extant vascular plants. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.

  3. Recent updates and developments to plant genome size databases

    PubMed Central

    Garcia, Sònia; Leitch, Ilia J.; Anadon-Rosell, Alba; Canela, Miguel Á.; Gálvez, Francisco; Garnatje, Teresa; Gras, Airy; Hidalgo, Oriane; Johnston, Emmeline; Mas de Xaxars, Gemma; Pellicer, Jaume; Siljak-Yakovlev, Sonja; Vallès, Joan; Vitales, Daniel; Bennett, Michael D.

    2014-01-01

    Two plant genome size databases have been recently updated and/or extended: the Plant DNA C-values database (http://data.kew.org/cvalues), and GSAD, the Genome Size in Asteraceae database (http://www.asteraceaegenomesize.com). While the first provides information on nuclear DNA contents across land plants and some algal groups, the second is focused on one of the largest and most economically important angiosperm families, Asteraceae. Genome size data have numerous applications: they can be used in comparative studies on genome evolution, or as a tool to appraise the cost of whole-genome sequencing programs. The growing interest in genome size and increasing rate of data accumulation has necessitated the continued update of these databases. Currently, the Plant DNA C-values database (Release 6.0, Dec. 2012) contains data for 8510 species, while GSAD has 1219 species (Release 2.0, June 2013), representing increases of 17 and 51%, respectively, in the number of species with genome size data, compared with previous releases. Here we provide overviews of the most recent releases of each database, and outline new features of GSAD. The latter include (i) a tool to visually compare genome size data between species, (ii) the option to export data and (iii) a webpage containing information about flow cytometry protocols. PMID:24288377

  4. The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum.

    PubMed

    Zimin, Aleksey V; Puiu, Daniela; Hall, Richard; Kingan, Sarah; Clavijo, Bernardo J; Salzberg, Steven L

    2017-11-01

    Common bread wheat, Triticum aestivum, has one of the most complex genomes known to science, with 6 copies of each chromosome, enormous numbers of near-identical sequences scattered throughout, and an overall haploid size of more than 15 billion bases. Multiple past attempts to assemble the genome have produced assemblies that were well short of the estimated genome size. Here we report the first near-complete assembly of T. aestivum, using deep sequencing coverage from a combination of short Illumina reads and very long Pacific Biosciences reads. The final assembly contains 15 344 693 583 bases and has a weighted average (N50) contig size of 232 659 bases. This represents by far the most complete and contiguous assembly of the wheat genome to date, providing a strong foundation for future genetic studies of this important food crop. We also report how we used the recently published genome of Aegilops tauschii, the diploid ancestor of the wheat D genome, to identify 4 179 762 575 bp of T. aestivum that correspond to its D genome components. © The Author 2017. Published by Oxford University Press.

  5. Metabolic 'engines' of flight drive genome size reduction in birds.

    PubMed

    Wright, Natalie A; Gregory, T Ryan; Witt, Christopher C

    2014-03-22

    The tendency for flying organisms to possess small genomes has been interpreted as evidence of natural selection acting on the physical size of the genome. Nonetheless, the flight-genome link and its mechanistic basis have yet to be well established by comparative studies within a volant clade. Is there a particular functional aspect of flight such as brisk metabolism, lift production or maneuverability that impinges on the physical genome? We measured genome sizes, wing dimensions and heart, flight muscle and body masses from a phylogenetically diverse set of bird species. In phylogenetically controlled analyses, we found that genome size was negatively correlated with relative flight muscle size and heart index (i.e. ratio of heart to body mass), but positively correlated with body mass and wing loading. The proportional masses of the flight muscles and heart were the most important parameters explaining variation in genome size in multivariate models. Hence, the metabolic intensity of powered flight appears to have driven genome size reduction in birds.

  6. The genome of melon (Cucumis melo L.)

    PubMed Central

    Garcia-Mas, Jordi; Benjak, Andrej; Sanseverino, Walter; Bourgeois, Michael; Mir, Gisela; González, Víctor M.; Hénaff, Elizabeth; Câmara, Francisco; Cozzuto, Luca; Lowy, Ernesto; Alioto, Tyler; Capella-Gutiérrez, Salvador; Blanca, Jose; Cañizares, Joaquín; Ziarsolo, Pello; Gonzalez-Ibeas, Daniel; Rodríguez-Moreno, Luis; Droege, Marcus; Du, Lei; Alvarez-Tejado, Miguel; Lorente-Galdos, Belen; Melé, Marta; Yang, Luming; Weng, Yiqun; Navarro, Arcadi; Marques-Bonet, Tomas; Aranda, Miguel A.; Nuez, Fernando; Picó, Belén; Gabaldón, Toni; Roma, Guglielmo; Guigó, Roderic; Casacuberta, Josep M.; Arús, Pere; Puigdomènech, Pere

    2012-01-01

    We report the genome sequence of melon, an important horticultural crop worldwide. We assembled 375 Mb of the double-haploid line DHL92, representing 83.3% of the estimated melon genome. We predicted 27,427 protein-coding genes, which we analyzed by reconstructing 22,218 phylogenetic trees, allowing mapping of the orthology and paralogy relationships of sequenced plant genomes. We observed the absence of recent whole-genome duplications in the melon lineage since the ancient eudicot triplication, and our data suggest that transposon amplification may in part explain the increased size of the melon genome compared with the close relative cucumber. A low number of nucleotide-binding site–leucine-rich repeat disease resistance genes were annotated, suggesting the existence of specific defense mechanisms in this species. The DHL92 genome was compared with that of its parental lines allowing the quantification of sequence variability in the species. The use of the genome sequence in future investigations will facilitate the understanding of evolution of cucurbits and the improvement of breeding strategies. PMID:22753475

  7. Induced parthenogenesis by gamma-irradiated pollen in loquat for haploid production.

    PubMed

    Blasco, Manuel; Badenes, María Luisa; Del Mar Naval, María

    2016-09-01

    Successful haploid induction in loquat ( Eriobotrya japonica (Thunb.) Lindl.) through in situ-induced parthenogenesis with gamma-ray irradiated pollen has been achieved. Female flowers of cultivar 'Algerie' were pollinated using pollen of cultivars 'Changhong-3', 'Cox' and 'Saval Brasil' irradiated with two doses of gamma rays, 150 and 300 Gy. The fruits were harvested 90, 105 and 120 days after pollination (dap). Four haploid plants were obtained from 'Algerie' pollinated with 300-Gy-treated pollen of 'Saval Brasil' from fruits harvested 105 dap. Haploidy was confirmed by flow cytometry and chromosome count. The haploids showed a very weak development compared to the diploid plants. This result suggests that irradiated pollen can be used to obtain parthenogenetic haploids.

  8. Gametic embryogenesis and haploid technology as valuable support to plant breeding.

    PubMed

    Germanà, Maria Antonietta

    2011-05-01

    Plant breeding is focused on continuously increasing crop production to meet the needs of an ever-growing world population, improving food quality to ensure a long and healthy life and address the problems of global warming and environment pollution, together with the challenges of developing novel sources of biofuels. The breeders' search for novel genetic combinations, with which to select plants with improved traits to satisfy both farmers and consumers, is endless. About half of the dramatic increase in crop yield obtained in the second half of the last century has been achieved thanks to the results of genetic improvement, while the residual advance has been due to the enhanced management techniques (pest and disease control, fertilization, and irrigation). Biotechnologies provide powerful tools for plant breeding, and among these ones, tissue culture, particularly haploid and doubled haploid technology, can effectively help to select superior plants. In fact, haploids (Hs), which are plants with gametophytic chromosome number, and doubled haploids (DHs), which are haploids that have undergone chromosome duplication, represent a particularly attractive biotechnological method to accelerate plant breeding. Currently, haploid technology, making possible through gametic embryogenesis the single-step development of complete homozygous lines from heterozygous parents, has already had a huge impact on agricultural systems of many agronomically important crops, representing an integral part in their improvement programmes. The aim of this review was to provide some background, recent advances, and future prospective on the employment of haploid technology through gametic embryogenesis as a powerful tool to support plant breeding.

  9. Evolution of Genome Size and Complexity in Pinus

    PubMed Central

    Morse, Alison M.; Peterson, Daniel G.; Islam-Faridi, M. Nurul; Smith, Katherine E.; Magbanua, Zenaida; Garcia, Saul A.; Kubisiak, Thomas L.; Amerson, Henry V.; Carlson, John E.; Nelson, C. Dana; Davis, John M.

    2009-01-01

    Background Genome evolution in the gymnosperm lineage of seed plants has given rise to many of the most complex and largest plant genomes, however the elements involved are poorly understood. Methodology/Principal Findings Gymny is a previously undescribed retrotransposon family in Pinus that is related to Athila elements in Arabidopsis. Gymny elements are dispersed throughout the modern Pinus genome and occupy a physical space at least the size of the Arabidopsis thaliana genome. In contrast to previously described retroelements in Pinus, the Gymny family was amplified or introduced after the divergence of pine and spruce (Picea). If retrotransposon expansions are responsible for genome size differences within the Pinaceae, as they are in angiosperms, then they have yet to be identified. In contrast, molecular divergence of Gymny retrotransposons together with other families of retrotransposons can account for the large genome complexity of pines along with protein-coding genic DNA, as revealed by massively parallel DNA sequence analysis of Cot fractionated genomic DNA. Conclusions/Significance Most of the enormous genome complexity of pines can be explained by divergence of retrotransposons, however the elements responsible for genome size variation are yet to be identified. Genomic resources for Pinus including those reported here should assist in further defining whether and how the roles of retrotransposons differ in the evolution of angiosperm and gymnosperm genomes. PMID:19194510

  10. The dynamic evolutionary history of genome size in North American woodland salamanders.

    PubMed

    Newman, Catherine E; Gregory, T Ryan; Austin, Christopher C

    2017-04-01

    The genus Plethodon is the most species-rich salamander genus in North America, and nearly half of its species face an uncertain future. It is also one of the most diverse families in terms of genome sizes, which range from 1C = 18.2 to 69.3 pg, or 5-20 times larger than the human genome. Large genome size in salamanders results in part from accumulation of transposable elements and is associated with various developmental and physiological traits. However, genome sizes have been reported for only 25% of the species of Plethodon (14 of 55). We collected genome size data for Plethodon serratus to supplement an ongoing phylogeographic study, reconstructed the evolutionary history of genome size in Plethodontidae, and inferred probable genome sizes for the 41 species missing empirical data. Results revealed multiple genome size changes in Plethodon: genomes of western Plethodon increased, whereas genomes of eastern Plethodon decreased, followed by additional decreases or subsequent increases. The estimated genome size of P. serratus was 21 pg. New understanding of variation in genome size evolution, along with genome size inferences for previously unstudied taxa, provide a foundation for future studies on the biology of plethodontid salamanders.

  11. Induced parthenogenesis by gamma-irradiated pollen in loquat for haploid production

    PubMed Central

    Blasco, Manuel; Badenes, María Luisa; del Mar Naval, María

    2016-01-01

    Successful haploid induction in loquat (Eriobotrya japonica (Thunb.) Lindl.) through in situ-induced parthenogenesis with gamma-ray irradiated pollen has been achieved. Female flowers of cultivar ‘Algerie’ were pollinated using pollen of cultivars ‘Changhong-3’, ‘Cox’ and ‘Saval Brasil’ irradiated with two doses of gamma rays, 150 and 300 Gy. The fruits were harvested 90, 105 and 120 days after pollination (dap). Four haploid plants were obtained from ‘Algerie’ pollinated with 300-Gy-treated pollen of ‘Saval Brasil’ from fruits harvested 105 dap. Haploidy was confirmed by flow cytometry and chromosome count. The haploids showed a very weak development compared to the diploid plants. This result suggests that irradiated pollen can be used to obtain parthenogenetic haploids. PMID:27795686

  12. Transcriptome Analysis of Honeybee (Apis Mellifera) Haploid and Diploid Embryos Reveals Early Zygotic Transcription during Cleavage

    PubMed Central

    Pires, Camilla Valente; Freitas, Flávia Cristina de Paula; Cristino, Alexandre S.; Dearden, Peter K.; Simões, Zilá Luz Paulino

    2016-01-01

    In honeybees, the haplodiploid sex determination system promotes a unique embryogenesis process wherein females develop from fertilized eggs and males develop from unfertilized eggs. However, the developmental strategies of honeybees during early embryogenesis are virtually unknown. Similar to most animals, the honeybee oocytes are supplied with proteins and regulatory elements that support early embryogenesis. As the embryo develops, the zygotic genome is activated and zygotic products gradually replace the preloaded maternal material. The analysis of small RNA and mRNA libraries of mature oocytes and embryos originated from fertilized and unfertilized eggs has allowed us to explore the gene expression dynamics in the first steps of development and during the maternal-to-zygotic transition (MZT). We localized a short sequence motif identified as TAGteam motif and hypothesized to play a similar role in honeybees as in fruit flies, which includes the timing of early zygotic expression (MZT), a function sustained by the presence of the zelda ortholog, which is the main regulator of genome activation. Predicted microRNA (miRNA)-target interactions indicated that there were specific regulators of haploid and diploid embryonic development and an overlap of maternal and zygotic gene expression during the early steps of embryogenesis. Although a number of functions are highly conserved during the early steps of honeybee embryogenesis, the results showed that zygotic genome activation occurs earlier in honeybees than in Drosophila based on the presence of three primary miRNAs (pri-miRNAs) (ame-mir-375, ame-mir-34 and ame-mir-263b) during the cleavage stage in haploid and diploid embryonic development. PMID:26751956

  13. The Oxytricha trifallax Macronuclear Genome: A Complex Eukaryotic Genome with 16,000 Tiny Chromosomes

    PubMed Central

    Swart, Estienne C.; Bracht, John R.; Magrini, Vincent; Minx, Patrick; Chen, Xiao; Zhou, Yi; Khurana, Jaspreet S.; Goldman, Aaron D.; Nowacki, Mariusz; Schotanus, Klaas; Jung, Seolkyoung; Fulton, Robert S.; Ly, Amy; McGrath, Sean; Haub, Kevin; Wiggins, Jessica L.; Storton, Donna; Matese, John C.; Parsons, Lance; Chang, Wei-Jen; Bowen, Michael S.; Stover, Nicholas A.; Jones, Thomas A.; Eddy, Sean R.; Herrick, Glenn A.; Doak, Thomas G.; Wilson, Richard K.; Mardis, Elaine R.; Landweber, Laura F.

    2013-01-01

    The macronuclear genome of the ciliate Oxytricha trifallax displays an extreme and unique eukaryotic genome architecture with extensive genomic variation. During sexual genome development, the expressed, somatic macronuclear genome is whittled down to the genic portion of a small fraction (∼5%) of its precursor “silent” germline micronuclear genome by a process of “unscrambling” and fragmentation. The tiny macronuclear “nanochromosomes” typically encode single, protein-coding genes (a small portion, 10%, encode 2–8 genes), have minimal noncoding regions, and are differentially amplified to an average of ∼2,000 copies. We report the high-quality genome assembly of ∼16,000 complete nanochromosomes (∼50 Mb haploid genome size) that vary from 469 bp to 66 kb long (mean ∼3.2 kb) and encode ∼18,500 genes. Alternative DNA fragmentation processes ∼10% of the nanochromosomes into multiple isoforms that usually encode complete genes. Nucleotide diversity in the macronucleus is very high (SNP heterozygosity is ∼4.0%), suggesting that Oxytricha trifallax may have one of the largest known effective population sizes of eukaryotes. Comparison to other ciliates with nonscrambled genomes and long macronuclear chromosomes (on the order of 100 kb) suggests several candidate proteins that could be involved in genome rearrangement, including domesticated MULE and IS1595-like DDE transposases. The assembly of the highly fragmented Oxytricha macronuclear genome is the first completed genome with such an unusual architecture. This genome sequence provides tantalizing glimpses into novel molecular biology and evolution. For example, Oxytricha maintains tens of millions of telomeres per cell and has also evolved an intriguing expansion of telomere end-binding proteins. In conjunction with the micronuclear genome in progress, the O. trifallax macronuclear genome will provide an invaluable resource for investigating programmed genome rearrangements, complementing

  14. Microeconomic principles explain an optimal genome size in bacteria.

    PubMed

    Ranea, Juan A G; Grant, Alastair; Thornton, Janet M; Orengo, Christine A

    2005-01-01

    Bacteria can clearly enhance their survival by expanding their genetic repertoire. However, the tight packing of the bacterial genome and the fact that the most evolved species do not necessarily have the biggest genomes suggest there are other evolutionary factors limiting their genome expansion. To clarify these restrictions on size, we studied those protein families contributing most significantly to bacterial-genome complexity. We found that all bacteria apply the same basic and ancestral 'molecular technology' to optimize their reproductive efficiency. The same microeconomics principles that define the optimum size in a factory can also explain the existence of a statistical optimum in bacterial genome size. This optimum is reached when the bacterial genome obtains the maximum metabolic complexity (revenue) for minimal regulatory genes (logistic cost).

  15. Production of viable homozygous, doubled haploid channel catfish (Ictalurus punctatus)

    USDA-ARS?s Scientific Manuscript database

    Production of doubled haploids via mitotic gynogenesis is a useful tool for the creation of completely inbred fish. In order to produce viable doubled haploid channel catfish, we utilized hydrostatic pressure or thermal treatments on eggs fertilized with sperm that had been exposed to ultraviolet l...

  16. Genome size diversity in angiosperms and its influence on gene space.

    PubMed

    Dodsworth, Steven; Leitch, Andrew R; Leitch, Ilia J

    2015-12-01

    Genome size varies c. 2400-fold in angiosperms (flowering plants), although the range of genome size is skewed towards small genomes, with a mean genome size of 1C=5.7Gb. One of the most crucial factors governing genome size in angiosperms is the relative amount and activity of repetitive elements. Recently, there have been new insights into how these repeats, previously discarded as 'junk' DNA, can have a significant impact on gene space (i.e. the part of the genome comprising all the genes and gene-related DNA). Here we review these new findings and explore in what ways genome size itself plays a role in influencing how repeats impact genome dynamics and gene space, including gene expression. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  17. REGULATION OF GEOGRAPHIC VARIABILITY IN HAPLOID:DIPLOD RATIOS OF BIPHASIC SEAWEED LIFE CYCLES(1).

    PubMed

    da Silva Vieira, Vasco Manuel Nobre de Carvalho; Santos, Rui Orlando Pimenta

    2012-08-01

    The relative abundance of haploid and diploid individuals (H:D) in isomorphic marine algal biphasic cycles varies spatially, but only if vital rates of haploid and diploid phases vary differently with environmental conditions (i.e. conditional differentiation between phases). Vital rates of isomorphic phases in particular environments may be determined by subtle morphological or physiological differences. Herein, we test numerically how geographic variability in H:D is regulated by conditional differentiation between isomorphic life phases and the type of life strategy of populations (i.e. life cycles dominated by reproduction, survival or growth). Simulation conditions were selected using available data on H:D spatial variability in seaweeds. Conditional differentiation between ploidy phases had a small effect on the H:D variability for species with life strategies that invest either in fertility or in growth. Conversely, species with life strategies that invest mainly in survival, exhibited high variability in H:D through a conditional differentiation in stasis (the probability of staying in the same size class), breakage (the probability of changing to a smaller size class) or growth (the probability of changing to a bigger size class). These results were consistent with observed geographic variability in H:D of natural marine algae populations. © 2012 Phycological Society of America.

  18. Dynamics of genome size evolution in birds and mammals.

    PubMed

    Kapusta, Aurélie; Suh, Alexander; Feschotte, Cédric

    2017-02-21

    Genome size in mammals and birds shows remarkably little interspecific variation compared with other taxa. However, genome sequencing has revealed that many mammal and bird lineages have experienced differential rates of transposable element (TE) accumulation, which would be predicted to cause substantial variation in genome size between species. Thus, we hypothesize that there has been covariation between the amount of DNA gained by transposition and lost by deletion during mammal and avian evolution, resulting in genome size equilibrium. To test this model, we develop computational methods to quantify the amount of DNA gained by TE expansion and lost by deletion over the last 100 My in the lineages of 10 species of eutherian mammals and 24 species of birds. The results reveal extensive variation in the amount of DNA gained via lineage-specific transposition, but that DNA loss counteracted this expansion to various extents across lineages. Our analysis of the rate and size spectrum of deletion events implies that DNA removal in both mammals and birds has proceeded mostly through large segmental deletions (>10 kb). These findings support a unified "accordion" model of genome size evolution in eukaryotes whereby DNA loss counteracting TE expansion is a major determinant of genome size. Furthermore, we propose that extensive DNA loss, and not necessarily a dearth of TE activity, has been the primary force maintaining the greater genomic compaction of flying birds and bats relative to their flightless relatives.

  19. Dynamics of genome size evolution in birds and mammals

    PubMed Central

    Feschotte, Cédric

    2017-01-01

    Genome size in mammals and birds shows remarkably little interspecific variation compared with other taxa. However, genome sequencing has revealed that many mammal and bird lineages have experienced differential rates of transposable element (TE) accumulation, which would be predicted to cause substantial variation in genome size between species. Thus, we hypothesize that there has been covariation between the amount of DNA gained by transposition and lost by deletion during mammal and avian evolution, resulting in genome size equilibrium. To test this model, we develop computational methods to quantify the amount of DNA gained by TE expansion and lost by deletion over the last 100 My in the lineages of 10 species of eutherian mammals and 24 species of birds. The results reveal extensive variation in the amount of DNA gained via lineage-specific transposition, but that DNA loss counteracted this expansion to various extents across lineages. Our analysis of the rate and size spectrum of deletion events implies that DNA removal in both mammals and birds has proceeded mostly through large segmental deletions (>10 kb). These findings support a unified “accordion” model of genome size evolution in eukaryotes whereby DNA loss counteracting TE expansion is a major determinant of genome size. Furthermore, we propose that extensive DNA loss, and not necessarily a dearth of TE activity, has been the primary force maintaining the greater genomic compaction of flying birds and bats relative to their flightless relatives. PMID:28179571

  20. Genome size and chromosome number in velvet worms (Onychophora).

    PubMed

    Jeffery, Nicholas W; Oliveira, Ivo S; Gregory, T Ryan; Rowell, David M; Mayer, Georg

    2012-12-01

    The Onychophora (velvet worms) represents a small group of invertebrates (~180 valid species), which is commonly united with Tardigrada and Arthropoda in a clade called Panarthropoda. As with the majority of invertebrate taxa, genome size data are very limited for the Onychophora, with only one previously published estimate. Here we use both flow cytometry and Feulgen image analysis densitometry to provide genome size estimates for seven species of velvet worms from both major subgroups, Peripatidae and Peripatopsidae, along with karyotype data for each species. Genome sizes in these species range from roughly 5-19 pg, with densitometric estimates being slightly larger than those obtained by flow cytometry for all species. Chromosome numbers range from 2n = 8 to 2n = 54. No relationship is evident between genome size, chromosome number, or reproductive mode. Various avenues for future genomic research are presented based on these results.

  1. Maximization of Markers Linked in Coupling for Tetraploid Potatoes via Monoparental Haploids

    PubMed Central

    Bartkiewicz, Annette M.; Chilla, Friederike; Terefe-Ayana, Diro; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Linde, Marcus; Debener, Thomas

    2018-01-01

    Haploid potato populations derived from a single tetraploid donor constitute an efficient strategy to analyze markers segregating from a single donor genotype. Analysis of marker segregation in populations derived from crosses between polysomic tetraploids is complicated by a maximum of eight segregating alleles, multiple dosages of the markers and problems related to linkage analysis of marker segregation in repulsion. Here, we present data on two monoparental haploid populations generated by prickle pollination of two tetraploid cultivars with Solanum phureja and genotyped with the 12.8 k SolCAP single nucleotide polymorphism (SNP) array. We show that in a population of monoparental haploids, the number of biallelic SNP markers segregating in linkage to loci from the tetraploid donor genotype is much larger than in putative crosses of this genotype to a diverse selection of 125 tetraploid cultivars. Although this strategy is more laborious than conventional breeding, the generation of haploid progeny for efficient marker analysis is straightforward if morphological markers and flow cytometry are utilized to select true haploid progeny. The level of introgressed fragments from S. phureja, the haploid inducer, is very low, supporting its suitability for genetic analysis. Mapping with single-dose markers allowed the analysis of quantitative trait loci (QTL) for four phenotypic traits. PMID:29868076

  2. A comprehensively molecular haplotype-resolved genome of a European individual

    PubMed Central

    Suk, Eun-Kyung; McEwen, Gayle K.; Duitama, Jorge; Nowick, Katja; Schulz, Sabrina; Palczewski, Stefanie; Schreiber, Stefan; Holloway, Dustin T.; McLaughlin, Stephen; Peckham, Heather; Lee, Clarence; Huebsch, Thomas; Hoehe, Margret R.

    2011-01-01

    Independent determination of both haplotype sequences of an individual genome is essential to relate genetic variation to genome function, phenotype, and disease. To address the importance of phase, we have generated the most complete haplotype-resolved genome to date, “Max Planck One” (MP1), by fosmid pool-based next generation sequencing. Virtually all SNPs (>99%) and 80,000 indels were phased into haploid sequences of up to 6.3 Mb (N50 ∼1 Mb). The completeness of phasing allowed determination of the concrete molecular haplotype pairs for the vast majority of genes (81%) including potential regulatory sequences, of which >90% were found to be constituted by two different molecular forms. A subset of 159 genes with potentially severe mutations in either cis or trans configurations exemplified in particular the role of phase for gene function, disease, and clinical interpretation of personal genomes (e.g., BRCA1). Extended genomic regions harboring manifold combinations of physically and/or functionally related genes and regulatory elements were resolved into their underlying “haploid landscapes,” which may define the functional genome. Moreover, the majority of genes and functional sequences were found to contain individual or rare SNPs, which cannot be phased from population data alone, emphasizing the importance of molecular phasing for characterizing a genome in its molecular individuality. Our work provides the foundation to understand that the distinction of molecular haplotypes is essential to resolve the (inherently individual) biology of genes, genomes, and disease, establishing a reference point for “phase-sensitive” personal genomics. MP1's annotated haploid genomes are available as a public resource. PMID:21813624

  3. Patterns of genome size diversity in bats (order Chiroptera).

    PubMed

    Smith, Jillian D L; Bickham, John W; Gregory, T Ryan

    2013-08-01

    Despite being a group of particular interest in considering relationships between genome size and metabolic parameters, bats have not been well studied from this perspective. This study presents new estimates for 121 "microbat" species from 12 families and complements a previous study on members of the family Pteropodidae ("megabats"). The results confirm that diversity in genome size in bats is very limited even compared with other mammals, varying approximately 2-fold from 1.63 pg in Lophostoma carrikeri to 3.17 pg in Rhinopoma hardwickii and averaging only 2.35 pg ± 0.02 SE (versus 3.5 pg overall for mammals). However, contrary to some other vertebrate groups, and perhaps owing to the narrow range observed, genome size correlations were not apparent with any chromosomal, physiological, flight-related, developmental, or ecological characteristics within the order Chiroptera. Genome size is positively correlated with measures of body size in bats, though the strength of the relationships differs between pteropodids ("megabats") and nonpteropodids ("microbats").

  4. Whole genome duplication and transposable element proliferation drive genome expansion in Corydoradinae catfishes

    PubMed Central

    Marburger, Sarah; Alexandrou, Markos A.; Creer, Simon

    2018-01-01

    Genome size varies significantly across eukaryotic taxa and the largest changes are typically driven by macro-mutations such as whole genome duplications (WGDs) and proliferation of repetitive elements. These two processes may affect the evolutionary potential of lineages by increasing genetic variation and changing gene expression. Here, we elucidate the evolutionary history and mechanisms underpinning genome size variation in a species-rich group of Neotropical catfishes (Corydoradinae) with extreme variation in genome size—0.6 to 4.4 pg per haploid cell. First, genome size was quantified in 65 species and mapped onto a novel fossil-calibrated phylogeny. Two evolutionary shifts in genome size were identified across the tree—the first between 43 and 49 Ma (95% highest posterior density (HPD) 36.2–68.1 Ma) and the second at approximately 19 Ma (95% HPD 15.3–30.14 Ma). Second, restriction-site-associated DNA (RAD) sequencing was used to identify potential WGD events and quantify transposable element (TE) abundance in different lineages. Evidence of two lineage-scale WGDs was identified across the phylogeny, the first event occurring between 54 and 66 Ma (95% HPD 42.56–99.5 Ma) and the second at 20–30 Ma (95% HPD 15.3–45 Ma) based on haplotype numbers per contig and between 35 and 44 Ma (95% HPD 30.29–64.51 Ma) and 20–30 Ma (95% HPD 15.3–45 Ma) based on SNP read ratios. TE abundance increased considerably in parallel with genome size, with a single TE-family (TC1-IS630-Pogo) showing several increases across the Corydoradinae, with the most recent at 20–30 Ma (95% HPD 15.3–45 Ma) and an older event at 35–44 Ma (95% HPD 30.29–64.51 Ma). We identified signals congruent with two WGD duplication events, as well as an increase in TE abundance across different lineages, making the Corydoradinae an excellent model system to study the effects of WGD and TEs on genome and organismal evolution. PMID:29445022

  5. Intrapopulation Genome Size Variation in D. melanogaster Reflects Life History Variation and Plasticity

    PubMed Central

    Ellis, Lisa L.; Huang, Wen; Quinn, Andrew M.; Ahuja, Astha; Alfrejd, Ben; Gomez, Francisco E.; Hjelmen, Carl E.; Moore, Kristi L.; Mackay, Trudy F. C.; Johnston, J. Spencer; Tarone, Aaron M.

    2014-01-01

    We determined female genome sizes using flow cytometry for 211 Drosophila melanogaster sequenced inbred strains from the Drosophila Genetic Reference Panel, and found significant conspecific and intrapopulation variation in genome size. We also compared several life history traits for 25 lines with large and 25 lines with small genomes in three thermal environments, and found that genome size as well as genome size by temperature interactions significantly correlated with survival to pupation and adulthood, time to pupation, female pupal mass, and female eclosion rates. Genome size accounted for up to 23% of the variation in developmental phenotypes, but the contribution of genome size to variation in life history traits was plastic and varied according to the thermal environment. Expression data implicate differences in metabolism that correspond to genome size variation. These results indicate that significant genome size variation exists within D. melanogaster and this variation may impact the evolutionary ecology of the species. Genome size variation accounts for a significant portion of life history variation in an environmentally dependent manner, suggesting that potential fitness effects associated with genome size variation also depend on environmental conditions. PMID:25057905

  6. Enterovirus D68 receptor requirements unveiled by haploid genetics

    PubMed Central

    Baggen, Jim; Thibaut, Hendrik Jan; Staring, Jacqueline; Jae, Lucas T.; Liu, Yue; Guo, Hongbo; Slager, Jasper J.; de Bruin, Jost W.; van Vliet, Arno L. W.; Blomen, Vincent A.; Overduin, Pieter; Sheng, Ju; de Haan, Cornelis A. M.; de Vries, Erik; Meijer, Adam; Rossmann, Michael G.; Brummelkamp, Thijn R.; van Kuppeveld, Frank J. M.

    2016-01-01

    Enterovirus D68 (EV-D68) is an emerging pathogen that can cause severe respiratory disease and is associated with cases of paralysis, especially among children. Heretofore, information on host factor requirements for EV-D68 infection is scarce. Haploid genetic screening is a powerful tool to reveal factors involved in the entry of pathogens. We performed a genome-wide haploid screen with the EV-D68 prototype Fermon strain to obtain a comprehensive overview of cellular factors supporting EV-D68 infection. We identified and confirmed several genes involved in sialic acid (Sia) biosynthesis, transport, and conjugation to be essential for infection. Moreover, by using knockout cell lines and gene reconstitution, we showed that both α2,6- and α2,3-linked Sia can be used as functional cellular EV-D68 receptors. Importantly, the screen did not reveal a specific protein receptor, suggesting that EV-D68 can use multiple redundant sialylated receptors. Upon testing recent clinical strains, we identified strains that showed a similar Sia dependency, whereas others could infect cells lacking surface Sia, indicating they can use an alternative, nonsialylated receptor. Nevertheless, these Sia-independent strains were still able to bind Sia on human erythrocytes, raising the possibility that these viruses can use multiple receptors. Sequence comparison of Sia-dependent and Sia-independent EV-D68 strains showed that many changes occurred near the canyon that might allow alternative receptor binding. Collectively, our findings provide insights into the identity of the EV-D68 receptor and suggest the possible existence of Sia-independent viruses, which are essential for understanding tropism and disease. PMID:26787879

  7. Genome size expansion and the relationship between nuclear DNA content and spore size in the Asplenium monanthes fern complex (Aspleniaceae)

    PubMed Central

    2013-01-01

    Background Homosporous ferns are distinctive amongst the land plant lineages for their high chromosome numbers and enigmatic genomes. Genome size measurements are an under exploited tool in homosporous ferns and show great potential to provide an overview of the mechanisms that define genome evolution in these ferns. The aim of this study is to investigate the evolution of genome size and the relationship between genome size and spore size within the apomictic Asplenium monanthes fern complex and related lineages. Results Comparative analyses to test for a relationship between spore size and genome size show that they are not correlated. The data do however provide evidence for marked genome size variation between species in this group. These results indicate that Asplenium monanthes has undergone a two-fold expansion in genome size. Conclusions Our findings challenge the widely held assumption that spore size can be used to infer ploidy levels within apomictic fern complexes. We argue that the observed genome size variation is likely to have arisen via increases in both chromosome number due to polyploidy and chromosome size due to amplification of repetitive DNA (e.g. transposable elements, especially retrotransposons). However, to date the latter has not been considered to be an important process of genome evolution within homosporous ferns. We infer that genome evolution, at least in some homosporous fern lineages, is a more dynamic process than existing studies would suggest. PMID:24354467

  8. Genome Sequences of Marine Shrimp Exopalaemon carinicauda Holthuis Provide Insights into Genome Size Evolution of Caridea.

    PubMed

    Yuan, Jianbo; Gao, Yi; Zhang, Xiaojun; Wei, Jiankai; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

    2017-07-05

    Crustacea, particularly Decapoda, contains many economically important species, such as shrimps and crabs. Crustaceans exhibit enormous (nearly 500-fold) variability in genome size. However, limited genome resources are available for investigating these species. Exopalaemon carinicauda Holthuis, an economical caridean shrimp, is a potential ideal experimental animal for research on crustaceans. In this study, we performed low-coverage sequencing and de novo assembly of the E. carinicauda genome. The assembly covers more than 95% of coding regions. E. carinicauda possesses a large complex genome (5.73 Gb), with size twice higher than those of many decapod shrimps. As such, comparative genomic analyses were implied to investigate factors affecting genome size evolution of decapods. However, clues associated with genome duplication were not identified, and few horizontally transferred sequences were detected. Ultimately, the burst of transposable elements, especially retrotransposons, was determined as the major factor influencing genome expansion. A total of 2 Gb repeats were identified, and RTE-BovB, Jockey, Gypsy, and DIRS were the four major retrotransposons that significantly expanded. Both recent (Jockey and Gypsy) and ancestral (DIRS) originated retrotransposons responsible for the genome evolution. The E. carinicauda genome also exhibited potential for the genomic and experimental research of shrimps.

  9. The evolution of sex chromosomes in organisms with separate haploid sexes.

    PubMed

    Immler, Simone; Otto, Sarah Perin

    2015-03-01

    The evolution of dimorphic sex chromosomes is driven largely by the evolution of reduced recombination and the subsequent accumulation of deleterious mutations. Although these processes are increasingly well understood in diploid organisms, the evolution of dimorphic sex chromosomes in haploid organisms (U/V) has been virtually unstudied theoretically. We analyze a model to investigate the evolution of linkage between fitness loci and the sex-determining region in U/V species. In a second step, we test how prone nonrecombining regions are to degeneration due to accumulation of deleterious mutations. Our modeling predicts that the decay of recombination on the sex chromosomes and the addition of strata via fusions will be just as much a part of the evolution of haploid sex chromosomes as in diploid sex chromosome systems. Reduced recombination is broadly favored, as long as there is some fitness difference between haploid males and females. The degeneration of the sex-determining region due to the accumulation of deleterious mutations is expected to be slower in haploid organisms because of the absence of masking. Nevertheless, balancing selection often drives greater differentiation between the U/V sex chromosomes than in X/Y and Z/W systems. We summarize empirical evidence for haploid sex chromosome evolution and discuss our predictions in light of these findings. © 2015 The Author(s).

  10. Generation of genetically modified mice using CRISPR/Cas9 and haploid embryonic stem cell systems

    PubMed Central

    JIN, Li-Fang; LI, Jin-Song

    2016-01-01

    With the development of high-throughput sequencing technology in the post-genomic era, researchers have concentrated their efforts on elucidating the relationships between genes and their corresponding functions. Recently, important progress has been achieved in the generation of genetically modified mice based on CRISPR/Cas9 and haploid embryonic stem cell (haESC) approaches, which provide new platforms for gene function analysis, human disease modeling, and gene therapy. Here, we review the CRISPR/Cas9 and haESC technology for the generation of genetically modified mice and discuss the key challenges in the application of these approaches. PMID:27469251

  11. A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing.

    PubMed

    Gao, Guangtu; Nome, Torfinn; Pearse, Devon E; Moen, Thomas; Naish, Kerry A; Thorgaard, Gary H; Lien, Sigbjørn; Palti, Yniv

    2018-01-01

    Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout ( Oncorhynchus mykiss ), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup , followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity

  12. Evolution and maintenance of haploid-diploid life cycles in natural populations: The case of the marine brown alga Ectocarpus.

    PubMed

    Couceiro, Lucía; Le Gac, Mickael; Hunsperger, Heather M; Mauger, Stéphane; Destombe, Christophe; Cock, J Mark; Ahmed, Sophia; Coelho, Susana M; Valero, Myriam; Peters, Akira F

    2015-07-01

    The evolutionary stability of haploid-diploid life cycles is still controversial. Mathematical models indicate that niche differences between ploidy phases may be a necessary condition for the evolution and maintenance of these life cycles. Nevertheless, experimental support for this prediction remains elusive. In the present work, we explored this hypothesis in natural populations of the brown alga Ectocarpus. Consistent with the life cycle described in culture, Ectocarpus crouaniorum in NW France and E. siliculosus in SW Italy exhibited an alternation between haploid gametophytes and diploid sporophytes. Our field data invalidated, however, the long-standing view of an isomorphic alternation of generations. Gametophytes and sporophytes displayed marked differences in size and, conforming to theoretical predictions, occupied different spatiotemporal niches. Gametophytes were found almost exclusively on the alga Scytosiphon lomentaria during spring whereas sporophytes were present year-round on abiotic substrata. Paradoxically, E. siliculosus in NW France exhibited similar habitat usage despite the absence of alternation of ploidy phases. Diploid sporophytes grew both epilithically and epiphytically, and this mainly asexual population gained the same ecological advantage postulated for haploid-diploid populations. Consequently, an ecological interpretation of the niche differences between haploid and diploid individuals does not seem to satisfactorily explain the evolution of the Ectocarpus life cycle. © 2015 The Author(s). Evolution © 2015 The Society for the Study of Evolution.

  13. The effects of quantitative fecundity in the haploid stage on reproductive success and diploid fitness in the aquatic peat moss Sphagnum macrophyllum

    PubMed Central

    Johnson, M G; Shaw, A J

    2016-01-01

    A major question in evolutionary biology is how mating patterns affect the fitness of offspring. However, in animals and seed plants it is virtually impossible to investigate the effects of specific gamete genotypes. In bryophytes, haploid gametophytes grow via clonal propagation and produce millions of genetically identical gametes throughout a population. The main goal of this research was to test whether gamete identity has an effect on the fitness of their diploid offspring in a population of the aquatic peat moss Sphagnum macrophyllum. We observed a heavily male-biased sex ratio in gametophyte plants (ramets) and in multilocus microsatellite genotypes (genets). There was a steeper relationship between mating success (number of different haploid mates) and fecundity (number of diploid offspring) for male genets compared with female genets. At the sporophyte level, we observed a weak effect of inbreeding on offspring fitness, but no effect of brood size (number of sporophytes per maternal ramet). Instead, the identities of the haploid male and haploid female parents were significant contributors to variance in fitness of sporophyte offspring in the population. Our results suggest that intrasexual gametophyte/gamete competition may play a role in determining mating success in this population. PMID:26905464

  14. The effects of quantitative fecundity in the haploid stage on reproductive success and diploid fitness in the aquatic peat moss Sphagnum macrophyllum.

    PubMed

    Johnson, M G; Shaw, A J

    2016-06-01

    A major question in evolutionary biology is how mating patterns affect the fitness of offspring. However, in animals and seed plants it is virtually impossible to investigate the effects of specific gamete genotypes. In bryophytes, haploid gametophytes grow via clonal propagation and produce millions of genetically identical gametes throughout a population. The main goal of this research was to test whether gamete identity has an effect on the fitness of their diploid offspring in a population of the aquatic peat moss Sphagnum macrophyllum. We observed a heavily male-biased sex ratio in gametophyte plants (ramets) and in multilocus microsatellite genotypes (genets). There was a steeper relationship between mating success (number of different haploid mates) and fecundity (number of diploid offspring) for male genets compared with female genets. At the sporophyte level, we observed a weak effect of inbreeding on offspring fitness, but no effect of brood size (number of sporophytes per maternal ramet). Instead, the identities of the haploid male and haploid female parents were significant contributors to variance in fitness of sporophyte offspring in the population. Our results suggest that intrasexual gametophyte/gamete competition may play a role in determining mating success in this population.

  15. Mapping PrBn and Other Quantitative Trait Loci Responsible for the Control of Homeologous Chromosome Pairing in Oilseed Rape (Brassica napus L.) Haploids

    PubMed Central

    Liu, Zhiqian; Adamczyk, Katarzyna; Manzanares-Dauleux, Maria; Eber, Frédérique; Lucas, Marie-Odile; Delourme, Régine; Chèvre, Anne Marie; Jenczewski, Eric

    2006-01-01

    In allopolyploid species, fair meiosis could be challenged by homeologous chromosome pairing and is usually achieved by the action of homeologous pairing suppressor genes. Oilseed rape (Brassica napus) haploids (AC, n = 19) represent an attractive model for studying the mechanisms used by allopolyploids to ensure the diploid-like meiotic pairing pattern. In oilseed rape haploids, homeologous chromosome pairing at metaphase I was found to be genetically based and controlled by a major gene, PrBn, segregating in a background of polygenic variation. In this study, we have mapped PrBn within a 10-cM interval on the C genome linkage group DY15 and shown that PrBn displays incomplete penetrance or variable expressivity. We have identified three to six minor QTL/BTL that have slight additive effects on the amount of pairing at metaphase I but do not interact with PrBn. We have also detected a number of other loci that interact epistatically, notably with PrBn. Our results support the idea that, as in other polyploid species, metaphase I homeologous pairing in oilseed rape haploids is controlled by an integrated system of several genes, which function in a complex manner. PMID:16951054

  16. Genome size and metabolic intensity in tetrapods: a tale of two lines

    PubMed Central

    Vinogradov, Alexander E; Anatskaya, Olga V

    2005-01-01

    We show the negative link between genome size and metabolic intensity in tetrapods, using the heart index (relative heart mass) as a unified indicator of metabolic intensity in poikilothermal and homeothermal animals. We found two separate regression lines of heart index on genome size for reptiles–birds and amphibians–mammals (the slope of regression is steeper in reptiles–birds). We also show a negative correlation between GC content and nucleosome formation potential in vertebrate DNA, and, consistent with this relationship, a positive correlation between genome GC content and nuclear size (independent of genome size). It is known that there are two separate regression lines of genome GC content on genome size for reptiles–birds and amphibians–mammals: reptiles–birds have the relatively higher GC content (for their genome sizes) compared to amphibians–mammals. Our results suggest uniting all these data into one concept. The slope of negative regression between GC content and nucleosome formation potential is steeper in exons than in non-coding DNA (where nucleosome formation potential is generally higher), which indicates a special role of non-coding DNA for orderly chromatin organization. The chromatin condensation and nuclear size are supposed to be key parameters that accommodate the effects of both genome size and GC content and connect them with metabolic intensity. Our data suggest that the reptilian–birds clade evolved special relationships among these parameters, whereas mammals preserved the amphibian-like relationships. Surprisingly, mammals, although acquiring a more complex general organization, seem to retain certain genome-related properties that are similar to amphibians. At the same time, the slope of regression between nucleosome formation potential and GC content is steeper in poikilothermal than in homeothermal genomes, which suggests that mammals and birds acquired certain common features of genomic organization. PMID:16519230

  17. No evidence that sex and transposable elements drive genome size variation in evening primroses.

    PubMed

    Ågren, J Arvid; Greiner, Stephan; Johnson, Marc T J; Wright, Stephen I

    2015-04-01

    Genome size varies dramatically across species, but despite an abundance of attention there is little agreement on the relative contributions of selective and neutral processes in governing this variation. The rate of sex can potentially play an important role in genome size evolution because of its effect on the efficacy of selection and transmission of transposable elements (TEs). Here, we used a phylogenetic comparative approach and whole genome sequencing to investigate the contribution of sex and TE content to genome size variation in the evening primrose (Oenothera) genus. We determined genome size using flow cytometry for 30 species that vary in genetic system and find that variation in sexual/asexual reproduction cannot explain the almost twofold variation in genome size. Moreover, using whole genome sequences of three species of varying genome sizes and reproductive system, we found that genome size was not associated with TE abundance; instead the larger genomes had a higher abundance of simple sequence repeats. Although it has long been clear that sexual reproduction may affect various aspects of genome evolution in general and TE evolution in particular, it does not appear to have played a major role in genome size evolution in the evening primroses. © 2015 The Author(s).

  18. Competition between the sperm of a single male can increase the evolutionary rate of haploid expressed genes.

    PubMed

    Ezawa, Kiyoshi; Innan, Hideki

    2013-07-01

    The population genetic behavior of mutations in sperm genes is theoretically investigated. We modeled the processes at two levels. One is the standard population genetic process, in which the population allele frequencies change generation by generation, depending on the difference in selective advantages. The other is the sperm competition during each genetic transmission from one generation to the next generation. For the sperm competition process, we formulate the situation where a huge number of sperm with alleles A and B, produced by a single heterozygous male, compete to fertilize a single egg. This "minimal model" demonstrates that a very slight difference in sperm performance amounts to quite a large difference between the alleles' winning probabilities. By incorporating this effect of paternity-sharing sperm competition into the standard population genetic process, we show that fierce sperm competition can enhance the fixation probability of a mutation with a very small phenotypic effect at the single-sperm level, suggesting a contribution of sperm competition to rapid amino acid substitutions in haploid-expressed sperm genes. Considering recent genome-wide demonstrations that a substantial fraction of the mammalian sperm genes are haploid expressed, our model could provide a potential explanation of rapid evolution of sperm genes with a wide variety of functions (as long as they are expressed in the haploid phase). Another advantage of our model is that it is applicable to a wide range of species, irrespective of whether the species is externally fertilizing, polygamous, or monogamous. The theoretical result was applied to mammalian data to estimate the selection intensity on nonsynonymous mutations in sperm genes.

  19. Genetic Analysis of Haploids from Industrial Strains of Baker's Yeast

    PubMed Central

    Oda, Yuji; Ouchi, Kozo

    1989-01-01

    Strains of baker's yeast conventionally used by the baking industry in Japan were tested for the ability to sporulate and produce viable haploid spores. Three isolates which possessed the properties of baker's yeasts were obtained from single spores. Each strain was a haploid, and one of these strains, YOY34, was characterized. YOY34 fermented maltose and sucrose, but did not utilize galactose, unlike its parental strain. Genetic analysis showed that YOY34 carried two MAL genes, one functional and one cryptic; two SUC genes; and one defective gal gene. The genotype of YOY34 was identified as MATα MAL1 MAL3g SUC2 SUC4 gall. The MAL1 gene from this haploid was constitutively expressed, was dominant over other wild-type MAL tester genes, and gave a weak sucrose fermentation. YOY34 was suitable for both bakery products, like conventional baker's yeasts, and for genetic analysis, like laboratory strains. PMID:16347967

  20. The Peculiar Landscape of Repetitive Sequences in the Olive (Olea europaea L.) Genome

    PubMed Central

    Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

    2014-01-01

    Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome. PMID:24671744

  1. The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.

    PubMed

    Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

    2014-04-01

    Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome.

  2. Parallel altitudinal clines reveal trends in adaptive evolution of genome size in Zea mays

    PubMed Central

    Berg, Jeremy J.; Birchler, James A.; Grote, Mark N.; Lorant, Anne; Quezada, Juvenal

    2018-01-01

    While the vast majority of genome size variation in plants is due to differences in repetitive sequence, we know little about how selection acts on repeat content in natural populations. Here we investigate parallel changes in intraspecific genome size and repeat content of domesticated maize (Zea mays) landraces and their wild relative teosinte across altitudinal gradients in Mesoamerica and South America. We combine genotyping, low coverage whole-genome sequence data, and flow cytometry to test for evidence of selection on genome size and individual repeat abundance. We find that population structure alone cannot explain the observed variation, implying that clinal patterns of genome size are maintained by natural selection. Our modeling additionally provides evidence of selection on individual heterochromatic knob repeats, likely due to their large individual contribution to genome size. To better understand the phenotypes driving selection on genome size, we conducted a growth chamber experiment using a population of highland teosinte exhibiting extensive variation in genome size. We find weak support for a positive correlation between genome size and cell size, but stronger support for a negative correlation between genome size and the rate of cell production. Reanalyzing published data of cell counts in maize shoot apical meristems, we then identify a negative correlation between cell production rate and flowering time. Together, our data suggest a model in which variation in genome size is driven by natural selection on flowering time across altitudinal clines, connecting intraspecific variation in repetitive sequence to important differences in adaptive phenotypes. PMID:29746459

  3. Exact Markov chains versus diffusion theory for haploid random mating.

    PubMed

    Tyvand, Peder A; Thorvaldsen, Steinar

    2010-05-01

    Exact discrete Markov chains are applied to the Wright-Fisher model and the Moran model of haploid random mating. Selection and mutations are neglected. At each discrete value of time t there is a given number n of diploid monoecious organisms. The evolution of the population distribution is given in diffusion variables, to compare the two models of random mating with their common diffusion limit. Only the Moran model converges uniformly to the diffusion limit near the boundary. The Wright-Fisher model allows the population size to change with the generations. Diffusion theory tends to under-predict the loss of genetic information when a population enters a bottleneck. 2010 Elsevier Inc. All rights reserved.

  4. In vitro propagation of the microsporidian pathogen Brachiola algerae and studies of its chromosome and ribosomal DNA organization in the context of the complete genome sequencing project.

    PubMed

    Belkorchia, Abdel; Biderre, Corinne; Militon, Cécile; Polonais, Valérie; Wincker, Patrick; Jubin, Claire; Delbac, Frédéric; Peyretaillade, Eric; Peyret, Pierre

    2008-03-01

    Brachiola algerae has a broad host spectrum from human to mosquitoes. The successful infection of two mosquito cell lines (Mos55: embryonic cells and Sua 4.0: hemocyte-like cells) and a human cell line (HFF) highlights the efficient adaptive capacity of this microsporidian pathogen. The molecular karyotype of this microsporidian species was determined in the context of the B. algerae genome sequencing project, showing that its haploid genome consists of 30 chromosomal-sized DNAs ranging from 160 to 2240 kbp giving an estimated genome size of 23 Mbp. A contig of 12,269 bp including the DNA sequence of the B. algerae ribosomal transcription unit has been built from initial genomic sequences and the secondary structure of the large subunit rRNA constructed. The data obtained indicate that B. algerae should be an excellent parasitic model to understand genome evolution in relation to infectious capacity.

  5. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

    PubMed

    Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J; Leitch, Ilia J

    2015-01-01

    The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.

  6. Reassessment of the Genome Size in Elaeis guineensis and Elaeis oleifera, and Its Interspecific Hybrid

    PubMed Central

    Camillo, Julceia; Leão, André P; Alves, Alexandre A; Formighieri, Eduardo F; Azevedo, Ana LS; Nunes, Juliana D; de Capdeville, Guy; de A Mattos, Jean K; Souza, Manoel T

    2014-01-01

    Aiming at generating a comprehensive genomic database on Elaeis spp., our group is leading several R&D initiatives with Elaeis guineensis (African oil palm) and Elaeis oleifera (American oil palm), including the whole-genome sequencing of the last. Genome size estimates currently available for this genus are controversial, as they indicate that American oil palm genome is about half the size of the African oil palm genome and that the genome of the interspecific hybrid is bigger than both the parental species genomes. We estimated the genome size of three E. guineensis genotypes, five E. oleifera genotypes, and two interspecific hybrids genotypes. On average, the genome size of E. guineensis is 4.32 ± 0.173 pg, while that of E. oleifera is 4.43 ± 0.018 pg. This indicates that both genomes are similar in size, even though E. oleifera is in fact bigger. As expected, the hybrid genome size is around the average of the two genomes, 4.40 ± 0.016 pg. Additionally, we demonstrate that both species present around 38% of GC content. As our results contradict the currently available data on Elaeis spp. genome sizes, we propose that the actual genome size of the Elaeis species is around 4 pg and that American oil palm possesses a larger genome than African oil palm. PMID:26203259

  7. Genomic Selection Outperforms Marker Assisted Selection for Grain Yield and Physiological Traits in a Maize Doubled Haploid Population Across Water Treatments.

    PubMed

    Cerrudo, Diego; Cao, Shiliang; Yuan, Yibing; Martinez, Carlos; Suarez, Edgar Antonio; Babu, Raman; Zhang, Xuecai; Trachsel, Samuel

    2018-01-01

    To increase genetic gain for tolerance to drought, we aimed to identify environmentally stable QTL in per se and testcross combination under well-watered (WW) and drought stressed (DS) conditions and evaluate the possible deployment of QTL using marker assisted and/or genomic selection (QTL/GS-MAS). A total of 169 doubled haploid lines derived from the cross between CML495 and LPSC7F64 and 190 testcrosses (tester CML494) were evaluated in a total of 11 treatment-by-population combinations under WW and DS conditions. In response to DS, grain yield (GY) and plant height (PHT) were reduced while time to anthesis and the anthesis silking interval (ASI) increased for both lines and hybrids. Forty-eight QTL were detected for a total of nine traits. The allele derived from CML495 generally increased trait values for anthesis, ASI, PHT, the normalized difference vegetative index (NDVI) and the green leaf area duration (GLAD; a composite trait of NDVI, PHT and senescence) while it reduced trait values for leaf rolling and senescence. The LOD scores for all detected QTL ranged from 2.0 to 7.2 explaining 4.4 to 19.4% of the observed phenotypic variance with R 2 ranging from 0 (GY, DS, lines) to 37.3% (PHT, WW, lines). Prediction accuracy of the model used for genomic selection was generally higher than phenotypic variance explained by the sum of QTL for individual traits indicative of the polygenic control of traits evaluated here. We therefore propose to use QTL-MAS in forward breeding to enrich the allelic frequency for a few desired traits with strong additive QTL in early selection cycles while GS-MAS could be used in more mature breeding programs to additionally capture alleles with smaller additive effects.

  8. Contrasting growth phenology of native and invasive forest shrubs mediated by genome size.

    PubMed

    Fridley, Jason D; Craddock, Alaä

    2015-08-01

    Examination of the significance of genome size to plant invasions has been largely restricted to its association with growth rate. We investigated the novel hypothesis that genome size is related to forest invasions through its association with growth phenology, as a result of the ability of large-genome species to grow more effectively through cell expansion at cool temperatures. We monitored the spring leaf phenology of 54 species of eastern USA deciduous forests, including native and invasive shrubs of six common genera. We used new measurements of genome size to evaluate its association with spring budbreak, cell size, summer leaf production rate, and photosynthetic capacity. In a phylogenetic hierarchical model that differentiated native and invasive species as a function of summer growth rate and spring budbreak timing, species with smaller genomes exhibited both faster growth and delayed budbreak compared with those with larger nuclear DNA content. Growth rate, but not budbreak timing, was associated with whether a species was native or invasive. Our results support genome size as a broad indicator of the growth behavior of woody species. Surprisingly, invaders of deciduous forests show the same small-genome tendencies of invaders of more open habitats, supporting genome size as a robust indicator of invasiveness. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  9. Competition Between the Sperm of a Single Male Can Increase the Evolutionary Rate of Haploid Expressed Genes

    PubMed Central

    Ezawa, Kiyoshi; Innan, Hideki

    2013-01-01

    The population genetic behavior of mutations in sperm genes is theoretically investigated. We modeled the processes at two levels. One is the standard population genetic process, in which the population allele frequencies change generation by generation, depending on the difference in selective advantages. The other is the sperm competition during each genetic transmission from one generation to the next generation. For the sperm competition process, we formulate the situation where a huge number of sperm with alleles A and B, produced by a single heterozygous male, compete to fertilize a single egg. This “minimal model” demonstrates that a very slight difference in sperm performance amounts to quite a large difference between the alleles’ winning probabilities. By incorporating this effect of paternity-sharing sperm competition into the standard population genetic process, we show that fierce sperm competition can enhance the fixation probability of a mutation with a very small phenotypic effect at the single-sperm level, suggesting a contribution of sperm competition to rapid amino acid substitutions in haploid-expressed sperm genes. Considering recent genome-wide demonstrations that a substantial fraction of the mammalian sperm genes are haploid expressed, our model could provide a potential explanation of rapid evolution of sperm genes with a wide variety of functions (as long as they are expressed in the haploid phase). Another advantage of our model is that it is applicable to a wide range of species, irrespective of whether the species is externally fertilizing, polygamous, or monogamous. The theoretical result was applied to mammalian data to estimate the selection intensity on nonsynonymous mutations in sperm genes. PMID:23666936

  10. Small genomes and large seeds: chromosome numbers, genome size and seed mass in diploid Aesculus species (Sapindaceae).

    PubMed

    Krahulcová, Anna; Trávnícek, Pavel; Krahulec, František; Rejmánek, Marcel

    2017-04-01

    Aesculus L. (horse chestnut, buckeye) is a genus of 12-19 extant woody species native to the temperate Northern Hemisphere. This genus is known for unusually large seeds among angiosperms. While chromosome counts are available for many Aesculus species, only one has had its genome size measured. The aim of this study is to provide more genome size data and analyse the relationship between genome size and seed mass in this genus. Chromosome numbers in root tip cuttings were confirmed for four species and reported for the first time for three additional species. Flow cytometric measurements of 2C nuclear DNA values were conducted on eight species, and mean seed mass values were estimated for the same taxa. The same chromosome number, 2 n = 40, was determined in all investigated taxa. Original measurements of 2C values for seven Aesculus species (eight taxa), added to just one reliable datum for A. hippocastanum , confirmed the notion that the genome size in this genus with relatively large seeds is surprisingly low, ranging from 0·955 pg 2C -1 in A. parviflora to 1·275 pg 2C -1 in A. glabra var. glabra. The chromosome number of 2 n = 40 seems to be conclusively the universal 2 n number for non-hybrid species in this genus. Aesculus genome sizes are relatively small, not only within its own family, Sapindaceae, but also within woody angiosperms. The genome sizes seem to be distinct and non-overlapping among the four major Aesculus clades. These results provide an extra support for the most recent reconstruction of Aesculus phylogeny. The correlation between the 2C values and seed masses in examined Aesculus species is slightly negative and not significant. However, when the four major clades are treated separately, there is consistent positive association between larger genome size and larger seed mass within individual lineages. © The Author 2017. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For

  11. Small genomes and large seeds: chromosome numbers, genome size and seed mass in diploid Aesculus species (Sapindaceae)

    PubMed Central

    Krahulcová, Anna; Trávníček, Pavel; Rejmánek, Marcel

    2017-01-01

    Background and Aims Aesculus L. (horse chestnut, buckeye) is a genus of 12–19 extant woody species native to the temperate Northern Hemisphere. This genus is known for unusually large seeds among angiosperms. While chromosome counts are available for many Aesculus species, only one has had its genome size measured. The aim of this study is to provide more genome size data and analyse the relationship between genome size and seed mass in this genus. Methods Chromosome numbers in root tip cuttings were confirmed for four species and reported for the first time for three additional species. Flow cytometric measurements of 2C nuclear DNA values were conducted on eight species, and mean seed mass values were estimated for the same taxa. Key Results The same chromosome number, 2n = 40, was determined in all investigated taxa. Original measurements of 2C values for seven Aesculus species (eight taxa), added to just one reliable datum for A. hippocastanum, confirmed the notion that the genome size in this genus with relatively large seeds is surprisingly low, ranging from 0·955 pg 2C–1 in A. parviflora to 1·275 pg 2C–1 in A. glabra var. glabra. Conclusions The chromosome number of 2n = 40 seems to be conclusively the universal 2n number for non-hybrid species in this genus. Aesculus genome sizes are relatively small, not only within its own family, Sapindaceae, but also within woody angiosperms. The genome sizes seem to be distinct and non-overlapping among the four major Aesculus clades. These results provide an extra support for the most recent reconstruction of Aesculus phylogeny. The correlation between the 2C values and seed masses in examined Aesculus species is slightly negative and not significant. However, when the four major clades are treated separately, there is consistent positive association between larger genome size and larger seed mass within individual lineages. PMID:28065925

  12. Maize Haploid Induction and Doubling II – Experience with Exotic and Elite Maize Populations

    USDA-ARS?s Scientific Manuscript database

    As a follow-up to our previous study, second year information will be presented addressing questions on haploid induction and doubling, utilizing exotic and elite maize. These projects result from collaborations between Iowa State Doubled Haploid Facility (http://www.plantbreeding.iastate.edu/DHF/D...

  13. Maize Haploid Induction and Doubling – Recent Experience with Exotic and Elite Maize Populations

    USDA-ARS?s Scientific Manuscript database

    Experience from three maize research projects utilizing the haploid inducer RWS x RWK-76 from the University of Hohenheim will be summarized. These projects result from collaborations between Iowa State Doubled Haploid Facility (http://www.plantbreeding.iastate.edu/DHF/DHF.htm) researchers and USDA...

  14. Evolution of genome size and complexity in the rhabdoviridae.

    PubMed

    Walker, Peter J; Firth, Cadhla; Widen, Steven G; Blasdell, Kim R; Guzman, Hilda; Wood, Thomas G; Paradkar, Prasad N; Holmes, Edward C; Tesh, Robert B; Vasilakis, Nikos

    2015-02-01

    RNA viruses exhibit substantial structural, ecological and genomic diversity. However, genome size in RNA viruses is likely limited by a high mutation rate, resulting in the evolution of various mechanisms to increase complexity while minimising genome expansion. Here we conduct a large-scale analysis of the genome sequences of 99 animal rhabdoviruses, including 45 genomes which we determined de novo, to identify patterns of genome expansion and the evolution of genome complexity. All but seven of the rhabdoviruses clustered into 17 well-supported monophyletic groups, of which eight corresponded to established genera, seven were assigned as new genera, and two were taxonomically ambiguous. We show that the acquisition and loss of new genes appears to have been a central theme of rhabdovirus evolution, and has been associated with the appearance of alternative, overlapping and consecutive ORFs within the major structural protein genes, and the insertion and loss of additional ORFs in each gene junction in a clade-specific manner. Changes in the lengths of gene junctions accounted for as much as 48.5% of the variation in genome size from the smallest to the largest genome, and the frequency with which new ORFs were observed increased in the 3' to 5' direction along the genome. We also identify several new families of accessory genes encoded in these regions, and show that non-canonical expression strategies involving TURBS-like termination-reinitiation, ribosomal frame-shifts and leaky ribosomal scanning appear to be common. We conclude that rhabdoviruses have an unusual capacity for genomic plasticity that may be linked to their discontinuous transcription strategy from the negative-sense single-stranded RNA genome, and propose a model that accounts for the regular occurrence of genome expansion and contraction throughout the evolution of the Rhabdoviridae.

  15. Evolution of Genome Size and Complexity in the Rhabdoviridae

    PubMed Central

    Walker, Peter J.; Firth, Cadhla; Widen, Steven G.; Blasdell, Kim R.; Guzman, Hilda; Wood, Thomas G.; Paradkar, Prasad N.; Holmes, Edward C.; Tesh, Robert B.; Vasilakis, Nikos

    2015-01-01

    RNA viruses exhibit substantial structural, ecological and genomic diversity. However, genome size in RNA viruses is likely limited by a high mutation rate, resulting in the evolution of various mechanisms to increase complexity while minimising genome expansion. Here we conduct a large-scale analysis of the genome sequences of 99 animal rhabdoviruses, including 45 genomes which we determined de novo, to identify patterns of genome expansion and the evolution of genome complexity. All but seven of the rhabdoviruses clustered into 17 well-supported monophyletic groups, of which eight corresponded to established genera, seven were assigned as new genera, and two were taxonomically ambiguous. We show that the acquisition and loss of new genes appears to have been a central theme of rhabdovirus evolution, and has been associated with the appearance of alternative, overlapping and consecutive ORFs within the major structural protein genes, and the insertion and loss of additional ORFs in each gene junction in a clade-specific manner. Changes in the lengths of gene junctions accounted for as much as 48.5% of the variation in genome size from the smallest to the largest genome, and the frequency with which new ORFs were observed increased in the 3’ to 5’ direction along the genome. We also identify several new families of accessory genes encoded in these regions, and show that non-canonical expression strategies involving TURBS-like termination-reinitiation, ribosomal frame-shifts and leaky ribosomal scanning appear to be common. We conclude that rhabdoviruses have an unusual capacity for genomic plasticity that may be linked to their discontinuous transcription strategy from the negative-sense single-stranded RNA genome, and propose a model that accounts for the regular occurrence of genome expansion and contraction throughout the evolution of the Rhabdoviridae. PMID:25679389

  16. Germplasm enhancement of maize: A look into haploid induction and chromosomal doubling of haploids from temperate-adapted tropical sources

    USDA-ARS?s Scientific Manuscript database

    Doubled haploid technology is used to develop completely homozygous inbred lines, where each of the chromatids making up a chromosome pair are identical. Two inbred lines, PHB47 and PHZ51, were used to make backcrosses to 18 maize landraces, generating 36 populations. The landraces were chosen bas...

  17. Genome size of Alexandrium catenella and Gracilariopsis lemaneiformis estimated by flow cytometry

    NASA Astrophysics Data System (ADS)

    Du, Qingwei; Sui, Zhenghong; Chang, Lianpeng; Wei, Huihui; Liu, Yuan; Mi, Ping; Shang, Erlei; Zeeshan, Niaz; Que, Zhou

    2016-08-01

    Flow cytometry (FCM) technique has been widely applied to estimating the genome size of various higher plants. However, there is few report about its application in algae. In this study, an optimized procedure of FCM was exploited to estimate the genome size of two eukaryotic algae. For analyzing Alexandrium catenella, an important red tide species, the whole cell instead of isolated nucleus was studied, and chicken erythrocytes were used as an internal reference. The genome size of A. catenella was estimated to be 56.48 ± 4.14 Gb (1C), approximately nineteen times larger than that of human genome. For analyzing Gracilariopsis lemaneiformis, an important economical red alga, the purified nucleus was employed, and Arabidopsis thaliana and Chondrus crispus were used as internal references, respectively. The genome size of Gp. lemaneiformis was 97.35 ± 2.58 Mb (1C) and 112.73 ± 14.00 Mb (1C), respectively, depending on the different internal references. The results of this research will promote the related studies on the genomics and evolution of these two species.

  18. A Genome-Wide Association Study Identifies Genomic Regions for Virulence in the Non-Model Organism Heterobasidion annosum s.s

    PubMed Central

    Dalman, Kerstin; Himmelstrand, Kajsa; Olson, Åke; Lind, Mårten; Brandström-Durling, Mikael; Stenlid, Jan

    2013-01-01

    The dense single nucleotide polymorphisms (SNP) panels needed for genome wide association (GWA) studies have hitherto been expensive to establish and use on non-model organisms. To overcome this, we used a next generation sequencing approach to both establish SNPs and to determine genotypes. We conducted a GWA study on a fungal species, analysing the virulence of Heterobasidion annosum s.s., a necrotrophic pathogen, on its hosts Picea abies and Pinus sylvestris. From a set of 33,018 single nucleotide polymorphisms (SNP) in 23 haploid isolates, twelve SNP markers distributed on seven contigs were associated with virulence (P<0.0001). Four of the contigs harbour known virulence genes from other fungal pathogens and the remaining three harbour novel candidate genes. Two contigs link closely to virulence regions recognized previously by QTL mapping in the congeneric hybrid H. irregulare × H. occidentale. Our study demonstrates the efficiency of GWA studies for dissecting important complex traits of small populations of non-model haploid organisms with small genomes. PMID:23341945

  19. The Arabidopsis lyrata genome sequence and the basis of rapid genome size change

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hu, Tina T.; Pattyn, Pedro; Bakker, Erica G.

    2011-04-29

    In our manuscript, we present a high-quality genome sequence of the Arabidopsis thaliana relative, Arabidopsis lyrata, produced by dideoxy sequencing. We have performed the usual types of genome analysis (gene annotation, dN/dS studies etc. etc.), but this is relegated to the Supporting Information. Instead, we focus on what was a major motivation for sequencing this genome, namely to understand how A. thaliana lost half its genome in a few million years and lived to tell the tale. The rather surprising conclusion is that there is not a single genomic feature that accounts for the reduced genome, but that every aspectmore » centromeres, intergenic regions, transposable elements, gene family number is affected through hundreds of thousands of cuts. This strongly suggests that overall genome size in itself is what has been under selection, a suggestion that is strongly supported by our demonstration (using population genetics data from A. thaliana) that new deletions seem to be driven to fixation.« less

  20. Genome size evolution in relation to leaf strategy and metabolic rates revisited.

    PubMed

    Beaulieu, Jeremy M; Leitch, Ilia J; Knight, Charles A

    2007-03-01

    It has been proposed that having too much DNA may carry physiological consequences for plants. The strong correlation between DNA content, cell size and cell division rate could lead to predictable morphological variation in plants, including a negative relationship with leaf mass per unit area (LMA). In addition, the possible increased demand for resources in species with high DNA content may have downstream effects on maximal metabolic efficiency, including decreased metabolic rates. Tests were made for genome size-dependent variation in LMA and metabolic rates (mass-based photosynthetic rate and dark respiration rate) using our own measurements and data from a plant functional trait database (Glopnet). These associations were tested using two metrics of genome size: bulk DNA amount (2C DNA) and monoploid genome size (1Cx DNA). The data were analysed using an evolutionary framework that included a regression analysis and independent contrasts using a phylogenetic tree with estimates of molecular diversification times. A contribution index for the LMA data set was also calculated to determine which divergences have the greatest influence on the relationship between genome size and LMA. A significant negative association was found between bulk DNA amount and LMA in angiosperms. This was primarily a result of influential divergences that may represent early shifts in growth form. However, divergences in bulk DNA amount were positively associated with divergences in LMA, suggesting that the relationship may be indirect and mediated through other traits directly related to genome size. There was a significant negative association between genome size and metabolic rates that was driven by a basal divergence between angiosperms and gymnosperms; no significant independent contrast results were found. Therefore, it is concluded that genome size-dependent constraints acting on metabolic efficiency may not exist within seed plants.

  1. Genome size differentiates co-occurring populations of the planktonic diatom Ditylum brightwellii (Bacillariophyta)

    PubMed Central

    2010-01-01

    Background Diatoms are one of the most species-rich groups of eukaryotic microbes known. Diatoms are also the only group of eukaryotic micro-algae with a diplontic life history, suggesting that the ancestral diatom switched to a life history dominated by a duplicated genome. A key mechanism of speciation among diatoms could be a propensity for additional stable genome duplications. Across eukaryotic taxa, genome size is directly correlated to cell size and inversely correlated to physiological rates. Differences in relative genome size, cell size, and acclimated growth rates were analyzed in isolates of the diatom Ditylum brightwellii. Ditylum brightwellii consists of two main populations with identical 18s rDNA sequences; one population is distributed globally at temperate latitudes and the second appears to be localized to the Pacific Northwest coast of the USA. These two populations co-occur within the Puget Sound estuary of WA, USA, although their peak abundances differ depending on local conditions. Results All isolates from the more regionally-localized population (population 2) possessed 1.94 ± 0.74 times the amount of DNA, grew more slowly, and were generally larger than isolates from the more globally distributed population (population 1). The ITS1 sequences, cell sizes, and genome sizes of isolates from New Zealand were the same as population 1 isolates from Puget Sound, but their growth rates were within the range of the slower-growing population 2 isolates. Importantly, the observed genome size difference between isolates from the two populations was stable regardless of time in culture or the changes in cell size that accompany the diatom life history. Conclusions The observed two-fold difference in genome size between the D. brightwellii populations suggests that whole genome duplication occurred within cells of population 1 ultimately giving rise to population 2 cells. The apparent regional localization of population 2 is consistent with a recent

  2. Selective significance of genome size in a plant community with heavy metal pollution.

    PubMed

    Vidic, T; Greilhuber, J; Vilhar, B; Dermastia, M

    2009-09-01

    In eukaryotes, nuclear genome sizes vary by more than five orders of magnitude. This variation is not related to organismal complexity, and its origin and biological significance are still disputed. One of the open questions is whether genome size has an adaptive role. We tested the hypothesis that genome size has selective significance, using five grassland communities occurring on a gradient of metal pollution of the soil as a model. We detected a negative correlation between the concentration of contaminating metals in the soil and the number of vascular plant species. Analysis of genome sizes of 70 herbaceous dicot perennial species occurring on the investigated plots revealed a negative correlation between the concentration of contaminating metals in the soil and the proportion of species with large genomes in plant communities. Consistent with the hypothesis, these results show that species with large genomes are at selective disadvantage in extreme environmental conditions.

  3. Genome sequence of the highly weak-acid-tolerant Zygosaccharomyces bailii IST302, amenable to genetic manipulations and physiological studies.

    PubMed

    Palma, Margarida; Münsterkötter, Martin; Peça, João; Güldener, Ulrich; Sá-Correia, Isabel

    2017-06-01

    Zygosaccharomyces bailii is one of the most problematic spoilage yeast species found in the food and beverage industry particularly in acidic products, due to its exceptional resistance to weak acid stress. This article describes the annotation of the genome sequence of Z. bailii IST302, a strain recently proven to be amenable to genetic manipulations and physiological studies. The work was based on the annotated genomes of strain ISA1307, an interspecies hybrid between Z. bailii and a closely related species, and the Z. bailii reference strain CLIB 213T. The resulting genome sequence of Z. bailii IST302 is distributed through 105 scaffolds, comprising a total of 5142 genes and a size of 10.8 Mb. Contrasting with CLIB 213T, strain IST302 does not form cell aggregates, allowing its manipulation in the laboratory for genetic and physiological studies. Comparative cell cycle analysis with the haploid and diploid Saccharomyces cerevisiae strains BY4741 and BY4743, respectively, suggests that Z. bailii IST302 is haploid. This is an additional trait that makes this strain attractive for the functional analysis of non-essential genes envisaging the elucidation of mechanisms underlying its high tolerance to weak acid food preservatives, or the investigation and exploitation of the potential of this resilient yeast species as cell factory. © FEMS 2017.

  4. The role of epistatic interactions underpinning resistance to parasitic Varroa mites in haploid honey bee (Apis mellifera) drones.

    PubMed

    Conlon, Benjamin H; Frey, Eva; Rosenkranz, Peter; Locke, Barbara; Moritz, Robin F A; Routtu, Jarkko

    2018-06-01

    The Red Queen hypothesis predicts that host-parasite coevolutionary dynamics can select for host resistance through increased genetic diversity, recombination and evolutionary rates. However, in haplodiploid organisms such as the honeybee (Apis mellifera), models suggest the selective pressure is weaker than in diploids. Haplodiploid sex determination, found in A. mellifera, can allow deleterious recessive alleles to persist in the population through the diploid sex with negative effects predominantly expressed in the haploid sex. To overcome these negative effects in haploid genomes, epistatic interactions have been hypothesized to play an important role. Here, we use the interaction between A. mellifera and the parasitic mite Varroa destructor to test epistasis in the expression of resistance, through the inhibition of parasite reproduction, in haploid drones. We find novel loci on three chromosomes which explain over 45% of the resistance phenotype. Two of these loci interact only additively, suggesting their expression is independent of each other, but both loci interact epistatically with the third locus. With drone offspring inheriting only one copy of the queen's chromosomes, the drones will only possess one of two queen alleles throughout the years-long lifetime of the honeybee colony. Varroa, in comparison, completes its highly inbred reproductive cycle in a matter of weeks, allowing it to rapidly evolve resistance. Faced with the rapidly evolving Varroa, a diversity of pathways and epistatic interactions for the inhibition of Varroa reproduction could therefore provide a selective advantage to the high levels of recombination seen in A. mellifera. This allows for the remixing of phenotypes despite a fixed queen genotype. © 2018 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2018 European Society For Evolutionary Biology.

  5. Schrödinger's Cheshire Cat: Are Haploid Emiliania huxleyi Cells Resistant to Viral Infection or Not?

    PubMed

    Mordecai, Gideon J; Verret, Frederic; Highfield, Andrea; Schroeder, Declan C

    2017-03-18

    Emiliania huxleyi is the main calcite producer on Earth and is routinely infected by a virus (EhV); a double stranded DNA (dsDNA) virus belonging to the family Phycodnaviridae . E. huxleyi exhibits a haplodiploid life cycle; the calcified diploid stage is non-motile and forms extensive blooms. The haploid phase is a non-calcified biflagellated cell bearing organic scales. Haploid cells are thought to resist infection, through a process deemed the "Cheshire Cat" escape strategy; however, a recent study detected the presence of viral lipids in the same haploid strain. Here we report on the application of an E. huxleyi CCMP1516 EhV-86 combined tiling array (TA) that further confirms an EhV infection in the RCC1217 haploid strain, which grew without any signs of cell lysis. Reverse transcription polymerase chain reaction (RT-PCR) and PCR verified the presence of viral RNA in the haploid cells, yet indicated an absence of viral DNA, respectively. These infected cells are an alternative stage of the virus life cycle deemed the haplococcolithovirocell. In this instance, the host is both resistant to and infected by EhV, i.e., the viral transcriptome is present in haploid cells whilst there is no evidence of viral lysis. This superimposed state is reminiscent of Schrödinger's cat; of being simultaneously both dead and alive.

  6. Proteomic strategy for the identification of critical actors in reorganization of the post-meiotic male genome.

    PubMed

    Govin, Jerome; Gaucher, Jonathan; Ferro, Myriam; Debernardi, Alexandra; Garin, Jerome; Khochbin, Saadi; Rousseaux, Sophie

    2012-01-01

    After meiosis, during the final stages of spermatogenesis, the haploid male genome undergoes major structural changes, resulting in a shift from a nucleosome-based genome organization to the sperm-specific, highly compacted nucleoprotamine structure. Recent data support the idea that region-specific programming of the haploid male genome is of high importance for the post-fertilization events and for successful embryo development. Although these events constitute a unique and essential step in reproduction, the mechanisms by which they occur have remained completely obscure and the factors involved have mostly remained uncharacterized. Here, we sought a strategy to significantly increase our understanding of proteins controlling the haploid male genome reprogramming, based on the identification of proteins in two specific pools: those with the potential to bind nucleic acids (basic proteins) and proteins capable of binding basic proteins (acidic proteins). For the identification of acidic proteins, we developed an approach involving a transition-protein (TP)-based chromatography, which has the advantage of retaining not only acidic proteins due to the charge interactions, but also potential TP-interacting factors. A second strategy, based on an in-depth bioinformatic analysis of the identified proteins, was then applied to pinpoint within the lists obtained, male germ cells expressed factors relevant to the post-meiotic genome organization. This approach reveals a functional network of DNA-packaging proteins and their putative chaperones and sheds a new light on the way the critical transitions in genome organizations could take place. This work also points to a new area of research in male infertility and sperm quality assessments.

  7. Standing at the Gateway to Europe - The Genetic Structure of Western Balkan Populations Based on Autosomal and Haploid Markers

    PubMed Central

    Kovacevic, Lejla; Tambets, Kristiina; Ilumäe, Anne-Mai; Kushniarevich, Alena; Yunusbayev, Bayazit; Solnik, Anu; Bego, Tamer; Primorac, Dragan; Skaro, Vedrana; Leskovac, Andreja; Jakovski, Zlatko; Drobnic, Katja; Tolk, Helle-Viivi; Kovacevic, Sandra; Rudan, Pavao; Metspalu, Ene; Marjanovic, Damir

    2014-01-01

    Contemporary inhabitants of the Balkan Peninsula belong to several ethnic groups of diverse cultural background. In this study, three ethnic groups from Bosnia and Herzegovina - Bosniacs, Bosnian Croats and Bosnian Serbs - as well as the populations of Serbians, Croatians, Macedonians from the former Yugoslav Republic of Macedonia, Montenegrins and Kosovars have been characterized for the genetic variation of 660 000 genome-wide autosomal single nucleotide polymorphisms and for haploid markers. New autosomal data of the 70 individuals together with previously published data of 20 individuals from the populations of the Western Balkan region in a context of 695 samples of global range have been analysed. Comparison of the variation data of autosomal and haploid lineages of the studied Western Balkan populations reveals a concordance of the data in both sets and the genetic uniformity of the studied populations, especially of Western South-Slavic speakers. The genetic variation of Western Balkan populations reveals the continuity between the Middle East and Europe via the Balkan region and supports the scenario that one of the major routes of ancient gene flows and admixture went through the Balkan Peninsula. PMID:25148043

  8. Standing at the gateway to Europe--the genetic structure of Western balkan populations based on autosomal and haploid markers.

    PubMed

    Kovacevic, Lejla; Tambets, Kristiina; Ilumäe, Anne-Mai; Kushniarevich, Alena; Yunusbayev, Bayazit; Solnik, Anu; Bego, Tamer; Primorac, Dragan; Skaro, Vedrana; Leskovac, Andreja; Jakovski, Zlatko; Drobnic, Katja; Tolk, Helle-Viivi; Kovacevic, Sandra; Rudan, Pavao; Metspalu, Ene; Marjanovic, Damir

    2014-01-01

    Contemporary inhabitants of the Balkan Peninsula belong to several ethnic groups of diverse cultural background. In this study, three ethnic groups from Bosnia and Herzegovina - Bosniacs, Bosnian Croats and Bosnian Serbs - as well as the populations of Serbians, Croatians, Macedonians from the former Yugoslav Republic of Macedonia, Montenegrins and Kosovars have been characterized for the genetic variation of 660 000 genome-wide autosomal single nucleotide polymorphisms and for haploid markers. New autosomal data of the 70 individuals together with previously published data of 20 individuals from the populations of the Western Balkan region in a context of 695 samples of global range have been analysed. Comparison of the variation data of autosomal and haploid lineages of the studied Western Balkan populations reveals a concordance of the data in both sets and the genetic uniformity of the studied populations, especially of Western South-Slavic speakers. The genetic variation of Western Balkan populations reveals the continuity between the Middle East and Europe via the Balkan region and supports the scenario that one of the major routes of ancient gene flows and admixture went through the Balkan Peninsula.

  9. Oilseed rape seeds with ablated defence cells of the glucosinolate–myrosinase system. Production and characteristics of double haploid MINELESS plants of Brassica napus L.

    PubMed Central

    Ahuja, Ishita; Borgen, Birgit Hafeld; Hansen, Magnor; Honne, Bjørn Ivar; Müller, Caroline; Rohloff, Jens; Rossiter, John Trevor; Bones, Atle Magnar

    2011-01-01

    Oilseed rape and other crop plants of the family Brassicaceae contain a unique defence system known as the glucosinolate–myrosinase system or the ‘mustard oil bomb’. The ‘mustard oil bomb’ which includes myrosinase and glucosinolates is triggered by abiotic and biotic stress, resulting in the formation of toxic products such as nitriles and isothiocyanates. Myrosinase is present in specialist cells known as ‘myrosin cells’ and can also be known as toxic mines. The myrosin cell idioblasts of Brassica napus were genetically reprogrammed to undergo controlled cell death (ablation) during seed development. These myrosin cell-free plants have been named MINELESS as they lack toxic mines. This has led to the production of oilseed rape with a significant reduction both in myrosinase levels and in the hydrolysis of glucosinolates. Even though the myrosinase activity in MINELESS was very low compared with the wild type, variation was observed. This variability was overcome by producing homozygous seeds. A microspore culture technique involving non-fertile haploid MINELESS plants was developed and these plants were treated with colchicine to produce double haploid MINELESS plants with full fertility. Double haploid MINELESS plants had significantly reduced myrosinase levels and glucosinolate hydrolysis products. Wild-type and MINELESS plants exhibited significant differences in growth parameters such as plant height, leaf traits, matter accumulation, and yield parameters. The growth and developmental pattern of MINELESS plants was relatively slow compared with the wild type. The characteristics of the pure double haploid MINELESS plant are described and its importance for future biochemical, agricultural, dietary, functional genomics, and plant defence studies is discussed. PMID:21778185

  10. Random Distribution Pattern and Non-adaptivity of Genome Size in a Highly Variable Population of Festuca pallens

    PubMed Central

    Šmarda, Petr; Bureš, Petr; Horová, Lucie

    2007-01-01

    Background and Aims The spatial and statistical distribution of genome sizes and the adaptivity of genome size to some types of habitat, vegetation or microclimatic conditions were investigated in a tetraploid population of Festuca pallens. The population was previously documented to vary highly in genome size and is assumed as a model for the study of the initial stages of genome size differentiation. Methods Using DAPI flow cytometry, samples were measured repeatedly with diploid Festuca pallens as the internal standard. Altogether 172 plants from 57 plots (2·25 m2), distributed in contrasting habitats over the whole locality in South Moravia, Czech Republic, were sampled. The differences in DNA content were confirmed by the double peaks of simultaneously measured samples. Key Results At maximum, a 1·115-fold difference in genome size was observed. The statistical distribution of genome sizes was found to be continuous and best fits the extreme (Gumbel) distribution with rare occurrences of extremely large genomes (positive-skewed), as it is similar for the log-normal distribution of the whole Angiosperms. Even plants from the same plot frequently varied considerably in genome size and the spatial distribution of genome sizes was generally random and unautocorrelated (P > 0·05). The observed spatial pattern and the overall lack of correlations of genome size with recognized vegetation types or microclimatic conditions indicate the absence of ecological adaptivity of genome size in the studied population. Conclusions These experimental data on intraspecific genome size variability in Festuca pallens argue for the absence of natural selection and the selective non-significance of genome size in the initial stages of genome size differentiation, and corroborate the current hypothetical model of genome size evolution in Angiosperms (Bennetzen et al., 2005, Annals of Botany 95: 127–132). PMID:17565968

  11. Transcriptome analysis of functional differentiation between haploid and diploid cells of Emiliania huxleyi, a globally significant photosynthetic calcifying cell.

    PubMed

    von Dassow, Peter; Ogata, Hiroyuki; Probert, Ian; Wincker, Patrick; Da Silva, Corinne; Audic, Stéphane; Claverie, Jean-Michel; de Vargas, Colomban

    2009-01-01

    Eukaryotes are classified as either haplontic, diplontic, or haplo-diplontic, depending on which ploidy levels undergo mitotic cell division in the life cycle. Emiliania huxleyi is one of the most abundant phytoplankton species in the ocean, playing an important role in global carbon fluxes, and represents haptophytes, an enigmatic group of unicellular organisms that diverged early in eukaryotic evolution. This species is haplo-diplontic. Little is known about the haploid cells, but they have been hypothesized to allow persistence of the species between the yearly blooms of diploid cells. We sequenced over 38,000 expressed sequence tags from haploid and diploid E. huxleyi normalized cDNA libraries to identify genes involved in important processes specific to each life phase (2N calcification or 1N motility), and to better understand the haploid phase of this prominent haplo-diplontic organism. The haploid and diploid transcriptomes showed a dramatic differentiation, with approximately 20% greater transcriptome richness in diploid cells than in haploid cells and only haploids included signal transduction and motility genes. Diploid-specific transcripts included Ca2+, H+, and HCO3- pumps. Potential factors differentiating the transcriptomes included haploid-specific Myb transcription factor homologs and an unusual diploid-specific histone H4 homolog. This study permitted the identification of genes likely involved in diploid-specific biomineralization, haploid-specific motility, and transcriptional control. Greater transcriptome richness in diploid cells suggests they may be more versatile for exploiting a diversity of rich environments whereas haploid cells are intrinsically more streamlined.

  12. Transcriptome analysis of functional differentiation between haploid and diploid cells of Emiliania huxleyi, a globally significant photosynthetic calcifying cell

    PubMed Central

    2009-01-01

    Background Eukaryotes are classified as either haplontic, diplontic, or haplo-diplontic, depending on which ploidy levels undergo mitotic cell division in the life cycle. Emiliania huxleyi is one of the most abundant phytoplankton species in the ocean, playing an important role in global carbon fluxes, and represents haptophytes, an enigmatic group of unicellular organisms that diverged early in eukaryotic evolution. This species is haplo-diplontic. Little is known about the haploid cells, but they have been hypothesized to allow persistence of the species between the yearly blooms of diploid cells. We sequenced over 38,000 expressed sequence tags from haploid and diploid E. huxleyi normalized cDNA libraries to identify genes involved in important processes specific to each life phase (2N calcification or 1N motility), and to better understand the haploid phase of this prominent haplo-diplontic organism. Results The haploid and diploid transcriptomes showed a dramatic differentiation, with approximately 20% greater transcriptome richness in diploid cells than in haploid cells and only ≤ 50% of transcripts estimated to be common between the two phases. The major functional category of transcripts differentiating haploids included signal transduction and motility genes. Diploid-specific transcripts included Ca2+, H+, and HCO3- pumps. Potential factors differentiating the transcriptomes included haploid-specific Myb transcription factor homologs and an unusual diploid-specific histone H4 homolog. Conclusions This study permitted the identification of genes likely involved in diploid-specific biomineralization, haploid-specific motility, and transcriptional control. Greater transcriptome richness in diploid cells suggests they may be more versatile for exploiting a diversity of rich environments whereas haploid cells are intrinsically more streamlined. PMID:19832986

  13. Larger Daphnia at lower temperature: a role for cell size and genome configuration?

    PubMed

    Jalal, Marwa; Wojewodzic, Marcin W; Laane, Carl Morten M; Hessen, Dag O

    2013-09-01

    Experiments with Daphnia magna and Daphnia pulex raised at 10 and 20 °C yielded larger adult size at the lower temperature. This must reflect increased cell size, increased cell numbers, or a combination of both. As it is difficult to achieve good estimates on cell size in crustaceans, we, therefore, measured nucleus and genome size using flow cytometry at 10 and 20 °C. DNA was stained with propidium iodide, ethidium bromide, and DAPI. Both nucleus and genome size estimates were elevated at 10 °C compared with 20 °C, suggesting that larger body size at low temperature could partly be accredited to an enlarged nucleus and thus cell size. Confocal microscopy observations confirmed the staining properties of fluorochromes. As differences in nucleotide numbers in response of growth temperature within a life span is unlikely, these results seem accredited to changed DNA-fluorochrome binding properties, presumably reflecting increased DNA condensation at low temperature. This implies that genome size comparisons may be impacted by ambient temperature in ectotherms. It also suggests that temperature-induced structural changes in the genome could affect cell size and for some species even body size.

  14. Genome sizes of cranes (Aves: Gruiformes).

    PubMed

    Rasch, Ellen M

    2006-12-01

    The DNA content of blood cell nuclei of 15 species of cranes was determined by Feulgen-DNA cytophotometry. Genome sizes agree with values reported elsewhere for several crane species analyzed by flow cytometry. Males have more DNA per cell than females in several species. A karyotype where 2n = 80 is reported for a male greater sandhill crane. Copyright 2006 Wiley-Liss, Inc.

  15. Reproductive Mode and the Evolution of Genome Size and Structure in Caenorhabditis Nematodes

    PubMed Central

    Fierst, Janna L.; Willis, John H.; Thomas, Cristel G.; Wang, Wei; Reynolds, Rose M.; Ahearne, Timothy E.; Cutter, Asher D.; Phillips, Patrick C.

    2015-01-01

    The self-fertile nematode worms Caenorhabditis elegans, C. briggsae, and C. tropicalis evolved independently from outcrossing male-female ancestors and have genomes 20-40% smaller than closely related outcrossing relatives. This pattern of smaller genomes for selfing species and larger genomes for closely related outcrossing species is also seen in plants. We use comparative genomics, including the first high quality genome assembly for an outcrossing member of the genus (C. remanei) to test several hypotheses for the evolution of genome reduction under a change in mating system. Unlike plants, it does not appear that reductions in the number of repetitive elements, such as transposable elements, are an important contributor to the change in genome size. Instead, all functional genomic categories are lost in approximately equal proportions. Theory predicts that self-fertilization should equalize the effective population size, as well as the resulting effects of genetic drift, between the X chromosome and autosomes. Contrary to this, we find that the self-fertile C. briggsae and C. elegans have larger intergenic spaces and larger protein-coding genes on the X chromosome when compared to autosomes, while C. remanei actually has smaller introns on the X chromosome than either self-reproducing species. Rather than being driven by mutational biases and/or genetic drift caused by a reduction in effective population size under self reproduction, changes in genome size in this group of nematodes appear to be caused by genome-wide patterns of gene loss, most likely generated by genomic adaptation to self reproduction per se. PMID:26114425

  16. Fully-Automated High-Throughput NMR System for Screening of Haploid Kernels of Maize (Corn) by Measurement of Oil Content

    PubMed Central

    Xu, Xiaoping; Huang, Qingming; Chen, Shanshan; Yang, Peiqiang; Chen, Shaojiang; Song, Yiqiao

    2016-01-01

    One of the modern crop breeding techniques uses doubled haploid plants that contain an identical pair of chromosomes in order to accelerate the breeding process. Rapid haploid identification method is critical for large-scale selections of double haploids. The conventional methods based on the color of the endosperm and embryo seeds are slow, manual and prone to error. On the other hand, there exists a significant difference between diploid and haploid seeds generated by high oil inducer, which makes it possible to use oil content to identify the haploid. This paper describes a fully-automated high-throughput NMR screening system for maize haploid kernel identification. The system is comprised of a sampler unit to select a single kernel to feed for measurement of NMR and weight, and a kernel sorter to distribute the kernel according to the measurement result. Tests of the system show a consistent accuracy of 94% with an average screening time of 4 seconds per kernel. Field test result is described and the directions for future improvement are discussed. PMID:27454427

  17. Schrödinger’s Cheshire Cat: Are Haploid Emiliania huxleyi Cells Resistant to Viral Infection or Not?

    PubMed Central

    Mordecai, Gideon J.; Verret, Frederic; Highfield, Andrea; Schroeder, Declan C.

    2017-01-01

    Emiliania huxleyi is the main calcite producer on Earth and is routinely infected by a virus (EhV); a double stranded DNA (dsDNA) virus belonging to the family Phycodnaviridae. E. huxleyi exhibits a haplodiploid life cycle; the calcified diploid stage is non-motile and forms extensive blooms. The haploid phase is a non-calcified biflagellated cell bearing organic scales. Haploid cells are thought to resist infection, through a process deemed the “Cheshire Cat” escape strategy; however, a recent study detected the presence of viral lipids in the same haploid strain. Here we report on the application of an E. huxleyi CCMP1516 EhV-86 combined tiling array (TA) that further confirms an EhV infection in the RCC1217 haploid strain, which grew without any signs of cell lysis. Reverse transcription polymerase chain reaction (RT-PCR) and PCR verified the presence of viral RNA in the haploid cells, yet indicated an absence of viral DNA, respectively. These infected cells are an alternative stage of the virus life cycle deemed the haplococcolithovirocell. In this instance, the host is both resistant to and infected by EhV, i.e., the viral transcriptome is present in haploid cells whilst there is no evidence of viral lysis. This superimposed state is reminiscent of Schrödinger’s cat; of being simultaneously both dead and alive. PMID:28335465

  18. The genome of the fire ant Solenopsis invicta

    USDA-ARS?s Scientific Manuscript database

    Ants have evolved very complex societies and are key ecosystem members. Some of them are also major pests, as exemplified by the fire ant Solenopsis invicta. We present here the draft genome of S. invicta, assembled from 454 and Illumina reads obtained from a focal haploid male and his brothers. In ...

  19. Analysis of the giant genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size.

    PubMed

    Kelly, Laura J; Renny-Byfield, Simon; Pellicer, Jaume; Macas, Jiří; Novák, Petr; Neumann, Pavel; Lysak, Martin A; Day, Peter D; Berger, Madeleine; Fay, Michael F; Nichols, Richard A; Leitch, Andrew R; Leitch, Ilia J

    2015-10-01

    Plants exhibit an extraordinary range of genome sizes, varying by > 2000-fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size. Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria. We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low-abundance repeat-derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal. Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high-abundance repeat families. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  20. Sizing up arthropod genomes: an evaluation of the impact of environmental variation on genome size estimates by flow cytometry and the use of qPCR as a method of estimation.

    PubMed

    Gregory, T Ryan; Nathwani, Paula; Bonnett, Tiffany R; Huber, Dezene P W

    2013-09-01

    A study was undertaken to evaluate both a pre-existing method and a newly proposed approach for the estimation of nuclear genome sizes in arthropods. First, concerns regarding the reliability of the well-established method of flow cytometry relating to impacts of rearing conditions on genome size estimates were examined. Contrary to previous reports, a more carefully controlled test found negligible environmental effects on genome size estimates in the fly Drosophila melanogaster. Second, a more recently touted method based on quantitative real-time PCR (qPCR) was examined in terms of ease of use, efficiency, and (most importantly) accuracy using four test species: the flies Drosophila melanogaster and Musca domestica and the beetles Tribolium castaneum and Dendroctonus ponderosa. The results of this analysis demonstrated that qPCR has the tendency to produce substantially different genome size estimates from other established techniques while also being far less efficient than existing methods.

  1. Transposable element distribution, abundance and role in genome size variation in the genus Oryza.

    PubMed

    Zuccolo, Andrea; Sebastian, Aswathy; Talag, Jayson; Yu, Yeisoo; Kim, HyeRan; Collura, Kristi; Kudrna, Dave; Wing, Rod A

    2007-08-29

    The genus Oryza is composed of 10 distinct genome types, 6 diploid and 4 polyploid, and includes the world's most important food crop - rice (Oryza sativa [AA]). Genome size variation in the Oryza is more than 3-fold and ranges from 357 Mbp in Oryza glaberrima [AA] to 1283 Mbp in the polyploid Oryza ridleyi [HHJJ]. Because repetitive elements are known to play a significant role in genome size variation, we constructed random sheared small insert genomic libraries from 12 representative Oryza species and conducted a comprehensive study of the repetitive element composition, distribution and phylogeny in this genus. Particular attention was paid to the role played by the most important classes of transposable elements (Long Terminal Repeats Retrotransposons, Long interspersed Nuclear Elements, helitrons, DNA transposable elements) in shaping these genomes and in their contributing to genome size variation. We identified the elements primarily responsible for the most strikingly genome size variation in Oryza. We demonstrated how Long Terminal Repeat retrotransposons belonging to the same families have proliferated to very different extents in various species. We also showed that the pool of Long Terminal Repeat Retrotransposons is substantially conserved and ubiquitous throughout the Oryza and so its origin is ancient and its existence predates the speciation events that originated the genus. Finally we described the peculiar behavior of repeats in the species Oryza coarctata [HHKK] whose placement in the Oryza genus is controversial. Long Terminal Repeat retrotransposons are the major component of the Oryza genomes analyzed and, along with polyploidization, are the most important contributors to the genome size variation across the Oryza genus. Two families of Ty3-gypsy elements (RIRE2 and Atlantys) account for a significant portion of the genome size variations present in the Oryza genus.

  2. Intra-specific variation in genome size in maize: cytological and phenotypic correlates

    PubMed Central

    Realini, María Florencia; Poggio, Lidia; Cámara-Hernández, Julián; González, Graciela Esther

    2016-01-01

    Genome size variation accompanies the diversification and evolution of many plant species. Relationships between DNA amount and phenotypic and cytological characteristics form the basis of most hypotheses that ascribe a biological role to genome size. The goal of the present research was to investigate the intra-specific variation in the DNA content in maize populations from Northeastern Argentina and further explore the relationship between genome size and the phenotypic traits seed weight and length of the vegetative cycle. Moreover, cytological parameters such as the percentage of heterochromatin as well as the number, position and sequence composition of knobs were analysed and their relationships with 2C DNA values were explored. The populations analysed presented significant differences in 2C DNA amount, from 4.62 to 6.29 pg, representing 36.15 % of the inter-populational variation. Moreover, intra-populational genome size variation was found, varying from 1.08 to 1.63-fold. The variation in the percentage of knob heterochromatin as well as in the number, chromosome position and sequence composition of the knobs was detected among and within the populations. Although a positive relationship between genome size and the percentage of heterochromatin was observed, a significant correlation was not found. This confirms that other non-coding repetitive DNA sequences are contributing to the genome size variation. A positive relationship between DNA amount and the seed weight has been reported in a large number of species, this relationship was not found in the populations studied here. The length of the vegetative cycle showed a positive correlation with the percentage of heterochromatin. This result allowed attributing an adaptive effect to heterochromatin since the length of this cycle would be optimized via selection for an appropriate percentage of heterochromatin. PMID:26644343

  3. Rapid and accurate identification of in vivo-induced haploid seeds based on oil content in maize

    PubMed Central

    Melchinger, Albrecht E.; Schipprack, Wolfgang; Würschum, Tobias; Chen, Shaojiang; Technow, Frank

    2013-01-01

    The needs of a growing human population require rapid and efficient development of improved cultivars by plant breeders. The doubled haploid (DH) technology enables generating completely homozygous lines in a single step and, thus, is central to modern genetics and breeding approaches. Rapid and reliable identification of seeds with a haploid embryo after in vivo haploid induction is elementary in the method utilized in maize but current systems have severe shortcomings preventing their use in many germplasm types. Here, we describe an alternative method for discrimination of haploid from diploid seeds based on differences in their oil content stemming from pollination with high oil inducers. After presenting some fundamental theory, we provide a proof-of-concept with experimental results, demonstrating acceptable error rates across different germplasm. Our approach represents a breakthrough in DH technology in maize, because it is amenable to automated high-throughput screening and applicable to any maize germplasm worldwide. PMID:23820577

  4. Diploid, but not haploid, human embryonic stem cells can be derived from microsurgically repaired tripronuclear human zygotes

    PubMed Central

    Fan, Yong; Li, Rong; Huang, Jin; Yu, Yang; Qiao, Jie

    2013-01-01

    Human embryonic stem cells have shown tremendous potential in regenerative medicine, and the recent progress in haploid embryonic stem cells provides new insights for future applications of embryonic stem cells. Disruption of normal fertilized embryos remains controversial; thus, the development of a new source for human embryonic stem cells is important for their usefulness. Here, we investigated the feasibility of haploid and diploid embryo reconstruction and embryonic stem cell derivation using microsurgically repaired tripronuclear human zygotes. Diploid and haploid zygotes were successfully reconstructed, but a large proportion of them still had a tripolar spindle assembly. The reconstructed embryos developed to the blastocyst stage, although the loss of chromosomes was observed in these zygotes. Finally, triploid and diploid human embryonic stem cells were derived from tripronuclear and reconstructed zygotes (from which only one pronucleus was removed), but haploid human embryonic stem cells were not successfully derived from the reconstructed zygotes when two pronuclei were removed. Both triploid and diploid human embryonic stem cells showed the general characteristics of human embryonic stem cells. These results indicate that the lower embryo quality resulting from abnormal spindle assembly contributed to the failure of the haploid embryonic stem cell derivation. However, the successful derivation of diploid embryonic stem cells demonstrated that microsurgical tripronuclear zygotes are an alternative source of human embryonic stem cells. In the future, improving spindle assembly will facilitate the application of triploid zygotes to the field of haploid embryonic stem cells. PMID:23255130

  5. A universe of dwarfs and giants: genome size and chromosome evolution in the monocot family Melanthiaceae.

    PubMed

    Pellicer, Jaume; Kelly, Laura J; Leitch, Ilia J; Zomlefer, Wendy B; Fay, Michael F

    2014-03-01

    • Since the occurrence of giant genomes in angiosperms is restricted to just a few lineages, identifying where shifts towards genome obesity have occurred is essential for understanding the evolutionary mechanisms triggering this process. • Genome sizes were assessed using flow cytometry in 79 species and new chromosome numbers were obtained. Phylogenetically based statistical methods were applied to infer ancestral character reconstructions of chromosome numbers and nuclear DNA contents. • Melanthiaceae are the most diverse family in terms of genome size, with C-values ranging more than 230-fold. Our data confirmed that giant genomes are restricted to tribe Parideae, with most extant species in the family characterized by small genomes. Ancestral genome size reconstruction revealed that the most recent common ancestor (MRCA) for the family had a relatively small genome (1C = 5.37 pg). Chromosome losses and polyploidy are recovered as the main evolutionary mechanisms generating chromosome number change. • Genome evolution in Melanthiaceae has been characterized by a trend towards genome size reduction, with just one episode of dramatic DNA accumulation in Parideae. Such extreme contrasting profiles of genome size evolution illustrate the key role of transposable elements and chromosome rearrangements in driving the evolution of plant genomes. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.

  6. A new rainbow trout (Oncorhynchus mykiss) reference genome assembly

    USDA-ARS?s Scientific Manuscript database

    In an effort to improve the rainbow trout reference genome assembly, we have re-sequenced the doubled-haploid Swanson line using the longest available reads from the Illumina technology. Overall we generated over 510 million 260nt paired-end shotgun reads, and 1 billion 160nt mate-pair reads from f...

  7. A human haploid gene trap collection to study lncRNAs with unusual RNA biology.

    PubMed

    Kornienko, Aleksandra E; Vlatkovic, Irena; Neesen, Jürgen; Barlow, Denise P; Pauler, Florian M

    2016-01-01

    Many thousand long non-coding (lnc) RNAs are mapped in the human genome. Time consuming studies using reverse genetic approaches by post-transcriptional knock-down or genetic modification of the locus demonstrated diverse biological functions for a few of these transcripts. The Human Gene Trap Mutant Collection in haploid KBM7 cells is a ready-to-use tool for studying protein-coding gene function. As lncRNAs show remarkable differences in RNA biology compared to protein-coding genes, it is unclear if this gene trap collection is useful for functional analysis of lncRNAs. Here we use the uncharacterized LOC100288798 lncRNA as a model to answer this question. Using public RNA-seq data we show that LOC100288798 is ubiquitously expressed, but inefficiently spliced. The minor spliced LOC100288798 isoforms are exported to the cytoplasm, whereas the major unspliced isoform is nuclear localized. This shows that LOC100288798 RNA biology differs markedly from typical mRNAs. De novo assembly from RNA-seq data suggests that LOC100288798 extends 289kb beyond its annotated 3' end and overlaps the downstream SLC38A4 gene. Three cell lines with independent gene trap insertions in LOC100288798 were available from the KBM7 gene trap collection. RT-qPCR and RNA-seq confirmed successful lncRNA truncation and its extended length. Expression analysis from RNA-seq data shows significant deregulation of 41 protein-coding genes upon LOC100288798 truncation. Our data shows that gene trap collections in human haploid cell lines are useful tools to study lncRNAs, and identifies the previously uncharacterized LOC100288798 as a potential gene regulator.

  8. Chromosome Numbers and Genome Size Variation in Indian Species of Curcuma (Zingiberaceae)

    PubMed Central

    Leong-Škorničková, Jana; Šída, Otakar; Jarolímová, Vlasta; Sabu, Mamyil; Fér, Tomáš; Trávníček, Pavel; Suda, Jan

    2007-01-01

    Background and Aims Genome size and chromosome numbers are important cytological characters that significantly influence various organismal traits. However, geographical representation of these data is seriously unbalanced, with tropical and subtropical regions being largely neglected. In the present study, an investigation was made of chromosomal and genome size variation in the majority of Curcuma species from the Indian subcontinent, and an assessment was made of the value of these data for taxonomic purposes. Methods Genome size of 161 homogeneously cultivated plant samples classified into 51 taxonomic entities was determined by propidium iodide flow cytometry. Chromosome numbers were counted in actively growing root tips using conventional rapid squash techniques. Key Results Six different chromosome counts (2n = 22, 42, 63, >70, 77 and 105) were found, the last two representing new generic records. The 2C-values varied from 1·66 pg in C. vamana to 4·76 pg in C. oligantha, representing a 2·87-fold range. Three groups of taxa with significantly different homoploid genome sizes (Cx-values) and distinct geographical distribution were identified. Five species exhibited intraspecific variation in nuclear DNA content, reaching up to 15·1 % in cultivated C. longa. Chromosome counts and genome sizes of three Curcuma-like species (Hitchenia caulina, Kaempferia scaposa and Paracautleya bhatii) corresponded well with typical hexaploid (2n = 6x = 42) Curcuma spp. Conclusions The basic chromosome number in the majority of Indian taxa (belonging to subgenus Curcuma) is x = 7; published counts correspond to 6x, 9x, 11x, 12x and 15x ploidy levels. Only a few species-specific C-values were found, but karyological and/or flow cytometric data may support taxonomic decisions in some species alliances with morphological similarities. Close evolutionary relationships among some cytotypes are suggested based on the similarity in homoploid genome sizes and geographical grouping

  9. Multiple Pairwise Analysis of Non-homologous Centromere Coupling Reveals Preferential Chromosome Size-Dependent Interactions and a Role for Bouquet Formation in Establishing the Interaction Pattern

    PubMed Central

    Lefrançois, Philippe; Rockmill, Beth; Xie, Pingxing; Roeder, G. Shirleen; Snyder, Michael

    2016-01-01

    During meiosis, chromosomes undergo a homology search in order to locate their homolog to form stable pairs and exchange genetic material. Early in prophase, chromosomes associate in mostly non-homologous pairs, tethered only at their centromeres. This phenomenon, conserved through higher eukaryotes, is termed centromere coupling in budding yeast. Both initiation of recombination and the presence of homologs are dispensable for centromere coupling (occurring in spo11 mutants and haploids induced to undergo meiosis) but the presence of the synaptonemal complex (SC) protein Zip1 is required. The nature and mechanism of coupling have yet to be elucidated. Here we present the first pairwise analysis of centromere coupling in an effort to uncover underlying rules that may exist within these non-homologous interactions. We designed a novel chromosome conformation capture (3C)-based assay to detect all possible interactions between non-homologous yeast centromeres during early meiosis. Using this variant of 3C-qPCR, we found a size-dependent interaction pattern, in which chromosomes assort preferentially with chromosomes of similar sizes, in haploid and diploid spo11 cells, but not in a coupling-defective mutant (spo11 zip1 haploid and diploid yeast). This pattern is also observed in wild-type diploids early in meiosis but disappears as meiosis progresses and homologous chromosomes pair. We found no evidence to support the notion that ancestral centromere homology plays a role in pattern establishment in S. cerevisiae post-genome duplication. Moreover, we found a role for the meiotic bouquet in establishing the size dependence of centromere coupling, as abolishing bouquet (using the bouquet-defective spo11 ndj1 mutant) reduces it. Coupling in spo11 ndj1 rather follows telomere clustering preferences. We propose that a chromosome size preference for centromere coupling helps establish efficient homolog recognition. PMID:27768699

  10. Optimization of the genotyping-by-sequencing strategy for population genomic analysis in conifers.

    PubMed

    Pan, Jin; Wang, Baosheng; Pei, Zhi-Yong; Zhao, Wei; Gao, Jie; Mao, Jian-Feng; Wang, Xiao-Ru

    2015-07-01

    Flexibility and low cost make genotyping-by-sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI-MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference-free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000-11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking. © 2014 John Wiley & Sons Ltd.

  11. Genome scaffolding and annotation for the pathogen vector Ixodes ricinus by ultra-long single molecule sequencing.

    PubMed

    Cramaro, Wibke J; Hunewald, Oliver E; Bell-Sakyi, Lesley; Muller, Claude P

    2017-02-08

    Global warming and other ecological changes have facilitated the expansion of Ixodes ricinus tick populations. Ixodes ricinus is the most important carrier of vector-borne pathogens in Europe, transmitting viruses, protozoa and bacteria, in particular Borrelia burgdorferi (sensu lato), the causative agent of Lyme borreliosis, the most prevalent vector-borne disease in humans in the Northern hemisphere. To faster control this disease vector, a better understanding of the I. ricinus tick is necessary. To facilitate such studies, we recently published the first reference genome of this highly prevalent pathogen vector. Here, we further extend these studies by scaffolding and annotating the first reference genome by using ultra-long sequencing reads from third generation single molecule sequencing. In addition, we present the first genome size estimation for I. ricinus ticks and the embryo-derived cell line IRE/CTVM19. 235,953 contigs were integrated into 204,904 scaffolds, extending the currently known genome lengths by more than 30% from 393 to 516 Mb and the N50 contig value by 87% from 1643 bp to a N50 scaffold value of 3067 bp. In addition, 25,263 sequences were annotated by comparison to the tick's North American relative Ixodes scapularis. After (conserved) hypothetical proteins, zinc finger proteins, secreted proteins and P450 coding proteins were the most prevalent protein categories annotated. Interestingly, more than 50% of the amino acid sequences matching the homology threshold had 95-100% identity to the corresponding I. scapularis gene models. The sequence information was complemented by the first genome size estimation for this species. Flow cytometry-based genome size analysis revealed a haploid genome size of 2.65Gb for I. ricinus ticks and 3.80 Gb for the cell line. We present a first draft sequence map of the I. ricinus genome based on a PacBio-Illumina assembly. The I. ricinus genome was shown to be 26% (500 Mb) larger than the genome of its

  12. Evolution of the Australian lungfish (Neoceratodus forsteri) genome: a major role for CR1 and L2 LINE elements.

    PubMed

    Metcalfe, Cushla J; Filée, Jonathan; Germon, Isabelle; Joss, Jean; Casane, Didier

    2012-11-01

    Haploid genomes greater than 25,000 Mb are rare, within the animals only the lungfish and some of the salamanders and crustaceans are known to have genomes this large. There is very little data on the structure of genomes this size. It is known, however, that for animal genomes up to 3,000 Mb, there is in general a good correlation between genome size and the percent of the genome composed of repetitive sequence and that this repetitive component is highly dynamic. In this study, we sampled the Australian lungfish genome using three mini-genomic libraries and found that with very little sequence, the results converged on an estimate of 40% of the genome being composed of recognizable transposable elements (TEs), chiefly from the CR1 and L2 long interspersed nuclear element clades. We further characterized the CR1 and L2 elements in the lungfish genome and show that although most CR1 elements probably represent recent amplifications, the L2 elements are more diverse and are more likely the result of a series of amplifications. We suggest that our sampling method has probably underestimated the recognizable TE content. However, on the basis of the most likely sources of error, we suggest that this very large genome is not largely composed of recently amplified, undetected TEs but may instead include a large component of older degenerate TEs. Based on these estimates, and on Thomson's (Thomson K. 1972. An attempt to reconstruct evolutionary changes in the cellular DNA content of lungfish. J Exp Zool. 180:363-372) inference that in the lineage leading to the extant Australian lungfish, there was massive increase in genome size between 350 and 200 mya, after which the size of the genome changed little, we speculate that the very large Australian lungfish genome may be the result of a massive amplification of TEs followed by a long period with a very low rate of sequence removal and some ongoing TE activity.

  13. A nine-scaffold genome assembly of the nine chromosome sugar beet

    USDA-ARS?s Scientific Manuscript database

    Over the course of 20 months, we assembled a sugar beet genome (700 - 800 Mb) into a close representation of the nine haploid chromosomes of beet. This result was obtained by sequentially assembling sequences >40 kb in length, orienting these assemblies via optical mapping, and scaffolding with in v...

  14. Agrobacterium-mediated transformation of the haploid liverwort Marchantia polymorpha L., an emerging model for plant biology.

    PubMed

    Ishizaki, Kimitsune; Chiyoda, Shota; Yamato, Katsuyuki T; Kohchi, Takayuki

    2008-07-01

    Agrobacterium-mediated transformation has not been practical in pteridophytes, bryophytes and algae to date, although it is commonly used in model plants including Arabidopsis and rice. Here we present a rapid Agrobacterium-mediated transformation system for the haploid liverwort Marchantia polymorpha L. using immature thalli developed from spores. Hundreds of hygromycin-resistant plants per sporangium were obtained by co-cultivation of immature thalli with Agrobacterium carrying the binary vector that contains a reporter, the beta-glucuronidase (GUS) gene with an intron, and a selection marker, the hygromycin phosphotransferase (hpt) gene. In this system, individual gemmae, which arise asexually from single initial cells, were analyzed as isogenic transformants. GUS activity staining showed that all hygromycin-resistant plants examined expressed the GUS transgene in planta. DNA analyses verified random integration of 1-5 copies of the intact T-DNA between the right and the left borders into the M. polymorpha genome. The efficient and rapid Agrobacterium-mediated transformation of M. polymorpha should provide molecular techniques to facilitate comparative genomics, taking advantage of this unique model plant that retains many features of the common ancestor of land plants.

  15. Challenges of flow-cytometric estimation of nuclear genome size in orchids, a plant group with both whole-genome and progressively partial endoreplication.

    PubMed

    Trávníček, Pavel; Ponert, Jan; Urfus, Tomáš; Jersáková, Jana; Vrána, Jan; Hřibová, Eva; Doležel, Jaroslav; Suda, Jan

    2015-10-01

    Nuclear genome size is an inherited quantitative trait of eukaryotic organisms with both practical and biological consequences. A detailed analysis of major families is a promising approach to fully understand the biological meaning of the extensive variation in genome size in plants. Although Orchidaceae accounts for ∼10% of the angiosperm diversity, the knowledge of patterns and dynamics of their genome size is limited, in part due to difficulties in flow cytometric analyses. Cells in various somatic tissues of orchids undergo extensive endoreplication, either whole-genome or partial, and the G1-phase nuclei with 2C DNA amounts may be lacking, resulting in overestimated genome size values. Interpretation of DNA content histograms is particularly challenging in species with progressively partial endoreplication, in which the ratios between the positions of two neighboring DNA peaks are lower than two. In order to assess distributions of nuclear DNA amounts and identify tissue suitable for reliable estimation of nuclear DNA content, we analyzed six different tissue types in 48 orchid species belonging to all recognized subfamilies. Although traditionally used leaves may provide incorrect C-values, particularly in species with progressively partial endoreplication, young ovaries and pollinaria consistently yield 2C and 1C peaks of their G1-phase nuclei, respectively, and are, therefore, the most suitable parts for genome size studies in orchids. We also provide new DNA C-values for 22 orchid genera and 42 species. Adhering to the proposed methodology would allow for reliable genome size estimates in this largest plant family. Although our research was limited to orchids, the need to find a suitable tissue with dominant 2C peak of G1-phase nuclei applies to all endopolyploid species. © 2015 International Society for Advancement of Cytometry.

  16. Genome size estimates for crustaceans using Feulgen image analysis densitometry of ethanol-preserved tissues.

    PubMed

    Jeffery, Nicholas W; Gregory, T Ryan

    2014-10-01

    Crustaceans are enormously diverse both phylogenetically and ecologically, but they remain substantially underrepresented in the existing genome size database. An expansion of this dataset could be facilitated if it were possible to obtain genome size estimates from ethanol-preserved specimens. In this study, two tests were performed in order to assess the reliability of genome size data generated using preserved material. First, the results of estimates based on flash-frozen versus ethanol-preserved material were compared across 37 species of crustaceans that differ widely in genome size. Second, a comparison was made of specimens from a single species that had been stored in ethanol for 1-14 years. In both cases, the use of gill tissue in Feulgen image analysis densitometry proved to be a very viable approach. This finding is of direct relevance to both new studies of field-collected crustaceans as well as potential studies based on existing collections. © 2014 International Society for Advancement of Cytometry.

  17. On the need for widespread horizontal gene transfers under genome size constraint.

    PubMed

    Isambert, Hervé; Stein, Richard R

    2009-08-25

    While eukaryotes primarily evolve by duplication-divergence expansion (and reduction) of their own gene repertoire with only rare horizontal gene transfers, prokaryotes appear to evolve under both gene duplications and widespread horizontal gene transfers over long evolutionary time scales. But, the evolutionary origin of this striking difference in the importance of horizontal gene transfers remains by and large a mystery. We propose that the abundance of horizontal gene transfers in free-living prokaryotes is a simple but necessary consequence of two opposite effects: i) their apparent genome size constraint compared to typical eukaryote genomes and ii) their underlying genome expansion dynamics through gene duplication-divergence evolution, as demonstrated by the presence of many tandem and block repeated genes. In principle, this combination of genome size constraint and underlying duplication expansion should lead to a coalescent-like process with extensive turnover of functional genes. This would, however, imply the unlikely, systematic reinvention of functions from discarded genes within independent phylogenetic lineages. Instead, we propose that the long-term evolutionary adaptation of free-living prokaryotes must have resulted in the emergence of efficient non-phylogenetic pathways to circumvent gene loss. This need for widespread horizontal gene transfers due to genome size constraint implies, in particular, that prokaryotes must remain under strong selection pressure in order to maintain the long-term evolutionary adaptation of their "mutualized" gene pool, beyond the inevitable turnover of individual prokaryote species. By contrast, the absence of genome size constraint for typical eukaryotes has presumably relaxed their need for widespread horizontal gene transfers and strong selection pressure. Yet, the resulting loss of genetic functions, due to weak selection pressure and inefficient gene recovery mechanisms, must have ultimately favored the

  18. Whole Genome Sequencing of the Braconid Parasitoid Wasp Fopius arisanus, an Important Biocontrol Agent of Pest Tepritid Fruit Flies

    PubMed Central

    Geib, Scott M.; Liang, Guang Hong; Murphy, Terence D.; Sim, Sheina B.

    2017-01-01

    The braconid wasp Fopius arisanus (Sonan) is an important biological control agent of tropical and subtropical pest fruit flies, including two important global pests, the Mediterranean fruit fly (Ceratitis capitata), and the oriental fruit fly (Bactrocera dorsalis). The goal of this study was to develop foundational genomic resources for this species to provide tools that can be used to answer questions exploring the multitrophic interactions between the host and parasitoid in this important research system. Here, we present a whole genome assembly of F. arisanus, derived from a pool of haploid offspring from a single unmated female. The genome is ∼154 Mb in size, with a N50 contig and scaffold size of 51,867 bp and 0.98 Mb, respectively. Utilizing existing RNA-Seq data for this species, as well as publicly available peptide sequences from related Hymenoptera, a high quality gene annotation set, which includes 10,991 protein coding genes, was generated. Prior to this assembly submission, no RefSeq proteins were present for this species. Parasitic wasps play an important role in a diverse ecosystem as well as a role in biological control of agricultural pests. This whole genome assembly and annotation data represents the first genome-scale assembly for this species or any closely related Opiine, and are publicly available in the National Center for Biotechnology Information Genome and RefSeq databases, providing a much needed genomic resource for this hymenopteran group. PMID:28584080

  19. True-breeding targeted gene knock-out in barley using designer TALE-nuclease in haploid cells.

    PubMed

    Gurushidze, Maia; Hensel, Goetz; Hiekel, Stefan; Schedel, Sindy; Valkov, Vladimir; Kumlehn, Jochen

    2014-01-01

    Transcription activator-like effector nucleases (TALENs) are customizable fusion proteins able to cleave virtually any genomic DNA sequence of choice, and thereby to generate site-directed genetic modifications in a wide range of cells and organisms. In the present study, we expressed TALENs in pollen-derived, regenerable cells to establish the generation of instantly true-breeding mutant plants. A gfp-specific TALEN pair was expressed via Agrobacterium-mediated transformation in embryogenic pollen of transgenic barley harboring a functional copy of gfp. Thanks to the haploid nature of the target cells, knock-out mutations were readily detected, and homozygous primary mutant plants obtained following genome duplication. In all, 22% of the TALEN transgenics proved knocked out with respect to gfp, and the loss of function could be ascribed to the deletions of between four and 36 nucleotides in length. The altered gfp alleles were transmitted normally through meiosis, and the knock-out phenotype was consistently shown by the offspring of two independent mutants. Thus, here we describe the efficient production of TALEN-mediated gene knock-outs in barley that are instantaneously homozygous and non-chimeric in regard to the site-directed mutations induced. This TALEN approach has broad applicability for both elucidating gene function and tailoring the phenotype of barley and other crop species.

  20. Convergent evolution of a fused sexual cycle promotes the haploid lifestyle

    NASA Astrophysics Data System (ADS)

    Sherwood, Racquel Kim; Scaduto, Christine M.; Torres, Sandra E.; Bennett, Richard J.

    2014-02-01

    Sexual reproduction is restricted to eukaryotic species and involves the fusion of haploid gametes to form a diploid cell that subsequently undergoes meiosis to generate recombinant haploid forms. This process has been extensively studied in the unicellular yeast Saccharomyces cerevisiae, which exhibits separate regulatory control over mating and meiosis. Here we address the mechanism of sexual reproduction in the related hemiascomycete species Candida lusitaniae. We demonstrate that, in contrast to S. cerevisiae, C. lusitaniae exhibits a highly integrated sexual program in which the programs regulating mating and meiosis have fused. Profiling of the C. lusitaniae sexual cycle revealed that gene expression patterns during mating and meiosis were overlapping, indicative of co-regulation. This was particularly evident for genes involved in pheromone MAPK signalling, which were highly induced throughout the sexual cycle of C. lusitaniae. Furthermore, genetic analysis showed that the orthologue of IME2, a `diploid-specific' factor in S. cerevisiae, and STE12, the master regulator of S. cerevisiae mating, were each required for progression through both mating and meiosis in C. lusitaniae. Together, our results establish that sexual reproduction has undergone significant rewiring between S. cerevisiae and C. lusitaniae, and that a concerted sexual cycle operates in C. lusitaniae that is more reminiscent of the distantly related ascomycete, Schizosaccharomyces pombe. We discuss these results in light of the evolution of sexual reproduction in yeast, and propose that regulatory coupling of mating and meiosis has evolved multiple times as an adaptation to promote the haploid lifestyle.

  1. A New and Improved Rainbow Trout (Oncorhynchus mykiss) Reference Genome Assembly

    USDA-ARS?s Scientific Manuscript database

    In an effort to improve the rainbow trout reference genome assembly, we re-sequenced the doubled-haploid Swanson line using the longest available reads from the Illumina technology; generating over 510 million paired-end shotgun reads (2x260nt), and 1 billion mate-pair reads (2x160nt) from four sequ...

  2. Rapid Increase in Genome Size as a Consequence of Transposable Element Hyperactivity in Wood-White (Leptidea) Butterflies

    PubMed Central

    Talla, Venkat; Suh, Alexander; Kalsoom, Faheema; Dincă, Vlad; Vila, Roger; Friberg, Magne; Wiklund, Christer

    2017-01-01

    Abstract Characterizing and quantifying genome size variation among organisms and understanding if genome size evolves as a consequence of adaptive or stochastic processes have been long-standing goals in evolutionary biology. Here, we investigate genome size variation and association with transposable elements (TEs) across lepidopteran lineages using a novel genome assembly of the common wood-white (Leptidea sinapis) and population re-sequencing data from both L. sinapis and the closely related L. reali and L. juvernica together with 12 previously available lepidopteran genome assemblies. A phylogenetic analysis confirms established relationships among species, but identifies previously unknown intraspecific structure within Leptidea lineages. The genome assembly of L. sinapis is one of the largest of any lepidopteran taxon so far (643 Mb) and genome size is correlated with abundance of TEs, both in Lepidoptera in general and within Leptidea where L. juvernica from Kazakhstan has considerably larger genome size than any other Leptidea population. Specific TE subclasses have been active in different Lepidoptera lineages with a pronounced expansion of predominantly LINEs, DNA elements, and unclassified TEs in the Leptidea lineage after the split from other Pieridae. The rate of genome expansion in Leptidea in general has been in the range of four Mb/Million year (My), with an increase in a particular L. juvernica population to 72 Mb/My. The considerable differences in accumulation rates of specific TE classes in different lineages indicate that TE activity plays a major role in genome size evolution in butterflies and moths. PMID:28981642

  3. Medium-sized tandem repeats represent an abundant component of the Drosophila virilis genome.

    PubMed

    Abdurashitov, Murat A; Gonchar, Danila A; Chernukhin, Valery A; Tomilov, Victor N; Tomilova, Julia E; Schostak, Natalia G; Zatsepina, Olga G; Zelentsova, Elena S; Evgen'ev, Michael B; Degtyarev, Sergey K H

    2013-11-09

    Previously, we developed a simple method for carrying out a restriction enzyme analysis of eukaryotic DNA in silico, based on the known DNA sequences of the genomes. This method allows the user to calculate lengths of all DNA fragments that are formed after a whole genome is digested at the theoretical recognition sites of a given restriction enzyme. A comparison of the observed peaks in distribution diagrams with the results from DNA cleavage using several restriction enzymes performed in vitro have shown good correspondence between the theoretical and experimental data in several cases. Here, we applied this approach to the annotated genome of Drosophila virilis which is extremely rich in various repeats. Here we explored the combined approach to perform the restriction analysis of D. virilis DNA. This approach enabled to reveal three abundant medium-sized tandem repeats within the D. virilis genome. While the 225 bp repeats were revealed previously in intergenic non-transcribed spacers between ribosomal genes of D. virilis, two other families comprised of 154 bp and 172 bp repeats were not described. Tandem Repeats Finder search demonstrated that 154 bp and 172 bp units are organized in multiple clusters in the genome of D. virilis. Characteristically, only 154 bp repeats derived from Helitron transposon are transcribed. Using in silico digestion in combination with conventional restriction analysis and sequencing of repeated DNA fragments enabled us to isolate and characterize three highly abundant families of medium-sized repeats present in the D. virilis genome. These repeats comprise a significant portion of the genome and may have important roles in genome function and structural integrity. Therefore, we demonstrated an approach which makes possible to investigate in detail the gross arrangement and expression of medium-sized repeats basing on sequencing data even in the case of incompletely assembled and/or annotated genomes.

  4. Genome Size Variation in the Genus Carthamus (Asteraceae, Cardueae): Systematic Implications and Additive Changes During Allopolyploidization

    PubMed Central

    GARNATJE, TERESA; GARCIA, SÒNIA; VILATERSANA, ROSER; VALLÈS, JOAN

    2006-01-01

    • Background and Aims Plant genome size is an important biological characteristic, with relationships to systematics, ecology and distribution. Currently, there is no information regarding nuclear DNA content for any Carthamus species. In addition to improving the knowledge base, this research focuses on interspecific variation and its implications for the infrageneric classification of this genus. Genome size variation in the process of allopolyploid formation is also addressed. • Methods Nuclear DNA samples from 34 populations of 16 species of the genus Carthamus were assessed by flow cytometry using propidium iodide. • Key Results The 2C values ranged from 2·26 pg for C. leucocaulos to 7·46 pg for C. turkestanicus, and monoploid genome size (1Cx-value) ranged from 1·13 pg in C. leucocaulos to 1·53 pg in C. alexandrinus. Mean genome sizes differed significantly, based on sectional classification. Both allopolyploid species (C. creticus and C. turkestanicus) exhibited nuclear DNA contents in accordance with the sum of the putative parental C-values (in one case with a slight reduction, frequent in polyploids), supporting their hybrid origin. • Conclusions Genome size represents a useful tool in elucidating systematic relationships between closely related species. A considerable reduction in monoploid genome size, possibly due to the hybrid formation, is also reported within these taxa. PMID:16390843

  5. Rapid Increase in Genome Size as a Consequence of Transposable Element Hyperactivity in Wood-White (Leptidea) Butterflies.

    PubMed

    Talla, Venkat; Suh, Alexander; Kalsoom, Faheema; Dinca, Vlad; Vila, Roger; Friberg, Magne; Wiklund, Christer; Backström, Niclas

    2017-10-01

    Characterizing and quantifying genome size variation among organisms and understanding if genome size evolves as a consequence of adaptive or stochastic processes have been long-standing goals in evolutionary biology. Here, we investigate genome size variation and association with transposable elements (TEs) across lepidopteran lineages using a novel genome assembly of the common wood-white (Leptidea sinapis) and population re-sequencing data from both L. sinapis and the closely related L. reali and L. juvernica together with 12 previously available lepidopteran genome assemblies. A phylogenetic analysis confirms established relationships among species, but identifies previously unknown intraspecific structure within Leptidea lineages. The genome assembly of L. sinapis is one of the largest of any lepidopteran taxon so far (643 Mb) and genome size is correlated with abundance of TEs, both in Lepidoptera in general and within Leptidea where L. juvernica from Kazakhstan has considerably larger genome size than any other Leptidea population. Specific TE subclasses have been active in different Lepidoptera lineages with a pronounced expansion of predominantly LINEs, DNA elements, and unclassified TEs in the Leptidea lineage after the split from other Pieridae. The rate of genome expansion in Leptidea in general has been in the range of four Mb/Million year (My), with an increase in a particular L. juvernica population to 72 Mb/My. The considerable differences in accumulation rates of specific TE classes in different lineages indicate that TE activity plays a major role in genome size evolution in butterflies and moths. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  6. Meiosis and Haploid Gametes in the Pathogen Trypanosoma brucei

    PubMed Central

    Peacock, Lori; Bailey, Mick; Carrington, Mark; Gibson, Wendy

    2014-01-01

    Summary In eukaryote pathogens, sex is an important driving force in spreading genes for drug resistance, pathogenicity, and virulence [1]. For the parasitic trypanosomes that cause African sleeping sickness, mating occurs during transmission by the tsetse vector [2, 3] and involves meiosis [4], but haploid gametes have not yet been identified. Here, we show that meiosis is a normal part of development in the insect salivary glands for all subspecies of Trypanosoma brucei, including the human pathogens. By observing insect-derived trypanosomes during the window of peak expression of meiosis-specific genes, we identified promastigote-like (PL) cells that interacted with each other via their flagella and underwent fusion, as visualized by the mixing of cytoplasmic red and green fluorescent proteins. PL cells had a short, wide body, a very long anterior flagellum, and either one or two kinetoplasts, but only the anterior kinetoplast was associated with the flagellum. Measurement of nuclear DNA contents showed that PL cells were haploid relative to diploid metacyclics. Trypanosomes are among the earliest diverging eukaryotes, and our results support the hypothesis that meiosis and sexual reproduction are ubiquitous in eukaryotes and likely to have been early innovations [5]. PMID:24388851

  7. Genome size variation among sex types in dioecious and triecious Caricaceae species

    USDA-ARS?s Scientific Manuscript database

    Caricaceae is a small family consisting of 35 species of varying sexual systems and includes economically important fruit crop, Carica papaya, and other species of “highland papayas”. Flow cytometry was used to obtain genome sizes for 11 species in three genera of Caricaceae to determine if genome s...

  8. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication

    USDA-ARS?s Scientific Manuscript database

    Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes—a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-o...

  9. First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Traditional Medicinal Plant

    PubMed Central

    Liu, Di; Zeng, Shao-Hua; Chen, Jian-Jun; Zhang, Yan-Jun; Xiao, Gong; Zhu, Lin-Yao; Wang, Ying

    2013-01-01

    Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant. PMID:23807511

  10. Human Genome Sequencing in Health and Disease

    PubMed Central

    Gonzaga-Jauregui, Claudia; Lupski, James R.; Gibbs, Richard A.

    2013-01-01

    Following the “finished,” euchromatic, haploid human reference genome sequence, the rapid development of novel, faster, and cheaper sequencing technologies is making possible the era of personalized human genomics. Personal diploid human genome sequences have been generated, and each has contributed to our better understanding of variation in the human genome. We have consequently begun to appreciate the vastness of individual genetic variation from single nucleotide to structural variants. Translation of genome-scale variation into medically useful information is, however, in its infancy. This review summarizes the initial steps undertaken in clinical implementation of personal genome information, and describes the application of whole-genome and exome sequencing to identify the cause of genetic diseases and to suggest adjuvant therapies. Better analysis tools and a deeper understanding of the biology of our genome are necessary in order to decipher, interpret, and optimize clinical utility of what the variation in the human genome can teach us. Personal genome sequencing may eventually become an instrument of common medical practice, providing information that assists in the formulation of a differential diagnosis. We outline herein some of the remaining challenges. PMID:22248320

  11. Reassessment of genome size in turtle and crocodile based on chromosome measurement by flow karyotyping: close similarity to chicken

    PubMed Central

    Kasai, Fumio; O'Brien, Patricia C. M.; Ferguson-Smith, Malcolm A.

    2012-01-01

    The genome size in turtles and crocodiles is thought to be much larger than the 1.2 Gb of the chicken (Gallus gallus domesticus, GGA), according to the animal genome size database. However, GGA macrochromosomes show extensive homology in the karyotypes of the red eared slider (Trachemys scripta elegans, TSC) and the Nile crocodile (Crocodylus niloticus, CNI), and bird and reptile genomes have been highly conserved during evolution. In this study, size and GC content of all chromosomes are measured from the flow karyotypes of GGA, TSC and CNI. Genome sizes estimated from the total chromosome size demonstrate that TSC and CNI are 1.21 Gb and 1.29 Gb, respectively. This refines previous overestimations and reveals similar genome sizes in chicken, turtle and crocodile. Analysis of chromosome GC content in each of these three species shows a higher GC content in smaller chromosomes than in larger chromosomes. This contrasts with mammals and squamates in which GC content does not correlate with chromosome size. These data suggest that a common ancestor of birds, turtles and crocodiles had a small genome size and a chromosomal size-dependent GC bias, distinct from the squamate lineage. PMID:22491763

  12. Draft genome sequence of the rubber tree Hevea brasiliensis.

    PubMed

    Rahman, Ahmad Yamin Abdul; Usharraj, Abhilash O; Misra, Biswapriya B; Thottathil, Gincy P; Jayasekaran, Kandakumar; Feng, Yun; Hou, Shaobin; Ong, Su Yean; Ng, Fui Ling; Lee, Ling Sze; Tan, Hock Siew; Sakaff, Muhd Khairul Luqman Muhd; Teh, Beng Soon; Khoo, Bee Feong; Badai, Siti Suriawati; Aziz, Nurohaida Ab; Yuryev, Anton; Knudsen, Bjarne; Dionne-Laporte, Alexandre; Mchunu, Nokuthula P; Yu, Qingyi; Langston, Brennick J; Freitas, Tracey Allen K; Young, Aaron G; Chen, Rui; Wang, Lei; Najimudin, Nazalan; Saito, Jennifer A; Alam, Maqsudul

    2013-02-02

    Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber.

  13. Characters that differ between diploid and haploid honey bee (Apis mellifera) drones.

    PubMed

    Herrmann, Matthias; Trenzcek, Tina; Fahrenhorst, Hartmut; Engels, Wolf

    2005-12-30

    Diploid males have long been considered a curiosity contradictory to the haplo-diploid mode of sex determination in the Hymenoptera. In Apis mellifera, 'false' diploid male larvae are eliminated by worker cannibalism immediately after hatching. A 'cannibalism substance' produced by diploid drone larvae to induce worker-assisted suicide has been hypothesized, but it has never been detected. Diploid drones are only removed some hours after hatching. Older larvae are evidently not regarded as 'false males' and instead are regularly nursed by the brood-attending worker bees. As the pheromonal cues presumably are located on the surface of newly hatched bee larvae, we extracted the cuticular secretions and analyzed their chemical composition by gas chromatograph-mass spectrometry (GC-MS) analyses. Larvae were sexed and then reared in vitro for up to three days. The GC-MS pattern that was obtained, with alkanes as the major compounds, was compared between diploid and haploid drone larvae. We also examined some physical parameters of adult drones. There was no difference between diploid and haploid males in their weight at the day of emergence. The diploid adult drones had fewer wing hooks and smaller testes. The sperm DNA content was 0.30 and 0.15 pg per nucleus, giving an exact 2:1 ratio for the gametocytes of diploid and haploid drones, respectively. Vitellogenin was found in the hemolymph of both types of imaginal drones at 5 to 6 days, with a significantly lower titer in the diploids.

  14. Insights into the Evolution of Mitochondrial Genome Size from Complete Sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae)

    PubMed Central

    Alverson, Andrew J.; Wei, XiaoXin; Rice, Danny W.; Stern, David B.; Barry, Kerrie; Palmer, Jeffrey D.

    2010-01-01

    The mitochondrial genomes of seed plants are unusually large and vary in size by at least an order of magnitude. Much of this variation occurs within a single family, the Cucurbitaceae, whose genomes range from an estimated 390 to 2,900 kb in size. We sequenced the mitochondrial genomes of Citrullus lanatus (watermelon: 379,236 nt) and Cucurbita pepo (zucchini: 982,833 nt)—the two smallest characterized cucurbit mitochondrial genomes—and determined their RNA editing content. The relatively compact Citrullus mitochondrial genome actually contains more and longer genes and introns, longer segmental duplications, and more discernibly nuclear-derived DNA. The large size of the Cucurbita mitochondrial genome reflects the accumulation of unprecedented amounts of both chloroplast sequences (>113 kb) and short repeated sequences (>370 kb). A low mutation rate has been hypothesized to underlie increases in both genome size and RNA editing frequency in plant mitochondria. However, despite its much larger genome, Cucurbita has a significantly higher synonymous substitution rate (and presumably mutation rate) than Citrullus but comparable levels of RNA editing. The evolution of mutation rate, genome size, and RNA editing are apparently decoupled in Cucurbitaceae, reflecting either simple stochastic variation or governance by different factors. PMID:20118192

  15. Insights on genome size evolution from a miniature inverted repeat transposon driving a satellite DNA.

    PubMed

    Scalvenzi, Thibault; Pollet, Nicolas

    2014-12-01

    The genome size in eukaryotes does not correlate well with the number of genes they contain. We can observe this so-called C-value paradox in amphibian species. By analyzing an amphibian genome we asked how repetitive DNA can impact genome size and architecture. We describe here our discovery of a Tc1/mariner miniature inverted-repeat transposon family present in Xenopus frogs. These transposons named miDNA4 are unique since they contain a satellite DNA motif. We found that miDNA4 measured 331 bp, contained 25 bp long inverted terminal repeat sequences and a sequence motif of 119 bp present as a unique copy or as an array of 2-47 copies. We characterized the structure, dynamics, impact and evolution of the miDNA4 family and its satellite DNA in Xenopus frog genomes. This led us to propose a model for the evolution of these two repeated sequences and how they can synergize to increase genome size. Copyright © 2014 Elsevier Inc. All rights reserved.

  16. Effects of Caffeine and Chlorogenic Acid on Propidium Iodide Accessibility to DNA: Consequences on Genome Size Evaluation in Coffee Tree

    PubMed Central

    NOIROT, M.; BARRE, P.; DUPERRAY, C.; LOUARN, J.; HAMON, S.

    2003-01-01

    Estimates of genome size using flow cytometry can be biased by the presence of cytosolic compounds, leading to pseudo‐intraspecific variation in genome size. Two important compounds present in coffee trees—caffeine and chlorogenic acid—modify accessibility of the dye propidium iodide to Petunia DNA, a species used as internal standard in our genome size evaluation. These compounds could be responsible for intraspecific variation in genome size since their contents vary between trees. They could also be implicated in environmental variations in genome size, such as those revealed when comparing the results of evaluations carried out on different dates on several genotypes. PMID:12876189

  17. Genome size and invasiveness traits in the hybrid meadow knapweed complex (Centaurea x moncktonii) in eastern North America

    USDA-ARS?s Scientific Manuscript database

    Hybridization and genomic admixture between divergent populations or species may be an important driver of plant invasiveness. Recent studies have emphasized the critical role that reductions in genome size may play in facilitating the rapid evolution of invasiveness, and small genome size has been ...

  18. BAC end sequencing of Pacific white shrimp Litopenaeus vannamei: a glimpse into the genome of Penaeid shrimp

    NASA Astrophysics Data System (ADS)

    Zhao, Cui; Zhang, Xiaojun; Liu, Chengzhang; Huan, Pin; Li, Fuhua; Xiang, Jianhai; Huang, Chao

    2012-05-01

    Little is known about the genome of Pacific white shrimp ( Litopenaeus vannamei). To address this, we conducted BAC (bacterial artificial chromosome) end sequencing of L. vannamei. We selected and sequenced 7 812 BAC clones from the BAC library LvHE from the two ends of the inserts by Sanger sequencing. After trimming and quality filtering, 11 279 BAC end sequences (BESs) including 4 609 pairedends BESs were obtained. The total length of the BESs was 4 340 753 bp, representing 0.18% of the L. vannamei haploid genome. The lengths of the BESs ranged from 100 bp to 660 bp with an average length of 385 bp. Analysis of the BESs indicated that the L. vannamei genome is AT-rich and that the primary repeats patterns were simple sequence repeats (SSRs) and low complexity sequences. Dinucleotide and hexanucleotide repeats were the most common SSR types in the BESs. The most abundant transposable element was gypsy, which may contribute to the generation of the large genome size of L. vannamei. We successfully annotated 4 519 BESs by BLAST searching, including genes involved in immunity and sex determination. Our results provide an important resource for functional gene studies, map construction and integration, and complete genome assembly for this species.

  19. Diallel crossing among doubled haploids of cucumber reveals significant reciprocal-cross differences

    USDA-ARS?s Scientific Manuscript database

    Cucumber is an excellent plant for studying organellar effects on phenotypes because chloroplasts show maternal and mitochondria paternal transmission. We produced doubled haploids (DH) from divergent cucumber populations, generated reciprocal crosses in a diallel mating scheme, measured fresh and d...

  20. Meiosis and haploid gametes in the pathogen Trypanosoma brucei.

    PubMed

    Peacock, Lori; Bailey, Mick; Carrington, Mark; Gibson, Wendy

    2014-01-20

    In eukaryote pathogens, sex is an important driving force in spreading genes for drug resistance, pathogenicity, and virulence. For the parasitic trypanosomes that cause African sleeping sickness, mating occurs during transmission by the tsetse vector and involves meiosis, but haploid gametes have not yet been identified. Here, we show that meiosis is a normal part of development in the insect salivary glands for all subspecies of Trypanosoma brucei, including the human pathogens. By observing insect-derived trypanosomes during the window of peak expression of meiosis-specific genes, we identified promastigote-like (PL) cells that interacted with each other via their flagella and underwent fusion, as visualized by the mixing of cytoplasmic red and green fluorescent proteins. PL cells had a short, wide body, a very long anterior flagellum, and either one or two kinetoplasts, but only the anterior kinetoplast was associated with the flagellum. Measurement of nuclear DNA contents showed that PL cells were haploid relative to diploid metacyclics. Trypanosomes are among the earliest diverging eukaryotes, and our results support the hypothesis that meiosis and sexual reproduction are ubiquitous in eukaryotes and likely to have been early innovations. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  1. The genome sequence of the colonial chordate, Botryllus schlosseri

    PubMed Central

    Voskoboynik, Ayelet; Neff, Norma F; Sahoo, Debashis; Newman, Aaron M; Pushkarev, Dmitry; Koh, Winston; Passarelli, Benedetto; Fan, H Christina; Mantalas, Gary L; Palmeri, Karla J; Ishizuka, Katherine J; Gissi, Carmela; Griggio, Francesca; Ben-Shlomo, Rachel; Corey, Daniel M; Penland, Lolita; White, Richard A; Weissman, Irving L; Quake, Stephen R

    2013-01-01

    Botryllus schlosseri is a colonial urochordate that follows the chordate plan of development following sexual reproduction, but invokes a stem cell-mediated budding program during subsequent rounds of asexual reproduction. As urochordates are considered to be the closest living invertebrate relatives of vertebrates, they are ideal subjects for whole genome sequence analyses. Using a novel method for high-throughput sequencing of eukaryotic genomes, we sequenced and assembled 580 Mbp of the B. schlosseri genome. The genome assembly is comprised of nearly 14,000 intron-containing predicted genes, and 13,500 intron-less predicted genes, 40% of which could be confidently parceled into 13 (of 16 haploid) chromosomes. A comparison of homologous genes between B. schlosseri and other diverse taxonomic groups revealed genomic events underlying the evolution of vertebrates and lymphoid-mediated immunity. The B. schlosseri genome is a community resource for studying alternative modes of reproduction, natural transplantation reactions, and stem cell-mediated regeneration. DOI: http://dx.doi.org/10.7554/eLife.00569.001 PMID:23840927

  2. Construction and Analysis of Siberian Tiger Bacterial Artificial Chromosome Library with Approximately 6.5-Fold Genome Equivalent Coverage

    PubMed Central

    Liu, Changqing; Bai, Chunyu; Guo, Yu; Liu, Dan; Lu, Taofeng; Li, Xiangchen; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2014-01-01

    Bacterial artificial chromosome (BAC) libraries are extremely valuable for the genome-wide genetic dissection of complex organisms. The Siberian tiger, one of the most well-known wild primitive carnivores in China, is an endangered animal. In order to promote research on its genome, a high-redundancy BAC library of the Siberian tiger was constructed and characterized. The library is divided into two sub-libraries prepared from blood cells and two sub-libraries prepared from fibroblasts. This BAC library contains 153,600 individually archived clones; for PCR-based screening of the library, BACs were placed into 40 superpools of 10 × 384-deep well microplates. The average insert size of BAC clones was estimated to be 116.5 kb, representing approximately 6.46 genome equivalents of the haploid genome and affording a 98.86% statistical probability of obtaining at least one clone containing a unique DNA sequence. Screening the library with 19 microsatellite markers and a SRY sequence revealed that each of these markers were present in the library; the average number of positive clones per marker was 6.74 (range 2 to 12), consistent with 6.46 coverage of the tiger genome. Additionally, we identified 72 microsatellite markers that could potentially be used as genetic markers. This BAC library will serve as a valuable resource for physical mapping, comparative genomic study and large-scale genome sequencing in the tiger. PMID:24608928

  3. Recombination and genetic variance among maize doubled haploids induced from F1 and F2 plants.

    PubMed

    Sleper, Joshua A; Bernardo, Rex

    2016-12-01

    Inducing maize doubled haploids from F 2 plants (DHF2) instead of F 1 plants (DHF1) led to more recombination events. However, the best DHF2 lines did not outperform the best DHF1 lines. Maize (Zea mays L.) breeders rely on doubled haploid (DH) technology for fast and efficient production of inbreds. Breeders can induce DH lines most quickly from F 1 plants (DHF1), or induce DH lines from F 2 plants (DHF2) to allow selection prior to DH induction and have more recombinations. Our objective was to determine if the additional recombinations in maize DHF2 lines lead to a larger genetic variance and a superior mean of the best lines. A total of 311 DHF1 and 241 DHF2 lines, derived from the same biparental cross, were crossed to two testers and evaluated in multilocation trials in Europe and the US. The mean number of recombinations per genome was 14.48 among the DHF1 lines and 21.38 among the DHF1 lines. The means of the DHF1 and DHF2 lines did not differ for yield, moisture, and plant height. The genetic variance was higher among DHF2 lines than among DHF1 lines for moisture, but not for yield and plant height. The ratio of repulsion to coupling linkages, which was estimated from genomewide marker effects, was higher among DHF1 lines than among DHF2 lines for moisture, but not for yield and plant height. The higher genetic variance for moisture among DHF2 lines did not lead to lower moisture of the best 10 % of the lines. Our results indicated that the decision of inducing DH lines from F 1 or F 2 plants needs to be made from considerations other than the performance of the resulting DHF1 or DHF2 lines.

  4. The first genetic map of a synthesized allohexaploid Brassica with A, B and C genomes based on simple sequence repeat markers.

    PubMed

    Yang, S; Chen, S; Geng, X X; Yan, G; Li, Z Y; Meng, J L; Cowling, W A; Zhou, W J

    2016-04-01

    We present the first genetic map of an allohexaploid Brassica species, based on segregating microsatellite markers in a doubled haploid mapping population generated from a hybrid between two hexaploid parents. This study reports the first genetic map of trigenomic Brassica. A doubled haploid mapping population consisting of 189 lines was obtained via microspore culture from a hybrid H16-1 derived from a cross between two allohexaploid Brassica lines (7H170-1 and Y54-2). Simple sequence repeat primer pairs specific to the A genome (107), B genome (44) and C genome (109) were used to construct a genetic linkage map of the population. Twenty-seven linkage groups were resolved from 274 polymorphic loci on the A genome (109), B genome (49) and C genome (116) covering a total genetic distance of 3178.8 cM with an average distance between markers of 11.60 cM. This is the first genetic framework map for the artificially synthesized Brassica allohexaploids. The linkage groups represent the expected complement of chromosomes in the A, B and C genomes from the original diploid and tetraploid parents. This framework linkage map will be valuable for QTL analysis and future genetic improvement of a new allohexaploid Brassica species, and in improving our understanding of the genetic control of meiosis in new polyploids.

  5. Evolutionary and Taxonomic Implications of Variation in Nuclear Genome Size: Lesson from the Grass Genus Anthoxanthum (Poaceae).

    PubMed

    Chumová, Zuzana; Krejčíková, Jana; Mandáková, Terezie; Suda, Jan; Trávníček, Pavel

    2015-01-01

    The genus Anthoxanthum (sweet vernal grass, Poaceae) represents a taxonomically intricate polyploid complex with large phenotypic variation and its evolutionary relationships still poorly resolved. In order to get insight into the geographic distribution of ploidy levels and assess the taxonomic value of genome size data, we determined C- and Cx-values in 628 plants representing all currently recognized European species collected from 197 populations in 29 European countries. The flow cytometric estimates were supplemented by conventional chromosome counts. In addition to diploids, we found two low (rare 3x and common 4x) and one high (~16x-18x) polyploid levels. Mean holoploid genome sizes ranged from 5.52 pg in diploid A. alpinum to 44.75 pg in highly polyploid A. amarum, while the size of monoploid genomes ranged from 2.75 pg in tetraploid A. alpinum to 9.19 pg in diploid A. gracile. In contrast to Central and Northern Europe, which harboured only limited cytological variation, a much more complex pattern of genome sizes was revealed in the Mediterranean, particularly in Corsica. Eight taxonomic groups that partly corresponded to traditionally recognized species were delimited based on genome size values and phenotypic variation. Whereas our data supported the merger of A. aristatum and A. ovatum, eastern Mediterranean populations traditionally referred to as diploid A. odoratum were shown to be cytologically distinct, and may represent a new taxon. Autopolyploid origin was suggested for 4x A. alpinum. In contrast, 4x A. odoratum seems to be an allopolyploid, based on the amounts of nuclear DNA. Intraspecific variation in genome size was observed in all recognized species, the most striking example being the A. aristatum/ovatum complex. Altogether, our study showed that genome size can be a useful taxonomic marker in Anthoxathum to not only guide taxonomic decisions but also help resolve evolutionary relationships in this challenging grass genus.

  6. Quantifying the Variation in the Effective Population Size Within a Genome

    PubMed Central

    Gossmann, Toni I.; Woolfit, Megan; Eyre-Walker, Adam

    2011-01-01

    The effective population size (Ne) is one of the most fundamental parameters in population genetics. It is thought to vary across the genome as a consequence of differences in the rate of recombination and the density of selected sites due to the processes of genetic hitchhiking and background selection. Although it is known that there is intragenomic variation in the effective population size in some species, it is not known whether this is widespread or how much variation in the effective population size there is. Here, we test whether the effective population size varies across the genome, between protein-coding genes, in 10 eukaryotic species by considering whether there is significant variation in neutral diversity, taking into account differences in the mutation rate between loci by using the divergence between species. In most species we find significant evidence of variation. We investigate whether the variation in Ne is correlated to recombination rate and the density of selected sites in four species, for which these data are available. We find that Ne is positively correlated to recombination rate in one species, Drosophila melanogaster, and negatively correlated to a measure of the density of selected sites in two others, humans and Arabidopsis thaliana. However, much of the variation remains unexplained. We use a hierarchical Bayesian analysis to quantify the amount of variation in the effective population size and show that it is quite modest in all species—most genes have an Ne that is within a few fold of all other genes. Nonetheless we show that this modest variation in Ne is sufficient to cause significant differences in the efficiency of natural selection across the genome, by demonstrating that the ratio of the number of nonsynonymous to synonymous polymorphisms is significantly correlated to synonymous diversity and estimates of Ne, even taking into account the obvious nonindependence between these measures. PMID:21954163

  7. Draft genome sequence of the rubber tree Hevea brasiliensis

    PubMed Central

    2013-01-01

    Background Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Results Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. Conclusions The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber. PMID:23375136

  8. Distribution and diversity of cytotypes in Dianthus broteri as evidenced by genome size variations.

    PubMed

    Balao, Francisco; Casimiro-Soriguer, Ramón; Talavera, María; Herrera, Javier; Talavera, Salvador

    2009-10-01

    Studying the spatial distribution of cytotypes and genome size in plants can provide valuable information about the evolution of polyploid complexes. Here, the spatial distribution of cytological races and the amount of DNA in Dianthus broteri, an Iberian carnation with several ploidy levels, is investigated. Sample chromosome counts and flow cytometry (using propidium iodide) were used to determine overall genome size (2C value) and ploidy level in 244 individuals of 25 populations. Both fresh and dried samples were investigated. Differences in 2C and 1Cx values among ploidy levels within biogeographical provinces were tested using ANOVA. Geographical correlations of genome size were also explored. Extensive variation in chromosomes numbers (2n = 2x = 30, 2n = 4x = 60, 2n = 6x = 90 and 2n = 12x =180) was detected, and the dodecaploid cytotype is reported for the first time in this genus. As regards cytotype distribution, six populations were diploid, 11 were tetraploid, three were hexaploid and five were dodecaploid. Except for one diploid population containing some triploid plants (2n = 45), the remaining populations showed a single cytotype. Diploids appeared in two disjunct areas (south-east and south-west), and so did tetraploids (although with a considerably wider geographic range). Dehydrated leaf samples provided reliable measurements of DNA content. Genome size varied significantly among some cytotypes, and also extensively within diploid (up to 1.17-fold) and tetraploid (1.22-fold) populations. Nevertheless, variations were not straightforwardly congruent with ecology and geographical distribution. Dianthus broteri shows the highest diversity of cytotypes known to date in the genus Dianthus. Moreover, some cytotypes present remarkable internal genome size variation. The evolution of the complex is discussed in terms of autopolyploidy, with primary and secondary contact zones.

  9. Distribution and diversity of cytotypes in Dianthus broteri as evidenced by genome size variations

    PubMed Central

    Balao, Francisco; Casimiro-Soriguer, Ramón; Talavera, María; Herrera, Javier; Talavera, Salvador

    2009-01-01

    Background and Aims Studying the spatial distribution of cytotypes and genome size in plants can provide valuable information about the evolution of polyploid complexes. Here, the spatial distribution of cytological races and the amount of DNA in Dianthus broteri, an Iberian carnation with several ploidy levels, is investigated. Methods Sample chromosome counts and flow cytometry (using propidium iodide) were used to determine overall genome size (2C value) and ploidy level in 244 individuals of 25 populations. Both fresh and dried samples were investigated. Differences in 2C and 1Cx values among ploidy levels within biogeographical provinces were tested using ANOVA. Geographical correlations of genome size were also explored. Key Results Extensive variation in chromosomes numbers (2n = 2x = 30, 2n = 4x = 60, 2n = 6x = 90 and 2n = 12x =180) was detected, and the dodecaploid cytotype is reported for the first time in this genus. As regards cytotype distribution, six populations were diploid, 11 were tetraploid, three were hexaploid and five were dodecaploid. Except for one diploid population containing some triploid plants (2n = 45), the remaining populations showed a single cytotype. Diploids appeared in two disjunct areas (south-east and south-west), and so did tetraploids (although with a considerably wider geographic range). Dehydrated leaf samples provided reliable measurements of DNA content. Genome size varied significantly among some cytotypes, and also extensively within diploid (up to 1·17-fold) and tetraploid (1·22-fold) populations. Nevertheless, variations were not straightforwardly congruent with ecology and geographical distribution. Conclusions Dianthus broteri shows the highest diversity of cytotypes known to date in the genus Dianthus. Moreover, some cytotypes present remarkable internal genome size variation. The evolution of the complex is discussed in terms of autopolyploidy, with primary and secondary contact zones. PMID:19633312

  10. Point mutation impairs centromeric CENH3 loading and induces haploid plants.

    PubMed

    Karimi-Ashtiyani, Raheleh; Ishii, Takayoshi; Niessen, Markus; Stein, Nils; Heckmann, Stefan; Gurushidze, Maia; Banaei-Moghaddam, Ali Mohammad; Fuchs, Jörg; Schubert, Veit; Koch, Kerstin; Weiss, Oda; Demidov, Dmitri; Schmidt, Klaus; Kumlehn, Jochen; Houben, Andreas

    2015-09-08

    The chromosomal position of the centromere-specific histone H3 variant CENH3 (also called "CENP-A") is the assembly site for the kinetochore complex of active centromeres. Any error in transcription, translation, modification, or incorporation can affect the ability to assemble intact CENH3 chromatin and can cause centromere inactivation [Allshire RC, Karpen GH (2008) Nat Rev Genet 9 (12):923-937]. Here we show that a single-point amino acid exchange in the centromere-targeting domain of CENH3 leads to reduced centromere loading of CENH3 in barley, sugar beet, and Arabidopsis thaliana. Haploids were obtained after cenh3 L130F-complemented cenh3-null mutant plants were crossed with wild-type A. thaliana. In contrast, in a noncompeting situation (i.e., centromeres possessing only mutated or only wild-type CENH3), no uniparental chromosome elimination occurs during early embryogenesis. The high degree of evolutionary conservation of the identified mutation site offers promising opportunities for application in a wide range of crop species in which haploid technology is of interest.

  11. Point mutation impairs centromeric CENH3 loading and induces haploid plants

    PubMed Central

    Karimi-Ashtiyani, Raheleh; Ishii, Takayoshi; Niessen, Markus; Stein, Nils; Heckmann, Stefan; Gurushidze, Maia; Banaei-Moghaddam, Ali Mohammad; Fuchs, Jörg; Schubert, Veit; Koch, Kerstin; Weiss, Oda; Demidov, Dmitri; Schmidt, Klaus; Kumlehn, Jochen; Houben, Andreas

    2015-01-01

    The chromosomal position of the centromere-specific histone H3 variant CENH3 (also called “CENP-A”) is the assembly site for the kinetochore complex of active centromeres. Any error in transcription, translation, modification, or incorporation can affect the ability to assemble intact CENH3 chromatin and can cause centromere inactivation [Allshire RC, Karpen GH (2008) Nat Rev Genet 9 (12):923–937]. Here we show that a single-point amino acid exchange in the centromere-targeting domain of CENH3 leads to reduced centromere loading of CENH3 in barley, sugar beet, and Arabidopsis thaliana. Haploids were obtained after cenh3 L130F-complemented cenh3-null mutant plants were crossed with wild-type A. thaliana. In contrast, in a noncompeting situation (i.e., centromeres possessing only mutated or only wild-type CENH3), no uniparental chromosome elimination occurs during early embryogenesis. The high degree of evolutionary conservation of the identified mutation site offers promising opportunities for application in a wide range of crop species in which haploid technology is of interest. PMID:26294252

  12. Mating system and gene flow in the red seaweed Gracilaria gracilis: effect of haploid-diploid life history and intertidal rocky shore landscape on fine-scale genetic structure.

    PubMed

    Engel, C R; Destombe, C; Valero, M

    2004-04-01

    The impact of haploid-diploidy and the intertidal landscape on a fine-scale genetic structure was explored in a red seaweed Gracilaria gracilis. The pattern of genetic structure was compared in haploid and diploid stages at a microgeographic scale (< 5 km): a total of 280 haploid and 296 diploid individuals located in six discrete, scattered rock pools were genotyped using seven microsatellite loci. Contrary to the theoretical expectation of predominantly endogamous mating systems in haploid-diploid organisms, G. gracilis showed a clearly allogamous mating system. Although within-population allele frequencies were similar between haploids and diploids, genetic differentiation among haploids was more than twice that of diploids, suggesting that there may be a lag between migration and (local) breeding due to the long generation times in G. gracilis. Weak, but significant, population differentiation was detected in both haploids and diploids and varied with landscape features, and not with geographic distance. Using an assignment test, we establish that effective migration rates varied according to height on the shore. In this intertidal species, biased spore dispersal may occur during the transport of spores and gametes at low tide when small streams flow from high- to lower-shore pools. The longevity of both haploid and diploid free-living stages and the long generation times typical of G. gracilis populations may promote the observed pattern of high genetic diversity within populations relative to that among populations.

  13. Creation of BAC genomic resources for cocoa ( Theobroma cacao L.) for physical mapping of RGA containing BAC clones.

    PubMed

    Clément, D; Lanaud, C; Sabau, X; Fouet, O; Le Cunff, L; Ruiz, E; Risterucci, A M; Glaszmann, J C; Piffanelli, P

    2004-05-01

    We have constructed and validated the first cocoa ( Theobroma cacao L.) BAC library, with the aim of developing molecular resources to study the structure and evolution of the genome of this perennial crop. This library contains 36,864 clones with an average insert size of 120 kb, representing approximately ten haploid genome equivalents. It was constructed from the genotype Scavina-6 (Sca-6), a Forastero clone highly resistant to cocoa pathogens and a parent of existing mapping populations. Validation of the BAC library was carried out with a set of 13 genetically-anchored single copy and one duplicated markers. An average of nine BAC clones per probe was identified, giving an initial experimental estimation of the genome coverage represented in the library. Screening of the library with a set of resistance gene analogues (RGAs), previously mapped in cocoa and co-localizing with QTL for resistance to Phytophthora traits, confirmed at the physical level the tight clustering of RGAs in the cocoa genome and provided the first insights into the relationships between genetic and physical distances in the cocoa genome. This library represents an available BAC resource for structural genomic studies or map-based cloning of genes corresponding to important QTLs for agronomic traits such as resistance genes to major cocoa pathogens like Phytophthora spp ( palmivora and megakarya), Crinipellis perniciosa and Moniliophthora roreri.

  14. Genome size variation in Corchorus olitorius (Malvaceae s.l.) and its correlation with elevation and phenotypic traits.

    PubMed

    Benor, Solomon; Fuchs, Jörg; Blattner, Frank R

    2011-07-01

    In this study, we report genome size variations in Corchorus olitorius L. (Malvaceae s.l.), a crop species known for its morphological plasticity and broad geographical distribution, and Corchorus capsularis L., the second widely cultivated species in the genus. Flow cytometric analyses were conducted with several tissues and nuclei isolation buffers using 69 accessions of C. olitorius and 4 accessions of C. capsularis, representing different habitats and geographical origins. The mean 2C nuclear DNA content (± SD) of C. olitorius was estimated to be 0.918 ± 0.011 pg, with a minimum of 0.882 ± 0.004 pg, and a maximum of 0.942 ± 0.004 pg. All studied plant materials were found to be diploid with 2n = 14. The genome size is negatively correlated with days to flowering (r = -0.29, p < 0.05) and positively with seed surface area (r = 0.38, p < 0.05). Moreover, a statistically significant positive correlation was detected between genome size and growing elevation (r = 0.59, p < 0.001) in wild populations. The mean 2C nuclear DNA content of C. capsularis was estimated to be 0.802 ± 0.008 pg. In comparison to other economically important crop species, the genome sizes of C. olitorius and C. capsularis are much smaller, and therewith closer to that of rice. The relatively small genome sizes will be of general advantage for any efforts into genomics or sequencing approaches of these species.

  15. Transposition of a Ds element from a plasmid into the plant genome in Nicotiana plumbaginifolia protoplast-derived cells.

    PubMed

    Houba-Hérin, N; Domin, M; Pédron, J

    1994-07-01

    Nicotiana plumbaginifolia haploid protoplasts were co-transformed with two plasmids, one with a NPT-II/Ds element and one with a gene encoding an amino-terminal truncated Ac transposase. It is shown that Ds can efficiently transpose from extrachromosomal DNA to N. plumbaginifolia chromosomes when the Ac transposase gene is present in trans. Ds has been shown to have transposed into the plant genome in a limited number of copies (1.9 copies per genome), for 21/32 transgenic lines tested. The flanking sequences present in the original plasmid are missing in these 21 plants. In only two of 21 plants was part of the transposase construct integrated. By segregation analysis of transgenic progeny, Ds was shown to be present in the heterozygous state in 10 lines even though haploid protoplasts had been originally transformed. This observation could indicate that integration occurred after or during DNA replication that leads to protoplast diploidization.

  16. Evolutionary and Taxonomic Implications of Variation in Nuclear Genome Size: Lesson from the Grass Genus Anthoxanthum (Poaceae)

    PubMed Central

    Chumová, Zuzana; Krejčíková, Jana; Mandáková, Terezie; Suda, Jan; Trávníček, Pavel

    2015-01-01

    The genus Anthoxanthum (sweet vernal grass, Poaceae) represents a taxonomically intricate polyploid complex with large phenotypic variation and its evolutionary relationships still poorly resolved. In order to get insight into the geographic distribution of ploidy levels and assess the taxonomic value of genome size data, we determined C- and Cx-values in 628 plants representing all currently recognized European species collected from 197 populations in 29 European countries. The flow cytometric estimates were supplemented by conventional chromosome counts. In addition to diploids, we found two low (rare 3x and common 4x) and one high (~16x–18x) polyploid levels. Mean holoploid genome sizes ranged from 5.52 pg in diploid A. alpinum to 44.75 pg in highly polyploid A. amarum, while the size of monoploid genomes ranged from 2.75 pg in tetraploid A. alpinum to 9.19 pg in diploid A. gracile. In contrast to Central and Northern Europe, which harboured only limited cytological variation, a much more complex pattern of genome sizes was revealed in the Mediterranean, particularly in Corsica. Eight taxonomic groups that partly corresponded to traditionally recognized species were delimited based on genome size values and phenotypic variation. Whereas our data supported the merger of A. aristatum and A. ovatum, eastern Mediterranean populations traditionally referred to as diploid A. odoratum were shown to be cytologically distinct, and may represent a new taxon. Autopolyploid origin was suggested for 4x A. alpinum. In contrast, 4x A. odoratum seems to be an allopolyploid, based on the amounts of nuclear DNA. Intraspecific variation in genome size was observed in all recognized species, the most striking example being the A. aristatum/ovatum complex. Altogether, our study showed that genome size can be a useful taxonomic marker in Anthoxathum to not only guide taxonomic decisions but also help resolve evolutionary relationships in this challenging grass genus. PMID:26207824

  17. Doubled haploid production from Spanish onion (Allium cepa L.) germplasm: embryogenesis induction, plant regeneration and chromosome doubling.

    PubMed

    Fayos, Oreto; Vallés, María P; Garcés-Claver, Ana; Mallor, Cristina; Castillo, Ana M

    2015-01-01

    The use of doubled haploids in onion breeding is limited due to the low gynogenesis efficiency of this species. Gynogenesis capacity from Spanish germplasm, including the sweet cultivar Fuentes de Ebro, the highly pungent landrace BGHZ1354 and the two Valenciana type commercial varieties Recas and Rita, was evaluated and optimized in this study. The OH-1 population, characterized by a high gynogenesis induction, was used as control. Growing conditions of the donor plants were tested with a one-step protocol and field plants produced a slightly higher percentage of embryogenesis induction than growth chamber plants. A one-step protocol was compared with a two-step protocol for embryogenesis induction. Spanish germplasm produced a 2-3 times higher percentage of embryogenesis with the two-step protocol, Recas showing the highest percentage (2.09%) and Fuentes de Ebro the lowest (0.53%). These percentages were significantly lower than those from the OH-1 population, with an average of 15% independently of the protocol used. The effect of different containers on plant regeneration was tested using both protocols. The highest percentage of acclimated plants was obtained with the two-step protocol in combination with Eco2box (70%), whereas the lowest percentage was observed with glass tubes in the two protocols (20-23%). Different amiprofos-methyl (APM) treatments were applied to embryos for chromosome doubling. A similar number of doubled haploid plants were recovered with 25 or 50 μM APM in liquid medium. However, the application of 25 μM in solid medium for 24 h produced the highest number of doubled haploid plants. Somatic regeneration from flower buds of haploid and mixoploid plants proved to be a successful approach for chromosome doubling, since diploid plants were obtained from the four regenerated lines. In this study, doubled haploid plants were produced from the four Spanish cultivars, however further improvements are needed to increase their gynogenesis

  18. Dissection of the complex genetic basis of craniofacial anomalies using haploid genetics and interspecies hybrids in Nasonia wasps

    PubMed Central

    Werren, John H.; Cohen, Lorna B.; Gadau, Juergen; Ponce, Rita; Baudry, Emmanuelle; Lynch, Jeremy A.

    2016-01-01

    The animal head is a complex structure where numerous sensory, structural and alimentary structures are concentrated and integrated, and its ontogeny requires precise and delicate interactions among genes, cells, and tissues. Thus, it is perhaps unsurprising that craniofacial abnormalities are among the most common birth defects in people, or that these defects have a complex genetic basis involving interactions among multiple loci. Developmental processes that depend on such epistatic interactions become exponentially more difficult to study in diploid organisms as the number of genes involved increases. Here, we present hybrid haploid males of the wasp species pair Nasonia vitripennis and Nasonia giraulti, which have distinct male head morphologies, as a genetic model of craniofacial development that possesses the genetic advantages of haploidy, along with many powerful genomic tools. Viable, fertile hybrids can be made between the species, and quantitative trail loci related to shape differences have been identified. In addition, a subset of hybrid males show head abnormalities, including clefting at the midline and asymmetries. Crucially, epistatic interactions among multiple loci underlie several developmental differences and defects observed in the F2 hybrid males. Furthermore, we demonstrate an introgression of a chromosomal region from N. giraulti into N. vitripennis that shows an abnormality in relative eye size, which maps to a region containing a major QTL for this trait. Therefore, the genetic sources of head morphology can, in principle, be identified by positional cloning. Thus, Nasonia is well positioned to be a uniquely powerful model invertebrate system with which to probe both development and complex genetics of craniofacial patterning and defects. PMID:26721604

  19. Efficient genome editing of differentiated renal epithelial cells.

    PubMed

    Hofherr, Alexis; Busch, Tilman; Huber, Nora; Nold, Andreas; Bohn, Albert; Viau, Amandine; Bienaimé, Frank; Kuehn, E Wolfgang; Arnold, Sebastian J; Köttgen, Michael

    2017-02-01

    Recent advances in genome editing technologies have enabled the rapid and precise manipulation of genomes, including the targeted introduction, alteration, and removal of genomic sequences. However, respective methods have been described mainly in non-differentiated or haploid cell types. Genome editing of well-differentiated renal epithelial cells has been hampered by a range of technological issues, including optimal design, efficient expression of multiple genome editing constructs, attainable mutation rates, and best screening strategies. Here, we present an easily implementable workflow for the rapid generation of targeted heterozygous and homozygous genomic sequence alterations in renal cells using transcription activator-like effector nucleases (TALENs) and the clustered regularly interspaced short palindromic repeat (CRISPR) system. We demonstrate the versatility of established protocols by generating novel cellular models for studying autosomal dominant polycystic kidney disease (ADPKD). Furthermore, we show that cell culture-validated genetic modifications can be readily applied to mouse embryonic stem cells (mESCs) for the generation of corresponding mouse models. The described procedure for efficient genome editing can be applied to any cell type to study physiological and pathophysiological functions in the context of precisely engineered genotypes.

  20. Global DNA cytosine methylation as an evolving trait: phylogenetic signal and correlated evolution with genome size in angiosperms

    PubMed Central

    Alonso, Conchita; Pérez, Ricardo; Bazaga, Pilar; Herrera, Carlos M.

    2015-01-01

    DNA cytosine methylation is a widespread epigenetic mechanism in eukaryotes, and plant genomes commonly are densely methylated. Genomic methylation can be associated with functional consequences such as mutational events, genomic instability or altered gene expression, but little is known on interspecific variation in global cytosine methylation in plants. In this paper, we compare global cytosine methylation estimates obtained by HPLC and use a phylogenetically-informed analytical approach to test for significance of evolutionary signatures of this trait across 54 angiosperm species in 25 families. We evaluate whether interspecific variation in global cytosine methylation is statistically related to phylogenetic distance and also whether it is evolutionarily correlated with genome size (C-value). Global cytosine methylation varied widely between species, ranging between 5.3% (Arabidopsis) and 39.2% (Narcissus). Differences between species were related to their evolutionary trajectories, as denoted by the strong phylogenetic signal underlying interspecific variation. Global cytosine methylation and genome size were evolutionarily correlated, as revealed by the significant relationship between the corresponding phylogenetically independent contrasts. On average, a ten-fold increase in genome size entailed an increase of about 10% in global cytosine methylation. Results show that global cytosine methylation is an evolving trait in angiosperms whose evolutionary trajectory is significantly linked to changes in genome size, and suggest that the evolutionary implications of epigenetic mechanisms are likely to vary between plant lineages. PMID:25688257

  1. Mediterranean species of Caulerpa are polyploid with smaller genomes in the invasive ones.

    PubMed

    Varela-Álvarez, Elena; Gómez Garreta, Amelia; Rull Lluch, Jordi; Salvador Soler, Noemi; Serrao, Ester A; Siguán, María Antonia Ribera

    2012-01-01

    Caulerpa species are marine green algae, which often act as invasive species with rapid clonal proliferation when growing outside their native biogeographical borders. Despite many publications on the genetics and ecology of Caulerpa species, their life history and ploidy levels are still to be resolved and are the subject of large controversy. While some authors claimed that the thallus found in nature has a haplodiplobiontic life cycle with heteromorphic alternation of generations, other authors claimed a diploid or haploid life cycle with only one generation involved. DAPI-staining with image analysis and microspectrophotometry were used to estimate relative nuclear DNA contents in three species of Caulerpa from the Mediterranean, at individual, population and species levels. Results show that ploidy levels and genome size vary in these three Caulerpa species, with a reduction in genome size for the invasive ones. Caulerpa species in the Mediterranean are polyploids in different life history phases; all sampled C. taxifolia and C. racemosa var. cylindracea were in haplophasic phase, but in C. prolifera, the native species, individuals were found in both diplophasic and haplophasic phases. Different levels of endopolyploidy were found in both C. prolifera and C. racemosa var. cylindracea. Life history is elucidated for the Mediterranean C. prolifera and it is hypothesized that haplophasic dominance in C. racemosa var. cylindracea and C. taxifolia is a beneficial trait for their invasive strategies.

  2. Consensus generation and variant detection by Celera Assembler.

    PubMed

    Denisov, Gennady; Walenz, Brian; Halpern, Aaron L; Miller, Jason; Axelrod, Nelson; Levy, Samuel; Sutton, Granger

    2008-04-15

    We present an algorithm to identify allelic variation given a Whole Genome Shotgun (WGS) assembly of haploid sequences, and to produce a set of haploid consensus sequences rather than a single consensus sequence. Existing WGS assemblers take a column-by-column approach to consensus generation, and produce a single consensus sequence which can be inconsistent with the underlying haploid alleles, and inconsistent with any of the aligned sequence reads. Our new algorithm uses a dynamic windowing approach. It detects alleles by simultaneously processing the portions of aligned reads spanning a region of sequence variation, assigns reads to their respective alleles, phases adjacent variant alleles and generates a consensus sequence corresponding to each confirmed allele. This algorithm was used to produce the first diploid genome sequence of an individual human. It can also be applied to assemblies of multiple diploid individuals and hybrid assemblies of multiple haploid organisms. Being applied to the individual human genome assembly, the new algorithm detects exactly two confirmed alleles and reports two consensus sequences in 98.98% of the total number 2,033311 detected regions of sequence variation. In 33,269 out of 460,373 detected regions of size >1 bp, it fixes the constructed errors of a mosaic haploid representation of a diploid locus as produced by the original Celera Assembler consensus algorithm. Using an optimized procedure calibrated against 1 506 344 known SNPs, it detects 438 814 new heterozygous SNPs with false positive rate 12%. The open source code is available at: http://wgs-assembler.cvs.sourceforge.net/wgs-assembler/

  3. Whole-Genome Resequencing of Experimental Populations Reveals Polygenic Basis of Egg-Size Variation in Drosophila melanogaster

    PubMed Central

    Jha, Aashish R.; Miles, Cecelia M.; Lippert, Nodia R.; Brown, Christopher D.; White, Kevin P.; Kreitman, Martin

    2015-01-01

    Complete genome resequencing of populations holds great promise in deconstructing complex polygenic traits to elucidate molecular and developmental mechanisms of adaptation. Egg size is a classic adaptive trait in insects, birds, and other taxa, but its highly polygenic architecture has prevented high-resolution genetic analysis. We used replicated experimental evolution in Drosophila melanogaster and whole-genome sequencing to identify consistent signatures of polygenic egg-size adaptation. A generalized linear-mixed model revealed reproducible allele frequency differences between replicated experimental populations selected for large and small egg volumes at approximately 4,000 single nucleotide polymorphisms (SNPs). Several hundred distinct genomic regions contain clusters of these SNPs and have lower heterozygosity than the genomic background, consistent with selection acting on polymorphisms in these regions. These SNPs are also enriched among genes expressed in Drosophila ovaries and many of these genes have well-defined functions in Drosophila oogenesis. Additional genes regulating egg development, growth, and cell size show evidence of directional selection as genes regulating these biological processes are enriched for highly differentiated SNPs. Genetic crosses performed with a subset of candidate genes demonstrated that these genes influence egg size, at least in the large genetic background. These findings confirm the highly polygenic architecture of this adaptive trait, and suggest the involvement of many novel candidate genes in regulating egg size. PMID:26044351

  4. Preparation and screening of an arrayed human genomic library generated with the P1 cloning system.

    PubMed Central

    Shepherd, N S; Pfrogner, B D; Coulby, J N; Ackerman, S L; Vaidyanathan, G; Sauer, R H; Balkenhol, T C; Sternberg, N

    1994-01-01

    We describe here the construction and initial characterization of a 3-fold coverage genomic library of the human haploid genome that was prepared using the bacteriophage P1 cloning system. The cloned DNA inserts were produced by size fractionation of a Sau3AI partial digest of high molecular weight genomic DNA isolated from primary cells of human foreskin fibroblasts. The inserts were cloned into the pAd10sacBII vector and packaged in vitro into P1 phage. These were used to generate recombinant bacterial clones, each of which was picked robotically from an agar plate into a well of a 96-well microtiter dish, grown overnight, and stored at -70 degrees C. The resulting library, designated DMPC-HFF#1 series A, consists of approximately 130,000-140,000 recombinant clones that were stored in 1500 microtiter dishes. To screen the library, clones were combined in a pooling strategy and specific loci were identified by PCR analysis. On average, the library contains two or three different clones for each locus screened. To date we have identified a total of 17 clones containing the hypoxanthine-guanine phosphoribosyltransferase, human serum albumin-human alpha-fetoprotein, p53, cyclooxygenase I, human apurinic endonuclease, beta-polymerase, and DNA ligase I genes. The cloned inserts average 80 kb in size and range from 70 to 95 kb, with one 49-kb insert and one 62-kb insert. Images PMID:8146166

  5. Brewing characteristics of haploid strains isolated from sake yeast Kyokai No. 7.

    PubMed

    Katou, Taku; Kitagaki, Hiroshi; Akao, Takeshi; Shimoi, Hitoshi

    2008-11-01

    Sake yeast exhibit various characteristics that make them more suitable for sake brewing compared to other yeast strains. Since sake yeast strains are Saccharomyces cerevisiae heterothallic diploid strains, it is likely that they have heterozygous alleles on homologous chromosomes (heterozygosity) due to spontaneous mutations. If this is the case, segregation of phenotypic traits in haploid strains after sporulation and concomitant meiosis of sake yeast strains would be expected to occur. To examine this hypothesis, we isolated 100 haploid strains from Kyokai No. 7 (K7), a typical sake yeast strain in Japan, and compared their brewing characteristics in small-scale sake-brewing tests. Analyses of the resultant sake samples showed a smooth and continuous distribution of analytical values for brewing characteristics, suggesting that K7 has multiple heterozygosities that affect brewing characteristics and that these heterozygous alleles do segregate after sporulation. Correlation and principal component analyses suggested that the analytical parameters could be classified into two groups, indicating fermentation ability and sake flavour. (c) 2008 John Wiley & Sons, Ltd.

  6. The number of genes encoding repeat domain-containing proteins positively correlates with genome size in amoebal giant viruses

    PubMed Central

    Shukla, Avi; Chatterjee, Anirvan

    2018-01-01

    Abstract Curiously, in viruses, the virion volume appears to be predominantly driven by genome length rather than the number of proteins it encodes or geometric constraints. With their large genome and giant particle size, amoebal viruses (AVs) are ideally suited to study the relationship between genome and virion size and explore the role of genome plasticity in their evolutionary success. Different genomic regions of AVs exhibit distinct genealogies. Although the vertically transferred core genes and their functions are universally conserved across the nucleocytoplasmic large DNA virus (NCLDV) families and are essential for their replication, the horizontally acquired genes are variable across families and are lineage-specific. When compared with other giant virus families, we observed a near–linear increase in the number of genes encoding repeat domain-containing proteins (RDCPs) with the increase in the genome size of AVs. From what is known about the functions of RDCPs in bacteria and eukaryotes and their prevalence in the AV genomes, we envisage important roles for RDCPs in the life cycle of AVs, their genome expansion, and plasticity. This observation also supports the evolution of AVs from a smaller viral ancestor by the acquisition of diverse gene families from the environment including RDCPs that might have helped in host adaption. PMID:29308275

  7. Transfer RNA gene-targeted integration: an adaptation of retrotransposable elements to survive in the compact Dictyostelium discoideum genome.

    PubMed

    Winckler, T; Szafranski, K; Glöckner, G

    2005-01-01

    Almost every organism carries along a multitude of molecular parasites known as transposable elements (TEs). TEs influence their host genomes in many ways by expanding genome size and complexity, rearranging genomic DNA, mutagenizing host genes, and altering transcription levels of nearby genes. The eukaryotic microorganism Dictyostelium discoideum is attractive for the study of fundamental biological phenomena such as intercellular communication, formation of multicellularity, cell differentiation, and morphogenesis. D. discoideum has a highly compacted, haploid genome with less than 1 kb of genomic DNA separating coding regions. Nevertheless, the D. discoideum genome is loaded with 10% of TEs that managed to settle and survive in this inhospitable environment. In depth analysis of D. discoideum genome project data has provided intriguing insights into the evolutionary challenges that mobile elements face when they invade compact genomes. Two different mechanisms are used by D. discoideum TEs to avoid disruption of host genes upon retrotransposition. Several TEs have invented the specific targeting of tRNA gene-flanking regions as a means to avoid integration into coding regions. These elements have been dispersed on all chromosomes, closely following the distribution of tRNA genes. By contrast, TEs that lack bona fide integration specificities show a strong bias to nested integration, thus forming large TE clusters at certain chromosomal loci that are hardly resolved by bioinformatics approaches. We summarize our current view of D. discoideum TEs and present new data from the analysis of the complete sequences of D. discoideum chromosomes 1 and 2, which comprise more than one third of the total genome.

  8. Segregation distortion causes large-scale differences between male and female genomes in hybrid ants.

    PubMed

    Kulmuni, Jonna; Seifert, Bernhard; Pamilo, Pekka

    2010-04-20

    Hybridization in isolated populations can lead either to hybrid breakdown and extinction or in some cases to speciation. The basis of hybrid breakdown lies in genetic incompatibilities between diverged genomes. In social Hymenoptera, the consequences of hybridization can differ from those in other animals because of haplodiploidy and sociality. Selection pressures differ between sexes because males are haploid and females are diploid. Furthermore, sociality and group living may allow survival of hybrid genotypes. We show that hybridization in Formica ants has resulted in a stable situation in which the males form two highly divergent gene pools whereas all the females are hybrids. This causes an exceptional situation with large-scale differences between male and female genomes. The genotype differences indicate strong transmission ratio distortion depending on offspring sex, whereby the mother transmits some alleles exclusively to her daughters and other alleles exclusively to her sons. The genetic differences between the sexes and the apparent lack of multilocus hybrid genotypes in males can be explained by recessive incompatibilities which cause the elimination of hybrid males because of their haploid genome. Alternatively, differentiation between sexes could be created by prezygotic segregation into male-forming and female-forming gametes in diploid females. Differentiation between sexes is stable and maintained throughout generations. The present study shows a unique outcome of hybridization and demonstrates that hybridization has the potential of generating evolutionary novelties in animals.

  9. The Yeast Deletion Collection: A Decade of Functional Genomics

    PubMed Central

    Giaever, Guri; Nislow, Corey

    2014-01-01

    The yeast deletion collections comprise >21,000 mutant strains that carry precise start-to-stop deletions of ∼6000 open reading frames. This collection includes heterozygous and homozygous diploids, and haploids of both MATa and MATα mating types. The yeast deletion collection, or yeast knockout (YKO) set, represents the first and only complete, systematically constructed deletion collection available for any organism. Conceived during the Saccharomyces cerevisiae sequencing project, work on the project began in 1998 and was completed in 2002. The YKO strains have been used in numerous laboratories in >1000 genome-wide screens. This landmark genome project has inspired development of numerous genome-wide technologies in organisms from yeast to man. Notable spinoff technologies include synthetic genetic array and HIPHOP chemogenomics. In this retrospective, we briefly describe the yeast deletion project and some of its most noteworthy biological contributions and the impact that these collections have had on the yeast research community and on genomics in general. PMID:24939991

  10. Breeding of a xylose-fermenting hybrid strain by mating genetically engineered haploid strains derived from industrial Saccharomyces cerevisiae.

    PubMed

    Inoue, Hiroyuki; Hashimoto, Seitaro; Matsushika, Akinori; Watanabe, Seiya; Sawayama, Shigeki

    2014-12-01

    The industrial Saccharomyces cerevisiae IR-2 is a promising host strain to genetically engineer xylose-utilizing yeasts for ethanol fermentation from lignocellulosic hydrolysates. Two IR-2-based haploid strains were selected based upon the rate of xylulose fermentation, and hybrids were obtained by mating recombinant haploid strains harboring heterogeneous xylose dehydrogenase (XDH) (wild-type NAD(+)-dependent XDH or engineered NADP(+)-dependent XDH, ARSdR), xylose reductase (XR) and xylulose kinase (XK) genes. ARSdR in the hybrids selected for growth rates on yeast extract-peptone-dextrose (YPD) agar and YP-xylose agar plates typically had a higher activity than NAD(+)-dependent XDH. Furthermore, the xylose-fermenting performance of the hybrid strain SE12 with the same level of heterogeneous XDH activity was similar to that of a recombinant strain of IR-2 harboring a single set of genes, XR/ARSdR/XK. These results suggest not only that the recombinant haploid strains retain the appropriate genetic background of IR-2 for ethanol production from xylose but also that ARSdR is preferable for xylose fermentation.

  11. Construction of a nurse shark (Ginglymostoma cirratum) bacterial artificial chromosome (BAC) library and a preliminary genome survey.

    PubMed

    Luo, Meizhong; Kim, Hyeran; Kudrna, Dave; Sisneros, Nicholas B; Lee, So-Jeong; Mueller, Christopher; Collura, Kristi; Zuccolo, Andrea; Buckingham, E Bryan; Grim, Suzanne M; Yanagiya, Kazuyo; Inoko, Hidetoshi; Shiina, Takashi; Flajnik, Martin F; Wing, Rod A; Ohta, Yuko

    2006-05-03

    Sharks are members of the taxonomic class Chondrichthyes, the oldest living jawed vertebrates. Genomic studies of this group, in comparison to representative species in other vertebrate taxa, will allow us to theorize about the fundamental genetic, developmental, and functional characteristics in the common ancestor of all jawed vertebrates. In order to obtain mapping and sequencing data for comparative genomics, we constructed a bacterial artificial chromosome (BAC) library for the nurse shark, Ginglymostoma cirratum. The BAC library consists of 313,344 clones with an average insert size of 144 kb, covering ~4.5 x 1010 bp and thus providing an 11-fold coverage of the haploid genome. BAC end sequence analyses revealed, in addition to LINEs and SINEs commonly found in other animal and plant genomes, two new groups of nurse shark-specific repetitive elements, NSRE1 and NSRE2 that seem to be major components of the nurse shark genome. Screening the library with single-copy or multi-copy gene probes showed 6-28 primary positive clones per probe of which 50-90% were true positives, demonstrating that the BAC library is representative of the different regions of the nurse shark genome. Furthermore, some BAC clones contained multiple genes, making physical mapping feasible. We have constructed a deep-coverage, high-quality, large insert, and publicly available BAC library for a cartilaginous fish. It will be very useful to the scientific community interested in shark genomic structure, comparative genomics, and functional studies. We found two new groups of repetitive elements specific to the nurse shark genome, which may contribute to the architecture and evolution of the nurse shark genome.

  12. Whole-Genome Resequencing of Experimental Populations Reveals Polygenic Basis of Egg-Size Variation in Drosophila melanogaster.

    PubMed

    Jha, Aashish R; Miles, Cecelia M; Lippert, Nodia R; Brown, Christopher D; White, Kevin P; Kreitman, Martin

    2015-10-01

    Complete genome resequencing of populations holds great promise in deconstructing complex polygenic traits to elucidate molecular and developmental mechanisms of adaptation. Egg size is a classic adaptive trait in insects, birds, and other taxa, but its highly polygenic architecture has prevented high-resolution genetic analysis. We used replicated experimental evolution in Drosophila melanogaster and whole-genome sequencing to identify consistent signatures of polygenic egg-size adaptation. A generalized linear-mixed model revealed reproducible allele frequency differences between replicated experimental populations selected for large and small egg volumes at approximately 4,000 single nucleotide polymorphisms (SNPs). Several hundred distinct genomic regions contain clusters of these SNPs and have lower heterozygosity than the genomic background, consistent with selection acting on polymorphisms in these regions. These SNPs are also enriched among genes expressed in Drosophila ovaries and many of these genes have well-defined functions in Drosophila oogenesis. Additional genes regulating egg development, growth, and cell size show evidence of directional selection as genes regulating these biological processes are enriched for highly differentiated SNPs. Genetic crosses performed with a subset of candidate genes demonstrated that these genes influence egg size, at least in the large genetic background. These findings confirm the highly polygenic architecture of this adaptive trait, and suggest the involvement of many novel candidate genes in regulating egg size. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  13. O father where art thou? Paternity analyses in a natural population of the haploid-diploid seaweed Chondrus crispus.

    PubMed

    Krueger-Hadfield, S A; Roze, D; Correa, J A; Destombe, C; Valero, M

    2015-02-01

    The link between life history traits and mating systems in diploid organisms has been extensively addressed in the literature, whereas the degree of selfing and/or inbreeding in natural populations of haploid-diploid organisms, in which haploid gametophytes alternate with diploid sporophytes, has been rarely measured. Dioecy has often been used as a proxy for the mating system in these organisms. Yet, dioecy does not prevent the fusion of gametes from male and female gametophytes originating from the same sporophyte. This is likely a common occurrence when spores from the same parent are dispersed in clumps and recruit together. This pattern of clumped spore dispersal has been hypothesized to explain significant heterozygote deficiency in the dioecious haploid-diploid seaweed Chondrus crispus. Fronds and cystocarps (structures in which zygotes are mitotically amplified) were sampled in two 25 m(2) plots located within a high and a low intertidal zone and genotyped at 5 polymorphic microsatellite loci in order to explore the mating system directly using paternity analyses. Multiple males sired cystocarps on each female, but only one of the 423 paternal genotypes corresponded to a field-sampled gametophyte. Nevertheless, larger kinship coefficients were detected between males siring cystocarps on the same female in comparison with males in the entire population, confirming restricted spermatial and clumped spore dispersal. Such dispersal mechanisms may be a mode of reproductive assurance due to nonmotile gametes associated with putatively reduced effects of inbreeding depression because of the free-living haploid stage in C. crispus.

  14. Insights into the Dekkera bruxellensis Genomic Landscape: Comparative Genomics Reveals Variations in Ploidy and Nutrient Utilisation Potential amongst Wine Isolates

    PubMed Central

    Borneman, Anthony R.; Zeppel, Ryan; Chambers, Paul J.; Curtin, Chris D.

    2014-01-01

    The yeast Dekkera bruxellensis is a major contaminant of industrial fermentations, such as those used for the production of biofuel and wine, where it outlasts and, under some conditions, outcompetes the major industrial yeast Saccharomyces cerevisiae. In order to investigate the level of inter-strain variation that is present within this economically important species, the genomes of four diverse D. bruxellensis isolates were compared. While each of the four strains was shown to contain a core diploid genome, which is clearly sufficient for survival, two of the four isolates have a third haploid complement of chromosomes. The sequences of these additional haploid genomes were both highly divergent from those comprising the diploid core and divergent between the two triploid strains. Similar to examples in the Saccharomyces spp. clade, where some allotriploids have arisen on the basis of enhanced ability to survive a range of environmental conditions, it is likely these strains are products of two independent hybridisation events that may have involved multiple species or distinct sub-species of Dekkera. Interestingly these triploid strains represent the vast majority (92%) of isolates from across the Australian wine industry, suggesting that the additional set of chromosomes may confer a selective advantage in winery environments that has resulted in these hybrid strains all-but replacing their diploid counterparts in Australian winery settings. In addition to the apparent inter-specific hybridisation events, chromosomal aberrations such as strain-specific insertions and deletions and loss-of-heterozygosity by gene conversion were also commonplace. While these events are likely to have affected many phenotypes across these strains, we have been able to link a specific deletion to the inability to utilise nitrate by some strains of D. bruxellensis, a phenotype that may have direct impacts in the ability for these strains to compete with S. cerevisiae. PMID:24550744

  15. Phenotypic diversification by enhanced genome restructuring after induction of multiple DNA double-strand breaks.

    PubMed

    Muramoto, Nobuhiko; Oda, Arisa; Tanaka, Hidenori; Nakamura, Takahiro; Kugou, Kazuto; Suda, Kazuki; Kobayashi, Aki; Yoneda, Shiori; Ikeuchi, Akinori; Sugimoto, Hiroki; Kondo, Satoshi; Ohto, Chikara; Shibata, Takehiko; Mitsukawa, Norihiro; Ohta, Kunihiro

    2018-05-18

    DNA double-strand break (DSB)-mediated genome rearrangements are assumed to provide diverse raw genetic materials enabling accelerated adaptive evolution; however, it remains unclear about the consequences of massive simultaneous DSB formation in cells and their resulting phenotypic impact. Here, we establish an artificial genome-restructuring technology by conditionally introducing multiple genomic DSBs in vivo using a temperature-dependent endonuclease TaqI. Application in yeast and Arabidopsis thaliana generates strains with phenotypes, including improved ethanol production from xylose at higher temperature and increased plant biomass, that are stably inherited to offspring after multiple passages. High-throughput genome resequencing revealed that these strains harbor diverse rearrangements, including copy number variations, translocations in retrotransposons, and direct end-joinings at TaqI-cleavage sites. Furthermore, large-scale rearrangements occur frequently in diploid yeasts (28.1%) and tetraploid plants (46.3%), whereas haploid yeasts and diploid plants undergo minimal rearrangement. This genome-restructuring system (TAQing system) will enable rapid genome breeding and aid genome-evolution studies.

  16. Improved genome recovery and integrated cell-size analyses of individual uncultured microbial cells and viral particles.

    PubMed

    Stepanauskas, Ramunas; Fergusson, Elizabeth A; Brown, Joseph; Poulton, Nicole J; Tupper, Ben; Labonté, Jessica M; Becraft, Eric D; Brown, Julia M; Pachiadaki, Maria G; Povilaitis, Tadas; Thompson, Brian P; Mascena, Corianna J; Bellows, Wendy K; Lubys, Arvydas

    2017-07-20

    Microbial single-cell genomics can be used to provide insights into the metabolic potential, interactions, and evolution of uncultured microorganisms. Here we present WGA-X, a method based on multiple displacement amplification of DNA that utilizes a thermostable mutant of the phi29 polymerase. WGA-X enhances genome recovery from individual microbial cells and viral particles while maintaining ease of use and scalability. The greatest improvements are observed when amplifying high G+C content templates, such as those belonging to the predominant bacteria in agricultural soils. By integrating WGA-X with calibrated index-cell sorting and high-throughput genomic sequencing, we are able to analyze genomic sequences and cell sizes of hundreds of individual, uncultured bacteria, archaea, protists, and viral particles, obtained directly from marine and soil samples, in a single experiment. This approach may find diverse applications in microbiology and in biomedical and forensic studies of humans and other multicellular organisms.Single-cell genomics can be used to study uncultured microorganisms. Here, Stepanauskas et al. present a method combining improved multiple displacement amplification and FACS, to obtain genomic sequences and cell size information from uncultivated microbial cells and viral particles in environmental samples.

  17. One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies

    PubMed Central

    Yuan, Shuai; Johnston, H. Richard; Zhang, Guosheng; Li, Yun; Hu, Yi-Juan; Qin, Zhaohui S.

    2015-01-01

    With rapid decline of the sequencing cost, researchers today rush to embrace whole genome sequencing (WGS), or whole exome sequencing (WES) approach as the next powerful tool for relating genetic variants to human diseases and phenotypes. A fundamental step in analyzing WGS and WES data is mapping short sequencing reads back to the reference genome. This is an important issue because incorrectly mapped reads affect the downstream variant discovery, genotype calling and association analysis. Although many read mapping algorithms have been developed, the majority of them uses the universal reference genome and do not take sequence variants into consideration. Given that genetic variants are ubiquitous, it is highly desirable if they can be factored into the read mapping procedure. In this work, we developed a novel strategy that utilizes genotypes obtained a priori to customize the universal haploid reference genome into a personalized diploid reference genome. The new strategy is implemented in a program named RefEditor. When applying RefEditor to real data, we achieved encouraging improvements in read mapping, variant discovery and genotype calling. Compared to standard approaches, RefEditor can significantly increase genotype calling consistency (from 43% to 61% at 4X coverage; from 82% to 92% at 20X coverage) and reduce Mendelian inconsistency across various sequencing depths. Because many WGS and WES studies are conducted on cohorts that have been genotyped using array-based genotyping platforms previously or concurrently, we believe the proposed strategy will be of high value in practice, which can also be applied to the scenario where multiple NGS experiments are conducted on the same cohort. The RefEditor sources are available at https://github.com/superyuan/refeditor. PMID:26267278

  18. Production of haploid plantlets in anther cultures of Albizzia lebbeck L.

    PubMed

    Gharyal, P K; Rashid, A; Maheshwari, S C

    1983-12-01

    Anthers of Albizzia lebbeck on B5 medium (BM) supplemented with kinetin (2 mg/l) and 2, 4-D (0.5 mg/l) showed callus initiation from microspores. Differentiation of embryoids and shoots was obtained on BM + BAP (1 mg/l) + IAA (0.5 mg/l) and of roots on BM. Root tip squashes of the regenerated plantlets showed the haploid chromosome number (n=13), confirming the microspore origin of the regenerants.

  19. The dynamics of genome replication using deep sequencing

    PubMed Central

    Müller, Carolin A.; Hawkins, Michelle; Retkute, Renata; Malla, Sunir; Wilson, Ray; Blythe, Martin J.; Nakato, Ryuichiro; Komata, Makiko; Shirahige, Katsuhiko; de Moura, Alessandro P.S.; Nieduszynski, Conrad A.

    2014-01-01

    Eukaryotic genomes are replicated from multiple DNA replication origins. We present complementary deep sequencing approaches to measure origin location and activity in Saccharomyces cerevisiae. Measuring the increase in DNA copy number during a synchronous S-phase allowed the precise determination of genome replication. To map origin locations, replication forks were stalled close to their initiation sites; therefore, copy number enrichment was limited to origins. Replication timing profiles were generated from asynchronous cultures using fluorescence-activated cell sorting. Applying this technique we show that the replication profiles of haploid and diploid cells are indistinguishable, indicating that both cell types use the same cohort of origins with the same activities. Finally, increasing sequencing depth allowed the direct measure of replication dynamics from an exponentially growing culture. This is the first time this approach, called marker frequency analysis, has been successfully applied to a eukaryote. These data provide a high-resolution resource and methodological framework for studying genome biology. PMID:24089142

  20. Cryptic Fitness Advantage: Diploids Invade Haploid Populations Despite Lacking Any Apparent Advantage as Measured by Standard Fitness Assays

    PubMed Central

    Gerstein, Aleeza C.; Otto, Sarah P.

    2011-01-01

    Ploidy varies tremendously within and between species, yet the factors that influence when or why ploidy variants are adaptive remains poorly understood. Our previous work found that diploid individuals repeatedly arose within ten replicate haploid populations of Saccharomyces cerevisiae, and in each case we witnessed diploid takeover within 1800 asexual generations of batch culture evolution in the lab. The character that allowed diploids to rise in frequency within haploid populations remains unknown. Here we present a number of experiments conducted with the goal to determine what this trait (or traits) might have been. Experiments were conducted both by sampling a small number of colonies from the stocks frozen every two weeks (93 generations) during the original experiment, as well through sampling a larger number of colonies at the two time points where polymorphism for ploidy was most prevalent. Surprisingly, none of our fitness component measures (lag phase, growth rate, biomass production) indicated an advantage to diploidy. Similarly, competition assays against a common competitor and direct competition between haploid and diploid colonies isolated from the same time point failed to indicate a diploid advantage. Furthermore, we uncovered a tremendous amount of trait variation among colonies of the same ploidy level. Only late-appearing diploids showed a competitive advantage over haploids, indicating that the fitness advantage that allowed eventual takeover was not diploidy per se but an attribute of a subset of diploid lineages. Nevertheless, the initial rise in diploids to intermediate frequency cannot be explained by any of the fitness measures used; we suggest that the resolution to this mystery is negative frequency-dependent selection, which is ignored in the standard fitness measures used. PMID:22174734

  1. Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer's Disease.

    PubMed

    Jin, Sheng Chih; Benitez, Bruno A; Deming, Yuetiva; Cruchaga, Carlos

    2016-01-01

    Analyses of genome-wide association studies (GWAS) for complex disorders usually identify common variants with a relatively small effect size that only explain a small proportion of phenotypic heritability. Several studies have suggested that a significant fraction of heritability may be explained by low-frequency (minor allele frequency (MAF) of 1-5 %) and rare-variants that are not contained in the commercial GWAS genotyping arrays (Schork et al., Curr Opin Genet Dev 19:212, 2009). Rare variants can also have relatively large effects on risk for developing human diseases or disease phenotype (Cruchaga et al., PLoS One 7:e31039, 2012). However, it is necessary to perform next-generation sequencing (NGS) studies in a large population (>4,000 samples) to detect a significant rare-variant association. Several NGS methods, such as custom capture sequencing and amplicon-based sequencing, are designed to screen a small proportion of the genome, but most of these methods are limited in the number of samples that can be multiplexed (i.e. most sequencing kits only provide 96 distinct index). Additionally, the sequencing library preparation for 4,000 samples remains expensive and thus conducting NGS studies with the aforementioned methods are not feasible for most research laboratories.The need for low-cost large scale rare-variant detection makes pooled-DNA sequencing an ideally efficient and cost-effective technique to identify rare variants in target regions by sequencing hundreds to thousands of samples. Our recent work has demonstrated that pooled-DNA sequencing can accurately detect rare variants in targeted regions in multiple DNA samples with high sensitivity and specificity (Jin et al., Alzheimers Res Ther 4:34, 2012). In these studies we used a well-established pooled-DNA sequencing approach and a computational package, SPLINTER (short indel prediction by large deviation inference and nonlinear true frequency estimation by recursion) (Vallania et al., Genome Res

  2. Inferring Population Size History from Large Samples of Genome-Wide Molecular Data - An Approximate Bayesian Computation Approach

    PubMed Central

    Boitard, Simon; Rodríguez, Willy; Jay, Flora; Mona, Stefano; Austerlitz, Frédéric

    2016-01-01

    Inferring the ancestral dynamics of effective population size is a long-standing question in population genetics, which can now be tackled much more accurately thanks to the massive genomic data available in many species. Several promising methods that take advantage of whole-genome sequences have been recently developed in this context. However, they can only be applied to rather small samples, which limits their ability to estimate recent population size history. Besides, they can be very sensitive to sequencing or phasing errors. Here we introduce a new approximate Bayesian computation approach named PopSizeABC that allows estimating the evolution of the effective population size through time, using a large sample of complete genomes. This sample is summarized using the folded allele frequency spectrum and the average zygotic linkage disequilibrium at different bins of physical distance, two classes of statistics that are widely used in population genetics and can be easily computed from unphased and unpolarized SNP data. Our approach provides accurate estimations of past population sizes, from the very first generations before present back to the expected time to the most recent common ancestor of the sample, as shown by simulations under a wide range of demographic scenarios. When applied to samples of 15 or 25 complete genomes in four cattle breeds (Angus, Fleckvieh, Holstein and Jersey), PopSizeABC revealed a series of population declines, related to historical events such as domestication or modern breed creation. We further highlight that our approach is robust to sequencing errors, provided summary statistics are computed from SNPs with common alleles. PMID:26943927

  3. Highly rearranged and size-variable chloroplast genomes in conifers II clade (cupressophytes): evolution towards shorter intergenic spacers.

    PubMed

    Wu, Chung-Shien; Chaw, Shu-Miaw

    2014-04-01

    Although conifers are of immense ecological and economic value, bioengineering of their chloroplasts remains undeveloped. Understanding the chloroplast genomic organization of conifers can facilitate their bioengineering. Members of the conifer II clade (or cupressophytes) are highly diverse in both morphologic features and chloroplast genomic organization. We compared six cupressophyte chloroplast genomes (cpDNAs) that represent four of the five cupressophyte families, including three genomes that are first reported here (Agathis dammara, Calocedrus formosana and Nageia nagi). The six cupressophyte cpDNAs have lost a pair of large inverted repeats (IRs) and vary greatly in size, organization and tRNA copies. We demonstrate that cupressophyte cpDNAs have evolved towards reduced size, largely due to shrunken intergenic spacers. In cupressophytes, cpDNA rearrangements are capable of extending intergenic spacers, and synonymous mutations are negatively associated with the size and frequency of rearrangements. The variable cpDNA sizes of cupressophytes may have been shaped by mutational burden and genomic rearrangements. On the basis of cpDNA organization, our analyses revealed that in gymnosperms, cpDNA rearrangements are phylogenetically informative, which supports the 'gnepines' clade. In addition, removal of a specific IR influences the minimal rearrangements required for the gnepines and cupressophyte clades, whereby Pinaceae favours the removal of IRB but cupressophytes exclusion of IRA. This result strongly suggests that different IR copies have been lost from conifers I and II. Our data help understand the complexity and evolution of cupressophyte cpDNAs. © 2013 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology, The Association of Applied Biologists and John Wiley & Sons Ltd.

  4. Anthocyanin inhibits propidium iodide DNA fluorescence in Euphorbia pulcherrima: implications for genome size variation and flow cytometry.

    PubMed

    Bennett, Michael D; Price, H James; Johnston, J Spencer

    2008-04-01

    Measuring genome size by flow cytometry assumes direct proportionality between nuclear DNA staining and DNA amount. By 1997 it was recognized that secondary metabolites may affect DNA staining, thereby causing inaccuracy. Here experiments are reported with poinsettia (Euphorbia pulcherrima) with green leaves and red bracts rich in phenolics. DNA content was estimated as fluorescence of propidium iodide (PI)-stained nuclei of poinsettia and/or pea (Pisum sativum) using flow cytometry. Tissue was chopped, or two tissues co-chopped, in Galbraith buffer alone or with six concentrations of cyanidin-3-rutinoside (a cyanidin-3-rhamnoglucoside contributing to red coloration in poinsettia). There were large differences in PI staining (35-70 %) between 2C nuclei from green leaf and red bract tissue in poinsettia. These largely disappeared when pea leaflets were co-chopped with poinsettia tissue as an internal standard. However, smaller (2.8-6.9 %) differences remained, and red bracts gave significantly lower 1C genome size estimates (1.69-1.76 pg) than green leaves (1.81 pg). Chopping pea or poinsettia tissue in buffer with 0-200 microm cyanidin-3-rutinoside showed that the effects of natural inhibitors in red bracts of poinsettia on PI staining were largely reproduced in a dose-dependent way by this anthocyanin. Given their near-ubiquitous distribution, many suspected roles and known affects on DNA staining, anthocyanins are a potent, potential cause of significant error variation in genome size estimations for many plant tissues and taxa. This has important implications of wide practical and theoretical significance. When choosing genome size calibration standards it seems prudent to select materials producing little or no anthocyanin. Reviewing the literature identifies clear examples in which claims of intraspecific variation in genome size are probably artefacts caused by natural variation in anthocyanin levels or correlated with environmental factors known to induce

  5. Simulation and estimation of gene number in a biological pathway using almost complete saturation mutagenesis screening of haploid mouse cells.

    PubMed

    Tokunaga, Masahiro; Kokubu, Chikara; Maeda, Yusuke; Sese, Jun; Horie, Kyoji; Sugimoto, Nakaba; Kinoshita, Taroh; Yusa, Kosuke; Takeda, Junji

    2014-11-24

    Genome-wide saturation mutagenesis and subsequent phenotype-driven screening has been central to a comprehensive understanding of complex biological processes in classical model organisms such as flies, nematodes, and plants. The degree of "saturation" (i.e., the fraction of possible target genes identified) has been shown to be a critical parameter in determining all relevant genes involved in a biological function, without prior knowledge of their products. In mammalian model systems, however, the relatively large scale and labor intensity of experiments have hampered the achievement of actual saturation mutagenesis, especially for recessive traits that require biallelic mutations to manifest detectable phenotypes. By exploiting the recently established haploid mouse embryonic stem cells (ESCs), we present an implementation of almost complete saturation mutagenesis in a mammalian system. The haploid ESCs were mutagenized with the chemical mutagen N-ethyl-N-nitrosourea (ENU) and processed for the screening of mutants defective in various steps of the glycosylphosphatidylinositol-anchor biosynthetic pathway. The resulting 114 independent mutant clones were characterized by a functional complementation assay, and were shown to be defective in any of 20 genes among all 22 known genes essential for this well-characterized pathway. Ten mutants were further validated by whole-exome sequencing. The predominant generation of single-nucleotide substitutions by ENU resulted in a gene mutation rate proportional to the length of the coding sequence, which facilitated the experimental design of saturation mutagenesis screening with the aid of computational simulation. Our study enables mammalian saturation mutagenesis to become a realistic proposition. Computational simulation, combined with a pilot mutagenesis experiment, could serve as a tool for the estimation of the number of genes essential for biological processes such as drug target pathways when a positive selection of

  6. Genetic mapping of centromeres in the nine Citrus clementina chromosomes using half-tetrad analysis and recombination patterns in unreduced and haploid gametes.

    PubMed

    Aleza, Pablo; Cuenca, José; Hernández, María; Juárez, José; Navarro, Luis; Ollitrault, Patrick

    2015-03-08

    Mapping centromere locations in plant species provides essential information for the analysis of genetic structures and population dynamics. The centromere's position affects the distribution of crossovers along a chromosome and the parental heterozygosity restitution by 2n gametes is a direct function of the genetic distance to the centromere. Sexual polyploidisation is relatively frequent in Citrus species and is widely used to develop new seedless triploid cultivars. The study's objectives were to (i) map the positions of the centromeres of the nine Citrus clementina chromosomes; (ii) analyse the crossover interference in unreduced gametes; and (iii) establish the pattern of genetic recombination in haploid clementine gametes along each chromosome and its relationship with the centromere location and distribution of genic sequences. Triploid progenies were derived from unreduced megagametophytes produced by second-division restitution. Centromere positions were mapped genetically for all linkage groups using half-tetrad analysis. Inference of the physical locations of centromeres revealed one acrocentric, four metacentric and four submetacentric chromosomes. Crossover interference was observed in unreduced gametes, with variation seen between chromosome arms. For haploid gametes, a strong decrease in the recombination rate occurred in centromeric and pericentromeric regions, which contained a low density of genic sequences. In chromosomes VIII and IX, these low recombination rates extended beyond the pericentromeric regions. The genomic region corresponding to a genetic distance < 5cM from a centromere represented 47% of the genome and 23% of the genic sequences. The centromere positions of the nine citrus chromosomes were genetically mapped. Their physical locations, inferred from the genetic ones, were consistent with the sequence constitution and recombination pattern along each chromosome. However, regions with low recombination rates extended beyond the

  7. Genome size variation in wild and cultivated maize along altitudinal gradients

    PubMed Central

    Díez, Concepción M.; Gaut, Brandon S.; Meca, Esteban; Scheinvar, Enrique; Montes-Hernandez, Salvador; Eguiarte, Luis E.; Tenaillon, Maud I.

    2014-01-01

    Summary • It is still an open question as to whether genome size (GS) variation is shaped by natural selection. One approach to address this question is a population-level survey that assesses both the variation in GS and the relationship of GS to ecological variants. • We assessed GS in Zea mays, a species that includes the cultivated crop, maize, and its closest wild relatives, the teosintes. We measured GS in five plants of each of 22 maize landraces and 21 teosinte populations from Mexico sampled from parallel altitudinal gradients. • GS was significantly smaller in landraces than in teosintes, but the largest component of GS variation was among landraces and among populations. In maize, GS correlated negatively with altitude; more generally, the best GS predictors were linked to geography. By contrast, GS variation in teosintes was best explained by temperature and precipitation. • Overall, our results further document the size flexibility of the Zea genome, but also point to a drastic shift in patterns of GS variation since domestication. We argue that such patterns may reflect the indirect action of selection on GS, through a multiplicity of phenotypes and life-history traits. PMID:23550586

  8. Draft Sequencing of the Heterozygous Diploid Genome of Satsuma (Citrus unshiu Marc.) Using a Hybrid Assembly Approach

    PubMed Central

    Shimizu, Tokurou; Tanizawa, Yasuhiro; Mochizuki, Takako; Nagasaki, Hideki; Yoshioka, Terutaka; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu

    2017-01-01

    Satsuma (Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma (“Miyagawa Wase”) was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome. PMID:29259619

  9. Draft Sequencing of the Heterozygous Diploid Genome of Satsuma (Citrus unshiu Marc.) Using a Hybrid Assembly Approach.

    PubMed

    Shimizu, Tokurou; Tanizawa, Yasuhiro; Mochizuki, Takako; Nagasaki, Hideki; Yoshioka, Terutaka; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu

    2017-01-01

    Satsuma ( Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma ("Miyagawa Wase") was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N 50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome.

  10. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates

    PubMed Central

    Yuan, Bo; Liu, Pengfei; Gupta, Aditya; Beck, Christine R.; Tejomurtula, Anusha; Campbell, Ian M.; Gambin, Tomasz; Simmons, Alexandra D.; Withers, Marjorie A.; Harris, R. Alan; Rogers, Jeffrey; Schwartz, David C.; Lupski, James R.

    2015-01-01

    Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. PMID:26641089

  11. Estimation of the genome sizes of the chigger mites Leptotrombidium pallidum and Leptotrombidium scutellare based on quantitative PCR and k-mer analysis

    PubMed Central

    2014-01-01

    Background Leptotrombidium pallidum and Leptotrombidium scutellare are the major vector mites for Orientia tsutsugamushi, the causative agent of scrub typhus. Before these organisms can be subjected to whole-genome sequencing, it is necessary to estimate their genome sizes to obtain basic information for establishing the strategies that should be used for genome sequencing and assembly. Method The genome sizes of L. pallidum and L. scutellare were estimated by a method based on quantitative real-time PCR. In addition, a k-mer analysis of the whole-genome sequences obtained through Illumina sequencing was conducted to verify the mutual compatibility and reliability of the results. Results The genome sizes estimated using qPCR were 191 ± 7 Mb for L. pallidum and 262 ± 13 Mb for L. scutellare. The k-mer analysis-based genome lengths were estimated to be 175 Mb for L. pallidum and 286 Mb for L. scutellare. The estimates from these two independent methods were mutually complementary and within a similar range to those of other Acariform mites. Conclusions The estimation method based on qPCR appears to be a useful alternative when the standard methods, such as flow cytometry, are impractical. The relatively small estimated genome sizes should facilitate whole-genome analysis, which could contribute to our understanding of Arachnida genome evolution and provide key information for scrub typhus prevention and mite vector competence. PMID:24947244

  12. Molecular phylogeny and genome size evolution of the genus Betula (Betulaceae)

    PubMed Central

    Wang, Nian; McAllister, Hugh A.; Bartlett, Paul R.; Buggs, Richard J. A.

    2016-01-01

    Background and Aims Betula L. (birch) is a genus of approx. 60 species, subspecies or varieties with a wide distribution in the northern hemisphere, of ecological and economic importance. A new classification of Betula has recently been proposed based on morphological characters. This classification differs somewhat from previously published molecular phylogenies, which may be due to factors such as convergent evolution, hybridization, incomplete taxon sampling or misidentification of samples. While chromosome counts have been made for many species, few have had their genome size measured. The aim of this study is to produce a new phylogenetic and genome size analysis of the genus. Methods Internal transcribed spacer (ITS) regions of nuclear ribosomal DNA were sequenced for 76 Betula samples verified by taxonomic experts, representing approx. 60 taxa, of which approx. 24 taxa have not been included in previous phylogenetic analyses. A further 49 samples from other collections were also sequenced, and 108 ITS sequences were downloaded from GenBank. Phylogenetic trees were built for these sequences. The genome sizes of 103 accessions representing nearly all described species were estimated using flow cytometry. Key Results As expected for a gene tree of a genus where hybridization and allopolyploidy occur, the ITS tree shows clustering, but not resolved monophyly, for the morphological subgenera recently proposed. Most sections show some clustering, but species of the dwarf section Apterocaryon are unusually scattered. Betula corylifolia (subgenus Nipponobetula) unexpectedly clusters with species of subgenus Aspera. Unexpected placements are also found for B. maximowicziana, B. bomiensis, B. nigra and B. grossa. Biogeographical disjunctions were found within Betula between Europe and North America, and also disjunctions between North-east and South-west Asia. The 2C-values for Betula ranged from 0·88 to 5·33 pg, and polyploids are scattered widely throughout the

  13. Variation, Evolution, and Correlation Analysis of C+G Content and Genome or Chromosome Size in Different Kingdoms and Phyla

    PubMed Central

    Li, Xiu-Qing; Du, Donglei

    2014-01-01

    C+G content (GC content or G+C content) is known to be correlated with genome/chromosome size in bacteria but the relationship for other kingdoms remains unclear. This study analyzed genome size, chromosome size, and base composition in most of the available sequenced genomes in various kingdoms. Genome size tends to increase during evolution in plants and animals, and the same is likely true for bacteria. The genomic C+G contents were found to vary greatly in microorganisms but were quite similar within each animal or plant subkingdom. In animals and plants, the C+G contents are ranked as follows: monocot plants>mammals>non-mammalian animals>dicot plants. The variation in C+G content between chromosomes within species is greater in animals than in plants. The correlation between average chromosome C+G content and chromosome length was found to be positive in Proteobacteria, Actinobacteria (but not in other analyzed bacterial phyla), Ascomycota fungi, and likely also in some plants; negative in some animals, insignificant in two protist phyla, and likely very weak in Archaea. Clearly, correlations between C+G content and chromosome size can be positive, negative, or not significant depending on the kingdoms/groups or species. Different phyla or species exhibit different patterns of correlation between chromosome-size and C+G content. Most chromosomes within a species have a similar pattern of variation in C+G content but outliers are common. The data presented in this study suggest that the C+G content is under genetic control by both trans- and cis- factors and that the correlation between C+G content and chromosome length can be positive, negative, or not significant in different phyla. PMID:24551092

  14. Genome-Wide Mutation Avalanches Induced in Diploid Yeast Cells by a Base Analog or an APOBEC Deaminase

    PubMed Central

    Lada, Artem G.; Stepchenkova, Elena I.; Waisertreiger, Irina S. R.; Noskov, Vladimir N.; Dhar, Alok; Eudy, James D.; Boissy, Robert J.; Hirano, Masayuki; Rogozin, Igor B.; Pavlov, Youri I.

    2013-01-01

    Genetic information should be accurately transmitted from cell to cell; conversely, the adaptation in evolution and disease is fueled by mutations. In the case of cancer development, multiple genetic changes happen in somatic diploid cells. Most classic studies of the molecular mechanisms of mutagenesis have been performed in haploids. We demonstrate that the parameters of the mutation process are different in diploid cell populations. The genomes of drug-resistant mutants induced in yeast diploids by base analog 6-hydroxylaminopurine (HAP) or AID/APOBEC cytosine deaminase PmCDA1 from lamprey carried a stunning load of thousands of unselected mutations. Haploid mutants contained almost an order of magnitude fewer mutations. To explain this, we propose that the distribution of induced mutation rates in the cell population is uneven. The mutants in diploids with coincidental mutations in the two copies of the reporter gene arise from a fraction of cells that are transiently hypersensitive to the mutagenic action of a given mutagen. The progeny of such cells were never recovered in haploids due to the lethality caused by the inactivation of single-copy essential genes in cells with too many induced mutations. In diploid cells, the progeny of hypersensitive cells survived, but their genomes were saturated by heterozygous mutations. The reason for the hypermutability of cells could be transient faults of the mutation prevention pathways, like sanitization of nucleotide pools for HAP or an elevated expression of the PmCDA1 gene or the temporary inability of the destruction of the deaminase. The hypothesis on spikes of mutability may explain the sudden acquisition of multiple mutational changes during evolution and carcinogenesis. PMID:24039593

  15. Meraculous: De Novo Genome Assembly with Short Paired-End Reads

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chapman, Jarrod A.; Ho, Isaac; Sunkara, Sirisha

    2011-08-18

    We describe a new algorithm, meraculous, for whole genome assembly of deep paired-end short reads, and apply it to the assembly of a dataset of paired 75-bp Illumina reads derived from the 15.4 megabase genome of the haploid yeast Pichia stipitis. More than 95% of the genome is recovered, with no errors; half the assembled sequence is in contigs longer than 101 kilobases and in scaffolds longer than 269 kilobases. Incorporating fosmid ends recovers entire chromosomes. Meraculous relies on an efficient and conservative traversal of the subgraph of the k-mer (deBruijn) graph of oligonucleotides with unique high quality extensions inmore » the dataset, avoiding an explicit error correction step as used in other short-read assemblers. A novel memory-efficient hashing scheme is introduced. The resulting contigs are ordered and oriented using paired reads separated by ~280 bp or ~3.2 kbp, and many gaps between contigs can be closed using paired-end placements. Practical issues with the dataset are described, and prospects for assembling larger genomes are discussed.« less

  16. Metabolic engineering of a haploid strain derived from a triploid industrial yeast for producing cellulosic ethanol.

    PubMed

    Kim, Soo Rin; Skerker, Jeffrey M; Kong, In Iok; Kim, Heejin; Maurer, Matthew J; Zhang, Guo-Chang; Peng, Dairong; Wei, Na; Arkin, Adam P; Jin, Yong-Su

    2017-03-01

    Many desired phenotypes for producing cellulosic biofuels are often observed in industrial Saccharomyces cerevisiae strains. However, many industrial yeast strains are polyploid and have low spore viability, making it difficult to use these strains for metabolic engineering applications. We selected the polyploid industrial strain S. cerevisiae ATCC 4124 exhibiting rapid glucose fermentation capability, high ethanol productivity, strong heat and inhibitor tolerance in order to construct an optimal yeast strain for producing cellulosic ethanol. Here, we focused on developing a general approach and high-throughput screening method to isolate stable haploid segregants derived from a polyploid parent, such as triploid ATCC 4124 with a poor spore viability. Specifically, we deleted the HO genes, performed random sporulation, and screened the resulting segregants based on growth rate, mating type, and ploidy. Only one stable haploid derivative (4124-S60) was isolated, while 14 other segregants with a stable mating type were aneuploid. The 4124-S60 strain inherited only a subset of desirable traits present in the parent strain, same as other aneuploids, suggesting that glucose fermentation and specific ethanol productivity are likely to be genetically complex traits and/or they might depend on ploidy. Nonetheless, the 4124-60 strain did inherit the ability to tolerate fermentation inhibitors. When additional genetic perturbations known to improve xylose fermentation were introduced into the 4124-60 strain, the resulting engineered strain (IIK1) was able to ferment a Miscanthus hydrolysate better than a previously engineered laboratory strain (SR8), built by making the same genetic changes. However, the IIK1 strain showed higher glycerol and xylitol yields than the SR8 strain. In order to decrease glycerol and xylitol production, an NADH-dependent acetate reduction pathway was introduced into the IIK1 strain. By consuming 2.4g/L of acetate, the resulting strain (IIK1A

  17. Microgeographic genome size differentiation of the carob tree, Ceratonia siliqua, at 'Evolution Canyon', Israel.

    PubMed

    Bures, Petr; Pavlícek, Tomás; Horová, Lucie; Nevo, Eviatar

    2004-05-01

    We tested whether the local differences in genome size recorded earlier in the wild barley, Hordeum spontaneum, at 'Evolution Canyon', Mount Carmel, Israel, can also be found in other organisms. As a model species for our test we chose the evergreen carob tree, Ceratonia siliqua. Genome size was measured by means of DAPI flow cytometry. In adults, significantly more DNA was recorded in trees growing on the more illuminated, warmer, drier, microclimatically more fluctuating 'African' south-facing slope than in trees on the opposite, less illuminated, cooler and more humid, 'European' north-facing slope in spite of an interslope distance of only 100 m at the canyon bottom and 400 m at the top. The amount of DNA was significantly negatively correlated with leaf length and tree circumference. In seedlings, interslope differences in the amount of genome DNA were not found. In addition, the first cases of triploidy and tetraploidy were found in C. siliqua. The data on C. siliqua at 'Evolution Canyon' showed that local variability in the C-value exists in this species and that ecological stress might be a strong evolutionary driving force in shaping the amount of DNA.

  18. Annotation of differentially expressed genes in the somatic embryogenesis of musa and their location in the banana genome.

    PubMed

    Maldonado-Borges, Josefina Ines; Ku-Cauich, José Roberto; Escobedo-Graciamedrano, Rosa Maria

    2013-01-01

    Analysis of cDNA-AFLP was used to study the genes expressed in zygotic and somatic embryogenesis of Musa acuminata Colla ssp. malaccensis, and a comparison was made between their differential transcribed fragments (TDFs) and the sequenced genome of the double haploid- (DH-) Pahang of the malaccensis subspecies that is available in the network. A total of 253 transcript-derived fragments (TDFs) were detected with apparent size of 100-4000 bp using 5 pairs of AFLP primers, of which 21 were differentially expressed during the different stages of banana embryogenesis; 15 of the sequences have matched DH-Pahang chromosomes, with 7 of them being homologous to gene sequences encoding either known or putative protein domains of higher plants. Four TDF sequences were located in all Musa chromosomes, while the rest were located in one or two chromosomes. Their putative individual function is briefly reviewed based on published information, and the potential roles of these genes in embryo development are discussed. Thus the availability of the genome of Musa and the information of TDFs sequences presented here opens new possibilities for an in-depth study of the molecular and biochemical research of zygotic and somatic embryogenesis of Musa.

  19. Annotated Draft Genome Assemblies for the Northern Bobwhite (Colinus virginianus) and the Scaled Quail (Callipepla squamata) Reveal Disparate Estimates of Modern Genome Diversity and Historic Effective Population Size.

    PubMed

    Oldeschulte, David L; Halley, Yvette A; Wilson, Miranda L; Bhattarai, Eric K; Brashear, Wesley; Hill, Joshua; Metz, Richard P; Johnson, Charles D; Rollins, Dale; Peterson, Markus J; Bickhart, Derek M; Decker, Jared E; Sewell, John F; Seabury, Christopher M

    2017-09-07

    Northern bobwhite ( Colinus virginianus ; hereafter bobwhite) and scaled quail ( Callipepla squamata ) populations have suffered precipitous declines across most of their US ranges. Illumina-based first- (v1.0) and second- (v2.0) generation draft genome assemblies for the scaled quail and the bobwhite produced N50 scaffold sizes of 1.035 and 2.042 Mb, thereby producing a 45-fold improvement in contiguity over the existing bobwhite assembly, and ≥90% of the assembled genomes were captured within 1313 and 8990 scaffolds, respectively. The scaled quail assembly (v1.0 = 1.045 Gb) was ∼20% smaller than the bobwhite (v2.0 = 1.254 Gb), which was supported by kmer-based estimates of genome size. Nevertheless, estimates of GC content (41.72%; 42.66%), genome-wide repetitive content (10.40%; 10.43%), and MAKER-predicted protein coding genes (17,131; 17,165) were similar for the scaled quail (v1.0) and bobwhite (v2.0) assemblies, respectively. BUSCO analyses utilizing 3023 single-copy orthologs revealed a high level of assembly completeness for the scaled quail (v1.0; 84.8%) and the bobwhite (v2.0; 82.5%), as verified by comparison with well-established avian genomes. We also detected 273 putative segmental duplications in the scaled quail genome (v1.0), and 711 in the bobwhite genome (v2.0), including some that were shared among both species. Autosomal variant prediction revealed ∼2.48 and 4.17 heterozygous variants per kilobase within the scaled quail (v1.0) and bobwhite (v2.0) genomes, respectively, and estimates of historic effective population size were uniformly higher for the bobwhite across all time points in a coalescent model. However, large-scale declines were predicted for both species beginning ∼15-20 KYA. Copyright © 2017 Oldeschulte et al.

  20. Transcript levels of ten caste-related genes in adult diploid males of Melipona quadrifasciata (Hymenoptera, Apidae) - A comparison with haploid males, queens and workers

    PubMed Central

    Borges, Andreia A.; Humann, Fernanda C.; Oliveira Campos, Lucio A.; Tavares, Mara G.; Hartfelder, Klaus

    2011-01-01

    In Hymenoptera, homozygosity at the sex locus results in the production of diploid males. In social species, these pose a double burden by having low fitness and drawing resources normally spent for increasing the work force of a colony. Yet, diploid males are of academic interest as they can elucidate effects of ploidy (normal males are haploid, whereas the female castes, the queens and workers, are diploid) on morphology and life history. Herein we investigated expression levels of ten caste-related genes in the stingless bee Melipona quadrifasciata, comparing newly emerged and 5-day-old diploid males with haploid males, queens and workers. In diploid males, transcript levels for dunce and paramyosin were increased during the first five days of adult life, while those for diacylglycerol kinase and the transcriptional co-repressor groucho diminished. Two general trends were apparent, (i) gene expression patterns in diploid males were overall more similar to haploid ones and workers than to queens, and (ii) in queens and workers, more genes were up-regulated after emergence until day five, whereas in diploid and especially so in haploid males more genes were down-regulated. This difference between the sexes may be related to longevity, which is much longer in females than in males. PMID:22215977

  1. Transcript levels of ten caste-related genes in adult diploid males of Melipona quadrifasciata (Hymenoptera, Apidae) - A comparison with haploid males, queens and workers.

    PubMed

    Borges, Andreia A; Humann, Fernanda C; Oliveira Campos, Lucio A; Tavares, Mara G; Hartfelder, Klaus

    2011-10-01

    In Hymenoptera, homozygosity at the sex locus results in the production of diploid males. In social species, these pose a double burden by having low fitness and drawing resources normally spent for increasing the work force of a colony. Yet, diploid males are of academic interest as they can elucidate effects of ploidy (normal males are haploid, whereas the female castes, the queens and workers, are diploid) on morphology and life history. Herein we investigated expression levels of ten caste-related genes in the stingless bee Melipona quadrifasciata, comparing newly emerged and 5-day-old diploid males with haploid males, queens and workers. In diploid males, transcript levels for dunce and paramyosin were increased during the first five days of adult life, while those for diacylglycerol kinase and the transcriptional co-repressor groucho diminished. Two general trends were apparent, (i) gene expression patterns in diploid males were overall more similar to haploid ones and workers than to queens, and (ii) in queens and workers, more genes were up-regulated after emergence until day five, whereas in diploid and especially so in haploid males more genes were down-regulated. This difference between the sexes may be related to longevity, which is much longer in females than in males.

  2. Karyotype and genome size in Euterpe Mart. (Arecaceae) species.

    PubMed

    Oliveira, Ludmila Cristina; de Oliveira, Maria do Socorro Padilha; Davide, Lisete Chamma; Torres, Giovana Augusta

    2016-01-01

    Euterpe (Martius, 1823), a genus from Central and South America, has species with high economic importance in Brazil, because of their palm heart and fruits, known as açaí berries. Breeding programs have been conducted to increase yield and establish cultivation systems to replace the extraction of wild material. These programs need basic information about the genome of these species to better explore the available genetic variability. The aim of this study was to compare Euterpe edulis (Martius, 1824), Euterpe oleracea (Martius, 1824) and Euterpe precatoria (Martius, 1842), with regard to karyotype, type of interphase nucleus and nuclear DNA amount. Metaphase chromosomes and interphase nuclei from root tip meristematic cells were obtained by the squashing technique and solid stained for microscope analysis. The DNA amount was estimated by flow cytometry. There were previous reports on the chromosome number of Euterpe edulis and Euterpe oleracea, but chromosome morphology of these two species and the whole karyotype of Euterpe precatoria are reported for the first time. The species have 2n=36, a number considered as a pleisomorphic feature in Arecoideae since the modern species, according to floral morphology, have the lowest chromosome number (2n=28 and 2n=30). The three Euterpe species also have the same type of interphase nuclei, classified as semi-reticulate. The species differed on karyotypic formulas, on localization of secondary constriction and genome size. The data suggest that the main forces driving Euterpe karyotype evolution were structural rearrangements, such as inversions and translocations that alter chromosome morphology, and either deletion or amplification that led to changes in chromosome size.

  3. Karyotype and genome size in Euterpe Mart. (Arecaceae) species

    PubMed Central

    Oliveira, Ludmila Cristina; de Oliveira, Maria do Socorro Padilha; Davide, Lisete Chamma; Torres, Giovana Augusta

    2016-01-01

    Abstract Euterpe (Martius, 1823), a genus from Central and South America, has species with high economic importance in Brazil, because of their palm heart and fruits, known as açaí berries. Breeding programs have been conducted to increase yield and establish cultivation systems to replace the extraction of wild material. These programs need basic information about the genome of these species to better explore the available genetic variability. The aim of this study was to compare Euterpe edulis (Martius, 1824), Euterpe oleracea (Martius, 1824) and Euterpe precatoria (Martius, 1842), with regard to karyotype, type of interphase nucleus and nuclear DNA amount. Metaphase chromosomes and interphase nuclei from root tip meristematic cells were obtained by the squashing technique and solid stained for microscope analysis. The DNA amount was estimated by flow cytometry. There were previous reports on the chromosome number of Euterpe edulis and Euterpe oleracea, but chromosome morphology of these two species and the whole karyotype of Euterpe precatoria are reported for the first time. The species have 2n=36, a number considered as a pleisomorphic feature in Arecoideae since the modern species, according to floral morphology, have the lowest chromosome number (2n=28 and 2n=30). The three Euterpe species also have the same type of interphase nuclei, classified as semi-reticulate. The species differed on karyotypic formulas, on localization of secondary constriction and genome size. The data suggest that the main forces driving Euterpe karyotype evolution were structural rearrangements, such as inversions and translocations that alter chromosome morphology, and either deletion or amplification that led to changes in chromosome size. PMID:27186334

  4. A Genome-Wide Association Study Identifies Multiple Regions Associated with Head Size in Catfish

    PubMed Central

    Geng, Xin; Liu, Shikai; Yao, Jun; Bao, Lisui; Zhang, Jiaren; Li, Chao; Wang, Ruijia; Sha, Jin; Zeng, Peng; Zhi, Degui; Liu, Zhanjiang

    2016-01-01

    Skull morphology is fundamental to evolution and the biological adaptation of species to their environments. With aquaculture fish species, head size is also important for economic reasons because it has a direct impact on fillet yield. However, little is known about the underlying genetic basis of head size. Catfish is the primary aquaculture species in the United States. In this study, we performed a genome-wide association study using the catfish 250K SNP array with backcross hybrid catfish to map the QTL for head size (head length, head width, and head depth). One significantly associated region on linkage group (LG) 7 was identified for head length. In addition, LGs 7, 9, and 16 contain suggestively associated regions for head length. For head width, significantly associated regions were found on LG9, and additional suggestively associated regions were identified on LGs 5 and 7. No region was found associated with head depth. Head size genetic loci were mapped in catfish to genomic regions with candidate genes involved in bone development. Comparative analysis indicated that homologs of several candidate genes are also involved in skull morphology in various other species ranging from amphibian to mammalian species, suggesting possible evolutionary conservation of those genes in the control of skull morphologies. PMID:27558670

  5. Xenopus laevis ribosomal protein genes: isolation of recombinant cDNA clones and study of the genomic organization.

    PubMed Central

    Bozzoni, I; Beccari, E; Luo, Z X; Amaldi, F

    1981-01-01

    Poly-A+ mRNA from Xenopus laevis oocytes, partially enriched for r-protein coding capacity has been used as starting material for preparing a cDNA bank in plasmid pBR322. The clones containing sequences specific for r-proteins have been selected by translation of the complementary mRNAs. Clones for six different r-proteins have been identified and utilized as probes for studying their genomic organization. Two gene copies per haploid genome were found for r-proteins L1, L14, S19, and four-five for protein S1, S8 and L32. Moreover a population polymorphism has been observed for the genomic regions containing sequences for r-protein S1, S8 and L14. Images PMID:6112733

  6. Double-strand breaks in genome-sized DNA caused by mechanical stress under mixing: Quantitative evaluation through single-molecule observation

    NASA Astrophysics Data System (ADS)

    Kikuchi, Hayato; Nose, Keiji; Yoshikawa, Yuko; Yoshikawa, Kenichi

    2018-06-01

    It is becoming increasingly apparent that changes in the higher-order structure of genome-sized DNA molecules of more than several tens kbp play important roles in the self-control of genome activity in living cells. Unfortunately, it has been rather difficult to prepare genome-sized DNA molecules without damage or fragmentation. Here, we evaluated the degree of double-strand breaks (DSBs) caused by mechanical mixing by single-molecule observation with fluorescence microscopy. The results show that DNA breaks are most significant for the first second after the initiation of mechanical agitation. Based on such observation, we propose a novel mixing procedure to significantly decrease DSBs.

  7. De-Novo Assembly and Analysis of the Heterozygous Triploid Genome of the Wine Spoilage Yeast Dekkera bruxellensis AWRI1499

    PubMed Central

    Chambers, Paul J.; Pretorius, Isak S.

    2012-01-01

    Despite its industrial importance, the yeast species Dekkera (Brettanomyces) bruxellensis has remained poorly understood at the genetic level. In this study we describe whole genome sequencing and analysis for a prevalent wine spoilage strain, AWRI1499. The 12.7 Mb assembly, consisting of 324 contigs in 99 scaffolds (super-contigs) at 26-fold coverage, exhibits a relatively high density of single nucleotide polymorphisms (SNPs). Haplotype sampling for 1.2% of open reading frames suggested that the D. bruxellensis AWRI1499 genome is comprised of a moderately heterozygous diploid genome, in combination with a divergent haploid genome. Gene content analysis revealed enrichment in membrane proteins, particularly transporters, along with oxidoreductase enzymes. Availability of this assembly and annotation provides a resource for further investigation of genomic organization in this species, and functional characterization of genes that may confer important phenotypic traits. PMID:22470482

  8. Insights into Land Plant Evolution Garnered from the Marchantia polymorpha Genome.

    PubMed

    Bowman, John L; Kohchi, Takayuki; Yamato, Katsuyuki T; Jenkins, Jerry; Shu, Shengqiang; Ishizaki, Kimitsune; Yamaoka, Shohei; Nishihama, Ryuichi; Nakamura, Yasukazu; Berger, Frédéric; Adam, Catherine; Aki, Shiori Sugamata; Althoff, Felix; Araki, Takashi; Arteaga-Vazquez, Mario A; Balasubrmanian, Sureshkumar; Barry, Kerrie; Bauer, Diane; Boehm, Christian R; Briginshaw, Liam; Caballero-Perez, Juan; Catarino, Bruno; Chen, Feng; Chiyoda, Shota; Chovatia, Mansi; Davies, Kevin M; Delmans, Mihails; Demura, Taku; Dierschke, Tom; Dolan, Liam; Dorantes-Acosta, Ana E; Eklund, D Magnus; Florent, Stevie N; Flores-Sandoval, Eduardo; Fujiyama, Asao; Fukuzawa, Hideya; Galik, Bence; Grimanelli, Daniel; Grimwood, Jane; Grossniklaus, Ueli; Hamada, Takahiro; Haseloff, Jim; Hetherington, Alexander J; Higo, Asuka; Hirakawa, Yuki; Hundley, Hope N; Ikeda, Yoko; Inoue, Keisuke; Inoue, Shin-Ichiro; Ishida, Sakiko; Jia, Qidong; Kakita, Mitsuru; Kanazawa, Takehiko; Kawai, Yosuke; Kawashima, Tomokazu; Kennedy, Megan; Kinose, Keita; Kinoshita, Toshinori; Kohara, Yuji; Koide, Eri; Komatsu, Kenji; Kopischke, Sarah; Kubo, Minoru; Kyozuka, Junko; Lagercrantz, Ulf; Lin, Shih-Shun; Lindquist, Erika; Lipzen, Anna M; Lu, Chia-Wei; De Luna, Efraín; Martienssen, Robert A; Minamino, Naoki; Mizutani, Masaharu; Mizutani, Miya; Mochizuki, Nobuyoshi; Monte, Isabel; Mosher, Rebecca; Nagasaki, Hideki; Nakagami, Hirofumi; Naramoto, Satoshi; Nishitani, Kazuhiko; Ohtani, Misato; Okamoto, Takashi; Okumura, Masaki; Phillips, Jeremy; Pollak, Bernardo; Reinders, Anke; Rövekamp, Moritz; Sano, Ryosuke; Sawa, Shinichiro; Schmid, Marc W; Shirakawa, Makoto; Solano, Roberto; Spunde, Alexander; Suetsugu, Noriyuki; Sugano, Sumio; Sugiyama, Akifumi; Sun, Rui; Suzuki, Yutaka; Takenaka, Mizuki; Takezawa, Daisuke; Tomogane, Hirokazu; Tsuzuki, Masayuki; Ueda, Takashi; Umeda, Masaaki; Ward, John M; Watanabe, Yuichiro; Yazaki, Kazufumi; Yokoyama, Ryusuke; Yoshitake, Yoshihiro; Yotsui, Izumi; Zachgo, Sabine; Schmutz, Jeremy

    2017-10-05

    The evolution of land flora transformed the terrestrial environment. Land plants evolved from an ancestral charophycean alga from which they inherited developmental, biochemical, and cell biological attributes. Additional biochemical and physiological adaptations to land, and a life cycle with an alternation between multicellular haploid and diploid generations that facilitated efficient dispersal of desiccation tolerant spores, evolved in the ancestral land plant. We analyzed the genome of the liverwort Marchantia polymorpha, a member of a basal land plant lineage. Relative to charophycean algae, land plant genomes are characterized by genes encoding novel biochemical pathways, new phytohormone signaling pathways (notably auxin), expanded repertoires of signaling pathways, and increased diversity in some transcription factor families. Compared with other sequenced land plants, M. polymorpha exhibits low genetic redundancy in most regulatory pathways, with this portion of its genome resembling that predicted for the ancestral land plant. PAPERCLIP. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  9. Multiplicity of genome equivalents in the radiation-resistant bacterium Micrococcus radiodurans.

    PubMed Central

    Hansen, M T

    1978-01-01

    The complexity of the genome of Micrococcus radiodurans was determined to be (2.0 +/- 0.3) X 10(9) daltons by DNA renaturation kinetics. The number of genome equivalents of DNA per cell was calculated from the complexity and the content of DNA. A lower limit of four genome equivalents per cell was approached with decreasing growth rate. Thus, no haploid stage appeared to be realized in this organism. The replication time was estimated from the kinetics and amount of residual DNA synthesis after inhibiting initiation of new rounds of replication. From this, the redundancy of terminal genetic markers was calculated to vary with growth rate from four to approximately eight copies per cell. All genetic material, including the least abundant, is thus multiply represented in each cell. The potential significance of the maintenance in each cell of multiple gene copies is discussed in relation to the extreme radiation resistance of M. radiodurans. PMID:649572

  10. Hybrid origin of gynogenetic clones and the introgression of their mitochondrial genome into sexual diploids through meiotic hybridogenesis in the loach, Misgurnus anguillicuadatus.

    PubMed

    Yamada, Aya; Kodo, Yukihiro; Murakami, Masaru; Kuroda, Masamichi; Aoki, Takao; Fujimoto, Takafumi; Arai, Katsutoshi

    2015-11-01

    In a few Japanese populations of the loach Misgurnus anguillicaudatus (Teleostei: Cobitidae), clonal diploid lineages produce unreduced diploid eggs that normally undergo gynogenetic reproduction; however the origin of these clones remains elusive. Here, we show the presence of two diverse clades, A and B, within this loach species from sequence analyses of two nuclear genes RAG1 (recombination activating gene 1) and IRBP2 (interphotoreceptor retinoid-binding protein, 2) and then demonstrate heterozygous genotypes fixed at the two loci as the evidence of the hybrid nature of clonal lineages. All the clonal individuals were identified by clone-specific mitochondrial DNA haplotypes, microsatellite genotypes, and random amplified polymorphic DNA fingerprints; they commonly showed two alleles, one from clade A and another from clade B, whereas other wild-type diploids possessed alleles from either clade A or B. However, we also found wild-type diploids with clone-specific mitochondrial DNA and nuclear genes from clade B. One possible explanation is an introgression of a clone-specific mitochondrial genome from clonal to these wild-type loaches. These individuals likely arose by a cross between haploid sperm from bisexual B clade males and haploid eggs with clone-specific mtDNA and clade B nuclear genome, produced by meiotic hybridogenesis (elimination of unmatched A genome followed by meiosis after preferential pairing between two matched B genomes) in clone-origin triploid individual (ABB). © 2015 Wiley Periodicals, Inc.

  11. Dynamics of chromosome number and genome size variation in a cytogenetically variable sedge (Carex scoparia var. scoparia, Cyperaceae).

    PubMed

    Chung, Kyong-Sook; Weber, Jaime A; Hipp, Andrew L

    2011-01-01

    High intraspecific cytogenetic variation in the sedge genus Carex (Cyperaceae) is hypothesized to be due to the "diffuse" or non-localized centromeres, which facilitate chromosome fission and fusion. If chromosome number changes are dominated by fission and fusion, then chromosome evolution will result primarily in changes in the potential for recombination among populations. Chromosome duplications, on the other hand, entail consequent opportunities for divergent evolution of paralogs. In this study, we evaluate whether genome size and chromosome number covary within species. We used flow cytometry to estimate genome sizes in Carex scoparia var. scoparia, sampling 99 plants (23 populations) in the Chicago region, and we used meiotic chromosome observations to document chromosome numbers and chromosome pairing relations. Chromosome numbers range from 2n = 62 to 2n = 68, and nuclear DNA 1C content from 0.342 to 0.361 pg DNA. Regressions of DNA content on chromosome number are nonsignificant for data analyzed by individual or population, and a regression model that excludes slope is favored over a model in which chromosome number predicts genome size. Chromosome rearrangements within cytogenetically variable Carex species are more likely a consequence of fission and fusion than of duplication and deletion. Moreover, neither genome size nor chromosome number is spatially autocorrelated, which suggests the potential for rapid chromosome evolution by fission and fusion at a relatively fine geographic scale (<350 km). These findings have important implications for ecological restoration and speciation within the largest angiosperm genus of the temperate zone.

  12. The battle of the sexes over seed size: support for both kinship genomic imprinting and interlocus contest evolution.

    PubMed

    Willi, Yvonne

    2013-06-01

    Outcrossing creates a venue for parental conflict. When one sex provides parental care to offspring fertilized by several partners, the nonproviding sex is under selection to maximally exploit the caring sex. The caring sex may counteradapt, and a coevolutionary arms race ensues. Genetic models of this conflict include the kinship theory of genomic imprinting (parent-of-origin-specific expression of maternal-care effectors) and interlocus conflict evolution (interaction between male selfish signals and female abatement). Predictions were tested by measuring the sizes of seeds produced by within-population crosses (diallel design) and between-population crosses in outcrossing and selfing populations of Arabidopsis lyrata. Within-population diallel crosses revealed substantial maternal variance in seed size in most populations. The comparison of between- and within-population crosses showed that seeds were larger when pollen came from another outcrossing population than when pollen came from a selfing or the same population, supporting interlocus contest evolution between male selfish genes and female recognition genes. Evidence for kinship genomic imprinting came from complementary trait means of seed size in reciprocal between-population crosses independent of whether populations were predominantly selfing or outcrossing. Hence, both kinship genomic imprinting and interlocus contest are supported in outcrossing Arabidopsis, whereas only kinship genomic imprinting is important in selfing populations.

  13. Dose dependence of the excision of ultraviolet-induced pyrimidine dimers from nuclear deoxyribonucleic acids of haploid and diploid Saccharomyces cerevisiae.

    PubMed Central

    Waters, R; Moustacchi, E

    1975-01-01

    The yield of ultraviolet-induced dimers is similar for a fixed dose in both haploid and diploid Saccharomyces cerevisiae. The excision of these photo-products from the nuclear deoxyribonucleic acids of cells of both ploidies after ultraviolet incident doses of 2 times 10-3 to 4 times 10-3 ergs/mm2 decreased with the corresponding increasing dose. Postirradiation incubation in saline followed by a further incubation in nutrient medium increases the excision as compared to that seen in either nutrient medium or saline alone. Previous data regarding both pyrimidine dimer removal and the survival of haploid and diploid cells after ultraviolet irradiation and either immediate or delayed plating are discussed. PMID:1090608

  14. One haploid parent contributes 100% of the gene pool for a widespread species in northwest North America.

    PubMed

    Karlin, E F; Andrus, R E; Boles, S B; Shaw, A J

    2011-02-01

    The monoicous peatmoss Sphagnum subnitens has a tripartite distribution that includes disjunct population systems in Europe (including the Azores), northwestern North America and New Zealand. Regional genetic diversity was highest in European S. subnitens but in northwestern North America, a single microsatellite-based multilocus haploid genotype was detected across 16 sites ranging from Coos County, Oregon, to Kavalga Island in the Western Aleutians (a distance of some 4115 km). Two multilocus haploid genotypes were detected across 14 sites on South Island, New Zealand. The microsatellite-based regional genetic diversity detected in New Zealand and North American S. subnitens is the lowest reported for any Sphagnum. The low genetic diversity detected in both of these regions most likely resulted from a founder event associated with vegetative propagation and complete selfing, with one founding haploid plant in northwest North America and two in New Zealand. Thus, one plant appears to have contributed 100% of the gene pool for the population systems of S. subnitens occurring in northwest North America, and this is arguably the most genetically uniform group of plants having a widespread distribution yet detected. Although having a distribution spanning 12.5° of latitude and 56° of longitude, there was no evidence of any genetic diversification in S. subnitens in northwest North America. No genetic structure was detected among the three regions, and it appears that European plants of S. subnitens provided the source for New Zealand and northwest North American populations. © 2010 Blackwell Publishing Ltd.

  15. Combination of reversible male sterility and doubled haploid production by targeted inactivation of cytoplasmic glutamine synthetase in developing anthers and pollen.

    PubMed

    Ribarits, Alexandra; Mamun, A N K; Li, Shipeng; Resch, Tatiana; Fiers, Martijn; Heberle-Bors, Erwin; Liu, Chun-Ming; Touraev, Alisher

    2007-07-01

    Reversible male sterility and doubled haploid plant production are two valuable technologies in F(1)-hybrid breeding. F(1)-hybrids combine uniformity with high yield and improved agronomic traits, and provide self-acting intellectual property protection. We have developed an F(1)-hybrid seed technology based on the metabolic engineering of glutamine in developing tobacco anthers and pollen. Cytosolic glutamine synthetase (GS1) was inactivated in tobacco by introducing mutated tobacco GS genes fused to the tapetum-specific TA29 and microspore-specific NTM19 promoters. Pollen in primary transformants aborted close to the first pollen mitosis, resulting in male sterility. A non-segregating population of homozygous doubled haploid male-sterile plants was generated through microspore embryogenesis. Fertility restoration was achieved by spraying plants with glutamine, or by pollination with pollen matured in vitro in glutamine-containing medium. The combination of reversible male sterility with doubled haploid production results in an innovative environmentally friendly breeding technology. Tapetum-mediated sporophytic male sterility is of use in foliage crops, whereas microspore-specific gametophytic male sterility can be applied to any field crop. Both types of sterility preclude the release of transgenic pollen into the environment.

  16. An ancient genome duplication contributed to the abundance of metabolic genes in the moss Physcomitrella patens

    PubMed Central

    Rensing, Stefan A; Ick, Julia; Fawcett, Jeffrey A; Lang, Daniel; Zimmer, Andreas; Van de Peer, Yves; Reski, Ralf

    2007-01-01

    Background: Analyses of complete genomes and large collections of gene transcripts have shown that most, if not all seed plants have undergone one or more genome duplications in their evolutionary past. Results: In this study, based on a large collection of EST sequences, we provide evidence that the haploid moss Physcomitrella patens is a paleopolyploid as well. Based on the construction of linearized phylogenetic trees we infer the genome duplication to have occurred between 30 and 60 million years ago. Gene Ontology and pathway association of the duplicated genes in P. patens reveal different biases of gene retention compared with seed plants. Conclusion: Metabolic genes seem to have been retained in excess following the genome duplication in P. patens. This might, at least partly, explain the versatility of metabolism, as described for P. patens and other mosses, in comparison to other land plants. PMID:17683536

  17. Genome duplication and mutations in ACE2 cause multicellular, fast-sedimenting phenotypes in evolved Saccharomyces cerevisiae

    PubMed Central

    Oud, Bart; Guadalupe-Medina, Victor; Nijkamp, Jurgen F.; de Ridder, Dick; Pronk, Jack T.; van Maris, Antonius J. A.; Daran, Jean-Marc

    2013-01-01

    Laboratory evolution of the yeast Saccharomyces cerevisiae in bioreactor batch cultures yielded variants that grow as multicellular, fast-sedimenting clusters. Knowledge of the molecular basis of this phenomenon may contribute to the understanding of natural evolution of multicellularity and to manipulating cell sedimentation in laboratory and industrial applications of S. cerevisiae. Multicellular, fast-sedimenting lineages obtained from a haploid S. cerevisiae strain in two independent evolution experiments were analyzed by whole genome resequencing. The two evolved cell lines showed different frameshift mutations in a stretch of eight adenosines in ACE2, which encodes a transcriptional regulator involved in cell cycle control and mother-daughter cell separation. Introduction of the two ace2 mutant alleles into the haploid parental strain led to slow-sedimenting cell clusters that consisted of just a few cells, thus representing only a partial reconstruction of the evolved phenotype. In addition to single-nucleotide mutations, a whole-genome duplication event had occurred in both evolved multicellular strains. Construction of a diploid reference strain with two mutant ace2 alleles led to complete reconstruction of the multicellular-fast sedimenting phenotype. This study shows that whole-genome duplication and a frameshift mutation in ACE2 are sufficient to generate a fast-sedimenting, multicellular phenotype in S. cerevisiae. The nature of the ace2 mutations and their occurrence in two independent evolution experiments encompassing fewer than 500 generations of selective growth suggest that switching between unicellular and multicellular phenotypes may be relevant for competitiveness of S. cerevisiae in natural environments. PMID:24145419

  18. Complete mitochondrial genomes of Trisidos kiyoni and Potiarca pilula: Varied mitochondrial genome size and highly rearranged gene order in Arcidae

    PubMed Central

    Sun, Shao’e; Li, Qi; Kong, Lingfeng; Yu, Hong

    2016-01-01

    We present the complete mitochondrial genomes (mitogenomes) of Trisidos kiyoni and Potiarca pilula, both important species from the family Arcidae (Arcoida: Arcacea). Typical bivalve mtDNA features were described, such as the relatively conserved gene number (36 and 37), a high A + T content (62.73% and 61.16%), the preference for A + T-rich codons, and the evidence of non-optimal codon usage. The mitogenomes of Arcidae species are exceptional for their extraordinarily large and variable sizes and substantial gene rearrangements. The mitogenome of T. kiyoni (19,614 bp) and P. pilula (28,470 bp) are the two smallest Arcidae mitogenomes. The compact mitogenomes are weakly associated with gene number and primarily reflect shrinkage of the non-coding regions. The varied size in Arcidae mitogenomes reflect a dynamic history of expansion. A significant positive correlation is observed between mitogenome size and the combined length of cox1-3, the lengths of Cytb, and the combined length of rRNAs (rrnS and rrnL) (P < 0.001). Both protein coding genes (PCGs) and tRNA rearrangements is observed in P. pilula and T. kiyoni mitogenomes. This analysis imply that the complicated gene rearrangement in mitochondrial genome could be considered as one of key characters in inferring higher-level phylogenetic relationship of Arcidae. PMID:27653979

  19. Nuclear fusion and genome encounter during yeast zygote formation.

    PubMed

    Tartakoff, Alan Michael; Jaiswal, Purnima

    2009-06-01

    When haploid cells of Saccharomyces cerevisiae are crossed, parental nuclei congress and fuse with each other. To investigate underlying mechanisms, we have developed assays that evaluate the impact of drugs and mutations. Nuclear congression is inhibited by drugs that perturb the actin and tubulin cytoskeletons. Nuclear envelope (NE) fusion consists of at least five steps in which preliminary modifications are followed by controlled flux of first outer and then inner membrane proteins, all before visible dilation of the waist of the nucleus or coalescence of the parental spindle pole bodies. Flux of nuclear pore complexes occurs after dilation. Karyogamy requires both the Sec18p/NSF ATPase and ER/NE luminal homeostasis. After fusion, chromosome tethering keeps tagged parental genomes separate from each other. The process of NE fusion and evidence of genome independence in yeast provide a prototype for understanding related events in higher eukaryotes.

  20. Evaluating droplet digital PCR for the quantification of human genomic DNA: converting copies per nanoliter to nanograms nuclear DNA per microliter.

    PubMed

    Duewer, David L; Kline, Margaret C; Romsos, Erica L; Toman, Blaza

    2018-05-01

    The highly multiplexed polymerase chain reaction (PCR) assays used for forensic human identification perform best when used with an accurately determined quantity of input DNA. To help ensure the reliable performance of these assays, we are developing a certified reference material (CRM) for calibrating human genomic DNA working standards. To enable sharing information over time and place, CRMs must provide accurate and stable values that are metrologically traceable to a common reference. We have shown that droplet digital PCR (ddPCR) limiting dilution end-point measurements of the concentration of DNA copies per volume of sample can be traceably linked to the International System of Units (SI). Unlike values assigned using conventional relationships between ultraviolet absorbance and DNA mass concentration, entity-based ddPCR measurements are expected to be stable over time. However, the forensic community expects DNA quantity to be stated in terms of mass concentration rather than entity concentration. The transformation can be accomplished given SI-traceable values and uncertainties for the number of nucleotide bases per human haploid genome equivalent (HHGE) and the average molar mass of a nucleotide monomer in the DNA polymer. This report presents the considerations required to establish the metrological traceability of ddPCR-based mass concentration estimates of human nuclear DNA. Graphical abstract The roots of metrological traceability for human nuclear DNA mass concentration results. Values for the factors in blue must be established experimentally. Values for the factors in red have been established from authoritative source materials. HHGE stands for "haploid human genome equivalent"; there are two HHGE per diploid human genome.

  1. Genome-wide analysis of macrosatellite repeat copy number variation in worldwide populations: evidence for differences and commonalities in size distributions and size restrictions

    PubMed Central

    2013-01-01

    Background Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and function is largely understudied. Here, we describe a detailed study of six autosomal and two X chromosomal MSRs among 270 HapMap individuals from Central Europe, Asia and Africa. Copy number variation, stability and genetic heterogeneity of the autosomal macrosatellite repeats RS447 (chromosome 4p), MSR5p (5p), FLJ40296 (13q), RNU2 (17q) and D4Z4 (4q and 10q) and X chromosomal DXZ4 and CT47 were investigated. Results Repeat array size distribution analysis shows that all of these MSRs are highly polymorphic with the most genetic variation among Africans and the least among Asians. A mitotic mutation rate of 0.4-2.2% was observed, exceeding meiotic mutation rates and possibly explaining the large size variability found for these MSRs. By means of a novel Bayesian approach, statistical support for a distinct multimodal rather than a uniform allele size distribution was detected in seven out of eight MSRs, with evidence for equidistant intervals between the modes. Conclusions The multimodal distributions with evidence for equidistant intervals, in combination with the observation of MSR-specific constraints on minimum array size, suggest that MSRs are limited in their configurations and that deviations thereof may cause disease, as is the case for facioscapulohumeral muscular dystrophy. However, at present we cannot exclude that there are mechanistic constraints for MSRs that are not directly disease-related. This study represents the first comprehensive study of MSRs in different human populations by applying novel statistical methods and identifies commonalities and differences in their organization and function in the human genome. PMID:23496858

  2. Mutation induction in haploid yeast after split-dose radiation-exposure. I. Fractionated UV-irradiation.

    PubMed

    Schenk, K; Zölzer, F; Kiefer, J

    1989-01-01

    Mutation induction was investigated in wild-type haploid yeast Saccharomyces cerevisiae after split-dose UV-irradiation. Cells were exposed to fractionated 254 nm-UV-doses separated by intervals from 0 to 6 h with incubation either on non-nutrient or nutrient agar between. The test parameter was resistance to canavanine. If modifications of sensitivity due to incubation are appropriately taken into account there is no change of mutation frequency.

  3. Zaba: a novel miniature transposable element present in genomes of legume plants.

    PubMed

    Macas, J; Neumann, P; Pozárková, D

    2003-08-01

    A novel family of miniature transposable elements, named Zaba, was identified in pea (Pisum sativum) and subsequently also in other legume species using computer analysis of their DNA sequences. Zaba elements are 141-190 bp long, generate 10-bp target site duplications, and their terminal inverted repeats make up most of the sequence. Zaba elements thus resemble class 3 foldback transposons. The elements are only moderately repetitive in pea (tens to hundreds copies per haploid genome), but they are present in up to thousands of copies in the genomes of several Medicago and Vicia species. More detailed analysis of the elements from pea, including isolation of new sequences from a genomic library, revealed that a fraction of these elements are truncated, and that their last transposition probably did not occur recently. A search for Zaba sequences in EST databases showed that at least some elements are transcribed, most probably due to their association with genic regions.

  4. Citrus sinensis annotation project (CAP): a comprehensive database for sweet orange genome.

    PubMed

    Wang, Jia; Chen, Dijun; Lei, Yang; Chang, Ji-Wei; Hao, Bao-Hai; Xing, Feng; Li, Sen; Xu, Qiang; Deng, Xiu-Xin; Chen, Ling-Ling

    2014-01-01

    Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia), and constructed the Citrus sinensis annotation project (CAP) to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-based organization of sweet orange genomic data, which integrates ab initio gene prediction, EST, RNA-seq and RNA-paired end tag (RNA-PET) evidence-based gene annotation. Furthermore, we provide a user-friendly web interface to show the predicted protein-protein interactions (PPIs) and metabolic pathways in sweet orange. CAP provides comprehensive information beneficial to the researchers of sweet orange and other woody plants, which is freely available at http://citrus.hzau.edu.cn/.

  5. Genetic Dissection of Leaf Development in Brassica rapa Using a Genetical Genomics Approach1[W

    PubMed Central

    Xiao, Dong; Wang, Huange; Basnet, Ram Kumar; Zhao, Jianjun; Lin, Ke; Hou, Xilin; Bonnema, Guusje

    2014-01-01

    The paleohexaploid crop Brassica rapa harbors an enormous reservoir of morphological variation, encompassing leafy vegetables, vegetable and fodder turnips (Brassica rapa, ssp. campestris), and oil crops, with different crops having very different leaf morphologies. In the triplicated B. rapa genome, many genes have multiple paralogs that may be regulated differentially and contribute to phenotypic variation. Using a genetical genomics approach, phenotypic data from a segregating doubled haploid population derived from a cross between cultivar Yellow sarson (oil type) and cultivar Pak choi (vegetable type) were used to identify loci controlling leaf development. Twenty-five colocalized phenotypic quantitative trait loci (QTLs) contributing to natural variation for leaf morphological traits, leaf number, plant architecture, and flowering time were identified. Genetic analysis showed that four colocalized phenotypic QTLs colocalized with flowering time and leaf trait candidate genes, with their cis-expression QTLs and cis- or trans-expression QTLs for homologs of genes playing a role in leaf development in Arabidopsis (Arabidopsis thaliana). The leaf gene BRASSICA RAPA KIP-RELATED PROTEIN2_A03 colocalized with QTLs for leaf shape and plant height; BRASSICA RAPA ERECTA_A09 colocalized with QTLs for leaf color and leaf shape; BRASSICA RAPA LONGIFOLIA1_A10 colocalized with QTLs for leaf size, leaf color, plant branching, and flowering time; while the major flowering time gene, BRASSICA RAPA FLOWERING LOCUS C_A02, colocalized with QTLs explaining variation in flowering time, plant architectural traits, and leaf size. Colocalization of these QTLs points to pleiotropic regulation of leaf development and plant architectural traits in B. rapa. PMID:24394778

  6. Extraordinary Genetic Diversity in a Wood Decay Mushroom.

    PubMed

    Baranova, Maria A; Logacheva, Maria D; Penin, Aleksey A; Seplyarskiy, Vladimir B; Safonova, Yana Y; Naumenko, Sergey A; Klepikova, Anna V; Gerasimov, Evgeny S; Bazykin, Georgii A; James, Timothy Y; Kondrashov, Alexey S

    2015-10-01

    Populations of different species vary in the amounts of genetic diversity they possess. Nucleotide diversity π, the fraction of nucleotides that are different between two randomly chosen genotypes, has been known to range in eukaryotes between 0.0001 in Lynx lynx and 0.16 in Caenorhabditis brenneri. Here, we report the results of a comparative analysis of 24 haploid genotypes (12 from the United States and 12 from European Russia) of a split-gill fungus Schizophyllum commune. The diversity at synonymous sites is 0.20 in the American population of S. commune and 0.13 in the Russian population. This exceptionally high level of nucleotide diversity also leads to extreme amino acid diversity of protein-coding genes. Using whole-genome resequencing of 2 parental and 17 offspring haploid genotypes, we estimate that the mutation rate in S. commune is high, at 2.0 × 10(-8) (95% CI: 1.1 × 10(-8) to 4.1 × 10(-8)) per nucleotide per generation. Therefore, the high diversity of S. commune is primarily determined by its elevated mutation rate, although high effective population size likely also plays a role. Small genome size, ease of cultivation and completion of the life cycle in the laboratory, free-living haploid life stages and exceptionally high variability of S. commune make it a promising model organism for population, quantitative, and evolutionary genetics. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Analysis of BAC end sequences in oak, a keystone forest tree species, providing insight into the composition of its genome

    PubMed Central

    2011-01-01

    Background One of the key goals of oak genomics research is to identify genes of adaptive significance. This information may help to improve the conservation of adaptive genetic variation and the management of forests to increase their health and productivity. Deep-coverage large-insert genomic libraries are a crucial tool for attaining this objective. We report herein the construction of a BAC library for Quercus robur, its characterization and an analysis of BAC end sequences. Results The EcoRI library generated consisted of 92,160 clones, 7% of which had no insert. Levels of chloroplast and mitochondrial contamination were below 3% and 1%, respectively. Mean clone insert size was estimated at 135 kb. The library represents 12 haploid genome equivalents and, the likelihood of finding a particular oak sequence of interest is greater than 99%. Genome coverage was confirmed by PCR screening of the library with 60 unique genetic loci sampled from the genetic linkage map. In total, about 20,000 high-quality BAC end sequences (BESs) were generated by sequencing 15,000 clones. Roughly 5.88% of the combined BAC end sequence length corresponded to known retroelements while ab initio repeat detection methods identified 41 additional repeats. Collectively, characterized and novel repeats account for roughly 8.94% of the genome. Further analysis of the BESs revealed 1,823 putative genes suggesting at least 29,340 genes in the oak genome. BESs were aligned with the genome sequences of Arabidopsis thaliana, Vitis vinifera and Populus trichocarpa. One putative collinear microsyntenic region encoding an alcohol acyl transferase protein was observed between oak and chromosome 2 of V. vinifera. Conclusions This BAC library provides a new resource for genomic studies, including SSR marker development, physical mapping, comparative genomics and genome sequencing. BES analysis provided insight into the structure of the oak genome. These sequences will be used in the assembly of a

  8. Complete mitochondrial genome sequence of Melipona scutellaris, a Brazilian stingless bee.

    PubMed

    Pereira, Ulisses de Padua; Bonetti, Ana Maria; Goulart, Luiz Ricardo; Santos, Anderson Rodrigues Dos; Oliveira, Guilherme Correa de; Cuadros-Orellana, Sara; Ueira-Vieira, Carlos

    2016-09-01

    Melipona scutellaris is a Brazilian stingless bee species and a highly important native pollinator besides its use in rational rearing for honey production. In this study, we present the whole mitochondrial DNA sequence of M. scutellaris from a haploid male. The mitogenome has a size of 14,862 bp and harbors 13 protein-coding genes (PCGs), 2 rRNA genes and 21 tRNA genes.

  9. Annotation of Differentially Expressed Genes in the Somatic Embryogenesis of Musa and Their Location in the Banana Genome

    PubMed Central

    Maldonado-Borges, Josefina Ines; Ku-Cauich, José Roberto; Escobedo-GraciaMedrano, Rosa Maria

    2013-01-01

    Analysis of cDNA-AFLP was used to study the genes expressed in zygotic and somatic embryogenesis of Musa acuminata Colla ssp. malaccensis, and a comparison was made between their differential transcribed fragments (TDFs) and the sequenced genome of the double haploid- (DH-) Pahang of the malaccensis subspecies that is available in the network. A total of 253 transcript-derived fragments (TDFs) were detected with apparent size of 100–4000 bp using 5 pairs of AFLP primers, of which 21 were differentially expressed during the different stages of banana embryogenesis; 15 of the sequences have matched DH-Pahang chromosomes, with 7 of them being homologous to gene sequences encoding either known or putative protein domains of higher plants. Four TDF sequences were located in all Musa chromosomes, while the rest were located in one or two chromosomes. Their putative individual function is briefly reviewed based on published information, and the potential roles of these genes in embryo development are discussed. Thus the availability of the genome of Musa and the information of TDFs sequences presented here opens new possibilities for an in-depth study of the molecular and biochemical research of zygotic and somatic embryogenesis of Musa. PMID:24027442

  10. Quantitative trait loci mapping of heat tolerance in a doubled haploid population of broccoli using genotyping-by-sequencing

    USDA-ARS?s Scientific Manuscript database

    Broccoli is a cool weather vegetable crop with a vernalization requirement to initiate and maintain floral development. Breeding for heat tolerance in broccoli has the potential to both expand viable production areas and extend the growing season. A doubled haploid (DH) population of broccoli (Bras...

  11. Physical mapping and BAC-end sequence analysis provide initial insights into the flax (Linum usitatissimum L.) genome

    PubMed Central

    2011-01-01

    Background Flax (Linum usitatissimum L.) is an important source of oil rich in omega-3 fatty acids, which have proven health benefits and utility as an industrial raw material. Flax seeds also contain lignans which are associated with reducing the risk of certain types of cancer. Its bast fibres have broad industrial applications. However, genomic tools needed for molecular breeding were non existent. Hence a project, Total Utilization Flax GENomics (TUFGEN) was initiated. We report here the first genome-wide physical map of flax and the generation and analysis of BAC-end sequences (BES) from 43,776 clones, providing initial insights into the genome. Results The physical map consists of 416 contigs spanning ~368 Mb, assembled from 32,025 fingerprints, representing roughly 54.5% to 99.4% of the estimated haploid genome (370-675 Mb). The N50 size of the contigs was estimated to be ~1,494 kb. The longest contig was ~5,562 kb comprising 437 clones. There were 96 contigs containing more than 100 clones. Approximately 54.6 Mb representing 8-14.8% of the genome was obtained from 80,337 BES. Annotation revealed that a large part of the genome consists of ribosomal DNA (~13.8%), followed by known transposable elements at 6.1%. Furthermore, ~7.4% of sequence was identified to harbour novel repeat elements. Homology searches against flax-ESTs and NCBI-ESTs suggested that ~5.6% of the transcriptome is unique to flax. A total of 4064 putative genomic SSRs were identified and are being developed as novel markers for their use in molecular breeding. Conclusion The first genome-wide physical map of flax constructed with BAC clones provides a framework for accessing target loci with economic importance for marker development and positional cloning. Analysis of the BES has provided insights into the uniqueness of the flax genome. Compared to other plant genomes, the proportion of rDNA was found to be very high whereas the proportion of known transposable elements was low. The SSRs

  12. Physical mapping and BAC-end sequence analysis provide initial insights into the flax (Linum usitatissimum L.) genome.

    PubMed

    Ragupathy, Raja; Rathinavelu, Rajkumar; Cloutier, Sylvie

    2011-05-09

    Flax (Linum usitatissimum L.) is an important source of oil rich in omega-3 fatty acids, which have proven health benefits and utility as an industrial raw material. Flax seeds also contain lignans which are associated with reducing the risk of certain types of cancer. Its bast fibres have broad industrial applications. However, genomic tools needed for molecular breeding were non existent. Hence a project, Total Utilization Flax GENomics (TUFGEN) was initiated. We report here the first genome-wide physical map of flax and the generation and analysis of BAC-end sequences (BES) from 43,776 clones, providing initial insights into the genome. The physical map consists of 416 contigs spanning ~368 Mb, assembled from 32,025 fingerprints, representing roughly 54.5% to 99.4% of the estimated haploid genome (370-675 Mb). The N50 size of the contigs was estimated to be ~1,494 kb. The longest contig was ~5,562 kb comprising 437 clones. There were 96 contigs containing more than 100 clones. Approximately 54.6 Mb representing 8-14.8% of the genome was obtained from 80,337 BES. Annotation revealed that a large part of the genome consists of ribosomal DNA (~13.8%), followed by known transposable elements at 6.1%. Furthermore, ~7.4% of sequence was identified to harbour novel repeat elements. Homology searches against flax-ESTs and NCBI-ESTs suggested that ~5.6% of the transcriptome is unique to flax. A total of 4064 putative genomic SSRs were identified and are being developed as novel markers for their use in molecular breeding. The first genome-wide physical map of flax constructed with BAC clones provides a framework for accessing target loci with economic importance for marker development and positional cloning. Analysis of the BES has provided insights into the uniqueness of the flax genome. Compared to other plant genomes, the proportion of rDNA was found to be very high whereas the proportion of known transposable elements was low. The SSRs identified from BES will be

  13. A Three-Dimensional Model of the Yeast Genome

    NASA Astrophysics Data System (ADS)

    Noble, William; Duan, Zhi-Jun; Andronescu, Mirela; Schutz, Kevin; McIlwain, Sean; Kim, Yoo Jung; Lee, Choli; Shendure, Jay; Fields, Stanley; Blau, C. Anthony

    Layered on top of information conveyed by DNA sequence and chromatin are higher order structures that encompass portions of chromosomes, entire chromosomes, and even whole genomes. Interphase chromosomes are not positioned randomly within the nucleus, but instead adopt preferred conformations. Disparate DNA elements co-localize into functionally defined aggregates or factories for transcription and DNA replication. In budding yeast, Drosophila and many other eukaryotes, chromosomes adopt a Rabl configuration, with arms extending from centromeres adjacent to the spindle pole body to telomeres that abut the nuclear envelope. Nonetheless, the topologies and spatial relationships of chromosomes remain poorly understood. Here we developed a method to globally capture intra- and inter-chromosomal interactions, and applied it to generate a map at kilobase resolution of the haploid genome of Saccharomyces cerevisiae. The map recapitulates known features of genome organization, thereby validating the method, and identifies new features. Extensive regional and higher order folding of individual chromosomes is observed. Chromosome XII exhibits a striking conformation that implicates the nucleolus as a formidable barrier to interaction between DNA sequences at either end. Inter-chromosomal contacts are anchored by centromeres and include interactions among transfer RNA genes, among origins of early DNA replication and among sites where chromosomal breakpoints occur. Finally, we constructed a three-dimensional model of the yeast genome. Our findings provide a glimpse of the interface between the form and function of a eukaryotic genome.

  14. The GAAS metagenomic tool and its estimations of viral and microbial average genome size in four major biomes.

    PubMed

    Angly, Florent E; Willner, Dana; Prieto-Davó, Alejandra; Edwards, Robert A; Schmieder, Robert; Vega-Thurber, Rebecca; Antonopoulos, Dionysios A; Barott, Katie; Cottrell, Matthew T; Desnues, Christelle; Dinsdale, Elizabeth A; Furlan, Mike; Haynes, Matthew; Henn, Matthew R; Hu, Yongfei; Kirchman, David L; McDole, Tracey; McPherson, John D; Meyer, Folker; Miller, R Michael; Mundt, Egbert; Naviaux, Robert K; Rodriguez-Mueller, Beltran; Stevens, Rick; Wegley, Linda; Zhang, Lixin; Zhu, Baoli; Rohwer, Forest

    2009-12-01

    Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS), a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites) suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and environmental conditions.

  15. Genetic Dissection of Clonally Inherited Genomes of Poeciliopsis. I. Linkage Analysis and Preliminary Assessment of Deleterious Gene Loads

    PubMed Central

    Leslie, James F.; Vrijenhoek, Robert C.

    1978-01-01

    Theoretical considerations suggest that a high load of deleterious mutations should accumulate in asexual genomes. An ideal system for testing this hypothesis occurs in the hybrid all-female fish Poeciliopsis monacha-lucida. The hybrid genotype is retained between generations by an oogenetic process that transmits only a nonrecombinant haploid monacha genome to their ova. The hybrid genotype is re-established in nature by fertilization of these monacha eggs with sperm from a sexual species, P. lucida. The unique reproductive mechanism of these hybrids allows the genetic dissection of the clonal monacha genome by forced matings with males of P. monacha. The resultant F1 hybrids and their backcross progeny were examined to determine the amount and kinds of genetic changes that might have occurred in two clonal monacha genomes.—Using six allozyme markers, four similar linkage groups were identified in each clonal genome. Segregation and assortment at these loci revealed no apparent differences between monacha genomes from sexually and clonally reproducing species. Mortality of F1 and backcross progeny revealed differences between the two clonal genomes, suggesting that deleterious genes may accumulate in genomes sheltered from recombination. PMID:17248875

  16. Population genomics of eusocial insects: the costs of a vertebrate-like effective population size.

    PubMed

    Romiguier, J; Lourenco, J; Gayral, P; Faivre, N; Weinert, L A; Ravel, S; Ballenghien, M; Cahais, V; Bernard, A; Loire, E; Keller, L; Galtier, N

    2014-03-01

    The evolution of reproductive division of labour and social life in social insects has lead to the emergence of several life-history traits and adaptations typical of larger organisms: social insect colonies can reach masses of several kilograms, they start reproducing only when they are several years old, and can live for decades. These features and the monopolization of reproduction by only one or few individuals in a colony should affect molecular evolution by reducing the effective population size. We tested this prediction by analysing genome-wide patterns of coding sequence polymorphism and divergence in eusocial vs. noneusocial insects based on newly generated RNA-seq data. We report very low amounts of genetic polymorphism and an elevated ratio of nonsynonymous to synonymous changes – a marker of the effective population size – in four distinct species of eusocial insects, which were more similar to vertebrates than to solitary insects regarding molecular evolutionary processes. Moreover, the ratio of nonsynonymous to synonymous substitutions was positively correlated with the level of social complexity across ant species. These results are fully consistent with the hypothesis of a reduced effective population size and an increased genetic load in eusocial insects, indicating that the evolution of social life has important consequences at both the genomic and population levels. © 2014 The Authors. Journal of Evolutionary Biology © 2014 European Society For Evolutionary Biology.

  17. Identification of Spen as a Crucial Factor for Xist Function through Forward Genetic Screening in Haploid Embryonic Stem Cells

    PubMed Central

    Monfort, Asun; Di Minin, Giulio; Postlmayr, Andreas; Freimann, Remo; Arieti, Fabiana; Thore, Stéphane; Wutz, Anton

    2015-01-01

    Summary In mammals, the noncoding Xist RNA triggers transcriptional silencing of one of the two X chromosomes in female cells. Here, we report a genetic screen for silencing factors in X chromosome inactivation using haploid mouse embryonic stem cells (ESCs) that carry an engineered selectable reporter system. This system was able to identify several candidate factors that are genetically required for chromosomal repression by Xist. Among the list of candidates, we identify the RNA-binding protein Spen, the homolog of split ends. Independent validation through gene deletion in ESCs confirms that Spen is required for gene repression by Xist. However, Spen is not required for Xist RNA localization and the recruitment of chromatin modifications, including Polycomb protein Ezh2. The identification of Spen opens avenues for further investigation into the gene-silencing pathway of Xist and shows the usefulness of haploid ESCs for genetic screening of epigenetic pathways. PMID:26190100

  18. Sequencing of mitochondrial genomes of nine Aspergillus and Penicillium species identifies mobile introns and accessory genes as main sources of genome size variability.

    PubMed

    Joardar, Vinita; Abrams, Natalie F; Hostetler, Jessica; Paukstelis, Paul J; Pakala, Suchitra; Pakala, Suman B; Zafar, Nikhat; Abolude, Olukemi O; Payne, Gary; Andrianopoulos, Alex; Denning, David W; Nierman, William C

    2012-12-12

    The genera Aspergillus and Penicillium include some of the most beneficial as well as the most harmful fungal species such as the penicillin-producer Penicillium chrysogenum and the human pathogen Aspergillus fumigatus, respectively. Their mitochondrial genomic sequences may hold vital clues into the mechanisms of their evolution, population genetics, and biology, yet only a handful of these genomes have been fully sequenced and annotated. Here we report the complete sequence and annotation of the mitochondrial genomes of six Aspergillus and three Penicillium species: A. fumigatus, A. clavatus, A. oryzae, A. flavus, Neosartorya fischeri (A. fischerianus), A. terreus, P. chrysogenum, P. marneffei, and Talaromyces stipitatus (P. stipitatum). The accompanying comparative analysis of these and related publicly available mitochondrial genomes reveals wide variation in size (25-36 Kb) among these closely related fungi. The sources of genome expansion include group I introns and accessory genes encoding putative homing endonucleases, DNA and RNA polymerases (presumed to be of plasmid origin) and hypothetical proteins. The two smallest sequenced genomes (A. terreus and P. chrysogenum) do not contain introns in protein-coding genes, whereas the largest genome (T. stipitatus), contains a total of eleven introns. All of the sequenced genomes have a group I intron in the large ribosomal subunit RNA gene, suggesting that this intron is fixed in these species. Subsequent analysis of several A. fumigatus strains showed low intraspecies variation. This study also includes a phylogenetic analysis based on 14 concatenated core mitochondrial proteins. The phylogenetic tree has a different topology from published multilocus trees, highlighting the challenges still facing the Aspergillus systematics. The study expands the genomic resources available to fungal biologists by providing mitochondrial genomes with consistent annotations for future genetic, evolutionary and population

  19. Comparative genome analysis of Pseudogymnoascus spp. reveals primarily clonal evolution with small genome fragments exchanged between lineages.

    PubMed

    Leushkin, Evgeny V; Logacheva, Maria D; Penin, Aleksey A; Sutormin, Roman A; Gerasimov, Evgeny S; Kochkina, Galina A; Ivanushkina, Natalia E; Vasilenko, Oleg V; Kondrashov, Alexey S; Ozerskaya, Svetlana M

    2015-05-21

    Pseudogymnoascus spp. is a wide group of fungi lineages in the family Pseudorotiaceae including an aggressive pathogen of bats P. destructans. Although several lineages of P. spp. were shown to produce ascospores in culture, the vast majority of P. spp. demonstrates no evidence of sexual reproduction. P. spp. can tolerate a wide range of different temperatures and salinities and can survive even in permafrost layer. Adaptability of P. spp. to different environments is accompanied by extremely variable morphology and physiology. We sequenced genotypes of 14 strains of P. spp., 5 of which were extracted from permafrost, 1 from a cryopeg, a layer of unfrozen ground in permafrost, and 8 from temperate surface environments. All sequenced genotypes are haploid. Nucleotide diversity among these genomes is very high, with a typical evolutionary distance at synonymous sites dS ≈ 0.5, suggesting that the last common ancestor of these strains lived >50 Mya. The strains extracted from permafrost do not form a separate clade. Instead, each permafrost strain has close relatives from temperate environments. We observed a strictly clonal population structure with no conflicting topologies for ~99% of genome sequences. However, there is a number of short (~100-10,000 nt) genomic segments with the total length of 67.6 Kb which possess phylogenetic patterns strikingly different from the rest of the genome. The most remarkable case is a MAT-locus, which has 2 distinct alleles interspersed along the whole-genome phylogenetic tree. Predominantly clonal structure of genome sequences is consistent with the observations that sexual reproduction is rare in P. spp. Small number of regions with noncanonical phylogenies seem to arise due to some recombination events between derived lineages of P. spp., with MAT-locus being transferred on multiple occasions. All sequenced strains have heterothallic configuration of MAT-locus.

  20. Direct transformation and plant regeneration of the haploid liverwort Marchantia polymorpha L.

    PubMed

    Takenaka, M; Yamaoka, S; Hanajiri, T; Shimizu-Ueda, Y; Yamato, K T; Fukuzawa, H; Ohyama, K

    2000-06-01

    Thalli of the haploid liverwort Marchantial polymorpha were successfully used for direct particle bombardment with plasmid pMT, which carries a hygromycin phosphotransferase gene (hpt) controlled by the CaMV 35S promoter and the NOS polyadenylation region. Hygromycin-resistant cell masses arose from the thallus surface and developed directly into hygromycin-resistant thalli. Southern blot analyses indicated that these thalli carried at least 1-4 copies of the hpt gene, which were stably transmitted to their asexual thallus progenies via gemma propagation for three generations. This transformation and direct plant regeneration protocol is expected to be a valuable tool for the molecular analysis of this lower land plant.

  1. Translating the "Banana Genome" to Delineate Stress Resistance, Dwarfing, Parthenocarpy and Mechanisms of Fruit Ripening.

    PubMed

    Dash, Prasanta K; Rai, Rhitu

    2016-01-01

    Evolutionary frozen, genetically sterile and globally iconic fruit "Banana" remained untouched by the green revolution and, as of today, researchers face intrinsic impediments for its varietal improvement. Recently, this wonder crop entered the genomics era with decoding of structural genome of double haploid Pahang (AA genome constitution) genotype of Musa acuminata . Its complex genome decoded by hybrid sequencing strategies revealed panoply of genes and transcription factors involved in the process of sucrose conversion that imparts sweetness to its fruit. Historically, banana has faced the wrath of pandemic bacterial, fungal, and viral diseases and multitude of abiotic stresses that has ruined the livelihood of small/marginal farmers' and destroyed commercial plantations. Decoding structural genome of this climacteric fruit has given impetus to a deeper understanding of the repertoire of genes involved in disease resistance, understanding the mechanism of dwarfing to develop an ideal plant type, unraveling the process of parthenocarpy, and fruit ripening for better fruit quality. Further, injunction of comparative genomics will usher in integration of information from its decoded genome and other monocots into field applications in banana related but not limited to yield enhancement, food security, livelihood assurance, and energy sustainability. In this mini review, we discuss pre- and post-genomic discoveries and highlight accomplishments in structural genomics, genetic engineering and forward genetic accomplishments with an aim to target genes and transcription factors for translational research in banana.

  2. Cell Size Influences the Reproductive Potential and Total Lifespan of the Saccharomyces cerevisiae Yeast as Revealed by the Analysis of Polyploid Strains.

    PubMed

    Zadrag-Tecza, Renata; Kwolek-Mirek, Magdalena; Alabrudzińska, Małgorzata; Skoneczna, Adrianna

    2018-01-01

    The total lifespan of the yeast Saccharomyces cerevisiae may be divided into two phases: the reproductive phase, during which the cell undergoes mitosis cycles to produce successive buds, and the postreproductive phase, which extends from the last division to cell death. These phases may be regulated by a common mechanism or by distinct ones. In this paper, we proposed a more comprehensive approach to reveal the mechanisms that regulate both reproductive potential and total lifespan in cell size context. Our study was based on yeast cells, whose size was determined by increased genome copy number, ranging from haploid to tetraploid. Such experiments enabled us to test the hypertrophy hypothesis, which postulates that excessive size achieved by the cell-the hypertrophy state-is the reason preventing the cell from further proliferation. This hypothesis defines the reproductive potential value as the difference between the maximal size that a cell can reach and the threshold value, which allows a cell to undergo its first cell cycle and the rate of the cell size to increase per generation. Here, we showed that cell size has an important impact on not only the reproductive potential but also the total lifespan of this cell. Moreover, the maximal cell size value, which limits its reproduction capacity, can be regulated by different factors and differs depending on the strain ploidy. The achievement of excessive size by the cell (hypertrophic state) may lead to two distinct phenomena: the cessation of reproduction without "mother" cell death and the cessation of reproduction with cell death by bursting, which has not been shown before.

  3. The m6A methyltransferase Ime4 epitranscriptionally regulates triacylglycerol metabolism and vacuolar morphology in haploid yeast cells.

    PubMed

    Yadav, Pradeep Kumar; Rajasekharan, Ram

    2017-08-18

    N 6 -Methyladenosine (m 6 A) is among the most common modifications in eukaryotic mRNA. The role of yeast m 6 A methyltransferase, Ime4, in meiosis and sporulation in diploid strains is very well studied, but its role in haploid strains has remained unknown. Here, with the help of an immunoblotting strategy and Ime4-GFP protein localization studies, we establish the physiological role of Ime4 in haploid cells. Our data showed that Ime4 epitranscriptionally regulates triacylglycerol metabolism and vacuolar morphology through the long-chain fatty acyl-CoA synthetase Faa1, independently of the RNA methylation complex (MIS complex). The MIS complex consists of the Ime4, Mum2, and Slz1 proteins. Our affinity enrichment strategy (methylated RNA immunoprecipitation assays) using m 6 A polyclonal antibodies coupled with mRNA isolation, quantitative real-time PCR, and standard PCR analyses confirmed the presence of m 6 A-modified FAA1 transcripts in haploid yeast cells. The term "epitranscriptional regulation" encompasses the RNA modification-mediated regulation of genes. Moreover, we demonstrate that the Aft2 transcription factor up-regulates FAA1 expression. Because the m 6 A methylation machinery is fundamentally conserved throughout eukaryotes, our findings will help advance the rapidly emerging field of RNA epitranscriptomics. The metabolic link identified here between m 6 A methylation and triacylglycerol metabolism via the Ime4 protein provides new insights into lipid metabolism and the pathophysiology of lipid-related metabolic disorders, such as obesity. Because the yeast vacuole is an analogue of the mammalian lysosome, our findings pave the way to better understand the role of m 6 A methylation in lysosome-related functions and diseases. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  4. Continuous Morphological Variation Correlated with Genome Size Indicates Frequent Introgressive Hybridization among Diphasiastrum Species (Lycopodiaceae) in Central Europe

    PubMed Central

    Hanušová, Kristýna; Ekrt, Libor; Vít, Petr; Kolář, Filip; Urfus, Tomáš

    2014-01-01

    Introgressive hybridization is an important evolutionary process frequently contributing to diversification and speciation of angiosperms. Its extent in other groups of land plants has only rarely been studied, however. We therefore examined the levels of introgression in the genus Diphasiastrum, a taxonomically challenging group of Lycopodiophytes, using flow cytometry and numerical and geometric morphometric analyses. Patterns of morphological and cytological variation were evaluated in an extensive dataset of 561 individuals from 57 populations of six taxa from Central Europe, the region with the largest known taxonomic complexity. In addition, genome size values of 63 individuals from Northern Europe were acquired for comparative purposes. Within Central European populations, we detected a continuous pattern in both morphological variation and genome size (strongly correlated together) suggesting extensive levels of interspecific gene flow within this region, including several large hybrid swarm populations. The secondary character of habitats of Central European hybrid swarm populations suggests that man-made landscape changes might have enhanced unnatural contact of species, resulting in extensive hybridization within this area. On the contrary, a distinct pattern of genome size variation among individuals from other parts of Europe indicates that pure populations prevail outside Central Europe. All in all, introgressive hybridization among Diphasiastrum species in Central Europe represents a unique case of extensive interspecific gene flow among spore producing vascular plants that cause serious complications of taxa delimitation. PMID:24932509

  5. A method for determining haploid and triploid genotypes and their association with vascular phenotypes in Williams syndrome and 7q11.23 duplication syndrome.

    PubMed

    Gregory, Michael D; Kolachana, Bhaskar; Yao, Yin; Nash, Tiffany; Dickinson, Dwight; Eisenberg, Daniel P; Mervis, Carolyn B; Berman, Karen F

    2018-04-04

    Williams syndrome ([WS], 7q11.23 hemideletion) and 7q11.23 duplication syndrome (Dup7) show contrasting syndromic symptoms. However, within each group there is considerable interindividual variability in the degree to which these phenotypes are expressed. Though software exists to identify areas of copy number variation (CNV) from commonly-available SNP-chip data, this software does not provide non-diploid genotypes in CNV regions. Here, we describe a method for identifying haploid and triploid genotypes in CNV regions, and then, as a proof-of-concept for applying this information to explain clinical variability, we test for genotype-phenotype associations. Blood samples for 25 individuals with WS and 13 individuals with Dup7 were genotyped with Illumina-HumanOmni5M SNP-chips. PennCNV and in-house code were used to make genotype calls for each SNP in the 7q11.23 locus. We tested for association between the presence of aortic arteriopathy and genotypes of the remaining (haploid in WS) or duplicated (triploid in Dup7) alleles. Haploid calls in the 7q11.23 region were made for 99.0% of SNPs in the WS group, and triploid calls for 98.8% of SNPs in those with Dup7. The G allele of SNP rs2528795 in the ELN gene was associated with aortic stenosis in WS participants (p < 0.0049) while the A allele of the same SNP was associated with aortic dilation in Dup7. Commonly available SNP-chip information can be used to make haploid and triploid calls in individuals with CNVs and then to relate variability in specific genes to variability in syndromic phenotypes, as demonstrated here using aortic arteriopathy. This work sets the stage for similar genotype-phenotype analyses in CNVs where phenotypes may be more complex and/or where there is less information about genetic mechanisms.

  6. A Deep-Coverage Tomato BAC Library and Prospects Toward Development of an STC Framework for Genome Sequencing

    PubMed Central

    Budiman, Muhammad A.; Mao, Long; Wood, Todd C.; Wing, Rod A.

    2000-01-01

    Recently a new strategy using BAC end sequences as sequence-tagged connectors (STCs) was proposed for whole-genome sequencing projects. In this study, we present the construction and detailed characterization of a 15.0 haploid genome equivalent BAC library for the cultivated tomato, Lycopersicon esculentum cv. Heinz 1706. The library contains 129,024 clones with an average insert size of 117.5 kb and a chloroplast content of 1.11%. BAC end sequences from 1490 ends were generated and analyzed as a preliminary evaluation for using this library to develop an STC framework to sequence the tomato genome. A total of 1205 BAC end sequences (80.9%) were obtained, with an average length of 360 high-quality bases, and were searched against the GenBank database. Using a cutoff expectation value of <10−6, and combining the results from BLASTN, BLASTX, and TBLASTX searches, 24.3% of the BAC end sequences were similar to known sequences, of which almost half (48.7%) share sequence similarities to retrotransposons and 7% to known genes. Some of the transposable element sequences were the first reported in tomato, such as sequences similar to maize transposon Activator (Ac) ORF and tobacco pararetrovirus-like sequences. Interestingly, there were no BAC end sequences similar to the highly repeated TGRI and TGRII elements. However, the majority (70.3%) of STCs did not share significant sequence similarities to any sequences in GenBank at either the DNA or predicted protein levels, indicating that a large portion of the tomato genome is still unknown. Our data demonstrate that this BAC library is suitable for developing an STC database to sequence the tomato genome. The advantages of developing an STC framework for whole-genome sequencing of tomato are discussed. [The BAC end sequences described in this paper have been deposited in the GenBank data library under accession nos. AQ367111–AQ368361.] PMID:10645957

  7. Doubled Haploid ‘CUDH2107’ as a Reference for Bulb Onion (Allium cepa L.) Research: Development of a Transcriptome Catalogue and Identification of Transcripts Associated with Male Fertility

    PubMed Central

    Khosa, Jiffinvir S.; Lee, Robyn; Bräuning, Sophia; Lord, Janice; Pither-Joyce, Meeghan; McCallum, John; Macknight, Richard C.

    2016-01-01

    Researchers working on model plants have derived great benefit from developing genomic and genetic resources using ‘reference’ genotypes. Onion has a large and highly heterozygous genome making the sharing of germplasm and analysis of sequencing data complicated. To simplify the discovery and analysis of genes underlying important onion traits, we are promoting the use of the homozygous double haploid line ‘CUDH2107’ by the onion research community. In the present investigation, we performed transcriptome sequencing on vegetative and reproductive tissues of CUDH2107 to develop a multi-organ reference transcriptome catalogue. A total of 396 million 100 base pair paired reads was assembled using the Trinity pipeline, resulting in 271,665 transcript contigs. This dataset was analysed for gene ontology and transcripts were classified on the basis of putative biological processes, molecular function and cellular localization. Significant differences were observed in transcript expression profiles between different tissues. To demonstrate the utility of our CUDH2107 transcriptome catalogue for understanding the genetic and molecular basis of various traits, we identified orthologues of rice genes involved in male fertility and flower development. These genes provide an excellent starting point for studying the molecular regulation, and the engineering of reproductive traits. PMID:27861615

  8. The effects of sample size on population genomic analyses--implications for the tests of neutrality.

    PubMed

    Subramanian, Sankar

    2016-02-20

    One of the fundamental measures of molecular genetic variation is the Watterson's estimator (θ), which is based on the number of segregating sites. The estimation of θ is unbiased only under neutrality and constant population growth. It is well known that the estimation of θ is biased when these assumptions are violated. However, the effects of sample size in modulating the bias was not well appreciated. We examined this issue in detail based on large-scale exome data and robust simulations. Our investigation revealed that sample size appreciably influences θ estimation and this effect was much higher for constrained genomic regions than that of neutral regions. For instance, θ estimated for synonymous sites using 512 human exomes was 1.9 times higher than that obtained using 16 exomes. However, this difference was 2.5 times for the nonsynonymous sites of the same data. We observed a positive correlation between the rate of increase in θ estimates (with respect to the sample size) and the magnitude of selection pressure. For example, θ estimated for the nonsynonymous sites of highly constrained genes (dN/dS < 0.1) using 512 exomes was 3.6 times higher than that estimated using 16 exomes. In contrast this difference was only 2 times for the less constrained genes (dN/dS > 0.9). The results of this study reveal the extent of underestimation owing to small sample sizes and thus emphasize the importance of sample size in estimating a number of population genomic parameters. Our results have serious implications for neutrality tests such as Tajima D, Fu-Li D and those based on the McDonald and Kreitman test: Neutrality Index and the fraction of adaptive substitutions. For instance, use of 16 exomes produced 2.4 times higher proportion of adaptive substitutions compared to that obtained using 512 exomes (24% vs 10 %).

  9. A genomic approach for isolating chloroplast microsatellite markers for Pachyptera kerere (Bignoniaceae)1

    PubMed Central

    Francisco, Jessica N. C.; Nazareno, Alison G.; Lohmann, Lúcia G.

    2016-01-01

    Premise of the study: In this study, we developed chloroplast microsatellite markers (cpSSRs) for Pachyptera kerere (Bignoniaceae) to investigate the population structure and genetic diversity of this species. Methods and Results: We used Illumina HiSeq data to reconstruct the chloroplast genome of P. kerere by a combination of de novo and reference-guided assembly. We then used the chloroplast genome to develop a set of cpSSRs from intergenic regions. Overall, 24 primer pairs were designed, 21 of which amplified successfully and were polymorphic, presenting three to nine alleles per locus. The unbiased haploid diversity per locus varied from 0.207 (Pac28) to 0.817 (Pac04). All but one locus amplified for all other taxa of Pachyptera. Conclusions: The markers reported here will serve as a basis for studies to assess the genetic structure and phylogeographic history of Pachyptera. PMID:27672522

  10. QTLs for important breeding characteristics in the doubled haploid oat progeny.

    PubMed

    Tanhuanpää, Pirjo; Manninen, Outi; Kiviharju, Elina

    2010-06-01

    A homozygous mapping population, consisting of doubled haploid (DH) oat (Avena sativa L.) plants generated through anther culture of F1 plants from the cross between the Finnish cultivar 'Aslak' and the Swedish cultivar 'Matilda', was used to construct an oat linkage map. Ten agronomic and quality traits were analyzed in the DH plants from field trials in 2005 and 2006. Leaf blotch (caused by Pyrenophora avenae) resistance was also evaluated in a greenhouse test with 2 different isolates. One to 8 quantitative trait loci (QTLs) were found to be associated with each trait studied. Some chromosomal regions affected more than 1 trait; for example, 4 regions affected both protein and oil content. This study gives valuable information to oat breeders concerning the inheritance of important traits, and it provides potential tools to assist breeding.

  11. Hybrid incompatibilities are affected by dominance and dosage in the haplodiploid wasp Nasonia

    PubMed Central

    Beukeboom, Leo W.; Koevoets, Tosca; Morales, Hernán E.; Ferber, Steven; van de Zande, Louis

    2015-01-01

    Study of genome incompatibilities in species hybrids is important for understanding the genetic basis of reproductive isolation and speciation. According to Haldane's rule hybridization affects the heterogametic sex more than the homogametic sex. Several theories have been proposed that attribute asymmetry in hybridization effects to either phenotype (sex) or genotype (heterogamety). Here we investigate the genetic basis of hybrid genome incompatibility in the haplodiploid wasp Nasonia using the powerful features of haploid males and sex reversal. We separately investigate the effects of heterozygosity (ploidy level) and sex by generating sex reversed diploid hybrid males and comparing them to genotypically similar haploid hybrid males and diploid hybrid females. Hybrid effects of sterility were more pronounced than of inviability, and were particularly strong in haploid males, but weak to absent in diploid males and females, indicating a strong ploidy level but no sex specific effect. Molecular markers identified a number of genomic regions associated with hybrid inviability in haploid males that disappeared under diploidy in both hybrid males and females. Hybrid inviability was rescued by dominance effects at some genomic regions, but aggravated or alleviated by dosage effects at other regions, consistent with cytonuclear incompatibilities. Dosage effects underlying Bateson–Dobzhansky–Muller (BDM) incompatibilities need more consideration in explaining Haldane's rule in diploid systems. PMID:25926847

  12. Comparisons with Caenorhabditis (approximately 100 Mb) and Drosophila (approximately 175 Mb) using flow cytometry show genome size in Arabidopsis to be approximately 157 Mb and thus approximately 25% larger than the Arabidopsis genome initiative estimate of approximately 125 Mb.

    PubMed

    Bennett, Michael D; Leitch, Ilia J; Price, H James; Johnston, J Spencer

    2003-04-01

    Recent genome sequencing papers have given genome sizes of 180 Mb for Drosophila melanogaster Iso-1 and 125 Mb for Arabidopsis thaliana Columbia. The former agrees with early cytochemical estimates, but numerous cytometric estimates of around 170 Mb imply that a genome size of 125 Mb for arabidopsis is an underestimate. In this study, nuclei of species pairs were compared directly using flow cytometry. Co-run Columbia and Iso-1 female gave a 2C peak for arabidopsis only approx. 15 % below that for drosophila, and 16C endopolyploid Columbia nuclei had approx. 15 % more DNA than 2C chicken nuclei (with >2280 Mb). Caenorhabditis elegans Bristol N2 (genome size approx. 100 Mb) co-run with Columbia or Iso-1 gave a 2C peak for drosophila approx. 75 % above that for 2C C. elegans, and a 2C peak for arabidopsis approx. 57 % above that for C. elegans. This confirms that 1C in drosophila is approx. 175 Mb and, combined with other evidence, leads us to conclude that the genome size of arabidopsis is not approx. 125 Mb, but probably approx. 157 Mb. It is likely that the discrepancy represents extra repeated sequences in unsequenced gaps in heterochromatic regions. Complete sequencing of the arabidopsis genome until no gaps remain at telomeres, nucleolar organizing regions or centromeres is still needed to provide the first precise angiosperm C-value as a benchmark calibration standard for plant genomes, and to ensure that no genes have been missed in arabidopsis, especially in centromeric regions, which are clearly larger than once imagined.

  13. Novel nuclei isolation buffer for flow cytometric genome size estimation of Zingiberaceae: a comparison with common isolation buffers

    PubMed Central

    Sadhu, Abhishek; Bhadra, Sreetama; Bandyopadhyay, Maumita

    2016-01-01

    Background and Aims Cytological parameters such as chromosome numbers and genome sizes of plants are used routinely for studying evolutionary aspects of polyploid plants. Members of Zingiberaceae show a wide range of inter- and intrageneric variation in their reproductive habits and ploidy levels. Conventional cytological study in this group of plants is severely hampered by the presence of diverse secondary metabolites, which also affect their genome size estimation using flow cytometry. None of the several nuclei isolation buffers used in flow cytometry could be used very successfully for members of Zingiberaceae to isolate good quality nuclei from both shoot and root tissues. Methods The competency of eight nuclei isolation buffers was compared with a newly formulated buffer, MB01, in six different genera of Zingiberaceae based on the fluorescence intensity of propidium iodide-stained nuclei using flow cytometric parameters, namely coefficient of variation of the G0/G1 peak, debris factor and nuclei yield factor. Isolated nuclei were studied using fluorescence microscopy and bio-scanning electron microscopy to analyse stain–nuclei interaction and nuclei topology, respectively. Genome contents of 21 species belonging to these six genera were determined using MB01. Key Results Flow cytometric parameters showed significant differences among the analysed buffers. MB01 exhibited the best combination of analysed parameters; photomicrographs obtained from fluorescence and electron microscopy supported the superiority of MB01 buffer over other buffers. Among the 21 species studied, nuclear DNA contents of 14 species are reported for the first time. Conclusions Results of the present study substantiate the enhanced efficacy of MB01, compared to other buffers tested, in the generation of acceptable cytograms from all species of Zingiberaceae studied. Our study facilitates new ways of sample preparation for further flow cytometric analysis of genome size of other members

  14. Single-molecule sequencing and Hi-C-based proximity-guided assembly of amaranth (Amaranthus hypochondriacus) chromosomes provide insights into genome evolution.

    PubMed

    Lightfoot, D J; Jarvis, D E; Ramaraj, T; Lee, R; Jellen, E N; Maughan, P J

    2017-08-31

    Amaranth (Amaranthus hypochondriacus) was a food staple among the ancient civilizations of Central and South America that has recently received increased attention due to the high nutritional value of the seeds, with the potential to help alleviate malnutrition and food security concerns, particularly in arid and semiarid regions of the developing world. Here, we present a reference-quality assembly of the amaranth genome which will assist the agronomic development of the species. Utilizing single-molecule, real-time sequencing (Pacific Biosciences) and chromatin interaction mapping (Hi-C) to close assembly gaps and scaffold contigs, respectively, we improved our previously reported Illumina-based assembly to produce a chromosome-scale assembly with a scaffold N50 of 24.4 Mb. The 16 largest scaffolds contain 98% of the assembly and likely represent the haploid chromosomes (n = 16). To demonstrate the accuracy and utility of this approach, we produced physical and genetic maps and identified candidate genes for the betalain pigmentation pathway. The chromosome-scale assembly facilitated a genome-wide syntenic comparison of amaranth with other Amaranthaceae species, revealing chromosome loss and fusion events in amaranth that explain the reduction from the ancestral haploid chromosome number (n = 18) for a tetraploid member of the Amaranthaceae. The assembly method reported here minimizes cost by relying primarily on short-read technology and is one of the first reported uses of in vivo Hi-C for assembly of a plant genome. Our analyses implicate chromosome loss and fusion as major evolutionary events in the 2n = 32 amaranths and clearly establish the homoeologous relationship among most of the subgenome chromosomes, which will facilitate future investigations of intragenomic changes that occurred post polyploidization.

  15. Comparative genomics of the marine bacterial genus Glaciecola reveals the high degree of genomic diversity and genomic characteristic for cold adaptation.

    PubMed

    Qin, Qi-Long; Xie, Bin-Bin; Yu, Yong; Shu, Yan-Li; Rong, Jin-Cheng; Zhang, Yan-Jiao; Zhao, Dian-Li; Chen, Xiu-Lan; Zhang, Xi-Ying; Chen, Bo; Zhou, Bai-Cheng; Zhang, Yu-Zhong

    2014-06-01

    To what extent the genomes of different species belonging to one genus can be diverse and the relationship between genomic differentiation and environmental factor remain unclear for oceanic bacteria. With many new bacterial genera and species being isolated from marine environments, this question warrants attention. In this study, we sequenced all the type strains of the published species of Glaciecola, a recently defined cold-adapted genus with species from diverse marine locations, to study the genomic diversity and cold-adaptation strategy in this genus.The genome size diverged widely from 3.08 to 5.96 Mb, which can be explained by massive gene gain and loss events. Horizontal gene transfer and new gene emergence contributed substantially to the genome size expansion. The genus Glaciecola had an open pan-genome. Comparative genomic research indicated that species of the genus Glaciecola had high diversity in genome size, gene content and genetic relatedness. This may be prevalent in marine bacterial genera considering the dynamic and complex environments of the ocean. Species of Glaciecola had some common genomic features related to cold adaptation, which enable them to thrive and play a role in biogeochemical cycle in the cold marine environments.

  16. Begin at the beginning: A BAC-end view of the passion fruit (Passiflora) genome.

    PubMed

    Santos, Anselmo Azevedo; Penha, Helen Alves; Bellec, Arnaud; Munhoz, Carla de Freitas; Pedrosa-Harand, Andrea; Bergès, Hélène; Vieira, Maria Lucia Carneiro

    2014-09-26

    The passion fruit (Passiflora edulis) is a tropical crop of economic importance both for juice production and consumption as fresh fruit. The juice is also used in concentrate blends that are consumed worldwide. However, very little is known about the genome of the species. Therefore, improving our understanding of passion fruit genomics is essential and to some degree a pre-requisite if its genetic resources are to be used more efficiently. In this study, we have constructed a large-insert BAC library and provided the first view on the structure and content of the passion fruit genome, using BAC-end sequence (BES) data as a major resource. The library consisted of 82,944 clones and its levels of organellar DNA were very low. The library represents six haploid genome equivalents, and the average insert size was 108 kb. To check its utility for gene isolation, successful macroarray screening experiments were carried out with probes complementary to eight Passiflora gene sequences available in public databases. BACs harbouring those genes were used in fluorescent in situ hybridizations and unique signals were detected for four BACs in three chromosomes (n=9). Then, we explored 10,000 BES and we identified reads likely to contain repetitive mobile elements (19.6% of all BES), simple sequence repeats and putative proteins, and to estimate the GC content (~42%) of the reads. Around 9.6% of all BES were found to have high levels of similarity to plant genes and ontological terms were assigned to more than half of the sequences analysed (940). The vast majority of the top-hits made by our sequences were to Populus trichocarpa (24.8% of the total occurrences), Theobroma cacao (21.6%), Ricinus communis (14.3%), Vitis vinifera (6.5%) and Prunus persica (3.8%). We generated the first large-insert library for a member of Passifloraceae. This BAC library provides a new resource for genetic and genomic studies, as well as it represents a valuable tool for future whole genome

  17. Insights into the red algae and eukaryotic evolution from the genome of Porphyra umbilicalis (Bangiophyceae, Rhodophyta)

    PubMed Central

    Brawley, Susan H.; Blouin, Nicolas A.; Ficko-Blean, Elizabeth; Wheeler, Glen L.; Lohr, Martin; Goodson, Holly V.; Jenkins, Jerry W.; Blaby-Haas, Crysten E.; Helliwell, Katherine E.; Chan, Cheong Xin; Marriage, Tara N.; Klein, Anita S.; Badis, Yacine; Brodie, Juliet; Cao, Yuanyu; Collén, Jonas; Dittami, Simon M.; Gachon, Claire M. M.; Green, Beverley R.; Karpowicz, Steven J.; Kim, Jay W.; Kudahl, Ulrich Johan; Lin, Senjie; Michel, Gurvan; Mittag, Maria; Olson, Bradley J. S. C.; Pangilinan, Jasmyn L.; Peng, Yi; Qiu, Huan; Shu, Shengqiang; Singer, John T.; Sprecher, Brittany N.; Wagner, Volker; Wang, Wenfei; Wang, Zhi-Yong; Yan, Juying; Yarish, Charles; Zäuner-Riek, Simone; Zhuang, Yunyun; Zou, Yong; Lindquist, Erika A.; Grimwood, Jane; Barry, Kerrie W.; Rokhsar, Daniel S.; Schmutz, Jeremy; Stiller, John W.; Grossman, Arthur R.; Prochnik, Simon E.

    2017-01-01

    Porphyra umbilicalis (laver) belongs to an ancient group of red algae (Bangiophyceae), is harvested for human food, and thrives in the harsh conditions of the upper intertidal zone. Here we present the 87.7-Mbp haploid Porphyra genome (65.8% G + C content, 13,125 gene loci) and elucidate traits that inform our understanding of the biology of red algae as one of the few multicellular eukaryotic lineages. Novel features of the Porphyra genome shared by other red algae relate to the cytoskeleton, calcium signaling, the cell cycle, and stress-tolerance mechanisms including photoprotection. Cytoskeletal motor proteins in Porphyra are restricted to a small set of kinesins that appear to be the only universal cytoskeletal motors within the red algae. Dynein motors are absent, and most red algae, including Porphyra, lack myosin. This surprisingly minimal cytoskeleton offers a potential explanation for why red algal cells and multicellular structures are more limited in size than in most multicellular lineages. Additional discoveries further relating to the stress tolerance of bangiophytes include ancestral enzymes for sulfation of the hydrophilic galactan-rich cell wall, evidence for mannan synthesis that originated before the divergence of green and red algae, and a high capacity for nutrient uptake. Our analyses provide a comprehensive understanding of the red algae, which are both commercially important and have played a major role in the evolution of other algal groups through secondary endosymbioses. PMID:28716924

  18. Insights into the red algae and eukaryotic evolution from the genome of Porphyra umbilicalis (Bangiophyceae, Rhodophyta).

    PubMed

    Brawley, Susan H; Blouin, Nicolas A; Ficko-Blean, Elizabeth; Wheeler, Glen L; Lohr, Martin; Goodson, Holly V; Jenkins, Jerry W; Blaby-Haas, Crysten E; Helliwell, Katherine E; Chan, Cheong Xin; Marriage, Tara N; Bhattacharya, Debashish; Klein, Anita S; Badis, Yacine; Brodie, Juliet; Cao, Yuanyu; Collén, Jonas; Dittami, Simon M; Gachon, Claire M M; Green, Beverley R; Karpowicz, Steven J; Kim, Jay W; Kudahl, Ulrich Johan; Lin, Senjie; Michel, Gurvan; Mittag, Maria; Olson, Bradley J S C; Pangilinan, Jasmyn L; Peng, Yi; Qiu, Huan; Shu, Shengqiang; Singer, John T; Smith, Alison G; Sprecher, Brittany N; Wagner, Volker; Wang, Wenfei; Wang, Zhi-Yong; Yan, Juying; Yarish, Charles; Zäuner-Riek, Simone; Zhuang, Yunyun; Zou, Yong; Lindquist, Erika A; Grimwood, Jane; Barry, Kerrie W; Rokhsar, Daniel S; Schmutz, Jeremy; Stiller, John W; Grossman, Arthur R; Prochnik, Simon E

    2017-08-01

    Porphyra umbilicalis (laver) belongs to an ancient group of red algae (Bangiophyceae), is harvested for human food, and thrives in the harsh conditions of the upper intertidal zone. Here we present the 87.7-Mbp haploid Porphyra genome (65.8% G + C content, 13,125 gene loci) and elucidate traits that inform our understanding of the biology of red algae as one of the few multicellular eukaryotic lineages. Novel features of the Porphyra genome shared by other red algae relate to the cytoskeleton, calcium signaling, the cell cycle, and stress-tolerance mechanisms including photoprotection. Cytoskeletal motor proteins in Porphyra are restricted to a small set of kinesins that appear to be the only universal cytoskeletal motors within the red algae. Dynein motors are absent, and most red algae, including Porphyra , lack myosin. This surprisingly minimal cytoskeleton offers a potential explanation for why red algal cells and multicellular structures are more limited in size than in most multicellular lineages. Additional discoveries further relating to the stress tolerance of bangiophytes include ancestral enzymes for sulfation of the hydrophilic galactan-rich cell wall, evidence for mannan synthesis that originated before the divergence of green and red algae, and a high capacity for nutrient uptake. Our analyses provide a comprehensive understanding of the red algae, which are both commercially important and have played a major role in the evolution of other algal groups through secondary endosymbioses.

  19. Insights into the red algae and eukaryotic evolution from the genome of Porphyra umbilicalis (Bangiophyceae, Rhodophyta)

    DOE PAGES

    Brawley, Susan H.; Blouin, Nicolas A.; Ficko-Blean, Elizabeth; ...

    2017-07-17

    Porphyra umbilicalis (laver) belongs to an ancient group of red algae (Bangiophyceae), is harvested for human food, and thrives in the harsh conditions of the upper intertidal zone. Here we present the 87.7-Mbp haploid Porphyra genome (65.8% G + C content, 13,125 gene loci) and elucidate traits that inform our understanding of the biology of red algae as one of the few multicellular eukaryotic lineages. Novel features of the Porphyra genome shared by other red algae relate to the cytoskeleton, calcium signaling, the cell cycle, and stress-tolerance mechanisms including photoprotection. Cytoskeletal motor proteins in Porphyra are restricted to a smallmore » set of kinesins that appear to be the only universal cytoskeletal motors within the red algae. Dynein motors are absent, and most red algae, including Porphyra, lack myosin. This surprisingly minimal cytoskeleton offers a potential explanation for why red algal cells and multicellular structures are more limited in size than in most multicellular lineages. Additional discoveries further relating to the stress tolerance of bangiophytes include ancestral enzymes for sulfation of the hydrophilic galactan-rich cell wall, evidence for mannan synthesis that originated before the divergence of green and red algae, and a high capacity for nutrient uptake. Our analyses provide a comprehensive understanding of the red algae, which are both commercially important and have played a major role in the evolution of other algal groups through secondary endosymbioses.« less

  20. Insights into the red algae and eukaryotic evolution from the genome of Porphyra umbilicalis (Bangiophyceae, Rhodophyta)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brawley, Susan H.; Blouin, Nicolas A.; Ficko-Blean, Elizabeth

    Porphyra umbilicalis (laver) belongs to an ancient group of red algae (Bangiophyceae), is harvested for human food, and thrives in the harsh conditions of the upper intertidal zone. Here we present the 87.7-Mbp haploid Porphyra genome (65.8% G + C content, 13,125 gene loci) and elucidate traits that inform our understanding of the biology of red algae as one of the few multicellular eukaryotic lineages. Novel features of the Porphyra genome shared by other red algae relate to the cytoskeleton, calcium signaling, the cell cycle, and stress-tolerance mechanisms including photoprotection. Cytoskeletal motor proteins in Porphyra are restricted to a smallmore » set of kinesins that appear to be the only universal cytoskeletal motors within the red algae. Dynein motors are absent, and most red algae, including Porphyra, lack myosin. This surprisingly minimal cytoskeleton offers a potential explanation for why red algal cells and multicellular structures are more limited in size than in most multicellular lineages. Additional discoveries further relating to the stress tolerance of bangiophytes include ancestral enzymes for sulfation of the hydrophilic galactan-rich cell wall, evidence for mannan synthesis that originated before the divergence of green and red algae, and a high capacity for nutrient uptake. Our analyses provide a comprehensive understanding of the red algae, which are both commercially important and have played a major role in the evolution of other algal groups through secondary endosymbioses.« less

  1. Phylogeny, rate variation, and genome size evolution of Pelargonium (Geraniaceae).

    PubMed

    Weng, Mao-Lun; Ruhlman, Tracey A; Gibby, Mary; Jansen, Robert K

    2012-09-01

    The phylogeny of 58 Pelargonium species was estimated using five plastid markers (rbcL, matK, ndhF, rpoC1, trnL-F) and one mitochondrial gene (nad5). The results confirmed the monophyly of three major clades and four subclades within Pelargonium but also indicate the need to revise some sectional classifications. This phylogeny was used to examine karyotype evolution in the genus: plotting chromosome sizes, numbers and 2C-values indicates that genome size is significantly correlated with chromosome size but not number. Accelerated rates of nucleotide substitution have been previously detected in both plastid and mitochondrial genes in Pelargonium, but sparse taxon sampling did not enable identification of the phylogenetic distribution of these elevated rates. Using the multigene phylogeny as a constraint, we investigated lineage- and locus-specific heterogeneity of substitution rates in Pelargonium for an expanded number of taxa and demonstrated that both plastid and mitochondrial genes have had accelerated substitution rates but with markedly disparate patterns. In the plastid, the exons of rpoC1 have significantly accelerated substitution rates compared to its intron and the acceleration was mainly due to nonsynonymous substitutions. In contrast, the mitochondrial gene, nad5, experienced substantial acceleration of synonymous substitution rates in three internal branches of Pelargonium, but this acceleration ceased in all terminal branches. Several lineages also have dN/dS ratios significantly greater than one for rpoC1, indicating that positive selection is acting on this gene, whereas the accelerated synonymous substitutions in the mitochondrial gene are the result of elevated mutation rates. Published by Elsevier Inc.

  2. Production of Androgenetic Zebrafish (Danio Rerio)

    PubMed Central

    Corley-Smith, G. E.; Lim, C. J.; Brandhorst, B. P.

    1996-01-01

    To help investigate the evolutionary origin of the imprinting (parent-of-origin mono-allelic expression) of paternal genes observed in mammals, we constructed haploid and diploid androgenetic zebrafish (Danio rerio). Haploid androgenotes were produced by fertilizing eggs that had been X-ray irradiated to eliminate the maternal genome. Subsequent inhibition of the first mitotic division of haploid androgenotes by heat shock produced diploid androgenotes. The lack of inheritance of maternal-specific DNA markers (RAPD and SSR) by putative diploid and haploid androgenotes confirmed the androgenetic origin of their genomes. Marker analysis was performed on 18 putative androgenotes (five diploids and 13 haploids) from six families. None of 157 maternal-specific RAPD markers analyzed, some of which were apparently homozygous, were passed on to any of these putative androgenotes. A mean of 7.7 maternal-specific markers were assessed per family. The survival of androgenetic zebrafish suggests that if paternal imprinting occurs in zebrafish, it does not result in essential genes being inactivated when their expression is required for development. Production of haploid androgenotes can be used to determine the meiotic recombination rate in male zebrafish. Androgenesis may also provide useful information about the mechanism of sex determination in zebrafish. PMID:8846903

  3. Parthenogenesis in a whitetip reef shark Triaenodon obesus involves a reduction in ploidy.

    PubMed

    Portnoy, D S; Hollenbeck, C M; Johnston, J S; Casman, H M; Gold, J R

    2014-08-01

    Genetic analysis of a female whitetip reef shark Triaenodon obesus and her stillborn pup, assumed to be of parthenogenetic origin, revealed that the pup was homozygous at all 24 nuclear-encoded microsatellites assayed, consistent with the idea that diploidy in the pup had been restored via terminal fusion. Flow cytometric analysis, however, indicated that the genome size of the pup was no more than half that of the mother, and microscopy revealed that nuclear volume was c. 1.73 times larger in the mother than in the pup. Together these data suggest that the pup was genetically haploid, developing directly from an unfertilized egg; as far as is known, this is the first observation of a spontaneously produced haploid vertebrate. © 2014 The Fisheries Society of the British Isles.

  4. An in vitro, short-term culture method for mammalian haploid round spermatids amenable for molecular manipulation.

    PubMed

    Dehnugara, Tushna; Dhar, Surbhi; Rao, M R Satyanarayana

    2012-01-01

    Extensive chromatin remodeling is a characteristic feature of mammalian spermiogenesis. To date, methods for the molecular manipulation of haploid spermatids are not available as there is a lack of a well-established culture system. Biochemical experiments and knockout studies reveal only the final outcome; studying the incremental details of the intricate mechanisms involved is still a challenge. We have established an in vitro culture system for pure haploid round spermatids isolated from rat testes that can be maintained with good viability for up to 72 hr. Changes in cell morphology and flagellar growth were also studied in the cultured spermatids. Further, we have demonstrated that upon treatment of cells with specific histone deacetylase inhibitors, sodium butyrate and trichostatin A, there is an increase in the hyperacetylation status of histone H4, mimicking an important event characteristic of histone replacement process that occurs during later stages of spermiogenesis. We have also tried various methods for introducing DNA and protein into these round spermatids in culture, and report that while DNA transfection is still a challenging task, protein transfection could be achieved using Chariot™ peptide as a transfection reagent. Thus, the method described here sets a stage to study the molecular roles of spermatid-specific proteins and chromatin remodelers in the cellular context. Copyright © 2011 Wiley Periodicals, Inc.

  5. Transmission of human mtDNA heteroplasmy in the Genome of the Netherlands families: support for a variable-size bottleneck

    PubMed Central

    Li, Mingkun; Rothwell, Rebecca; Vermaat, Martijn; Wachsmuth, Manja; Schröder, Roland; Laros, Jeroen F.J.; van Oven, Mannis; de Bakker, Paul I.W.; Bovenberg, Jasper A.; van Duijn, Cornelia M.; van Ommen, Gert-Jan B.; Slagboom, P. Eline; Swertz, Morris A.; Wijmenga, Cisca; Kayser, Manfred; Boomsma, Dorret I.; Zöllner, Sebastian; de Knijff, Peter; Stoneking, Mark

    2016-01-01

    Although previous studies have documented a bottleneck in the transmission of mtDNA genomes from mothers to offspring, several aspects remain unclear, including the size and nature of the bottleneck. Here, we analyze the dynamics of mtDNA heteroplasmy transmission in the Genomes of the Netherlands (GoNL) data, which consists of complete mtDNA genome sequences from 228 trios, eight dizygotic (DZ) twin quartets, and 10 monozygotic (MZ) twin quartets. Using a minor allele frequency (MAF) threshold of 2%, we identified 189 heteroplasmies in the trio mothers, of which 59% were transmitted to offspring, and 159 heteroplasmies in the trio offspring, of which 70% were inherited from the mothers. MZ twin pairs exhibited greater similarity in MAF at heteroplasmic sites than DZ twin pairs, suggesting that the heteroplasmy MAF in the oocyte is the major determinant of the heteroplasmy MAF in the offspring. We used a likelihood method to estimate the effective number of mtDNA genomes transmitted to offspring under different bottleneck models; a variable bottleneck size model provided the best fit to the data, with an estimated mean of nine individual mtDNA genomes transmitted. We also found evidence for negative selection during transmission against novel heteroplasmies (in which the minor allele has never been observed in polymorphism data). These novel heteroplasmies are enhanced for tRNA and rRNA genes, and mutations associated with mtDNA diseases frequently occur in these genes. Our results thus suggest that the female germ line is able to recognize and select against deleterious heteroplasmies. PMID:26916109

  6. Cell Size Influences the Reproductive Potential and Total Lifespan of the Saccharomyces cerevisiae Yeast as Revealed by the Analysis of Polyploid Strains

    PubMed Central

    Kwolek-Mirek, Magdalena; Alabrudzińska, Małgorzata

    2018-01-01

    The total lifespan of the yeast Saccharomyces cerevisiae may be divided into two phases: the reproductive phase, during which the cell undergoes mitosis cycles to produce successive buds, and the postreproductive phase, which extends from the last division to cell death. These phases may be regulated by a common mechanism or by distinct ones. In this paper, we proposed a more comprehensive approach to reveal the mechanisms that regulate both reproductive potential and total lifespan in cell size context. Our study was based on yeast cells, whose size was determined by increased genome copy number, ranging from haploid to tetraploid. Such experiments enabled us to test the hypertrophy hypothesis, which postulates that excessive size achieved by the cell—the hypertrophy state—is the reason preventing the cell from further proliferation. This hypothesis defines the reproductive potential value as the difference between the maximal size that a cell can reach and the threshold value, which allows a cell to undergo its first cell cycle and the rate of the cell size to increase per generation. Here, we showed that cell size has an important impact on not only the reproductive potential but also the total lifespan of this cell. Moreover, the maximal cell size value, which limits its reproduction capacity, can be regulated by different factors and differs depending on the strain ploidy. The achievement of excessive size by the cell (hypertrophic state) may lead to two distinct phenomena: the cessation of reproduction without “mother” cell death and the cessation of reproduction with cell death by bursting, which has not been shown before. PMID:29743970

  7. Genome-Wide Estimates of Mutation Rates and Spectrum in Schizosaccharomyces pombe Indicate CpG Sites are Highly Mutagenic Despite the Absence of DNA Methylation

    PubMed Central

    Behringer, Megan G.; Hall, David W.

    2015-01-01

    We accumulated mutations for 1952 generations in 79 initially identical, haploid lines of the fission yeast Schizosaccharomyces pombe, and then performed whole-genome sequencing to determine the mutation rates and spectrum. We captured 696 spontaneous mutations across the 79 mutation accumulation (MA) lines. We compared the mutation spectrum and rate to a recently published equivalent experiment on the same species, and to another model ascomycetous yeast, the budding yeast Saccharomyces cerevisiae. While the two species are approximately 600 million years diverged from each other, they share similar life histories, genome size and genomic G/C content. We found that Sc. pombe and S. cerevisiae have similar mutation rates, but Sc. pombe exhibits a stronger insertion bias. Intriguingly, we observed an increased mutation rate at cytosine nucleotides, specifically CpG nucleotides, which is also seen in S. cerevisiae. However, the absence of methylation in Sc. pombe and the pattern of mutation at these sites, primarily C → A as opposed to C → T, strongly suggest that the increased mutation rate is not caused by deamination of methylated cytosines. This result implies that the high mutability of CpG dinucleotides in other species may be caused in part by a methylation-independent mechanism. Many of our findings mirror those seen in the recent study, despite the use of different passaging conditions, indicating that MA is a reliable method for estimating mutation rates and spectra. PMID:26564949

  8. Nuclear DNA content and base composition in 28 taxa of Musa.

    PubMed

    Kamaté, K; Brown, S; Durand, P; Bureau, J M; De Nay, D; Trinh, T H

    2001-08-01

    The nuclear DNA content of 28 taxa of Musa was assessed by flow cytometry, using line PxPC6 of Petunia hybrida as an internal standard. The 2C DNA value of Musa balbisiana (BB genome) was 1.16 pg, whereas Musa acuminata (AA genome) had an average 2C DNA value of 1.27 pg, with a difference of 11% between its subspecies. The two haploid (IC) genomes, A and B, comprising most of the edible bananas, are therefore of similar size, 0.63 pg (610 million bp) and 0.58 pg (560 million bp), respectively. The genome of diploid Musa is thus threefold that of Arabidopsis thaliana. The genome sizes in a set of triploid Musa cultivars or clones were quite different, with 2C DNA values ranging from 1.61 to 2.23 pg. Likewise, the genome sizes of tetraploid cultivars ranged from 1.94 to 2.37 pg (2C). Apparently, tetraploids (for instance, accession I.C.2) can have a genome size that falls within the range of triploid genome sizes, and vice versa (as in the case of accession Simili Radjah). The 2C values estimated for organs such as leaf, leaf sheath, rhizome, and flower were consistent, whereas root material gave atypical results, owing to browning. The genomic base composition of these Musa taxa had a median value of 40.8% GC (SD = 0.43%).

  9. Single haplotype assembly of the human genome from a hydatidiform mole

    PubMed Central

    Steinberg, Karyn Meltz; Schneider, Valerie A.; Graves-Lindsay, Tina A.; Fulton, Robert S.; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A.; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C.; Church, Deanna M.; Eichler, Evan E.; Wilson, Richard K.

    2014-01-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. PMID:25373144

  10. The banana (Musa acuminata) genome and the evolution of monocotyledonous plants.

    PubMed

    D'Hont, Angélique; Denoeud, France; Aury, Jean-Marc; Baurens, Franc-Christophe; Carreel, Françoise; Garsmeur, Olivier; Noel, Benjamin; Bocs, Stéphanie; Droc, Gaëtan; Rouard, Mathieu; Da Silva, Corinne; Jabbari, Kamel; Cardi, Céline; Poulain, Julie; Souquet, Marlène; Labadie, Karine; Jourda, Cyril; Lengellé, Juliette; Rodier-Goud, Marguerite; Alberti, Adriana; Bernard, Maria; Correa, Margot; Ayyampalayam, Saravanaraj; Mckain, Michael R; Leebens-Mack, Jim; Burgess, Diane; Freeling, Mike; Mbéguié-A-Mbéguié, Didier; Chabannes, Matthieu; Wicker, Thomas; Panaud, Olivier; Barbosa, Jose; Hribova, Eva; Heslop-Harrison, Pat; Habas, Rémy; Rivallan, Ronan; Francois, Philippe; Poiron, Claire; Kilian, Andrzej; Burthia, Dheema; Jenny, Christophe; Bakry, Frédéric; Brown, Spencer; Guignon, Valentin; Kema, Gert; Dita, Miguel; Waalwijk, Cees; Joseph, Steeve; Dievart, Anne; Jaillon, Olivier; Leclercq, Julie; Argout, Xavier; Lyons, Eric; Almeida, Ana; Jeridi, Mouna; Dolezel, Jaroslav; Roux, Nicolas; Risterucci, Ange-Marie; Weissenbach, Jean; Ruiz, Manuel; Glaszmann, Jean-Christophe; Quétier, Francis; Yahiaoui, Nabila; Wincker, Patrick

    2012-08-09

    Bananas (Musa spp.), including dessert and cooking types, are giant perennial monocotyledonous herbs of the order Zingiberales, a sister group to the well-studied Poales, which include cereals. Bananas are vital for food security in many tropical and subtropical countries and the most popular fruit in industrialized countries. The Musa domestication process started some 7,000 years ago in Southeast Asia. It involved hybridizations between diverse species and subspecies, fostered by human migrations, and selection of diploid and triploid seedless, parthenocarpic hybrids thereafter widely dispersed by vegetative propagation. Half of the current production relies on somaclones derived from a single triploid genotype (Cavendish). Pests and diseases have gradually become adapted, representing an imminent danger for global banana production. Here we describe the draft sequence of the 523-megabase genome of a Musa acuminata doubled-haploid genotype, providing a crucial stepping-stone for genetic improvement of banana. We detected three rounds of whole-genome duplications in the Musa lineage, independently of those previously described in the Poales lineage and the one we detected in the Arecales lineage. This first monocotyledon high-continuity whole-genome sequence reported outside Poales represents an essential bridge for comparative genome analysis in plants. As such, it clarifies commelinid-monocotyledon phylogenetic relationships, reveals Poaceae-specific features and has led to the discovery of conserved non-coding sequences predating monocotyledon-eudicotyledon divergence.

  11. A Constant Rate of Spontaneous Mutation in DNA-Based Microbes

    NASA Astrophysics Data System (ADS)

    Drake, John W.

    1991-08-01

    In terms of evolution and fitness, the most significant spontaneous mutation rate is likely to be that for the entire genome (or its nonfrivolous fraction). Information is now available to calculate this rate for several DNA-based haploid microbes, including bacteriophages with single- or double-stranded DNA, a bacterium, a yeast, and a filamentous fungus. Their genome sizes vary by ≈6500-fold. Their average mutation rates per base pair vary by ≈16,000-fold, whereas their mutation rates per genome vary by only ≈2.5-fold, apparently randomly, around a mean value of 0.0033 per DNA replication. The average mutation rate per base pair is inversely proportional to genome size. Therefore, a nearly invariant microbial mutation rate appears to have evolved. Because this rate is uniform in such diverse organisms, it is likely to be determined by deep general forces, perhaps by a balance between the usually deleterious effects of mutation and the physiological costs of further reducing mutation rates.

  12. Three tiers of genome evolution in reptiles

    PubMed Central

    Organ, Chris L.; Moreno, Ricardo Godínez; Edwards, Scott V.

    2008-01-01

    Characterization of reptilian genomes is essential for understanding the overall diversity and evolution of amniote genomes, because reptiles, which include birds, constitute a major fraction of the amniote evolutionary tree. To better understand the evolution and diversity of genomic characteristics in Reptilia, we conducted comparative analyses of online sequence data from Alligator mississippiensis (alligator) and Sphenodon punctatus (tuatara) as well as genome size and karyological data from a wide range of reptilian species. At the whole-genome and chromosomal tiers of organization, we find that reptilian genome size distribution is consistent with a model of continuous gradual evolution while genomic compartmentalization, as manifested in the number of microchromosomes and macrochromosomes, appears to have undergone early rapid change. At the sequence level, the third genomic tier, we find that exon size in Alligator is distributed in a pattern matching that of exons in Gallus (chicken), especially in the 101—200 bp size class. A small spike in the fraction of exons in the 301 bp—1 kb size class is also observed for Alligator, but more so for Sphenodon. For introns, we find that members of Reptilia have a larger fraction of introns within the 101 bp–2 kb size class and a lower fraction of introns within the 5–30 kb size class than do mammals. These findings suggest that the mode of reptilian genome evolution varies across three hierarchical levels of the genome, a pattern consistent with a mosaic model of genomic evolution. PMID:21669810

  13. Genome of an arbuscular mycorrhizal fungus provides insight into the oldest plant symbiosis.

    PubMed

    Tisserant, Emilie; Malbreil, Mathilde; Kuo, Alan; Kohler, Annegret; Symeonidi, Aikaterini; Balestrini, Raffaella; Charron, Philippe; Duensing, Nina; Frei dit Frey, Nicolas; Gianinazzi-Pearson, Vivienne; Gilbert, Luz B; Handa, Yoshihiro; Herr, Joshua R; Hijri, Mohamed; Koul, Raman; Kawaguchi, Masayoshi; Krajinski, Franziska; Lammers, Peter J; Masclaux, Frederic G; Murat, Claude; Morin, Emmanuelle; Ndikumana, Steve; Pagni, Marco; Petitpierre, Denis; Requena, Natalia; Rosikiewicz, Pawel; Riley, Rohan; Saito, Katsuharu; San Clemente, Hélène; Shapiro, Harris; van Tuinen, Diederik; Bécard, Guillaume; Bonfante, Paola; Paszkowski, Uta; Shachar-Hill, Yair Y; Tuskan, Gerald A; Young, J Peter W; Young, Peter W; Sanders, Ian R; Henrissat, Bernard; Rensing, Stefan A; Grigoriev, Igor V; Corradi, Nicolas; Roux, Christophe; Martin, Francis

    2013-12-10

    The mutualistic symbiosis involving Glomeromycota, a distinctive phylum of early diverging Fungi, is widely hypothesized to have promoted the evolution of land plants during the middle Paleozoic. These arbuscular mycorrhizal fungi (AMF) perform vital functions in the phosphorus cycle that are fundamental to sustainable crop plant productivity. The unusual biological features of AMF have long fascinated evolutionary biologists. The coenocytic hyphae host a community of hundreds of nuclei and reproduce clonally through large multinucleated spores. It has been suggested that the AMF maintain a stable assemblage of several different genomes during the life cycle, but this genomic organization has been questioned. Here we introduce the 153-Mb haploid genome of Rhizophagus irregularis and its repertoire of 28,232 genes. The observed low level of genome polymorphism (0.43 SNP per kb) is not consistent with the occurrence of multiple, highly diverged genomes. The expansion of mating-related genes suggests the existence of cryptic sex-related processes. A comparison of gene categories confirms that R. irregularis is close to the Mucoromycotina. The AMF obligate biotrophy is not explained by genome erosion or any related loss of metabolic complexity in central metabolism, but is marked by a lack of genes encoding plant cell wall-degrading enzymes and of genes involved in toxin and thiamine synthesis. A battery of mycorrhiza-induced secreted proteins is expressed in symbiotic tissues. The present comprehensive repertoire of R. irregularis genes provides a basis for future research on symbiosis-related mechanisms in Glomeromycota.

  14. Genome of an arbuscular mycorrhizal fungus provides insight into the oldest plant symbiosis

    PubMed Central

    Tisserant, Emilie; Malbreil, Mathilde; Kuo, Alan; Kohler, Annegret; Symeonidi, Aikaterini; Balestrini, Raffaella; Charron, Philippe; Duensing, Nina; Frei dit Frey, Nicolas; Gianinazzi-Pearson, Vivienne; Gilbert, Luz B.; Handa, Yoshihiro; Herr, Joshua R.; Hijri, Mohamed; Koul, Raman; Kawaguchi, Masayoshi; Krajinski, Franziska; Lammers, Peter J.; Masclaux, Frederic G.; Murat, Claude; Morin, Emmanuelle; Ndikumana, Steve; Pagni, Marco; Petitpierre, Denis; Requena, Natalia; Rosikiewicz, Pawel; Riley, Rohan; Saito, Katsuharu; San Clemente, Hélène; Shapiro, Harris; van Tuinen, Diederik; Bécard, Guillaume; Bonfante, Paola; Paszkowski, Uta; Shachar-Hill, Yair Y.; Tuskan, Gerald A.; Young, J. Peter W.; Sanders, Ian R.; Henrissat, Bernard; Rensing, Stefan A.; Grigoriev, Igor V.; Corradi, Nicolas; Roux, Christophe; Martin, Francis

    2013-01-01

    The mutualistic symbiosis involving Glomeromycota, a distinctive phylum of early diverging Fungi, is widely hypothesized to have promoted the evolution of land plants during the middle Paleozoic. These arbuscular mycorrhizal fungi (AMF) perform vital functions in the phosphorus cycle that are fundamental to sustainable crop plant productivity. The unusual biological features of AMF have long fascinated evolutionary biologists. The coenocytic hyphae host a community of hundreds of nuclei and reproduce clonally through large multinucleated spores. It has been suggested that the AMF maintain a stable assemblage of several different genomes during the life cycle, but this genomic organization has been questioned. Here we introduce the 153-Mb haploid genome of Rhizophagus irregularis and its repertoire of 28,232 genes. The observed low level of genome polymorphism (0.43 SNP per kb) is not consistent with the occurrence of multiple, highly diverged genomes. The expansion of mating-related genes suggests the existence of cryptic sex-related processes. A comparison of gene categories confirms that R. irregularis is close to the Mucoromycotina. The AMF obligate biotrophy is not explained by genome erosion or any related loss of metabolic complexity in central metabolism, but is marked by a lack of genes encoding plant cell wall-degrading enzymes and of genes involved in toxin and thiamine synthesis. A battery of mycorrhiza-induced secreted proteins is expressed in symbiotic tissues. The present comprehensive repertoire of R. irregularis genes provides a basis for future research on symbiosis-related mechanisms in Glomeromycota. PMID:24277808

  15. Genome Sequencing and Comparative Genomics of the Broad Host-Range Pathogen Rhizoctonia solani AG8

    PubMed Central

    Hane, James K.; Anderson, Jonathan P.; Williams, Angela H.; Sperschneider, Jana; Singh, Karam B.

    2014-01-01

    Rhizoctonia solani is a soil-borne basidiomycete fungus with a necrotrophic lifestyle which is classified into fourteen reproductively incompatible anastomosis groups (AGs). One of these, AG8, is a devastating pathogen causing bare patch of cereals, brassicas and legumes. R. solani is a multinucleate heterokaryon containing significant heterozygosity within a single cell. This complexity posed significant challenges for the assembly of its genome. We present a high quality genome assembly of R. solani AG8 and a manually curated set of 13,964 genes supported by RNA-seq. The AG8 genome assembly used novel methods to produce a haploid representation of its heterokaryotic state. The whole-genomes of AG8, the rice pathogen AG1-IA and the potato pathogen AG3 were observed to be syntenic and co-linear. Genes and functions putatively relevant to pathogenicity were highlighted by comparing AG8 to known pathogenicity genes, orthology databases spanning 197 phytopathogenic taxa and AG1-IA. We also observed SNP-level “hypermutation” of CpG dinucleotides to TpG between AG8 nuclei, with similarities to repeat-induced point mutation (RIP). Interestingly, gene-coding regions were widely affected along with repetitive DNA, which has not been previously observed for RIP in mononuclear fungi of the Pezizomycotina. The rate of heterozygous SNP mutations within this single isolate of AG8 was observed to be higher than SNP mutation rates observed across populations of most fungal species compared. Comparative analyses were combined to predict biological processes relevant to AG8 and 308 proteins with effector-like characteristics, forming a valuable resource for further study of this pathosystem. Predicted effector-like proteins had elevated levels of non-synonymous point mutations relative to synonymous mutations (dN/dS), suggesting that they may be under diversifying selection pressures. In addition, the distant relationship to sequenced necrotrophs of the Ascomycota suggests the

  16. Evolutionary and ecological implications of genome size in the North American endemic sagebrushes and allies (Artemisia, Asteraceae)

    Treesearch

    Sonia Garcia; Miguel A. Canela; Teresa Garnatje; E. Durant McArthur; Jaume Pellicer; Stewart C. Sanderson; Joan Valles

    2008-01-01

    The genome size of 51 populations of 20 species of the North American endemic sagebrushes (subgenus Tridentatae), related species, and some hybrid taxa were assessed by flow cytometry, and were analysed in a phylogenetic framework. Results were similar for most Tridentatae species, with the exception of three taxonomically conflictive species: Artemisia bigelovii Gray...

  17. Using Partial Genomic Fosmid Libraries for Sequencing CompleteOrganellar Genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McNeal, Joel R.; Leebens-Mack, James H.; Arumuganathan, K.

    2005-08-26

    Organellar genome sequences provide numerous phylogenetic markers and yield insight into organellar function and molecular evolution. These genomes are much smaller in size than their nuclear counterparts; thus, their complete sequencing is much less expensive than total nuclear genome sequencing, making broader phylogenetic sampling feasible. However, for some organisms it is challenging to isolate plastid DNA for sequencing using standard methods. To overcome these difficulties, we constructed partial genomic libraries from total DNA preparations of two heterotrophic and two autotrophic angiosperm species using fosmid vectors. We then used macroarray screening to isolate clones containing large fragments of plastid DNA. Amore » minimum tiling path of clones comprising the entire genome sequence of each plastid was selected, and these clones were shotgun-sequenced and assembled into complete genomes. Although this method worked well for both heterotrophic and autotrophic plants, nuclear genome size had a dramatic effect on the proportion of screened clones containing plastid DNA and, consequently, the overall number of clones that must be screened to ensure full plastid genome coverage. This technique makes it possible to determine complete plastid genome sequences for organisms that defy other available organellar genome sequencing methods, especially those for which limited amounts of tissue are available.« less

  18. A genomics approach to understanding the role of auxin in apple (Malus x domestica) fruit size control.

    PubMed

    Devoghalaere, Fanny; Doucen, Thomas; Guitton, Baptiste; Keeling, Jeannette; Payne, Wendy; Ling, Toby John; Ross, John James; Hallett, Ian Charles; Gunaseelan, Kularajathevan; Dayatilake, G A; Diak, Robert; Breen, Ken C; Tustin, D Stuart; Costes, Evelyne; Chagné, David; Schaffer, Robert James; David, Karine Myriam

    2012-01-13

    Auxin is an important phytohormone for fleshy fruit development, having been shown to be involved in the initial signal for fertilisation, fruit size through the control of cell division and cell expansion, and ripening related events. There is considerable knowledge of auxin-related genes, mostly from work in model species. With the apple genome now available, it is possible to carry out genomics studies on auxin-related genes to identify genes that may play roles in specific stages of apple fruit development. High amounts of auxin in the seed compared with the fruit cortex were observed in 'Royal Gala' apples, with amounts increasing through fruit development. Injection of exogenous auxin into developing apples at the start of cell expansion caused an increase in cell size. An expression analysis screen of auxin-related genes involved in auxin reception, homeostasis, and transcriptional regulation showed complex patterns of expression in each class of gene. Two mapping populations were phenotyped for fruit size over multiple seasons, and multiple quantitative trait loci (QTLs) were observed. One QTL mapped to a region containing an Auxin Response Factor (ARF106). This gene is expressed during cell division and cell expansion stages, consistent with a potential role in the control of fruit size. The application of exogenous auxin to apples increased cell expansion, suggesting that endogenous auxin concentrations are at least one of the limiting factors controlling fruit size. The expression analysis of ARF106 linked to a strong QTL for fruit weight suggests that the auxin signal regulating fruit size could partially be modulated through the function of this gene. One class of gene (GH3) removes free auxin by conjugation to amino acids. The lower expression of these GH3 genes during rapid fruit expansion is consistent with the apple maximising auxin concentrations at this point.

  19. A genomics approach to understanding the role of auxin in apple (Malus x domestica) fruit size control

    PubMed Central

    2012-01-01

    Background Auxin is an important phytohormone for fleshy fruit development, having been shown to be involved in the initial signal for fertilisation, fruit size through the control of cell division and cell expansion, and ripening related events. There is considerable knowledge of auxin-related genes, mostly from work in model species. With the apple genome now available, it is possible to carry out genomics studies on auxin-related genes to identify genes that may play roles in specific stages of apple fruit development. Results High amounts of auxin in the seed compared with the fruit cortex were observed in 'Royal Gala' apples, with amounts increasing through fruit development. Injection of exogenous auxin into developing apples at the start of cell expansion caused an increase in cell size. An expression analysis screen of auxin-related genes involved in auxin reception, homeostasis, and transcriptional regulation showed complex patterns of expression in each class of gene. Two mapping populations were phenotyped for fruit size over multiple seasons, and multiple quantitative trait loci (QTLs) were observed. One QTL mapped to a region containing an Auxin Response Factor (ARF106). This gene is expressed during cell division and cell expansion stages, consistent with a potential role in the control of fruit size. Conclusions The application of exogenous auxin to apples increased cell expansion, suggesting that endogenous auxin concentrations are at least one of the limiting factors controlling fruit size. The expression analysis of ARF106 linked to a strong QTL for fruit weight suggests that the auxin signal regulating fruit size could partially be modulated through the function of this gene. One class of gene (GH3) removes free auxin by conjugation to amino acids. The lower expression of these GH3 genes during rapid fruit expansion is consistent with the apple maximising auxin concentrations at this point. PMID:22243694

  20. Estimation of (co)variances for genomic regions of flexible sizes: application to complex infectious udder diseases in dairy cattle

    PubMed Central

    2012-01-01

    Background Multi-trait genomic models in a Bayesian context can be used to estimate genomic (co)variances, either for a complete genome or for genomic regions (e.g. per chromosome) for the purpose of multi-trait genomic selection or to gain further insight into the genomic architecture of related traits such as mammary disease traits in dairy cattle. Methods Data on progeny means of six traits related to mastitis resistance in dairy cattle (general mastitis resistance and five pathogen-specific mastitis resistance traits) were analyzed using a bivariate Bayesian SNP-based genomic model with a common prior distribution for the marker allele substitution effects and estimation of the hyperparameters in this prior distribution from the progeny means data. From the Markov chain Monte Carlo samples of the allele substitution effects, genomic (co)variances were calculated on a whole-genome level, per chromosome, and in regions of 100 SNP on a chromosome. Results Genomic proportions of the total variance differed between traits. Genomic correlations were lower than pedigree-based genetic correlations and they were highest between general mastitis and pathogen-specific traits because of the part-whole relationship between these traits. The chromosome-wise genomic proportions of the total variance differed between traits, with some chromosomes explaining higher or lower values than expected in relation to chromosome size. Few chromosomes showed pleiotropic effects and only chromosome 19 had a clear effect on all traits, indicating the presence of QTL with a general effect on mastitis resistance. The region-wise patterns of genomic variances differed between traits. Peaks indicating QTL were identified but were not very distinctive because a common prior for the marker effects was used. There was a clear difference in the region-wise patterns of genomic correlation among combinations of traits, with distinctive peaks indicating the presence of pleiotropic QTL. Conclusions

  1. Karyotype diversity and genome size variation in Neotropical Maxillariinae orchids.

    PubMed

    Moraes, A P; Koehler, S; Cabral, J S; Gomes, S S L; Viccini, L F; Barros, F; Felix, L P; Guerra, M; Forni-Martins, E R

    2017-03-01

    Orchidaceae is a widely distributed plant family with very diverse vegetative and floral morphology, and such variability is also reflected in their karyotypes. However, since only a low proportion of Orchidaceae has been analysed for chromosome data, greater diversity may await to be unveiled. Here we analyse both genome size (GS) and karyotype in two subtribes recently included in the broadened Maxillariinea to detect how much chromosome and GS variation there is in these groups and to evaluate which genome rearrangements are involved in the species evolution. To do so, the GS (14 species), the karyotype - based on chromosome number, heterochromatic banding and 5S and 45S rDNA localisation (18 species) - was characterised and analysed along with published data using phylogenetic approaches. The GS presented a high phylogenetic correlation and it was related to morphological groups in Bifrenaria (larger plants - higher GS). The two largest GS found among genera were caused by different mechanisms: polyploidy in Bifrenaria tyrianthina and accumulation of repetitive DNA in Scuticaria hadwenii. The chromosome number variability was caused mainly through descending dysploidy, and x=20 was estimated as the base chromosome number. Combining GS and karyotype data with molecular phylogeny, our data provide a more complete scenario of the karyotype evolution in Maxillariinae orchids, allowing us to suggest, besides dysploidy, that inversions and transposable elements as two mechanisms involved in the karyotype evolution. Such karyotype modifications could be associated with niche changes that occurred during species evolution. © 2016 German Botanical Society and The Royal Botanical Society of the Netherlands.

  2. Novel nuclei isolation buffer for flow cytometric genome size estimation of Zingiberaceae: a comparison with common isolation buffers.

    PubMed

    Sadhu, Abhishek; Bhadra, Sreetama; Bandyopadhyay, Maumita

    2016-11-01

    Cytological parameters such as chromosome numbers and genome sizes of plants are used routinely for studying evolutionary aspects of polyploid plants. Members of Zingiberaceae show a wide range of inter- and intrageneric variation in their reproductive habits and ploidy levels. Conventional cytological study in this group of plants is severely hampered by the presence of diverse secondary metabolites, which also affect their genome size estimation using flow cytometry. None of the several nuclei isolation buffers used in flow cytometry could be used very successfully for members of Zingiberaceae to isolate good quality nuclei from both shoot and root tissues. The competency of eight nuclei isolation buffers was compared with a newly formulated buffer, MB01, in six different genera of Zingiberaceae based on the fluorescence intensity of propidium iodide-stained nuclei using flow cytometric parameters, namely coefficient of variation of the G 0 /G 1 peak, debris factor and nuclei yield factor. Isolated nuclei were studied using fluorescence microscopy and bio-scanning electron microscopy to analyse stain-nuclei interaction and nuclei topology, respectively. Genome contents of 21 species belonging to these six genera were determined using MB01. Flow cytometric parameters showed significant differences among the analysed buffers. MB01 exhibited the best combination of analysed parameters; photomicrographs obtained from fluorescence and electron microscopy supported the superiority of MB01 buffer over other buffers. Among the 21 species studied, nuclear DNA contents of 14 species are reported for the first time. Results of the present study substantiate the enhanced efficacy of MB01, compared to other buffers tested, in the generation of acceptable cytograms from all species of Zingiberaceae studied. Our study facilitates new ways of sample preparation for further flow cytometric analysis of genome size of other members belonging to this highly complex polyploid family

  3. A Targeted Capture Linkage Map Anchors the Genome of the Schistosomiasis Vector Snail, Biomphalaria glabrata.

    PubMed

    Tennessen, Jacob A; Bollmann, Stephanie R; Blouin, Michael S

    2017-07-05

    The aquatic planorbid snail Biomphalaria glabrata is one of the most intensively-studied mollusks due to its role in the transmission of schistosomiasis. Its 916 Mb genome has recently been sequenced and annotated, but it remains poorly assembled. Here, we used targeted capture markers to map over 10,000 B. glabrata scaffolds in a linkage cross of 94 F1 offspring, generating 24 linkage groups (LGs). We added additional scaffolds to these LGs based on linkage disequilibrium (LD) analysis of targeted capture and whole-genome sequences of 96 unrelated snails. Our final linkage map consists of 18,613 scaffolds comprising 515 Mb, representing 56% of the genome and 75% of genic and nonrepetitive regions. There are 18 large (> 10 Mb) LGs, likely representing the expected 18 haploid chromosomes, and > 50% of the genome has been assigned to LGs of at least 17 Mb. Comparisons with other gastropod genomes reveal patterns of synteny and chromosomal rearrangements. Linkage relationships of key immune-relevant genes may help clarify snail-schistosome interactions. By focusing on linkage among genic and nonrepetitive regions, we have generated a useful resource for associating snail phenotypes with causal genes, even in the absence of a complete genome assembly. A similar approach could potentially improve numerous poorly-assembled genomes in other taxa. This map will facilitate future work on this host of a serious human parasite. Copyright © 2017 Tennessen et al.

  4. Efficient privacy-preserving string search and an application in genomics.

    PubMed

    Shimizu, Kana; Nuida, Koji; Rätsch, Gunnar

    2016-06-01

    Personal genomes carry inherent privacy risks and protecting privacy poses major social and technological challenges. We consider the case where a user searches for genetic information (e.g. an allele) on a server that stores a large genomic database and aims to receive allele-associated information. The user would like to keep the query and result private and the server the database. We propose a novel approach that combines efficient string data structures such as the Burrows-Wheeler transform with cryptographic techniques based on additive homomorphic encryption. We assume that the sequence data is searchable in efficient iterative query operations over a large indexed dictionary, for instance, from large genome collections and employing the (positional) Burrows-Wheeler transform. We use a technique called oblivious transfer that is based on additive homomorphic encryption to conceal the sequence query and the genomic region of interest in positional queries. We designed and implemented an efficient algorithm for searching sequences of SNPs in large genome databases. During search, the user can only identify the longest match while the server does not learn which sequence of SNPs the user queried. In an experiment based on 2184 aligned haploid genomes from the 1000 Genomes Project, our algorithm was able to perform typical queries within [Formula: see text] 4.6 s and [Formula: see text] 10.8 s for client and server side, respectively, on laptop computers. The presented algorithm is at least one order of magnitude faster than an exhaustive baseline algorithm. https://github.com/iskana/PBWT-sec and https://github.com/ratschlab/PBWT-sec shimizu-kana@aist.go.jp or Gunnar.Ratsch@ratschlab.org Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  5. Efficient privacy-preserving string search and an application in genomics

    PubMed Central

    Shimizu, Kana; Nuida, Koji; Rätsch, Gunnar

    2016-01-01

    Motivation: Personal genomes carry inherent privacy risks and protecting privacy poses major social and technological challenges. We consider the case where a user searches for genetic information (e.g. an allele) on a server that stores a large genomic database and aims to receive allele-associated information. The user would like to keep the query and result private and the server the database. Approach: We propose a novel approach that combines efficient string data structures such as the Burrows–Wheeler transform with cryptographic techniques based on additive homomorphic encryption. We assume that the sequence data is searchable in efficient iterative query operations over a large indexed dictionary, for instance, from large genome collections and employing the (positional) Burrows–Wheeler transform. We use a technique called oblivious transfer that is based on additive homomorphic encryption to conceal the sequence query and the genomic region of interest in positional queries. Results: We designed and implemented an efficient algorithm for searching sequences of SNPs in large genome databases. During search, the user can only identify the longest match while the server does not learn which sequence of SNPs the user queried. In an experiment based on 2184 aligned haploid genomes from the 1000 Genomes Project, our algorithm was able to perform typical queries within ≈ 4.6 s and ≈ 10.8 s for client and server side, respectively, on laptop computers. The presented algorithm is at least one order of magnitude faster than an exhaustive baseline algorithm. Availability and implementation: https://github.com/iskana/PBWT-sec and https://github.com/ratschlab/PBWT-sec. Contacts: shimizu-kana@aist.go.jp or Gunnar.Ratsch@ratschlab.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153731

  6. Empirical comparison between different methods for genomic prediction of number of piglets born alive in moderate sized breeding populations.

    PubMed

    Fangmann, A; Sharifi, R A; Heinkel, J; Danowski, K; Schrade, H; Erbe, M; Simianer, H

    2017-04-01

    Currently used multi-step methods to incorporate genomic information in the prediction of breeding values (BV) implicitly involve many assumptions which, if violated, may result in loss of information, inaccuracies and bias. To overcome this, single-step genomic best linear unbiased prediction (ssGBLUP) was proposed combining pedigree, phenotype and genotype of all individuals for genetic evaluation. Our objective was to implement ssGBLUP for genomic predictions in pigs and to compare the accuracy of ssGBLUP with that of multi-step methods with empirical data of moderately sized pig breeding populations. Different predictions were performed: conventional parent average (PA), direct genomic value (DGV) calculated with genomic BLUP (GBLUP), a GEBV obtained by blending the DGV with PA, and ssGBLUP. Data comprised individuals from a German Landrace (LR) and Large White (LW) population. The trait 'number of piglets born alive' (NBA) was available for 182,054 litters of 41,090 LR sows and 15,750 litters from 4534 LW sows. The pedigree contained 174,021 animals, of which 147,461 (26,560) animals were LR (LW) animals. In total, 526 LR and 455 LW animals were genotyped with the Illumina PorcineSNP60 BeadChip. After quality control and imputation, 495 LR (424 LW) animals with 44,368 (43,678) SNP on 18 autosomes remained for the analysis. Predictive abilities, i.e., correlations between de-regressed proofs and genomic BV, were calculated with a five-fold cross validation and with a forward prediction for young genotyped validation animals born after 2011. Generally, predictive abilities for LR were rather small (0.08 for GBLUP, 0.19 for GEBV and 0.18 for ssGBLUP). For LW, ssGBLUP had the greatest predictive ability (0.45). For both breeds, assessment of reliabilities for young genotyped animals indicated that genomic prediction outperforms PA with ssGBLUP providing greater reliabilities (0.40 for LR and 0.32 for LW) than GEBV (0.35 for LR and 0.29 for LW). Grouping of animals

  7. Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes.

    PubMed

    Janicki, Mateusz; Rooke, Rebecca; Yang, Guojun

    2011-08-01

    A major portion of most eukaryotic genomes are transposable elements (TEs). During evolution, TEs have introduced profound changes to genome size, structure, and function. As integral parts of genomes, the dynamic presence of TEs will continue to be a major force in reshaping genomes. Early computational analyses of TEs in genome sequences focused on filtering out "junk" sequences to facilitate gene annotation. When the high abundance and diversity of TEs in eukaryotic genomes were recognized, these early efforts transformed into the systematic genome-wide categorization and classification of TEs. The availability of genomic sequence data reversed the classical genetic approaches to discovering new TE families and superfamilies. Curated TE databases and their accurate annotation of genome sequences in turn facilitated the studies on TEs in a number of frontiers including: (1) TE-mediated changes of genome size and structure, (2) the influence of TEs on genome and gene functions, (3) TE regulation by host, (4) the evolution of TEs and their population dynamics, and (5) genomic scale studies of TE activity. Bioinformatics and genomic approaches have become an integral part of large-scale studies on TEs to extract information with pure in silico analyses or to assist wet lab experimental studies. The current revolution in genome sequencing technology facilitates further progress in the existing frontiers of research and emergence of new initiatives. The rapid generation of large-sequence datasets at record low costs on a routine basis is challenging the computing industry on storage capacity and manipulation speed and the bioinformatics community for improvement in algorithms and their implementations.

  8. Limits of variation, specific infectivity, and genome packaging of massively recoded poliovirus genomes.

    PubMed

    Song, Yutong; Gorbatsevych, Oleksandr; Liu, Ying; Mugavero, JoAnn; Shen, Sam H; Ward, Charles B; Asare, Emmanuel; Jiang, Ping; Paul, Aniko V; Mueller, Steffen; Wimmer, Eckard

    2017-10-10

    Computer design and chemical synthesis generated viable variants of poliovirus type 1 (PV1), whose ORF (6,189 nucleotides) carried up to 1,297 "Max" mutations (excess of overrepresented synonymous codon pairs) or up to 2,104 "SD" mutations (randomly scrambled synonymous codons). "Min" variants (excess of underrepresented synonymous codon pairs) are nonviable except for P2 Min , a variant temperature-sensitive at 33 and 39.5 °C. Compared with WT PV1, P2 Min displayed a vastly reduced specific infectivity (si) (WT, 1 PFU/118 particles vs. P2 Min , 1 PFU/35,000 particles), a phenotype that will be discussed broadly. Si of haploid PV presents cellular infectivity of a single genotype. We performed a comprehensive analysis of sequence and structures of the PV genome to determine if evolutionary conserved cis-acting packaging signal(s) were preserved after recoding. We showed that conserved synonymous sites and/or local secondary structures that might play a role in determining packaging specificity do not survive codon pair recoding. This makes it unlikely that numerous "cryptic, sequence-degenerate, dispersed RNA packaging signals mapping along the entire viral genome" [Patel N, et al. (2017) Nat Microbiol 2:17098] play the critical role in poliovirus packaging specificity. Considering all available evidence, we propose a two-step assembly strategy for +ssRNA viruses: step I, acquisition of packaging specificity, either ( a ) by specific recognition between capsid protein(s) and replication proteins (poliovirus), or ( b ) by the high affinity interaction of a single RNA packaging signal (PS) with capsid protein(s) (most +ssRNA viruses so far studied); step II, cocondensation of genome/capsid precursors in which an array of hairpin structures plays a role in virion formation.

  9. Genomic-based-breeding tools for tropical maize improvement.

    PubMed

    Chakradhar, Thammineni; Hindu, Vemuri; Reddy, Palakolanu Sudhakar

    2017-12-01

    Maize has traditionally been the main staple diet in the Southern Asia and Sub-Saharan Africa and widely grown by millions of resource poor small scale farmers. Approximately, 35.4 million hectares are sown to tropical maize, constituting around 59% of the developing worlds. Tropical maize encounters tremendous challenges besides poor agro-climatic situations with average yields recorded <3 tones/hectare that is far less than the average of developed countries. On the contrary to poor yields, the demand for maize as food, feed, and fuel is continuously increasing in these regions. Heterosis breeding introduced in early 90 s improved maize yields significantly, but genetic gains is still a mirage, particularly for crop growing under marginal environments. Application of molecular markers has accelerated the pace of maize breeding to some extent. The availability of array of sequencing and genotyping technologies offers unrivalled service to improve precision in maize-breeding programs through modern approaches such as genomic selection, genome-wide association studies, bulk segregant analysis-based sequencing approaches, etc. Superior alleles underlying complex traits can easily be identified and introgressed efficiently using these sequence-based approaches. Integration of genomic tools and techniques with advanced genetic resources such as nested association mapping and backcross nested association mapping could certainly address the genetic issues in maize improvement programs in developing countries. Huge diversity in tropical maize and its inherent capacity for doubled haploid technology offers advantage to apply the next generation genomic tools for accelerating production in marginal environments of tropical and subtropical world. Precision in phenotyping is the key for success of any molecular-breeding approach. This article reviews genomic technologies and their application to improve agronomic traits in tropical maize breeding has been reviewed in

  10. In vivo evolutionary engineering for ethanol-tolerance of Saccharomyces cerevisiae haploid cells triggers diploidization.

    PubMed

    Turanlı-Yıldız, Burcu; Benbadis, Laurent; Alkım, Ceren; Sezgin, Tuğba; Akşit, Arman; Gökçe, Abdülmecit; Öztürk, Yavuz; Baykal, Ahmet Tarık; Çakar, Zeynep Petek; François, Jean M

    2017-09-01

    Microbial ethanol production is an important alternative energy resource to replace fossil fuels, but at high level, this product is highly toxic, which hampers its efficient production. Towards increasing ethanol-tolerance of Saccharomyces cerevisiae, the so far best industrial ethanol-producer, we evaluated an in vivo evolutionary engineering strategy based on batch selection under both constant (5%, v v -1 ) and gradually increasing (5-11.4%, v v -1 ) ethanol concentrations. Selection under increasing ethanol levels yielded evolved clones that could tolerate up to 12% (v v -1 ) ethanol and had cross-resistance to other stresses. Quite surprisingly, diploidization of the yeast population took place already at 7% (v v -1 ) ethanol level during evolutionary engineering, and this event was abolished by the loss of MKT1, a gene previously identified as being implicated in ethanol tolerance (Swinnen et al., Genome Res., 22, 975-984, 2012). Transcriptomic analysis confirmed diploidization of the evolved clones with strong down-regulation in mating process, and in several haploid-specific genes. We selected two clones exhibiting the highest viability on 12% ethanol, and found productivity and titer of ethanol significantly higher than those of the reference strain under aerated fed-batch cultivation conditions. This higher fermentation performance could be related with a higher abundance of glycolytic and ribosomal proteins and with a relatively lower respiratory capacity of the evolved strain, as revealed by a comparative transcriptomic and proteomic analysis between the evolved and the reference strains. Altogether, these results emphasize the efficiency of the in vivo evolutionary engineering strategy for improving ethanol tolerance, and the link between ethanol tolerance and diploidization. Copyright © 2017 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  11. Combining fluorescence imaging with Hi-C to study 3D genome architecture of the same single cell.

    PubMed

    Lando, David; Basu, Srinjan; Stevens, Tim J; Riddell, Andy; Wohlfahrt, Kai J; Cao, Yang; Boucher, Wayne; Leeb, Martin; Atkinson, Liam P; Lee, Steven F; Hendrich, Brian; Klenerman, Dave; Laue, Ernest D

    2018-05-01

    Fluorescence imaging and chromosome conformation capture assays such as Hi-C are key tools for studying genome organization. However, traditionally, they have been carried out independently, making integration of the two types of data difficult to perform. By trapping individual cell nuclei inside a well of a 384-well glass-bottom plate with an agarose pad, we have established a protocol that allows both fluorescence imaging and Hi-C processing to be carried out on the same single cell. The protocol identifies 30,000-100,000 chromosome contacts per single haploid genome in parallel with fluorescence images. Contacts can be used to calculate intact genome structures to better than 100-kb resolution, which can then be directly compared with the images. Preparation of 20 single-cell Hi-C libraries using this protocol takes 5 d of bench work by researchers experienced in molecular biology techniques. Image acquisition and analysis require basic understanding of fluorescence microscopy, and some bioinformatics knowledge is required to run the sequence-processing tools described here.

  12. Gene expansion shapes genome architecture in the human pathogen Lichtheimia corymbifera: an evolutionary genomics analysis in the ancient terrestrial mucorales (Mucoromycotina).

    PubMed

    Schwartze, Volker U; Winter, Sascha; Shelest, Ekaterina; Marcet-Houben, Marina; Horn, Fabian; Wehner, Stefanie; Linde, Jörg; Valiante, Vito; Sammeth, Michael; Riege, Konstantin; Nowrousian, Minou; Kaerger, Kerstin; Jacobsen, Ilse D; Marz, Manja; Brakhage, Axel A; Gabaldón, Toni; Böcker, Sebastian; Voigt, Kerstin

    2014-08-01

    Lichtheimia species are the second most important cause of mucormycosis in Europe. To provide broader insights into the molecular basis of the pathogenicity-associated traits of the basal Mucorales, we report the full genome sequence of L. corymbifera and compared it to the genome of Rhizopus oryzae, the most common cause of mucormycosis worldwide. The genome assembly encompasses 33.6 MB and 12,379 protein-coding genes. This study reveals four major differences of the L. corymbifera genome to R. oryzae: (i) the presence of an highly elevated number of gene duplications which are unlike R. oryzae not due to whole genome duplication (WGD), (ii) despite the relatively high incidence of introns, alternative splicing (AS) is not frequently observed for the generation of paralogs and in response to stress, (iii) the content of repetitive elements is strikingly low (<5%), (iv) L. corymbifera is typically haploid. Novel virulence factors were identified which may be involved in the regulation of the adaptation to iron-limitation, e.g. LCor01340.1 encoding a putative siderophore transporter and LCor00410.1 involved in the siderophore metabolism. Genes encoding the transcription factors LCor08192.1 and LCor01236.1, which are similar to GATA type regulators and to calcineurin regulated CRZ1, respectively, indicating an involvement of the calcineurin pathway in the adaption to iron limitation. Genes encoding MADS-box transcription factors are elevated up to 11 copies compared to the 1-4 copies usually found in other fungi. More findings are: (i) lower content of tRNAs, but unique codons in L. corymbifera, (ii) Over 25% of the proteins are apparently specific for L. corymbifera. (iii) L. corymbifera contains only 2/3 of the proteases (known to be essential virulence factors) in comparison to R. oryzae. On the other hand, the number of secreted proteases, however, is roughly twice as high as in R. oryzae.

  13. Single haplotype assembly of the human genome from a hydatidiform mole.

    PubMed

    Steinberg, Karyn Meltz; Schneider, Valerie A; Graves-Lindsay, Tina A; Fulton, Robert S; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C; Church, Deanna M; Eichler, Evan E; Wilson, Richard K

    2014-12-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. © 2014 Steinberg et al.; Published by Cold Spring Harbor Laboratory Press.

  14. GenomeFingerprinter: the genome fingerprint and the universal genome fingerprint analysis for systematic comparative genomics.

    PubMed

    Ai, Yuncan; Ai, Hannan; Meng, Fanmei; Zhao, Lei

    2013-01-01

    No attention has been paid on comparing a set of genome sequences crossing genetic components and biological categories with far divergence over large size range. We define it as the systematic comparative genomics and aim to develop the methodology. First, we create a method, GenomeFingerprinter, to unambiguously produce a set of three-dimensional coordinates from a sequence, followed by one three-dimensional plot and six two-dimensional trajectory projections, to illustrate the genome fingerprint of a given genome sequence. Second, we develop a set of concepts and tools, and thereby establish a method called the universal genome fingerprint analysis (UGFA). Particularly, we define the total genetic component configuration (TGCC) (including chromosome, plasmid, and phage) for describing a strain as a systematic unit, the universal genome fingerprint map (UGFM) of TGCC for differentiating strains as a universal system, and the systematic comparative genomics (SCG) for comparing a set of genomes crossing genetic components and biological categories. Third, we construct a method of quantitative analysis to compare two genomes by using the outcome dataset of genome fingerprint analysis. Specifically, we define the geometric center and its geometric mean for a given genome fingerprint map, followed by the Euclidean distance, the differentiate rate, and the weighted differentiate rate to quantitatively describe the difference between two genomes of comparison. Moreover, we demonstrate the applications through case studies on various genome sequences, giving tremendous insights into the critical issues in microbial genomics and taxonomy. We have created a method, GenomeFingerprinter, for rapidly computing, geometrically visualizing, intuitively comparing a set of genomes at genome fingerprint level, and hence established a method called the universal genome fingerprint analysis, as well as developed a method of quantitative analysis of the outcome dataset. These have set

  15. Application of Response Surface Methods To Determine Conditions for Optimal Genomic Prediction

    PubMed Central

    Howard, Réka; Carriquiry, Alicia L.; Beavis, William D.

    2017-01-01

    An epistatic genetic architecture can have a significant impact on prediction accuracies of genomic prediction (GP) methods. Machine learning methods predict traits comprised of epistatic genetic architectures more accurately than statistical methods based on additive mixed linear models. The differences between these types of GP methods suggest a diagnostic for revealing genetic architectures underlying traits of interest. In addition to genetic architecture, the performance of GP methods may be influenced by the sample size of the training population, the number of QTL, and the proportion of phenotypic variability due to genotypic variability (heritability). Possible values for these factors and the number of combinations of the factor levels that influence the performance of GP methods can be large. Thus, efficient methods for identifying combinations of factor levels that produce most accurate GPs is needed. Herein, we employ response surface methods (RSMs) to find the experimental conditions that produce the most accurate GPs. We illustrate RSM with an example of simulated doubled haploid populations and identify the combination of factors that maximize the difference between prediction accuracies of best linear unbiased prediction (BLUP) and support vector machine (SVM) GP methods. The greatest impact on the response is due to the genetic architecture of the population, heritability of the trait, and the sample size. When epistasis is responsible for all of the genotypic variance and heritability is equal to one and the sample size of the training population is large, the advantage of using the SVM method vs. the BLUP method is greatest. However, except for values close to the maximum, most of the response surface shows little difference between the methods. We also determined that the conditions resulting in the greatest prediction accuracy for BLUP occurred when genetic architecture consists solely of additive effects, and heritability is equal to one. PMID

  16. The Effect of Different Oceanic Abiotic Factors on Prokaryotic Body Sizes

    NASA Astrophysics Data System (ADS)

    Pidathala, S.; Bellon, M.; Heim, N.; Payne, J.

    2016-12-01

    We are studying the impact of abiotic factors in the Pacific and Atlantic on prokaryotic body sizes and genome sizes because we are interested in the manner in which abiotic factors influence genome sizes independent of their influence on body sizes. Some research has been done in the past on marine bacterial evolution, including data collection on marine ecology in relation to bacterial body sizes (Straza 2009). We are using the abiotic factors: temperature, salinity, and pH to compare the biovolumes/genome sizes of different phyla by using R. We made 9 scatter plots to model these relationships. Regardless of the phyla or the ocean, we found that there is no relation between pH, temperature, and body size, with several exceptions: Deinococcus. thermus has an indirect relationship with size in respect to temperature; size only correlates to temperature for phyla that are thermophiles. We also found that bacteria like D. thermus and Thermotogae are taxa only found in higher temperatures. Additionally, almost all phyla have genome sizes restricted by certain pH levels:, Proteobacteria only reach genomes with acidity levels greater than 6. In terms of salinity levels, certain bacteria are only found within a small range, and others, like Proteobacteria, can only reach genomes at low salinity levels. Finally, Proteobacteria have large genome sizes between 30 and 40 °, and Crenarchaeota have constant genome sizes in higher temperatures. Conclusively, we discovered that these abiotic factors generally do not affect body size, with the exception of D. thermus' indirect relationship to temperature due to its small biovolume in high temperatures. However, we determined that these abiotic factors have a great impact on genome sizes. This is due to genome size independence from body size. Also, genome size could have served as an adaptive feature for bacteria in marine environments, explaining why different phyla may have diverged to accommodate their lifestyles.

  17. Genome-Wide Search Identifies 1.9 Mb from the Polar Bear Y Chromosome for Evolutionary Analyses

    PubMed Central

    Bidon, Tobias; Schreck, Nancy; Hailer, Frank; Nilsson, Maria A.; Janke, Axel

    2015-01-01

    The male-inherited Y chromosome is the major haploid fraction of the mammalian genome, rendering Y-linked sequences an indispensable resource for evolutionary research. However, despite recent large-scale genome sequencing approaches, only a handful of Y chromosome sequences have been characterized to date, mainly in model organisms. Using polar bear (Ursus maritimus) genomes, we compare two different in silico approaches to identify Y-linked sequences: 1) Similarity to known Y-linked genes and 2) difference in the average read depth of autosomal versus sex chromosomal scaffolds. Specifically, we mapped available genomic sequencing short reads from a male and a female polar bear against the reference genome and identify 112 Y-chromosomal scaffolds with a combined length of 1.9 Mb. We verified the in silico findings for the longer polar bear scaffolds by male-specific in vitro amplification, demonstrating the reliability of the average read depth approach. The obtained Y chromosome sequences contain protein-coding sequences, single nucleotide polymorphisms, microsatellites, and transposable elements that are useful for evolutionary studies. A high-resolution phylogeny of the polar bear patriline shows two highly divergent Y chromosome lineages, obtained from analysis of the identified Y scaffolds in 12 previously published male polar bear genomes. Moreover, we find evidence of gene conversion among ZFX and ZFY sequences in the giant panda lineage and in the ancestor of ursine and tremarctine bears. Thus, the identification of Y-linked scaffold sequences from unordered genome sequences yields valuable data to infer phylogenomic and population-genomic patterns in bears. PMID:26019166

  18. Genome Sequence of Saccharomyces carlsbergensis, the World’s First Pure Culture Lager Yeast

    PubMed Central

    Walther, Andrea; Hesselbart, Ana; Wendland, Jürgen

    2014-01-01

    Lager yeast beer production was revolutionized by the introduction of pure culture strains. The first established lager yeast strain is known as the bottom fermenting Saccharomyces carlsbergensis, which was originally termed Unterhefe No. 1 by Emil Chr. Hansen and has been used in production in since 1883. S. carlsbergensis belongs to group I/Saaz-type lager yeast strains and is better adapted to cold growth conditions than group II/Frohberg-type lager yeasts, e.g., the Weihenstephan strain WS34/70. Here, we sequenced S. carlsbergensis using next generation sequencing technologies. Lager yeasts are descendants from hybrids formed between a S. cerevisiae parent and a parent similar to S. eubayanus. Accordingly, the S. carlsbergensis 19.5-Mb genome is substantially larger than the 12-Mb S. cerevisiae genome. Based on the sequence scaffolds, synteny to the S. cerevisae genome, and by using directed polymerase chain reaction for gap closure, we generated a chromosomal map of S. carlsbergensis consisting of 29 unique chromosomes. We present evidence for genome and chromosome evolution within S. carlsbergensis via chromosome loss and loss of heterozygosity specifically of parts derived from the S. cerevisiae parent. Based on our sequence data and via fluorescence-activated cell-sorting analysis, we determined the ploidy of S. carlsbergensis. This inferred that this strain is basically triploid with a diploid S. eubayanus and haploid S. cerevisiae genome content. In contrast the Weihenstephan strain, which we resequenced, is essentially tetraploid composed of two diploid S. cerevisiae and S. eubayanus genomes. Based on conserved translocations between the parental genomes in S. carlsbergensis and the Weihenstephan strain we propose a joint evolutionary ancestry for lager yeast strains. PMID:24578374

  19. Stability of transgene integration and expression in subsequent generations of doubled haploid oilseed rape transformed with chitinase and beta-1,3-glucanase genes in a double-gene construct.

    PubMed

    Melander, Margareta; Kamnert, Iréne; Happstadius, Ingrid; Liljeroth, Erland; Bryngelsson, Tomas

    2006-09-01

    A double-gene construct with one chitinase and one beta-1,3-glucanase gene from barley, both driven by enhanced 35S promoters, was transformed into oilseed rape. From six primary transformants expressing both transgenes 10 doubled haploid lines were produced and studied for five generations. The number of inserted copies for both the genes was determined by Southern blotting and real-time PCR with full agreement between the two methods. When copy numbers were analysed in different generations, discrepancies were found, indicating that at least part of the inserted sequences were lost in one of the alleles of some doubled haploids. Chitinase and beta-1,3-glucanase expression was analysed by Western blotting in all five doubled haploid generations. Despite that both the genes were present on the same T-DNA and directed by the same promoter their expression pattern between generations was different. The beta-1,3-glucanase was expressed at high and stable levels in all generations, while the chitinase displayed lower expression that varied between generations. The transgenic plants did not show any major impact on fungal resistance when assayed in greenhouse, although purified beta-1,3-glucanase and chitinase caused retardment of fungal growth in vitro.

  20. Genome-wide haploinsufficiency screen reveals a novel role for γ-TuSC in spindle organization and genome stability

    PubMed Central

    Choy, John S.; O'Toole, Eileen; Schuster, Breanna M.; Crisp, Matthew J.; Karpova, Tatiana S.; McNally, James G.; Winey, Mark; Gardner, Melissa K.; Basrai, Munira A.

    2013-01-01

    How subunit dosage contributes to the assembly and function of multimeric complexes is an important question with implications in understanding biochemical, evolutionary, and disease mechanisms. Toward identifying pathways that are susceptible to decreased gene dosage, we performed a genome-wide screen for haploinsufficient (HI) genes that guard against genome instability in Saccharomyces cerevisiae. This led to the identification of all three genes (SPC97, SPC98, and TUB4) encoding the evolutionarily conserved γ-tubulin small complex (γ-TuSC), which nucleates microtubule assembly. We found that hemizygous γ-TuSC mutants exhibit higher rates of chromosome loss and increases in anaphase spindle length and elongation velocities. Fluorescence microscopy, fluorescence recovery after photobleaching, electron tomography, and model convolution simulation of spc98/+ mutants revealed improper regulation of interpolar (iMT) and kinetochore (kMT) microtubules in anaphase. The underlying cause is likely due to reduced levels of Tub4, as overexpression of TUB4 suppressed the spindle and chromosome segregation defects in spc98/+ mutants. We propose that γ-TuSC is crucial for balanced assembly between iMTs and kMTs for spindle organization and accurate chromosome segregation. Taken together, the results show how gene dosage studies provide critical insights into the assembly and function of multisubunit complexes that may not be revealed by using traditional studies with haploid gene deletion or conditional alleles. PMID:23825022

  1. Genome-wide haploinsufficiency screen reveals a novel role for γ-TuSC in spindle organization and genome stability.

    PubMed

    Choy, John S; O'Toole, Eileen; Schuster, Breanna M; Crisp, Matthew J; Karpova, Tatiana S; McNally, James G; Winey, Mark; Gardner, Melissa K; Basrai, Munira A

    2013-09-01

    How subunit dosage contributes to the assembly and function of multimeric complexes is an important question with implications in understanding biochemical, evolutionary, and disease mechanisms. Toward identifying pathways that are susceptible to decreased gene dosage, we performed a genome-wide screen for haploinsufficient (HI) genes that guard against genome instability in Saccharomyces cerevisiae. This led to the identification of all three genes (SPC97, SPC98, and TUB4) encoding the evolutionarily conserved γ-tubulin small complex (γ-TuSC), which nucleates microtubule assembly. We found that hemizygous γ-TuSC mutants exhibit higher rates of chromosome loss and increases in anaphase spindle length and elongation velocities. Fluorescence microscopy, fluorescence recovery after photobleaching, electron tomography, and model convolution simulation of spc98/+ mutants revealed improper regulation of interpolar (iMT) and kinetochore (kMT) microtubules in anaphase. The underlying cause is likely due to reduced levels of Tub4, as overexpression of TUB4 suppressed the spindle and chromosome segregation defects in spc98/+ mutants. We propose that γ-TuSC is crucial for balanced assembly between iMTs and kMTs for spindle organization and accurate chromosome segregation. Taken together, the results show how gene dosage studies provide critical insights into the assembly and function of multisubunit complexes that may not be revealed by using traditional studies with haploid gene deletion or conditional alleles.

  2. Genome Surfing As Driver of Microbial Genomic Diversity.

    PubMed

    Choudoir, Mallory J; Panke-Buisse, Kevin; Andam, Cheryl P; Buckley, Daniel H

    2017-08-01

    Historical changes in population size, such as those caused by demographic range expansions, can produce nonadaptive changes in genomic diversity through mechanisms such as gene surfing. We propose that demographic range expansion of a microbial population capable of horizontal gene exchange can result in genome surfing, a mechanism that can cause widespread increase in the pan-genome frequency of genes acquired by horizontal gene exchange. We explain that patterns of genetic diversity within Streptomyces are consistent with genome surfing, and we describe several predictions for testing this hypothesis both in Streptomyces and in other microorganisms. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Development and in-house validation of the event-specific polymerase chain reaction detection methods for genetically modified soybean MON89788 based on the cloned integration flanking sequence.

    PubMed

    Liu, Jia; Guo, Jinchao; Zhang, Haibo; Li, Ning; Yang, Litao; Zhang, Dabing

    2009-11-25

    Various polymerase chain reaction (PCR) methods were developed for the execution of genetically modified organism (GMO) labeling policies, of which an event-specific PCR detection method based on the flanking sequence of exogenous integration is the primary trend in GMO detection due to its high specificity. In this study, the 5' and 3' flanking sequences of the exogenous integration of MON89788 soybean were revealed by thermal asymmetric interlaced PCR. The event-specific PCR primers and TaqMan probe were designed based upon the revealed 5' flanking sequence, and the qualitative and quantitative PCR assays were established employing these designed primers and probes. In qualitative PCR, the limit of detection (LOD) was about 0.01 ng of genomic DNA corresponding to 10 copies of haploid soybean genomic DNA. In the quantitative PCR assay, the LOD was as low as two haploid genome copies, and the limit of quantification was five haploid genome copies. Furthermore, the developed PCR methods were in-house validated by five researchers, and the validated results indicated that the developed event-specific PCR methods can be used for identification and quantification of MON89788 soybean and its derivates.

  4. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication

    PubMed Central

    Wu, G. Albert; Prochnik, Simon; Jenkins, Jerry; Salse, Jerome; Hellsten, Uffe; Murat, Florent; Perrier, Xavier; Ruiz, Manuel; Scalabrin, Simone; Terol, Javier; Takita, Marco Aurélio; Labadie, Karine; Poulain, Julie; Couloux, Arnaud; Jabbari, Kamel; Cattonaro, Federica; Del Fabbro, Cristian; Pinosio, Sara; Zuccolo, Andrea; Chapman, Jarrod; Grimwood, Jane; Tadeo, Francisco R.; Estornell, Leandro H.; Muñoz-Sanz, Juan V.; Ibanez, Victoria; Herrero-Ortega, Amparo; Aleza, Pablo; Pérez-Pérez, Julián; Ramón, Daniel; Brunel, Dominique; Luro, François; Chen, Chunxian; Farmerie, William G.; Desany, Brian; Kodira, Chinnappa; Mohiuddin, Mohammed; Harkins, Tim; Fredrikson, Karin; Burns, Paul; Lomsadze, Alexandre; Borodovsky, Mark; Reforgiato, Giuseppe; Freitas-Astúa, Juliana; Quetier, Francis; Navarro, Luis; Roose, Mikeal; Wincker, Patrick; Schmutz, Jeremy; Morgante, Michele; Machado, Marcos Antonio; Talon, Manuel; Jaillon, Olivier; Ollitrault, Patrick; Gmitter, Frederick; Rokhsar, Daniel

    2014-01-01

    The domestication of citrus, is poorly understood. Cultivated types are selections from, or hybrids of, wild progenitor species, whose identities and contributions remain controversial. By comparative analysis of a collection of citrus genomes, including a high quality haploid reference, we show that cultivated types were derived from two progenitor species. Though cultivated pummelos represent selections from a single progenitor species, C. maxima, cultivated mandarins are introgressions of C. maxima into the ancestral mandarin species, C. reticulata. The most widely cultivated citrus, sweet orange, is the offspring of previously admixed individuals, but sour orange is an F1 hybrid of pure C. maxima and C. reticulata parents, implying that wild mandarins were part of the early breeding germplasm. A wild “mandarin” from China exhibited substantial divergence from C. reticulata, suggesting the possibility of other unrecognized wild citrus species. Understanding citrus phylogeny through genome analysis clarifies taxonomic relationships and enables sequence-directed genetic improvement. PMID:24908277

  5. Genome-wide association studies identified multiple genetic loci for body size at four growth stages in Chinese Holstein cattle.

    PubMed

    Zhang, Xu; Chu, Qin; Guo, Gang; Dong, Ganghui; Li, Xizhi; Zhang, Qin; Zhang, Shengli; Zhang, Zhiwu; Wang, Yachun

    2017-01-01

    The growth and maturity of cattle body size affect not only feed efficiency, but also productivity and longevity. Dissecting the genetic architecture of body size is critical for cattle breeding to improve both efficiency and productivity. The volume and weight of body size are indicated by several measurements. Among them, Heart Girth (HG) and Hip Height (HH) are the most important traits. They are widely used as predictors of body weight (BW). Few association studies have been conducted for HG and HH in cattle focusing on single growth stage. In this study, we extended the Genome-wide association studies to a full spectrum of four growth stages (6-, 12-, 18-, and 24-months after birth) in Chinese Holstein heifers. The whole genomic single nucleotide polymorphisms (SNPs) were obtained from the Illumina BovineSNP50 v2 BeadChip genotyped on 3,325 individuals. Estimated breeding values (EBVs) were derived for both HG and HH at the four different ages and analyzed separately for GWAS by using the Fixed and random model Circuitous Probability Unification (FarmCPU) method. In total, 27 SNPs were identified to be significantly associated with HG and HH at different growth stages. We found 66 candidate genes located nearby the associated SNPs, including nine genes that were known as highly related to development and skeletal and muscular growth. In addition, biological function analysis was performed by Ingenuity Pathway Analysis and an interaction network related to development was obtained, which contained 16 genes out of the 66 candidates. The set of putative genes provided valuable resources and can help elucidate the genomic architecture and mechanisms underlying growth traits in dairy cattle.

  6. Genome structure of a Saccharomyces cerevisiae strain widely used in bioethanol production

    PubMed Central

    Argueso, Juan Lucas; Carazzolle, Marcelo F.; Mieczkowski, Piotr A.; Duarte, Fabiana M.; Netto, Osmar V.C.; Missawa, Silvia K.; Galzerani, Felipe; Costa, Gustavo G.L.; Vidal, Ramon O.; Noronha, Melline F.; Dominska, Margaret; Andrietta, Maria G.S.; Andrietta, Sílvio R.; Cunha, Anderson F.; Gomes, Luiz H.; Tavares, Flavio C.A.; Alcarde, André R.; Dietrich, Fred S.; McCusker, John H.; Petes, Thomas D.; Pereira, Gonçalo A.G.

    2009-01-01

    Bioethanol is a biofuel produced mainly from the fermentation of carbohydrates derived from agricultural feedstocks by the yeast Saccharomyces cerevisiae. One of the most widely adopted strains is PE-2, a heterothallic diploid naturally adapted to the sugar cane fermentation process used in Brazil. Here we report the molecular genetic analysis of a PE-2 derived diploid (JAY270), and the complete genome sequence of a haploid derivative (JAY291). The JAY270 genome is highly heterozygous (∼2 SNPs/kb) and has several structural polymorphisms between homologous chromosomes. These chromosomal rearrangements are confined to the peripheral regions of the chromosomes, with breakpoints within repetitive DNA sequences. Despite its complex karyotype, this diploid, when sporulated, had a high frequency of viable spores. Hybrid diploids formed by outcrossing with the laboratory strain S288c also displayed good spore viability. Thus, the rearrangements that exist near the ends of chromosomes do not impair meiosis, as they do not span regions that contain essential genes. This observation is consistent with a model in which the peripheral regions of chromosomes represent plastic domains of the genome that are free to recombine ectopically and experiment with alternative structures. We also explored features of the JAY270 and JAY291 genomes that help explain their high adaptation to industrial environments, exhibiting desirable phenotypes such as high ethanol and cell mass production and high temperature and oxidative stress tolerance. The genomic manipulation of such strains could enable the creation of a new generation of industrial organisms, ideally suited for use as delivery vehicles for future bioenergy technologies. PMID:19812109

  7. Cypress surrogate mother produces haploid progeny from alien pollen.

    PubMed

    Pichot, Christian; Liens, Benjamin; Nava, Juana L Rivera; Bachelier, Julien B; El Maâtaoui, Mohamed

    2008-01-01

    Although most living organisms reproduce sexually, some have developed a uniparental reproduction where the embryo usually derives from the female parent. A unique case of paternal apomixis in plants has been recently reported in Cupressus dupreziana, an endangered Mediterranean conifer. This species produces unreduced pollen that develop into all-paternal embryos within the seed tissues. We analyzed seedlings produced by open-pollinated C. dupreziana seed trees using morphological descriptors, ploidy levels assessed through flow cytometry, and AFLP genetic diversity. In situ C. dupreziana seed trees (from Algeria) produced only diploid C. dupreziana progeny. In contrast, only one-third of the progeny produced by ex situ C. dupreziana seed trees planted in French collections were similar to C. dupreziana seedlings; the other progeny were haploid or diploid C. sempervirens seedlings. These results demonstrate that C. dupreziana ovules allow for the development of all-paternal embryos from pollen produced by another species, C. sempervirens. Thus, the in planta androgenesis is achieved through the combination of the embryogenic behavior of pollen grains and the ability of seed tree ovules to act as a surrogate mother. This phenomenon offers a unique opportunity to produce, by natural means, highly valuable material for genetic studies and selection of sterile cultivars.

  8. The Awesome Power of Yeast Evolutionary Genetics: New Genome Sequences and Strain Resources for the Saccharomyces sensu stricto Genus

    PubMed Central

    Scannell, Devin R.; Zill, Oliver A.; Rokas, Antonis; Payen, Celia; Dunham, Maitreya J.; Eisen, Michael B.; Rine, Jasper; Johnston, Mark; Hittinger, Chris Todd

    2011-01-01

    High-quality, well-annotated genome sequences and standardized laboratory strains fuel experimental and evolutionary research. We present improved genome sequences of three species of Saccharomyces sensu stricto yeasts: S. bayanus var. uvarum (CBS 7001), S. kudriavzevii (IFO 1802T and ZP 591), and S. mikatae (IFO 1815T), and describe their comparison to the genomes of S. cerevisiae and S. paradoxus. The new sequences, derived by assembling millions of short DNA sequence reads together with previously published Sanger shotgun reads, have vastly greater long-range continuity and far fewer gaps than the previously available genome sequences. New gene predictions defined a set of 5261 protein-coding orthologs across the five most commonly studied Saccharomyces yeasts, enabling a re-examination of the tempo and mode of yeast gene evolution and improved inferences of species-specific gains and losses. To facilitate experimental investigations, we generated genetically marked, stable haploid strains for all three of these Saccharomyces species. These nearly complete genome sequences and the collection of genetically marked strains provide a valuable toolset for comparative studies of gene function, metabolism, and evolution, and render Saccharomyces sensu stricto the most experimentally tractable model genus. These resources are freely available and accessible through www.SaccharomycesSensuStricto.org. PMID:22384314

  9. Comparative Genomics of the Cucurbitaceae

    USDA-ARS?s Scientific Manuscript database

    The genome size for watermelon, melon, cucumber, and pumpkin is 425, 454, 367, and 502 Mbp, respectively, and considered medium size as compared with most other crops. Whole-genome duplication is common in angiosperm plants. Research has revealed a paleohexaploidy (') event in the common ancestor of...

  10. A first AFLP-Based Genetic Linkage Map for Brine Shrimp Artemia franciscana and Its Application in Mapping the Sex Locus

    PubMed Central

    De Vos, Stephanie; Bossier, Peter; Van Stappen, Gilbert; Vercauteren, Ilse; Sorgeloos, Patrick; Vuylsteke, Marnik

    2013-01-01

    We report on the construction of sex-specific linkage maps, the identification of sex-linked markers and the genome size estimation for the brine shrimp Artemia franciscana. Overall, from the analysis of 433 AFLP markers segregating in a 112 full-sib family we identified 21 male and 22 female linkage groups (2n = 42), covering 1,041 and 1,313 cM respectively. Fifteen putatively homologous linkage groups, including the sex linkage groups, were identified between the female and male linkage map. Eight sex-linked AFLP marker alleles were inherited from the female parent, supporting the hypothesis of a WZ–ZZ sex-determining system. The haploid Artemia genome size was estimated to 0.93 Gb by flow cytometry. The produced Artemia linkage maps provide the basis for further fine mapping and exploring of the sex-determining region and are a possible marker resource for mapping genomic loci underlying phenotypic differences among Artemia species. PMID:23469207

  11. Development of a CRISPR-Cas9 System for Efficient Genome Editing of Candida lusitaniae.

    PubMed

    Norton, Emily L; Sherwood, Racquel K; Bennett, Richard J

    2017-01-01

    Candida lusitaniae is a member of the Candida clade that includes a diverse group of fungal species relevant to both human health and biotechnology. This species exhibits a full sexual cycle to undergo interconversion between haploid and diploid forms. C. lusitaniae is also an emerging opportunistic pathogen that can cause serious bloodstream infections in the clinic and yet has often proven to be refractory to facile genetic manipulations. In this work, we develop a clustered regularly interspaced short palindromic repeat (CRISPR) and CRISPR-associated gene 9 (Cas9) system to enable genome editing of C. lusitaniae . We demonstrate that expression of CRISPR-Cas9 components under species-specific promoters is necessary for efficient gene targeting and can be successfully applied to multiple genes in both haploid and diploid isolates. Gene deletion efficiencies with CRISPR-Cas9 were further enhanced in C. lusitaniae strains lacking the established nonhomologous end joining (NHEJ) factors Ku70 and DNA ligase 4. These results indicate that NHEJ plays an important role in directing the repair of DNA double-strand breaks (DSBs) in C. lusitaniae and that removal of this pathway increases integration of gene deletion templates by homologous recombination. The described approaches significantly enhance the ability to perform genetic studies in, and promote understanding of, this emerging human pathogen and model sexual species. IMPORTANCE The ability to perform efficient genome editing is a key development for detailed mechanistic studies of a species. Candida lusitaniae is an important member of the Candida clade and is relevant both as an emerging human pathogen and as a model for understanding mechanisms of sexual reproduction. We highlight the development of a CRISPR-Cas9 system for efficient genome manipulation in C. lusitaniae and demonstrate the importance of species-specific promoters for expression of CRISPR components. We also demonstrate that the NHEJ

  12. Parallel or convergent evolution in human population genomic data revealed by genotype networks.

    PubMed

    R Vahdati, Ali; Wagner, Andreas

    2016-08-02

    Genotype networks are representations of genetic variation data that are complementary to phylogenetic trees. A genotype network is a graph whose nodes are genotypes (DNA sequences) with the same broadly defined phenotype. Two nodes are connected if they differ in some minimal way, e.g., in a single nucleotide. We analyze human genome variation data from the 1,000 genomes project, and construct haploid genotype (haplotype) networks for 12,235 protein coding genes. The structure of these networks varies widely among genes, indicating different patterns of variation despite a shared evolutionary history. We focus on those genes whose genotype networks show many cycles, which can indicate homoplasy, i.e., parallel or convergent evolution, on the sequence level. For 42 genes, the observed number of cycles is so large that it cannot be explained by either chance homoplasy or recombination. When analyzing possible explanations, we discovered evidence for positive selection in 21 of these genes and, in addition, a potential role for constrained variation and purifying selection. Balancing selection plays at most a small role. The 42 genes with excess cycles are enriched in functions related to immunity and response to pathogens. Genotype networks are representations of genetic variation data that can help understand unusual patterns of genomic variation.

  13. Contrasting behavior of heterochromatic and euchromatic chromosome portions and pericentric genome separation in pre-bouquet spermatocytes of hybrid mice.

    PubMed

    Scherthan, Harry; Schöfisch, Karina; Dell, Thomas; Illner, Doris

    2014-12-01

    The spatial distribution of parental genomes has attracted much interest because intranuclear chromosome distribution can modulate the transcriptome of cells and influence the efficacy of meiotic homologue pairing. Pairing of parental chromosomes is imperative to sexual reproduction as it translates into homologue segregation and genome haploidization to counteract the genome doubling at fertilization. Differential FISH tagging of parental pericentromeric genome portions and specific painting of euchromatic chromosome arms in Mus musculus (MMU) × Mus spretus (MSP) hybrid spermatogenesis disclosed a phase of homotypic non-homologous pericentromere clustering that led to parental pericentric genome separation from the pre-leptoteneup to zygotene stages. Preferential clustering of MMU pericentromeres correlated with particular enrichment of epigenetic marks (H3K9me3), HP1-γ and structural maintenance of chromosomes SMC6 complex proteins at the MMU major satellite DNA repeats. In contrast to the separation of heterochromatic pericentric genome portions, the euchromatic arms of homeologous chromosomes showed considerable presynaptic pairing already during leptotene stage of all mice investigated. Pericentric genome separation was eventually disbanded by telomere clustering that concentrated both parental pericentric genome portions in a limited nuclear sector of the bouquet nucleus. Our data disclose the differential behavior of pericentromeric heterochromatin and the euchromatic portions of the parental genomes during homologue search. Homotypic pericentromere clustering early in prophase I may contribute to the exclusion of large repetitive DNA domains from homology search, while the telomere bouquet congregates and registers spatially separated portions of the genome to fuel synapsis initiation and high levels of homologue pairing, thus contributing to the fidelity of meiosis and reproduction.

  14. De Novo Assembly and Phasing of Dikaryotic Genomes from Two Isolates of Puccinia coronata f. sp. avenae, the Causal Agent of Oat Crown Rust

    PubMed Central

    Miller, Marisa E.; Zhang, Ying; Omidvar, Vahid; Sperschneider, Jana; Raley, Castle; Palmer, Jonathan M.; Garnica, Diana; Upadhyaya, Narayana; Rathjen, John; Taylor, Jennifer M.; Park, Robert F.; Dodds, Peter N.; Hirsch, Cory D.

    2018-01-01

    ABSTRACT Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae, is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenae. PMID:29463655

  15. Drosophila Females Undergo Genome Expansion after Interspecific Hybridization

    PubMed Central

    Romero-Soriano, Valèria; Burlet, Nelly; Vela, Doris; Fontdevila, Antonio; Vieira, Cristina; García Guerreiro, María Pilar

    2016-01-01

    Genome size (or C-value) can present a wide range of values among eukaryotes. This variation has been attributed to differences in the amplification and deletion of different noncoding repetitive sequences, particularly transposable elements (TEs). TEs can be activated under different stress conditions such as interspecific hybridization events, as described for several species of animals and plants. These massive transposition episodes can lead to considerable genome expansions that could ultimately be involved in hybrid speciation processes. Here, we describe the effects of hybridization and introgression on genome size of Drosophila hybrids. We measured the genome size of two close Drosophila species, Drosophila buzzatii and Drosophila koepferae, their F1 offspring and the offspring from three generations of backcrossed hybrids; where mobilization of up to 28 different TEs was previously detected. We show that hybrid females indeed present a genome expansion, especially in the first backcross, which could likely be explained by transposition events. Hybrid males, which exhibit more variable C-values among individuals of the same generation, do not present an increased genome size. Thus, we demonstrate that the impact of hybridization on genome size can be detected through flow cytometry and is sex-dependent. PMID:26872773

  16. Ultrafast Comparison of Personal Genomes via Precomputed Genome Fingerprints.

    PubMed

    Glusman, Gustavo; Mauldin, Denise E; Hood, Leroy E; Robinson, Max

    2017-01-01

    We present an ultrafast method for comparing personal genomes. We transform the standard genome representation (lists of variants relative to a reference) into "genome fingerprints" via locality sensitive hashing. The resulting genome fingerprints can be meaningfully compared even when the input data were obtained using different sequencing technologies, processed using different pipelines, represented in different data formats and relative to different reference versions. Furthermore, genome fingerprints are robust to up to 30% missing data. Because of their reduced size, computation on the genome fingerprints is fast and requires little memory. For example, we could compute all-against-all pairwise comparisons among the 2504 genomes in the 1000 Genomes data set in 67 s at high quality (21 μs per comparison, on a single processor), and achieved a lower quality approximation in just 11 s. Efficient computation enables scaling up a variety of important genome analyses, including quantifying relatedness, recognizing duplicative sequenced genomes in a set, population reconstruction, and many others. The original genome representation cannot be reconstructed from its fingerprint, effectively decoupling genome comparison from genome interpretation; the method thus has significant implications for privacy-preserving genome analytics.

  17. Ultrafast Comparison of Personal Genomes via Precomputed Genome Fingerprints

    PubMed Central

    Glusman, Gustavo; Mauldin, Denise E.; Hood, Leroy E.; Robinson, Max

    2017-01-01

    We present an ultrafast method for comparing personal genomes. We transform the standard genome representation (lists of variants relative to a reference) into “genome fingerprints” via locality sensitive hashing. The resulting genome fingerprints can be meaningfully compared even when the input data were obtained using different sequencing technologies, processed using different pipelines, represented in different data formats and relative to different reference versions. Furthermore, genome fingerprints are robust to up to 30% missing data. Because of their reduced size, computation on the genome fingerprints is fast and requires little memory. For example, we could compute all-against-all pairwise comparisons among the 2504 genomes in the 1000 Genomes data set in 67 s at high quality (21 μs per comparison, on a single processor), and achieved a lower quality approximation in just 11 s. Efficient computation enables scaling up a variety of important genome analyses, including quantifying relatedness, recognizing duplicative sequenced genomes in a set, population reconstruction, and many others. The original genome representation cannot be reconstructed from its fingerprint, effectively decoupling genome comparison from genome interpretation; the method thus has significant implications for privacy-preserving genome analytics. PMID:29018478

  18. The draft genome and transcriptome of Cannabis sativa.

    PubMed

    van Bakel, Harm; Stout, Jake M; Cote, Atina G; Tallon, Carling M; Sharpe, Andrew G; Hughes, Timothy R; Page, Jonathan E

    2011-10-20

    Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored. We sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp. The availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics.

  19. Characterization of the exogenous insert and development of event-specific PCR detection methods for genetically modified Huanong No. 1 papaya.

    PubMed

    Guo, Jinchao; Yang, Litao; Liu, Xin; Guan, Xiaoyan; Jiang, Lingxi; Zhang, Dabing

    2009-08-26

    Genetically modified (GM) papaya (Carica papaya L.), Huanong No. 1, was approved for commercialization in Guangdong province, China in 2006, and the development of the Huanong No. 1 papaya detection method is necessary for implementing genetically modified organism (GMO) labeling regulations. In this study, we reported the characterization of the exogenous integration of GM Huanong No. 1 papaya by means of conventional polymerase chain reaction (PCR) and thermal asymmetric interlaced (TAIL)-PCR strategies. The results suggested that one intact copy of the initial construction was integrated in the papaya genome and which probably resulted in one deletion (38 bp in size) of the host genomic DNA. Also, one unintended insertion of a 92 bp truncated NptII fragment was observed at the 5' end of the exogenous insert. Furthermore, we revealed its 5' and 3' flanking sequences between the insert DNA and the papaya genomic DNA, and developed the event-specific qualitative and quantitative PCR assays for GM Huanong No. 1 papaya based on the 5' integration flanking sequence. The relative limit of detection (LOD) of the qualitative PCR assay was about 0.01% in 100 ng of total papaya genomic DNA, corresponding to about 25 copies of papaya haploid genome. In the quantitative PCR, the limits of detection and quantification (LOD and LOQ) were as low as 12.5 and 25 copies of papaya haploid genome, respectively. In practical sample quantification, the quantified biases between the test and true values of three samples ranged from 0.44% to 4.41%. Collectively, we proposed that all of these results are useful for the identification and quantification of Huanong No. 1 papaya and its derivates.

  20. Pstl repeat: a family of short interspersed nucleotide element (SINE)-like sequences in the genomes of cattle, goat, and buffalo.

    PubMed

    Sheikh, Faruk G; Mukhopadhyay, Sudit S; Gupta, Prabhakar

    2002-02-01

    The PstI family of elements are short, highly repetitive DNA sequences interspersed throughout the genome of the Bovidae. We have cloned and sequenced some members of the PstI family from cattle, goat, and buffalo. These elements are approximately 500 bp, have a copy number of 2 x 10(5) - 4 x 10(5), and comprise about 4% of the haploid genome. Studies of nucleotide sequence homology indicate that the buffalo and goat PstI repeats (type II) are similar types of short interspersed nucleotide element (SINE) sequences, but the cattle PstI repeat (type I) is considerably more divergent. Additionally, the goat PstI sequence showed significant sequence homology with bovine serine tRNA, and is therefore likely derived from serine tRNA. Interestingly, Southern hybridization suggests that both types of SINEs (I and II) are present in all the species of Bovidae. Dendrogram analysis indicates that cattle PstI SINE is similar to bovine Alu-like SINEs. Goat and buffalo SINEs formed a separate cluster, suggesting that these two types of SINEs evolved separately in the genome of the Bovidae.

  1. Genome Size, Molecular Phylogeny, and Evolutionary History of the Tribe Aquilarieae (Thymelaeaceae), the Natural Source of Agarwood

    PubMed Central

    Farah, Azman H.; Lee, Shiou Yih; Gao, Zhihui; Yao, Tze Leong; Madon, Maria; Mohamed, Rozi

    2018-01-01

    The tribe Aquilarieae of the family Thymelaeaceae consists of two genera, Aquilaria and Gyrinops, with a total of 30 species, distributed from northeast India, through southeast Asia and the south of China, to Papua New Guinea. They are an important botanical resource for fragrant agarwood, a prized product derived from injured or infected stems of these species. The aim of this study was to estimate the genome size of selected Aquilaria species and comprehend the evolutionary history of Aquilarieae speciation through molecular phylogeny. Five non-coding chloroplast DNA regions and a nuclear region were sequenced from 12 Aquilaria and three Gyrinops species. Phylogenetic trees constructed using combined chloroplast DNA sequences revealed relationships of the studied 15 members in Aquilarieae, while nuclear ribosomal DNA internal transcribed spacer (ITS) sequences showed a paraphyletic relationship between Aquilaria species from Indochina and Malesian. We exposed, for the first time, the estimated divergence time for Aquilarieae speciation, which was speculated to happen during the Miocene Epoch. The ancestral split and biogeographic pattern of studied species were discussed. Results showed no large variation in the 2C-values for the five Aquilaria species (1.35–2.23 pg). Further investigation into the genome size may provide additional information regarding ancestral traits and its evolution history. PMID:29896211

  2. The chloroplast genome of the hexaploid Spartina maritima (Poaceae, Chloridoideae): Comparative analyses and molecular dating.

    PubMed

    Rousseau-Gueutin, M; Bellot, S; Martin, G E; Boutte, J; Chelaifa, H; Lima, O; Michon-Coudouel, S; Naquin, D; Salmon, A; Ainouche, K; Ainouche, M

    2015-12-01

    The history of many plant lineages is complicated by reticulate evolution with cases of hybridization often followed by genome duplication (allopolyploidy). In such a context, the inference of phylogenetic relationships and biogeographic scenarios based on molecular data is easier using haploid markers like chloroplast genome sequences. Hybridization and polyploidization occurred recurrently in the genus Spartina (Poaceae, Chloridoideae), as illustrated by the recent formation of the invasive allododecaploid S. anglica during the 19th century in Europe. Until now, only a few plastid markers were available to explore the history of this genus and their low variability limited the resolution of species relationships. We sequenced the complete chloroplast genome (plastome) of S. maritima, the native European parent of S. anglica, and compared it to the plastomes of other Poaceae. Our analysis revealed the presence of fast-evolving regions of potential taxonomic, phylogeographic and phylogenetic utility at various levels within the Poaceae family. Using secondary calibrations, we show that the tetraploid and hexaploid lineages of Spartina diverged 6-10 my ago, and that the two parents of the invasive allopolyploid S. anglica separated 2-4 my ago via long distance dispersal of the ancestor of S. maritima over the Atlantic Ocean. Finally, we discuss the meaning of divergence times between chloroplast genomes in the context of reticulate evolution. Copyright © 2015 Elsevier Inc. All rights reserved.

  3. Rewriting the blueprint of life by synthetic genomics and genome engineering.

    PubMed

    Annaluru, Narayana; Ramalingam, Sivaprakash; Chandrasegaran, Srinivasan

    2015-06-16

    Advances in DNA synthesis and assembly methods over the past decade have made it possible to construct genome-size fragments from oligonucleotides. Early work focused on synthesis of small viral genomes, followed by hierarchical synthesis of wild-type bacterial genomes and subsequently on transplantation of synthesized bacterial genomes into closely related recipient strains. More recently, a synthetic designer version of yeast Saccharomyces cerevisiae chromosome III has been generated, with numerous changes from the wild-type sequence without having an impact on cell fitness and phenotype, suggesting plasticity of the yeast genome. A project to generate the first synthetic yeast genome--the Sc2.0 Project--is currently underway.

  4. New Markers for Predicting Fertility of the Male Gametes in the Post Genomic Age.

    PubMed

    Dipresa, Savina; De Toni, Luca; Foresta, Carlo; Garolla, Andrea

    2018-04-18

    A number of test have been proposed to assess male fertility potential, ranging from routine testing by light microscopic method for evaluating semen samples, to screening test for DNA integrity aimed to look at sperm chromatin abnormalities. Spermatozoa are an extremely differentiated cell, they have critical functions for embryo development and heredity, in addiction to delivering a haploid paternal genome to the oocyte. Towards this goal certain requirements must always be met. The ability of spermatozoa to perform its reproductive function taking place in the spermatogenesis, a highly specialized process depending on multiple factors with effect on male fertility. In the past 30 years, large-scale analyses of transcriptomic and genome expression in mammals have generated a large amount of informations on numberless biomolecules involved in spermatogenesis and male germ cell reproductive function. Sperm proteome represents the protein content that spermatozoa needs to survive and work correctly and modifications of sperm proteome play a role in determining functional changes leading to a decrease of reproductive competence into affected spermatozoa. The post-genomic approach consists of different methodologies for concurrently testicular transcriptome studies, protein compositional analysis and metabolomics findings of the spermatozoa in humans. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  5. Population Genomics of Sub-Saharan Drosophila melanogaster: African Diversity and Non-African Admixture

    PubMed Central

    Pool, John E.; Corbett-Detig, Russell B.; Sugino, Ryuichi P.; Stevens, Kristian A.; Cardeno, Charis M.; Crepeau, Marc W.; Duchen, Pablo; Emerson, J. J.; Saelao, Perot; Begun, David J.; Langley, Charles H.

    2012-01-01

    Drosophila melanogaster has played a pivotal role in the development of modern population genetics. However, many basic questions regarding the demographic and adaptive history of this species remain unresolved. We report the genome sequencing of 139 wild-derived strains of D. melanogaster, representing 22 population samples from the sub-Saharan ancestral range of this species, along with one European population. Most genomes were sequenced above 25X depth from haploid embryos. Results indicated a pervasive influence of non-African admixture in many African populations, motivating the development and application of a novel admixture detection method. Admixture proportions varied among populations, with greater admixture in urban locations. Admixture levels also varied across the genome, with localized peaks and valleys suggestive of a non-neutral introgression process. Genomes from the same location differed starkly in ancestry, suggesting that isolation mechanisms may exist within African populations. After removing putatively admixed genomic segments, the greatest genetic diversity was observed in southern Africa (e.g. Zambia), while diversity in other populations was largely consistent with a geographic expansion from this potentially ancestral region. The European population showed different levels of diversity reduction on each chromosome arm, and some African populations displayed chromosome arm-specific diversity reductions. Inversions in the European sample were associated with strong elevations in diversity across chromosome arms. Genomic scans were conducted to identify loci that may represent targets of positive selection within an African population, between African populations, and between European and African populations. A disproportionate number of candidate selective sweep regions were located near genes with varied roles in gene regulation. Outliers for Europe-Africa FST were found to be enriched in genomic regions of locally elevated cosmopolitan

  6. Genome-Wide Search Identifies 1.9 Mb from the Polar Bear Y Chromosome for Evolutionary Analyses.

    PubMed

    Bidon, Tobias; Schreck, Nancy; Hailer, Frank; Nilsson, Maria A; Janke, Axel

    2015-05-27

    The male-inherited Y chromosome is the major haploid fraction of the mammalian genome, rendering Y-linked sequences an indispensable resource for evolutionary research. However, despite recent large-scale genome sequencing approaches, only a handful of Y chromosome sequences have been characterized to date, mainly in model organisms. Using polar bear (Ursus maritimus) genomes, we compare two different in silico approaches to identify Y-linked sequences: 1) Similarity to known Y-linked genes and 2) difference in the average read depth of autosomal versus sex chromosomal scaffolds. Specifically, we mapped available genomic sequencing short reads from a male and a female polar bear against the reference genome and identify 112 Y-chromosomal scaffolds with a combined length of 1.9 Mb. We verified the in silico findings for the longer polar bear scaffolds by male-specific in vitro amplification, demonstrating the reliability of the average read depth approach. The obtained Y chromosome sequences contain protein-coding sequences, single nucleotide polymorphisms, microsatellites, and transposable elements that are useful for evolutionary studies. A high-resolution phylogeny of the polar bear patriline shows two highly divergent Y chromosome lineages, obtained from analysis of the identified Y scaffolds in 12 previously published male polar bear genomes. Moreover, we find evidence of gene conversion among ZFX and ZFY sequences in the giant panda lineage and in the ancestor of ursine and tremarctine bears. Thus, the identification of Y-linked scaffold sequences from unordered genome sequences yields valuable data to infer phylogenomic and population-genomic patterns in bears. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. LTR Retrotransposons Contribute to Genomic Gigantism in Plethodontid Salamanders

    PubMed Central

    Sun, Cheng; Shepard, Donald B.; Chong, Rebecca A.; López Arriaza, José; Hall, Kathryn; Castoe, Todd A.; Feschotte, Cédric; Pollock, David D.; Mueller, Rachel Lockridge

    2012-01-01

    Among vertebrates, most of the largest genomes are found within the salamanders, a clade of amphibians that includes 613 species. Salamander genome sizes range from ∼14 to ∼120 Gb. Because genome size is correlated with nucleus and cell sizes, as well as other traits, morphological evolution in salamanders has been profoundly affected by genomic gigantism. However, the molecular mechanisms driving genomic expansion in this clade remain largely unknown. Here, we present the first comparative analysis of transposable element (TE) content in salamanders. Using high-throughput sequencing, we generated genomic shotgun data for six species from the Plethodontidae, the largest family of salamanders. We then developed a pipeline to mine TE sequences from shotgun data in taxa with limited genomic resources, such as salamanders. Our summaries of overall TE abundance and diversity for each species demonstrate that TEs make up a substantial portion of salamander genomes, and that all of the major known types of TEs are represented in salamanders. The most abundant TE superfamilies found in the genomes of our six focal species are similar, despite substantial variation in genome size. However, our results demonstrate a major difference between salamanders and other vertebrates: salamander genomes contain much larger amounts of long terminal repeat (LTR) retrotransposons, primarily Ty3/gypsy elements. Thus, the extreme increase in genome size that occurred in salamanders was likely accompanied by a shift in TE landscape. These results suggest that increased proliferation of LTR retrotransposons was a major molecular mechanism contributing to genomic expansion in salamanders. PMID:22200636

  8. Meta-analysis of genome-wide association from genomic prediction models

    USDA-ARS?s Scientific Manuscript database

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...

  9. Périgord black truffle genome uncovers evolutionary origins and mechanisms of symbiosis.

    PubMed

    Martin, Francis; Kohler, Annegret; Murat, Claude; Balestrini, Raffaella; Coutinho, Pedro M; Jaillon, Olivier; Montanini, Barbara; Morin, Emmanuelle; Noel, Benjamin; Percudani, Riccardo; Porcel, Bettina; Rubini, Andrea; Amicucci, Antonella; Amselem, Joelle; Anthouard, Véronique; Arcioni, Sergio; Artiguenave, François; Aury, Jean-Marc; Ballario, Paola; Bolchi, Angelo; Brenna, Andrea; Brun, Annick; Buée, Marc; Cantarel, Brandi; Chevalier, Gérard; Couloux, Arnaud; Da Silva, Corinne; Denoeud, France; Duplessis, Sébastien; Ghignone, Stefano; Hilselberger, Benoît; Iotti, Mirco; Marçais, Benoît; Mello, Antonietta; Miranda, Michele; Pacioni, Giovanni; Quesneville, Hadi; Riccioni, Claudia; Ruotolo, Roberta; Splivallo, Richard; Stocchi, Vilberto; Tisserant, Emilie; Viscomi, Arturo Roberto; Zambonelli, Alessandra; Zampieri, Elisa; Henrissat, Bernard; Lebrun, Marc-Henri; Paolocci, Francesco; Bonfante, Paola; Ottonello, Simone; Wincker, Patrick

    2010-04-15

    The Périgord black truffle (Tuber melanosporum Vittad.) and the Piedmont white truffle dominate today's truffle market. The hypogeous fruiting body of T. melanosporum is a gastronomic delicacy produced by an ectomycorrhizal symbiont endemic to calcareous soils in southern Europe. The worldwide demand for this truffle has fuelled intense efforts at cultivation. Identification of processes that condition and trigger fruit body and symbiosis formation, ultimately leading to efficient crop production, will be facilitated by a thorough analysis of truffle genomic traits. In the ectomycorrhizal Laccaria bicolor, the expansion of gene families may have acted as a 'symbiosis toolbox'. This feature may however reflect evolution of this particular taxon and not a general trait shared by all ectomycorrhizal species. To get a better understanding of the biology and evolution of the ectomycorrhizal symbiosis, we report here the sequence of the haploid genome of T. melanosporum, which at approximately 125 megabases is the largest and most complex fungal genome sequenced so far. This expansion results from a proliferation of transposable elements accounting for approximately 58% of the genome. In contrast, this genome only contains approximately 7,500 protein-coding genes with very rare multigene families. It lacks large sets of carbohydrate cleaving enzymes, but a few of them involved in degradation of plant cell walls are induced in symbiotic tissues. The latter feature and the upregulation of genes encoding for lipases and multicopper oxidases suggest that T. melanosporum degrades its host cell walls during colonization. Symbiosis induces an increased expression of carbohydrate and amino acid transporters in both L. bicolor and T. melanosporum, but the comparison of genomic traits in the two ectomycorrhizal fungi showed that genetic predispositions for symbiosis-'the symbiosis toolbox'-evolved along different ways in ascomycetes and basidiomycetes.

  10. NAHR-mediated copy-number variants in a clinical population: mechanistic insights into both genomic disorders and Mendelizing traits.

    PubMed

    Dittwald, Piotr; Gambin, Tomasz; Szafranski, Przemyslaw; Li, Jian; Amato, Stephen; Divon, Michael Y; Rodríguez Rojas, Lisa Ximena; Elton, Lindsay E; Scott, Daryl A; Schaaf, Christian P; Torres-Martinez, Wilfredo; Stevens, Abby K; Rosenfeld, Jill A; Agadi, Satish; Francis, David; Kang, Sung-Hae L; Breman, Amy; Lalani, Seema R; Bacino, Carlos A; Bi, Weimin; Milosavljevic, Aleksandar; Beaudet, Arthur L; Patel, Ankita; Shaw, Chad A; Lupski, James R; Gambin, Anna; Cheung, Sau Wai; Stankiewicz, Pawel

    2013-09-01

    We delineated and analyzed directly oriented paralogous low-copy repeats (DP-LCRs) in the most recent version of the human haploid reference genome. The computationally defined DP-LCRs were cross-referenced with our chromosomal microarray analysis (CMA) database of 25,144 patients subjected to genome-wide assays. This computationally guided approach to the empirically derived large data set allowed us to investigate genomic rearrangement relative frequencies and identify new loci for recurrent nonallelic homologous recombination (NAHR)-mediated copy-number variants (CNVs). The most commonly observed recurrent CNVs were NPHP1 duplications (233), CHRNA7 duplications (175), and 22q11.21 deletions (DiGeorge/velocardiofacial syndrome, 166). In the ∼25% of CMA cases for which parental studies were available, we identified 190 de novo recurrent CNVs. In this group, the most frequently observed events were deletions of 22q11.21 (48), 16p11.2 (autism, 34), and 7q11.23 (Williams-Beuren syndrome, 11). Several features of DP-LCRs, including length, distance between NAHR substrate elements, DNA sequence identity (fraction matching), GC content, and concentration of the homologous recombination (HR) hot spot motif 5'-CCNCCNTNNCCNC-3', correlate with the frequencies of the recurrent CNVs events. Four novel adjacent DP-LCR-flanked and NAHR-prone regions, involving 2q12.2q13, were elucidated in association with novel genomic disorders. Our study quantitates genome architectural features responsible for NAHR-mediated genomic instability and further elucidates the role of NAHR in human disease.

  11. High-Throughput Sequencing and Linkage Mapping of a Clownfish Genome Provide Insights on the Distribution of Molecular Players Involved in Sex Change.

    PubMed

    Casas, Laura; Saenz-Agudelo, Pablo; Irigoien, Xabier

    2018-03-06

    Clownfishes are an excellent model system for investigating the genetic mechanism governing hermaphroditism and socially-controlled sex change in their natural environment because they are broadly distributed and strongly site-attached. Genomic tools, such as genetic linkage maps, allow fine-mapping of loci involved in molecular pathways underlying these reproductive processes. In this study, a high-density genetic map of Amphiprion bicinctus was constructed with 3146 RAD markers in a full-sib family organized in 24 robust linkage groups which correspond to the haploid chromosome number of the species. The length of the map was 4294.71 cM, with an average marker interval of 1.38 cM. The clownfish linkage map showed various levels of conserved synteny and collinearity with the genomes of Asian and European seabass, Nile tilapia and stickleback. The map provided a platform to investigate the genomic position of genes with differential expression during sex change in A. bicinctus. This study aims to bridge the gap of genome-scale information for this iconic group of species to facilitate the study of the main gene regulatory networks governing social sex change and gonadal restructuring in protandrous hermaphrodites.

  12. Genome wide analysis of flowering time trait in multiple environments via high-throughput genotyping technique in Brassica napus L.

    PubMed

    Li, Lun; Long, Yan; Zhang, Libin; Dalton-Morgan, Jessica; Batley, Jacqueline; Yu, Longjiang; Meng, Jinling; Li, Maoteng

    2015-01-01

    The prediction of the flowering time (FT) trait in Brassica napus based on genome-wide markers and the detection of underlying genetic factors is important not only for oilseed producers around the world but also for the other crop industry in the rotation system in China. In previous studies the low density and mixture of biomarkers used obstructed genomic selection in B. napus and comprehensive mapping of FT related loci. In this study, a high-density genome-wide SNP set was genotyped from a double-haploid population of B. napus. We first performed genomic prediction of FT traits in B. napus using SNPs across the genome under ten environments of three geographic regions via eight existing genomic predictive models. The results showed that all the models achieved comparably high accuracies, verifying the feasibility of genomic prediction in B. napus. Next, we performed a large-scale mapping of FT related loci among three regions, and found 437 associated SNPs, some of which represented known FT genes, such as AP1 and PHYE. The genes tagged by the associated SNPs were enriched in biological processes involved in the formation of flowers. Epistasis analysis showed that significant interactions were found between detected loci, even among some known FT related genes. All the results showed that our large scale and high-density genotype data are of great practical and scientific values for B. napus. To our best knowledge, this is the first evaluation of genomic selection models in B. napus based on a high-density SNP dataset and large-scale mapping of FT loci.

  13. Gene Expansion Shapes Genome Architecture in the Human Pathogen Lichtheimia corymbifera: An Evolutionary Genomics Analysis in the Ancient Terrestrial Mucorales (Mucoromycotina)

    PubMed Central

    Wehner, Stefanie; Linde, Jörg; Valiante, Vito; Sammeth, Michael; Riege, Konstantin; Nowrousian, Minou; Kaerger, Kerstin; Jacobsen, Ilse D.; Marz, Manja; Brakhage, Axel A.; Gabaldón, Toni; Böcker, Sebastian; Voigt, Kerstin

    2014-01-01

    Lichtheimia species are the second most important cause of mucormycosis in Europe. To provide broader insights into the molecular basis of the pathogenicity-associated traits of the basal Mucorales, we report the full genome sequence of L. corymbifera and compared it to the genome of Rhizopus oryzae, the most common cause of mucormycosis worldwide. The genome assembly encompasses 33.6 MB and 12,379 protein-coding genes. This study reveals four major differences of the L. corymbifera genome to R. oryzae: (i) the presence of an highly elevated number of gene duplications which are unlike R. oryzae not due to whole genome duplication (WGD), (ii) despite the relatively high incidence of introns, alternative splicing (AS) is not frequently observed for the generation of paralogs and in response to stress, (iii) the content of repetitive elements is strikingly low (<5%), (iv) L. corymbifera is typically haploid. Novel virulence factors were identified which may be involved in the regulation of the adaptation to iron-limitation, e.g. LCor01340.1 encoding a putative siderophore transporter and LCor00410.1 involved in the siderophore metabolism. Genes encoding the transcription factors LCor08192.1 and LCor01236.1, which are similar to GATA type regulators and to calcineurin regulated CRZ1, respectively, indicating an involvement of the calcineurin pathway in the adaption to iron limitation. Genes encoding MADS-box transcription factors are elevated up to 11 copies compared to the 1–4 copies usually found in other fungi. More findings are: (i) lower content of tRNAs, but unique codons in L. corymbifera, (ii) Over 25% of the proteins are apparently specific for L. corymbifera. (iii) L. corymbifera contains only 2/3 of the proteases (known to be essential virulence factors) in comparision to R. oryzae. On the other hand, the number of secreted proteases, however, is roughly twice as high as in R. oryzae. PMID:25121733

  14. Rhipicephalus (Boophilus) microplus strain Deutsch, whole genome shotgun sequencing project first submission of genome sequence

    USDA-ARS?s Scientific Manuscript database

    The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence difficult. Cot filtration/selection techniques were used to reduce the repetitive fraction of the tick genome and enrich for the fraction of DNA with gene-containing regions. The Cot-selected ...

  15. Size matters: How population size influences genotype–phenotype association studies in anonymized data

    PubMed Central

    Denny, Joshua C.; Haines, Jonathan L.; Roden, Dan M.; Malin, Bradley A.

    2014-01-01

    Objective Electronic medical records (EMRs) data is increasingly incorporated into genome-phenome association studies. Investigators hope to share data, but there are concerns it may be “re-identified” through the exploitation of various features, such as combinations of standardized clinical codes. Formal anonymization algorithms (e.g., k-anonymization) can prevent such violations, but prior studies suggest that the size of the population available for anonymization may influence the utility of the resulting data. We systematically investigate this issue using a large-scale biorepository and EMR system through which we evaluate the ability of researchers to learn from anonymized data for genome- phenome association studies under various conditions. Methods We use a k-anonymization strategy to simulate a data protection process (on data sets containing clinical codes) for resources of similar size to those found at nine academic medical institutions within the United States. Following the protection process, we replicate an existing genome-phenome association study and compare the discoveries using the protected data and the original data through the correlation (r2) of the p-values of association significance. Results Our investigation shows that anonymizing an entire dataset with respect to the population from which it is derived yields significantly more utility than small study-specific datasets anonymized unto themselves. When evaluated using the correlation of genome-phenome association strengths on anonymized data versus original data, all nine simulated sites, results from largest-scale anonymizations (population ∼ 100;000) retained better utility to those on smaller sizes (population ∼ 6000—75;000). We observed a general trend of increasing r2 for larger data set sizes: r2 = 0.9481 for small-sized datasets, r2 = 0.9493 for moderately-sized datasets, r2 = 0.9934 for large-sized datasets. Conclusions This research implies that regardless of the

  16. Comparative genomic data of the Avian Phylogenomics Project.

    PubMed

    Zhang, Guojie; Li, Bo; Li, Cai; Gilbert, M Thomas P; Jarvis, Erich D; Wang, Jun

    2014-01-01

    The evolutionary relationships of modern birds are among the most challenging to understand in systematic biology and have been debated for centuries. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders, and used the genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomics analyses (Jarvis et al. in press; Zhang et al. in press). Here we release assemblies and datasets associated with the comparative genome analyses, which include 38 newly sequenced avian genomes plus previously released or simultaneously released genomes of Chicken, Zebra finch, Turkey, Pigeon, Peregrine falcon, Duck, Budgerigar, Adelie penguin, Emperor penguin and the Medium Ground Finch. We hope that this resource will serve future efforts in phylogenomics and comparative genomics. The 38 bird genomes were sequenced using the Illumina HiSeq 2000 platform and assembled using a whole genome shotgun strategy. The 48 genomes were categorized into two groups according to the N50 scaffold size of the assemblies: a high depth group comprising 23 species sequenced at high coverage (>50X) with multiple insert size libraries resulting in N50 scaffold sizes greater than 1 Mb (except the White-throated Tinamou and Bald Eagle); and a low depth group comprising 25 species sequenced at a low coverage (~30X) with two insert size libraries resulting in an average N50 scaffold size of about 50 kb. Repetitive elements comprised 4%-22% of the bird genomes. The assembled scaffolds allowed the homology-based annotation of 13,000 ~ 17000 protein coding genes in each avian genome relative to chicken, zebra finch and human, as well as comparative and sequence conservation analyses. Here we release full genome assemblies of 38 newly sequenced avian species, link genome assembly downloads for the 7 of the remaining 10 species, and provide a guideline of

  17. Genomes in turmoil: quantification of genome dynamics in prokaryote supergenomes.

    PubMed

    Puigbò, Pere; Lobkovsky, Alexander E; Kristensen, David M; Wolf, Yuri I; Koonin, Eugene V

    2014-08-21

    Genomes of bacteria and archaea (collectively, prokaryotes) appear to exist in incessant flux, expanding via horizontal gene transfer and gene duplication, and contracting via gene loss. However, the actual rates of genome dynamics and relative contributions of different types of event across the diversity of prokaryotes are largely unknown, as are the sizes of microbial supergenomes, i.e. pools of genes that are accessible to the given microbial species. We performed a comprehensive analysis of the genome dynamics in 35 groups (34 bacterial and one archaeal) of closely related microbial genomes using a phylogenetic birth-and-death maximum likelihood model to quantify the rates of gene family gain and loss, as well as expansion and reduction. The results show that loss of gene families dominates the evolution of prokaryotes, occurring at approximately three times the rate of gain. The rates of gene family expansion and reduction are typically seven and twenty times less than the gain and loss rates, respectively. Thus, the prevailing mode of evolution in bacteria and archaea is genome contraction, which is partially compensated by the gain of new gene families via horizontal gene transfer. However, the rates of gene family gain, loss, expansion and reduction vary within wide ranges, with the most stable genomes showing rates about 25 times lower than the most dynamic genomes. For many groups, the supergenome estimated from the fraction of repetitive gene family gains includes about tenfold more gene families than the typical genome in the group although some groups appear to have vast, 'open' supergenomes. Reconstruction of evolution for groups of closely related bacteria and archaea reveals an extremely rapid and highly variable flux of genes in evolving microbial genomes, demonstrates that extensive gene loss and horizontal gene transfer leading to innovation are the two dominant evolutionary processes, and yields robust estimates of the supergenome size.

  18. Induction of diploid gynogenesis in an evolutionary model organism, the three-spined stickleback (Gasterosteus aculeatus)

    PubMed Central

    2011-01-01

    Background Rapid advances in genomics have provided nearly complete genome sequences for many different species. However, no matter how the sequencing technology has improved, natural genetic polymorphism complicates the production of high quality reference genomes. To address this problem, researchers have tried using artificial modes of genome manipulation such as gynogenesis for fast production of inbred lines. Results Here, we present the first successful induction of diploid gynogenesis in an evolutionary model system, the three-spined sticklebacks (Gasterosteus aculeatus), using a combination of UV-irradiation of the sperm and heat shock (HS) of the resulting embryo to inhibit the second meiotic division. Optimal UV irradiation of the sperm was established by exposing stickleback sperm to a UV- light source at various times. Heat shock parameters like temperature, duration, and time of initiation were tested by subjecting eggs fertilized with UV inactivated sperm 5, 10, 15, 20, 25, or 30 minutes post fertilization (mpf) to 30°C, 34°C, or 38°C for 2, 4, 6 or 8 minutes. Gynogen yield was highest when stickleback eggs were activated with 2 minutes UV-irradiated sperm and received HS 5 mpf at 34°C for 4 minutes. Conclusions Diploid gynogenesis has been successfully performed in three-spined stickleback. This has been confirmed by microsatellite DNA analysis which revealed exclusively maternal inheritance in all gynogenetic fry tested. Ploidy verification by flow cytometry showed that gynogenetic embryos/larvae exhibiting abnormalities were haploids and those that developed normally were diploids, i.e., double haploids that can be raised until adult size. PMID:21910888

  19. The draft genome and transcriptome of Cannabis sativa

    PubMed Central

    2011-01-01

    Background Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored. Results We sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp. Conclusions The availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics. PMID:22014239

  20. Whole-genome alignment.

    PubMed

    Dewey, Colin N

    2012-01-01

    Whole-genome alignment (WGA) is the prediction of evolutionary relationships at the nucleotide level between two or more genomes. It combines aspects of both colinear sequence alignment and gene orthology prediction, and is typically more challenging to address than either of these tasks due to the size and complexity of whole genomes. Despite the difficulty of this problem, numerous methods have been developed for its solution because WGAs are valuable for genome-wide analyses, such as phylogenetic inference, genome annotation, and function prediction. In this chapter, we discuss the meaning and significance of WGA and present an overview of the methods that address it. We also examine the problem of evaluating whole-genome aligners and offer a set of methodological challenges that need to be tackled in order to make the most effective use of our rapidly growing databases of whole genomes.

  1. Evolution of recombination rates in a multi-locus, haploid-selection, symmetric-viability model.

    PubMed

    Chasnov, J R; Ye, Felix Xiaofeng

    2013-02-01

    A fast algorithm for computing multi-locus recombination is extended to include a recombination-modifier locus. This algorithm and a linear stability analysis is used to investigate the evolution of recombination rates in a multi-locus, haploid-selection, symmetric-viability model for which stable equilibria have recently been determined. When the starting equilibrium is symmetric with two selected loci, we show analytically that modifier alleles that reduce recombination always invade. When the starting equilibrium is monomorphic, and there is a fixed nonzero recombination rate between the modifier locus and the selected loci, we determine analytical conditions for which a modifier allele can invade. In particular, we show that a gap exists between the recombination rates of modifiers that can invade and the recombination rate that specifies the lower stability boundary of the monomorphic equilibrium. A numerical investigation shows that a similar gap exists in a weakened form when the starting equilibrium is fully polymorphic but asymmetric. Copyright © 2012 Elsevier Inc. All rights reserved.

  2. Haploid deletion strains of Saccharomyces cerevisiae that determine survival during space flight

    NASA Astrophysics Data System (ADS)

    Johanson, Kelly; Allen, Patricia L.; Gonzalez-Villalobos, Romer A.; Nesbit, Jacqueline; Nickerson, Cheryl A.; Höner zu Bentrup, Kerstin; Wilson, James W.; Ramamurthy, Rajee; D'Elia, Riccardo; Muse, Kenneth E.; Hammond, Jeffrey; Freeman, Jake; Stodieck, Louis S.; Hammond, Timothy G.

    2007-02-01

    This study identifies genes that determine survival during a space flight, using the model eukaryotic organism, Saccharomyces cerevisiae. Select strains of a haploid yeast deletion series grew during storage in distilled water in space, but not in ground based static or clinorotation controls. The survival advantages in space in distilled water include a 133-fold advantage for the deletion of PEX19, a chaperone and import receptor for newly- synthesized class I peroxisomal membrane proteins, to 77-40 fold for deletion strains lacking elements of aerobic respiration, isocitrate metabolism, and mitochondrial electron transport. Following automated addition of rich growth media, the space flight was associated with a marked survival advantage of strains with deletions in catalytically active genes including hydrolases, oxidoreductases and transferases. When compared to static controls, space flight was associated with a marked survival disadvantage of deletion strains lacking transporter, antioxidant and catalytic activity. This study identifies yeast deletion strains with a survival advantage during storage in distilled water and space flight, and amplifies our understanding of the genes critical for survival in space.

  3. Expression of genomic AtCYCD2;1 in Arabidopsis induces cell division at smaller cell sizes: implications for the control of plant growth.

    PubMed

    Qi, Ruhu; John, Peter Crook Lloyd

    2007-07-01

    The Arabidopsis (Arabidopsis thaliana) CYCD2;1 gene introduced in genomic form increased cell formation in the Arabidopsis root apex and leaf, while generating full-length mRNA, raised CDK/CYCLIN enzyme activity, reduced G1-phase duration, and reduced size of cells at S phase and division. Other cell cycle genes, CDKA;1, CYCLIN B;1, and the cDNA form of CYCD2;1 that produced an aberrantly spliced mRNA, produced smaller or zero increases in CDK/CYCLIN activity and did not increase the number of cells formed. Plants with a homozygous single insert of genomic CYCD2;1 grew with normal morphology and without accelerated growth of root or shoot, not providing evidence that cell formation or CYCLIN D2 controls growth of postembryonic vegetative tissues. At the root apex, cells progressed normally from meristem to elongation, but their smaller size enclosed less growth and a 40% reduction in final size of epidermal and cortical cells was seen. Smaller elongated cell size inhibited endoreduplication, indicating a cell size requirement. Leaf cells were also smaller and more numerous during proliferation and epidermal pavement and palisade cells attained 59% and 69% of controls, whereas laminas reached normal size. Autonomous control of expansion was therefore not evident in abundant cell types that formed tissues of root or leaf. Cell size was reduced by a greater number formed in a tissue prior to cell and tissue expansion. Initiation and termination of expansion did not correlate with cell dimension or number and may be determined by tissue-wide signals acting across cellular boundaries.

  4. A Major Locus for Manganese Tolerance Maps on Chromosome A09 in a Doubled Haploid Population of Brassica napus L.

    PubMed

    Raman, Harsh; Raman, Rosy; McVittie, Brett; Orchard, Beverley; Qiu, Yu; Delourme, Regine

    2017-01-01

    Soil acidity poses a major threat to productivity of several crops; mainly due to the prevalence of toxic levels of Al 3+ and Mn 2+ . Crop productivity could be harnessed on acid soils via the development of plant varieties tolerant to phytotoxic levels of these cations. In this study, we investigated the extent of natural variation for Mn 2+ tolerance among ten parental lines of the Australian and International canola mapping populations. Response to Mn 2+ toxicity was measured on the bases of cotyledon chlorosis, shoot biomass, and leaf area in nutrient solution under control (9 μM of MnCl 2 ⋅4H 2 O) and Mn treatment (125 μM of MnCl 2 ⋅4H 2 O). Among parental lines, we selected Darmor- bzh and Yudal that showed significant and contrasting variation in Mn 2+ tolerance to understand genetic control and identify the quantitative trait loci (QTL) underlying Mn 2+ tolerance. We evaluated parental lines and their doubled haploid (DH) progenies (196 lines) derived from an F 1 cross, Darmor- bzh /Yudal for Mn 2+ tolerance. Mn 2+ -tolerant genotypes had significantly higher shoot biomass and leaf area compared to Mn 2+ -sensitive genotypes. A genetic linkage map based on 7,805 DArTseq markers corresponding to 2,094 unique loci was constructed and further utilized for QTL identification. A major locus, BnMn 2+ . A09 was further mapped with a SNP marker, Bn-A09-p29012402 (LOD score of 34.6) accounting for most of the variation in Mn 2+ tolerance on chromosome A09. This is the first report on the genomic localization of a Mn 2+ tolerance locus in B. napus . Additionally, an ortholog of A. thaliana encoding for cation efflux facilitator transporter was located within 3,991 bp from significant SNP marker associated with BnMn 2+ . A09 . A suite of genome sequence based markers (DArTseq and Illumina Infinium SNPs) flanking the BnMn 2+ . A09 locus would provide an invaluable tool for various molecular breeding applications to improve canola production and profitability on

  5. A Major Locus for Manganese Tolerance Maps on Chromosome A09 in a Doubled Haploid Population of Brassica napus L.

    PubMed Central

    Raman, Harsh; Raman, Rosy; McVittie, Brett; Orchard, Beverley; Qiu, Yu; Delourme, Regine

    2017-01-01

    Soil acidity poses a major threat to productivity of several crops; mainly due to the prevalence of toxic levels of Al3+ and Mn2+. Crop productivity could be harnessed on acid soils via the development of plant varieties tolerant to phytotoxic levels of these cations. In this study, we investigated the extent of natural variation for Mn2+ tolerance among ten parental lines of the Australian and International canola mapping populations. Response to Mn2+ toxicity was measured on the bases of cotyledon chlorosis, shoot biomass, and leaf area in nutrient solution under control (9 μM of MnCl2⋅4H2O) and Mn treatment (125 μM of MnCl2⋅4H2O). Among parental lines, we selected Darmor-bzh and Yudal that showed significant and contrasting variation in Mn2+ tolerance to understand genetic control and identify the quantitative trait loci (QTL) underlying Mn2+ tolerance. We evaluated parental lines and their doubled haploid (DH) progenies (196 lines) derived from an F1 cross, Darmor-bzh/Yudal for Mn2+ tolerance. Mn2+-tolerant genotypes had significantly higher shoot biomass and leaf area compared to Mn2+-sensitive genotypes. A genetic linkage map based on 7,805 DArTseq markers corresponding to 2,094 unique loci was constructed and further utilized for QTL identification. A major locus, BnMn2+.A09 was further mapped with a SNP marker, Bn-A09-p29012402 (LOD score of 34.6) accounting for most of the variation in Mn2+ tolerance on chromosome A09. This is the first report on the genomic localization of a Mn2+ tolerance locus in B. napus. Additionally, an ortholog of A. thaliana encoding for cation efflux facilitator transporter was located within 3,991 bp from significant SNP marker associated with BnMn2+.A09. A suite of genome sequence based markers (DArTseq and Illumina Infinium SNPs) flanking the BnMn2+.A09 locus would provide an invaluable tool for various molecular breeding applications to improve canola production and profitability on Mn2+ toxic soils. PMID:29312361

  6. Comparison of the Exomes of Common Carp (Cyprinus carpio) and Zebrafish (Danio rerio)

    PubMed Central

    Henkel, Christiaan V.; Dirks, Ron P.; Jansen, Hans J.; Forlenza, Maria; Wiegertjes, Geert F.; Howe, Kerstin; van den Thillart, Guido E.E.J.M.

    2012-01-01

    Abstract Research on common carp, Cyprinus carpio, is beneficial for zebrafish research because of resources available owing to its large body size, such as the availability of sufficient organ material for transcriptomics, proteomics, and metabolomics. Here we describe the shot gun sequencing of a clonal double-haploid common carp line. The assembly consists of 511891 scaffolds with an N50 of 17 kb, predicting a total genome size of 1.4–1.5 Gb. A detailed analysis of the ten largest scaffolds indicates that the carp genome has a considerably lower repeat coverage than zebrafish, whilst the average intron size is significantly smaller, making it comparable to the fugu genome. The quality of the scaffolding was confirmed by comparisons with RNA deep sequencing data sets and a manual analysis for synteny with the zebrafish, especially the Hox gene clusters. In the ten largest scaffolds analyzed, the synteny of genes is almost complete. Comparisons of predicted exons of common carp with those of the zebrafish revealed only few genes specific for either zebrafish or carp, most of these being of unknown function. This supports the hypothesis of an additional genome duplication event in the carp evolutionary history, which—due to a higher degree of compactness—did not result in a genome larger than that of zebrafish. PMID:22715948

  7. Genetic map of Triticum turgidum based on a hexaploid wheat population without genetic recombination for D genome.

    PubMed

    Zhang, Li; Luo, Jiang-Tao; Hao, Ming; Zhang, Lian-Quan; Yuan, Zhong-Wei; Yan, Ze-Hong; Liu, Ya-Xi; Zhang, Bo; Liu, Bao-Long; Liu, Chun-Ji; Zhang, Huai-Gang; Zheng, You-Liang; Liu, Deng-Cai

    2012-08-13

    A synthetic doubled-haploid hexaploid wheat population, SynDH1, derived from the spontaneous chromosome doubling of triploid F1 hybrid plants obtained from the cross of hybrids Triticum turgidum ssp. durum line Langdon (LDN) and ssp. turgidum line AS313, with Aegilops tauschii ssp. tauschii accession AS60, was previously constructed. SynDH1 is a tetraploidization-hexaploid doubled haploid (DH) population because it contains recombinant A and B chromosomes from two different T. turgidum genotypes, while all the D chromosomes from Ae. tauschii are homogenous across the whole population. This paper reports the construction of a genetic map using this population. Of the 606 markers used to assemble the genetic map, 588 (97%) were assigned to linkage groups. These included 513 Diversity Arrays Technology (DArT) markers, 72 simple sequence repeat (SSR), one insertion site-based polymorphism (ISBP), and two high-molecular-weight glutenin subunit (HMW-GS) markers. These markers were assigned to the 14 chromosomes, covering 2048.79 cM, with a mean distance of 3.48 cM between adjacent markers. This map showed good coverage of the A and B genome chromosomes, apart from 3A, 5A, 6A, and 4B. Compared with previously reported maps, most shared markers showed highly consistent orders. This map was successfully used to identify five quantitative trait loci (QTL), including two for spikelet number on chromosomes 7A and 5B, two for spike length on 7A and 3B, and one for 1000-grain weight on 4B. However, differences in crossability QTL between the two T. turgidum parents may explain the segregation distortion regions on chromosomes 1A, 3B, and 6B. A genetic map of T. turgidum including 588 markers was constructed using a synthetic doubled haploid (SynDH) hexaploid wheat population. Five QTLs for three agronomic traits were identified from this population. However, more markers are needed to increase the density and resolution of this map in the future study.

  8. An informational diversity framework, illustrated with sexually deceptive orchids in early stages of speciation.

    PubMed

    Smouse, Peter E; Whitehead, Michael R; Peakall, Rod

    2015-11-01

    Reconstructing evolutionary history for emerging species complexes is notoriously difficult, with newly isolated taxa often morphologically cryptic and the signature of reproductive isolation often restricted to a few genes. Evidence from multiple loci and genomes is highly desirable, but multiple inputs require 'common currency' translation. Here we deploy a Shannon information framework, converting into diversity analogue, which provides a common currency analysis for maternally inherited haploid and bi-parentally inherited diploid nuclear markers, and then extend that analysis to construction of minimum-spanning networks for both genomes. The new approach is illustrated with a quartet of cryptic congeners from the sexually deceptive Australian orchid genus Chiloglottis, still in the early stages of speciation. Divergence is more rapid for haploid plastids than for nuclear markers, consistent with the effective population size differential (N(ep) < (N(en)), but divergence patterns are broadly correlated for the two genomes. There are nevertheless intriguing discrepancies between the emerging plastid and nuclear signals of early phylogenetic radiation of these taxa, and neither pattern is entirely consistent with the available information on the sexual cues used by the orchids to lure the pollinators enforcing reproductive isolation. We describe possible extensions of this methodology to multiple ploidy levels and other types of markers, which should increase the range of application to any taxonomic assemblage in the very early stages of reproductive isolation and speciation. © 2015 John Wiley & Sons Ltd.

  9. Second generation noninvasive fetal genome analysis reveals de novo mutations, single-base parental inheritance, and preferred DNA ends

    PubMed Central

    Chan, K. C. Allen; Jiang, Peiyong; Sun, Kun; Cheng, Yvonne K. Y.; Tong, Yu K.; Cheng, Suk Hang; Wong, Ada I. C.; Hudecova, Irena; Leung, Tak Y.; Chiu, Rossa W. K.; Lo, Yuk Ming Dennis

    2016-01-01

    Plasma DNA obtained from a pregnant woman was sequenced to a depth of 270× haploid genome coverage. Comparing the maternal plasma DNA sequencing data with the parental genomic DNA data and using a series of bioinformatics filters, fetal de novo mutations were detected at a sensitivity of 85% and a positive predictive value of 74%. These results represent a 169-fold improvement in the positive predictive value over previous attempts. Improvements in the interpretation of the sequence information of every base position in the genome allowed us to interrogate the maternal inheritance of the fetus for 618,271 of 656,676 (94.2%) heterozygous SNPs within the maternal genome. The fetal genotype at each of these sites was deduced individually, unlike previously, where the inheritance was determined for a collection of sites within a haplotype. These results represent a 90-fold enhancement in the resolution in determining the fetus’s maternal inheritance. Selected genomic locations were more likely to be found at the ends of plasma DNA molecules. We found that a subset of such preferred ends exhibited selectivity for fetal- or maternal-derived DNA in maternal plasma. The ratio of the number of maternal plasma DNA molecules with fetal preferred ends to those with maternal preferred ends showed a correlation with the fetal DNA fraction. Finally, this second generation approach for noninvasive fetal whole-genome analysis was validated in a pregnancy diagnosed with cardiofaciocutaneous syndrome with maternal plasma DNA sequenced to 195× coverage. The causative de novo BRAF mutation was successfully detected through the maternal plasma DNA analysis. PMID:27799561

  10. Signatures of host specialization and a recent transposable element burst in the dynamic one-speed genome of the fungal barley powdery mildew pathogen.

    PubMed

    Frantzeskakis, Lamprinos; Kracher, Barbara; Kusch, Stefan; Yoshikawa-Maekawa, Makoto; Bauer, Saskia; Pedersen, Carsten; Spanu, Pietro D; Maekawa, Takaki; Schulze-Lefert, Paul; Panstruga, Ralph

    2018-05-22

    Powdery mildews are biotrophic pathogenic fungi infecting a number of economically important plants. The grass powdery mildew, Blumeria graminis, has become a model organism to study host specialization of obligate biotrophic fungal pathogens. We resolved the large-scale genomic architecture of B. graminis forma specialis hordei (Bgh) to explore the potential influence of its genome organization on the co-evolutionary process with its host plant, barley (Hordeum vulgare). The near-chromosome level assemblies of the Bgh reference isolate DH14 and one of the most diversified isolates, RACE1, enabled a comparative analysis of these haploid genomes, which are highly enriched with transposable elements (TEs). We found largely retained genome synteny and gene repertoires, yet detected copy number variation (CNV) of secretion signal peptide-containing protein-coding genes (SPs) and locally disrupted synteny blocks. Genes coding for sequence-related SPs are often locally clustered, but neither the SPs nor the TEs reside preferentially in genomic regions with unique features. Extended comparative analysis with different host-specific B. graminis formae speciales revealed the existence of a core suite of SPs, but also isolate-specific SP sets as well as congruence of SP CNV and phylogenetic relationship. We further detected evidence for a recent, lineage-specific expansion of TEs in the Bgh genome. The characteristics of the Bgh genome (largely retained synteny, CNV of SP genes, recently proliferated TEs and a lack of significant compartmentalization) are consistent with a "one-speed" genome that differs in its architecture and (co-)evolutionary pattern from the "two-speed" genomes reported for several other filamentous phytopathogens.

  11. Abr1, a Transposon-Like Element in the Genome of the Cultivated Mushroom Agaricus bisporus (Lange) Imbach

    PubMed Central

    Sonnenberg, Anton S. M.; Baars, Johan J. P.; Mikosch, Thomas S. P.; Schaap, Peter J.; Van Griensven, Leo J. L. D.

    1999-01-01

    A 300-bp repetitive element was found in the genome of the white button mushroom, Agaricus bisporus, and designated Abr1. It is present in ∼15 copies per haploid genome in the commercial strain Horst U1. Analysis of seven copies showed 89 to 97% sequence identity. The repeat has features typical of class II transposons (i.e., terminal inverted repeats, subterminal repeats, and a target site duplication of 7 bp). The latter shows a consensus sequence. When used as probe on Southern blots, Abr1 identifies relatively little variation within traditional and present-day commercial strains, indicating that most strains are identical or have a common origin. In contrast to these cultivars, high variation is found among field-collected strains. Furthermore, a remarkable difference in copy numbers of Abr1 was found between A. bisporus isolates with a secondarily homothallic life cycle and those with a heterothallic life cycle. Abr1 is a type II transposon not previously reported in basidiomycetes and appears to be useful for the identification of strains within the species A. bisporus. PMID:10427018

  12. Genome survey sequencing of red swamp crayfish Procambarus clarkii.

    PubMed

    Shi, Linlin; Yi, Shaokui; Li, Yanhe

    2018-06-21

    Red swamp crayfish, Procambarus clarkii, presently is an important aquatic commercial species in China. The crayfish is a hot area of research focus, and its genetic improvement is quite urgent for the crayfish aquaculture in China. However, the knowledge of its genomic landscape is limited. In this study, a survey of P. clarkii genome was investigated based on Illumina's Solexa sequencing platform. Meanwhile, its genome size was estimated using flow cytometry. Interestingly, the genome size estimated is about 8.50 Gb by flow cytometry and 1.86 Gb with genome survey sequencing. Based on the assembled genome sequences, total of 136,962 genes and 152,268 exons were predicted, and the predicted genes ranged from 150 to 12,807 bp in length. The survey sequences could help accelerate the progress of gene discovery involved in genetic diversity and evolutionary analysis, even though it could not successfully applied for estimation of P. clarkii genome size.

  13. Karyological evidence of hybridogenesis in Greenlings (Teleostei: Hexagrammidae).

    PubMed

    Suzuki, Shota; Arai, Katsutoshi; Munehara, Hiroyuki

    2017-01-01

    Two types of natural hybrids were discovered in populations of three Hexagrammos species (Teleostei: Hexagrammidae) distributed off the southern coast of Hokkaido in the North Pacific Ocean. Both hybrids reproduce by hybridogenesis, in which the maternal haploid genome is transmitted to offspring without recombination and the paternal haploid genome is eliminated during gametogenesis. While natural hybrids are unisexual and reproduce hemiclonally by backcrossing with the paternal species (BC-P), artificial F1-hybrids between the pure species produce recombinant gametes. Thus, despite having the same genome composition, the natural hybrids and the F1-hybrids are not genetically identical. Here, to clarify the differences between both hybrids, we examined the karyotypes of the three Hexagrammos species, their natural hybrids, the artificial F1-hybrids, and several backcrosses. Artificial F1-hybrids have karyotypes and chromosome numbers that are intermediate between those of the parental species. Conversely, the natural hybrids differed from F1-hybrids by having several large metacentric chromosomes and microchromosomes. Since the entire maternal haploid genome is inherited by the natural hybrids, maternal backcrosses (BC-M) between natural hybrids and males of the maternal species (H. octogrammus; Hoc) have a hemiclonal Hoc genome with large chromosomes from the mother and a normal Hoc genome from the father. However, the large chromosomes disappear in offspring of BC-M, probably due to fissuring during gametogenesis. Similarly, microsatellite DNA analysis revealed that chromosomes of BC-M undergo recombination. These findings suggest that genetic factors associated with hemiclonal reproduction may be located on the large metacentric chromosomes of natural hybrids.

  14. Are there laws of genome evolution?

    PubMed

    Koonin, Eugene V

    2011-08-01

    Research in quantitative evolutionary genomics and systems biology led to the discovery of several universal regularities connecting genomic and molecular phenomic variables. These universals include the log-normal distribution of the evolutionary rates of orthologous genes; the power law-like distributions of paralogous family size and node degree in various biological networks; the negative correlation between a gene's sequence evolution rate and expression level; and differential scaling of functional classes of genes with genome size. The universals of genome evolution can be accounted for by simple mathematical models similar to those used in statistical physics, such as the birth-death-innovation model. These models do not explicitly incorporate selection; therefore, the observed universal regularities do not appear to be shaped by selection but rather are emergent properties of gene ensembles. Although a complete physical theory of evolutionary biology is inconceivable, the universals of genome evolution might qualify as "laws of evolutionary genomics" in the same sense "law" is understood in modern physics.

  15. Histone variant H3.3-mediated chromatin remodeling is essential for paternal genome activation in mouse preimplantation embryos.

    PubMed

    Kong, Qingran; Banaszynski, Laura A; Geng, Fuqiang; Zhang, Xiaolei; Zhang, Jiaming; Zhang, Heng; O'Neill, Claire L; Yan, Peidong; Liu, Zhonghua; Shido, Koji; Palermo, Gianpiero D; Allis, C David; Rafii, Shahin; Rosenwaks, Zev; Wen, Duancheng

    2018-03-09

    Derepression of chromatin-mediated transcriptional repression of paternal and maternal genomes is considered the first major step that initiates zygotic gene expression after fertilization. The histone variant H3.3 is present in both male and female gametes and is thought to be important for remodeling the paternal and maternal genomes for activation during both fertilization and embryogenesis. However, the underlying mechanisms remain poorly understood. Using our H3.3B-HA-tagged mouse model, engineered to report H3.3 expression in live animals and to distinguish different sources of H3.3 protein in embryos, we show here that sperm-derived H3.3 (sH3.3) protein is removed from the sperm genome shortly after fertilization and extruded from the zygotes via the second polar bodies (PBII) during embryogenesis. We also found that the maternal H3.3 (mH3.3) protein is incorporated into the paternal genome as early as 2 h postfertilization and is detectable in the paternal genome until the morula stage. Knockdown of maternal H3.3 resulted in compromised embryonic development both of fertilized embryos and of androgenetic haploid embryos. Furthermore, we report that mH3.3 depletion in oocytes impairs both activation of the Oct4 pluripotency marker gene and global de novo transcription from the paternal genome important for early embryonic development. Our results suggest that H3.3-mediated paternal chromatin remodeling is essential for the development of preimplantation embryos and the activation of the paternal genome during embryogenesis. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.

  16. Genome-derived vaccines.

    PubMed

    De Groot, Anne S; Rappuoli, Rino

    2004-02-01

    Vaccine research entered a new era when the complete genome of a pathogenic bacterium was published in 1995. Since then, more than 97 bacterial pathogens have been sequenced and at least 110 additional projects are now in progress. Genome sequencing has also dramatically accelerated: high-throughput facilities can draft the sequence of an entire microbe (two to four megabases) in 1 to 2 days. Vaccine developers are using microarrays, immunoinformatics, proteomics and high-throughput immunology assays to reduce the truly unmanageable volume of information available in genome databases to a manageable size. Vaccines composed by novel antigens discovered from genome mining are already in clinical trials. Within 5 years we can expect to see a novel class of vaccines composed by genome-predicted, assembled and engineered T- and Bcell epitopes. This article addresses the convergence of three forces--microbial genome sequencing, computational immunology and new vaccine technologies--that are shifting genome mining for vaccines onto the forefront of immunology research.

  17. Comparison of genome size and synthesis of structural proteins of Hirame Rhabdovirus, infectious hematopoietic necrosis virus, and viral hemorrhagic Septicemia virus

    USGS Publications Warehouse

    Nishizawa, Toyohiko; Yoshimizu, Mamoru; Winton, James R.; Kimura, Takahisa

    1991-01-01

    Genomic RNA was extracted from purified virions of hirame rhabdovirus (HRV), infectious hematopoietic necrosis virus (IHNV), and viral hemorrhagic septicemia virus (VHSV). The full-length RNA was analyzed using formaldehyde agarose gel electrophoresis followed by ethidium bromide staining. Compared with an internal RNA size standard, all three viral genomic RNAs appeared to have identical relative mobilities and were estimated to be approximately 10.7 kilobases in length or about 3.7 megadaltons in molecular mass. Structural protein synthesis of HRV, IHNV, and VHSV was studied using cell cultures treated with actinomycin D. At 2 h intervals, proteins were labeled with 35S-methionine, extracted, and analyzed by SDS-polyacrylamide gel electrophoresis and autoradiography. The five structural proteins of each of the three viruses appeared in the following order : nucleoprotein (N), matrix protein 1 (M1), matrix protein 2 (M2), glycoprotein (G), and polymerase (L) reflecting both the approximate relative abundance of each protein within infected cells and the gene order within the viral genome.

  18. Effect of phosphorus availability on the selection of species with different ploidy levels and genome sizes in a long-term grassland fertilization experiment.

    PubMed

    Šmarda, Petr; Hejcman, Michal; Březinová, Alexandra; Horová, Lucie; Steigerová, Helena; Zedek, František; Bureš, Petr; Hejcmanová, Pavla; Schellberg, Jürgen

    2013-11-01

    Polyploidy and increased genome size are hypothesized to increase organismal nutrient demands, namely of phosphorus (P), which is an essential and abundant component of nucleic acids. Therefore, polyploids and plants with larger genomes are expected to be selectively disadvantaged in P-limited environments. However, this hypothesis has yet to be experimentally tested. We measured the somatic DNA content and ploidy level in 74 vascular plant species in a long-term fertilization experiment. The differences between the fertilizer treatments regarding the DNA content and ploidy level of the established species were tested using phylogeny-based statistics. The percentage and biomass of polyploid species clearly increased with soil P in particular fertilizer treatments, and a similar but weaker trend was observed for the DNA content. These increases were associated with the dominance of competitive life strategy (particularly advantageous in the P-treated plots) in polyploids and the enhanced competitive ability of dominant polyploid grasses at high soil P concentrations, indicating their increased P limitation. Our results verify the hypothesized effect of P availability on the selection of polyploids and plants with increased genome sizes, although the relative contribution of increased P demands vs increased competitiveness as causes of the observed pattern requires further evaluation. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.

  19. Genome-Wide Structural Variation Detection by Genome Mapping on Nanochannel Arrays.

    PubMed

    Mak, Angel C Y; Lai, Yvonne Y Y; Lam, Ernest T; Kwok, Tsz-Piu; Leung, Alden K Y; Poon, Annie; Mostovoy, Yulia; Hastie, Alex R; Stedman, William; Anantharaman, Thomas; Andrews, Warren; Zhou, Xiang; Pang, Andy W C; Dai, Heng; Chu, Catherine; Lin, Chin; Wu, Jacob J K; Li, Catherine M L; Li, Jing-Woei; Yim, Aldrin K Y; Chan, Saki; Sibert, Justin; Džakula, Željko; Cao, Han; Yiu, Siu-Ming; Chan, Ting-Fung; Yip, Kevin Y; Xiao, Ming; Kwok, Pui-Yan

    2016-01-01

    Comprehensive whole-genome structural variation detection is challenging with current approaches. With diploid cells as DNA source and the presence of numerous repetitive elements, short-read DNA sequencing cannot be used to detect structural variation efficiently. In this report, we show that genome mapping with long, fluorescently labeled DNA molecules imaged on nanochannel arrays can be used for whole-genome structural variation detection without sequencing. While whole-genome haplotyping is not achieved, local phasing (across >150-kb regions) is routine, as molecules from the parental chromosomes are examined separately. In one experiment, we generated genome maps from a trio from the 1000 Genomes Project, compared the maps against that derived from the reference human genome, and identified structural variations that are >5 kb in size. We find that these individuals have many more structural variants than those published, including some with the potential of disrupting gene function or regulation. Copyright © 2016 by the Genetics Society of America.

  20. Novel Approaches to Breast Cancer Prevention and Inhibition of Metastases

    DTIC Science & Technology

    2013-10-01

    allow a functional characterization of human candidate breast cancer genes. The transgenic RNAi library is covering the whole Drosophila genome ...W81XWH-12-1-0093 / Penninger 15. SUBJECT TERMS Genome wide functional genetics, haploid stem cells, Drosophila cancer modeling...With the advent of modern genomics hundreds of candidate genes have been associated with breast cancer both in GWAS studies as well as by cancer genome

  1. Reference quality assembly of the 3.5-Gb genome of Capsicum annuum from a single linked-read library.

    PubMed

    Hulse-Kemp, Amanda M; Maheshwari, Shamoni; Stoffel, Kevin; Hill, Theresa A; Jaffe, David; Williams, Stephen R; Weisenfeld, Neil; Ramakrishnan, Srividya; Kumar, Vijay; Shah, Preyas; Schatz, Michael C; Church, Deanna M; Van Deynze, Allen

    2018-01-01

    Linked-Read sequencing technology has recently been employed successfully for de novo assembly of human genomes, however, the utility of this technology for complex plant genomes is unproven. We evaluated the technology for this purpose by sequencing the 3.5-gigabase (Gb) diploid pepper ( Capsicum annuum ) genome with a single Linked-Read library. Plant genomes, including pepper, are characterized by long, highly similar repetitive sequences. Accordingly, significant effort is used to ensure that the sequenced plant is highly homozygous and the resulting assembly is a haploid consensus. With a phased assembly approach, we targeted a heterozygous F 1 derived from a wide cross to assess the ability to derive both haplotypes and characterize a pungency gene with a large insertion/deletion. The Supernova software generated a highly ordered, more contiguous sequence assembly than all currently available C. annuum reference genomes. Over 83% of the final assembly was anchored and oriented using four publicly available  de novo linkage maps. A comparison of the annotation of conserved eukaryotic genes indicated the completeness of assembly. The validity of the phased assembly is further demonstrated with the complete recovery of both 2.5-Kb insertion/deletion haplotypes of the PUN1 locus in the F 1 sample that represents pungent and nonpungent peppers, as well as nearly full recovery of the BUSCO2 gene set within each of the two haplotypes. The most contiguous pepper genome assembly to date has been generated which demonstrates that Linked-Read library technology provides a tool to de novo assemble complex highly repetitive heterozygous plant genomes. This technology can provide an opportunity to cost-effectively develop high-quality genome assemblies for other complex plants and compare structural and gene differences through accurate haplotype reconstruction.

  2. Precise detection of de novo single nucleotide variants in human genomes.

    PubMed

    Gómez-Romero, Laura; Palacios-Flores, Kim; Reyes, José; García, Delfino; Boege, Margareta; Dávila, Guillermo; Flores, Margarita; Schatz, Michael C; Palacios, Rafael

    2018-05-22

    The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent-offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we developed an alternative approach to accurately identify single nucleotide variants (SNVs) using only perfect matches. However, this approach could be applied only to haploid regions of the genome and was computationally intensive. In this study, we present a unique approach, coverage-based single nucleotide variant identification (COBASI), which allows the exploration of the entire genome using second-generation short sequence reads without extensive computing requirements. COBASI identifies SNVs using changes in coverage of exactly matching unique substrings, and is particularly suited for pinpointing de novo SNVs. Unlike other approaches that require population frequencies across hundreds of samples to filter out any methodological biases, COBASI can be applied to detect de novo SNVs within isolated families. We demonstrate this capability through extensive simulation studies and by studying a parent-offspring trio we sequenced using short reads. Experimental validation of all 58 candidate de novo SNVs and a selection of non-de novo SNVs found in the trio confirmed zero FP calls. COBASI is available as open source at https://github.com/Laura-Gomez/COBASI for any researcher to use. Copyright © 2018 the Author(s). Published by PNAS.

  3. Comparative genomics reveals insights into avian genome evolution and adaptation

    PubMed Central

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  4. Theory of prokaryotic genome evolution.

    PubMed

    Sela, Itamar; Wolf, Yuri I; Koonin, Eugene V

    2016-10-11

    Bacteria and archaea typically possess small genomes that are tightly packed with protein-coding genes. The compactness of prokaryotic genomes is commonly perceived as evidence of adaptive genome streamlining caused by strong purifying selection in large microbial populations. In such populations, even the small cost incurred by nonfunctional DNA because of extra energy and time expenditure is thought to be sufficient for this extra genetic material to be eliminated by selection. However, contrary to the predictions of this model, there exists a consistent, positive correlation between the strength of selection at the protein sequence level, measured as the ratio of nonsynonymous to synonymous substitution rates, and microbial genome size. Here, by fitting the genome size distributions in multiple groups of prokaryotes to predictions of mathematical models of population evolution, we show that only models in which acquisition of additional genes is, on average, slightly beneficial yield a good fit to genomic data. These results suggest that the number of genes in prokaryotic genomes reflects the equilibrium between the benefit of additional genes that diminishes as the genome grows and deletion bias (i.e., the rate of deletion of genetic material being slightly greater than the rate of acquisition). Thus, new genes acquired by microbial genomes, on average, appear to be adaptive. The tight spacing of protein-coding genes likely results from a combination of the deletion bias and purifying selection that efficiently eliminates nonfunctional, noncoding sequences.

  5. Quantifying Temporal Genomic Erosion in Endangered Species.

    PubMed

    Díez-Del-Molino, David; Sánchez-Barreiro, Fatima; Barnes, Ian; Gilbert, M Thomas P; Dalén, Love

    2018-03-01

    Many species have undergone dramatic population size declines over the past centuries. Although stochastic genetic processes during and after such declines are thought to elevate the risk of extinction, comparative analyses of genomic data from several endangered species suggest little concordance between genome-wide diversity and current population sizes. This is likely because species-specific life-history traits and ancient bottlenecks overshadow the genetic effect of recent demographic declines. Therefore, we advocate that temporal sampling of genomic data provides a more accurate approach to quantify genetic threats in endangered species. Specifically, genomic data from predecline museum specimens will provide valuable baseline data that enable accurate estimation of recent decreases in genome-wide diversity, increases in inbreeding levels, and accumulation of deleterious genetic variation. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. RPAN: rice pan-genome browser for ∼3000 rice genomes.

    PubMed

    Sun, Chen; Hu, Zhiqiang; Zheng, Tianqing; Lu, Kuangchen; Zhao, Yue; Wang, Wensheng; Shi, Jianxin; Wang, Chunchao; Lu, Jinyuan; Zhang, Dabing; Li, Zhikang; Wei, Chaochun

    2017-01-25

    A pan-genome is the union of the gene sets of all the individuals of a clade or a species and it provides a new dimension of genome complexity with the presence/absence variations (PAVs) of genes among these genomes. With the progress of sequencing technologies, pan-genome study is becoming affordable for eukaryotes with large-sized genomes. The Asian cultivated rice, Oryza sativa L., is one of the major food sources for the world and a model organism in plant biology. Recently, the 3000 Rice Genome Project (3K RGP) sequenced more than 3000 rice genomes with a mean sequencing depth of 14.3×, which provided a tremendous resource for rice research. In this paper, we present a genome browser, Rice Pan-genome Browser (RPAN), as a tool to search and visualize the rice pan-genome derived from 3K RGP. RPAN contains a database of the basic information of 3010 rice accessions, including genomic sequences, gene annotations, PAV information and gene expression data of the rice pan-genome. At least 12 000 novel genes absent in the reference genome were included. RPAN also provides multiple search and visualization functions. RPAN can be a rich resource for rice biology and rice breeding. It is available at http://cgm.sjtu.edu.cn/3kricedb/ or http://www.rmbreeding.cn/pan3k. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Breeding and identification of novel koji molds with high activity of acid protease by genome recombination between Aspergillus oryzae and Aspergillus niger.

    PubMed

    Xu, Defeng; Pan, Li; Zhao, Haifeng; Zhao, Mouming; Sun, Jiaxin; Liu, Dongmei

    2011-09-01

    Acid protease is essential for degradation of proteins during soy sauce fermentation. To breed more suitable koji molds with high activity of acid protease, interspecific genome recombination between A. oryzae and A. niger was performed. Through stabilization with d-camphor and haploidization with benomyl, several stable fusants with higher activity of acid protease were obtained, showing different degrees of improvement in acid protease activity compared with the parental strain A. oryzae. In addition, analyses of mycelial morphology, expression profiles of extracellular proteins, esterase isoenzyme profiles, and random amplified polymorphic DNA (RAPD) were applied to identify the fusants through their phenotypic and genetic relationships. Morphology analysis of the mycelial shape of fusants indicated a phenotype intermediate between A. oryzae and A. niger. The profiles of extracellular proteins and esterase isoenzyme electrophoresis showed the occurrence of genome recombination during or after protoplast fusion. The dendrogram constructed from RAPD data revealed great heterogeneity, and genetic dissimilarity indices showed there were considerable differences between the fusants and their parental strains. This investigation suggests that genome recombination is a powerful tool for improvement of food-grade industrial strains. Furthermore, the presented strain improvement procedure will be applicable for widespread use for other industrial strains.

  8. Identification and quantification of genetically modified Moonshade carnation lines using conventional and TaqMan real-time polymerase chain reaction methods.

    PubMed

    Li, Peng; Jia, Junwei; Bai, Lan; Pan, Aihu; Tang, Xueming

    2013-07-01

    Genetically modified carnation (Dianthus caryophyllus L.) Moonshade was approved for planting and commercialization in several countries from 2004. Developing methods for analyzing Moonshade is necessary for implementing genetically modified organism labeling regulations. In this study, the 5'-transgene integration sequence was isolated using thermal asymmetric interlaced (TAIL)-PCR. Based upon the 5'-transgene integration sequence, conventional and TaqMan real-time PCR assays were established. The relative limit of detection for the conventional PCR assay was 0.05 % for Moonshade using 100 ng total carnation genomic DNA, corresponding to approximately 79 copies of the carnation haploid genome, and the limits of detection and quantification of the TaqMan real-time PCR assay were estimated to be 51 and 254 copies of haploid carnation genomic DNA, respectively. These results are useful for identifying and quantifying Moonshade and its derivatives.

  9. The cacao Criollo genome v2.0: an improved version of the genome for genetic and functional genomic studies.

    PubMed

    Argout, X; Martin, G; Droc, G; Fouet, O; Labadie, K; Rivals, E; Aury, J M; Lanaud, C

    2017-09-15

    Theobroma cacao L., native to the Amazonian basin of South America, is an economically important fruit tree crop for tropical countries as a source of chocolate. The first draft genome of the species, from a Criollo cultivar, was published in 2011. Although a useful resource, some improvements are possible, including identifying misassemblies, reducing the number of scaffolds and gaps, and anchoring un-anchored sequences to the 10 chromosomes. We used a NGS-based approach to significantly improve the assembly of the Belizian Criollo B97-61/B2 genome. We combined four Illumina large insert size mate paired libraries with 52x of Pacific Biosciences long reads to correct misassembled regions and reduced the number of scaffolds. We then used genotyping by sequencing (GBS) methods to increase the proportion of the assembly anchored to chromosomes. The scaffold number decreased from 4,792 in assembly V1 to 554 in V2 while the scaffold N50 size has increased from 0.47 Mb in V1 to 6.5 Mb in V2. A total of 96.7% of the assembly was anchored to the 10 chromosomes compared to 66.8% in the previous version. Unknown sites (Ns) were reduced from 10.8% to 5.7%. In addition, we updated the functional annotations and performed a new RefSeq structural annotation based on RNAseq evidence. Theobroma cacao Criollo genome version 2 will be a valuable resource for the investigation of complex traits at the genomic level and for future comparative genomics and genetics studies in cacao tree. New functional tools and annotations are available on the Cocoa Genome Hub ( http://cocoa-genome-hub.southgreen.fr ).

  10. Theory of microbial genome evolution

    NASA Astrophysics Data System (ADS)

    Koonin, Eugene

    Bacteria and archaea have small genomes tightly packed with protein-coding genes. This compactness is commonly perceived as evidence of adaptive genome streamlining caused by strong purifying selection in large microbial populations. In such populations, even the small cost incurred by nonfunctional DNA because of extra energy and time expenditure is thought to be sufficient for this extra genetic material to be eliminated by selection. However, contrary to the predictions of this model, there exists a consistent, positive correlation between the strength of selection at the protein sequence level, measured as the ratio of nonsynonymous to synonymous substitution rates, and microbial genome size. By fitting the genome size distributions in multiple groups of prokaryotes to predictions of mathematical models of population evolution, we show that only models in which acquisition of additional genes is, on average, slightly beneficial yield a good fit to genomic data. Thus, the number of genes in prokaryotic genomes seems to reflect the equilibrium between the benefit of additional genes that diminishes as the genome grows and deletion bias. New genes acquired by microbial genomes, on average, appear to be adaptive. Evolution of bacterial and archaeal genomes involves extensive horizontal gene transfer and gene loss. Many microbes have open pangenomes, where each newly sequenced genome contains more than 10% `ORFans', genes without detectable homologues in other species. A simple, steady-state evolutionary model reveals two sharply distinct classes of microbial genes, one of which (ORFans) is characterized by effectively instantaneous gene replacement, whereas the other consists of genes with finite, distributed replacement rates. These findings imply a conservative estimate of at least a billion distinct genes in the prokaryotic genomic universe.

  11. Plant regeneration from haploid cell suspension-derived protoplasts of Mediterranean rice (Oryza sativa L. cv. Miara).

    PubMed

    Guiderdoni, E; Chaïr, H

    1992-11-01

    More than 750 plants were regenerated from protoplasts isolated from microspore callus-derived cell suspensions of the Mediterranean japonica rice Miara, using a nurse-feeder technique and N6-based culture medium. The mean plating efficiency and the mean regeneration ability of the protocalluses were 0.5% and 49% respectively. Flow cytometric evaluation of the DNA contents of 7 month old-cell and protoplast suspensions showed that they were still haploid. Contrastingly, the DNA contents of leaf cell nuclei of the regenerated protoclones ranged from 1C to 5C including 60% 2C plants. This was consistent with the morphological type and the fertility of the mature plants. These results and the absence of chimeric plants suggest that polyploidization occurred during the early phase of protoplast culture.

  12. Determination of the melon chloroplast and mitochondrial genome sequences reveals that the largest reported mitochondrial genome in plants contains a significant amount of DNA having a nuclear origin

    PubMed Central

    2011-01-01

    Background The melon belongs to the Cucurbitaceae family, whose economic importance among vegetable crops is second only to Solanaceae. The melon has a small genome size (454 Mb), which makes it suitable for molecular and genetic studies. Despite similar nuclear and chloroplast genome sizes, cucurbits show great variation when their mitochondrial genomes are compared. The melon possesses the largest plant mitochondrial genome, as much as eight times larger than that of other cucurbits. Results The nucleotide sequences of the melon chloroplast and mitochondrial genomes were determined. The chloroplast genome (156,017 bp) included 132 genes, with 98 single-copy genes dispersed between the small (SSC) and large (LSC) single-copy regions and 17 duplicated genes in the inverted repeat regions (IRa and IRb). A comparison of the cucumber and melon chloroplast genomes showed differences in only approximately 5% of nucleotides, mainly due to short indels and SNPs. Additionally, 2.74 Mb of mitochondrial sequence, accounting for 95% of the estimated mitochondrial genome size, were assembled into five scaffolds and four additional unscaffolded contigs. An 84% of the mitochondrial genome is contained in a single scaffold. The gene-coding region accounted for 1.7% (45,926 bp) of the total sequence, including 51 protein-coding genes, 4 conserved ORFs, 3 rRNA genes and 24 tRNA genes. Despite the differences observed in the mitochondrial genome sizes of cucurbit species, Citrullus lanatus (379 kb), Cucurbita pepo (983 kb) and Cucumis melo (2,740 kb) share 120 kb of sequence, including the predicted protein-coding regions. Nevertheless, melon contained a high number of repetitive sequences and a high content of DNA of nuclear origin, which represented 42% and 47% of the total sequence, respectively. Conclusions Whereas the size and gene organisation of chloroplast genomes are similar among the cucurbit species, mitochondrial genomes show a wide variety of sizes, with a non

  13. CHPA, a Cysteine- and Histidine-Rich-Domain-Containing Protein, Contributes to Maintenance of the Diploid State in Aspergillus nidulans

    PubMed Central

    Sadanandom, Ari; Findlay, Kim; Doonan, John H.; Schulze-Lefert, Paul; Shirasu, Ken

    2004-01-01

    The alternation of eukaryotic life cycles between haploid and diploid phases is crucial for maintaining genetic diversity. In some organisms, the growth and development of haploid and diploid phases are nearly identical, and one might suppose that all genes required for one phase are likely to be critical for the other phase. Here, we show that targeted disruption of the chpA (cysteine- and histidine-rich-domain- [CHORD]-containing protein A) gene in haploid Aspergillus nidulans strains gives rise to chpA knockout haploids and heterozygous diploids but no chpA knockout diploids. A. nidulans chpA heterozygous diploids showed impaired conidiophore development and reduced conidiation. Deletion of chpA from diploid A. nidulans resulted in genome instability and reversion to a haploid state. Thus, our data suggest a vital role for chpA in maintenance of the diploid phase in A. nidulans. Furthermore, the human chpA homolog, Chp-1, was able to complement haploinsufficiency in A. nidulans chpA heterozygotes, suggesting that the function of CHORD-containing proteins is highly conserved in eukaryotes. PMID:15302831

  14. Transposable Element Genomic Fissuring in Pyrenophora teres Is Associated With Genome Expansion and Dynamics of Host–Pathogen Genetic Interactions

    PubMed Central

    Syme, Robert A.; Martin, Anke; Wyatt, Nathan A.; Lawrence, Julie A.; Muria-Gonzalez, Mariano J.; Friesen, Timothy L.; Ellwood, Simon R.

    2018-01-01

    Pyrenophora teres, P. teres f. teres (PTT) and P. teres f. maculata (PTM) cause significant diseases in barley, but little is known about the large-scale genomic differences that may distinguish the two forms. Comprehensive genome assemblies were constructed from long DNA reads, optical and genetic maps. As repeat masking in fungal genomes influences the final gene annotations, an accurate and reproducible pipeline was developed to ensure comparability between isolates. The genomes of the two forms are highly collinear, each composed of 12 chromosomes. Genome evolution in P. teres is characterized by genome fissuring through the insertion and expansion of transposable elements (TEs), a process that isolates blocks of genic sequence. The phenomenon is particularly pronounced in PTT, which has a larger, more repetitive genome than PTM and more recent transposon activity measured by the frequency and size of genome fissures. PTT has a longer cultivated host association and, notably, a greater range of host–pathogen genetic interactions compared to other Pyrenophora spp., a property which associates better with genome size than pathogen lifestyle. The two forms possess similar complements of TE families with Tc1/Mariner and LINE-like Tad-1 elements more abundant in PTT. Tad-1 was only detectable as vestigial fragments in PTM and, within the forms, differences in genome sizes and the presence and absence of several TE families indicated recent lineage invasions. Gene differences between P. teres forms are mainly associated with gene-sparse regions near or within TE-rich regions, with many genes possessing characteristics of fungal effectors. Instances of gene interruption by transposons resulting in pseudogenization were detected in PTT. In addition, both forms have a large complement of secondary metabolite gene clusters indicating significant capacity to produce an array of different molecules. This study provides genomic resources for functional genetics to help

  15. Novel Approaches to Breast Cancer Prevention and Inhibition of Metastases

    DTIC Science & Technology

    2014-10-01

    functional characterization of candidate breast cancer genes. The transgenic RNAi library is covering the whole Drosophila genome , giving us an...cancer prevention trials in BRCA1 carriers using RANKL blockade. Using Drosophila modeling of Ras-driven transformation, we performed a near- genome ... Genome wide functional genetics, haploid stem cells, Drosophila cancer modeling, breast cancer prevention, BRCA1 carriers 16. SECURITY

  16. Comparative genomics reveals insights into avian genome evolution and adaptation.

    PubMed

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D; Gilbert, M Thomas P; Wang, Jun

    2014-12-12

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. Copyright © 2014, American Association for the Advancement of Science.

  17. Genome size, cytogenetic data and transferability of EST-SSRs markers in wild and cultivated species of the genus Theobroma L. (Byttnerioideae, Malvaceae)

    PubMed Central

    da Silva, Rangeline Azevedo; Souza, Gustavo; Lemos, Lívia Santos Lima; Lopes, Uilson Vanderlei; Patrocínio, Nara Geórgia Ribeiro Braz; Alves, Rafael Moysés; Marcellino, Lucília Helena; Clement, Didier; Micheli, Fabienne

    2017-01-01

    The genus Theobroma comprises several trees species native to the Amazon. Theobroma cacao L. plays a key economic role mainly in the chocolate industry. Both cultivated and wild forms are described within the genus. Variations in genome size and chromosome number have been used for prediction purposes including the frequency of interspecific hybridization or inference about evolutionary relationships. In this study, the nuclear DNA content, karyotype and genetic diversity using functional microsatellites (EST-SSR) of seven Theobroma species were characterized. The nuclear content of DNA for all analyzed Theobroma species was 1C = ~ 0.46 pg. These species presented 2n = 20 with small chromosomes and only one pair of terminal heterochromatic bands positively stained (CMA+/DAPI− bands). The small size of Theobroma ssp. genomes was equivalent to other Byttnerioideae species, suggesting that the basal lineage of Malvaceae have smaller genomes and that there was an expansion of 2C values in the more specialized family clades. A set of 20 EST-SSR primers were characterized for related species of Theobroma, in which 12 loci were polymorphic. The polymorphism information content (PIC) ranged from 0.23 to 0.65, indicating a high level of information per locus. Combined results of flow cytometry, cytogenetic data and EST-SSRs markers will contribute to better describe the species and infer about the evolutionary relationships among Theobroma species. In addition, the importance of a core collection for conservation purposes is highlighted. PMID:28187131

  18. Genome size, cytogenetic data and transferability of EST-SSRs markers in wild and cultivated species of the genus Theobroma L. (Byttnerioideae, Malvaceae).

    PubMed

    da Silva, Rangeline Azevedo; Souza, Gustavo; Lemos, Lívia Santos Lima; Lopes, Uilson Vanderlei; Patrocínio, Nara Geórgia Ribeiro Braz; Alves, Rafael Moysés; Marcellino, Lucília Helena; Clement, Didier; Micheli, Fabienne; Gramacho, Karina Peres

    2017-01-01

    The genus Theobroma comprises several trees species native to the Amazon. Theobroma cacao L. plays a key economic role mainly in the chocolate industry. Both cultivated and wild forms are described within the genus. Variations in genome size and chromosome number have been used for prediction purposes including the frequency of interspecific hybridization or inference about evolutionary relationships. In this study, the nuclear DNA content, karyotype and genetic diversity using functional microsatellites (EST-SSR) of seven Theobroma species were characterized. The nuclear content of DNA for all analyzed Theobroma species was 1C = ~ 0.46 pg. These species presented 2n = 20 with small chromosomes and only one pair of terminal heterochromatic bands positively stained (CMA+/DAPI- bands). The small size of Theobroma ssp. genomes was equivalent to other Byttnerioideae species, suggesting that the basal lineage of Malvaceae have smaller genomes and that there was an expansion of 2C values in the more specialized family clades. A set of 20 EST-SSR primers were characterized for related species of Theobroma, in which 12 loci were polymorphic. The polymorphism information content (PIC) ranged from 0.23 to 0.65, indicating a high level of information per locus. Combined results of flow cytometry, cytogenetic data and EST-SSRs markers will contribute to better describe the species and infer about the evolutionary relationships among Theobroma species. In addition, the importance of a core collection for conservation purposes is highlighted.

  19. Genome surfing as driver of microbial genomic diversity

    USDA-ARS?s Scientific Manuscript database

    Historical changes in population size, such as those caused by demographic range expansions, can produce nonadaptive changes in genomic diversity through mechanisms such as gene surfing. We propose that demographic range expansion of a microbial population capable of horizontal gene exchange can res...

  20. Evolution and genome architecture in fungal plant pathogens.

    PubMed

    Möller, Mareike; Stukenbrock, Eva H

    2017-12-01

    The fungal kingdom comprises some of the most devastating plant pathogens. Sequencing the genomes of fungal pathogens has shown a remarkable variability in genome size and architecture. Population genomic data enable us to understand the mechanisms and the history of changes in genome size and adaptive evolution in plant pathogens. Although transposable elements predominantly have negative effects on their host, fungal pathogens provide prominent examples of advantageous associations between rapidly evolving transposable elements and virulence genes that cause variation in virulence phenotypes. By providing homogeneous environments at large regional scales, managed ecosystems, such as modern agriculture, can be conducive for the rapid evolution and dispersal of pathogens. In this Review, we summarize key examples from fungal plant pathogen genomics and discuss evolutionary processes in pathogenic fungi in the context of molecular evolution, population genomics and agriculture.

  1. A function accounting for training set size and marker density to model the average accuracy of genomic prediction.

    PubMed

    Erbe, Malena; Gredler, Birgit; Seefried, Franz Reinhold; Bapst, Beat; Simianer, Henner

    2013-01-01

    Prediction of genomic breeding values is of major practical relevance in dairy cattle breeding. Deterministic equations have been suggested to predict the accuracy of genomic breeding values in a given design which are based on training set size, reliability of phenotypes, and the number of independent chromosome segments ([Formula: see text]). The aim of our study was to find a general deterministic equation for the average accuracy of genomic breeding values that also accounts for marker density and can be fitted empirically. Two data sets of 5'698 Holstein Friesian bulls genotyped with 50 K SNPs and 1'332 Brown Swiss bulls genotyped with 50 K SNPs and imputed to ∼600 K SNPs were available. Different k-fold (k = 2-10, 15, 20) cross-validation scenarios (50 replicates, random assignment) were performed using a genomic BLUP approach. A maximum likelihood approach was used to estimate the parameters of different prediction equations. The highest likelihood was obtained when using a modified form of the deterministic equation of Daetwyler et al. (2010), augmented by a weighting factor (w) based on the assumption that the maximum achievable accuracy is [Formula: see text]. The proportion of genetic variance captured by the complete SNP sets ([Formula: see text]) was 0.76 to 0.82 for Holstein Friesian and 0.72 to 0.75 for Brown Swiss. When modifying the number of SNPs, w was found to be proportional to the log of the marker density up to a limit which is population and trait specific and was found to be reached with ∼20'000 SNPs in the Brown Swiss population studied.

  2. Draft genome of the Peruvian scallop Argopecten purpuratus.

    PubMed

    Li, Chao; Liu, Xiao; Liu, Bo; Ma, Bin; Liu, Fengqiao; Liu, Guilong; Shi, Qiong; Wang, Chunde

    2018-04-01

    The Peruvian scallop, Argopecten purpuratus, is mainly cultured in southern Chile and Peru was introduced into China in the last century. Unlike other Argopecten scallops, the Peruvian scallop normally has a long life span of up to 7 to 10 years. Therefore, researchers have been using it to develop hybrid vigor. Here, we performed whole genome sequencing, assembly, and gene annotation of the Peruvian scallop, with an important aim to develop genomic resources for genetic breeding in scallops. A total of 463.19-Gb raw DNA reads were sequenced. A draft genome assembly of 724.78 Mb was generated (accounting for 81.87% of the estimated genome size of 885.29 Mb), with a contig N50 size of 80.11 kb and a scaffold N50 size of 1.02 Mb. Repeat sequences were calculated to reach 33.74% of the whole genome, and 26,256 protein-coding genes and 3,057 noncoding RNAs were predicted from the assembly. We generated a high-quality draft genome assembly of the Peruvian scallop, which will provide a solid resource for further genetic breeding and for the analysis of the evolutionary history of this economically important scallop.

  3. Dynamic properties in the four-state haploid coupled discrete-time mutation-selection model with an infinite population limit

    NASA Astrophysics Data System (ADS)

    Lee, Kyu Sang; Gill, Wonpyong

    2017-11-01

    The dynamic properties, such as the crossing time and time-dependence of the relative density of the four-state haploid coupled discrete-time mutation-selection model, were calculated with the assumption that μ ij = μ ji , where μ ij denotes the mutation rate between the sequence elements, i and j. The crossing time for s = 0 and r 23 = r 42 = 1 in the four-state model became saturated at a large fitness parameter when r 12 > 1, was scaled as a power law in the fitness parameter when r 12 = 1, and diverged when the fitness parameter approached the critical fitness parameter when r 12 < 1, where r ij = μ ij / μ 14.

  4. Reduced representation approaches to interrogate genome diversity in large repetitive plant genomes.

    PubMed

    Hirsch, Cory D; Evans, Joseph; Buell, C Robin; Hirsch, Candice N

    2014-07-01

    Technology and software improvements in the last decade now provide methodologies to access the genome sequence of not only a single accession, but also multiple accessions of plant species. This provides a means to interrogate species diversity at the genome level. Ample diversity among accessions in a collection of species can be found, including single-nucleotide polymorphisms, insertions and deletions, copy number variation and presence/absence variation. For species with small, non-repetitive rich genomes, re-sequencing of query accessions is robust, highly informative, and economically feasible. However, for species with moderate to large sized repetitive-rich genomes, technical and economic barriers prevent en masse genome re-sequencing of accessions. Multiple approaches to access a focused subset of loci in species with larger genomes have been developed, including reduced representation sequencing, exome capture and transcriptome sequencing. Collectively, these approaches have enabled interrogation of diversity on a genome scale for large plant genomes, including crop species important to worldwide food security. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  5. The genomes and comparative genomics of Lactobacillus delbrueckii phages.

    PubMed

    Riipinen, Katja-Anneli; Forsman, Päivi; Alatossava, Tapani

    2011-07-01

    Lactobacillus delbrueckii phages are a great source of genetic diversity. Here, the genome sequences of Lb. delbrueckii phages LL-Ku, c5 and JCL1032 were analyzed in detail, and the genetic diversity of Lb. delbrueckii phages belonging to different taxonomic groups was explored. The lytic isometric group b phages LL-Ku (31,080 bp) and c5 (31,841 bp) showed a minimum nucleotide sequence identity of 90% over about three-fourths of their genomes. The genomic locations of their lysis modules were unique, and the genomes featured several putative overlapping transcription units of genes. LL-Ku and c5 virions displayed peptidoglycan hydrolytic activity associated with a ~36-kDa protein similar in size to the endolysin. Unexpectedly, the 49,433-bp genome of the prolate phage JCL1032 (temperate, group c) revealed a conserved gene order within its structural genes. Lb. delbrueckii phages representing groups a (a phage LL-H), b and c possessed only limited protein sequence homology. Genomic comparison of LL-Ku and c5 suggested that diversification of Lb. delbrueckii phages is mainly due to insertions, deletions and recombination. For the first time, the complete genome sequences of group b and c Lb. delbrueckii phages are reported.

  6. Between Two Fern Genomes

    PubMed Central

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  7. Genome fluctuations in cyanobacteria reflect evolutionary, developmental and adaptive traits.

    PubMed

    Larsson, John; Nylander, Johan Aa; Bergman, Birgitta

    2011-06-30

    Cyanobacteria belong to an ancient group of photosynthetic prokaryotes with pronounced variations in their cellular differentiation strategies, physiological capacities and choice of habitat. Sequencing efforts have shown that genomes within this phylum are equally diverse in terms of size and protein-coding capacity. To increase our understanding of genomic changes in the lineage, the genomes of 58 contemporary cyanobacteria were analysed for shared and unique orthologs. A total of 404 protein families, present in all cyanobacterial genomes, were identified. Two of these are unique to the phylum, corresponding to an AbrB family transcriptional regulator and a gene that escapes functional annotation although its genomic neighbourhood is conserved among the organisms examined. The evolution of cyanobacterial genome sizes involves a mix of gains and losses in the clade encompassing complex cyanobacteria, while a single event of reduction is evident in a clade dominated by unicellular cyanobacteria. Genome sizes and gene family copy numbers evolve at a higher rate in the former clade, and multi-copy genes were predominant in large genomes. Orthologs unique to cyanobacteria exhibiting specific characteristics, such as filament formation, heterocyst differentiation, diazotrophy and symbiotic competence, were also identified. An ancestral character reconstruction suggests that the most recent common ancestor of cyanobacteria had a genome size of approx. 4.5 Mbp and 1678 to 3291 protein-coding genes, 4%-6% of which are unique to cyanobacteria today. The different rates of genome-size evolution and multi-copy gene abundance suggest two routes of genome development in the history of cyanobacteria. The expansion strategy is driven by gene-family enlargment and generates a broad adaptive potential; while the genome streamlining strategy imposes adaptations to highly specific niches, also reflected in their different functional capacities. A few genomes display extreme

  8. Cell wall formation in zoospores of Allomyces arbuscula. II. Development of surface structure of encysted haploid zoospores, rhizoids, and hyphae.

    PubMed

    Kroh, M; Hendriks, H; Kirby, E G; Sassen, M M

    1976-08-01

    Development of haploid meiospores of Allomyces arbuscula into germling cells with rhizoids and hyphae was followed during incubation in complete growth medium. The surface structure of encysted meiospores, rhizoids and hyphae before and after extraction of amorphous materials with ethanolic KOH was studied by means of carbon-platinum replicas. After 2--3 min incubation in complete medium 10% of the meiospores were surrounded by a cell wall containing microfibrils embedded in a matrix. Structure of cell walls of encysted meiospores, rhizoids, and hyphae differ from one another by the location of amorphous materials and by the arrangement of chitin microfibrils.

  9. Analysis of phylogenetic relationships and genome size evolution of the Amaranthus genus using GBS indicates the ancestors of an ancient crop.

    PubMed

    Stetter, Markus G; Schmid, Karl J

    2017-04-01

    The genus Amaranthus consists of 50-70 species and harbors several cultivated and weedy species of great economic importance. A small number of suitable traits, phenotypic plasticity, gene flow and hybridization made it difficult to establish the taxonomy and phylogeny of the whole genus despite various studies using molecular markers. We inferred the phylogeny of the Amaranthus genus using genotyping by sequencing (GBS) of 94 genebank accessions representing 35 Amaranthus species and measured their genome sizes. SNPs were called by de novo and reference-based methods, for which we used the distant sugarbeet Beta vulgaris and the closely related Amaranthus hypochondriacus as references. SNP counts and proportions of missing data differed between methods, but the resulting phylogenetic trees were highly similar. A distance-based neighbor joining tree of individual accessions and a species tree calculated with the multispecies coalescent supported a previous taxonomic classification into three subgenera although the subgenus A. Acnida consists of two highly differentiated clades. The analysis of the Hybridus complex within the A. Amaranthus subgenus revealed insights on the history of cultivated grain amaranths. The complex includes the three cultivated grain amaranths and their wild relatives and was well separated from other species in the subgenus. Wild and cultivated amaranth accessions did not differentiate according to the species assignment but clustered by their geographic origin from South and Central America. Different geographically separated populations of Amaranthus hybridus appear to be the common ancestors of the three cultivated grain species and A. quitensis might be additionally be involved in the evolution of South American grain amaranth (A. caudatus). We also measured genome sizes of the species and observed little variation with the exception of two lineages that showed evidence for a recent polyploidization. With the exception of two lineages

  10. Interactions of photosynthesis with genome size and function.

    PubMed

    Raven, John A; Beardall, John; Larkum, Anthony W D; Sánchez-Baracaldo, Patricia

    2013-07-19

    Photolithotrophs are divided between those that use water as their electron donor (Cyanobacteria and the photosynthetic eukaryotes) and those that use a different electron donor (the anoxygenic photolithotrophs, all of them Bacteria). Photolithotrophs with the most reduced genomes have more genes than do the corresponding chemoorganotrophs, and the fastest-growing photolithotrophs have significantly lower specific growth rates than the fastest-growing chemoorganotrophs. Slower growth results from diversion of resources into the photosynthetic apparatus, which accounts for about half of the cell protein. There are inherent dangers in (especially oxygenic) photosynthesis, including the formation of reactive oxygen species (ROS) and blue light sensitivity of the water spitting apparatus. The extent to which photolithotrophs incur greater DNA damage and repair, and faster protein turnover with increased rRNA requirement, needs further investigation. A related source of environmental damage is ultraviolet B (UVB) radiation (280-320 nm), whose flux at the Earth's surface decreased as oxygen (and ozone) increased in the atmosphere. This oxygenation led to the requirements of defence against ROS, and decreasing availability to organisms of combined (non-dinitrogen) nitrogen and ferrous iron, and (indirectly) phosphorus, in the oxygenated biosphere. Differential codon usage in the genome and, especially, the proteome can lead to economies in the use of potentially growth-limiting elements.

  11. Genetic Diversity in the UV Sex Chromosomes of the Brown Alga Ectocarpus.

    PubMed

    Avia, Komlan; Lipinska, Agnieszka P; Mignerot, Laure; Montecinos, Alejandro E; Jamy, Mahwash; Ahmed, Sophia; Valero, Myriam; Peters, Akira F; Cock, J Mark; Roze, Denis; Coelho, Susana M

    2018-06-06

    Three types of sex chromosome system exist in nature: diploid XY and ZW systems and haploid UV systems. For many years, research has focused exclusively on XY and ZW systems, leaving UV chromosomes and haploid sex determination largely neglected. Here, we perform a detailed analysis of DNA sequence neutral diversity levels across the U and V sex chromosomes of the model brown alga Ectocarpus using a large population dataset. We show that the U and V non-recombining regions of the sex chromosomes (SDR) exhibit about half as much neutral diversity as the autosomes. This difference is consistent with the reduced effective population size of these regions compared with the rest of the genome, suggesting that the influence of additional factors such as background selection or selective sweeps is minimal. The pseudoautosomal region (PAR) of this UV system, in contrast, exhibited surprisingly high neutral diversity and there were several indications that genes in this region may be under balancing selection. The PAR of Ectocarpus is known to exhibit unusual genomic features and our results lay the foundation for further work aimed at understanding whether, and to what extent, these structural features underlie the high level of genetic diversity. Overall, this study fills a gap between available information on genetic diversity in XY/ZW systems and UV systems and significantly contributes to advancing our knowledge of the evolution of UV sex chromosomes.

  12. Transcriptional Regulation During Zygotic Genome Activation in Zebrafish and Other Anamniote Embryos.

    PubMed

    Wragg, J; Müller, F

    2016-01-01

    Embryo development commences with the fusion of two terminally differentiated haploid gametes into the totipotent fertilized egg, which through a series of major cellular and molecular transitions generate a pluripotent cell mass. The activation of the zygotic genome occurs during the so-called maternal to zygotic transition and prepares the embryo for zygotic takeover from maternal factors, in the control of the development of cellular lineages during differentiation. Recent advances in next generation sequencing technologies have allowed the dissection of the genomic and epigenomic processes mediating this transition. These processes include reorganization of the chromatin structure to a transcriptionally permissive state, changes in composition and function of structural and regulatory DNA-binding proteins, and changeover of the transcriptome as it is overhauled from that deposited by the mother in the oocyte to a zygotically transcribed complement. Zygotic genome activation in zebrafish occurs 10 cell cycles after fertilization and provides an ideal experimental platform for elucidating the temporal sequence and dynamics of establishment of a transcriptionally active chromatin state and helps in identifying the determinants of transcription activation at polymerase II transcribed gene promoters. The relatively large number of pluripotent cells generated by the fast cell divisions before zygotic transcription provides sufficient biomass for next generation sequencing technology approaches to establish the temporal dynamics of events and suggest causative relationship between them. However, genomic and genetic technologies need to be improved further to capture the earliest events in development, where cell number is a limiting factor. These technologies need to be complemented with precise, inducible genetic interference studies using the latest genome editing tools to reveal the function of candidate determinants and to confirm the predictions made by classic

  13. Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes.

    PubMed

    Haiminen, Niina; Feltus, F Alex; Parida, Laxmi

    2011-04-15

    We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies.

  14. Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes

    PubMed Central

    2011-01-01

    Background We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. Results The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. Conclusions BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies. PMID:21496274

  15. Comparison of Different Methods for Separation of Haploid Embryo Induced through Irradiated Pollen and Their Economic Analysis in Melon (Cucumis melo var. inodorus)

    PubMed Central

    Baktemur, Gökhan; Taşkın, Hatıra; Büyükalaca, Saadet

    2013-01-01

    Irradiated pollen technique is the most successful haploidization technique within Cucurbitaceae. After harvesting of fruits pollinated with irradiated pollen, classical method called as “inspecting the seeds one by one” is used to find haploid embryos in the seeds. In this study, different methods were used to extract the embryos more easily, quickly, economically, and effectively. “Inspecting the seeds one by one” was used as control treatment. Other four methods tested were “sowing seeds direct nutrient media,” “inspecting seeds in the light source,” “floating seeds on liquid media,” and “floating seeds on liquid media after surface sterilization.” Y2 and Y3 melon genotypes selected from the third backcross population of Yuva were used as plant material. Results of this study show that there is no statistically significant difference among methods “inspecting the seeds one by one,” “sowing seeds direct CP nutrient media,” and “inspecting seeds in the light source,” although the average number of embryos per fruit is slightly different. No embryo production was obtained from liquid culture because of infection. When considered together with labor costs and time required for embryo rescue, the best methods were “sowing seeds directly in the CP nutrient media“ and ”inspecting seeds in the light source.” PMID:23818825

  16. [Prospects for application of breakthrough technologies in breeding: The CRISPR/Cas9 system for plant genome editing].

    PubMed

    Khlestkina, E K; Shumny, V K

    2016-07-01

    Integration of the methods of contemporary genetics and biotechnology into the breeding process is assessed, and the potential role and efficacy of genome editing as a novel approach is discussed. Use of molecular (DNA) markers for breeding was proposed more than 30 years ago. Nowadays, they are widely used as an accessory tool in order to select plants by mono- and olygogenic traits. Presently, the genomic approaches are actively introduced into the breeding processes owing to automatization of DNA polymorphism analyses and development of comparatively cheap methods of DNA sequencing. These approaches provide effective selection by complex quantitative traits, and are based on the full-genome genotyping of the breeding material. Moreover, biotechnological tools, such as doubled haploids production, which provides fast obtainment of homozygotes, are widely used in plant breeding. Use of genomic and biotechnological approaches makes the development of varieties less time consuming. It also decreases the cultivated areas and financial expenditures required for accomplishment of the breeding process. However, the capacities of modern breeding are not limited to only these advantages. Experiments carried out on plants about 10 years ago provided the first data on genome editing. In the last two years, we have observed a sharp increase in the number of publications that report about successful experiments aimed at plant genome editing owing to the use of the relatively simple and convenient CRISPR/Cas9 system. The goal of some of these experiments was to modify agriculturally valuable genes of cultivated plants, such as potato, cabbage, tomato, maize, rice, wheat, barley, soybean and sorghum. These studies show that it is possible to obtain nontransgenic plants carrying stably inherited, specifically determined mutations using the CRISPR/Cas9 system. This possibility offers the challenge to obtain varieties with predetermined mono- and olygogenic traits.

  17. GAAP: Genome-organization-framework-Assisted Assembly Pipeline for prokaryotic genomes.

    PubMed

    Yuan, Lina; Yu, Yang; Zhu, Yanmin; Li, Yulai; Li, Changqing; Li, Rujiao; Ma, Qin; Siu, Gilman Kit-Hang; Yu, Jun; Jiang, Taijiao; Xiao, Jingfa; Kang, Yu

    2017-01-25

    Next-generation sequencing (NGS) technologies have greatly promoted the genomic study of prokaryotes. However, highly fragmented assemblies due to short reads from NGS are still a limiting factor in gaining insights into the genome biology. Reference-assisted tools are promising in genome assembly, but tend to result in false assembly when the assigned reference has extensive rearrangements. Herein, we present GAAP, a genome assembly pipeline for scaffolding based on core-gene-defined Genome Organizational Framework (cGOF) described in our previous study. Instead of assigning references, we use the multiple-reference-derived cGOFs as indexes to assist in order and orientation of the scaffolds and build a skeleton structure, and then use read pairs to extend scaffolds, called local scaffolding, and distinguish between true and chimeric adjacencies in the scaffolds. In our performance tests using both empirical and simulated data of 15 genomes in six species with diverse genome size, complexity, and all three categories of cGOFs, GAAP outcompetes or achieves comparable results when compared to three other reference-assisted programs, AlignGraph, Ragout and MeDuSa. GAAP uses both cGOF and pair-end reads to create assemblies in genomic scale, and performs better than the currently available reference-assisted assembly tools as it recovers more assemblies and makes fewer false locations, especially for species with extensive rearranged genomes. Our method is a promising solution for reconstruction of genome sequence from short reads of NGS.

  18. A reference genetic map of C. clementina hort. ex Tan.; citrus evolution inferences from comparative mapping

    PubMed Central

    2012-01-01

    Background Most modern citrus cultivars have an interspecific origin. As a foundational step towards deciphering the interspecific genome structures, a reference whole genome sequence was produced by the International Citrus Genome Consortium from a haploid derived from Clementine mandarin. The availability of a saturated genetic map of Clementine was identified as an essential prerequisite to assist the whole genome sequence assembly. Clementine is believed to be a ‘Mediterranean’ mandarin × sweet orange hybrid, and sweet orange likely arose from interspecific hybridizations between mandarin and pummelo gene pools. The primary goals of the present study were to establish a Clementine reference map using codominant markers, and to perform comparative mapping of pummelo, sweet orange, and Clementine. Results Five parental genetic maps were established from three segregating populations, which were genotyped with Single Nucleotide Polymorphism (SNP), Simple Sequence Repeats (SSR) and Insertion-Deletion (Indel) markers. An initial medium density reference map (961 markers for 1084.1 cM) of the Clementine was established by combining male and female Clementine segregation data. This Clementine map was compared with two pummelo maps and a sweet orange map. The linear order of markers was highly conserved in the different species. However, significant differences in map size were observed, which suggests a variation in the recombination rates. Skewed segregations were much higher in the male than female Clementine mapping data. The mapping data confirmed that Clementine arose from hybridization between ‘Mediterranean’ mandarin and sweet orange. The results identified nine recombination break points for the sweet orange gamete that contributed to the Clementine genome. Conclusions A reference genetic map of citrus, used to facilitate the chromosome assembly of the first citrus reference genome sequence, was established. The high conservation of marker order

  19. Genome expansion via lineage splitting and genome reduction in the cicada endosymbiont Hodgkinia.

    PubMed

    Campbell, Matthew A; Van Leuven, James T; Meister, Russell C; Carey, Kaitlin M; Simon, Chris; McCutcheon, John P

    2015-08-18

    Comparative genomics from mitochondria, plastids, and mutualistic endosymbiotic bacteria has shown that the stable establishment of a bacterium in a host cell results in genome reduction. Although many highly reduced genomes from endosymbiotic bacteria are stable in gene content and genome structure, organelle genomes are sometimes characterized by dramatic structural diversity. Previous results from Candidatus Hodgkinia cicadicola, an endosymbiont of cicadas, revealed that some lineages of this bacterium had split into two new cytologically distinct yet genetically interdependent species. It was hypothesized that the long life cycle of cicadas in part enabled this unusual lineage-splitting event. Here we test this hypothesis by investigating the structure of the Ca. Hodgkinia genome in one of the longest-lived cicadas, Magicicada tredecim. We show that the Ca. Hodgkinia genome from M. tredecim has fragmented into multiple new chromosomes or genomes, with at least some remaining partitioned into discrete cells. We also show that this lineage-splitting process has resulted in a complex of Ca. Hodgkinia genomes that are 1.1-Mb pairs in length when considered together, an almost 10-fold increase in size from the hypothetical single-genome ancestor. These results parallel some examples of genome fragmentation and expansion in organelles, although the mechanisms that give rise to these extreme genome instabilities are likely different.

  20. Insights into conifer giga-genomes.

    PubMed

    De La Torre, Amanda R; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K; Jansson, Stefan; Jones, Steven J M; Keeling, Christopher I; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

    2014-12-01

    Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world's forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20-30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. © 2014 American Society of Plant Biologists. All Rights Reserved.

  1. Efficient inference of population size histories and locus-specific mutation rates from large-sample genomic variation data.

    PubMed

    Bhaskar, Anand; Wang, Y X Rachel; Song, Yun S

    2015-02-01

    With the recent increase in study sample sizes in human genetics, there has been growing interest in inferring historical population demography from genomic variation data. Here, we present an efficient inference method that can scale up to very large samples, with tens or hundreds of thousands of individuals. Specifically, by utilizing analytic results on the expected frequency spectrum under the coalescent and by leveraging the technique of automatic differentiation, which allows us to compute gradients exactly, we develop a very efficient algorithm to infer piecewise-exponential models of the historical effective population size from the distribution of sample allele frequencies. Our method is orders of magnitude faster than previous demographic inference methods based on the frequency spectrum. In addition to inferring demography, our method can also accurately estimate locus-specific mutation rates. We perform extensive validation of our method on simulated data and show that it can accurately infer multiple recent epochs of rapid exponential growth, a signal that is difficult to pick up with small sample sizes. Lastly, we use our method to analyze data from recent sequencing studies, including a large-sample exome-sequencing data set of tens of thousands of individuals assayed at a few hundred genic regions. © 2015 Bhaskar et al.; Published by Cold Spring Harbor Laboratory Press.

  2. Detection of genomic rearrangements in cucumber using genomecmp software

    NASA Astrophysics Data System (ADS)

    Kulawik, Maciej; Pawełkowicz, Magdalena Ewa; Wojcieszek, Michał; PlÄ der, Wojciech; Nowak, Robert M.

    2017-08-01

    Comparative genomic by increasing information about the genomes sequences available in the databases is a rapidly evolving science. A simple comparison of the general features of genomes such as genome size, number of genes, and chromosome number presents an entry point into comparative genomic analysis. Here we present the utility of the new tool genomecmp for finding rearrangements across the compared sequences and applications in plant comparative genomics.

  3. Genome fluctuations in cyanobacteria reflect evolutionary, developmental and adaptive traits

    PubMed Central

    2011-01-01

    Background Cyanobacteria belong to an ancient group of photosynthetic prokaryotes with pronounced variations in their cellular differentiation strategies, physiological capacities and choice of habitat. Sequencing efforts have shown that genomes within this phylum are equally diverse in terms of size and protein-coding capacity. To increase our understanding of genomic changes in the lineage, the genomes of 58 contemporary cyanobacteria were analysed for shared and unique orthologs. Results A total of 404 protein families, present in all cyanobacterial genomes, were identified. Two of these are unique to the phylum, corresponding to an AbrB family transcriptional regulator and a gene that escapes functional annotation although its genomic neighbourhood is conserved among the organisms examined. The evolution of cyanobacterial genome sizes involves a mix of gains and losses in the clade encompassing complex cyanobacteria, while a single event of reduction is evident in a clade dominated by unicellular cyanobacteria. Genome sizes and gene family copy numbers evolve at a higher rate in the former clade, and multi-copy genes were predominant in large genomes. Orthologs unique to cyanobacteria exhibiting specific characteristics, such as filament formation, heterocyst differentiation, diazotrophy and symbiotic competence, were also identified. An ancestral character reconstruction suggests that the most recent common ancestor of cyanobacteria had a genome size of approx. 4.5 Mbp and 1678 to 3291 protein-coding genes, 4%-6% of which are unique to cyanobacteria today. Conclusions The different rates of genome-size evolution and multi-copy gene abundance suggest two routes of genome development in the history of cyanobacteria. The expansion strategy is driven by gene-family enlargment and generates a broad adaptive potential; while the genome streamlining strategy imposes adaptations to highly specific niches, also reflected in their different functional capacities. A few

  4. Modeling evolution of spatially distributed bacterial communities: a simulation with the haploid evolutionary constructor

    PubMed Central

    2015-01-01

    Background Multiscale approaches for integrating submodels of various levels of biological organization into a single model became the major tool of systems biology. In this paper, we have constructed and simulated a set of multiscale models of spatially distributed microbial communities and study an influence of unevenly distributed environmental factors on the genetic diversity and evolution of the community members. Results Haploid Evolutionary Constructor software http://evol-constructor.bionet.nsc.ru/ was expanded by adding the tool for the spatial modeling of a microbial community (1D, 2D and 3D versions). A set of the models of spatially distributed communities was built to demonstrate that the spatial distribution of cells affects both intensity of selection and evolution rate. Conclusion In spatially heterogeneous communities, the change in the direction of the environmental flow might be reflected in local irregular population dynamics, while the genetic structure of populations (frequencies of the alleles) remains stable. Furthermore, in spatially heterogeneous communities, the chemotaxis might dramatically affect the evolution of community members. PMID:25708911

  5. PGSB/MIPS Plant Genome Information Resources and Concepts for the Analysis of Complex Grass Genomes.

    PubMed

    Spannagl, Manuel; Bader, Kai; Pfeifer, Matthias; Nussbaumer, Thomas; Mayer, Klaus F X

    2016-01-01

    PGSB (Plant Genome and Systems Biology; formerly MIPS-Munich Institute for Protein Sequences) has been involved in developing, implementing and maintaining plant genome databases for more than a decade. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable datasets for model plant genomes as a backbone against which experimental data, e.g., from high-throughput functional genomics, can be organized and analyzed. In addition, genomes from both model and crop plants form a scaffold for comparative genomics, assisted by specialized tools such as the CrowsNest viewer to explore conserved gene order (synteny) between related species on macro- and micro-levels.The genomes of many economically important Triticeae plants such as wheat, barley, and rye present a great challenge for sequence assembly and bioinformatic analysis due to their enormous complexity and large genome size. Novel concepts and strategies have been developed to deal with these difficulties and have been applied to the genomes of wheat, barley, rye, and other cereals. This includes the GenomeZipper concept, reference-guided exome assembly, and "chromosome genomics" based on flow cytometry sorted chromosomes.

  6. Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence

    PubMed Central

    2011-01-01

    Background Many plants have large and complex genomes with an abundance of repeated sequences. Many plants are also polyploid. Both of these attributes typify the genome architecture in the tribe Triticeae, whose members include economically important wheat, rye and barley. Large genome sizes, an abundance of repeated sequences, and polyploidy present challenges to genome-wide SNP discovery using next-generation sequencing (NGS) of total genomic DNA by making alignment and clustering of short reads generated by the NGS platforms difficult, particularly in the absence of a reference genome sequence. Results An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions from repetitive sequences and sequences shared by paralogous genes. Multiple genome equivalents of shotgun reads of another genotype generated with SOLiD or Solexa are then mapped to the annotated Roche 454 reads to identify putative SNPs. A pipeline program package, AGSNP, was developed and used for genome-wide SNP discovery in Aegilops tauschii-the diploid source of the wheat D genome, and with a genome size of 4.02 Gb, of which 90% is repetitive sequences. Genomic DNA of Ae. tauschii accession AL8/78 was sequenced with the Roche 454 NGS platform. Genomic DNA and cDNA of Ae. tauschii accession AS75 was sequenced primarily with SOLiD, although some Solexa and Roche 454 genomic sequences were also generated. A total of 195,631 putative SNPs were discovered in gene sequences, 155,580 putative SNPs were discovered in uncharacterized single-copy regions, and another 145,907 putative SNPs were discovered in repeat junctions. These SNPs were dispersed across the entire Ae. tauschii genome. To assess the false positive SNP discovery rate, DNA containing putative SNPs was

  7. Genome-Wide Association Analyses Highlight the Potential for Different Genetic Mechanisms for Litter Size Among Sheep Breeds

    PubMed Central

    Xu, Song-Song; Gao, Lei; Xie, Xing-Long; Ren, Yan-Ling; Shen, Zhi-Qiang; Wang, Feng; Shen, Min; Eyϸórsdóttir, Emma; Hallsson, Jón H.; Kiseleva, Tatyana; Kantanen, Juha; Li, Meng-Hua

    2018-01-01

    Reproduction is an important trait in sheep breeding as well as in other livestock. However, despite its importance the genetic mechanisms of litter size in domestic sheep (Ovis aries) are still poorly understood. To explore genetic mechanisms underlying the variation in litter size, we conducted multiple independent genome-wide association studies in five sheep breeds of high prolificacy (Wadi, Hu, Icelandic, Finnsheep, and Romanov) and one low prolificacy (Texel) using the Ovine Infinium HD BeadChip, respectively. We identified different sets of candidate genes associated with litter size in different breeds: BMPR1B, FBN1, and MMP2 in Wadi; GRIA2, SMAD1, and CTNNB1 in Hu; NCOA1 in Icelandic; INHBB, NF1, FLT1, PTGS2, and PLCB3 in Finnsheep; ESR2 in Romanov and ESR1, GHR, ETS1, MMP15, FLI1, and SPP1 in Texel. Further annotation of genes and bioinformatics analyses revealed that different biological pathways could be involved in the variation in litter size of females: hormone secretion (FSH and LH) in Wadi and Hu, placenta and embryonic lethality in Icelandic, folliculogenesis and LH signaling in Finnsheep, ovulation and preovulatory follicle maturation in Romanov, and estrogen and follicular growth in Texel. Taken together, our results provide new insights into the genetic mechanisms underlying the prolificacy trait in sheep and other mammals, suggesting targets for selection where the aim is to increase prolificacy in breeding projects. PMID:29692799

  8. Insights into Conifer Giga-Genomes1

    PubMed Central

    De La Torre, Amanda R.; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K.; Jansson, Stefan; Jones, Steven J.M.; Keeling, Christopher I.; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

    2014-01-01

    Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world’s forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20–30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. PMID:25349325

  9. Mitochondrial DNA repairs double-strand breaks in yeast chromosomes.

    PubMed

    Ricchetti, M; Fairhead, C; Dujon, B

    1999-11-04

    The endosymbiotic theory for the origin of eukaryotic cells proposes that genetic information can be transferred from mitochondria to the nucleus of a cell, and genes that are probably of mitochondrial origin have been found in nuclear chromosomes. Occasionally, short or rearranged sequences homologous to mitochondrial DNA are seen in the chromosomes of different organisms including yeast, plants and humans. Here we report a mechanism by which fragments of mitochondrial DNA, in single or tandem array, are transferred to yeast chromosomes under natural conditions during the repair of double-strand breaks in haploid mitotic cells. These repair insertions originate from noncontiguous regions of the mitochondrial genome. Our analysis of the Saccharomyces cerevisiae mitochondrial genome indicates that the yeast nuclear genome does indeed contain several short sequences of mitochondrial origin which are similar in size and composition to those that repair double-strand breaks. These sequences are located predominantly in non-coding regions of the chromosomes, frequently in the vicinity of retrotransposon long terminal repeats, and appear as recent integration events. Thus, colonization of the yeast genome by mitochondrial DNA is an ongoing process.

  10. A post-assembly genome-improvement toolkit (PAGIT) to obtain annotated genomes from contigs.

    PubMed

    Swain, Martin T; Tsai, Isheng J; Assefa, Samual A; Newbold, Chris; Berriman, Matthew; Otto, Thomas D

    2012-06-07

    Genome projects now produce draft assemblies within weeks owing to advanced high-throughput sequencing technologies. For milestone projects such as Escherichia coli or Homo sapiens, teams of scientists were employed to manually curate and finish these genomes to a high standard. Nowadays, this is not feasible for most projects, and the quality of genomes is generally of a much lower standard. This protocol describes software (PAGIT) that is used to improve the quality of draft genomes. It offers flexible functionality to close gaps in scaffolds, correct base errors in the consensus sequence and exploit reference genomes (if available) in order to improve scaffolding and generating annotations. The protocol is most accessible for bacterial and small eukaryotic genomes (up to 300 Mb), such as pathogenic bacteria, malaria and parasitic worms. Applying PAGIT to an E. coli assembly takes ∼24 h: it doubles the average contig size and annotates over 4,300 gene models.

  11. Development of genome- and transcriptome-derived microsatellites in related species of snapping shrimps with highly duplicated genomes.

    PubMed

    Gaynor, Kaitlyn M; Solomon, Joseph W; Siller, Stefanie; Jessell, Linnet; Duffy, J Emmett; Rubenstein, Dustin R

    2017-11-01

    Molecular markers are powerful tools for studying patterns of relatedness and parentage within populations and for making inferences about social evolution. However, the development of molecular markers for simultaneous study of multiple species presents challenges, particularly when species exhibit genome duplication or polyploidy. We developed microsatellite markers for Synalpheus shrimp, a genus in which species exhibit not only great variation in social organization, but also interspecific variation in genome size and partial genome duplication. From the four primary clades within Synalpheus, we identified microsatellites in the genomes of four species and in the consensus transcriptome of two species. Ultimately, we designed and tested primers for 143 microsatellite markers across 25 species. Although the majority of markers were disomic, many markers were polysomic for certain species. Surprisingly, we found no relationship between genome size and the number of polysomic markers. As expected, markers developed for a given species amplified better for closely related species than for more distant relatives. Finally, the markers developed from the transcriptome were more likely to work successfully and to be disomic than those developed from the genome, suggesting that consensus transcriptomes are likely to be conserved across species. Our findings suggest that the transcriptome, particularly consensus sequences from multiple species, can be a valuable source of molecular markers for taxa with complex, duplicated genomes. © 2017 John Wiley & Sons Ltd.

  12. Inter- and intra-specific pan-genomes of Borrelia burgdorferi sensu lato: genome stability and adaptive radiation

    PubMed Central

    2013-01-01

    Background Lyme disease is caused by spirochete bacteria from the Borrelia burgdorferi sensu lato (B. burgdorferi s.l.) species complex. To reconstruct the evolution of B. burgdorferi s.l. and identify the genomic basis of its human virulence, we compared the genomes of 23 B. burgdorferi s.l. isolates from Europe and the United States, including B. burgdorferi sensu stricto (B. burgdorferi s.s., 14 isolates), B. afzelii (2), B. garinii (2), B. “bavariensis” (1), B. spielmanii (1), B. valaisiana (1), B. bissettii (1), and B. “finlandensis” (1). Results Robust B. burgdorferi s.s. and B. burgdorferi s.l. phylogenies were obtained using genome-wide single-nucleotide polymorphisms, despite recombination. Phylogeny-based pan-genome analysis showed that the rate of gene acquisition was higher between species than within species, suggesting adaptive speciation. Strong positive natural selection drives the sequence evolution of lipoproteins, including chromosomally-encoded genes 0102 and 0404, cp26-encoded ospC and b08, and lp54-encoded dbpA, a07, a22, a33, a53, a65. Computer simulations predicted rapid adaptive radiation of genomic groups as population size increases. Conclusions Intra- and inter-specific pan-genome sizes of B. burgdorferi s.l. expand linearly with phylogenetic diversity. Yet gene-acquisition rates in B. burgdorferi s.l. are among the lowest in bacterial pathogens, resulting in high genome stability and few lineage-specific genes. Genome adaptation of B. burgdorferi s.l. is driven predominantly by copy-number and sequence variations of lipoprotein genes. New genomic groups are likely to emerge if the current trend of B. burgdorferi s.l. population expansion continues. PMID:24112474

  13. Similar Ratios of Introns to Intergenic Sequence across Animal Genomes

    PubMed Central

    Wörheide, Gert

    2017-01-01

    Abstract One central goal of genome biology is to understand how the usage of the genome differs between organisms. Our knowledge of genome composition, needed for downstream inferences, is critically dependent on gene annotations, yet problems associated with gene annotation and assembly errors are usually ignored in comparative genomics. Here, we analyze the genomes of 68 species across 12 animal phyla and some single-cell eukaryotes for general trends in genome composition and transcription, taking into account problems of gene annotation. We show that, regardless of genome size, the ratio of introns to intergenic sequence is comparable across essentially all animals, with nearly all deviations dominated by increased intergenic sequence. Genomes of model organisms have ratios much closer to 1:1, suggesting that the majority of published genomes of nonmodel organisms are underannotated and consequently omit substantial numbers of genes, with likely negative impact on evolutionary interpretations. Finally, our results also indicate that most animals transcribe half or more of their genomes arguing against differences in genome usage between animal groups, and also suggesting that the transcribed portion is more dependent on genome size than previously thought. PMID:28633296

  14. X. couchianus and X. hellerii genome models provide genomic variation insight among Xiphophorus species.

    PubMed

    Shen, Yingjia; Chalopin, Domitille; Garcia, Tzintzuni; Boswell, Mikki; Boswell, William; Shiryev, Sergey A; Agarwala, Richa; Volff, Jean-Nicolas; Postlethwait, John H; Schartl, Manfred; Minx, Patrick; Warren, Wesley C; Walter, Ronald B

    2016-01-07

    Xiphophorus fishes are represented by 26 live-bearing species of tropical fish that express many attributes (e.g., viviparity, genetic and phenotypic variation, ecological adaptation, varied sexual developmental mechanisms, ability to produce fertile interspecies hybrids) that have made attractive research models for over 85 years. Use of various interspecies hybrids to investigate the genetics underlying spontaneous and induced tumorigenesis has resulted in the development and maintenance of pedigreed Xiphophorus lines specifically bred for research. The recent availability of the X. maculatus reference genome assembly now provides unprecedented opportunities for novel and exciting comparative research studies among Xiphophorus species. We present sequencing, assembly and annotation of two new genomes representing Xiphophorus couchianus and Xiphophorus hellerii. The final X. couchianus and X. hellerii assemblies have total sizes of 708 Mb and 734 Mb and correspond to 98 % and 102 % of the X. maculatus Jp 163 A genome size, respectively. The rates of single nucleotide change range from 1 per 52 bp to 1 per 69 bp among the three genomes and the impact of putatively damaging variants are presented. In addition, a survey of transposable elements allowed us to deduce an ancestral TE landscape, uncovered potential active TEs and document a recent burst of TEs during evolution of this genus. Two new Xiphophorus genomes and their corresponding transcriptomes were efficiently assembled, the former using a novel guided assembly approach. Three assembled genome sequences within this single vertebrate order of new world live-bearing fishes will accelerate our understanding of relationship between environmental adaptation and genome evolution. In addition, these genome resources provide capability to determine allele specific gene regulation among interspecies hybrids produced by crossing any of the three species that are known to produce progeny predisposed to tumor

  15. Profiling of gene duplication patterns of sequenced teleost genomes: evidence for rapid lineage-specific genome expansion mediated by recent tandem duplications.

    PubMed

    Lu, Jianguo; Peatman, Eric; Tang, Haibao; Lewis, Joshua; Liu, Zhanjiang

    2012-06-15

    Gene duplication has had a major impact on genome evolution. Localized (or tandem) duplication resulting from unequal crossing over and whole genome duplication are believed to be the two dominant mechanisms contributing to vertebrate genome evolution. While much scrutiny has been directed toward discerning patterns indicative of whole-genome duplication events in teleost species, less attention has been paid to the continuous nature of gene duplications and their impact on the size, gene content, functional diversity, and overall architecture of teleost genomes. Here, using a Markov clustering algorithm directed approach we catalogue and analyze patterns of gene duplication in the four model teleost species with chromosomal coordinates: zebrafish, medaka, stickleback, and Tetraodon. Our analyses based on set size, duplication type, synonymous substitution rate (Ks), and gene ontology emphasize shared and lineage-specific patterns of genome evolution via gene duplication. Most strikingly, our analyses highlight the extraordinary duplication and retention rate of recent duplicates in zebrafish and their likely role in the structural and functional expansion of the zebrafish genome. We find that the zebrafish genome is remarkable in its large number of duplicated genes, small duplicate set size, biased Ks distribution toward minimal mutational divergence, and proportion of tandem and intra-chromosomal duplicates when compared with the other teleost model genomes. The observed gene duplication patterns have played significant roles in shaping the architecture of teleost genomes and appear to have contributed to the recent functional diversification and divergence of important physiological processes in zebrafish. We have analyzed gene duplication patterns and duplication types among the available teleost genomes and found that a large number of genes were tandemly and intrachromosomally duplicated, suggesting their origin of independent and continuous duplication

  16. Whole genomic DNA sequencing and comparative genomic analysis of Arthrospira platensis: high genome plasticity and genetic diversity

    PubMed Central

    Xu, Teng; Qin, Song; Hu, Yongwu; Song, Zhijian; Ying, Jianchao; Li, Peizhen; Dong, Wei; Zhao, Fangqing; Yang, Huanming; Bao, Qiyu

    2016-01-01

    Arthrospira platensis is a multi-cellular and filamentous non-N2-fixing cyanobacterium that is capable of performing oxygenic photosynthesis. In this study, we determined the nearly complete genome sequence of A. platensis YZ. A. platensis YZ genome is a single, circular chromosome of 6.62 Mb in size. Phylogenetic and comparative genomic analyses revealed that A. platensis YZ was more closely related to A. platensis NIES-39 than Arthrospira sp. PCC 8005 and A. platensis C1. Broad gene gains were identified between A. platensis YZ and three other Arthrospira speices, some of which have been previously demonstrated that can be laterally transferred among different species, such as restriction-modification systems-coding genes. Moreover, unprecedented extensive chromosomal rearrangements among different strains were observed. The chromosomal rearrangements, particularly the chromosomal inversions, were analysed and estimated to be closely related to palindromes that involved long inverted repeat sequences and the extensively distributed type IIR restriction enzyme in the Arthrospira genome. In addition, species from genus Arthrospira unanimously contained the highest rate of repetitive sequence compared with the other species of order Oscillatoriales, suggested that sequence duplication significantly contributed to Arthrospira genome phylogeny. These results provided in-depth views into the genomic phylogeny and structural variation of A. platensis, as well as provide a valuable resource for functional genomics studies. PMID:27330141

  17. FISH-mapping of the 5S rDNA locus in chili peppers (Capsicum-Solanaceae).

    PubMed

    Aguilera, Patricia M; Debat, Humberto J; Scaldaferro, Marisel A; Martí, Dardo A; Grabiele, Mauro

    2016-03-01

    We present here the physical mapping of the 5S rDNA locus in six wild and five cultivated taxa of Capsicum by means of a genus-specific FISH probe. In all taxa, a single 5S locus per haploid genome that persistently mapped onto the short arm of a unique metacentric chromosome pair at intercalar position, was found. 5S FISH signals of almost the same size and brightness intensity were observed in all the analyzed taxa. This is the first cytological characterization of the 5S in wild taxa of Capsicum by using a genus-derived probe, and the most exhaustive and comprehensive in the chili peppers up to now. The information provided here will aid the cytomolecular characterization of pepper germplasm to evaluate variability and can be instrumental to integrate physical, genetic and genomic maps already generated in the genus.

  18. Genome-wide linkage disequilibrium and past effective population size in three Korean cattle breeds.

    PubMed

    Sudrajad, P; Seo, D W; Choi, T J; Park, B H; Roh, S H; Jung, W Y; Lee, S S; Lee, J H; Kim, S; Lee, S H

    2017-02-01

    The routine collection and use of genomic data are useful for effectively managing breeding programs for endangered populations. Linkage disequilibrium (LD) using high-density DNA markers has been widely used to determine population structures and predict the genomic regions that are associated with economic traits in beef cattle. The extent of LD also provides information about historical events, including past effective population size (N e ), and it allows inferences on the genetic diversity of breeds. The objective of this study was to estimate the LD and N e in three Korean cattle breeds that are genetically similar but have different coat colors (Brown, Brindle and Jeju Black Hanwoo). Brindle and Jeju Black are endangered breeds with small populations, whereas Brown Hanwoo is the main breeding population in Korea. DNA samples from these cattle breeds were genotyped using the Illumina BovineSNP50 Bead Chip. We examined 13 cattle breeds, including European taurines, African taurines and indicines, and hybrids to compare their LD values. Brown Hanwoo consistently had the lowest mean LD compared to Jeju Black, Brindle and the other 13 cattle breeds (0.13, 0.19, 0.21 and 0.15-0.22 respectively). The high LD values of Brindle and Jeju Black contributed to small N e values (53 and 60 respectively), which were distinct from that of Brown Hanwoo (531) for 11 generations ago. The differences in LD and N e for each breed reflect the breeding strategy applied. The N e for these endangered cattle breeds remain low; thus, effort is needed to bring them back to a sustainable tract. © 2016 Stichting International Foundation for Animal Genetics.

  19. Microdiversification in genome-streamlined ubiquitous freshwater Actinobacteria.

    PubMed

    Neuenschwander, Stefan M; Ghai, Rohit; Pernthaler, Jakob; Salcher, Michaela M

    2018-01-01

    Actinobacteria of the acI lineage are the most abundant microbes in freshwater systems, but there are so far no pure living cultures of these organisms, possibly because of metabolic dependencies on other microbes. This, in turn, has hampered an in-depth assessment of the genomic basis for their success in the environment. Here we present genomes from 16 axenic cultures of acI Actinobacteria. The isolates were not only of minute cell size, but also among the most streamlined free-living microbes, with extremely small genome sizes (1.2-1.4 Mbp) and low genomic GC content. Genome reduction in these bacteria might have led to auxotrophy for various vitamins, amino acids and reduced sulphur sources, thus creating dependencies to co-occurring organisms (the 'Black Queen' hypothesis). Genome analyses, moreover, revealed a surprising degree of inter- and intraspecific diversity in metabolic pathways, especially of carbohydrate transport and metabolism, and mainly encoded in genomic islands. The striking genotype microdiversification of acI Actinobacteria might explain their global success in highly dynamic freshwater environments with complex seasonal patterns of allochthonous and autochthonous carbon sources. We propose a new order within Actinobacteria ('Candidatus Nanopelagicales') with two new genera ('Candidatus Nanopelagicus' and 'Candidatus Planktophila') and nine new species.

  20. GenomicTools: a computational platform for developing high-throughput analytics in genomics.

    PubMed

    Tsirigos, Aristotelis; Haiminen, Niina; Bilal, Erhan; Utro, Filippo

    2012-01-15

    Recent advances in sequencing technology have resulted in the dramatic increase of sequencing data, which, in turn, requires efficient management of computational resources, such as computing time, memory requirements as well as prototyping of computational pipelines. We present GenomicTools, a flexible computational platform, comprising both a command-line set of tools and a C++ API, for the analysis and manipulation of high-throughput sequencing data such as DNA-seq, RNA-seq, ChIP-seq and MethylC-seq. GenomicTools implements a variety of mathematical operations between sets of genomic regions thereby enabling the prototyping of computational pipelines that can address a wide spectrum of tasks ranging from pre-processing and quality control to meta-analyses. Additionally, the GenomicTools platform is designed to analyze large datasets of any size by minimizing memory requirements. In practical applications, where comparable, GenomicTools outperforms existing tools in terms of both time and memory usage. The GenomicTools platform (version 2.0.0) was implemented in C++. The source code, documentation, user manual, example datasets and scripts are available online at http://code.google.com/p/ibm-cbc-genomic-tools.

  1. Efficiency of multi-breed genomic selection for dairy cattle breeds with different sizes of reference population.

    PubMed

    Hozé, C; Fritz, S; Phocas, F; Boichard, D; Ducrocq, V; Croiseau, P

    2014-01-01

    for 6 traits and using the different prediction approaches. Compared with pedigree-based BLUP, the average gain in accuracy with GS in small populations was 0.057 for the single-breed and 0.086 for multi-breed approach. This gain was up to 0.193 and 0.209, respectively, with the large reference population. Improvement of EBV prediction due to the multi-breed evaluation was higher for animals not closely related to the reference population. In the case of a breed with a small reference population size, the increase in correlation due to multi-breed GS was 0.141 for bulls without their sire in reference population compared with 0.016 for bulls with their sire in reference population. These results demonstrate that multi-breed GS can contribute to increase genomic evaluation accuracy in small breeds. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  2. Draft genome sequence of an aflatoxigenic Aspergillus species, A. bombycis

    USDA-ARS?s Scientific Manuscript database

    The genome of the A. bombycis Type strain was sequenced using a Personal Genome Machine, followed by annotation of its predicted genes. The genome size for A. bombycis was found to be approximately 37 Mb and contained 12,266 genes. This announcement introduces a sequenced genome for an aflatoxigenic...

  3. Human centromere genomics: now it's personal.

    PubMed

    Hayden, Karen E

    2012-07-01

    Advances in human genomics have accelerated studies in evolution, disease, and cellular regulation. However, centromere sequences, defining the chromosomal interface with spindle microtubules, remain largely absent from ongoing genomic studies and disconnected from functional, genome-wide analyses. This disparity results from the challenge of predicting the linear order of multi-megabase-sized regions that are composed almost entirely of near-identical satellite DNA. Acknowledging these challenges, the field of human centromere genomics possesses the potential to rapidly advance given the availability of individual, or personalized, genome projects matched with the promise of long-read sequencing technologies. Here I review the current genomic model of human centromeres in consideration of those studies involving functional datasets that examine the role of sequence in centromere identity.

  4. EUPAN enables pan-genome studies of a large number of eukaryotic genomes.

    PubMed

    Hu, Zhiqiang; Sun, Chen; Lu, Kuang-Chen; Chu, Xixia; Zhao, Yue; Lu, Jinyuan; Shi, Jianxin; Wei, Chaochun

    2017-08-01

    Pan-genome analyses are routinely carried out for bacteria to interpret the within-species gene presence/absence variations (PAVs). However, pan-genome analyses are rare for eukaryotes due to the large sizes and higher complexities of their genomes. Here we proposed EUPAN, a eukaryotic pan-genome analysis toolkit, enabling automatic large-scale eukaryotic pan-genome analyses and detection of gene PAVs at a relatively low sequencing depth. In the previous studies, we demonstrated the effectiveness and high accuracy of EUPAN in the pan-genome analysis of 453 rice genomes, in which we also revealed widespread gene PAVs among individual rice genomes. Moreover, EUPAN can be directly applied to the current re-sequencing projects primarily focusing on single nucleotide polymorphisms. EUPAN is implemented in Perl, R and C ++. It is supported under Linux and preferred for a computer cluster with LSF and SLURM job scheduling system. EUPAN together with its standard operating procedure (SOP) is freely available for non-commercial use (CC BY-NC 4.0) at http://cgm.sjtu.edu.cn/eupan/index.html . ccwei@sjtu.edu.cn or jianxin.shi@sjtu.edu.cn. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  5. Potent L-lactic acid assimilation of the fermentative and heterothallic haploid yeast Saccharomyces cerevisiae NAM34-4C.

    PubMed

    Tomitaka, Masataka; Taguchi, Hisataka; Matsuoka, Masayoshi; Morimura, Shigeru; Kida, Kenji; Akamatsu, Takashi

    2014-01-01

    We screened an industrial thermotolerant Saccharomyces cerevisiae strain, KF7, as a potent lactic-acid-assimilating yeast. Heterothallic haploid strains KF7-5C and KF7-4B were obtained from the tetrads of the homothallic yeast strain KF7. The inefficient sporulation and poor spore viability of the haploid strains were improved by two strategies. The first strategy was as follows: (i) the KF7-5C was crossed with the laboratory strain SH6710; (ii) the progenies were backcrossed with KF7-5C three times; and (iii) the progenies were inbred three times to maintain a genetic background close to that of KF7. The NAM12 diploid between the cross of the resultant two strains, NAM11-9C and NAM11-13A, showed efficient sporulation and exhibited excellent growth in YPD medium (pH 3.5) at 35°C with 1.4-h generation time, indicating thermotolerance and acid tolerance. The second strategy was successive intrastrain crosses. The resultant two strains, KFG4-6B and KFG4-4B, showed excellent mating capacity. A spontaneous mutant of KFG4-6B, KFG4-6BD, showed a high growth rate with a generation time of 1.1 h in YPD medium (pH 3.0) at 35°C. The KFG4-6BD strain produced ascospores, which were crossed with NAM11-2C and its progeny to produce tetrads. These tetrads were crossed with KFG4-4B to produce NAM26-14A and NAM26-15A. The latter strain had a generation time of 1.6 h at 35°C in pH 2.5, thus exhibiting further thermotolerance and acid tolerance. A progeny from a cross of NAM26-14A and NAM26-15A yielded the strain NAM34-4C, which showed potent lactic acid assimilation and high transformation efficiency, better than those of a standard laboratory strain. Copyright © 2013 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  6. Body lice and head lice (Anoplura: Pediculidae) have the smallest genomes of any hemimetabolous insect reported to date.

    PubMed

    Johnston, J Spencer; Yoon, Kyong Sup; Strycharz, Joseph P; Pittendrigh, Barry R; Clark, J Marshall

    2007-11-01

    The human body louse, Pediculus humanus humanus L. (Anoplura: Pediculidae), is a vector of several diseases, including louse-borne epidemic typhus, relapsing fever, and trench fever, whereas the head louse, Pediculus humanus capitis De Geer (Anoplura: Pediculidae), is more a pest of social concern. Sequencing of the body louse genome has recently been proposed and undertaken by National Human Genome Research Institute. One of the first steps in understanding an organism's genome is to determine its genome size. Here, using flow cytometry determinations, we present evidence that body louse genome size is 104.7 +/- 1.4 Mb for females and 108.3 +/- 1.1 Mb for males. Our results suggest that head lice also have a small genome size, of similar size to the body louse. Thus, Pediculus lice have one of the smallest genome sizes known in insects, suggesting it may be a suitable choice as a minimal hemimetabolous genome.

  7. Quantitative microscopy uncovers ploidy changes during mitosis in live Drosophila embryos and their effect on nuclear size.

    PubMed

    Puah, Wee Choo; Chinta, Rambabu; Wasser, Martin

    2017-03-15

    Time-lapse microscopy is a powerful tool to investigate cellular and developmental dynamics. In Drosophila melanogaster , it can be used to study division cycles in embryogenesis. To obtain quantitative information from 3D time-lapse data and track proliferating nuclei from the syncytial stage until gastrulation, we developed an image analysis pipeline consisting of nuclear segmentation, tracking, annotation and quantification. Image analysis of maternal-haploid ( mh ) embryos revealed that a fraction of haploid syncytial nuclei fused to give rise to nuclei of higher ploidy (2n, 3n, 4n). Moreover, nuclear densities in mh embryos at the mid-blastula transition varied over threefold. By tracking synchronized nuclei of different karyotypes side-by-side, we show that DNA content determines nuclear growth rate and size in early interphase, while the nuclear to cytoplasmic ratio constrains nuclear growth during late interphase. mh encodes the Drosophila ortholog of human Spartan, a protein involved in DNA damage tolerance. To explore the link between mh and chromosome instability, we fluorescently tagged Mh protein to study its subcellular localization. We show Mh-mKO2 localizes to nuclear speckles that increase in numbers as nuclei expand in interphase. In summary, quantitative microscopy can provide new insights into well-studied genes and biological processes. © 2017. Published by The Company of Biologists Ltd.

  8. A gene family for acidic ribosomal proteins in Schizosaccharomyces pombe: two essential and two nonessential genes.

    PubMed Central

    Beltrame, M; Bianchi, M E

    1990-01-01

    We have cloned the genes for small acidic ribosomal proteins (A-proteins) of the fission yeast Schizosaccharomyces pombe. S. pombe contains four transcribed genes for small A-proteins per haploid genome, as is the case for Saccharomyces cerevisiae. In contrast, multicellular eucaryotes contain two transcribed genes per haploid genome. The four proteins of S. pombe, besides sharing a high overall similarity, form two couples of nearly identical sequences. Their corresponding genes have a very conserved structure and are transcribed to a similar level. Surprisingly, of each couple of genes coding for nearly identical proteins, one is essential for cell growth, whereas the other is not. We suggest that the unequal importance of the four small A-proteins for cell survival is related to their physical organization in 60S ribosomal subunits. Images PMID:2325655

  9. The Chromosomal Constitution of Embryos Arising from Monopronuclear Oocytes in Programmes of Assisted Reproduction

    PubMed Central

    2014-01-01

    The assessment of oocytes showing only one pronucleus during assisted reproduction is associated with uncertainty. A compilation of data on the genetic constitution of different developmental stages shows that affected oocytes are able to develop into haploid, diploid, and mosaic embryos with more or less complex chromosomal compositions. In the majority of cases (~80%), haploidy appears to be caused by gynogenesis, whereas parthenogenesis or androgenesis is less common. Most of the diploid embryos result from a fertilization event involving asynchronous formation of the two pronuclei or pronuclear fusion at a very early stage. Uniparental diploidy may sometimes occur if one pronucleus fails to develop and the other pronucleus already contains a diploid genome or alternatively a haploid genome undergoes endoreduplication. In general, the chance of obtaining a biparental diploid embryo appears higher after conventional in vitro fertilization than after intracytoplasmic sperm injection. If a transfer of embryos obtained from monopronuclear oocytes is envisaged, it should be tried to culture them up to the blastocyst since most haploid embryos are not able to reach this stage. Comprehensive counselling of patients on potential risks is advisable before transfer and a preimplantation genetic diagnosis could be offered if available. PMID:25763399

  10. Minimal-assumption inference from population-genomic data

    NASA Astrophysics Data System (ADS)

    Weissman, Daniel; Hallatschek, Oskar

    Samples of multiple complete genome sequences contain vast amounts of information about the evolutionary history of populations, much of it in the associations among polymorphisms at different loci. Current methods that take advantage of this linkage information rely on models of recombination and coalescence, limiting the sample sizes and populations that they can analyze. We introduce a method, Minimal-Assumption Genomic Inference of Coalescence (MAGIC), that reconstructs key features of the evolutionary history, including the distribution of coalescence times, by integrating information across genomic length scales without using an explicit model of recombination, demography or selection. Using simulated data, we show that MAGIC's performance is comparable to PSMC' on single diploid samples generated with standard coalescent and recombination models. More importantly, MAGIC can also analyze arbitrarily large samples and is robust to changes in the coalescent and recombination processes. Using MAGIC, we show that the inferred coalescence time histories of samples of multiple human genomes exhibit inconsistencies with a description in terms of an effective population size based on single-genome data.

  11. Pan-Genomic Analysis Provides Insights into the Genomic Variation and Evolution of Salmonella Paratyphi A

    PubMed Central

    Chen, Chunxia; Cui, Xiaoying; Yu, Jun; Xiao, Jingfa; Kan, Biao

    2012-01-01

    Salmonella Paratyphi A (S. Paratyphi A) is a highly adapted, human-specific pathogen that causes paratyphoid fever. Cases of paratyphoid fever have recently been increasing, and the disease is becoming a major public health concern, especially in Eastern and Southern Asia. To investigate the genomic variation and evolution of S. Paratyphi A, a pan-genomic analysis was performed on five newly sequenced S. Paratyphi A strains and two other reference strains. A whole genome comparison revealed that the seven genomes are collinear and that their organization is highly conserved. The high rate of substitutions in part of the core genome indicates that there are frequent homologous recombination events. Based on the changes in the pan-genome size and cluster number (both in the core functional genes and core pseudogenes), it can be inferred that the sharply increasing number of pseudogene clusters may have strong correlation with the inactivation of functional genes, and indicates that the S. Paratyphi A genome is being degraded. PMID:23028950

  12. Joint scaling laws in functional and evolutionary categories in prokaryotic genomes

    PubMed Central

    Grilli, J.; Bassetti, B.; Maslov, S.; Cosentino Lagomarsino, M.

    2012-01-01

    We propose and study a class-expansion/innovation/loss model of genome evolution taking into account biological roles of genes and their constituent domains. In our model, numbers of genes in different functional categories are coupled to each other. For example, an increase in the number of metabolic enzymes in a genome is usually accompanied by addition of new transcription factors regulating these enzymes. Such coupling can be thought of as a proportional ‘recipe’ for genome composition of the type ‘a spoonful of sugar for each egg yolk’. The model jointly reproduces two known empirical laws: the distribution of family sizes and the non-linear scaling of the number of genes in certain functional categories (e.g. transcription factors) with genome size. In addition, it allows us to derive a novel relation between the exponents characterizing these two scaling laws, establishing a direct quantitative connection between evolutionary and functional categories. It predicts that functional categories that grow faster-than-linearly with genome size to be characterized by flatter-than-average family size distributions. This relation is confirmed by our bioinformatics analysis of prokaryotic genomes. This proves that the joint quantitative trends of functional and evolutionary classes can be understood in terms of evolutionary growth with proportional recipes. PMID:21937509

  13. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration

    PubMed Central

    Thorvaldsdóttir, Helga; Mesirov, Jill P.

    2013-01-01

    Data visualization is an essential component of genomic data analysis. However, the size and diversity of the data sets produced by today’s sequencing and array-based profiling methods present major challenges to visualization tools. The Integrative Genomics Viewer (IGV) is a high-performance viewer that efficiently handles large heterogeneous data sets, while providing a smooth and intuitive user experience at all levels of genome resolution. A key characteristic of IGV is its focus on the integrative nature of genomic studies, with support for both array-based and next-generation sequencing data, and the integration of clinical and phenotypic data. Although IGV is often used to view genomic data from public sources, its primary emphasis is to support researchers who wish to visualize and explore their own data sets or those from colleagues. To that end, IGV supports flexible loading of local and remote data sets, and is optimized to provide high-performance data visualization and exploration on standard desktop systems. IGV is freely available for download from http://www.broadinstitute.org/igv, under a GNU LGPL open-source license. PMID:22517427

  14. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration.

    PubMed

    Thorvaldsdóttir, Helga; Robinson, James T; Mesirov, Jill P

    2013-03-01

    Data visualization is an essential component of genomic data analysis. However, the size and diversity of the data sets produced by today's sequencing and array-based profiling methods present major challenges to visualization tools. The Integrative Genomics Viewer (IGV) is a high-performance viewer that efficiently handles large heterogeneous data sets, while providing a smooth and intuitive user experience at all levels of genome resolution. A key characteristic of IGV is its focus on the integrative nature of genomic studies, with support for both array-based and next-generation sequencing data, and the integration of clinical and phenotypic data. Although IGV is often used to view genomic data from public sources, its primary emphasis is to support researchers who wish to visualize and explore their own data sets or those from colleagues. To that end, IGV supports flexible loading of local and remote data sets, and is optimized to provide high-performance data visualization and exploration on standard desktop systems. IGV is freely available for download from http://www.broadinstitute.org/igv, under a GNU LGPL open-source license.

  15. Intranuclear DNA density affects chromosome condensation in metazoans

    PubMed Central

    Hara, Yuki; Iwabuchi, Mari; Ohsumi, Keita; Kimura, Akatsuki

    2013-01-01

    Chromosome condensation is critical for accurate inheritance of genetic information. The degree of condensation, which is reflected in the size of the condensed chromosomes during mitosis, is not constant. It is differentially regulated in embryonic and somatic cells. In addition to the developmentally programmed regulation of chromosome condensation, there may be adaptive regulation based on spatial parameters such as genomic length or cell size. We propose that chromosome condensation is affected by a spatial parameter called the chromosome amount per nuclear space, or “intranuclear DNA density.” Using Caenorhabditis elegans embryos, we show that condensed chromosome sizes vary during early embryogenesis. Of importance, changing DNA content to haploid or polyploid changes the condensed chromosome size, even at the same developmental stage. Condensed chromosome size correlates with interphase nuclear size. Finally, a reduction in nuclear size in a cell-free system from Xenopus laevis eggs resulted in reduced condensed chromosome sizes. These data support the hypothesis that intranuclear DNA density regulates chromosome condensation. This suggests an adaptive mode of chromosome condensation regulation in metazoans. PMID:23783035

  16. Comparative genomics of Lactobacillus

    PubMed Central

    Kant, Ravi; Blom, Jochen; Palva, Airi; Siezen, Roland J.; de Vos, Willem M.

    2011-01-01

    Summary The genus Lactobacillus includes a diverse group of bacteria consisting of many species that are associated with fermentations of plants, meat or milk. In addition, various lactobacilli are natural inhabitants of the intestinal tract of humans and other animals. Finally, several Lactobacillus strains are marketed as probiotics as their consumption can confer a health benefit to host. Presently, 154 Lactobacillus species are known and a growing fraction of these are subject to draft genome sequencing. However, complete genome sequences are needed to provide a platform for detailed genomic comparisons. Therefore, we selected a total of 20 genomes of various Lactobacillus strains for which complete genomic sequences have been reported. These genomes had sizes varying from 1.8 to 3.3 Mb and other characteristic features, such as G+C content that ranged from 33% to 51%. The Lactobacillus pan genome was found to consist of approximately 14 000 protein‐encoding genes while all 20 genomes shared a total of 383 sets of orthologous genes that defined the Lactobacillus core genome (LCG). Based on advanced phylogeny of the proteins encoded by this LCG, we grouped the 20 strains into three main groups and defined core group genes present in all genomes of a single group, signature group genes shared in all genomes of one group but absent in all other Lactobacillus genomes, and Group‐specific ORFans present in core group genes of one group and absent in all other complete genomes. The latter are of specific value in defining the different groups of genomes. The study provides a platform for present individual comparisons as well as future analysis of new Lactobacillus genomes. PMID:21375712

  17. Similar Ratios of Introns to Intergenic Sequence across Animal Genomes.

    PubMed

    Francis, Warren R; Wörheide, Gert

    2017-06-01

    One central goal of genome biology is to understand how the usage of the genome differs between organisms. Our knowledge of genome composition, needed for downstream inferences, is critically dependent on gene annotations, yet problems associated with gene annotation and assembly errors are usually ignored in comparative genomics. Here, we analyze the genomes of 68 species across 12 animal phyla and some single-cell eukaryotes for general trends in genome composition and transcription, taking into account problems of gene annotation. We show that, regardless of genome size, the ratio of introns to intergenic sequence is comparable across essentially all animals, with nearly all deviations dominated by increased intergenic sequence. Genomes of model organisms have ratios much closer to 1:1, suggesting that the majority of published genomes of nonmodel organisms are underannotated and consequently omit substantial numbers of genes, with likely negative impact on evolutionary interpretations. Finally, our results also indicate that most animals transcribe half or more of their genomes arguing against differences in genome usage between animal groups, and also suggesting that the transcribed portion is more dependent on genome size than previously thought. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  18. Comparative genomics among Saccharomyces cerevisiae × Saccharomyces kudriavzevii natural hybrid strains isolated from wine and beer reveals different origins

    PubMed Central

    2012-01-01

    Background Interspecific hybrids between S. cerevisiae × S. kudriavzevii have frequently been detected in wine and beer fermentations. Significant physiological differences among parental and hybrid strains under different stress conditions have been evidenced. In this study, we used comparative genome hybridization analysis to evaluate the genome composition of different S. cerevisiae × S. kudriavzevii natural hybrids isolated from wine and beer fermentations to infer their evolutionary origins and to figure out the potential role of common S. kudriavzevii gene fraction present in these hybrids. Results Comparative genomic hybridization (CGH) and ploidy analyses carried out in this study confirmed the presence of individual and differential chromosomal composition patterns for most S. cerevisiae × S. kudriavzevii hybrids from beer and wine. All hybrids share a common set of depleted S. cerevisiae genes, which also are depleted or absent in the wine strains studied so far, and the presence a common set of S. kudriavzevii genes, which may be associated with their capability to grow at low temperatures. Finally, a maximum parsimony analysis of chromosomal rearrangement events, occurred in the hybrid genomes, indicated the presence of two main groups of wine hybrids and different divergent lineages of brewing strains. Conclusion Our data suggest that wine and beer S. cerevisiae × S. kudriavzevii hybrids have been originated by different rare-mating events involving a diploid wine S. cerevisiae and a haploid or diploid European S. kudriavzevii strains. Hybrids maintain several S. kudriavzevii genes involved in cold adaptation as well as those related to S. kudriavzevii mitochondrial functions. PMID:22906207

  19. Performances of Different Fragment Sizes for Reduced Representation Bisulfite Sequencing in Pigs.

    PubMed

    Yuan, Xiao-Long; Zhang, Zhe; Pan, Rong-Yang; Gao, Ning; Deng, Xi; Li, Bin; Zhang, Hao; Sangild, Per Torp; Li, Jia-Qi

    2017-01-01

    Reduced representation bisulfite sequencing (RRBS) has been widely used to profile genome-scale DNA methylation in mammalian genomes. However, the applications and technical performances of RRBS with different fragment sizes have not been systematically reported in pigs, which serve as one of the important biomedical models for humans. The aims of this study were to evaluate capacities of RRBS libraries with different fragment sizes to characterize the porcine genome. We found that the Msp I-digested segments between 40 and 220 bp harbored a high distribution peak at 74 bp, which were highly overlapped with the repetitive elements and might reduce the unique mapping alignment. The RRBS library of 110-220 bp fragment size had the highest unique mapping alignment and the lowest multiple alignment. The cost-effectiveness of the 40-110 bp, 110-220 bp and 40-220 bp fragment sizes might decrease when the dataset size was more than 70, 50 and 110 million reads for these three fragment sizes, respectively. Given a 50-million dataset size, the average sequencing depth of the detected CpG sites in the 110-220 bp fragment size appeared to be deeper than in the 40-110 bp and 40-220 bp fragment sizes, and these detected CpG sties differently located in gene- and CpG island-related regions. In this study, our results demonstrated that selections of fragment sizes could affect the numbers and sequencing depth of detected CpG sites as well as the cost-efficiency. No single solution of RRBS is optimal in all circumstances for investigating genome-scale DNA methylation. This work provides the useful knowledge on designing and executing RRBS for investigating the genome-wide DNA methylation in tissues from pigs.

  20. The Drosophila genome nexus: a population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population.

    PubMed

    Lack, Justin B; Cardeno, Charis M; Crepeau, Marc W; Taylor, William; Corbett-Detig, Russell B; Stevens, Kristian A; Langley, Charles H; Pool, John E

    2015-04-01

    Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets. Copyright © 2015 by the Genetics Society of America.