Genome-wide association analysis identifies a meningioma risk locus at 11p15.5.
Claus, Elizabeth B; Cornish, Alex J; Broderick, Peter; Schildkraut, Joellen M; Dobbins, Sara E; Holroyd, Amy; Calvocoressi, Lisa; Lu, Lingeng; Hansen, Helen M; Smirnov, Ivan; Walsh, Kyle M; Schramm, Johannes; Hoffmann, Per; Nöthen, Markus M; Jöckel, Karl-Heinz; Swerdlow, Anthony; Larsen, Signe Benzon; Johansen, Christoffer; Simon, Matthias; Bondy, Melissa; Wrensch, Margaret; Houlston, Richard; Wiemels, Joseph L
2018-05-12
Meningioma are adult brain tumors originating in the meningeal coverings of the brain and spinal cord, with significant heritable basis. Genome-wide association studies (GWAS) have previously identified only a single risk locus for meningioma, at 10p12.31. To identify a susceptibility locus for meningioma, we conducted a meta-analysis of two GWAS, imputed using a merged reference panel of 1,000 Genomes and UK10K data, with validation in two independent sample series totaling 2,138 cases and 12,081 controls. We identified a new susceptibility locus for meningioma at 11p15.5 (rs2686876, odds ratio = 1.44, P = 9.86 × 10-9). A number of genes localize to the region of linkage disequilibrium encompassing rs2686876, including RIC8A, which plays a central role in the development of neural crest-derived structures, such as the meninges. This finding advances our understanding of the genetic basis of meningioma development and provides additional support for a polygenic model of meningioma.
Gardiner, Laura-Jayne; Gawroński, Piotr; Olohan, Lisa; Schnurbusch, Thorsten; Hall, Neil; Hall, Anthony
2014-12-01
Mapping-by-sequencing analyses have largely required a complete reference sequence and employed whole genome re-sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re-sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early-flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene-rich regions of hexaploid bread wheat to design a 110-Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo-chromosomes derived from the capture probe target sequence, with a long-range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval. © 2014 The Authors.The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Francisco, Jessica N. C.; Nazareno, Alison G.; Lohmann, Lúcia G.
2016-01-01
Premise of the study: In this study, we developed chloroplast microsatellite markers (cpSSRs) for Pachyptera kerere (Bignoniaceae) to investigate the population structure and genetic diversity of this species. Methods and Results: We used Illumina HiSeq data to reconstruct the chloroplast genome of P. kerere by a combination of de novo and reference-guided assembly. We then used the chloroplast genome to develop a set of cpSSRs from intergenic regions. Overall, 24 primer pairs were designed, 21 of which amplified successfully and were polymorphic, presenting three to nine alleles per locus. The unbiased haploid diversity per locus varied from 0.207 (Pac28) to 0.817 (Pac04). All but one locus amplified for all other taxa of Pachyptera. Conclusions: The markers reported here will serve as a basis for studies to assess the genetic structure and phylogeographic history of Pachyptera. PMID:27672522
USDA-ARS?s Scientific Manuscript database
Next generation sequencing offers new ways to identify the genetic mechanisms that underlie mutant phenotypes. The release of a reference diploid Gossypium raimondii (D5) genome and bioinformatics tools to sort tetraploid reads into subgenomes has brought cotton genetic mapping into the genomics er...
Battlay, Paul; Schmidt, Joshua M; Fournier-Level, Alexandre; Robin, Charles
2016-08-09
Scans of the Drosophila melanogaster genome have identified organophosphate resistance loci among those with the most pronounced signature of positive selection. In this study, the molecular basis of resistance to the organophosphate insecticide azinphos-methyl was investigated using the Drosophila Genetic Reference Panel, and genome-wide association. Recently released full transcriptome data were used to extend the utility of the Drosophila Genetic Reference Panel resource beyond traditional genome-wide association studies to allow systems genetics analyses of phenotypes. We found that both genomic and transcriptomic associations independently identified Cyp6g1, a gene involved in resistance to DDT and neonicotinoid insecticides, as the top candidate for azinphos-methyl resistance. This was verified by transgenically overexpressing Cyp6g1 using natural regulatory elements from a resistant allele, resulting in a 6.5-fold increase in resistance. We also identified four novel candidate genes associated with azinphos-methyl resistance, all of which are involved in either regulation of fat storage, or nervous system development. In Cyp6g1, we find a demonstrable resistance locus, a verification that transcriptome data can be used to identify variants associated with insecticide resistance, and an overlap between peaks of a genome-wide association study, and a genome-wide selective sweep analysis. Copyright © 2016 Battlay et al.
Gardiner, Laura-Jayne; Bansept-Basler, Pauline; Olohan, Lisa; Joynson, Ryan; Brenchley, Rachel; Hall, Neil; O'Sullivan, Donal M; Hall, Anthony
2016-08-01
Previously we extended the utility of mapping-by-sequencing by combining it with sequence capture and mapping sequence data to pseudo-chromosomes that were organized using wheat-Brachypodium synteny. This, with a bespoke haplotyping algorithm, enabled us to map the flowering time locus in the diploid wheat Triticum monococcum L. identifying a set of deleted genes (Gardiner et al., 2014). Here, we develop this combination of gene enrichment and sliding window mapping-by-synteny analysis to map the Yr6 locus for yellow stripe rust resistance in hexaploid wheat. A 110 MB NimbleGen capture probe set was used to enrich and sequence a doubled haploid mapping population of hexaploid wheat derived from an Avalon and Cadenza cross. The Yr6 locus was identified by mapping to the POPSEQ chromosomal pseudomolecules using a bespoke pipeline and algorithm (Chapman et al., 2015). Furthermore the same locus was identified using newly developed pseudo-chromosome sequences as a mapping reference that are based on the genic sequence used for sequence enrichment. The pseudo-chromosomes allow us to demonstrate the application of mapping-by-sequencing to even poorly defined polyploidy genomes where chromosomes are incomplete and sub-genome assemblies are collapsed. This analysis uniquely enabled us to: compare wheat genome annotations; identify the Yr6 locus - defining a smaller genic region than was previously possible; associate the interval with one wheat sub-genome and increase the density of SNP markers associated. Finally, we built the pipeline in iPlant, making it a user-friendly community resource for phenotype mapping. © 2016 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
O'Toole, Ronan F; Gautam, Sanjay S
2017-10-01
The genome sequence of Mycobacterium tuberculosis strain H37Rv is an important and valuable reference point in the study of M. tuberculosis phylogeny, molecular epidemiology, and drug-resistance mutations. However, it is becoming apparent that use of H37Rv as a sole reference genome in analysing clinical isolates presents some limitations to fully investigating M. tuberculosis virulence. Here, we examine the presence of single locus variants and the absence of entire genes in H37Rv with respect to strains that are responsible for cases and outbreaks of tuberculosis. We discuss how these polymorphisms may affect phenotypic properties of H37Rv including pathogenicity. Based on our observations and those of other researchers, we propose that use of a single reference genome, H37Rv, is not sufficient for the detection and characterisation of M. tuberculosis virulence-related loci. We recommend incorporation of genome sequences of other reference strains, in particular, direct clinical isolates, in such analyses in addition to H37Rv. Copyright © 2017 Elsevier Inc. All rights reserved.
Reference genome sequence of the model plant Setaria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Reference genome sequence of the model plant Setaria.
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao; Percifield, Ryan; Hawkins, Jennifer; Pontaroli, Ana C; Estep, Matt; Feng, Liang; Vaughn, Justin N; Grimwood, Jane; Jenkins, Jerry; Barry, Kerrie; Lindquist, Erika; Hellsten, Uffe; Deshpande, Shweta; Wang, Xuewen; Wu, Xiaomei; Mitros, Therese; Triplett, Jimmy; Yang, Xiaohan; Ye, Chu-Yu; Mauro-Herrera, Margarita; Wang, Lin; Li, Pinghua; Sharma, Manoj; Sharma, Rita; Ronald, Pamela C; Panaud, Olivier; Kellogg, Elizabeth A; Brutnell, Thomas P; Doust, Andrew N; Tuskan, Gerald A; Rokhsar, Daniel; Devos, Katrien M
2012-05-13
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).
Davis, G L; McMullen, M D; Baysdorfer, C; Musket, T; Grant, D; Staebell, M; Xu, G; Polacco, M; Koster, L; Melia-Hancock, S; Houchins, K; Chao, S; Coe, E H
1999-01-01
We have constructed a 1736-locus maize genome map containing1156 loci probed by cDNAs, 545 probed by random genomic clones, 16 by simple sequence repeats (SSRs), 14 by isozymes, and 5 by anonymous clones. Sequence information is available for 56% of the loci with 66% of the sequenced loci assigned functions. A total of 596 new ESTs were mapped from a B73 library of 5-wk-old shoots. The map contains 237 loci probed by barley, oat, wheat, rice, or tripsacum clones, which serve as grass genome reference points in comparisons between maize and other grass maps. Ninety core markers selected for low copy number, high polymorphism, and even spacing along the chromosome delineate the 100 bins on the map. The average bin size is 17 cM. Use of bin assignments enables comparison among different maize mapping populations and experiments including those involving cytogenetic stocks, mutants, or quantitative trait loci. Integration of nonmaize markers in the map extends the resources available for gene discovery beyond the boundaries of maize mapping information into the expanse of map, sequence, and phenotype information from other grass species. This map provides a foundation for numerous basic and applied investigations including studies of gene organization, gene and genome evolution, targeted cloning, and dissection of complex traits. PMID:10388831
Xu, Duo; Jaber, Yousef; Pavlidis, Pavlos; Gokcumen, Omer
2017-09-26
Constructing alignments and phylogenies for a given locus from large genome sequencing studies with relevant outgroups allow novel evolutionary and anthropological insights. However, no user-friendly tool has been developed to integrate thousands of recently available and anthropologically relevant genome sequences to construct complete sequence alignments and phylogenies. Here, we provide VCFtoTree, a user friendly tool with a graphical user interface that directly accesses online databases to download, parse and analyze genome variation data for regions of interest. Our pipeline combines popular sequence datasets and tree building algorithms with custom data parsing to generate accurate alignments and phylogenies using all the individuals from the 1000 Genomes Project, Neanderthal and Denisovan genomes, as well as reference genomes of Chimpanzee and Rhesus Macaque. It can also be applied to other phased human genomes, as well as genomes from other species. The output of our pipeline includes an alignment in FASTA format and a tree file in newick format. VCFtoTree fulfills the increasing demand for constructing alignments and phylogenies for a given loci from thousands of available genomes. Our software provides a user friendly interface for a wider audience without prerequisite knowledge in programming. VCFtoTree can be accessed from https://github.com/duoduoo/VCFtoTree_3.0.0 .
2013-01-01
Background Faba bean (Vicia faba L.) is among the earliest domesticated crops from the Near East. Today this legume is a key protein feed and food worldwide and continues to serve an important role in culinary traditions throughout Middle East, Mediterranean region, China and Ethiopia. Adapted to a wide range of soil types, the main faba bean breeding objectives are to improve yield, resistance to biotic and abiotic stresses, seed quality and other agronomic traits. Genomic approaches aimed at enhancing faba bean breeding programs require high-quality genetic linkage maps to facilitate quantitative trait locus analysis and gene tagging for use in a marker-assisted selection. The objective of this study was to construct a reference consensus map in faba bean by joining the information from the most relevant maps reported so far in this crop. Results A combination of two approaches, increasing the number of anchor loci in diverse mapping populations and joining the corresponding genetic maps, was used to develop a reference consensus map in faba bean. The map was constructed from three main recombinant inbreed populations derived from four parental lines, incorporates 729 markers and is based on 69 common loci. It spans 4,602 cM with a range from 323 to 1041 loci in six main linkage groups or chromosomes, and an average marker density of one locus every 6 cM. Locus order is generally well maintained between the consensus map and the individual maps. Conclusion We have constructed a reliable and fairly dense consensus genetic linkage map that will serve as a basis for genomic approaches in faba bean research and breeding. The core map contains a larger number of markers than any previous individual map, covers existing gaps and achieves a wider coverage of the large faba bean genome as a whole. This tool can be used as a reference resource for studies in different genetic backgrounds, and provides a framework for transferring genetic information when using different marker technologies. Combined with syntenic approaches, the consensus map will increase marker density in selected genomic regions and will be useful for future faba bean molecular breeding applications. PMID:24377374
Chiara, Matteo; Horner, David S; Spada, Alberto
2013-01-01
De novo transcriptome characterization from Next Generation Sequencing data has become an important approach in the study of non-model plants. Despite notable advances in the assembly of short reads, the clustering of transcripts into unigene-like (locus-specific) clusters remains a somewhat neglected subject. Indeed, closely related paralogous transcripts are often merged into single clusters by current approaches. Here, a novel heuristic method for locus-specific clustering is compared to that implemented in the de novo assembler Oases, using the same initial transcript collections, derived from Arabidopsis thaliana and the developmental model Streptocarpus rexii. We show that the proposed approach improves cluster specificity in the A. thaliana dataset for which the reference genome is available. Furthermore, for the S. rexii data our filtered transcript collection matches a larger number of distinct annotated loci in reference genomes than the Oases set, while containing a reduced overall number of loci. A detailed discussion of advantages and limitations of our approach in processing de novo transcriptome reconstructions is presented. The proposed method should be widely applicable to other organisms, irrespective of the transcript assembly method employed. The S. rexii transcriptome is available as a sophisticated and augmented publicly available online database.
Morrison, Cheryl L; Iwanowicz, Luke; Work, Thierry M; Fahsbender, Elizabeth; Breitbart, Mya; Adams, Cynthia; Iwanowicz, Deb; Sanders, Lakyn; Ackermann, Mathias; Cornman, Robert S
2018-01-01
Chelonid alphaherpesvirus 5 (ChHV5) is a herpesvirus associated with fibropapillomatosis (FP) in sea turtles worldwide. Single-locus typing has previously shown differentiation between Atlantic and Pacific strains of this virus, with low variation within each geographic clade. However, a lack of multi-locus genomic sequence data hinders understanding of the rate and mechanisms of ChHV5 evolutionary divergence, as well as how these genomic changes may contribute to differences in disease manifestation. To assess genomic variation in ChHV5 among five Hawaii and three Florida green sea turtles, we used high-throughput short-read sequencing of long-range PCR products amplified from tumor tissue using primers designed from the single available ChHV5 reference genome from a Hawaii green sea turtle. This strategy recovered sequence data from both geographic regions for approximately 75% of the predicted ChHV5 coding sequences. The average nucleotide divergence between geographic populations was 1.5%; most of the substitutions were fixed differences between regions. Protein divergence was generally low (average 0.08%), and ranged between 0 and 5.3%. Several atypical genes originally identified and annotated in the reference genome were confirmed in ChHV5 genomes from both geographic locations. Unambiguous recombination events between geographic regions were identified, and clustering of private alleles suggests the prevalence of recombination in the evolutionary history of ChHV5. This study significantly increased the amount of sequence data available from ChHV5 strains, enabling informed selection of loci for future population genetic and natural history studies, and suggesting the (possibly latent) co-infection of individuals by well-differentiated geographic variants.
Morrison, Cheryl L.; Iwanowicz, Luke R.; Work, Thierry M.; Fahsbender, Elizabeth; Breitbart, Mya; Adams, Cynthia; Iwanowicz, Deborah; Sanders, Lakyn; Ackermann, Mathias; Cornman, Robert S.
2018-01-01
Chelonid alphaherpesvirus 5 (ChHV5) is a herpesvirus associated with fibropapillomatosis (FP) in sea turtles worldwide. Single-locus typing has previously shown differentiation between Atlantic and Pacific strains of this virus, with low variation within each geographic clade. However, a lack of multi-locus genomic sequence data hinders understanding of the rate and mechanisms of ChHV5 evolutionary divergence, as well as how these genomic changes may contribute to differences in disease manifestation. To assess genomic variation in ChHV5 among five Hawaii and three Florida green sea turtles, we used high-throughput short-read sequencing of long-range PCR products amplified from tumor tissue using primers designed from the single available ChHV5 reference genome from a Hawaii green sea turtle. This strategy recovered sequence data from both geographic regions for approximately 75% of the predicted ChHV5 coding sequences. The average nucleotide divergence between geographic populations was 1.5%; most of the substitutions were fixed differences between regions. Protein divergence was generally low (average 0.08%), and ranged between 0 and 5.3%. Several atypical genes originally identified and annotated in the reference genome were confirmed in ChHV5 genomes from both geographic locations. Unambiguous recombination events between geographic regions were identified, and clustering of private alleles suggests the prevalence of recombination in the evolutionary history of ChHV5. This study significantly increased the amount of sequence data available from ChHV5 strains, enabling informed selection of loci for future population genetic and natural history studies, and suggesting the (possibly latent) co-infection of individuals by well-differentiated geographic variants.
Iwanowicz, Luke; Work, Thierry M.; Fahsbender, Elizabeth; Breitbart, Mya; Adams, Cynthia; Iwanowicz, Deb; Sanders, Lakyn; Ackermann, Mathias; Cornman, Robert S.
2018-01-01
Chelonid alphaherpesvirus 5 (ChHV5) is a herpesvirus associated with fibropapillomatosis (FP) in sea turtles worldwide. Single-locus typing has previously shown differentiation between Atlantic and Pacific strains of this virus, with low variation within each geographic clade. However, a lack of multi-locus genomic sequence data hinders understanding of the rate and mechanisms of ChHV5 evolutionary divergence, as well as how these genomic changes may contribute to differences in disease manifestation. To assess genomic variation in ChHV5 among five Hawaii and three Florida green sea turtles, we used high-throughput short-read sequencing of long-range PCR products amplified from tumor tissue using primers designed from the single available ChHV5 reference genome from a Hawaii green sea turtle. This strategy recovered sequence data from both geographic regions for approximately 75% of the predicted ChHV5 coding sequences. The average nucleotide divergence between geographic populations was 1.5%; most of the substitutions were fixed differences between regions. Protein divergence was generally low (average 0.08%), and ranged between 0 and 5.3%. Several atypical genes originally identified and annotated in the reference genome were confirmed in ChHV5 genomes from both geographic locations. Unambiguous recombination events between geographic regions were identified, and clustering of private alleles suggests the prevalence of recombination in the evolutionary history of ChHV5. This study significantly increased the amount of sequence data available from ChHV5 strains, enabling informed selection of loci for future population genetic and natural history studies, and suggesting the (possibly latent) co-infection of individuals by well-differentiated geographic variants. PMID:29479497
Bossé, Janine T; Li, Yanwen; Sárközi, Rita; Gottschalk, Marcelo; Angen, Øystein; Nedbalcova, Katerina; Rycroft, Andrew N; Fodor, László; Langford, Paul R
2017-03-01
Actinobacillus pleuropneumoniae causes pleuropneumonia, an economically significant lung disease of pigs. Recently, isolates of A. pleuropneumoniae that were serologically distinct from the previously characterized 15 serovars were described, and a proposal was put forward that they comprised a new serovar, serovar 16. Here we used whole-genome sequencing of the proposed serovar 16 reference strain A-85/14 to confirm the presence of a unique capsular polysaccharide biosynthetic locus. For molecular diagnostics, primers were designed from the capsule locus of strain A-85/14, and a PCR was formulated that differentiated serovar 16 isolates from all 15 known serovars and other common respiratory pathogenic/commensal bacteria of pigs. Analysis of the capsule locus of strain A-85/14 combined with the previous serological data show the existence of a sixteenth serovar-designated serovar 16-of A. pleuropneumoniae . Copyright © 2017 Bossé et al.
Cormier, Alexandre; Avia, Komlan; Sterck, Lieven; Derrien, Thomas; Wucher, Valentin; Andres, Gwendoline; Monsoor, Misharl; Godfroy, Olivier; Lipinska, Agnieszka; Perrineau, Marie-Mathilde; Van De Peer, Yves; Hitte, Christophe; Corre, Erwan; Coelho, Susana M; Cock, J Mark
2017-04-01
The genome of the filamentous brown alga Ectocarpus was the first to be completely sequenced from within the brown algal group and has served as a key reference genome both for this lineage and for the stramenopiles. We present a complete structural and functional reannotation of the Ectocarpus genome. The large-scale assembly of the Ectocarpus genome was significantly improved and genome-wide gene re-annotation using extensive RNA-seq data improved the structure of 11 108 existing protein-coding genes and added 2030 new loci. A genome-wide analysis of splicing isoforms identified an average of 1.6 transcripts per locus. A large number of previously undescribed noncoding genes were identified and annotated, including 717 loci that produce long noncoding RNAs. Conservation of lncRNAs between Ectocarpus and another brown alga, the kelp Saccharina japonica, suggests that at least a proportion of these loci serve a function. Finally, a large collection of single nucleotide polymorphism-based markers was developed for genetic analyses. These resources are available through an updated and improved genome database. This study significantly improves the utility of the Ectocarpus genome as a high-quality reference for the study of many important aspects of brown algal biology and as a reference for genomic analyses across the stramenopiles. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Allele-specific locus binding and genome editing by CRISPR at the p16INK4a locus.
Fujita, Toshitsugu; Yuno, Miyuki; Fujii, Hodaka
2016-07-28
The clustered regularly interspaced short palindromic repeats (CRISPR) system has been adopted for a wide range of biological applications including genome editing. In some cases, dissection of genome functions requires allele-specific genome editing, but the use of CRISPR for this purpose has not been studied in detail. In this study, using the p16INK4a gene in HCT116 as a model locus, we investigated whether chromatin states, such as CpG methylation, or a single-nucleotide gap form in a target site can be exploited for allele-specific locus binding and genome editing by CRISPR in vivo. First, we showed that allele-specific locus binding and genome editing could be achieved by targeting allele-specific CpG-methylated regions, which was successful for one, but not all guide RNAs. In this regard, molecular basis underlying the success remains elusive at this stage. Next, we demonstrated that an allele-specific single-nucleotide gap form could be employed for allele-specific locus binding and genome editing by CRISPR, although it was important to avoid CRISPR tolerance of a single nucleotide mismatch brought about by mismatched base skipping. Our results provide information that might be useful for applications of CRISPR in studies of allele-specific functions in the genomes.
Mujic, Alija Bajro; Kuo, Alan; Tritt, Andrew; Lipzen, Anna; Chen, Cindy; Johnson, Jenifer; Sharma, Aditi; Barry, Kerrie; Grigoriev, Igor V.; Spatafora, Joseph W.
2017-01-01
Divergence of breeding system plays an important role in fungal speciation. Ectomycorrhizal fungi, however, pose a challenge for the study of reproductive biology because most cannot be mated under laboratory conditions. To overcome this barrier, we sequenced the draft genomes of the ectomycorrhizal sister species Rhizopogon vinicolor Smith and Zeller and R. vesiculosus Smith and Zeller (Basidiomycota, Boletales)—the first genomes available for Basidiomycota truffles—and characterized gene content and organization surrounding their mating type loci. Both species possess a pair of homeodomain transcription factor homologs at the mating type A-locus as well as pheromone receptor and pheromone precursor homologs at the mating type B-locus. Comparison of Rhizopogon genomes with genomes from Boletales, Agaricales, and Polyporales revealed synteny of the A-locus region within Boletales, but several genomic rearrangements across orders. Our findings suggest correlation between gene content at the B-locus region and breeding system in Boletales with tetrapolar species possessing more diverse gene content than bipolar species. Rhizopogon vinicolor possesses a greater number of B-locus pheromone receptor and precursor genes than R. vesiculosus, as well as a pair of isoprenyl cysteine methyltransferase genes flanking the B-locus compared to a single copy in R. vesiculosus. Examination of dikaryotic single nucleotide polymorphisms within genomes revealed greater heterozygosity in R. vinicolor, consistent with increased rates of outcrossing. Both species possess the components of a heterothallic breeding system with R. vinicolor possessing a B-locus region structure consistent with tetrapolar Boletales and R. vesiculosus possessing a B-locus region structure intermediate between bipolar and tetrapolar Boletales. PMID:28450370
Dessimoz, Christophe; Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro
2011-09-01
Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.
Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro
2011-01-01
Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references. PMID:21712341
The Past, Present, and Future of Human Centromere Genomics
Aldrup-MacDonald, Megan E.; Sullivan, Beth A.
2014-01-01
The centromere is the chromosomal locus essential for chromosome inheritance and genome stability. Human centromeres are located at repetitive alpha satellite DNA arrays that compose approximately 5% of the genome. Contiguous alpha satellite DNA sequence is absent from the assembled reference genome, limiting current understanding of centromere organization and function. Here, we review the progress in centromere genomics spanning the discovery of the sequence to its molecular characterization and the work done during the Human Genome Project era to elucidate alpha satellite structure and sequence variation. We discuss exciting recent advances in alpha satellite sequence assembly that have provided important insight into the abundance and complex organization of this sequence on human chromosomes. In light of these new findings, we offer perspectives for future studies of human centromere assembly and function. PMID:24683489
Ma, Chun-Lei; Jin, Ji-Qiang; Li, Chun-Fang; Wang, Rong-Kai; Zheng, Hong-Kun; Yao, Ming-Zhe; Chen, Liang
2015-01-01
Genetic maps are important tools in plant genomics and breeding. The present study reports the large-scale discovery of single nucleotide polymorphisms (SNPs) for genetic map construction in tea plant. We developed a total of 6,042 valid SNP markers using specific-locus amplified fragment sequencing (SLAF-seq), and subsequently mapped them into the previous framework map. The final map contained 6,448 molecular markers, distributing on fifteen linkage groups corresponding to the number of tea plant chromosomes. The total map length was 3,965 cM, with an average inter-locus distance of 1.0 cM. This map is the first SNP-based reference map of tea plant, as well as the most saturated one developed to date. The SNP markers and map resources generated in this study provide a wealth of genetic information that can serve as a foundation for downstream genetic analyses, such as the fine mapping of quantitative trait loci (QTL), map-based cloning, marker-assisted selection, and anchoring of scaffolds to facilitate the process of whole genome sequencing projects for tea plant. PMID:26035838
BiQ Analyzer HT: locus-specific analysis of DNA methylation by high-throughput bisulfite sequencing
Lutsik, Pavlo; Feuerbach, Lars; Arand, Julia; Lengauer, Thomas; Walter, Jörn; Bock, Christoph
2011-01-01
Bisulfite sequencing is a widely used method for measuring DNA methylation in eukaryotic genomes. The assay provides single-base pair resolution and, given sufficient sequencing depth, its quantitative accuracy is excellent. High-throughput sequencing of bisulfite-converted DNA can be applied either genome wide or targeted to a defined set of genomic loci (e.g. using locus-specific PCR primers or DNA capture probes). Here, we describe BiQ Analyzer HT (http://biq-analyzer-ht.bioinf.mpi-inf.mpg.de/), a user-friendly software tool that supports locus-specific analysis and visualization of high-throughput bisulfite sequencing data. The software facilitates the shift from time-consuming clonal bisulfite sequencing to the more quantitative and cost-efficient use of high-throughput sequencing for studying locus-specific DNA methylation patterns. In addition, it is useful for locus-specific visualization of genome-wide bisulfite sequencing data. PMID:21565797
USDA-ARS?s Scientific Manuscript database
Multi-locus genome-wide association studies has become the state-of-the-art procedure to identify quantitative trait loci (QTL) associated with traits simultaneously. However, implementation of multi-locus model is still difficult. In this study, we integrated least angle regression with empirical B...
Joha, Sami; Dauphin, Véronique; Leprêtre, Frédéric; Corm, Sélim; Nicolini, Franck E; Roumier, Christophe; Nibourel, Olivier; Grardel, Nathalie; Maguer-Satta, Véronique; Idziorek, Thierry; Figeac, Martin; Laï, Jean-Luc; Quesnel, Bruno; Etienne, Gabriel; Guilhot, François; Lippert, Eric; Preudhomme, Claude; Roche-Lestienne, Catherine
2011-04-01
To ascertain genomic alterations associated with Imatinib resistance in chronic myeloid leukaemia, we performed high resolution genomic analysis of CD34(+) cells from 25 Imatinib (IM) resistant and 11 responders CML patients. Using patients' T-cells as reference, we found significant association between number of acquired cryptic copy number alterations (CNA) and disease phase (p=0.036) or loss of IM response for patients diagnosed in chronic phase (CP) (p=0.04). Recurrent cryptic losses were identified on chromosomes 7, 12 and 13. On chromosome 7, recurrent deletions of the IKZF1 locus were detected, for the first time, in 4 patients in CP. Copyright © 2010 Elsevier Ltd. All rights reserved.
Developmental Stability Covaries with Genome-Wide and Single-Locus Heterozygosity in House Sparrows
Vangestel, Carl; Mergeay, Joachim; Dawson, Deborah A.; Vandomme, Viki; Lens, Luc
2011-01-01
Fluctuating asymmetry (FA), a measure of developmental instability, has been hypothesized to increase with genetic stress. Despite numerous studies providing empirical evidence for associations between FA and genome-wide properties such as multi-locus heterozygosity, support for single-locus effects remains scant. Here we test if, and to what extent, FA co-varies with single- and multilocus markers of genetic diversity in house sparrow (Passer domesticus) populations along an urban gradient. In line with theoretical expectations, FA was inversely correlated with genetic diversity estimated at genome level. However, this relationship was largely driven by variation at a single key locus. Contrary to our expectations, relationships between FA and genetic diversity were not stronger in individuals from urban populations that experience higher nutritional stress. We conclude that loss of genetic diversity adversely affects developmental stability in P. domesticus, and more generally, that the molecular basis of developmental stability may involve complex interactions between local and genome-wide effects. Further study on the relative effects of single-locus and genome-wide effects on the developmental stability of populations with different genetic properties is therefore needed. PMID:21747940
Puttini, Stefania; Ouvrard-Pascaud, Antoine; Palais, Gael; Beggah, Ahmed T; Gascard, Philippe; Cohen-Tannoudji, Michel; Babinet, Charles; Blot-Chabaud, Marcel; Jaisser, Frederic
2005-03-16
Functional genomic analysis is a challenging step in the so-called post-genomic field. Identification of potential targets using large-scale gene expression analysis requires functional validation to identify those that are physiologically relevant. Genetically modified cell models are often used for this purpose allowing up- or down-expression of selected targets in a well-defined and if possible highly differentiated cell type. However, the generation of such models remains time-consuming and expensive. In order to alleviate this step, we developed a strategy aimed at the rapid and efficient generation of genetically modified cell lines with conditional, inducible expression of various target genes. Efficient knock-in of various constructs, called targeted transgenesis, in a locus selected for its permissibility to the tet inducible system, was obtained through the stimulation of site-specific homologous recombination by the meganuclease I-SceI. Our results demonstrate that targeted transgenesis in a reference inducible locus greatly facilitated the functional analysis of the selected recombinant cells. The efficient screening strategy we have designed makes possible automation of the transfection and selection steps. Furthermore, this strategy could be applied to a variety of highly differentiated cells.
Saraswathy, Kallur Nava; Mukhopadhyay, Rupak; Shukla, Deepti; Kaur, Harpreet; Sachdeva, Mohinder Pal; Rao, A P; Saksena, Deepti; Kalla, Aloke Kumar
2009-02-01
Dopamine receptor D2 (DRD2) is expressed in the central nervous system and has a high affinity for many antipsychotic drugs. Besides several epidemiological investigations on association of DRD2 locus polymorphism(s) with neuropsychiatric problems and addictive behavior, a few polymorphisms in this locus have also been used to understand genomic diversity and population migratory histories globally. The present study attempts to understand the genomic diversity/affinity among four endogamous groups of Andhra Pradesh (India) against the backdrop of diversity studies from other parts of India and the rest of the world, with special reference to DRD2 locus. The four population groups from Adilabad District of Andhra Pradesh, namely, Brahmin (n=50), Nayakpod (n=49), Thoti (n=52), and Kolam (n=53), were included in the study. The DRD2 markers typed for the present study are three biallelic restriction fragments, that is, TaqI A (rs1800497), TaqI B (rs1079597), and TaqI D (rs1800498). Scoring of DRD2 haplotypes with respect to the three TaqI sites shows that five out of eight possible haplotypes are shared by the four populations. Ancestral haplotype B2D2A1 is most frequent among Thotis (0.359). The results of the present study indicate a differential gene flow into South India followed by certain important demographic events resulting in diversified peopling of India.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennetzen, Jeffrey L; Yang, Xiaohan; Ye, Chuyu
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The {approx}400-Mb assembly covers {approx}80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Fuselli, S; Baptista, R P; Panziera, A; Magi, A; Guglielmi, S; Tonin, R; Benazzo, A; Bauzer, L G; Mazzoni, C J; Bertorelle, G
2018-03-24
The major histocompatibility complex (MHC) acts as an interface between the immune system and infectious diseases. Accurate characterization and genotyping of the extremely variable MHC loci are challenging especially without a reference sequence. We designed a combination of long-range PCR, Illumina short-reads, and Oxford Nanopore MinION long-reads approaches to capture the genetic variation of the MHC II DRB locus in an Italian population of the Alpine chamois (Rupicapra rupicapra). We utilized long-range PCR to generate a 9 Kb fragment of the DRB locus. Amplicons from six different individuals were fragmented, tagged, and simultaneously sequenced with Illumina MiSeq. One of these amplicons was sequenced with the MinION device, which produced long reads covering the entire amplified fragment. A pipeline that combines short and long reads resolved several short tandem repeats and homopolymers and produced a de novo reference, which was then used to map and genotype the short reads from all individuals. The assembled DRB locus showed a high level of polymorphism and the presence of a recombination breakpoint. Our results suggest that an amplicon-based NGS approach coupled with single-molecule MinION nanopore sequencing can efficiently achieve both the assembly and the genotyping of complex genomic regions in multiple individuals in the absence of a reference sequence.
Bogdanova, Vera S.; Zaytseva, Olga O.; Mglinets, Anatoliy V.; Shatskaya, Natalia V.; Kosterin, Oleg E.; Vasiliev, Gennadiy V.
2015-01-01
In crosses of wild and cultivated peas (Pisum sativum L.), nuclear-cytoplasmic incompatibility frequently occurs manifested as decreased pollen fertility, male gametophyte lethality, sporophyte lethality. High-throughput sequencing of plastid genomes of one cultivated and four wild pea accessions differing in cross-compatibility was performed. Candidate genes for involvement in the nuclear-plastid conflict were searched in the reconstructed plastid genomes. In the annotated Medicago truncatula genome, nuclear candidate genes were searched in the portion syntenic to the pea chromosome region known to harbor a locus involved in the conflict. In the plastid genomes, a substantial variability of the accD locus represented by nucleotide substitutions and indels was found to correspond to the pattern of cross-compatibility among the accessions analyzed. Amino acid substitutions in the polypeptides encoded by the alleles of a nuclear locus, designated as Bccp3, with a complementary function to accD, fitted the compatibility pattern. The accD locus in the plastid genome encoding beta subunit of the carboxyltransferase of acetyl-coA carboxylase and the nuclear locus Bccp3 encoding biotin carboxyl carrier protein of the same multi-subunit enzyme were nominated as candidate genes for main contribution to nuclear-cytoplasmic incompatibility in peas. Existence of another nuclear locus involved in the accD-mediated conflict is hypothesized. PMID:25789472
Jhunjhunwala, Suchit; van Zelm, Menno C.; Peak, Mandy M.; Cutchin, Steve; Riblet, Roy; van Dongen, Jacques J.M.; Grosveld, Frank G.; Knoch, Tobias A.; Murre, Cornelis
2009-01-01
SUMMARY The immunoglobulin heavy-chain (Igh) locus is organized into distinct regions that contain multiple variable (VH), diversity (DH), joining (JH) and constant (CH) coding elements. How the Igh locus is structured in 3D space is unknown. To probe the topography of the Igh locus, spatial distance distributions were determined between 12 genomic markers that span the entire Igh locus. Comparison of the distance distributions to computer simulations of alternative chromatin arrangements predicted that the Igh locus is organized into compartments containing clusters of loops separated by linkers. Trilateration and triple-point angle measurements indicated the mean relative 3D positions of the VH, DH, JH, and CH elements, showed compartmentalization and striking conformational changes involving VH and DH-JH elements during early B cell development. In pro-B cells, the entire repertoire of VH regions (2 Mbp) appeared to have merged and juxtaposed to the DH elements, mechanistically permitting long-range genomic interactions to occur with relatively high frequency. PMID:18423198
2011-01-01
Background Missense mutations in three different genes encoding amyloid-β precursor protein, presenilin 1 and presenilin 2 are recognized to cause familial early-onset Alzheimer disease. Also duplications of the amyloid precursor protein gene have been shown to cause the disease. At the Dept. of Geriatric Medicine, Karolinska University Hospital, Sweden, patients are referred for mutation screening for the identification of nucleotide variations and for determining copy-number of the APP locus. Methods We combined the method of microsatellite marker genotyping with a quantitative real-time PCR analysis to detect duplications in patients with Alzheimer disease. Results In 22 DNA samples from individuals diagnosed with clinical Alzheimer disease, we identified one patient carrying a duplication on chromosome 21 which included the APP locus. Further mapping of the chromosomal region by array-comparative genome hybridization showed that the duplication spanned a maximal region of 1.09 Mb. Conclusions This is the first report of an APP duplication in a Swedish Alzheimer patient and describes the use of quantitative real-time PCR as a tool for determining copy-number of the APP locus. PMID:22044463
Thonberg, Håkan; Fallström, Marie; Björkström, Jenny; Schoumans, Jacqueline; Nennesmo, Inger; Graff, Caroline
2011-11-01
Missense mutations in three different genes encoding amyloid-β precursor protein, presenilin 1 and presenilin 2 are recognized to cause familial early-onset Alzheimer disease. Also duplications of the amyloid precursor protein gene have been shown to cause the disease. At the Dept. of Geriatric Medicine, Karolinska University Hospital, Sweden, patients are referred for mutation screening for the identification of nucleotide variations and for determining copy-number of the APP locus. We combined the method of microsatellite marker genotyping with a quantitative real-time PCR analysis to detect duplications in patients with Alzheimer disease. In 22 DNA samples from individuals diagnosed with clinical Alzheimer disease, we identified one patient carrying a duplication on chromosome 21 which included the APP locus. Further mapping of the chromosomal region by array-comparative genome hybridization showed that the duplication spanned a maximal region of 1.09 Mb. This is the first report of an APP duplication in a Swedish Alzheimer patient and describes the use of quantitative real-time PCR as a tool for determining copy-number of the APP locus.
ERIC Educational Resources Information Center
Tull, Ashley; Freeman, Jerrid P.
2011-01-01
Examined in this study were the identified frames of reference and locus of control used by 478 student affairs administrators. Administrator responses were examined to identify frames of reference most commonly used and their preference order. Locus of control most commonly used and the relationship between frames of reference and locus of…
ERIC Educational Resources Information Center
Nijmeijer, Judith S.; Arias-Vásquez, Alejandro; Rommelse, Nanda N.; Altink, Marieke E.; Buschgens, Cathelijne J.; Fliers, Ellen A.; Franke, Barbara; Minderaa, Ruud B.; Sergeant, Joseph A.; Buitelaar, Jan K.; Hoekstra, Pieter J.; Hartman, Catharina A.
2014-01-01
We studied 261 ADHD probands and 354 of their siblings to assess quantitative trait loci associated with autism spectrum disorder symptoms (as measured by the Children's Social Behavior Questionnaire (CSBQ) using a genome-wide linkage approach, followed by locus-wide association analysis. A genome-wide significant locus for the CSBQ subscale…
Subspecies diversity in bacteriocin production by intestinal Lactobacillus salivarius strains
O’ Shea, Eileen F.; O’ Connor, Paula M.; Raftis, Emma J.; O’ Toole, Paul W.; Stanton, Catherine; Cotter, Paul D.; Ross, R. Paul; Hill, Colin
2012-01-01
A recent comparative genomic hybridization study in our laboratory revealed considerable plasticity within the bacteriocin locus of gastrointestinal strains of Lactobacillus salivarius. Most notably, these analyses led to the identification of two novel unmodified bacteriocins, salivaricin L and salivaricin T, produced by the neonatal isolate L. salivarius DPC6488 with immunity, regulatory and export systems analogous to those of abp118, a two-component bacteriocin produced by the well characterized reference strain L. salivarius UCC118. In this addendum we discuss the intraspecific diversity of our seven bacteriocin-producing L. salivarius isolates on a genome-wide level, and more specifically, with respect to their salivaricin loci. PMID:22892690
Subspecies diversity in bacteriocin production by intestinal Lactobacillus salivarius strains.
O' Shea, Eileen F; O' Connor, Paula M; Raftis, Emma J; O' Toole, Paul W; Stanton, Catherine; Cotter, Paul D; Ross, R Paul; Hill, Colin
2012-01-01
A recent comparative genomic hybridization study in our laboratory revealed considerable plasticity within the bacteriocin locus of gastrointestinal strains of Lactobacillus salivarius. Most notably, these analyses led to the identification of two novel unmodified bacteriocins, salivaricin L and salivaricin T, produced by the neonatal isolate L. salivarius DPC6488 with immunity, regulatory and export systems analogous to those of abp118, a two-component bacteriocin produced by the well characterized reference strain L. salivarius UCC118. In this addendum we discuss the intraspecific diversity of our seven bacteriocin-producing L. salivarius isolates on a genome-wide level, and more specifically, with respect to their salivaricin loci.
Pottorff, Marti; Wanamaker, Steve; Ma, Yaqin Q; Ehlers, Jeffrey D; Roberts, Philip A; Close, Timothy J
2012-01-01
Fusarium oxysporum f.sp. tracheiphilum (Fot) is a soil-borne fungal pathogen that causes vascular wilt disease in cowpea. Fot race 3 is one of the major pathogens affecting cowpea production in California. Identification of Fot race 3 resistance determinants will expedite delivery of improved cultivars by replacing time-consuming phenotypic screening with selection based on perfect markers, thereby generating successful cultivars in a shorter time period. Resistance to Fot race 3 was studied in the RIL population California Blackeye 27 (resistant) x 24-125B-1 (susceptible). Biparental mapping identified a Fot race 3 resistance locus, Fot3-1, which spanned 3.56 cM on linkage group one of the CB27 x 24-125B-1 genetic map. A marker-trait association narrowed the resistance locus to a 1.2 cM region and identified SNP marker 1_1107 as co-segregating with Fot3-1 resistance. Macro and microsynteny was observed for the Fot3-1 locus region in Glycine max where six disease resistance genes were observed in the two syntenic regions of soybean chromosomes 9 and 15. Fot3-1 was identified on the cowpea physical map on BAC clone CH093L18, spanning approximately 208,868 bp on BAC contig250. The Fot3-1 locus was narrowed to 0.5 cM distance on the cowpea genetic map linkage group 6, flanked by SNP markers 1_0860 and 1_1107. BAC clone CH093L18 was sequenced and four cowpea sequences with similarity to leucine-rich repeat serine/threonine protein kinases were identified and are cowpea candidate genes for the Fot3-1 locus. This study has shown how readily candidate genes can be identified for simply inherited agronomic traits when appropriate genetic stocks and integrated genomic resources are available. High co-linearity between cowpea and soybean genomes illustrated that utilizing synteny can transfer knowledge from a reference legume to legumes with less complete genomic resources. Identification of Fot race 3 resistance genes will enable transfer into high yielding cowpea varieties using marker-assisted selection (MAS).
Pottorff, Marti; Wanamaker, Steve; Ma, Yaqin Q.; Ehlers, Jeffrey D.; Roberts, Philip A.; Close, Timothy J.
2012-01-01
Fusarium oxysporum f.sp. tracheiphilum (Fot) is a soil-borne fungal pathogen that causes vascular wilt disease in cowpea. Fot race 3 is one of the major pathogens affecting cowpea production in California. Identification of Fot race 3 resistance determinants will expedite delivery of improved cultivars by replacing time-consuming phenotypic screening with selection based on perfect markers, thereby generating successful cultivars in a shorter time period. Resistance to Fot race 3 was studied in the RIL population California Blackeye 27 (resistant) x 24-125B-1 (susceptible). Biparental mapping identified a Fot race 3 resistance locus, Fot3-1, which spanned 3.56 cM on linkage group one of the CB27 x 24-125B-1 genetic map. A marker-trait association narrowed the resistance locus to a 1.2 cM region and identified SNP marker 1_1107 as co-segregating with Fot3-1 resistance. Macro and microsynteny was observed for the Fot3-1 locus region in Glycine max where six disease resistance genes were observed in the two syntenic regions of soybean chromosomes 9 and 15. Fot3-1 was identified on the cowpea physical map on BAC clone CH093L18, spanning approximately 208,868 bp on BAC contig250. The Fot3-1 locus was narrowed to 0.5 cM distance on the cowpea genetic map linkage group 6, flanked by SNP markers 1_0860 and 1_1107. BAC clone CH093L18 was sequenced and four cowpea sequences with similarity to leucine-rich repeat serine/threonine protein kinases were identified and are cowpea candidate genes for the Fot3-1 locus. This study has shown how readily candidate genes can be identified for simply inherited agronomic traits when appropriate genetic stocks and integrated genomic resources are available. High co-linearity between cowpea and soybean genomes illustrated that utilizing synteny can transfer knowledge from a reference legume to legumes with less complete genomic resources. Identification of Fot race 3 resistance genes will enable transfer into high yielding cowpea varieties using marker-assisted selection (MAS). PMID:22860000
Genetical genomics of Populus leaf shape variation
Drost, Derek R.; Puranik, Swati; Novaes, Evandro; ...
2015-06-30
Leaf morphology varies extensively among plant species and is under strong genetic control. Mutagenic screens in model systems have identified genes and established molecular mechanisms regulating leaf initiation, development, and shape. However, it is not known whether this diversity across plant species is related to naturally occurring variation at these genes. Quantitative trait locus (QTL) analysis has revealed a polygenic control for leaf shape variation in different species suggesting that loci discovered by mutagenesis may only explain part of the naturally occurring variation in leaf shape. Here we undertook a genetical genomics study in a poplar intersectional pseudo-backcross pedigree tomore » identify genetic factors controlling leaf shape. Here, the approach combined QTL discovery in a genetic linkage map anchored to the Populus trichocarpa reference genome sequence and transcriptome analysis.« less
Zhao, Zhenqing; Gu, Honghui; Sheng, Xiaoguang; Yu, Huifang; Wang, Jiansheng; Huang, Long; Wang, Dan
2016-01-01
Molecular markers and genetic maps play an important role in plant genomics and breeding studies. Cauliflower is an important and distinctive vegetable; however, very few molecular resources have been reported for this species. In this study, a novel, specific-locus amplified fragment (SLAF) sequencing strategy was employed for large-scale single nucleotide polymorphism (SNP) discovery and high-density genetic map construction in a double-haploid, segregating population of cauliflower. A total of 12.47 Gb raw data containing 77.92 M pair-end reads were obtained after processing and 6815 polymorphic SLAFs between the two parents were detected. The average sequencing depths reached 52.66-fold for the female parent and 49.35-fold for the male parent. Subsequently, these polymorphic SLAFs were used to genotype the population and further filtered based on several criteria to construct a genetic linkage map of cauliflower. Finally, 1776 high-quality SLAF markers, including 2741 SNPs, constituted the linkage map with average data integrity of 95.68%. The final map spanned a total genetic length of 890.01 cM with an average marker interval of 0.50 cM, and covered 364.9 Mb of the reference genome. The markers and genetic map developed in this study could provide an important foundation not only for comparative genomics studies within Brassica oleracea species but also for quantitative trait loci identification and molecular breeding of cauliflower. PMID:27047515
Cadle-Davidson, Lance; Gadoury, David; Fresnedo-Ramírez, Jonathan; Yang, Shanshan; Barba, Paola; Sun, Qi; Demmings, Elizabeth M; Seem, Robert; Schaub, Michelle; Nowogrodzki, Anna; Kasinathan, Hema; Ledbetter, Craig; Reisch, Bruce I
2016-10-01
The genomics era brought unprecedented opportunities for genetic analysis of host resistance, but it came with the challenge that accurate and reproducible phenotypes are needed so that genomic results appropriately reflect biology. Phenotyping host resistance by natural infection in the field can produce variable results due to the uncontrolled environment, uneven distribution and genetics of the pathogen, and developmentally regulated resistance among other factors. To address these challenges, we developed highly controlled, standardized methodologies for phenotyping powdery mildew resistance in the context of a phenotyping center, receiving samples of up to 140 grapevine progeny per F 1 family. We applied these methodologies to F 1 families segregating for REN1- or REN2-mediated resistance and validated that some but not all bioassays identified the REN1 or REN2 locus. A point-intercept method (hyphal transects) to quantify colony density objectively at 8 or 9 days postinoculation proved to be the phenotypic response most reproducibly predicted by these resistance loci. Quantitative trait locus (QTL) mapping with genotyping-by-sequencing maps defined the REN1 and REN2 loci at relatively high resolution. In the reference PN40024 genome under each QTL, nucleotide-binding site-leucine-rich repeat candidate resistance genes were identified-one gene for REN1 and two genes for REN2. The methods described here for centralized resistance phenotyping and high-resolution genetic mapping can inform strategies for breeding resistance to powdery mildews and other pathogens on diverse, highly heterozygous hosts.
Applied Genomics in Cattle – Identification of the SLICK locus in tropically adapted cattle
USDA-ARS?s Scientific Manuscript database
Over the past 3 years, ARS scientists have been working to identify the underlying genetic variants responsible for a heat tolerance phenotype in cattle associated with the SLICK locus typically found in Senepol cattle. This presentation reviews the general field of applied genomics in cattle, and ...
de Vries, Paul S; Sabater-Lleal, Maria; Chasman, Daniel I; Trompet, Stella; Ahluwalia, Tarunveer S; Teumer, Alexander; Kleber, Marcus E; Chen, Ming-Huei; Wang, Jie Jin; Attia, John R; Marioni, Riccardo E; Steri, Maristella; Weng, Lu-Chen; Pool, Rene; Grossmann, Vera; Brody, Jennifer A; Venturini, Cristina; Tanaka, Toshiko; Rose, Lynda M; Oldmeadow, Christopher; Mazur, Johanna; Basu, Saonli; Frånberg, Mattias; Yang, Qiong; Ligthart, Symen; Hottenga, Jouke J; Rumley, Ann; Mulas, Antonella; de Craen, Anton J M; Grotevendt, Anne; Taylor, Kent D; Delgado, Graciela E; Kifley, Annette; Lopez, Lorna M; Berentzen, Tina L; Mangino, Massimo; Bandinelli, Stefania; Morrison, Alanna C; Hamsten, Anders; Tofler, Geoffrey; de Maat, Moniek P M; Draisma, Harmen H M; Lowe, Gordon D; Zoledziewska, Magdalena; Sattar, Naveed; Lackner, Karl J; Völker, Uwe; McKnight, Barbara; Huang, Jie; Holliday, Elizabeth G; McEvoy, Mark A; Starr, John M; Hysi, Pirro G; Hernandez, Dena G; Guan, Weihua; Rivadeneira, Fernando; McArdle, Wendy L; Slagboom, P Eline; Zeller, Tanja; Psaty, Bruce M; Uitterlinden, André G; de Geus, Eco J C; Stott, David J; Binder, Harald; Hofman, Albert; Franco, Oscar H; Rotter, Jerome I; Ferrucci, Luigi; Spector, Tim D; Deary, Ian J; März, Winfried; Greinacher, Andreas; Wild, Philipp S; Cucca, Francesco; Boomsma, Dorret I; Watkins, Hugh; Tang, Weihong; Ridker, Paul M; Jukema, Jan W; Scott, Rodney J; Mitchell, Paul; Hansen, Torben; O'Donnell, Christopher J; Smith, Nicholas L; Strachan, David P; Dehghan, Abbas
2017-01-01
An increasing number of genome-wide association (GWA) studies are now using the higher resolution 1000 Genomes Project reference panel (1000G) for imputation, with the expectation that 1000G imputation will lead to the discovery of additional associated loci when compared to HapMap imputation. In order to assess the improvement of 1000G over HapMap imputation in identifying associated loci, we compared the results of GWA studies of circulating fibrinogen based on the two reference panels. Using both HapMap and 1000G imputation we performed a meta-analysis of 22 studies comprising the same 91,953 individuals. We identified six additional signals using 1000G imputation, while 29 loci were associated using both HapMap and 1000G imputation. One locus identified using HapMap imputation was not significant using 1000G imputation. The genome-wide significance threshold of 5×10-8 is based on the number of independent statistical tests using HapMap imputation, and 1000G imputation may lead to further independent tests that should be corrected for. When using a stricter Bonferroni correction for the 1000G GWA study (P-value < 2.5×10-8), the number of loci significant only using HapMap imputation increased to 4 while the number of loci significant only using 1000G decreased to 5. In conclusion, 1000G imputation enabled the identification of 20% more loci than HapMap imputation, although the advantage of 1000G imputation became less clear when a stricter Bonferroni correction was used. More generally, our results provide insights that are applicable to the implementation of other dense reference panels that are under development.
de Vries, Paul S.; Sabater-Lleal, Maria; Chasman, Daniel I.; Trompet, Stella; Kleber, Marcus E.; Chen, Ming-Huei; Wang, Jie Jin; Attia, John R.; Marioni, Riccardo E.; Weng, Lu-Chen; Grossmann, Vera; Brody, Jennifer A.; Venturini, Cristina; Tanaka, Toshiko; Rose, Lynda M.; Oldmeadow, Christopher; Mazur, Johanna; Basu, Saonli; Yang, Qiong; Ligthart, Symen; Hottenga, Jouke J.; Rumley, Ann; Mulas, Antonella; de Craen, Anton J. M.; Grotevendt, Anne; Taylor, Kent D.; Delgado, Graciela E.; Kifley, Annette; Lopez, Lorna M.; Berentzen, Tina L.; Mangino, Massimo; Bandinelli, Stefania; Morrison, Alanna C.; Hamsten, Anders; Tofler, Geoffrey; de Maat, Moniek P. M.; Draisma, Harmen H. M.; Lowe, Gordon D.; Zoledziewska, Magdalena; Sattar, Naveed; Lackner, Karl J.; Völker, Uwe; McKnight, Barbara; Huang, Jie; Holliday, Elizabeth G.; McEvoy, Mark A.; Starr, John M.; Hysi, Pirro G.; Hernandez, Dena G.; Guan, Weihua; Rivadeneira, Fernando; McArdle, Wendy L.; Slagboom, P. Eline; Zeller, Tanja; Psaty, Bruce M.; Uitterlinden, André G.; de Geus, Eco J. C.; Stott, David J.; Binder, Harald; Hofman, Albert; Franco, Oscar H.; Rotter, Jerome I.; Ferrucci, Luigi; Spector, Tim D.; Deary, Ian J.; März, Winfried; Greinacher, Andreas; Wild, Philipp S.; Cucca, Francesco; Boomsma, Dorret I.; Watkins, Hugh; Tang, Weihong; Ridker, Paul M.; Jukema, Jan W.; Scott, Rodney J.; Mitchell, Paul; Hansen, Torben; O'Donnell, Christopher J.; Smith, Nicholas L.; Strachan, David P.
2017-01-01
An increasing number of genome-wide association (GWA) studies are now using the higher resolution 1000 Genomes Project reference panel (1000G) for imputation, with the expectation that 1000G imputation will lead to the discovery of additional associated loci when compared to HapMap imputation. In order to assess the improvement of 1000G over HapMap imputation in identifying associated loci, we compared the results of GWA studies of circulating fibrinogen based on the two reference panels. Using both HapMap and 1000G imputation we performed a meta-analysis of 22 studies comprising the same 91,953 individuals. We identified six additional signals using 1000G imputation, while 29 loci were associated using both HapMap and 1000G imputation. One locus identified using HapMap imputation was not significant using 1000G imputation. The genome-wide significance threshold of 5×10−8 is based on the number of independent statistical tests using HapMap imputation, and 1000G imputation may lead to further independent tests that should be corrected for. When using a stricter Bonferroni correction for the 1000G GWA study (P-value < 2.5×10−8), the number of loci significant only using HapMap imputation increased to 4 while the number of loci significant only using 1000G decreased to 5. In conclusion, 1000G imputation enabled the identification of 20% more loci than HapMap imputation, although the advantage of 1000G imputation became less clear when a stricter Bonferroni correction was used. More generally, our results provide insights that are applicable to the implementation of other dense reference panels that are under development. PMID:28107422
Zhang, Huamin; Wu, Junqing; Dai, Zihui; Qin, Meiling; Hao, Lingyu; Ren, Yanjing; Li, Qingfei; Zhang, Lugang
2017-03-01
In Chinese cabbage, there are two Rf loci for pol CMS and one of them was mapped to a 12.6-kb region containing a potential candidate gene encoding PPR protein. In Chinese cabbage (Brassica rapa), polima cytoplasmic male sterility (pol CMS) is an important CMS type and is widely used for hybrid breeding. By extensive test crossing in Chinese cabbage, four restorer lines (92s105, 01s325, 00s109, and 88s148) for pol CMS were screened. By analyzing the allelism of the four restorer lines, it was found that 92s105, 01s325, and 00s109 had the same "restorers of fertility" (Rf) locus (designated as BrRfp1), but 88s148 had a different Rf locus (designated as BrRfp2). For fine mapping the BrRfp1 locus of 92s105, a BC 1 F 1 population with 487 individuals and a BC 1 F 2 population with 2485 individuals were successively constructed. Using simple sequence repeat (SSR) markers developed from Brassica rapa reference genome and InDel markers derived from whole-genome resequencing data of 94c9 and 92s105, BrRfp1 was mapped to a 12.6-kb region containing a potential candidate gene encoding pentatricopeptide repeat-containing protein. Based on the nucleotide polymorphisms of the candidate gene sequence between the restoring and nonrestoring alleles, a co-segregating marker SC718 was developed, which would be helpful for hybrid breeding by marker-assisted screening and for detecting new restorer lines.
Variants in ZFHX3 are associated with atrial fibrillation in individuals of European ancestry
Benjamin, Emelia J.; Rice, Kenneth M.; Arking, Dan E.; Pfeufer, Arne; van Noord, Charlotte; Smith, Albert V.; Schnabel, Renate B.; Bis, Joshua C.; Boerwinkle, Eric; Sinner, Moritz F.; Dehghan, Abbas; Lubitz, Steven A.; D’Agostino, Ralph B.; Lumley, Thomas; Ehret, Georg B.; Heeringa, Jan; Aspelund, Thor; Newton-Cheh, Christopher; Larson, Martin G.; Marciante, Kristin D.; Soliman, Elsayed Z.; Rivadeneira, Fernando; Wang, Thomas J.; Eiriksdottir, Gudny; Levy, Daniel; Psaty, Bruce M.; Li, Man; Chamberlain, Alanna M.; Hofman, Albert; Vasan, Ramachandran S.; Harris, Tamara B.; Rotter, Jerome I.; Kao, W.H. Linda; Agarwal, Sunil K.; Ch. Stricker, Bruno H.; Wang, Ke; Launer, Lenore J.; Smith, Nicholas L.; Chakravarti, Aravinda; Uitterlinden, Andre G.; Wolf, Philip A; Sotoodehnia, Nona; Kottgen, Anna; van Duijn, Cornelia M.; Lunetta, Kathryn L.; Heckbert, Susan R.; Gudnason, Vilmundur; Alonso, Alvaro; Kaab, Stefan; Ellinor, Patrick T.; Witteman, Jacqueline C.
2009-01-01
We conducted meta-analyses of genome-wide association studies (GWAS) for atrial fibrillation (AF) in participants from five community-based cohorts. Meta-analyses of 896 prevalent (15,768 referents) and 2,517 incident (21,337 referents) AF cases identified a novel locus for AF (ZFHX3, rs2106261, risk ratio [RR]=1.19; P=2.3×10−7), an association that was replicated in the German AF Network (odds ratio=1.44; P=1.6×10−11). Combining the discovery and replication results, rs2106261 was significantly associated with AF (RR=1.25; P=1.8×10−15). PMID:19597492
Population genomics of parallel hybrid zones in the mimetic butterflies, H. melpomene and H. erato
Ruiz, Mayté; Salazar, Patricio; Counterman, Brian; Medina, Jose Alejandro; Ortiz-Zuazaga, Humberto; Morrison, Anna; Papa, Riccardo
2014-01-01
Hybrid zones can be valuable tools for studying evolution and identifying genomic regions responsible for adaptive divergence and underlying phenotypic variation. Hybrid zones between subspecies of Heliconius butterflies can be very narrow and are maintained by strong selection acting on color pattern. The comimetic species, H. erato and H. melpomene, have parallel hybrid zones in which both species undergo a change from one color pattern form to another. We use restriction-associated DNA sequencing to obtain several thousand genome-wide sequence markers and use these to analyze patterns of population divergence across two pairs of parallel hybrid zones in Peru and Ecuador. We compare two approaches for analysis of this type of data—alignment to a reference genome and de novo assembly—and find that alignment gives the best results for species both closely (H. melpomene) and distantly (H. erato, ∼15% divergent) related to the reference sequence. Our results confirm that the color pattern controlling loci account for the majority of divergent regions across the genome, but we also detect other divergent regions apparently unlinked to color pattern differences. We also use association mapping to identify previously unmapped color pattern loci, in particular the Ro locus. Finally, we identify a new cryptic population of H. timareta in Ecuador, which occurs at relatively low altitude and is mimetic with H. melpomene malleti. PMID:24823669
Losada, Liliana; Varga, John J.; Hostetler, Jessica; Radune, Diana; Kim, Maria; Durkin, Scott; Schneewind, Olaf; Nierman, William C.
2011-01-01
Yersinia pestis is the causative agent of the plague. Y. pestis KIM 10+ strain was passaged and selected for loss of the 102 kb pgm locus, resulting in an attenuated strain, KIM D27. In this study, whole genome sequencing was performed on KIM D27 in order to identify any additional differences. Initial assemblies of 454 data were highly fragmented, and various bioinformatic tools detected between 15 and 465 SNPs and INDELs when comparing both strains, the vast majority associated with A or T homopolymer sequences. Consequently, Illumina sequencing was performed to improve the quality of the assembly. Hybrid sequence assemblies were performed and a total of 56 validated SNP/INDELs and 5 repeat differences were identified in the D27 strain relative to published KIM 10+ sequence. However, further analysis showed that 55 of these SNP/INDELs and 3 repeats were errors in the KIM 10+ reference sequence. We conclude that both 454 and Illumina sequencing were required to obtain the most accurate and rapid sequence results for Y. pestis KIMD27. SNP and INDELS calls were most accurate when both Newbler and CLC Genomics Workbench were employed. For purposes of obtaining high quality genome sequence differences between strains, any identified differences should be verified in both the new and reference genomes. PMID:21559501
Losada, Liliana; Varga, John J; Hostetler, Jessica; Radune, Diana; Kim, Maria; Durkin, Scott; Schneewind, Olaf; Nierman, William C
2011-04-29
Yersinia pestis is the causative agent of the plague. Y. pestis KIM 10+ strain was passaged and selected for loss of the 102 kb pgm locus, resulting in an attenuated strain, KIM D27. In this study, whole genome sequencing was performed on KIM D27 in order to identify any additional differences. Initial assemblies of 454 data were highly fragmented, and various bioinformatic tools detected between 15 and 465 SNPs and INDELs when comparing both strains, the vast majority associated with A or T homopolymer sequences. Consequently, Illumina sequencing was performed to improve the quality of the assembly. Hybrid sequence assemblies were performed and a total of 56 validated SNP/INDELs and 5 repeat differences were identified in the D27 strain relative to published KIM 10+ sequence. However, further analysis showed that 55 of these SNP/INDELs and 3 repeats were errors in the KIM 10+ reference sequence. We conclude that both 454 and Illumina sequencing were required to obtain the most accurate and rapid sequence results for Y. pestis KIMD27. SNP and INDELS calls were most accurate when both Newbler and CLC Genomics Workbench were employed. For purposes of obtaining high quality genome sequence differences between strains, any identified differences should be verified in both the new and reference genomes.
Carlier, Jorge D.; Alabaça, Claudia S.; Sousa, Nelson H.; Coelho, Paula S.; Monteiro, António A.; Paterson, Andrew H.; Leitão, José M.
2011-01-01
We describe the construction of a BAC contig and identification of a minimal tiling path that encompass the dominant and monogenically inherited downy mildew resistance locus Pp523 of Brassica oleracea L. The selection of BAC clones for construction of the physical map was carried out by screening gridded BAC libraries with DNA overgo probes derived from both genetically mapped DNA markers flanking the locus of interest and BAC-end sequences that align to Arabidopsis thaliana sequences within the previously identified syntenic region. The selected BAC clones consistently mapped to three different genomic regions of B. oleracea. Although 83 BAC clones were accurately mapped within a ∼4.6 cM region surrounding the downy mildew resistance locus Pp523, a subset of 33 BAC clones mapped to another region on chromosome C8 that was ∼60 cM away from the resistance gene, and a subset of 63 BAC clones mapped to chromosome C5. These results reflect the triplication of the Brassica genomes since their divergence from a common ancestor shared with A. thaliana, and they are consonant with recent analyses of the C genome of Brassica napus. The assembly of a minimal tiling path constituted by 13 (BoT01) BAC clones that span the Pp523 locus sets the stage for map-based cloning of this resistance gene. PMID:22384370
Haplotag: Software for Haplotype-Based Genotyping-by-Sequencing Analysis
Tinker, Nicholas A.; Bekele, Wubishet A.; Hattori, Jiro
2016-01-01
Genotyping-by-sequencing (GBS), and related methods, are based on high-throughput short-read sequencing of genomic complexity reductions followed by discovery of single nucleotide polymorphisms (SNPs) within sequence tags. This provides a powerful and economical approach to whole-genome genotyping, facilitating applications in genomics, diversity analysis, and molecular breeding. However, due to the complexity of analyzing large data sets, applications of GBS may require substantial time, expertise, and computational resources. Haplotag, the novel GBS software described here, is freely available, and operates with minimal user-investment on widely available computer platforms. Haplotag is unique in fulfilling the following set of criteria: (1) operates without a reference genome; (2) can be used in a polyploid species; (3) provides a discovery mode, and a production mode; (4) discovers polymorphisms based on a model of tag-level haplotypes within sequenced tags; (5) reports SNPs as well as haplotype-based genotypes; and (6) provides an intuitive visual “passport” for each inferred locus. Haplotag is optimized for use in a self-pollinating plant species. PMID:26818073
Babben, Steve; Perovic, Dragan; Koch, Michael; Ordon, Frank
2015-01-01
Recent declines in costs accelerated sequencing of many species with large genomes, including hexaploid wheat (Triticum aestivum L.). Although the draft sequence of bread wheat is known, it is still one of the major challenges to developlocus specific primers suitable to be used in marker assisted selection procedures, due to the high homology of the three genomes. In this study we describe an efficient approach for the development of locus specific primers comprising four steps, i.e. (i) identification of genomic and coding sequences (CDS) of candidate genes, (ii) intron- and exon-structure reconstruction, (iii) identification of wheat A, B and D sub-genome sequences and primer development based on sequence differences between the three sub-genomes, and (iv); testing of primers for functionality, correct size and localisation. This approach was applied to single, low and high copy genes involved in frost tolerance in wheat. In summary for 27 of these genes for which sequences were derived from Triticum aestivum, Triticum monococcum and Hordeum vulgare, a set of 119 primer pairs was developed and after testing on Nulli-tetrasomic (NT) lines, a set of 65 primer pairs (54.6%), corresponding to 19 candidate genes, turned out to be specific. Out of these a set of 35 fragments was selected for validation via Sanger's amplicon re-sequencing. All fragments, with the exception of one, could be assigned to the original reference sequence. The approach presented here showed a much higher specificity in primer development in comparison to techniques used so far in bread wheat and can be applied to other polyploid species with a known draft sequence. PMID:26565976
The RAG2 C-terminus and ATM protect genome integrity by controlling antigen receptor gene cleavage
Chaumeil, Julie; Micsinai, Mariann; Ntziachristos, Panagiotis; Roth, David B.; Aifantis, Iannis; Kluger, Yuval; Deriano, Ludovic; Skok, Jane A.
2013-01-01
Tight control of antigen-receptor gene rearrangement is required to preserve genome integrity and prevent the occurrence of leukemia and lymphoma. Nonetheless, mistakes can happen, leading to the generation of aberrant rearrangements, such as Tcra/d-Igh inter-locus translocations that are a hallmark of ATM deficiency. Current evidence indicates that these translocations arise from the persistence of unrepaired breaks converging at different stages of thymocyte differentiation. Here we show that a defect in feedback control of RAG2 activity gives rise to bi-locus breaks and damage on Tcra/d and Igh in the same T cell at the same developmental stage, which provides a direct mechanism for generating these inter-locus rearrangements. Both the RAG2 C-terminus and ATM prevent bi-locus RAG-mediated cleavage through modulation of 3D conformation (higher order loops) and nuclear organization of the two loci. This limits the number of potential substrates for translocation and provides an important mechanism for protecting genome stability. PMID:23900513
A TAD boundary is preserved upon deletion of the CTCF-rich Firre locus.
Barutcu, A Rasim; Maass, Philipp G; Lewandowski, Jordan P; Weiner, Catherine L; Rinn, John L
2018-04-13
The binding of the transcriptional regulator CTCF to the genome has been implicated in the formation of topologically associated domains (TADs). However, the general mechanisms of folding the genome into TADs are not fully understood. Here we test the effects of deleting a CTCF-rich locus on TAD boundary formation. Using genome-wide chromosome conformation capture (Hi-C), we focus on one TAD boundary on chromosome X harboring ~ 15 CTCF binding sites and located at the long non-coding RNA (lncRNA) locus Firre. Specifically, this TAD boundary is invariant across evolution, tissues, and temporal dynamics of X-chromosome inactivation. We demonstrate that neither the deletion of this locus nor the ectopic insertion of Firre cDNA or its ectopic expression are sufficient to alter TADs in a sex-specific or allele-specific manner. In contrast, Firre's deletion disrupts the chromatin super-loop formation of the inactive X-chromosome. Collectively, our findings suggest that apart from CTCF binding, additional mechanisms may play roles in establishing TAD boundary formation.
Hoffmann, Katrin; Planitz, Christian; Rüschendorf, Franz; Müller-Myhsok, Bertram; Stassen, Hans H; Lucke, Barbara; Mattheisen, Manuel; Stumvoll, Michael; Bochmann, Rolf; Zschornack, Martin; Wienker, Thomas F; Nürnberg, Peter; Reis, André; Luft, Friedrich C; Lindner, Tom H
2009-05-01
Genome-wide linkage studies and genome-wide association studies have not as yet identified major genes contributing to primary hypertension in the general population. This state-of-affairs suggests considerable heterogeneity with small contributing effects for primary hypertension, or other complex genetic traits, in outbred populations. Isolated populations, as recent data from Iceland and French Canada suggest, could offer a solution to this problem. We studied a Slavic isolate in Germany, the Sorbs, and genotyped 1040 polymorphic microsatellite markers in 87 multigeneration families. Our genome-wide linkage scan revealed a locus on chromosome 1p36.13 at D1S3669-D1S2826 (40.95 cM Marshfield coordinates; logarithm of the odds = 3.45, nominal P = 0.00003) that reached genome-wide significance (P = 0.004), indicating the increased power in isolated populations. The chromosome 1 locus maps to a region in which traits such as diabetes, hyperlipidemia, obesity and BMI cluster. Our results suggest that this locus contributes to the metabolic syndrome, and that further attention in this and other populations is warranted.
Philippe, Claude; Vargas-Landin, Dulce B; Doucet, Aurélien J; van Essen, Dominic; Vera-Otarola, Jorge; Kuciak, Monika; Corbin, Antoine; Nigumann, Pilvi; Cristofari, Gaël
2016-01-01
LINE-1 (L1) retrotransposons represent approximately one sixth of the human genome, but only the human-specific L1HS-Ta subfamily acts as an endogenous mutagen in modern humans, reshaping both somatic and germline genomes. Due to their high levels of sequence identity and the existence of many polymorphic insertions absent from the reference genome, the transcriptional activation of individual genomic L1HS-Ta copies remains poorly understood. Here we comprehensively mapped fixed and polymorphic L1HS-Ta copies in 12 commonly-used somatic cell lines, and identified transcriptional and epigenetic signatures allowing the unambiguous identification of active L1HS-Ta copies in their genomic context. Strikingly, only a very restricted subset of L1HS-Ta loci - some being polymorphic among individuals - significantly contributes to the bulk of L1 expression, and these loci are differentially regulated among distinct cell lines. Thus, our data support a local model of L1 transcriptional activation in somatic cells, governed by individual-, locus-, and cell-type-specific determinants. DOI: http://dx.doi.org/10.7554/eLife.13926.001 PMID:27016617
Chau, John H; Rahfeldt, Wolfgang A; Olmstead, Richard G
2018-03-01
Targeted sequence capture can be used to efficiently gather sequence data for large numbers of loci, such as single-copy nuclear loci. Most published studies in plants have used taxon-specific locus sets developed individually for a clade using multiple genomic and transcriptomic resources. General locus sets can also be developed from loci that have been identified as single-copy and have orthologs in large clades of plants. We identify and compare a taxon-specific locus set and three general locus sets (conserved ortholog set [COSII], shared single-copy nuclear [APVO SSC] genes, and pentatricopeptide repeat [PPR] genes) for targeted sequence capture in Buddleja (Scrophulariaceae) and outgroups. We evaluate their performance in terms of assembly success, sequence variability, and resolution and support of inferred phylogenetic trees. The taxon-specific locus set had the most target loci. Assembly success was high for all locus sets in Buddleja samples. For outgroups, general locus sets had greater assembly success. Taxon-specific and PPR loci had the highest average variability. The taxon-specific data set produced the best-supported tree, but all data sets showed improved resolution over previous non-sequence capture data sets. General locus sets can be a useful source of sequence capture targets, especially if multiple genomic resources are not available for a taxon.
Hulse-Kemp, Amanda M; Maheshwari, Shamoni; Stoffel, Kevin; Hill, Theresa A; Jaffe, David; Williams, Stephen R; Weisenfeld, Neil; Ramakrishnan, Srividya; Kumar, Vijay; Shah, Preyas; Schatz, Michael C; Church, Deanna M; Van Deynze, Allen
2018-01-01
Linked-Read sequencing technology has recently been employed successfully for de novo assembly of human genomes, however, the utility of this technology for complex plant genomes is unproven. We evaluated the technology for this purpose by sequencing the 3.5-gigabase (Gb) diploid pepper ( Capsicum annuum ) genome with a single Linked-Read library. Plant genomes, including pepper, are characterized by long, highly similar repetitive sequences. Accordingly, significant effort is used to ensure that the sequenced plant is highly homozygous and the resulting assembly is a haploid consensus. With a phased assembly approach, we targeted a heterozygous F 1 derived from a wide cross to assess the ability to derive both haplotypes and characterize a pungency gene with a large insertion/deletion. The Supernova software generated a highly ordered, more contiguous sequence assembly than all currently available C. annuum reference genomes. Over 83% of the final assembly was anchored and oriented using four publicly available de novo linkage maps. A comparison of the annotation of conserved eukaryotic genes indicated the completeness of assembly. The validity of the phased assembly is further demonstrated with the complete recovery of both 2.5-Kb insertion/deletion haplotypes of the PUN1 locus in the F 1 sample that represents pungent and nonpungent peppers, as well as nearly full recovery of the BUSCO2 gene set within each of the two haplotypes. The most contiguous pepper genome assembly to date has been generated which demonstrates that Linked-Read library technology provides a tool to de novo assemble complex highly repetitive heterozygous plant genomes. This technology can provide an opportunity to cost-effectively develop high-quality genome assemblies for other complex plants and compare structural and gene differences through accurate haplotype reconstruction.
Cholley, Pascal; Stojanov, Milos; Hocquet, Didier; Thouverez, Michelle; Bertrand, Xavier; Blanc, Dominique S
2015-08-01
Reliable molecular typing methods are necessary to investigate the epidemiology of bacterial pathogens. Reference methods such as multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE) are costly and time consuming. Here, we compared our newly developed double-locus sequence typing (DLST) method for Pseudomonas aeruginosa to MLST and PFGE on a collection of 281 isolates. DLST was as discriminatory as MLST and was able to recognize "high-risk" epidemic clones. Both methods were highly congruent. Not surprisingly, a higher discriminatory power was observed with PFGE. In conclusion, being a simple method (single-strand sequencing of only 2 loci), DLST is valuable as a first-line typing tool for epidemiological investigations of P. aeruginosa. Coupled to a more discriminant method like PFGE or whole genome sequencing, it might represent an efficient typing strategy to investigate or prevent outbreaks. Copyright © 2015 Elsevier Inc. All rights reserved.
Nakano, Michiharu; Shimada, Takehiko; Endo, Tomoko; Fujii, Hiroshi; Nesumi, Hirohisa; Kita, Masayuki; Ebina, Masumi; Shimizu, Tokurou; Omura, Mitsuo
2012-02-01
Polyembryony, in which multiple somatic nucellar cell-derived embryos develop in addition to the zygotic embryo in a seed, is common in the genus Citrus. Previous genetic studies indicated polyembryony is mainly determined by a single locus, but the underlying molecular mechanism is still unclear. As a step towards identification and characterization of the gene or genes responsible for nucellar embryogenesis in Citrus, haplotype-specific physical maps around the polyembryony locus were constructed. By sequencing three BAC clones aligned on the polyembryony haplotype, a single contiguous draft sequence consisting of 380 kb containing 70 predicted open reading frames (ORFs) was reconstructed. Single nucleotide polymorphism genotypes detected in the sequenced genomic region showed strong association with embryo type in Citrus, indicating a common polyembryony locus is shared among widely diverse Citrus cultivars and species. The arrangement of the predicted ORFs in the characterized genomic region showed high collinearity to the genomic sequence of chromosome 4 of Vitis vinifera and linkage group VI of Populus trichocarpa, suggesting that the syntenic relationship among these species is conserved even though V. vinifera and P. trichocarpa are non-apomictic species. This is the first study to characterize in detail the genomic structure of an apomixis locus determining adventitious embryony. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Fast and Efficient Drosophila melanogaster Gene Knock-Ins Using MiMIC Transposons
Vilain, Sven; Vanhauwaert, Roeland; Maes, Ine; Schoovaerts, Nils; Zhou, Lujia; Soukup, Sandra; da Cunha, Raquel; Lauwers, Elsa; Fiers, Mark; Verstreken, Patrik
2014-01-01
Modern molecular genetics studies necessitate the manipulation of genes in their endogenous locus, but most of the current methodologies require an inefficient donor-dependent homologous recombination step to locally modify the genome. Here we describe a methodology to efficiently generate Drosophila knock-in alleles by capitalizing on the availability of numerous genomic MiMIC transposon insertions carrying recombinogenic attP sites. Our methodology entails the efficient PhiC31-mediated integration of a recombination cassette flanked by unique I-SceI and/or I-CreI restriction enzyme sites into an attP-site. These restriction enzyme sites allow for double-strand break−mediated removal of unwanted flanking transposon sequences, while leaving the desired genomic modifications or recombination cassettes. As a proof-of-principle, we mutated LRRK, tau, and sky by using different MiMIC elements. We replaced 6 kb of genomic DNA encompassing the tau locus and 35 kb encompassing the sky locus with a recombination cassette that permits easy integration of DNA at these loci and we also generated a functional LRRKHA knock in allele. Given that ~92% of the Drosophila genes are located within the vicinity (<35 kb) of a MiMIC element, our methodology enables the efficient manipulation of nearly every locus in the fruit fly genome without the need for inefficient donor-dependent homologous recombination events. PMID:25298537
Fast and efficient Drosophila melanogaster gene knock-ins using MiMIC transposons.
Vilain, Sven; Vanhauwaert, Roeland; Maes, Ine; Schoovaerts, Nils; Zhou, Lujia; Soukup, Sandra; da Cunha, Raquel; Lauwers, Elsa; Fiers, Mark; Verstreken, Patrik
2014-10-08
Modern molecular genetics studies necessitate the manipulation of genes in their endogenous locus, but most of the current methodologies require an inefficient donor-dependent homologous recombination step to locally modify the genome. Here we describe a methodology to efficiently generate Drosophila knock-in alleles by capitalizing on the availability of numerous genomic MiMIC transposon insertions carrying recombinogenic attP sites. Our methodology entails the efficient PhiC31-mediated integration of a recombination cassette flanked by unique I-SceI and/or I-CreI restriction enzyme sites into an attP-site. These restriction enzyme sites allow for double-strand break-mediated removal of unwanted flanking transposon sequences, while leaving the desired genomic modifications or recombination cassettes. As a proof-of-principle, we mutated LRRK, tau, and sky by using different MiMIC elements. We replaced 6 kb of genomic DNA encompassing the tau locus and 35 kb encompassing the sky locus with a recombination cassette that permits easy integration of DNA at these loci and we also generated a functional LRRK(HA) knock in allele. Given that ~92% of the Drosophila genes are located within the vicinity (<35 kb) of a MiMIC element, our methodology enables the efficient manipulation of nearly every locus in the fruit fly genome without the need for inefficient donor-dependent homologous recombination events. Copyright © 2014 Vilain et al.
Structural forms of the human amylase locus and their relationships to SNPs, haplotypes, and obesity
Usher, Christina L; Handsaker, Robert E; Esko, Tõnu; Tuke, Marcus A; Weedon, Michael N; Hastie, Alex R; Cao, Han; Moon, Jennifer E; Kashin, Seva; Fuchsberger, Christian; Metspalu, Andres; Pato, Carlos N; Pato, Michele T; McCarthy, Mark I; Boehnke, Michael; Altshuler, David M; Frayling, Timothy M; Hirschhorn, Joel N; McCarroll, Steven A
2016-01-01
Hundreds of genes reside in structurally complex, poorly understood regions of the human genome1-3. One such region contains the three amylase genes (AMY2B, AMY2A, and AMY1) responsible for digesting starch into sugar. The copy number of AMY1 is reported to be the genome’s largest influence on obesity4, though genome-wide association studies for obesity have found this locus unremarkable. Using whole genome sequence analysis3,5, droplet digital PCR6, and genome mapping7, we identified eight common structural haplotypes of the amylase locus that suggest its mutational history. We found that AMY1 copy number in individuals’ genomes is generally even (rather than odd) and partially correlates to nearby SNPs, which do not associate with BMI. We measured amylase gene copy number in 1,000 obese or lean Estonians and in two other cohorts totaling ~3,500 individuals. We had 99% power to detect the lower bound of the reported effects on BMI4, yet found no association. PMID:26098870
Retter, Ida; Chevillard, Christophe; Scharfe, Maren; Conrad, Ansgar; Hafner, Martin; Im, Tschong-Hun; Ludewig, Monika; Nordsiek, Gabriele; Severitt, Simone; Thies, Stephanie; Mauhar, America; Blöcker, Helmut; Müller, Werner; Riblet, Roy
2009-01-01
Although the entire mouse genome has been sequenced, there remain challenges concerning the elucidation of particular complex and polymorphic genomic loci. In the murine Igh locus, different haplotypes exist in different inbred mouse strains. For example, the Ighb haplotype sequence of the Mouse Genome Project strain C57BL/6 differs considerably from the Igha haplotype of BALB/c, which has been widely used in the analyses of Ab responses. We have sequenced and annotated the 3′ half of the Igha locus of 129S1/SvImJ, covering the CH region and approximately half of the VH region. This sequence comprises 128 VH genes, of which 49 are judged to be functional. The comparison of the Igha sequence with the homologous Ighb region from C57BL/6 revealed two major expansions in the germline repertoire of Igha. In addition, we found smaller haplotype-specific differences like the duplication of five VH genes in the Igha locus. We generated a VH allele table by comparing the individual VH genes of both haplotypes. Surprisingly, the number and position of DH genes in the 129S1 strain differs not only from the sequence of C57BL/6 but also from the map published for BALB/c. Taken together, the contiguous genomic sequence of the 3′ part of the Igha locus allows a detailed view of the recent evolution of this highly dynamic locus in the mouse. PMID:17675503
Takahata, Satoshi; Yago, Takumi; Iwabuchi, Keisuke; Hirakawa, Hideki; Suzuki, Yutaka; Onodera, Yasuyuki
2016-01-01
Spinach (Spinacia oleracea, 2n = 12) and sugar beet (Beta vulgaris, 2n = 18) are important crop members of the family Chenopodiaceae ss Sugar beet has a basic chromosome number of 9 and a cosexual breeding system, as do most members of the Chenopodiaceae ss. family. By contrast, spinach has a basic chromosome number of 6 and, although certain cultivars and genotypes produce monoecious plants, is considered to be a dioecious species. The loci determining male and monoecious sexual expression were mapped to different loci on the spinach sex chromosomes. In this study, a linkage map with 46 mapped protein-coding sequences was constructed for the spinach sex chromosomes. Comparison of the linkage map with a reference genome sequence of sugar beet revealed that the spinach sex chromosomes exhibited extensive synteny with sugar beet chromosomes 4 and 9. Tightly linked protein-coding genes linked to the male-determining locus in spinach corresponded to genes located in or around the putative pericentromeric and centromeric regions of sugar beet chromosomes 4 and 9, supporting the observation that recombination rates were low in the vicinity of the male-determining locus. The locus for monoecism was confined to a chromosomal segment corresponding to a region of approximately 1.7Mb on sugar beet chromosome 9, which may facilitate future positional cloning of the locus. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Yuan, Bo; Liu, Pengfei; Gupta, Aditya; Beck, Christine R.; Tejomurtula, Anusha; Campbell, Ian M.; Gambin, Tomasz; Simmons, Alexandra D.; Withers, Marjorie A.; Harris, R. Alan; Rogers, Jeffrey; Schwartz, David C.; Lupski, James R.
2015-01-01
Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. PMID:26641089
Variation block-based genomics method for crop plants.
Kim, Yul Ho; Park, Hyang Mi; Hwang, Tae-Young; Lee, Seuk Ki; Choi, Man Soo; Jho, Sungwoong; Hwang, Seungwoo; Kim, Hak-Min; Lee, Dongwoo; Kim, Byoung-Chul; Hong, Chang Pyo; Cho, Yun Sung; Kim, Hyunmin; Jeong, Kwang Ho; Seo, Min Jung; Yun, Hong Tai; Kim, Sun Lim; Kwon, Young-Up; Kim, Wook Han; Chun, Hye Kyung; Lim, Sang Jong; Shin, Young-Ah; Choi, Ik-Young; Kim, Young Sun; Yoon, Ho-Sung; Lee, Suk-Ha; Lee, Sunghoon
2014-06-15
In contrast with wild species, cultivated crop genomes consist of reshuffled recombination blocks, which occurred by crossing and selection processes. Accordingly, recombination block-based genomics analysis can be an effective approach for the screening of target loci for agricultural traits. We propose the variation block method, which is a three-step process for recombination block detection and comparison. The first step is to detect variations by comparing the short-read DNA sequences of the cultivar to the reference genome of the target crop. Next, sequence blocks with variation patterns are examined and defined. The boundaries between the variation-containing sequence blocks are regarded as recombination sites. All the assumed recombination sites in the cultivar set are used to split the genomes, and the resulting sequence regions are termed variation blocks. Finally, the genomes are compared using the variation blocks. The variation block method identified recurring recombination blocks accurately and successfully represented block-level diversities in the publicly available genomes of 31 soybean and 23 rice accessions. The practicality of this approach was demonstrated by the identification of a putative locus determining soybean hilum color. We suggest that the variation block method is an efficient genomics method for the recombination block-level comparison of crop genomes. We expect that this method will facilitate the development of crop genomics by bringing genomics technologies to the field of crop breeding.
Keaton, Jacob M; Gao, Chuan; Guan, Meijian; Hellwege, Jacklyn N; Palmer, Nicholette D; Pankow, James S; Fornage, Myriam; Wilson, James G; Correa, Adolfo; Rasmussen-Torvik, Laura J; Rotter, Jerome I; Chen, Yii-Der I; Taylor, Kent D; Rich, Stephen S; Wagenknecht, Lynne E; Freedman, Barry I; Ng, Maggie C Y; Bowden, Donald W
2018-04-24
Although type 2 diabetes (T2D) results from metabolic defects in insulin secretion and insulin sensitivity, most of the genetic risk loci identified to date relates to insulin secretion. We reported that T2D loci influencing insulin sensitivity may be identified through interactions with insulin secretion loci, thereby leading to T2D. Here, we hypothesize that joint testing of variant main effects and interaction effects with an insulin secretion locus increases power to identify genetic interactions leading to T2D. We tested this hypothesis with an intronic MTNR1B SNP, rs10830963, which is associated with acute insulin response to glucose, a dynamic measure of insulin secretion. rs10830963 was tested for interaction and joint (main + interaction) effects with genome-wide data in African Americans (2,452 cases and 3,772 controls) from five cohorts. Genome-wide genotype data (Affymetrix Human Genome 6.0 array) was imputed to a 1000 Genomes Project reference panel. T2D risk was modeled using logistic regression with rs10830963 dosage, age, sex, and principal component as predictors. Joint effects were captured using the Kraft two degrees of freedom test. Genome-wide significant (P < 5 × 10 -8 ) interaction with MTNR1B and joint effects were detected for CMIP intronic SNP rs17197883 (P interaction = 1.43 × 10 -8 ; P joint = 4.70 × 10 -8 ). CMIP variants have been nominally associated with T2D, fasting glucose, and adiponectin in individuals of East Asian ancestry, with high-density lipoprotein, and with waist-to-hip ratio adjusted for body mass index in Europeans. These data support the hypothesis that additional genetic factors contributing to T2D risk, including insulin sensitivity loci, can be identified through interactions with insulin secretion loci. © 2018 WILEY PERIODICALS, INC.
Tikkanen, Tuomas; Leroy, Bernard; Fournier, Jean Louis; Risques, Rosa Ana; Malcikova, Jitka; Soussi, Thierry
2018-07-01
Accurate annotation of genomic variants in human diseases is essential to allow personalized medicine. Assessment of somatic and germline TP53 alterations has now reached the clinic and is required in several circumstances such as the identification of the most effective cancer therapy for patients with chronic lymphocytic leukemia (CLL). Here, we present Seshat, a Web service for annotating TP53 information derived from sequencing data. A flexible framework allows the use of standard file formats such as Mutation Annotation Format (MAF) or Variant Call Format (VCF), as well as common TXT files. Seshat performs accurate variant annotations using the Human Genome Variation Society (HGVS) nomenclature and the stable TP53 genomic reference provided by the Locus Reference Genomic (LRG). In addition, using the 2017 release of the UMD_TP53 database, Seshat provides multiple statistical information for each TP53 variant including database frequency, functional activity, or pathogenicity. The information is delivered in standardized output tables that minimize errors and facilitate comparison of mutational data across studies. Seshat is a beneficial tool to interpret the ever-growing TP53 sequencing data generated by multiple sequencing platforms and it is freely available via the TP53 Website, http://p53.fr or directly at http://vps338341.ovh.net/. © 2018 Wiley Periodicals, Inc.
Cheung, Gordon Y C; Villaruz, Amer E; Joo, Hwang-Soo; Duong, Anthony C; Yeh, Anthony J; Nguyen, Thuan H; Sturdevant, Daniel E; Queck, S Y; Otto, M
2014-07-01
Several methicillin resistance (SCCmec) clusters characteristic of hospital-associated methicillin-resistant Staphylococcus aureus (MRSA) strains harbor the psm-mec locus. In addition to encoding the cytolysin, phenol-soluble modulin (PSM)-mec, this locus has been attributed gene regulatory functions. Here we employed genome-wide transcriptional profiling to define the regulatory function of the psm-mec locus. The immune evasion factor protein A emerged as the primary conserved and strongly regulated target of psm-mec, an effect we show is mediated by the psm-mec RNA. Furthermore, the psm-mec locus exerted regulatory effects that were more moderate in extent. For example, expression of PSM-mec limited expression of mecA, thereby decreasing methicillin resistance. Our study shows that the psm-mec locus has a rare dual regulatory RNA and encoded cytolysin function. Furthermore, our findings reveal a specific mechanism underscoring the recently emerging concept that S. aureus strains balance pronounced virulence and high expression of antibiotic resistance. Published by Elsevier GmbH.
Efficient high-throughput sequencing of a laser microdissected chromosome arm
2013-01-01
Background Genomic sequence assemblies are key tools for a broad range of gene function and evolutionary studies. The diploid amphibian Xenopus tropicalis plays a pivotal role in these fields due to its combination of experimental flexibility, diploid genome, and early-branching tetrapod taxonomic position, having diverged from the amniote lineage ~360 million years ago. A genome assembly and a genetic linkage map have recently been made available. Unfortunately, large gaps in the linkage map attenuate long-range integrity of the genome assembly. Results We laser dissected the short arm of X. tropicalis chromosome 7 for next generation sequencing and computational mapping to the reference genome. This arm is of particular interest as it encodes the sex determination locus, but its genetic map contains large gaps which undermine available genome assemblies. Whole genome amplification of 15 laser-microdissected 7p arms followed by next generation sequencing yielded ~35 million reads, over four million of which uniquely mapped to the X. tropicalis genome. Our analysis placed more than 200 previously unmapped scaffolds on the analyzed chromosome arm, providing valuable low-resolution physical map information for de novo genome assembly. Conclusion We present a new approach for improving and validating genetic maps and sequence assemblies. Whole genome amplification of 15 microdissected chromosome arms provided sufficient high-quality material for localizing previously unmapped scaffolds and genes as well as recognizing mislocalized scaffolds. PMID:23714049
Chen, J W; Wang, L; Pang, X F; Pan, Q H
2006-04-01
Genetic analysis and fine mapping of a resistance gene against brown planthopper (BPH) biotype 2 in rice was performed using two F(2) populations derived from two crosses between a resistant indica cultivar (cv.), AS20-1, and two susceptible japonica cvs., Aichi Asahi and Lijiangxintuanheigu. Insect resistance was evaluated using F(1) plants and the two F(2) populations. The results showed that a single recessive gene, tentatively designated as bph19(t), conditioned the resistance in AS20-1. A linkage analysis, mainly employing microsatellite markers, was carried out in the two F(2) populations through bulked segregant analysis and recessive class analysis (RCA), in combination with bioinformatics analysis (BIA). The resistance gene locus bph19(t) was finely mapped to a region of about 1.0 cM on the short arm of chromosome 3, flanked by markers RM6308 and RM3134, where one known marker RM1022, and four new markers, b1, b2, b3 and b4, developed in the present study were co-segregating with the locus. To physically map this locus, the bph19(t)-linked markers were landed on bacterial artificial chromosome or P1 artificial chromosome clones of the reference cv., Nipponbare, released by the International Rice Genome Sequencing Project. Sequence information of these clones was used to construct a physical map of the bph19(t) locus, in silico, by BIA. The bph19(t) locus was physically defined to an interval of about 60 kb. The detailed genetic and physical maps of the bph19(t) locus will facilitate marker-assisted gene pyramiding and cloning.
Conservatism and novelty in the genetic architecture of adaptation in Heliconius butterflies.
Huber, B; Whibley, A; Poul, Y L; Navarro, N; Martin, A; Baxter, S; Shah, A; Gilles, B; Wirth, T; McMillan, W O; Joron, M
2015-05-01
Understanding the genetic architecture of adaptive traits has been at the centre of modern evolutionary biology since Fisher; however, evaluating how the genetic architecture of ecologically important traits influences their diversification has been hampered by the scarcity of empirical data. Now, high-throughput genomics facilitates the detailed exploration of variation in the genome-to-phenotype map among closely related taxa. Here, we investigate the evolution of wing pattern diversity in Heliconius, a clade of neotropical butterflies that have undergone an adaptive radiation for wing-pattern mimicry and are influenced by distinct selection regimes. Using crosses between natural wing-pattern variants, we used genome-wide restriction site-associated DNA (RAD) genotyping, traditional linkage mapping and multivariate image analysis to study the evolution of the architecture of adaptive variation in two closely related species: Heliconius hecale and H. ismenius. We implemented a new morphometric procedure for the analysis of whole-wing pattern variation, which allows visualising spatial heatmaps of genotype-to-phenotype association for each quantitative trait locus separately. We used the H. melpomene reference genome to fine-map variation for each major wing-patterning region uncovered, evaluated the role of candidate genes and compared genetic architectures across the genus. Our results show that, although the loci responding to mimicry selection are highly conserved between species, their effect size and phenotypic action vary throughout the clade. Multilocus architecture is ancestral and maintained across species under directional selection, whereas the single-locus (supergene) inheritance controlling polymorphism in H. numata appears to have evolved only once. Nevertheless, the conservatism in the wing-patterning toolkit found throughout the genus does not appear to constrain phenotypic evolution towards local adaptive optima.
Iskow, Rebecca C.; Austermann, Christian; Scharer, Christopher D.; Raj, Towfique; Boss, Jeremy M.; Sunyaev, Shamil; Price, Alkes; Stranger, Barbara; Simon, Viviana; Lee, Charles
2013-01-01
Ancient population structure shaping contemporary genetic variation has been recently appreciated and has important implications regarding our understanding of the structure of modern human genomes. We identified a ∼36-kb DNA segment in the human genome that displays an ancient substructure. The variation at this locus exists primarily as two highly divergent haplogroups. One of these haplogroups (the NE1 haplogroup) aligns with the Neandertal haplotype and contains a 4.6-kb deletion polymorphism in perfect linkage disequilibrium with 12 single nucleotide polymorphisms (SNPs) across diverse populations. The other haplogroup, which does not contain the 4.6-kb deletion, aligns with the chimpanzee haplotype and is likely ancestral. Africans have higher overall pairwise differences with the Neandertal haplotype than Eurasians do for this NE1 locus (p<10−15). Moreover, the nucleotide diversity at this locus is higher in Eurasians than in Africans. These results mimic signatures of recent Neandertal admixture contributing to this locus. However, an in-depth assessment of the variation in this region across multiple populations reveals that African NE1 haplotypes, albeit rare, harbor more sequence variation than NE1 haplotypes found in Europeans, indicating an ancient African origin of this haplogroup and refuting recent Neandertal admixture. Population genetic analyses of the SNPs within each of these haplogroups, along with genome-wide comparisons revealed significant FST (p = 0.00003) and positive Tajima's D (p = 0.00285) statistics, pointing to non-neutral evolution of this locus. The NE1 locus harbors no protein-coding genes, but contains transcribed sequences as well as sequences with putative regulatory function based on bioinformatic predictions and in vitro experiments. We postulate that the variation observed at this locus predates Human–Neandertal divergence and is evolving under balancing selection, especially among European populations. PMID:23593015
Jiang, Jiyang; Thalamuthu, Anbupalam; Ho, Jennifer E.; Mahajan, Anubha; Ek, Weronica E.; Brown, David A.; Breit, Samuel N.; Wang, Thomas J.; Gyllensten, Ulf; Chen, Ming-Huei; Enroth, Stefan; Januzzi, James L.; Lind, Lars; Armstrong, Nicola J.; Kwok, John B.; Schofield, Peter R.; Wen, Wei; Trollor, Julian N.; Johansson, Åsa; Morris, Andrew P.; Vasan, Ramachandran S.; Sachdev, Perminder S.; Mather, Karen A.
2018-01-01
Blood levels of growth differentiation factor-15 (GDF-15), also known as macrophage inhibitory cytokine-1 (MIC-1), have been associated with various pathological processes and diseases, including cardiovascular disease and cancer. Prior studies suggest genetic factors play a role in regulating blood MIC-1/GDF-15 concentration. In the current study, we conducted the largest genome-wide association study (GWAS) to date using a sample of ∼5,400 community-based Caucasian participants, to determine the genetic variants associated with MIC-1/GDF-15 blood concentration. Conditional and joint (COJO), gene-based association, and gene-set enrichment analyses were also carried out to identify novel loci, genes, and pathways. Consistent with prior results, a locus on chromosome 19, which includes nine single nucleotide polymorphisms (SNPs) (top SNP, rs888663, p = 1.690 × 10-35), was significantly associated with blood MIC-1/GDF-15 concentration, and explained 21.47% of its variance. COJO analysis showed evidence for two independent signals within this locus. Gene-based analysis confirmed the chromosome 19 locus association and in addition, a putative locus on chromosome 1. Gene-set enrichment analyses showed that the“COPI-mediated anterograde transport” gene-set was associated with MIC-1/GDF15 blood concentration with marginal significance after FDR correction (p = 0.067). In conclusion, a locus on chromosome 19 was associated with MIC-1/GDF-15 blood concentration with genome-wide significance, with evidence for a new locus (chromosome 1). Future studies using independent cohorts are needed to confirm the observed associations especially for the chromosomes 1 locus, and to further investigate and identify the causal SNPs that contribute to MIC-1/GDF-15 levels. PMID:29628937
Jiang, Jiyang; Thalamuthu, Anbupalam; Ho, Jennifer E; Mahajan, Anubha; Ek, Weronica E; Brown, David A; Breit, Samuel N; Wang, Thomas J; Gyllensten, Ulf; Chen, Ming-Huei; Enroth, Stefan; Januzzi, James L; Lind, Lars; Armstrong, Nicola J; Kwok, John B; Schofield, Peter R; Wen, Wei; Trollor, Julian N; Johansson, Åsa; Morris, Andrew P; Vasan, Ramachandran S; Sachdev, Perminder S; Mather, Karen A
2018-01-01
Blood levels of growth differentiation factor-15 (GDF-15), also known as macrophage inhibitory cytokine-1 (MIC-1), have been associated with various pathological processes and diseases, including cardiovascular disease and cancer. Prior studies suggest genetic factors play a role in regulating blood MIC-1/GDF-15 concentration. In the current study, we conducted the largest genome-wide association study (GWAS) to date using a sample of ∼5,400 community-based Caucasian participants, to determine the genetic variants associated with MIC-1/GDF-15 blood concentration. Conditional and joint (COJO), gene-based association, and gene-set enrichment analyses were also carried out to identify novel loci, genes, and pathways. Consistent with prior results, a locus on chromosome 19, which includes nine single nucleotide polymorphisms (SNPs) (top SNP, rs888663, p = 1.690 × 10 -35 ), was significantly associated with blood MIC-1/GDF-15 concentration, and explained 21.47% of its variance. COJO analysis showed evidence for two independent signals within this locus. Gene-based analysis confirmed the chromosome 19 locus association and in addition, a putative locus on chromosome 1. Gene-set enrichment analyses showed that the"COPI-mediated anterograde transport" gene-set was associated with MIC-1/GDF15 blood concentration with marginal significance after FDR correction ( p = 0.067). In conclusion, a locus on chromosome 19 was associated with MIC-1/GDF-15 blood concentration with genome-wide significance, with evidence for a new locus (chromosome 1). Future studies using independent cohorts are needed to confirm the observed associations especially for the chromosomes 1 locus, and to further investigate and identify the causal SNPs that contribute to MIC-1/GDF-15 levels.
Blair, Matthew W; Prieto, Sergio; Díaz, Lucy M; Buendía, Héctor F; Cardona, César
2010-04-29
An interesting seed protein family with a role in preventing insect herbivory is the multi-gene, APA family encoding the alpha-amylase inhibitor, phytohemagglutinin and arcelin proteins of common bean (Phaseolus vulgaris). Variability for this gene family exists and has been exploited to breed for insect resistance. For example, the arcelin locus has been successfully transferred from wild to cultivated common bean genotypes to provide resistance against the bruchid species Zabrotes subfasciatus although the process has been hampered by a lack of genetic tools for and understanding about the locus. In this study, we analyzed linkage disequilibrium (LD) between microsatellite markers at the APA locus and bruchid resistance in a germplasm survey of 105 resistant and susceptible genotypes and compared this with LD in other parts of the genome. Microsatellite allele diversity was found to vary with each of the eight APA-linked markers analyzed, and two markers within the APA locus were found to be diagnostic for bruchid resistance or susceptibility and for the different arcelin alleles inherited from the wild accessions. Arc1 was found to provide higher levels of resistance than Arc5 and the markers in the APA locus were highly associated with resistance showing that introgression of this gene-family from wild beans provides resistance in cultivated beans. LD around the APA locus was found to be intermediate compared to other regions of the genome and the highest LD was found within the APA locus itself for example between the markers PV-atct001 and PV-ag004. We found the APA locus to be an important genetic determinant of bruchid resistance and also found that LD existed mostly within the APA locus but not beyond it. Moderate LD was also found for some other regions of the genome perhaps related to domestication genes. The LD pattern may reflect the introgression of arcelin from the wild into the cultivated background through breeding. LD and association studies for the arcelin gene, linked genes and other members of the APA family are essential for breaking linkage drag while maintaining high levels of bruchid resistance in common bean.
van Rheenen, Wouter; Shatunov, Aleksey; Dekker, Annelot M; McLaughlin, Russell L; Diekstra, Frank P; Pulit, Sara L; van der Spek, Rick A A; Võsa, Urmo; de Jong, Simone; Robinson, Matthew R; Yang, Jian; Fogh, Isabella; van Doormaal, Perry TC; Tazelaar, Gijs H P; Koppers, Max; Blokhuis, Anna M; Sproviero, William; Jones, Ashley R; Kenna, Kevin P; van Eijk, Kristel R; Harschnitz, Oliver; Schellevis, Raymond D; Brands, William J; Medic, Jelena; Menelaou, Androniki; Vajda, Alice; Ticozzi, Nicola; Lin, Kuang; Rogelj, Boris; Vrabec, Katarina; Ravnik-Glavač, Metka; Koritnik, Blaž; Zidar, Janez; Leonardis, Lea; Grošelj, Leja Dolenc; Millecamps, Stéphanie; Salachas, François; Meininger, Vincent; de Carvalho, Mamede; Pinto, Susana; Mora, Jesus S; Rojas-García, Ricardo; Polak, Meraida; Chandran, Siddharthan; Colville, Shuna; Swingler, Robert; Morrison, Karen E; Shaw, Pamela J; Hardy, John; Orrell, Richard W; Pittman, Alan; Sidle, Katie; Fratta, Pietro; Malaspina, Andrea; Topp, Simon; Petri, Susanne; Abdulla, Susanne; Drepper, Carsten; Sendtner, Michael; Meyer, Thomas; Ophoff, Roel A; Staats, Kim A; Wiedau-Pazos, Martina; Lomen-Hoerth, Catherine; Van Deerlin, Vivianna M; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Basak, A Nazli; Tunca, Ceren; Hamzeiy, Hamid; Parman, Yesim; Meitinger, Thomas; Lichtner, Peter; Radivojkov-Blagojevic, Milena; Andres, Christian R; Maurel, Cindy; Bensimon, Gilbert; Landwehrmeyer, Bernhard; Brice, Alexis; Payan, Christine A M; Saker-Delye, Safaa; Dürr, Alexandra; Wood, Nicholas W; Tittmann, Lukas; Lieb, Wolfgang; Franke, Andre; Rietschel, Marcella; Cichon, Sven; Nöthen, Markus M; Amouyel, Philippe; Tzourio, Christophe; Dartigues, Jean-François; Uitterlinden, Andre G; Rivadeneira, Fernando; Estrada, Karol; Hofman, Albert; Curtis, Charles; Blauw, Hylke M; van der Kooi, Anneke J; de Visser, Marianne; Goris, An; Weber, Markus; Shaw, Christopher E; Smith, Bradley N; Pansarasa, Orietta; Cereda, Cristina; Bo, Roberto Del; Comi, Giacomo P; D’Alfonso, Sandra; Bertolin, Cinzia; Sorarù, Gianni; Mazzini, Letizia; Pensato, Viviana; Gellera, Cinzia; Tiloca, Cinzia; Ratti, Antonia; Calvo, Andrea; Moglia, Cristina; Brunetti, Maura; Arcuti, Simona; Capozzo, Rosa; Zecca, Chiara; Lunetta, Christian; Penco, Silvana; Riva, Nilo; Padovani, Alessandro; Filosto, Massimiliano; Muller, Bernard; Stuit, Robbert Jan; Blair, Ian; Zhang, Katharine; McCann, Emily P; Fifita, Jennifer A; Nicholson, Garth A; Rowe, Dominic B; Pamphlett, Roger; Kiernan, Matthew C; Grosskreutz, Julian; Witte, Otto W; Ringer, Thomas; Prell, Tino; Stubendorff, Beatrice; Kurth, Ingo; Hübner, Christian A; Leigh, P Nigel; Casale, Federico; Chio, Adriano; Beghi, Ettore; Pupillo, Elisabetta; Tortelli, Rosanna; Logroscino, Giancarlo; Powell, John; Ludolph, Albert C; Weishaupt, Jochen H; Robberecht, Wim; Van Damme, Philip; Franke, Lude; Pers, Tune H; Brown, Robert H; Glass, Jonathan D; Landers, John E; Hardiman, Orla; Andersen, Peter M; Corcia, Philippe; Vourc’h, Patrick; Silani, Vincenzo; Wray, Naomi R; Visscher, Peter M; de Bakker, Paul I W; van Es, Michael A; Pasterkamp, R Jeroen; Lewis, Cathryn M; Breen, Gerome; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan H
2017-01-01
To elucidate the genetic architecture of amyotrophic lateral sclerosis (ALS) and find associated loci, we assembled a custom imputation reference panel from whole-genome-sequenced patients with ALS and matched controls (n = 1,861). Through imputation and mixed-model association analysis in 12,577 cases and 23,475 controls, combined with 2,579 cases and 2,767 controls in an independent replication cohort, we fine-mapped a new risk locus on chromosome 21 and identified C21orf2 as a gene associated with ALS risk. In addition, we identified MOBP and SCFD1 as new associated risk loci. We established evidence of ALS being a complex genetic trait with a polygenic architecture. Furthermore, we estimated the SNP-based heritability at 8.5%, with a distinct and important role for low-frequency variants (frequency 1–10%). This study motivates the interrogation of larger samples with full genome coverage to identify rare causal variants that underpin ALS risk. PMID:27455348
Duncan, Laramie; Yilmaz, Zeynep; Gaspar, Helena; Walters, Raymond; Goldstein, Jackie; Anttila, Verneri; Bulik-Sullivan, Brendan; Ripke, Stephan; Thornton, Laura; Hinney, Anke; Daly, Mark; Sullivan, Patrick F; Zeggini, Eleftheria; Breen, Gerome; Bulik, Cynthia M
2017-09-01
The authors conducted a genome-wide association study of anorexia nervosa and calculated genetic correlations with a series of psychiatric, educational, and metabolic phenotypes. Following uniform quality control and imputation procedures using the 1000 Genomes Project (phase 3) in 12 case-control cohorts comprising 3,495 anorexia nervosa cases and 10,982 controls, the authors performed standard association analysis followed by a meta-analysis across cohorts. Linkage disequilibrium score regression was used to calculate genome-wide common variant heritability (single-nucleotide polymorphism [SNP]-based heritability [h 2 SNP ]), partitioned heritability, and genetic correlations (r g ) between anorexia nervosa and 159 other phenotypes. Results were obtained for 10,641,224 SNPs and insertion-deletion variants with minor allele frequencies >1% and imputation quality scores >0.6. The h 2 SNP of anorexia nervosa was 0.20 (SE=0.02), suggesting that a substantial fraction of the twin-based heritability arises from common genetic variation. The authors identified one genome-wide significant locus on chromosome 12 (rs4622308) in a region harboring a previously reported type 1 diabetes and autoimmune disorder locus. Significant positive genetic correlations were observed between anorexia nervosa and schizophrenia, neuroticism, educational attainment, and high-density lipoprotein cholesterol, and significant negative genetic correlations were observed between anorexia nervosa and body mass index, insulin, glucose, and lipid phenotypes. Anorexia nervosa is a complex heritable phenotype for which this study has uncovered the first genome-wide significant locus. Anorexia nervosa also has large and significant genetic correlations with both psychiatric phenotypes and metabolic traits. The study results encourage a reconceptualization of this frequently lethal disorder as one with both psychiatric and metabolic etiology.
The unusual S locus of Leavenworthia is composed of two sets of paralogous loci.
Chantha, Sier-Ching; Herman, Adam C; Castric, Vincent; Vekemans, Xavier; Marande, William; Schoen, Daniel J
2017-12-01
The Leavenworthia self-incompatibility locus (S locus) consists of paralogs (Lal2, SCRL) of the canonical Brassicaceae S locus genes (SRK, SCR), and is situated in a genomic position that differs from the ancestral one in the Brassicaceae. Unexpectedly, in a small number of Leavenworthia alabamica plants examined, sequences closely resembling exon 1 of SRK have been found, but the function of these has remained unclear. BAC cloning and expression analyses were employed to characterize these SRK-like sequences. An SRK-positive Bacterial Artificial Chromosome clone was found to contain complete SRK and SCR sequences located close by one another in the derived genomic position of the Leavenworthia S locus, and in place of the more typical Lal2 and SCRL sequences. These sequences are expressed in stigmas and anthers, respectively, and crossing data show that the SRK/SCR haplotype is functional in self-incompatibility. Population surveys indicate that < 5% of Leavenworthia S loci possess such alleles. An ancestral translocation or recombination event involving SRK/SCR and Lal2/SCRL likely occurred, together with neofunctionalization of Lal2/SCRL, and both haplotype groups now function as Leavenworthia S locus alleles. These findings suggest that S locus alleles can have distinctly different evolutionary origins. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Gianfrancesco, Olympia; Griffiths, Daniel; Myers, Paul; Collier, David A; Bubb, Vivien J; Quinn, John P
2016-10-01
Genome-wide association studies (GWAS) have identified a region at chromosome 1p21.3, containing the microRNA MIR137, to be among the most significant associations for schizophrenia. However, the mechanism by which genetic variation at this locus increases risk of schizophrenia is unknown. Identifying key regulatory regions around MIR137 is crucial to understanding the potential role of this gene in the aetiology of psychiatric disorders. Through alignment of vertebrate genomes, we identified seven non-coding regions at the MIR137 locus with conservation comparable to exons (>70 %). Bioinformatic analysis using the Psychiatric Genomics Consortium GWAS dataset for schizophrenia showed five of the ECRs to have genome-wide significant SNPs in or adjacent to their sequence. Analysis of available datasets on chromatin marks and histone modification data showed that three of the ECRs were predicted to be functional in the human brain, and three in development. In vitro analysis of ECR activity using reporter gene assays showed that all seven of the selected ECRs displayed transcriptional regulatory activity in the SH-SY5Y neuroblastoma cell line. This data suggests a regulatory role in the developing and adult brain for these highly conserved regions at the MIR137 schizophrenia-associated locus and further that these domains could act individually or synergistically to regulate levels of MIR137 expression.
Douvris, Adrianna; Soubeyrand, Sébastien; Naing, Thet; Martinuk, Amy; Nikpay, Majid; Williams, Andrew; Buick, Julie; Yauk, Carole; McPherson, Ruth
2014-06-03
The TRIB1 locus has been linked to hepatic triglyceride metabolism in mice and to plasma triglycerides and coronary artery disease in humans. The lipid-associated single nucleotide polymorphisms (SNPs), identified by genome-wide association studies, are located ≈30 kb downstream from TRIB1, suggesting complex regulatory effects on genes or pathways relevant to hepatic triglyceride metabolism. The goal of this study was to investigate the functional relationship between common SNPs at the TRIB1 locus and plasma lipid traits. Characterization of the risk locus reveals that it encompasses a gene, TRIB1-associated locus (TRIBAL), composed of a well-conserved promoter region and an alternatively spliced transcript. Bioinformatic analysis and resequencing identified a single SNP, rs2001844, within the promoter region that associates with increased plasma triglycerides and reduced high-density lipoprotein cholesterol and coronary artery disease risk. Further, correction for triglycerides as a covariate indicated that the genome-wide association studies association is largely dependent on triglycerides. In addition, we show that rs2001844 is an expression trait locus (eQTL) for TRIB1 expression in blood and alters TRIBAL promoter activity in a reporter assay model. The TRIBAL transcript has features typical of long noncoding RNAs, including poor sequence conservation. Modulation of TRIBAL expression had limited impact on either TRIB1 or lipid regulatory genes mRNA levels in human hepatocyte models. In contrast, TRIB1 knockdown markedly increased TRIBAL expression in HepG2 cells and primary human hepatocytes. These studies demonstrate an interplay between a novel locus, TRIBAL, and TRIB1. TRIBAL is located in the genome-wide association studies identified risk locus, responds to altered expression of TRIB1, harbors a risk SNP that is an eQTL for TRIB1 expression, and associates with plasma triglyceride concentrations. © 2014 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley Blackwell.
2013-01-01
Background In this report we have explored the genomic and microbiological basis for a sustained increase in bloodstream infections at a major Australian hospital caused by Enterococcus faecium multi-locus sequence type (ST) 203, an outbreak strain that has largely replaced a predecessor ST17 sequence type. Results To establish a ST203 reference sequence we fully assembled and annotated the genome of Aus0085, a 2009 vancomycin-resistant Enterococcus faecium (VREfm) bloodstream isolate, and the first example of a completed ST203 genome. Aus0085 has a 3.2 Mb genome, comprising a 2.9 Mb circular chromosome and six circular plasmids (2 kb–130 kb). Twelve percent of the 3222 coding sequences (CDS) in Aus0085 are not present in ST17 E. faecium Aus0004 and ST18 E. faecium TX16. Extending this comparison to an additional 12 ST17 and 14 ST203 E. faecium hospital isolate genomes revealed only six genomic regions spanning 41 kb that were present in all ST203 and absent from all ST17 genomes. The 40 CDS have predicted functions that include ion transport, riboflavin metabolism and two phosphotransferase systems. Comparison of the vancomycin resistance-conferring Tn1549 transposon between Aus0004 and Aus0085 revealed differences in transposon length and insertion site, and van locus sequence variation that correlated with a higher vancomycin MIC in Aus0085. Additional phenotype comparisons between ST17 and ST203 isolates showed that while there were no differences in biofilm-formation and killing of Galleria mellonella, ST203 isolates grew significantly faster and out-competed ST17 isolates in growth assays. Conclusions Here we have fully assembled and annotated the first ST203 genome, and then characterized the genomic differences between ST17 and ST203 E. faecium. We also show that ST203 E. faecium are faster growing and can out-compete ST17 E. faecium. While a causal genetic basis for these phenotype differences is not provided here, this study revealed conserved genetic differences between the two clones, differences that can now be tested to explain the molecular basis for the success and emergence of ST203 E. faecium. PMID:24004955
Gupta, A; Morby, A P; Turner, J S; Whitton, B A; Robinson, N J
1993-01-01
Genomic rearrangements involving amplification of metallothionein (MT) genes have been reported in metal-tolerant eukaryotes. Similarly, we have recently observed amplification and rearrangement of a prokaryotic MT locus, smt, in cells of Synechococcus PCC 6301 selected for Cd tolerance. Following the characterization of this locus, the altered smt region has now been isolated from a Cd-tolerant cell line, C3.2, and its nucleotide sequence determined. This has identified a deletion within smtB, which encodes a trans-acting repressor of smt transcription. Two identical palindromic octanucleotides (5'-GCGATC-GC-3') traverse both borders of the excised element. This palindromic sequence is highly represented in the smt locus (7 occurrences in 1326 nucleotides) and analysis of the GenBank/EMBL/DDBJ DNA Nucleotide Sequence Data Libraries reveals that this is a highly iterated palindrome (HIP1) in other known sequences from Synechococcus strains (estimated to occur at an average frequency of once every c. 664 bp). HIP1 is also abundant in the genomes of other cyanobacteria. The functional significance of smtB deletion and the possible role of HIP1 in genome plasticity and adaptation in cyanobacteria are discussed.
Lefèvre, Christophe M; Sharp, Julie A; Nicholas, Kevin R
2009-01-01
Using a milk-cell cDNA sequencing approach we characterised milk-protein sequences from two monotreme species, platypus (Ornithorhynchus anatinus) and echidna (Tachyglossus aculeatus) and found a full set of caseins and casein variants. The genomic organisation of the platypus casein locus is compared with other mammalian genomes, including the marsupial opossum and several eutherians. Physical linkage of casein genes has been seen in the casein loci of all mammalian genomes examined and we confirm that this is also observed in platypus. However, we show that a recent duplication of beta-casein occurred in the monotreme lineage, as opposed to more ancient duplications of alpha-casein in the eutherian lineage, while marsupials possess only single copies of alpha- and beta-caseins. Despite this variability, the close proximity of the main alpha- and beta-casein genes in an inverted tail-tail orientation and the relative orientation of the more distant kappa-casein genes are similar in all mammalian genome sequences so far available. Overall, the conservation of the genomic organisation of the caseins indicates the early, pre-monotreme development of the fundamental role of caseins during lactation. In contrast, the lineage-specific gene duplications that have occurred within the casein locus of monotremes and eutherians but not marsupials, which may have lost part of the ancestral casein locus, emphasises the independent selection on milk provision strategies to the young, most likely linked to different developmental strategies. The monotremes therefore provide insight into the ancestral drivers for lactation and how these have adapted in different lineages.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rorsman, F.; Bywater, M.; Knott, T.J.
The human platelet-derived growth factor (PDGF) A-chain locus was characterized by restriction endonuclease analysis, and the nucleotide sequence of its exons was determined. Seven exons were identified, spanning approximately 22 kilobase pairs of genomic DNA. Alternative exon usage, identified by cDNA cloning, occurs in a human glioblastoma cell line and may give rise to two types of A-chain precursors with different C termini. The exon-intron arrangement was similar to that of the PDGF B-chain/sis locus and seemed to divide the precursor proteins into functional domains. Southern blot analysis of genomic DNA showed that a single PDGF A-chain gene was presentmore » in the human genome.« less
Juneja, Punita; Ariani, Cristina V.; Ho, Yung Shwen; Akorli, Jewelna; Palmer, William J.; Pain, Arnab; Jiggins, Francis M.
2015-01-01
Many mosquito species are naturally polymorphic for their abilities to transmit parasites, a feature which is of great interest for controlling vector-borne disease. Aedes aegypti, the primary vector of dengue and yellow fever and a laboratory model for studying lymphatic filariasis, is genetically variable for its capacity to harbor the filarial nematode Brugia malayi. The genome of Ae. aegypti is large and repetitive, making genome resequencing difficult and expensive. We designed exome captures to target protein-coding regions of the genome, and used association mapping in a wild Kenyan population to identify a single, dominant, sex-linked locus underlying resistance. This falls in a region of the genome where a resistance locus was previously mapped in a line established in 1936, suggesting that this polymorphism has been maintained in the wild for the at least 80 years. We then crossed resistant and susceptible mosquitoes to place both alleles of the gene into a common genetic background, and used RNA-seq to measure the effect of this locus on gene expression. We found evidence for Toll, IMD, and JAK-STAT pathway activity in response to early stages of B. malayi infection when the parasites are beginning to die in the resistant genotype. We also found that resistant mosquitoes express anti-microbial peptides at the time of parasite-killing, and that this expression is suppressed in susceptible mosquitoes. Together, we have found that a single resistance locus leads to a higher immune response in resistant mosquitoes, and we identify genes in this region that may be responsible for this trait. PMID:25815506
DNA and RNA editing of retrotransposons accelerate mammalian genome evolution.
Knisbacher, Binyamin A; Levanon, Erez Y
2015-04-01
Genome evolution is commonly viewed as a gradual process that is driven by random mutations that accumulate over time. However, DNA- and RNA-editing enzymes have been identified that can accelerate evolution by actively modifying the genomically encoded information. The apolipoprotein B mRNA editing enzymes, catalytic polypeptide-like (APOBECs) are potent restriction factors that can inhibit retroelements by cytosine-to-uridine editing of retroelement DNA after reverse transcription. In some cases, a retroelement may successfully integrate into the genome despite being hypermutated. Such events introduce unique sequences into the genome and are thus a source of genomic innovation. adenosine deaminases that act on RNA (ADARs) catalyze adenosine-to-inosine editing in double-stranded RNA, commonly formed by oppositely oriented retroelements. The RNA editing confers plasticity to the transcriptome by generating many transcript variants from a single genomic locus. If the editing produces a beneficial variant, the genome may maintain the locus that produces the RNA-edited transcript for its novel function. Here, we discuss how these two powerful editing mechanisms, which both target inserted retroelements, facilitate expedited genome evolution. © 2015 New York Academy of Sciences.
Ferreira de Carvalho, J; Poulain, J; Da Silva, C; Wincker, P; Michon-Coudouel, S; Dheilly, A; Naquin, D; Boutte, J; Salmon, A; Ainouche, M
2013-01-01
Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations. PMID:23149455
Pandey, Manish K.; Upadhyaya, Hari D.; Rathore, Abhishek; Vadez, Vincent; Sheshshayee, M. S.; Sriswathi, Manda; Govil, Mansee; Kumar, Ashish; Gowda, M. V. C.; Sharma, Shivali; Hamidou, Falalou; Kumar, V. Anil; Khera, Pawan; Bhat, Ramesh S.; Khan, Aamir W.; Singh, Sube; Li, Hongjie; Monyo, Emmanuel; Nadaf, H. L.; Mukri, Ganapati; Jackson, Scott A.; Guo, Baozhu; Liang, Xuanqiang; Varshney, Rajeev K.
2014-01-01
Peanut is an important and nutritious agricultural commodity and a livelihood of many small-holder farmers in the semi-arid tropics (SAT) of world which are facing serious production threats. Integration of genomics tools with on-going genetic improvement approaches is expected to facilitate accelerated development of improved cultivars. Therefore, high-resolution genotyping and multiple season phenotyping data for 50 important agronomic, disease and quality traits were generated on the ‘reference set’ of peanut. This study reports comprehensive analyses of allelic diversity, population structure, linkage disequilibrium (LD) decay and marker-trait association (MTA) in peanut. Distinctness of all the genotypes can be established by using either an unique allele detected by a single SSR or a combination of unique alleles by two or more than two SSR markers. As expected, DArT features (2.0 alleles/locus, 0.125 PIC) showed lower allele frequency and polymorphic information content (PIC) than SSRs (22.21 alleles /locus, 0.715 PIC). Both marker types clearly differentiated the genotypes of diploids from tetraploids. Multi-allelic SSRs identified three sub-groups (K = 3) while the LD simulation trend line based on squared-allele frequency correlations (r2) predicted LD decay of 15–20 cM in peanut genome. Detailed analysis identified a total of 524 highly significant MTAs (pvalue >2.1×10–6) with wide phenotypic variance (PV) range (5.81–90.09%) for 36 traits. These MTAs after validation may be deployed in improving biotic resistance, oil/ seed/ nutritional quality, drought tolerance related traits, and yield/ yield components. PMID:25140620
Knudsen, Gitte M; Nielsen, Jesper Boye; Marvig, Rasmus L; Ng, Yin; Worning, Peder; Westh, Henrik; Gram, Lone
2017-08-01
Whole genome sequencing is increasing used in epidemiology, e.g. for tracing outbreaks of food-borne diseases. This requires in-depth understanding of pathogen emergence, persistence and genomic diversity along the food production chain including in food processing plants. We sequenced the genomes of 80 isolates of Listeria monocytogenes sampled from Danish food processing plants over a time-period of 20 years, and analysed the sequences together with 10 public available reference genomes to advance our understanding of interplant and intraplant genomic diversity of L. monocytogenes. Except for three persisting sequence types (ST) based on Multi Locus Sequence Typing being ST7, ST8 and ST121, long-term persistence of clonal groups was limited, and new clones were introduced continuously, potentially from raw materials. No particular gene could be linked to the persistence phenotype. Using time-based phylogenetic analyses of the persistent STs, we estimate the L. monocytogenes evolutionary rate to be 0.18-0.35 single nucleotide polymorphisms/year, suggesting that the persistent STs emerged approximately 100 years ago, which correlates with the onset of industrialization and globalization of the food market. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
High-resolution genetic maps of Eucalyptus improve Eucalyptus grandis genome assembly.
Bartholomé, Jérôme; Mandrou, Eric; Mabiala, André; Jenkins, Jerry; Nabihoudine, Ibouniyamine; Klopp, Christophe; Schmutz, Jeremy; Plomion, Christophe; Gion, Jean-Marc
2015-06-01
Genetic maps are key tools in genetic research as they constitute the framework for many applications, such as quantitative trait locus analysis, and support the assembly of genome sequences. The resequencing of the two parents of a cross between Eucalyptus urophylla and Eucalyptus grandis was used to design a single nucleotide polymorphism (SNP) array of 6000 markers evenly distributed along the E. grandis genome. The genotyping of 1025 offspring enabled the construction of two high-resolution genetic maps containing 1832 and 1773 markers with an average marker interval of 0.45 and 0.5 cM for E. grandis and E. urophylla, respectively. The comparison between genetic maps and the reference genome highlighted 85% of collinear regions. A total of 43 noncollinear regions and 13 nonsynthetic regions were detected and corrected in the new genome assembly. This improved version contains 4943 scaffolds totalling 691.3 Mb of which 88.6% were captured by the 11 chromosomes. The mapping data were also used to investigate the effect of population size and number of markers on linkage mapping accuracy. This study provides the most reliable linkage maps for Eucalyptus and version 2.0 of the E. grandis genome. © 2014 CIRAD. New Phytologist © 2014 New Phytologist Trust.
The genetic structure of the A mating-type locus of Lentinula edodes.
Au, Chun Hang; Wong, Man Chun; Bao, Dapeng; Zhang, Meiyan; Song, Chunyan; Song, Wenhua; Law, Patrick Tik Wan; Kües, Ursula; Kwan, Hoi Shan
2014-02-10
The Shiitake mushroom, Lentinula edodes (Berk.) Pegler is a tetrapolar basidiomycete with two unlinked mating-type loci, commonly called the A and B loci. Identifying the mating-types in shiitake is important for enhancing the breeding and cultivation of this economically-important edible mushroom. Here, we identified the A mating-type locus from the first draft genome sequence of L. edodes and characterized multiple alleles from different monokaryotic strains. Two intron-length polymorphism markers were developed to facilitate rapid molecular determination of A mating-type. L. edodes sequences were compared with those of known tetrapolar and bipolar basidiomycete species. The A mating-type genes are conserved at the homeodomain region across the order Agaricales. However, we observed unique genomic organization of the locus in L. edodes which exhibits atypical gene order and multiple repetitive elements around its A locus. To our knowledge, this is the first known exception among Homobasidiomycetes, in which the mitochondrial intermediate peptidase (mip) gene is not closely linked to A locus. Copyright © 2013 Elsevier B.V. All rights reserved.
Functional and genetic analysis of haplotypic sequence variation at the nicastrin genomic locus
Hamilton, Gillian; Killick, Richard; Lambert, Jean-Charles; Amouyel, Philippe; Carrasquillo, Minerva M.; Pankratz, V. Shane; Graff-Radford, Neill R.; Dickson, Dennis W.; Petersen, Ronald C.; Younkin, Steven G.; Powell, John F.; Wade-Martins, Richard
2013-01-01
Nicastrin (NCSTN) is a component of the γ-secretase complex and therefore potentially a candidate risk gene for Alzheimer's disease. Here, we have developed a novel functional genomics methodology to express common locus haplotypes to assess functional differences. DNA recombination was used to engineer 5 bacterial artificial chromosomes (BACs) to each express a different haplotype of the NCSTN locus. Each NCSTN-BAC was delivered to knockout nicastrin (Ncstn−/−) cells and clonal NCSTN-BAC+/Ncstn−/− cell lines were created for functional analyses. We showed that all NCSTN-BAC haplotypes expressed nicastrin protein and rescued γ-secretase activity and amyloid beta (Aβ) production in NCSTN-BAC+/Ncstn−/− lines. We then showed that genetic variation at the NCSTN locus affected alternative splicing in human postmortem brain tissue. However, there was no robust functional difference between clonal cell lines rescued by each of the 5 different haplotypes. Finally, there was no statistically significant association of NCSTN with disease risk in the 4 cohorts. We therefore conclude that it is unlikely that common variation at the NCSTN locus is a risk factor for Alzheimer's disease. PMID:22405046
Locus-specific gene repositioning in prostate cancer
Leshner, Marc; Devine, Michelle; Roloff, Gregory W.; True, Lawrence D.; Misteli, Tom; Meaburn, Karen J.
2016-01-01
Genes occupy preferred spatial positions within interphase cell nuclei. However, positioning patterns are not an innate feature of a locus, and genes can alter their localization in response to physiological and pathological changes. Here we screen the radial positioning patterns of 40 genes in normal, hyperplasic, and malignant human prostate tissues. We find that the overall spatial organization of the genome in prostate tissue is largely conserved among individuals. We identify three genes whose nuclear positions are robustly altered in neoplastic prostate tissues. FLI1 and MMP9 position differently in prostate cancer than in normal tissue and prostate hyperplasia, whereas MMP2 is repositioned in both prostate cancer and hyperplasia. Our data point to locus-specific reorganization of the genome during prostate disease. PMID:26564800
Saunders, Edward J; Dadaev, Tokhir; Leongamornlert, Daniel A; Al Olama, Ali Amin; Benlloch, Sara; Giles, Graham G; Wiklund, Fredrik; Gronberg, Henrik; Haiman, Christopher A; Schleutker, Johanna; Nordestgaard, Borge G; Travis, Ruth C; Neal, David; Pasayan, Nora; Khaw, Kay-Tee; Stanford, Janet L; Blot, William J; Thibodeau, Stephen N; Maier, Christiane; Kibel, Adam S; Cybulski, Cezary; Cannon-Albright, Lisa; Brenner, Hermann; Park, Jong Y; Kaneva, Radka; Batra, Jyotsna; Teixeira, Manuel R; Pandha, Hardev; Govindasami, Koveela; Muir, Ken; Easton, Douglas F; Eeles, Rosalind A; Kote-Jarai, Zsofia
2016-04-12
Germline mutations within DNA-repair genes are implicated in susceptibility to multiple forms of cancer. For prostate cancer (PrCa), rare mutations in BRCA2 and BRCA1 give rise to moderately elevated risk, whereas two of B100 common, low-penetrance PrCa susceptibility variants identified so far by genome-wide association studies implicate RAD51B and RAD23B. Genotype data from the iCOGS array were imputed to the 1000 genomes phase 3 reference panel for 21 780 PrCa cases and 21 727 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. We subsequently performed single variant, gene and pathway-level analyses using 81 303 SNPs within 20 Kb of a panel of 179 DNA-repair genes. Single SNP analyses identified only the previously reported association with RAD51B. Gene-level analyses using the SKAT-C test from the SNP-set (Sequence) Kernel Association Test (SKAT) identified a significant association with PrCa for MSH5. Pathway-level analyses suggested a possible role for the translesion synthesis pathway in PrCa risk and Homologous recombination/Fanconi Anaemia pathway for PrCa aggressiveness, even though after adjustment for multiple testing these did not remain significant. MSH5 is a novel candidate gene warranting additional follow-up as a prospective PrCa-risk locus. MSH5 has previously been reported as a pleiotropic susceptibility locus for lung, colorectal and serous ovarian cancers.
Evidence for the sexual origin of heterokaryosis in arbuscular mycorrhizal fungi.
Ropars, Jeanne; Toro, Kinga Sędzielewska; Noel, Jessica; Pelin, Adrian; Charron, Philippe; Farinelli, Laurent; Marton, Timea; Krüger, Manuela; Fuchs, Jörg; Brachmann, Andreas; Corradi, Nicolas
2016-03-21
Sexual reproduction is ubiquitous among eukaryotes, and fully asexual lineages are extremely rare. Prominent among ancient asexual lineages are the arbuscular mycorrhizal fungi (AMF), a group of plant symbionts with a multinucleate cytoplasm. Genomic divergence among co-existing nuclei was proposed to drive the evolutionary success of AMF in the absence of sex(1), but this hypothesis has been contradicted by recent genome analyses that failed to find significant genetic diversity within an AMF isolate(2,3). Here, we set out to resolve issues surrounding the genome organization and sexual potential of AMF by exploring the genomes of five isolates of Rhizophagus irregularis, a model AMF. We find that genetic diversity in this species varies among isolates and is structured in a homo-dikaryon-like manner usually linked with the existence of a sexual life cycle. We also identify a putative AMF mating-type locus, containing two genes with structural and evolutionary similarities with the mating-type locus of some Dikarya. Our analyses suggest that this locus may be multi-allelic and that AMF could be heterothallic and bipolar. These findings reconcile opposing views on the genome organization of these ubiquitous plant symbionts and open avenues for strain improvement and environmental application of these organisms.
Genome Analysis of the Domestic Dog (Korean Jindo) by Massively Parallel Sequencing
Kim, Ryong Nam; Kim, Dae-Soo; Choi, Sang-Haeng; Yoon, Byoung-Ha; Kang, Aram; Nam, Seong-Hyeuk; Kim, Dong-Wook; Kim, Jong-Joo; Ha, Ji-Hong; Toyoda, Atsushi; Fujiyama, Asao; Kim, Aeri; Kim, Min-Young; Park, Kun-Hyang; Lee, Kang Seon; Park, Hong-Seog
2012-01-01
Although pioneering sequencing projects have shed light on the boxer and poodle genomes, a number of challenges need to be met before the sequencing and annotation of the dog genome can be considered complete. Here, we present the DNA sequence of the Jindo dog genome, sequenced to 45-fold average coverage using Illumina massively parallel sequencing technology. A comparison of the sequence to the reference boxer genome led to the identification of 4 675 437 single nucleotide polymorphisms (SNPs, including 3 346 058 novel SNPs), 71 642 indels and 8131 structural variations. Of these, 339 non-synonymous SNPs and 3 indels are located within coding sequences (CDS). In particular, 3 non-synonymous SNPs and a 26-bp deletion occur in the TCOF1 locus, implying that the difference observed in cranial facial morphology between Jindo and boxer dogs might be influenced by those variations. Through the annotation of the Jindo olfactory receptor gene family, we found 2 unique olfactory receptor genes and 236 olfactory receptor genes harbouring non-synonymous homozygous SNPs that are likely to affect smelling capability. In addition, we determined the DNA sequence of the Jindo dog mitochondrial genome and identified Jindo dog-specific mtDNA genotypes. This Jindo genome data upgrade our understanding of dog genomic architecture and will be a very valuable resource for investigating not only dog genetics and genomics but also human and dog disease genetics and comparative genomics. PMID:22474061
Picotti, Paola; Clement-Ziza, Mathieu; Lam, Henry; Campbell, David S.; Schmidt, Alexander; Deutsch, Eric W.; Röst, Hannes; Sun, Zhi; Rinner, Oliver; Reiter, Lukas; Shen, Qin; Michaelson, Jacob J.; Frei, Andreas; Alberti, Simon; Kusebauch, Ulrike; Wollscheid, Bernd; Moritz, Robert; Beyer, Andreas; Aebersold, Ruedi
2013-01-01
Complete reference maps or datasets, like the genomic map of an organism, are highly beneficial tools for biological and biomedical research. Attempts to generate such reference datasets for a proteome so far failed to reach complete proteome coverage, with saturation apparent at approximately two thirds of the proteomes tested, even for the most thoroughly characterized proteomes. Here, we used a strategy based on high-throughput peptide synthesis and mass spectrometry to generate a close to complete reference map (97% of the genome-predicted proteins) of the S. cerevisiae proteome. We generated two versions of this mass spectrometric map one supporting discovery- (shotgun) and the other hypothesis-driven (targeted) proteomic measurements. The two versions of the map, therefore, constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. The reference libraries can be browsed via a web-based repository and associated navigation tools. To demonstrate the utility of the reference libraries we applied them to a protein quantitative trait locus (pQTL) analysis, which requires measurement of the same peptides over a large number of samples with high precision. Protein measurements over a set of 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, impacting on the levels of related proteins. Our results suggest that selective pressure favors the acquisition of sets of polymorphisms that maintain the stoichiometry of protein complexes and pathways. PMID:23334424
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zonana, J.; Gault, J.; Jones, M.
1993-01-01
X-linked hypohidrotic ectodermal dysplasia (EDA) has been localized to the Xq12-q13.1. A panel of genomic DNA samples from 80 unrelated males with EDA has been screened for deletions at seven genetic loci within the Xq12-13 region. A single individual was identified with a deletion at the DXS732 locus by hybridization with the mouse genomic probe pcos169E/4. This highly conserved DNA probe is from locus DXCrc169, which is tightly linked to the Ta locus, the putative mouse homologue of EDA. The proband had the classical phenotype of EDA, with no other phenotypic abnormalities, and a normal cytogenetic analysis. A human genomicmore » DNA clone, homologous to pcos169E/4, was isolated from a human X-chromosome cosmid library. On hybridization with the cosmid, the proband was found to be only partially deleted at the DXS732 locus, with a unique junctional fragment identified in the proband and in three of his maternal relatives. This is the first determination of carrier status for EDA in females, by direct mutation analysis. Failure to detect deletion of the other loci tested in the proband suggests that the DXS732 locus is the closest known locus to the EDA gene. Since the DXS732 locus contains a highly conserved sequence, it must be considered to be a candidate locus for the EDA gene itself. 18 refs., 3 figs., 1 tab.« less
Marenholz, Ingo; Grosche, Sarah; Kalb, Birgit; Rüschendorf, Franz; Blümchen, Katharina; Schlags, Rupert; Harandi, Neda; Price, Mareike; Hansen, Gesine; Seidenberg, Jürgen; Röblitz, Holger; Yürek, Songül; Tschirner, Sebastian; Hong, Xiumei; Wang, Xiaobin; Homuth, Georg; Schmidt, Carsten O; Nöthen, Markus M; Hübner, Norbert; Niggemann, Bodo; Beyer, Kirsten; Lee, Young-Ae
2017-10-20
Genetic factors and mechanisms underlying food allergy are largely unknown. Due to heterogeneity of symptoms a reliable diagnosis is often difficult to make. Here, we report a genome-wide association study on food allergy diagnosed by oral food challenge in 497 cases and 2387 controls. We identify five loci at genome-wide significance, the clade B serpin (SERPINB) gene cluster at 18q21.3, the cytokine gene cluster at 5q31.1, the filaggrin gene, the C11orf30/LRRC32 locus, and the human leukocyte antigen (HLA) region. Stratifying the results for the causative food demonstrates that association of the HLA locus is peanut allergy-specific whereas the other four loci increase the risk for any food allergy. Variants in the SERPINB gene cluster are associated with SERPINB10 expression in leukocytes. Moreover, SERPINB genes are highly expressed in the esophagus. All identified loci are involved in immunological regulation or epithelial barrier function, emphasizing the role of both mechanisms in food allergy.
Measurement of locus copy number by hybridisation with amplifiable probes
Armour, John A. L.; Sismani, Carolina; Patsalis, Philippos C.; Cross, Gareth
2000-01-01
Despite its fundamental importance in genome analysis, it is only recently that systematic approaches have been developed to assess copy number at specific genetic loci, or to examine genomic DNA for submicroscopic deletions of unknown location. In this report we show that short probes can be recovered and amplified quantitatively following hybridisation to genomic DNA. This simple observation forms the basis of a new approach to determining locus copy number in complex genomes. The power and specificity of multiplex amplifiable probe hybridisation is demonstrated by the simultaneous assessment of copy number at a set of 40 human loci, including detection of deletions causing Duchenne muscular dystrophy and Prader–Willi/Angelman syndromes. Assembly of other probe sets will allow novel, technically simple approaches to a wide variety of genetic analyses, including the potential for extension to high resolution genome-wide screens for deletions and amplifications. PMID:10606661
Measurement of locus copy number by hybridisation with amplifiable probes.
Armour, J A; Sismani, C; Patsalis, P C; Cross, G
2000-01-15
Despite its fundamental importance in genome analysis, it is only recently that systematic approaches have been developed to assess copy number at specific genetic loci, or to examine genomic DNA for submicro-scopic deletions of unknown location. In this report we show that short probes can be recovered and amplified quantitatively following hybridisation to genomic DNA. This simple observation forms the basis of a new approach to determining locus copy number in complex genomes. The power and specificity of multiplex amplifiable probe hybridisation is demonstrated by the simultaneous assessment of copy number at a set of 40 human loci, including detection of deletions causing Duchenne muscular dystrophy and Prader-Willi/Angelman syndromes. Assembly of other probe sets will allow novel, technically simple approaches to a wide variety of genetic analyses, including the potential for extension to high resolution genome-wide screens for deletions and amplifications.
Enhancer scanning to locate regulatory regions in genomic loci
Buckley, Melissa; Gjyshi, Anxhela; Mendoza-Fandiño, Gustavo; Baskin, Rebekah; Carvalho, Renato S.; Carvalho, Marcelo A.; Woods, Nicholas T.; Monteiro, Alvaro N.A.
2016-01-01
The present protocol provides a rapid, streamlined and scalable strategy to systematically scan genomic regions for the presence of transcriptional regulatory regions active in a specific cell type. It creates genomic tiles spanning a region of interest that are subsequently cloned by recombination into a luciferase reporter vector containing the Simian Virus 40 promoter. Tiling clones are transfected into specific cell types to test for the presence of transcriptional regulatory regions. The protocol includes testing of different SNP (single nucleotide polymorphism) alleles to determine their effect on regulatory activity. This procedure provides a systematic framework to identify candidate functional SNPs within a locus during functional analysis of genome-wide association studies. This protocol adapts and combines previous well-established molecular biology methods to provide a streamlined strategy, based on automated primer design and recombinational cloning to rapidly go from a genomic locus to a set of candidate functional SNPs in eight weeks. PMID:26658467
USDA-ARS?s Scientific Manuscript database
The ARS Microbial Genome Sequence Database (http://199.133.98.43), a web-based database server, was established utilizing the BIGSdb (Bacterial Isolate Genomics Sequence Database) software package, developed at Oxford University, as a tool to manage multi-locus sequence data for the family Streptomy...
A Genome-Wide Association Study for Regulators of Micronucleus Formation in Mice.
McIntyre, Rebecca E; Nicod, Jérôme; Robles-Espinoza, Carla Daniela; Maciejowski, John; Cai, Na; Hill, Jennifer; Verstraten, Ruth; Iyer, Vivek; Rust, Alistair G; Balmus, Gabriel; Mott, Richard; Flint, Jonathan; Adams, David J
2016-08-09
In mammals the regulation of genomic instability plays a key role in tumor suppression and also controls genome plasticity, which is important for recombination during the processes of immunity and meiosis. Most studies to identify regulators of genomic instability have been performed in cells in culture or in systems that report on gross rearrangements of the genome, yet subtle differences in the level of genomic instability can contribute to whole organism phenotypes such as tumor predisposition. Here we performed a genome-wide association study in a population of 1379 outbred Crl:CFW(SW)-US_P08 mice to dissect the genetic landscape of micronucleus formation, a biomarker of chromosomal breaks, whole chromosome loss, and extranuclear DNA. Variation in micronucleus levels is a complex trait with a genome-wide heritability of 53.1%. We identify seven loci influencing micronucleus formation (false discovery rate <5%), and define candidate genes at each locus. Intriguingly at several loci we find evidence for sexual dimorphism in micronucleus formation, with a locus on chromosome 11 being specific to males. Copyright © 2016 McIntyre et al.
Gene amplification of the Hps locus in Glycine max
Gijzen, Mark; Kuflu, Kuflom; Moy, Pat
2006-01-01
Background Hydrophobic protein from soybean (HPS) is an 8 kD cysteine-rich polypeptide that causes asthma in persons allergic to soybean dust. HPS is synthesized in the pod endocarp and deposited on the seed surface during development. Past evidence suggests that the protein may mediate the adherence or dehiscence of endocarp tissues during maturation and affect the lustre, or glossiness of the seed surface. Results A comparison of soybean germplasm by genomic DNA blot hybridization shows that the copy number and structure of the Hps locus is polymorphic among soybean cultivars and related species. Changes in Hps gene copy number were also detected by comparative genomic DNA hybridization using cDNA microarrays. The Hps copy number polymorphisms co-segregated with seed lustre phenotype and HPS surface protein in a cross between dull- and shiny-seeded soybeans. In soybean cultivar Harosoy 63, a minimum of 27 ± 5 copies of the Hps gene were estimated to be present in each haploid genome. The isolation and analysis of genomic clones indicates that the core Hps locus is comprised of a tandem array of reiterated units, with each 8.6 kb unit containing a single HPS open reading frame. Conclusion This study shows that polymorphisms at the Hps locus arise from changes in the gene copy number via gene amplification. We present a model whereby Hps copy number modulates protein expression levels and seed lustre, and we suggest that gene amplification may result from selection pressures imposed on crop plants. PMID:16536872
Evolution and selection of Rhg1, a copy-number variant nematode-resistance locus
Lee, Tong Geon; Kumar, Indrajit; Diers, Brian W; Hudson, Matthew E
2015-01-01
The soybean cyst nematode (SCN) resistance locus Rhg1 is a tandem repeat of a 31.2 kb unit of the soybean genome. Each 31.2-kb unit contains four genes. One allele of Rhg1, Rhg1-b, is responsible for protecting most US soybean production from SCN. Whole-genome sequencing was performed, and PCR assays were developed to investigate allelic variation in sequence and copy number of the Rhg1 locus across a population of soybean germplasm accessions. Four distinct sequences of the 31.2-kb repeat unit were identified, and some Rhg1 alleles carry up to three different types of repeat unit. The total number of copies of the repeat varies from 1 to 10 per haploid genome. Both copy number and sequence of the repeat correlate with the resistance phenotype, and the Rhg1 locus shows strong signatures of selection. Significant linkage disequilibrium in the genome outside the boundaries of the repeat allowed the Rhg1 genotype to be inferred using high-density single nucleotide polymorphism genotyping of 15 996 accessions. Over 860 germplasm accessions were found likely to possess Rhg1 alleles. The regions surrounding the repeat show indications of non-neutral evolution and high genetic variability in populations from different geographic locations, but without evidence of fixation of the resistant genotype. A compelling explanation of these results is that balancing selection is in operation at Rhg1. PMID:25735447
Peter, Beate; Matsushita, Mark; Raskind, Wendy H
2012-10-01
The aim of this pilot study was to investigate a measure of motor sequencing deficit as a potential endophenotype of speech sound disorder (SSD) in a multigenerational family with evidence of familial SSD. In a multigenerational family with evidence of a familial motor-based SSD, affectation status and a measure of motor sequencing during oral motor testing were obtained. To further investigate the role of motor sequencing as an endophenotype for genetic studies, parametric and nonparametric linkage analyses were carried out using a genome-wide panel of 404 microsatellites. In seven of the 10 family members with available data, SSD affectation status and motor sequencing status coincided. Linkage analysis revealed four regions of interest, 6p21, 7q32, 7q36, and 8q24, primarily identified with the measure of motor sequencing ability. The 6p21 region overlaps with a locus implicated in rapid alternating naming in a recent genome-wide dyslexia linkage study. The 7q32 locus contains a locus implicated in dyslexia. The 7q36 locus borders on a gene known to affect the component traits of language impairment. The results are consistent with a motor-based endophenotype of SSD that would be informative for genetic studies. The linkage results in this first genome-wide study in a multigenerational family with SSD warrant follow-up in additional families and with fine mapping or next-generation approaches to gene identification.
Peter, Beate; Matsushita, Mark; Raskind, Wendy H.
2012-01-01
Objectives The purpose of this pilot study was to investigate a measure of motor sequencing deficit as a potential endophenotype of speech sound disorder (SSD) in a multigenerational family with evidence of familial SSD. Methods In a multigenerational family with evidence of a familial motor-based SSD, affectation status and a measure of motor sequencing during oral motor testing were obtained. To further investigate the role of motor sequencing as an endophenotype for genetic studies, parametric and nonparametric linkage analyses were conducted using a genome-wide panel of 404 microsatellites. Results In seven of the ten family members with available data, SSD affectation status and motor sequencing status coincided. Linkage analysis revealed four regions of interest, 6p21, 7q32, 7q36, and 8q24, primarily identified with the measure of motor sequencing ability. The 6p21 region overlaps with a locus implicated in rapid alternating naming in a recent genome-wide dyslexia linkage study. The 7q32 locus contains a locus implicated in dyslexia. The 7q36 locus borders on a gene known to affect component traits of language impairment. Conclusions Results are consistent with a motor-based endophenotype of SSD that would be informative for genetic studies. The linkage results in this first genome-wide study in a multigenerational family with SSD warrant follow-up in additional families and with fine mapping or next-generation approaches to gene identification. PMID:22517379
Identification of the genomic locus for the human Rieske Fe-S Protein gene on Chromosome 19q12
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pennacchio, L.A.
1994-05-06
We have identified the chromosomal location of the human Rieske Iron-Sulfur Protein (UQCRFS1) gene. Mapping by hybridization to a panel of monochromosomal hybrid cell lines indicated that the gene was either on chromosome 19 or 22. By screening a human chromosome 19 specific genomic cosmid library with an oligonucleotide probe made from the published Rieske cDNA sequence, we identified a corresponding cosmid. Portions of this cosmid were sequenced directly. The exon, exon:intron junction, and flanking sequences verified that this cosmid contains the genomic locus. Fluorescent in situ hybridization (FISH) was performed to localize this cosmid to chromosome band 19q12.
Germier, Thomas; Sylvain, Audibert; Silvia, Kocanova; David, Lane; Kerstin, Bystricky
2018-06-01
Spatio-temporal organization of the cell nucleus adapts to and regulates genomic processes. Microscopy approaches that enable direct monitoring of specific chromatin sites in single cells and in real time are needed to better understand the dynamics involved. In this chapter, we describe the principle and development of ANCHOR, a novel tool for DNA labelling in eukaryotic cells. Protocols for use of ANCHOR to visualize a single genomic locus in eukaryotic cells are presented. We describe an approach for live cell imaging of a DNA locus during the entire cell cycle in human breast cancer cells. Copyright © 2018 Elsevier Inc. All rights reserved.
Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering.
Guo, Xuan; Meng, Yu; Yu, Ning; Pan, Yi
2014-04-10
Taking the advantage of high-throughput single nucleotide polymorphism (SNP) genotyping technology, large genome-wide association studies (GWASs) have been considered to hold promise for unravelling complex relationships between genotype and phenotype. At present, traditional single-locus-based methods are insufficient to detect interactions consisting of multiple-locus, which are broadly existing in complex traits. In addition, statistic tests for high order epistatic interactions with more than 2 SNPs propose computational and analytical challenges because the computation increases exponentially as the cardinality of SNPs combinations gets larger. In this paper, we provide a simple, fast and powerful method using dynamic clustering and cloud computing to detect genome-wide multi-locus epistatic interactions. We have constructed systematic experiments to compare powers performance against some recently proposed algorithms, including TEAM, SNPRuler, EDCF and BOOST. Furthermore, we have applied our method on two real GWAS datasets, Age-related macular degeneration (AMD) and Rheumatoid arthritis (RA) datasets, where we find some novel potential disease-related genetic factors which are not shown up in detections of 2-loci epistatic interactions. Experimental results on simulated data demonstrate that our method is more powerful than some recently proposed methods on both two- and three-locus disease models. Our method has discovered many novel high-order associations that are significantly enriched in cases from two real GWAS datasets. Moreover, the running time of the cloud implementation for our method on AMD dataset and RA dataset are roughly 2 hours and 50 hours on a cluster with forty small virtual machines for detecting two-locus interactions, respectively. Therefore, we believe that our method is suitable and effective for the full-scale analysis of multiple-locus epistatic interactions in GWAS.
Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering
2014-01-01
Backgroud Taking the advan tage of high-throughput single nucleotide polymorphism (SNP) genotyping technology, large genome-wide association studies (GWASs) have been considered to hold promise for unravelling complex relationships between genotype and phenotype. At present, traditional single-locus-based methods are insufficient to detect interactions consisting of multiple-locus, which are broadly existing in complex traits. In addition, statistic tests for high order epistatic interactions with more than 2 SNPs propose computational and analytical challenges because the computation increases exponentially as the cardinality of SNPs combinations gets larger. Results In this paper, we provide a simple, fast and powerful method using dynamic clustering and cloud computing to detect genome-wide multi-locus epistatic interactions. We have constructed systematic experiments to compare powers performance against some recently proposed algorithms, including TEAM, SNPRuler, EDCF and BOOST. Furthermore, we have applied our method on two real GWAS datasets, Age-related macular degeneration (AMD) and Rheumatoid arthritis (RA) datasets, where we find some novel potential disease-related genetic factors which are not shown up in detections of 2-loci epistatic interactions. Conclusions Experimental results on simulated data demonstrate that our method is more powerful than some recently proposed methods on both two- and three-locus disease models. Our method has discovered many novel high-order associations that are significantly enriched in cases from two real GWAS datasets. Moreover, the running time of the cloud implementation for our method on AMD dataset and RA dataset are roughly 2 hours and 50 hours on a cluster with forty small virtual machines for detecting two-locus interactions, respectively. Therefore, we believe that our method is suitable and effective for the full-scale analysis of multiple-locus epistatic interactions in GWAS. PMID:24717145
Liu, Guangjin; Zhang, Wei; Lu, Chengping
2013-11-11
Streptococcus agalactiae, also referred to as Group B Streptococcus (GBS), is a frequent resident of the rectovaginal tract in humans, and a major cause of neonatal infection. In addition, S. agalactiae is a known fish pathogen, which compromises food safety and represents a zoonotic hazard. The complete genome sequence of the piscine S. agalactiae isolate GD201008-001 was compared with 14 other piscine, human and bovine strains to explore their virulence determinants, evolutionary relationships and the genetic basis of host tropism in S. agalactiae. The pan-genome of S. agalactiae is open and its size increases with the addition of newly sequenced genomes. The core genes shared by all isolates account for 50 ~ 70% of any single genome. The Chinese piscine isolates GD201008-001 and ZQ0910 are phylogenetically distinct from the Latin American piscine isolates SA20-06 and STIR-CD-17, but are closely related to the human strain A909, in the context of the clustered regularly interspaced short palindromic repeats (CRISPRs), prophage, virulence-associated genes and phylogenetic relationships. We identified a unique 10 kb gene locus in Chinese piscine strains. Isolates from cultured tilapia in China have a close genomic relationship with the human strain A909. Our findings provide insight into the pathogenesis and host-associated genome content of piscine S. agalactiae isolated in China.
Hybrid Sterility Locus on Chromosome X Controls Meiotic Recombination Rate in Mouse
Balcova, Maria; Faltusova, Barbora; Gergelits, Vaclav; Bhattacharyya, Tanmoy; Mihola, Ondrej; Trachtulec, Zdenek; Knopf, Corinna; Fotopulosova, Vladana; Chvatalova, Irena; Gregorova, Sona; Forejt, Jiri
2016-01-01
Meiotic recombination safeguards proper segregation of homologous chromosomes into gametes, affects genetic variation within species, and contributes to meiotic chromosome recognition, pairing and synapsis. The Prdm9 gene has a dual role, it controls meiotic recombination by determining the genomic position of crossover hotspots and, in infertile hybrids of house mouse subspecies Mus m. musculus (Mmm) and Mus m. domesticus (Mmd), it further functions as the major hybrid sterility gene. In the latter role Prdm9 interacts with the hybrid sterility X 2 (Hstx2) genomic locus on Chromosome X (Chr X) by a still unknown mechanism. Here we investigated the meiotic recombination rate at the genome-wide level and its possible relation to hybrid sterility. Using immunofluorescence microscopy we quantified the foci of MLH1 DNA mismatch repair protein, the cytological counterparts of reciprocal crossovers, in a panel of inter-subspecific chromosome substitution strains. Two autosomes, Chr 7 and Chr 11, significantly modified the meiotic recombination rate, yet the strongest modifier, designated meiotic recombination 1, Meir1, emerged in the 4.7 Mb Hstx2 genomic locus on Chr X. The male-limited transgressive effect of Meir1 on recombination rate parallels the male-limited transgressive role of Hstx2 in hybrid male sterility. Thus, both genetic factors, the Prdm9 gene and the Hstx2/Meir1 genomic locus, indicate a link between meiotic recombination and hybrid sterility. A strong female-specific modifier of meiotic recombination rate with the effect opposite to Meir1 was localized on Chr X, distally to Meir1. Mapping Meir1 to a narrow candidate interval on Chr X is an important first step towards positional cloning of the respective gene(s) responsible for variation in the global recombination rate between closely related mouse subspecies. PMID:27104744
Hybrid Sterility Locus on Chromosome X Controls Meiotic Recombination Rate in Mouse.
Balcova, Maria; Faltusova, Barbora; Gergelits, Vaclav; Bhattacharyya, Tanmoy; Mihola, Ondrej; Trachtulec, Zdenek; Knopf, Corinna; Fotopulosova, Vladana; Chvatalova, Irena; Gregorova, Sona; Forejt, Jiri
2016-04-01
Meiotic recombination safeguards proper segregation of homologous chromosomes into gametes, affects genetic variation within species, and contributes to meiotic chromosome recognition, pairing and synapsis. The Prdm9 gene has a dual role, it controls meiotic recombination by determining the genomic position of crossover hotspots and, in infertile hybrids of house mouse subspecies Mus m. musculus (Mmm) and Mus m. domesticus (Mmd), it further functions as the major hybrid sterility gene. In the latter role Prdm9 interacts with the hybrid sterility X 2 (Hstx2) genomic locus on Chromosome X (Chr X) by a still unknown mechanism. Here we investigated the meiotic recombination rate at the genome-wide level and its possible relation to hybrid sterility. Using immunofluorescence microscopy we quantified the foci of MLH1 DNA mismatch repair protein, the cytological counterparts of reciprocal crossovers, in a panel of inter-subspecific chromosome substitution strains. Two autosomes, Chr 7 and Chr 11, significantly modified the meiotic recombination rate, yet the strongest modifier, designated meiotic recombination 1, Meir1, emerged in the 4.7 Mb Hstx2 genomic locus on Chr X. The male-limited transgressive effect of Meir1 on recombination rate parallels the male-limited transgressive role of Hstx2 in hybrid male sterility. Thus, both genetic factors, the Prdm9 gene and the Hstx2/Meir1 genomic locus, indicate a link between meiotic recombination and hybrid sterility. A strong female-specific modifier of meiotic recombination rate with the effect opposite to Meir1 was localized on Chr X, distally to Meir1. Mapping Meir1 to a narrow candidate interval on Chr X is an important first step towards positional cloning of the respective gene(s) responsible for variation in the global recombination rate between closely related mouse subspecies.
LOLAweb: a containerized web server for interactive genomic locus overlap enrichment analysis.
Nagraj, V P; Magee, Neal E; Sheffield, Nathan C
2018-06-06
The past few years have seen an explosion of interest in understanding the role of regulatory DNA. This interest has driven large-scale production of functional genomics data and analytical methods. One popular analysis is to test for enrichment of overlaps between a query set of genomic regions and a database of region sets. In this way, new genomic data can be easily connected to annotations from external data sources. Here, we present an interactive interface for enrichment analysis of genomic locus overlaps using a web server called LOLAweb. LOLAweb accepts a set of genomic ranges from the user and tests it for enrichment against a database of region sets. LOLAweb renders results in an R Shiny application to provide interactive visualization features, enabling users to filter, sort, and explore enrichment results dynamically. LOLAweb is built and deployed in a Linux container, making it scalable to many concurrent users on our servers and also enabling users to download and run LOLAweb locally.
A local duplication of the Melanocortin receptor 1 locus in Astyanax
Gross, Joshua B.; Weagley, James; Stahl, Bethany A.; Ma, Li; Espinasa, Luis; McGaugh, Suzanne E.
2017-01-01
In this study, we report evidence of a novel duplication of Melanocortin receptor 1 (Mc1r) in the cavefish genome. This locus was discovered following the observation of excessive allelic diversity in a ~820 bp fragment of Mc1r amplified via degenerate PCR from a natural population of Astyanax aeneus fish from Guerrero, Mexico. The cavefish genome reveals the presence of two closely related Mc1r open reading frames separated by a 1.46 kb intergenic region. One open reading frame corresponds to the previously reported Mc1r receptor, and the other open reading frame (duplicate copy) is 975 bp in length, encoding a receptor of 325 amino acids. Sequence similarity analyses position both copies in the syntenic region of the single Mc1r locus in 16 representative craniate genomes spanning bony fish (including Astyanax) to mammals, suggesting we discovered tandem duplicates of this important gene. The two Mc1r copies share ~89% sequence similarity, and, within Astyanax, are more similar to one another compared to other melanocortin family members. Future studies will inform the precise functional significance of the duplicated Mc1r locus, and if this novel copy number variant may have adaptive significance for the Astyanax lineage. PMID:28738163
Dilthey, Alexander T; Gourraud, Pierre-Antoine; Mentzer, Alexander J; Cereb, Nezih; Iqbal, Zamin; McVean, Gil
2016-10-01
Genetic variation at the Human Leucocyte Antigen (HLA) genes is associated with many autoimmune and infectious disease phenotypes, is an important element of the immunological distinction between self and non-self, and shapes immune epitope repertoires. Determining the allelic state of the HLA genes (HLA typing) as a by-product of standard whole-genome sequencing data would therefore be highly desirable and enable the immunogenetic characterization of samples in currently ongoing population sequencing projects. Extensive hyperpolymorphism and sequence similarity between the HLA genes, however, pose problems for accurate read mapping and make HLA type inference from whole-genome sequencing data a challenging problem. We describe how to address these challenges in a Population Reference Graph (PRG) framework. First, we construct a PRG for 46 (mostly HLA) genes and pseudogenes, their genomic context and their characterized sequence variants, integrating a database of over 10,000 known allele sequences. Second, we present a sequence-to-PRG paired-end read mapping algorithm that enables accurate read mapping for the HLA genes. Third, we infer the most likely pair of underlying alleles at G group resolution from the IMGT/HLA database at each locus, employing a simple likelihood framework. We show that HLA*PRG, our algorithm, outperforms existing methods by a wide margin. We evaluate HLA*PRG on six classical class I and class II HLA genes (HLA-A, -B, -C, -DQA1, -DQB1, -DRB1) and on a set of 14 samples (3 samples with 2 x 100bp, 11 samples with 2 x 250bp Illumina HiSeq data). Of 158 alleles tested, we correctly infer 157 alleles (99.4%). We also identify and re-type two erroneous alleles in the original validation data. We conclude that HLA*PRG for the first time achieves accuracies comparable to gold-standard reference methods from standard whole-genome sequencing data, though high computational demands (currently ~30-250 CPU hours per sample) remain a significant challenge to practical application.
High-Accuracy HLA Type Inference from Whole-Genome Sequencing Data Using Population Reference Graphs
Dilthey, Alexander T.; Gourraud, Pierre-Antoine; McVean, Gil
2016-01-01
Genetic variation at the Human Leucocyte Antigen (HLA) genes is associated with many autoimmune and infectious disease phenotypes, is an important element of the immunological distinction between self and non-self, and shapes immune epitope repertoires. Determining the allelic state of the HLA genes (HLA typing) as a by-product of standard whole-genome sequencing data would therefore be highly desirable and enable the immunogenetic characterization of samples in currently ongoing population sequencing projects. Extensive hyperpolymorphism and sequence similarity between the HLA genes, however, pose problems for accurate read mapping and make HLA type inference from whole-genome sequencing data a challenging problem. We describe how to address these challenges in a Population Reference Graph (PRG) framework. First, we construct a PRG for 46 (mostly HLA) genes and pseudogenes, their genomic context and their characterized sequence variants, integrating a database of over 10,000 known allele sequences. Second, we present a sequence-to-PRG paired-end read mapping algorithm that enables accurate read mapping for the HLA genes. Third, we infer the most likely pair of underlying alleles at G group resolution from the IMGT/HLA database at each locus, employing a simple likelihood framework. We show that HLA*PRG, our algorithm, outperforms existing methods by a wide margin. We evaluate HLA*PRG on six classical class I and class II HLA genes (HLA-A, -B, -C, -DQA1, -DQB1, -DRB1) and on a set of 14 samples (3 samples with 2 x 100bp, 11 samples with 2 x 250bp Illumina HiSeq data). Of 158 alleles tested, we correctly infer 157 alleles (99.4%). We also identify and re-type two erroneous alleles in the original validation data. We conclude that HLA*PRG for the first time achieves accuracies comparable to gold-standard reference methods from standard whole-genome sequencing data, though high computational demands (currently ~30–250 CPU hours per sample) remain a significant challenge to practical application. PMID:27792722
Vogl, Thomas; Gebbie, Leigh; Palfreyman, Robin W; Speight, Robert
2018-03-15
Pichia pastoris (syn. Komagataella phaffii ) is one of the most common eukaryotic expression systems for heterologous protein production. Expression cassettes are typically integrated in the genome to obtain stable expression strains. In contrast to Saccharomyces cerevisiae , where short overhangs are sufficient to target highly specific integration, long overhangs are more efficient in P. pastoris and ectopic integration of foreign DNA can occur. Here, we aimed to elucidate the influence of ectopic integration by high-throughput screening of >700 transformants and whole-genome sequencing of 27 transformants. Different vector designs and linearization approaches were used to mimic the most common integration events targeted in P. pastoris Fluorescence of an enhanced green fluorescent protein (eGFP) reporter protein was highly uniform among transformants when the expression cassettes were correctly integrated in the targeted locus. Surprisingly, most nonspecifically integrated transformants showed highly uniform expression that was comparable to specific integration, suggesting that nonspecific integration does not necessarily influence expression. However, a few clones (<10%) harboring ectopically integrated cassettes showed a greater variation spanning a 25-fold range, surpassing specifically integrated reference strains up to 6-fold. High-expression strains showed a correlation between increased gene copy numbers and high reporter protein fluorescence levels. Our results suggest that for comparing expression levels between strains, the integration locus can be neglected as long as a sufficient numbers of transformed strains are compared. For expression optimization of highly expressible proteins, increasing copy number appears to be the dominant positive influence rather than the integration locus, genomic rearrangements, deletions, or single-nucleotide polymorphisms (SNPs). IMPORTANCE Yeasts are commonly used as biotechnological production hosts for proteins and metabolites. In the yeast Saccharomyces cerevisiae , expression cassettes carrying foreign genes integrate highly specifically at the targeted sites in the genome. In contrast, cassettes often integrate at random genomic positions in nonconventional yeasts, such as Pichia pastoris (syn. Komagataella phaffii ). Hence, cells from the same transformation event often behave differently, with significant clonal variation necessitating the screening of large numbers of strains. The importance of this study is that we systematically investigated the influence of integration events in more than 700 strains. Our findings provide novel insight into clonal variation in P. pastoris and, thus, how to avoid pitfalls and obtain reliable results. The underlying mechanisms may also play a role in other yeasts and hence could be generally relevant for recombinant yeast protein production strains. Copyright © 2018 American Society for Microbiology.
Exploring new alleles for frost tolerance in winter rye.
Erath, Wiltrud; Bauer, Eva; Fowler, D Brian; Gordillo, Andres; Korzun, Viktor; Ponomareva, Mira; Schmidt, Malthe; Schmiedchen, Brigitta; Wilde, Peer; Schön, Chris-Carolin
2017-10-01
Rye genetic resources provide a valuable source of new alleles for the improvement of frost tolerance in rye breeding programs. Frost tolerance is a must-have trait for winter cereal production in northern and continental cropping areas. Genetic resources should harbor promising alleles for the improvement of frost tolerance of winter rye elite lines. For frost tolerance breeding, the identification of quantitative trait loci (QTL) and the choice of optimum genome-based selection methods are essential. We identified genomic regions involved in frost tolerance of winter rye by QTL mapping in a biparental population derived from a highly frost tolerant selection from the Canadian cultivar Puma and the European elite line Lo157. Lines per se and their testcrosses were phenotyped in a controlled freeze test and in multi-location field trials in Russia and Canada. Three QTL on chromosomes 4R, 5R, and 7R were consistently detected across environments. The QTL on 5R is congruent with the genomic region harboring the Frost resistance locus 2 (Fr-2) in Triticeae. The Puma allele at the Fr-R2 locus was found to significantly increase frost tolerance. A comparison of predictive ability obtained from the QTL-based model with different whole-genome prediction models revealed that besides a few large, also small QTL effects contribute to the genomic variance of frost tolerance in rye. Genomic prediction models assigning a high weight to the Fr-R2 locus allow increasing the selection intensity for frost tolerance by genome-based pre-selection of promising candidates.
Draye, Xavier; Lin, Yann-Rong; Qian, Xiao-yin; Bowers, John E.; Burow, Gloria B.; Morrell, Peter L.; Peterson, Daniel G.; Presting, Gernot G.; Ren, Shu-xin; Wing, Rod A.; Paterson, Andrew H.
2001-01-01
The small genome of sorghum (Sorghum bicolor L. Moench.) provides an important template for study of closely related large-genome crops such as maize (Zea mays) and sugarcane (Saccharum spp.), and is a logical complement to distantly related rice (Oryza sativa) as a “grass genome model.” Using a high-density RFLP map as a framework, a robust physical map of sorghum is being assembled by integrating hybridization and fingerprint data with comparative data from related taxa such as rice and using new methods to resolve genomic duplications into locus-specific groups. By taking advantage of allelic variation revealed by heterologous probes, the positions of corresponding loci on the wheat (Triticum aestivum), rice, maize, sugarcane, and Arabidopsis genomes are being interpolated on the sorghum physical map. Bacterial artificial chromosomes for the small genome of rice are shown to close several gaps in the sorghum contigs; the emerging rice physical map and assembled sequence will further accelerate progress. An important motivation for developing genomic tools is to relate molecular level variation to phenotypic diversity. “Diversity maps,” which depict the levels and patterns of variation in different gene pools, shed light on relationships of allelic diversity with chromosome organization, and suggest possible locations of genomic regions that are under selection due to major gene effects (some of which may be revealed by quantitative trait locus mapping). Both physical maps and diversity maps suggest interesting features that may be integrally related to the chromosomal context of DNA—progress in cytology promises to provide a means to elucidate such relationships. We seek to provide a detailed picture of the structure, function, and evolution of the genome of sorghum and its relatives, together with molecular tools such as locus-specific sequence-tagged site DNA markers and bacterial artificial chromosome contigs that will have enduring value for many aspects of genome analysis. PMID:11244113
Delta: a new web-based 3D genome visualization and analysis platform.
Tang, Bixia; Li, Feifei; Li, Jing; Zhao, Wenming; Zhang, Zhihua
2018-04-15
Delta is an integrative visualization and analysis platform to facilitate visually annotating and exploring the 3D physical architecture of genomes. Delta takes Hi-C or ChIA-PET contact matrix as input and predicts the topologically associating domains and chromatin loops in the genome. It then generates a physical 3D model which represents the plausible consensus 3D structure of the genome. Delta features a highly interactive visualization tool which enhances the integration of genome topology/physical structure with extensive genome annotation by juxtaposing the 3D model with diverse genomic assay outputs. Finally, by visually comparing the 3D model of the β-globin gene locus and its annotation, we speculated a plausible transitory interaction pattern in the locus. Experimental evidence was found to support this speculation by literature survey. This served as an example of intuitive hypothesis testing with the help of Delta. Delta is freely accessible from http://delta.big.ac.cn, and the source code is available at https://github.com/zhangzhwlab/delta. zhangzhihua@big.ac.cn. Supplementary data are available at Bioinformatics online.
Zuriaga, Elena; Molina, Laura; Badenes, María Luisa; Romero, Carlos
2012-06-01
S-locus products (S-RNase and F-box proteins) are essential for the gametophytic self-incompatibility (GSI) specific recognition in Prunus. However, accumulated genetic evidence suggests that other S-locus unlinked factors are also required for GSI. For instance, GSI breakdown was associated with a pollen-part mutation unlinked to the S-locus in the apricot (Prunus armeniaca L.) cv. 'Canino'. Fine-mapping of this mutated modifier gene (M-locus) and the synteny analysis of the M-locus within the Rosaceae are here reported. A segregation distortion loci mapping strategy, based on a selectively genotyped population, was used to map the M-locus. In addition, a bacterial artificial chromosome (BAC) contig was constructed for this region using overlapping oligonucleotides probes, and BAC-end sequences (BES) were blasted against Rosaceae genomes to perform micro-synteny analysis. The M-locus was mapped to the distal part of chr.3 flanked by two SSR markers within an interval of 1.8 cM corresponding to ~364 Kb in the peach (Prunus persica L. Batsch) genome. In the integrated genetic-physical map of this region, BES were mapped against the peach scaffold_3 and BACs were anchored to the apricot map. Micro-syntenic blocks were detected in apple (Malus × domestica Borkh.) LG17/9 and strawberry (Fragaria vesca L.) FG6 chromosomes. The M-locus fine-scale mapping provides a solid basis for self-compatibility marker-assisted selection and for positional cloning of the underlying gene, a necessary goal to elucidate the pollen rejection mechanism in Prunus. In a wider context, the syntenic regions identified in peach, apple and strawberry might be useful to interpret GSI evolution in Rosaceae.
Comparative population genetics of a mimicry locus among hybridizing Heliconius butterfly species.
Chamberlain, N L; Hill, R I; Baxter, S W; Jiggins, C D; Kronforst, M R
2011-09-01
The comimetic Heliconius butterfly species pair, H. erato and H. melpomene, appear to use a conserved Mendelian switch locus to generate their matching red wing patterns. Here we investigate whether H. cydno and H. pachinus, species closely related to H. melpomene, use this same switch locus to generate their highly divergent red and brown color pattern elements. Using an F2 intercross between H. cydno and H. pachinus, we first map the genomic positions of two novel red/brown wing pattern elements; the G locus, which controls the presence of red vs brown at the base of the ventral wings, and the Br locus, which controls the presence vs absence of a brown oval pattern on the ventral hind wing. The results reveal that the G locus is tightly linked to markers in the genomic interval that controls red wing pattern elements of H. erato and H. melpomene. Br is on the same linkage group but approximately 26 cM away. Next, we analyze fine-scale patterns of genetic differentiation and linkage disequilibrium throughout the G locus candidate interval in H. cydno, H. pachinus and H. melpomene, and find evidence for elevated differentiation between H. cydno and H. pachinus, but no localized signature of association. Overall, these results indicate that the G locus maps to the same interval as the locus controlling red patterning in H. melpomene and H. erato. This, in turn, suggests that the genes controlling red pattern elements may be homologous across Heliconius, supporting the hypothesis that Heliconius butterflies use a limited suite of conserved genetic switch loci to generate both convergent and divergent wing patterns.
USDA-ARS?s Scientific Manuscript database
Insulin-like growth factor 2 (IGF2) is a peptide hormone regulating various cellular processes such as proliferation and apoptosis. IGF2 is vital to embryo development. The IGF2 locus covers approximately 150-kb genomic region on human chromosome 11, containing two imprinted genes, IGF2 and H19, sha...
L'Homme, Y; Brown, G G
1993-01-01
Comparison of the physical maps of male fertile (cam) and male sterile (pol) mitochondrial genomes of Brassica napus indicates that structural differences between the two mtDNAs are confined to a region immediately upstream of the atp6 gene. Relative to cam mtDNA, pol mtDNA possesses a 4.5 kb segment at this locus that includes a chimeric gene that is cotranscribed with atp6 and lacks an approximately 1kb region located upstream of the cam atp6 gene. The 4.5 kb pol segment is present and similarly organized in the mitochondrial genome of the common nap B.napus cytoplasm; however, the nap and pol DNA regions flanking this segment are different and the nap sequences are not expressed. The 4.5 kb CMS-associated pol segment has thus apparently undergone transposition during the evolution of the nap and pol cytoplasms and has been lost in the cam genome subsequent to the pol-cam divergence. This 4.5 kb segment comprises the single DNA region that is expressed differently in fertile, pol CMS and fertility restored pol cytoplasm plants. The finding that this locus is part of the single mtDNA region organized differently in the fertile and male sterile mitochondrial genomes provides strong support for the view that it specifies the pol CMS trait. Images PMID:8388101
Aii, Jotaro; Abe, Tomoko; Matsumoto, Daiki; Sato, Shingo; Hayashi, Yoriko; Ohnishi, Ohmi; Ota, Tatsuya
2012-01-01
The different forms of flowers in a species have attracted the attention of many evolutionary biologists, including Charles Darwin. In Fagopyrum esculentum (common buckwheat), the occurrence of dimorphic flowers, namely short-styled and long-styled flowers, is associated with a type of self-incompatibility (SI) called heteromorphic SI. The floral morphology and intra-morph incompatibility are both determined by a single genetic locus named the S-locus. Plants with short-styled flowers are heterozygous (S/s) and plants with long-styled flowers are homozygous recessive (s/s) at the S-locus. Despite recent progress in our understanding of the molecular basis of flower development and plant SI systems, the molecular mechanisms underlying heteromorphic SI remain unresolved. By examining differentially expressed genes from the styles of the two floral morphs, we identified a gene that is expressed only in short-styled plants. The novel gene identified was completely linked to the S-locus in a linkage analysis of 1,373 plants and had homology to EARLY FLOWERING 3. We named this gene S-LOCUS EARLY FLOWERING 3 (S-ELF3). In an ion-beam-induced mutant that harbored a deletion in the genomic region spanning S-ELF3, a phenotype shift from short-styled flowers to long-styled flowers was observed. Furthermore, S-ELF3 was present in the genome of short-styled plants and absent from that of long-styled plants both in world-wide landraces of buckwheat and in two distantly related Fagopyrum species that exhibit heteromorphic SI. Moreover, independent disruptions of S-ELF3 were detected in a recently emerged self-compatible Fagopyrum species and a self-compatible line of buckwheat. The nonessential role of S-ELF3 in the survival of individuals and the prolonged evolutionary presence only in the genomes of short-styled plants exhibiting heteromorphic SI suggests that S-ELF3 is a suitable candidate gene for the control of the short-styled phenotype of buckwheat plants. PMID:22312442
Designing Epigenome Editors: Considerations of Biochemical and Locus Specificities.
Sen, Dilara; Keung, Albert J
2018-01-01
The advent of locus-specific protein recruitment technologies has enabled a new class of studies in chromatin biology. Epigenome editors enable biochemical modifications of chromatin at almost any specific endogenous locus. Their locus specificity unlocks unique information including the functional roles of distinct modifications at specific genomic loci. Given the growing interest in using these tools for biological and translational studies, there are many specific design considerations depending on the scientific question or clinical need. Here we present and discuss important design considerations and challenges regarding the biochemical and locus specificities of epigenome editors. These include how to account for the complex biochemical diversity of chromatin; control for potential interdependency of epigenome editors and their resultant modifications; avoid sequestration effects; quantify the locus specificity of epigenome editors; and improve locus specificity by considering concentration, affinity, avidity, and sequestration effects.
Short-read, high-throughput sequencing technology for STR genotyping
Bornman, Daniel M.; Hester, Mark E.; Schuetter, Jared M.; Kasoji, Manjula D.; Minard-Smith, Angela; Barden, Curt A.; Nelson, Scott C.; Godbold, Gene D.; Baker, Christine H.; Yang, Boyu; Walther, Jacquelyn E.; Tornes, Ivan E.; Yan, Pearlly S.; Rodriguez, Benjamin; Bundschuh, Ralf; Dickens, Michael L.; Young, Brian A.; Faith, Seth A.
2013-01-01
DNA-based methods for human identification principally rely upon genotyping of short tandem repeat (STR) loci. Electrophoretic-based techniques for variable-length classification of STRs are universally utilized, but are limited in that they have relatively low throughput and do not yield nucleotide sequence information. High-throughput sequencing technology may provide a more powerful instrument for human identification, but is not currently validated for forensic casework. Here, we present a systematic method to perform high-throughput genotyping analysis of the Combined DNA Index System (CODIS) STR loci using short-read (150 bp) massively parallel sequencing technology. Open source reference alignment tools were optimized to evaluate PCR-amplified STR loci using a custom designed STR genome reference. Evaluation of this approach demonstrated that the 13 CODIS STR loci and amelogenin (AMEL) locus could be accurately called from individual and mixture samples. Sensitivity analysis showed that as few as 18,500 reads, aligned to an in silico referenced genome, were required to genotype an individual (>99% confidence) for the CODIS loci. The power of this technology was further demonstrated by identification of variant alleles containing single nucleotide polymorphisms (SNPs) and the development of quantitative measurements (reads) for resolving mixed samples. PMID:25621315
Slack, Andrew T; Dohnt, Michael F; Symonds, Meegan L; Smythe, Lee D
2005-01-01
Background Leptospirosis is a zoonotic disease caused by the genus, Leptospira. Leptospira interrogans is the most common genomospecies implicated in the disease. Epidemiological investigations are needed to distinguish outbreak situations or to trace reservoirs of the organisms. Current methodologies used for typing Leptospira have significant drawbacks. The development of an easy to perform yet high resolution method is needed for this organism. Methods In this study we have searched the available genomic sequence of L. interrogans serovar Copenhageni strain Fiocruz L1-130 for the presence of tandem repeats [1]. These repeats were evaluated against reference strains for diversity. Six loci were selected to create a Multiple Locus Variable Number of Tandem Repeats (VNTR) Analysis (MLVA) to explore the genetic diversity within L. interrogans serovar Australis clinical isolates from Far North Queensland. Results The 39 reference strains used for the development of the method displayed 39 distinct patterns. Diversity Indexes for the loci varied between 0.80 and 0.93 and the number of repeat units at each locus varied between less than one to 52 repeats. When the MLVA was applied to serovar Australis isolates three large clusters were distinguishable, each comprising various hosts including Rattus species, human and canines. Conclusion The MLVA described in this report, was easy to perform, analyse and was reproducible. The loci selected had high diversity allowing discrimination between serovars and also between strains within a serovar. This method provides a starting point on which improvements to the method and comparisons to other techniques can be made. PMID:15987533
Zinc-finger nucleases-based genome engineering to generate isogenic human cell lines.
Dreyer, Anne-Kathrin; Cathomen, Toni
2012-01-01
Customized zinc-finger nucleases (ZFNs) have developed into a promising technology to precisely alter mammalian genomes for biomedical research, biotechnology, or human gene therapy. In the context of synthetic biology, the targeted integration of a transgene or reporter cassette into a "neutral site" of the human genome, such as the AAVS1 locus, permits the generation of isogenic human cell lines with two major advantages over standard genetic manipulation techniques: minimal integration site-dependent effects on the transgene and, vice versa, no functional perturbation of the host-cell transcriptome. Here we describe in detail how ZFNs can be employed to target integration of a transgene cassette into the AAVS1 locus and how to characterize the targeted cells by PCR-based genotyping.
The Gpr1/Zdbf2 locus provides new paradigms for transient and dynamic genomic imprinting in mammals
Duffié, Rachel; Ajjan, Sophie; Greenberg, Maxim V.; Zamudio, Natasha; Escamilla del Arenal, Martin; Iranzo, Julian; Okamoto, Ikuhiro; Barbaux, Sandrine; Fauque, Patricia; Bourc'his, Déborah
2014-01-01
Many loci maintain parent-of-origin DNA methylation only briefly after fertilization during mammalian development: Whether this form of transient genomic imprinting can impact the early embryonic transcriptome or even have life-long consequences on genome regulation and possibly phenotypes is currently unknown. Here, we report a maternal germline differentially methylated region (DMR) at the mouse Gpr1/Zdbf2 (DBF-type zinc finger-containing protein 2) locus, which controls the paternal-specific expression of long isoforms of Zdbf2 (Liz) in the early embryo. This DMR loses parental specificity by gain of DNA methylation at implantation in the embryo but is maintained in extraembryonic tissues. As a consequence of this transient, tissue-specific maternal imprinting, Liz expression is restricted to the pluripotent embryo, extraembryonic tissues, and pluripotent male germ cells. We found that Liz potentially functions as both Zdbf2-coding RNA and cis-regulatory RNA. Importantly, Liz-mediated events allow a switch from maternal to paternal imprinted DNA methylation and from Liz to canonical Zdbf2 promoter use during embryonic differentiation, which are stably maintained through somatic life and conserved in humans. The Gpr1/Zdbf2 locus lacks classical imprinting histone modifications, but analysis of mutant embryonic stem cells reveals fine-tuned regulation of Zdbf2 dosage through DNA and H3K27 methylation interplay. Together, our work underlines the developmental and evolutionary need to ensure proper Liz/Zdbf2 dosage as a driving force for dynamic genomic imprinting at the Gpr1/Zdbf2 locus. PMID:24589776
Cheng, Yu-Ching; Stanne, Tara M; Giese, Anne-Katrin; Ho, Weang Kee; Traylor, Matthew; Amouyel, Philippe; Holliday, Elizabeth G; Malik, Rainer; Xu, Huichun; Kittner, Steven J; Cole, John W; O'Connell, Jeffrey R; Danesh, John; Rasheed, Asif; Zhao, Wei; Engelter, Stefan; Grond-Ginsbach, Caspar; Kamatani, Yoichiro; Lathrop, Mark; Leys, Didier; Thijs, Vincent; Metso, Tiina M; Tatlisumak, Turgut; Pezzini, Alessandro; Parati, Eugenio A; Norrving, Bo; Bevan, Steve; Rothwell, Peter M; Sudlow, Cathie; Slowik, Agnieszka; Lindgren, Arne; Walters, Matthew R; Jannes, Jim; Shen, Jess; Crosslin, David; Doheny, Kimberly; Laurie, Cathy C; Kanse, Sandip M; Bis, Joshua C; Fornage, Myriam; Mosley, Thomas H; Hopewell, Jemma C; Strauch, Konstantin; Müller-Nurasyid, Martina; Gieger, Christian; Waldenberger, Melanie; Peters, Annette; Meisinger, Christine; Ikram, M Arfan; Longstreth, W T; Meschia, James F; Seshadri, Sudha; Sharma, Pankaj; Worrall, Bradford; Jern, Christina; Levi, Christopher; Dichgans, Martin; Boncoraglio, Giorgio B; Markus, Hugh S; Debette, Stephanie; Rolfs, Arndt; Saleheen, Danish; Mitchell, Braxton D
2016-02-01
Although a genetic contribution to ischemic stroke is well recognized, only a handful of stroke loci have been identified by large-scale genetic association studies to date. Hypothesizing that genetic effects might be stronger for early- versus late-onset stroke, we conducted a 2-stage meta-analysis of genome-wide association studies, focusing on stroke cases with an age of onset <60 years. The discovery stage of our genome-wide association studies included 4505 cases and 21 968 controls of European, South-Asian, and African ancestry, drawn from 6 studies. In Stage 2, we selected the lead genetic variants at loci with association P<5×10(-6) and performed in silico association analyses in an independent sample of ≤1003 cases and 7745 controls. One stroke susceptibility locus at 10q25 reached genome-wide significance in the combined analysis of all samples from the discovery and follow-up stages (rs11196288; odds ratio =1.41; P=9.5×10(-9)). The associated locus is in an intergenic region between TCF7L2 and HABP2. In a further analysis in an independent sample, we found that 2 single nucleotide polymorphisms in high linkage disequilibrium with rs11196288 were significantly associated with total plasma factor VII-activating protease levels, a product of HABP2. HABP2, which encodes an extracellular serine protease involved in coagulation, fibrinolysis, and inflammatory pathways, may be a genetic susceptibility locus for early-onset stroke. © 2016 American Heart Association, Inc.
A radiation hybrid map of chromosome ID reveals synteny conservation at a wheat speciation locus.
USDA-ARS?s Scientific Manuscript database
The species cytoplasm specific (scs) genes affect nuclear-cytoplasmic interactions in interspecific hybrids. A radiation hybrid (RH) mapping population of 188 individuals was employed to refine the location of the scsae locus of Tritcum aestivum chromosome 1D. ‘Wheat Zapper’, a comparative genomic...
Three invariant Hi-C interaction patterns: Applications to genome assembly.
Oddes, Sivan; Zelig, Aviv; Kaplan, Noam
2018-06-01
Assembly of reference-quality genomes from next-generation sequencing data is a key challenge in genomics. Recently, we and others have shown that Hi-C data can be used to address several outstanding challenges in the field of genome assembly. This principle has since been developed in academia and industry, and has been used in the assembly of several major genomes. In this paper, we explore the central principles underlying Hi-C-based assembly approaches, by quantitatively defining and characterizing three invariant Hi-C interaction patterns on which these approaches can build: Intrachromosomal interaction enrichment, distance-dependent interaction decay and local interaction smoothness. Specifically, we evaluate to what degree each invariant pattern holds on a single locus level in different species, cell types and Hi-C map resolutions. We find that these patterns are generally consistent across species and cell types but are affected by sequencing depth, and that matrix balancing improves consistency of loci with all three invariant patterns. Finally, we overview current Hi-C-based assembly approaches in light of these invariant patterns and demonstrate how local interaction smoothness can be used to easily detect scaffolding errors in extremely sparse Hi-C maps. We suggest that simultaneously considering all three invariant patterns may lead to better Hi-C-based genome assembly methods. Copyright © 2018 Elsevier Inc. All rights reserved.
A Nomadic Subtelomeric Disease Resistance Gene Cluster in Common Bean1[W
David, Perrine; Chen, Nicolas W.G.; Pedrosa-Harand, Andrea; Thareau, Vincent; Sévignac, Mireille; Cannon, Steven B.; Debouck, Daniel; Langin, Thierry; Geffroy, Valérie
2009-01-01
The B4 resistance (R) gene cluster is one of the largest clusters known in common bean (Phaseolus vulgaris [Pv]). It is located in a peculiar genomic environment in the subtelomeric region of the short arm of chromosome 4, adjacent to two heterochromatic blocks (knobs). We sequenced 650 kb spanning this locus and annotated 97 genes, 26 of which correspond to Coiled-Coil-Nucleotide-Binding-Site-Leucine-Rich-Repeat (CNL). Conserved microsynteny was observed between the Pv B4 locus and corresponding regions of Medicago truncatula and Lotus japonicus in chromosomes Mt6 and Lj2, respectively. The notable exception was the CNL sequences, which were completely absent in these regions. The origin of the Pv B4-CNL sequences was investigated through phylogenetic analysis, which reveals that, in the Pv genome, paralogous CNL genes are shared among nonhomologous chromosomes (4 and 11). Together, our results suggest that Pv B4-CNL was derived from CNL sequences from another cluster, the Co-2 cluster, through an ectopic recombination event. Integration of the soybean (Glycine max) genome data enables us to date more precisely this event and also to infer that a single CNL moved from the Co-2 to the B4 cluster. Moreover, we identified a new 528-bp satellite repeat, referred to as khipu, specific to the Phaseolus genus, present both between B4-CNL sequences and in the two knobs identified at the B4 R gene cluster. The khipu repeat is present on most chromosomal termini, indicating the existence of frequent ectopic recombination events in Pv subtelomeric regions. Our results highlight the importance of ectopic recombination in R gene evolution. PMID:19776165
Thomson, P A; Parla, J S; McRae, A F; Kramer, M; Ramakrishnan, K; Yao, J; Soares, D C; McCarthy, S; Morris, S W; Cardone, L; Cass, S; Ghiban, E; Hennah, W; Evans, K L; Rebolini, D; Millar, J K; Harris, S E; Starr, J M; MacIntyre, D J; McIntosh, A M; Watson, J D; Deary, I J; Visscher, P M; Blackwood, D H; McCombie, W R; Porteous, D J
2014-06-01
A balanced t(1;11) translocation that transects the Disrupted in schizophrenia 1 (DISC1) gene shows genome-wide significant linkage for schizophrenia and recurrent major depressive disorder (rMDD) in a single large Scottish family, but genome-wide and exome sequencing-based association studies have not supported a role for DISC1 in psychiatric illness. To explore DISC1 in more detail, we sequenced 528 kb of the DISC1 locus in 653 cases and 889 controls. We report 2718 validated single-nucleotide polymorphisms (SNPs) of which 2010 have a minor allele frequency of <1%. Only 38% of these variants are reported in the 1000 Genomes Project European subset. This suggests that many DISC1 SNPs remain undiscovered and are essentially private. Rare coding variants identified exclusively in patients were found in likely functional protein domains. Significant region-wide association was observed between rs16856199 and rMDD (P=0.026, unadjusted P=6.3 × 10(-5), OR=3.48). This was not replicated in additional recurrent major depression samples (replication P=0.11). Combined analysis of both the original and replication set supported the original association (P=0.0058, OR=1.46). Evidence for segregation of this variant with disease in families was limited to those of rMDD individuals referred from primary care. Burden analysis for coding and non-coding variants gave nominal associations with diagnosis and measures of mood and cognition. Together, these observations are likely to generalise to other candidate genes for major mental illness and may thus provide guidelines for the design of future studies.
Identification of Novel Genetic Markers of Breast Cancer Survival
Guo, Qi; Schmidt, Marjanka K.; Kraft, Peter; Canisius, Sander; Chen, Constance; Khan, Sofia; Tyrer, Jonathan; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Michailidou, Kyriaki; Lush, Michael; Kar, Siddhartha; Beesley, Jonathan; Dunning, Alison M.; Shah, Mitul; Czene, Kamila; Darabi, Hatef; Eriksson, Mikael; Lambrechts, Diether; Weltens, Caroline; Leunen, Karin; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Blomqvist, Carl; Aittomäki, Kristiina; Fagerholm, Rainer; Muranen, Taru A.; Couch, Fergus J.; Olson, Janet E.; Vachon, Celine; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Mulligan, Anna Marie; Broeks, Annegien; Hogervorst, Frans B.; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Hopper, John L.; Tsimiklis, Helen; Apicella, Carmel; Southey, Melissa C.; Cox, Angela; Cross, Simon S.; Reed, Malcolm W. R.; Giles, Graham G.; Milne, Roger L.; McLean, Catriona; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Hooning, Maartje J.; Hollestelle, Antoinette; Martens, John W. M.; van den Ouweland, Ans M. W.; Marme, Federik; Schneeweiss, Andreas; Yang, Rongxi; Burwinkel, Barbara; Figueroa, Jonine; Chanock, Stephen J.; Lissowska, Jolanta; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Holleczek, Bernd; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Li, Jingmei; Brand, Judith S.; Humphreys, Keith; Devilee, Peter; Tollenaar, Rob A. E. M.; Seynaeve, Caroline; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Mariani, Paolo; Fasching, Peter A.; Beckmann, Matthias W.; Hein, Alexander; Ekici, Arif B.; Chenevix-Trench, Georgia; Balleine, Rosemary; Phillips, Kelly-Anne; Benitez, Javier; Zamora, M. Pilar; Arias Perez, Jose Ignacio; Menéndez, Primitiva; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Hamann, Ute; Kabisch, Maria; Ulmer, Hans Ulrich; Rüdiger, Thomas; Margolin, Sara; Kristensen, Vessela; Nord, Silje; Evans, D. Gareth; Abraham, Jean E.; Earl, Helena M.; Hiller, Louise; Dunn, Janet A.; Bowden, Sarah; Berg, Christine; Campa, Daniele; Diver, W. Ryan; Gapstur, Susan M.; Gaudet, Mia M.; Hankinson, Susan E.; Hoover, Robert N.; Hüsing, Anika; Kaaks, Rudolf; Machiela, Mitchell J.; Willett, Walter; Barrdahl, Myrto; Canzian, Federico; Chin, Suet-Feung; Caldas, Carlos; Hunter, David J.; Lindstrom, Sara; García-Closas, Montserrat; Hall, Per; Easton, Douglas F.; Eccles, Diana M.; Rahman, Nazneen; Nevanlinna, Heli; Pharoah, Paul D. P.
2015-01-01
Background: Survival after a diagnosis of breast cancer varies considerably between patients, and some of this variation may be because of germline genetic variation. We aimed to identify genetic markers associated with breast cancer–specific survival. Methods: We conducted a large meta-analysis of studies in populations of European ancestry, including 37954 patients with 2900 deaths from breast cancer. Each study had been genotyped for between 200000 and 900000 single nucleotide polymorphisms (SNPs) across the genome; genotypes for nine million common variants were imputed using a common reference panel from the 1000 Genomes Project. We also carried out subtype-specific analyses based on 6881 estrogen receptor (ER)–negative patients (920 events) and 23059 ER-positive patients (1333 events). All statistical tests were two-sided. Results: We identified one new locus (rs2059614 at 11q24.2) associated with survival in ER-negative breast cancer cases (hazard ratio [HR] = 1.95, 95% confidence interval [CI] = 1.55 to 2.47, P = 1.91 x 10–8). Genotyping a subset of 2113 case patients, of which 300 were ER negative, provided supporting evidence for the quality of the imputation. The association in this set of case patients was stronger for the observed genotypes than for the imputed genotypes. A second locus (rs148760487 at 2q24.2) was associated at genome-wide statistical significance in initial analyses; the association was similar in ER-positive and ER-negative case patients. Here the results of genotyping suggested that the finding was less robust. Conclusions: This is currently the largest study investigating genetic variation associated with breast cancer survival. Our results have potential clinical implications, as they confirm that germline genotype can provide prognostic information in addition to standard tumor prognostic factors. PMID:25890600
A common variant mapping to CACNA1A is associated with susceptibility to exfoliation syndrome.
Aung, Tin; Ozaki, Mineo; Mizoguchi, Takanori; Allingham, R Rand; Li, Zheng; Haripriya, Aravind; Nakano, Satoko; Uebe, Steffen; Harder, Jeffrey M; Chan, Anita S Y; Lee, Mei Chin; Burdon, Kathryn P; Astakhov, Yury S; Abu-Amero, Khaled K; Zenteno, Juan C; Nilgün, Yildirim; Zarnowski, Tomasz; Pakravan, Mohammad; Safieh, Leen Abu; Jia, Liyun; Wang, Ya Xing; Williams, Susan; Paoli, Daniela; Schlottmann, Patricio G; Huang, Lulin; Sim, Kar Seng; Foo, Jia Nee; Nakano, Masakazu; Ikeda, Yoko; Kumar, Rajesh S; Ueno, Morio; Manabe, Shin-ichi; Hayashi, Ken; Kazama, Shigeyasu; Ideta, Ryuichi; Mori, Yosai; Miyata, Kazunori; Sugiyama, Kazuhisa; Higashide, Tomomi; Chihara, Etsuo; Inoue, Kenji; Ishiko, Satoshi; Yoshida, Akitoshi; Yanagi, Masahide; Kiuchi, Yoshiaki; Aihara, Makoto; Ohashi, Tsutomu; Sakurai, Toshiya; Sugimoto, Takako; Chuman, Hideki; Matsuda, Fumihiko; Yamashiro, Kenji; Gotoh, Norimoto; Miyake, Masahiro; Astakhov, Sergei Y; Osman, Essam A; Al-Obeidan, Saleh A; Owaidhah, Ohoud; Al-Jasim, Leyla; Al Shahwan, Sami; Fogarty, Rhys A; Leo, Paul; Yetkin, Yaz; Oğuz, Çilingir; Kanavi, Mozhgan Rezaei; Beni, Afsaneh Nederi; Yazdani, Shahin; Akopov, Evgeny L; Toh, Kai-Yee; Howell, Gareth R; Orr, Andrew C; Goh, Yufen; Meah, Wee Yang; Peh, Su Qin; Kosior-Jarecka, Ewa; Lukasik, Urszula; Krumbiegel, Mandy; Vithana, Eranga N; Wong, Tien Yin; Liu, Yutao; Koch, Allison E Ashley; Challa, Pratap; Rautenbach, Robyn M; Mackey, David A; Hewitt, Alex W; Mitchell, Paul; Wang, Jie Jin; Ziskind, Ari; Carmichael, Trevor; Ramakrishnan, Rangappa; Narendran, Kalpana; Venkatesh, Rangaraj; Vijayan, Saravanan; Zhao, Peiquan; Chen, Xueyi; Guadarrama-Vallejo, Dalia; Cheng, Ching Yu; Perera, Shamira A; Husain, Rahat; Ho, Su-Ling; Welge-Luessen, Ulrich-Christoph; Mardin, Christian; Schloetzer-Schrehardt, Ursula; Hillmer, Axel M; Herms, Stefan; Moebus, Susanne; Nöthen, Markus M; Weisschuh, Nicole; Shetty, Rohit; Ghosh, Arkasubhra; Teo, Yik Ying; Brown, Matthew A; Lischinsky, Ignacio; Crowston, Jonathan G; Coote, Michael; Zhao, Bowen; Sang, Jinghong; Zhang, Nihong; You, Qisheng; Vysochinskaya, Vera; Founti, Panayiota; Chatzikyriakidou, Anthoula; Lambropoulos, Alexandros; Anastasopoulos, Eleftherios; Coleman, Anne L; Wilson, M Roy; Rhee, Douglas J; Kang, Jae Hee; May-Bolchakova, Inna; Heegaard, Steffen; Mori, Kazuhiko; Alward, Wallace L M; Jonas, Jost B; Xu, Liang; Liebmann, Jeffrey M; Chowbay, Balram; Schaeffeler, Elke; Schwab, Matthias; Lerner, Fabian; Wang, Ningli; Yang, Zhenglin; Frezzotti, Paolo; Kinoshita, Shigeru; Fingert, John H; Inatani, Masaru; Tashiro, Kei; Reis, André; Edward, Deepak P; Pasquale, Louis R; Kubota, Toshiaki; Wiggs, Janey L; Pasutto, Francesca; Topouzis, Fotis; Dubina, Michael; Craig, Jamie E; Yoshimura, Nagahisa; Sundaresan, Periasamy; John, Simon W M; Ritch, Robert; Hauser, Michael A; Khor, Chiea-Chuen
2015-04-01
Exfoliation syndrome (XFS) is the most common recognizable cause of open-angle glaucoma worldwide. To better understand the etiology of XFS, we conducted a genome-wide association study (GWAS) of 1,484 cases and 1,188 controls from Japan and followed up the most significant findings in a further 6,901 cases and 20,727 controls from 17 countries across 6 continents. We discovered a genome-wide significant association between a new locus (CACNA1A rs4926244) and increased susceptibility to XFS (odds ratio (OR) = 1.16, P = 3.36 × 10(-11)). Although we also confirmed overwhelming association at the LOXL1 locus, the key SNP marker (LOXL1 rs4886776) demonstrated allelic reversal depending on the ancestry group (Japanese: OR(A allele) = 9.87, P = 2.13 × 10(-217); non-Japanese: OR(A allele) = 0.49, P = 2.35 × 10(-31)). Our findings represent the first genetic locus outside of LOXL1 surpassing genome-wide significance for XFS and provide insight into the biology and pathogenesis of the disease.
The immunoglobulin heavy chain locus in the platypus (Ornithorhynchus anatinus).
Gambón-Deza, F; Sánchez-Espinel, C; Magadán-Mompó, S
2009-08-01
Immunoglobulins loci in mammals are well known to be organized within a translocon, however their origin remains unresolved. Four of the five classes of immunoglobulins described in humans and rodents (immunoglobulins M, G, E and A-IgM, IgG, IgE and IgA) were found in marsupials and monotremes (immunoglobulin D-IgD was not found) thus showing that the genomic structure of antibodies in mammals has remained constant since its origin. We have recently described the genomic organization of the immunoglobulin heavy chain locus in reptiles (IGHM, IGHD and IGHY). These data and the characterization of the IGH locus in platypus (Ornithorhynchus anatinus), allow us to elucidate the changes that took place in this genomic region during evolution from reptile to mammal. Thus, by using available genome data, we were able to detect that platypus IGH locus contains reptilian and mammalian genes. Besides having an IGHD that is very similar to the one in reptiles and an IGHY, they also present the mammal specific antibody genes IGHG and IGHE, in addition to IGHA. We also detected a pseudogene that originated by recombination between the IGHD and the IGHM (similar to the IGHD2 found in Eublepharis macularius). The analysis of the IGH locus in platypus shows that IGHY was duplicated, firstly by evolving into IGHE and then into IGHG. The IGHA of the platypus has a complex origin, and probably arose by a process of recombination between the IGHM and the IGHY. We detected about 44 VH genes (25 were already described), most of which comprise a single group. When we compared these VH genes with those described in Anolis carolinensis, we find that there is an evolutionary relationship between the VH genes of platypus and the reptilian Group III genes. These results suggest that a fast VH turnover took place in platypus and this gave rise to a family with a high VH gene number and the disappearance of the earlier VH families.
Henson, Kerstin; Luzader, Angelina; Lindstrom, Merle; Spooner, Muriel; Steffy, Brian M.; Suzuki, Oscar; Janse, Chris; Waters, Andrew P.; Zhou, Yingyao; Wiltshire, Tim; Winzeler, Elizabeth A.
2010-01-01
The genetic background of a patient determines in part if a person develops a mild form of malaria and recovers, or develops a severe form and dies. We have used a mouse model to detect genes involved in the resistance or susceptibility to Plasmodium berghei malaria infection. To this end we first characterized 32 different mouse strains infected with P. berghei and identified survival as the best trait to discriminate between the strains. We found a locus on chromosome 6 by linking the survival phenotypes of the mouse strains to their genetic variations using genome wide analyses such as haplotype associated mapping and the efficient mixed-model for association. This new locus involved in malaria resistance contains only two genes and confirms the importance of Ppar-γ in malaria infection. PMID:20531941
Southern Analysis of Genomic Alterations in Gamma-Ray-Induced Aprt- Hamster Cell Mutants
Grosovsky, Andrew J.; Drobetsky, Elliot A.; deJong, Pieter J.; Glickman, Barry W.
1986-01-01
The role of genomic alterations in mutagenesis induced by ionizing radiation has been the subject of considerable speculation. By Southern blotting analysis we show here that 9 of 55 (approximately 1/6) gamma-ray-induced mutants at the adenine phosphoribosyl transferase (aprt) locus of Chinese hamster ovary (CHO) cells have a detectable genomic rearrangement. These fall into two classes: intragenic deletions and chromosomal rearrangements. In contrast, no major genomic alterations were detected among 67 spontaneous mutants, although two restriction site loss events were observed. Three gamma-ray-induced mutants were found to be intragenic deletions; all may have identical break-points. The remaining six gamma-ray-induced mutants demonstrating a genomic alteration appear to be the result of chromosomal rearrangements, possibly translocation or inversion events. None of the remaining gamma-ray-induced mutants showed any observable alteration in blotting pattern indicating a substantial role for point mutation in gamma-ray-induced mutagenesis at the aprt locus. PMID:3013724
Improvements of the Ray-Tracing Based Method Calculating Hypocentral Loci for Earthquake Location
NASA Astrophysics Data System (ADS)
Zhao, A. H.
2014-12-01
Hypocentral loci are very useful to reliable and visual earthquake location. However, they can hardly be analytically expressed when the velocity model is complex. One of methods numerically calculating them is based on a minimum traveltime tree algorithm for tracing rays: a focal locus is represented in terms of ray paths in its residual field from the minimum point (namely initial point) to low residual points (referred as reference points of the focal locus). The method has no restrictions on the complexity of the velocity model but still lacks the ability of correctly dealing with multi-segment loci. Additionally, it is rather laborious to set calculation parameters for obtaining loci with satisfying completeness and fineness. In this study, we improve the ray-tracing based numerical method to overcome its advantages. (1) Reference points of a hypocentral locus are selected from nodes of the model cells that it goes through, by means of a so-called peeling method. (2) The calculation domain of a hypocentral locus is defined as such a low residual area that its connected regions each include one segment of the locus and hence all the focal locus segments are respectively calculated with the minimum traveltime tree algorithm for tracing rays by repeatedly assigning the minimum residual reference point among those that have not been traced as an initial point. (3) Short ray paths without branching are removed to make the calculated locus finer. Numerical tests show that the improved method becomes capable of efficiently calculating complete and fine hypocentral loci of earthquakes in a complex model.
CTLA-4 as a genetic determinant in autoimmune Addison's disease.
Wolff, A S B; Mitchell, A L; Cordell, H J; Short, A; Skinningsrud, B; Ollier, W; Badenhoop, K; Meyer, G; Falorni, A; Kampe, O; Undlien, D; Pearce, S H S; Husebye, E S
2015-09-01
In common with several other autoimmune diseases, autoimmune Addison's disease (AAD) is thought to be caused by a combination of deleterious susceptibility polymorphisms in several genes, together with undefined environmental factors and stochastic events. To date, the strongest genomic association with AAD has been with alleles at the HLA locus, DR3-DQ2 and DR4. The contribution of other genetic variants has been inconsistent. We have studied the association of 16 single-nucleotide polymorphisms (SNPs) within the CD28-CTLA-4-ICOS genomic locus, in a cohort comprising 691 AAD patients of Norwegian and UK origin with matched controls. We have also performed a meta-analysis including 1002 patients from European countries. The G-allele of SNP rs231775 in CTLA-4 is associated with AAD in Norwegian patients (odds ratio (OR)=1.35 (confidence interval (CI) 1.10-1.66), P=0.004), but not in UK patients. The same allele is associated with AAD in the total European population (OR=1.37 (CI 1.13-1.66), P=0.002). A three-marker haplotype, comprising PROMOTER_1661, rs231726 and rs1896286 was found to be associated with AAD in the Norwegian cohort only (OR 2.43 (CI 1.68-3.51), P=0.00013). This study points to the CTLA-4 gene as a susceptibility locus for the development of AAD, and refines its mapping within the wider genomic locus.
Inferring mechanisms of copy number change from haplotype structures at the human DEFA1A3 locus.
Black, Holly A; Khan, Fayeza F; Tyson, Jess; Al Armour, John
2014-07-21
The determination of structural haplotypes at copy number variable regions can indicate the mechanisms responsible for changes in copy number, as well as explain the relationship between gene copy number and expression. However, obtaining spatial information at regions displaying extensive copy number variation, such as the DEFA1A3 locus, is complex, because of the difficulty in the phasing and assembly of these regions. The DEFA1A3 locus is intriguing in that it falls within a region of high linkage disequilibrium, despite its high variability in copy number (n = 3-16); hence, the mechanisms responsible for changes in copy number at this locus are unclear. In this study, a region flanking the DEFA1A3 locus was sequenced across 120 independent haplotypes with European ancestry, identifying five common classes of DEFA1A3 haplotype. Assigning DEFA1A3 class to haplotypes within the 1000 Genomes project highlights a significant difference in DEFA1A3 class frequencies between populations with different ancestry. The features of each DEFA1A3 class, for example, the associated DEFA1A3 copy numbers, were initially assessed in a European cohort (n = 599) and replicated in the 1000 Genomes samples, showing within-class similarity, but between-class and between-population differences in the features of the DEFA1A3 locus. Emulsion haplotype fusion-PCR was used to generate 61 structural haplotypes at the DEFA1A3 locus, showing a high within-class similarity in structure. Structural haplotypes across the DEFA1A3 locus indicate that intra-allelic rearrangement is the predominant mechanism responsible for changes in DEFA1A3 copy number, explaining the conservation of linkage disequilibrium across the locus. The identification of common structural haplotypes at the DEFA1A3 locus could aid studies into how DEFA1A3 copy number influences expression, which is currently unclear.
Hit and go CAS9 delivered through a lentiviral based self-limiting circuit.
Petris, Gianluca; Casini, Antonio; Montagna, Claudia; Lorenzin, Francesca; Prandi, Davide; Romanel, Alessandro; Zasso, Jacopo; Conti, Luciano; Demichelis, Francesca; Cereseto, Anna
2017-05-22
In vivo application of the CRISPR-Cas9 technology is still limited by unwanted Cas9 genomic cleavages. Long-term expression of Cas9 increases the number of genomic loci non-specifically cleaved by the nuclease. Here we develop a Self-Limiting Cas9 circuit for Enhanced Safety and specificity (SLiCES) which consists of an expression unit for Streptococcus pyogenes Cas9 (SpCas9), a self-targeting sgRNA and a second sgRNA targeting a chosen genomic locus. The self-limiting circuit results in increased genome editing specificity by controlling Cas9 levels. For its in vivo utilization, we next integrate SLiCES into a lentiviral delivery system (lentiSLiCES) via circuit inhibition to achieve viral particle production. Upon delivery into target cells, the lentiSLiCES circuit switches on to edit the intended genomic locus while simultaneously stepping up its own neutralization through SpCas9 inactivation. By preserving target cells from residual nuclease activity, our hit and go system increases safety margins for genome editing.
Identifying Epigenetic Modulators of Resistance to ERK Signaling Inhibitors
2015-08-01
1.8 cal months 07/01/2014 - 05/31/2016 Bayer Hemophilia Award Program Targeted conection of hemophilia A using CRISPR -mediated editing The Specific...Aims of the project are to: (1) Inse1t a human FVIII eDNA into the Rosa26locus of the mouse genome using the CRISPR -Cas9 system, and (2) Inse1t a...human FVIII eDNA into the AA VS 1 locus of the human genome using the CRISPR -Cas9 system. Role: PI RGP009/2014 (Brown) 1.8 cal months 06/01/2014 - 05
Beta-defensin genomic copy number is not a modifier locus for cystic fibrosis
Hollox, Edward J; Davies, Jane; Griesenbach, Uta; Burgess, Juliana; Alton, Eric WFW; Armour, John AL
2005-01-01
Human beta-defensin 2 (DEFB4, also known as DEFB2 or hBD-2) is a salt-sensitive antimicrobial protein that is expressed in lung epithelia. Previous work has shown that it is encoded in a cluster of beta-defensin genes at 8p23.1, which varies in copy number between 2 and 12 in different individuals. We determined the copy number of this locus in 355 patients with cystic fibrosis (CF), and tested for correlation between beta-defensin cluster genomic copy number and lung disease associated with CF. No significant association was found. PMID:16336654
Three Infectious Viral Species Lying in Wait in the Banana Genome
Chabannes, Matthieu; Baurens, Franc-Christophe; Duroy, Pierre-Olivier; Bocs, Stéphanie; Vernerey, Marie-Stéphanie; Rodier-Goud, Marguerite; Barbe, Valérie; Gayral, Philippe
2013-01-01
Plant pararetroviruses integrate serendipitously into their host genomes. The banana genome harbors integrated copies of banana streak virus (BSV) named endogenous BSV (eBSV) that are able to release infectious pararetrovirus. In this investigation, we characterized integrants of three BSV species—Goldfinger (eBSGFV), Imove (eBSImV), and Obino l'Ewai (eBSOLV)—in the seedy Musa balbisiana Pisang klutuk wulung (PKW) by studying their molecular structure, genomic organization, genomic landscape, and infectious capacity. All eBSVs exhibit extensive viral genome duplications and rearrangements. eBSV segregation analysis on an F1 population of PKW combined with fluorescent in situ hybridization analysis showed that eBSImV, eBSOLV, and eBSGFV are each present at a single locus. eBSOLV and eBSGFV contain two distinct alleles, whereas eBSImV has two structurally identical alleles. Genotyping of both eBSV and viral particles expressed in the progeny demonstrated that only one allele for each species is infectious. The infectious allele of eBSImV could not be identified since the two alleles are identical. Finally, we demonstrate that eBSGFV and eBSOLV are located on chromosome 1 and eBSImV is located on chromosome 2 of the reference Musa genome published recently. The structure and evolution of eBSVs suggest sequential integration into the plant genome, and haplotype divergence analysis confirms that the three loci display differential evolution. Based on our data, we propose a model for BSV integration and eBSV evolution in the Musa balbisiana genome. The mutual benefits of this unique host-pathogen association are also discussed. PMID:23720724
2013-01-01
Background Streptococcus agalactiae, also referred to as Group B Streptococcus (GBS), is a frequent resident of the rectovaginal tract in humans, and a major cause of neonatal infection. In addition, S. agalactiae is a known fish pathogen, which compromises food safety and represents a zoonotic hazard. The complete genome sequence of the piscine S. agalactiae isolate GD201008-001 was compared with 14 other piscine, human and bovine strains to explore their virulence determinants, evolutionary relationships and the genetic basis of host tropism in S. agalactiae. Results The pan-genome of S. agalactiae is open and its size increases with the addition of newly sequenced genomes. The core genes shared by all isolates account for 50 ~ 70% of any single genome. The Chinese piscine isolates GD201008-001 and ZQ0910 are phylogenetically distinct from the Latin American piscine isolates SA20-06 and STIR-CD-17, but are closely related to the human strain A909, in the context of the clustered regularly interspaced short palindromic repeats (CRISPRs), prophage, virulence-associated genes and phylogenetic relationships. We identified a unique 10 kb gene locus in Chinese piscine strains. Conclusions Isolates from cultured tilapia in China have a close genomic relationship with the human strain A909. Our findings provide insight into the pathogenesis and host-associated genome content of piscine S. agalactiae isolated in China. PMID:24215651
Genomic analysis reveals candidate genes for PPV resistance in apricot (Prunus armeniaca L.)
USDA-ARS?s Scientific Manuscript database
Sharka disease, caused by Plum pox virus (PPV), is the most important disease affecting Prunus species. A major PPV resistance locus (PPVres) was previously mapped to the upper part of apricot (Prunus armeniaca) linkage group 1. In this study, a physical map of the PPVres locus in the PPV resistan...
USDA-ARS?s Scientific Manuscript database
Natural antisense transcripts (NATs) are transcripts of the opposite DNA strand to the sense-strand either at the same locus (cis-encoded) or a different locus (trans-encoded). They can affect gene expression at multiple stages including transcription, RNA processing and transport, and translation....
Zhou, Yanrong; Lin, Yanli; Wu, Xiaojie; Xiong, Fuyin; Lv, Yuemeng; Zheng, Tao; Huang, Peitang; Chen, Hongxing
2012-02-01
Transgene expression for the mammary gland bioreactor aimed at producing recombinant proteins requires optimized expression vector construction. Previously we presented a hybrid gene locus strategy, which was originally tested with human lactoferrin (hLF) as target transgene, and an extremely high-level expression of rhLF ever been achieved as to 29.8 g/l in mice milk. Here to demonstrate the broad application of this strategy, another 38.4 kb mWAP-htPA hybrid gene locus was constructed, in which the 3-kb genomic coding sequence in the 24-kb mouse whey acidic protein (mWAP) gene locus was substituted by the 17.4-kb genomic coding sequence of human tissue plasminogen activator (htPA), exactly from the start codon to the end codon. Corresponding five transgenic mice lines were generated and the highest expression level of rhtPA in the milk attained as to 3.3 g/l. Our strategy will provide a universal way for the large-scale production of pharmaceutical proteins in the mammary gland of transgenic animals.
Comparative fine mapping of the Wax 1 (W1) locus in hexaploid wheat.
Lu, Ping; Qin, Jinxia; Wang, Guoxin; Wang, Lili; Wang, Zhenzhong; Wu, Qiuhong; Xie, Jingzhong; Liang, Yong; Wang, Yong; Zhang, Deyun; Sun, Qixin; Liu, Zhiyong
2015-08-01
By applying comparative genomics analyses, a high-density genetic linkage map of the Wax 1 ( W1 ) locus was constructed as a framework for map-based cloning. Glaucousness is described as the scattering effect of visible light from wax deposited on the cuticle of plant aerial organs. In wheat, the wax on leaves and stems is mainly controlled by two sets of genes: glaucousness loci (W1 and W2) and non-glaucousness loci (Iw1 and Iw2). Bulked segregant analysis (BSA) and simple sequence repeat (SSR) mapping showed that Wax1 (W1) is located on chromosome arm 2BS between markers Xgwm210 and Xbarc35. By applying comparative genomics analyses, colinearity genomic regions of the W1 locus on wheat 2BS were identified in Brachypodium distachyon chromosome 5, rice chromosome 4 and sorghum chromosome 6, respectively. Four STS markers were developed using the Triticum aestivum cv. Chinese Spring 454 contig sequences and the International Wheat Genome Sequencing Consortium (IWGSC) survey sequences. W1 was mapped into a 0.93 cM genetic interval flanked by markers XWGGC3197 and XWGGC2484, which has synteny with genomic regions of 56.5 kb in Brachypodium, 390 kb in rice and 31.8 kb in sorghum. The fine genetic map can serve as a framework for chromosome landing, physical mapping and map-based cloning of the W1 in wheat.
Demirci, F. Yesim; Wang, Xingbin; Kelly, Jennifer A.; Morris, David L.; Barmada, M. Michael; Feingold, Eleanor; Kao, Amy H.; Sivils, Kathy L.; Bernatsky, Sasha; Pineau, Christian; Clarke, Ann; Ramsey-Goldman, Rosalind; Vyse, Timothy J.; Gaffney, Patrick M.; Manzi, Susan; Kamboh, M. Ilyas
2016-01-01
Objective Genome-wide association studies (GWASs) in individuals of European ancestry identified a number of systemic lupus erythematosus (SLE) susceptibility loci using earlier versions of high-density genotyping platforms. Follow-up studies on suggestive GWAS regions using larger samples and more markers identified additional SLE loci in European-descent subjects. Here we report the results of a multi-stage study that we performed to identify novel SLE loci. Methods In Stage 1, we conducted a new GWAS of SLE in a North American case-control sample of European ancestry (n=1,166) genotyped on Affymetrix Genome-Wide Human SNP Array 6.0. In Stage 2, we further investigated top new suggestive GWAS hits by in silico evaluation and meta-analysis using an additional dataset of European-descent subjects (>2,500 individuals), followed by replication of top meta-analysis findings in another dataset of European-descent subjects (>10,000 individuals) in Stage 3. Results As expected, our GWAS revealed most significant associations at the major histocompatibility complex locus (6p21), which easily surpassed genome-wide significance threshold (P<5×10−8). Several other SLE signals/loci previously implicated in Caucasians and/or Asians were also supported in Stage 1 discovery sample and strongest signals were observed at 2q32/STAT4 (P=3.6×10−7) and at 8p23/BLK (P=8.1×10−6). Stage 2 meta-analyses identified a new genome-wide significant SLE locus at 12q12 (meta P=3.1×10−8), which was replicated in Stage 3. Conclusion Our multi-stage study identified and replicated a new SLE locus that warrants further follow-up in additional studies. Publicly available databases suggest that this new SLE signal falls within a functionally relevant genomic region and near biologically important genes. PMID:26316170
FISH-mapping of the 5S rDNA locus in chili peppers (Capsicum-Solanaceae).
Aguilera, Patricia M; Debat, Humberto J; Scaldaferro, Marisel A; Martí, Dardo A; Grabiele, Mauro
2016-03-01
We present here the physical mapping of the 5S rDNA locus in six wild and five cultivated taxa of Capsicum by means of a genus-specific FISH probe. In all taxa, a single 5S locus per haploid genome that persistently mapped onto the short arm of a unique metacentric chromosome pair at intercalar position, was found. 5S FISH signals of almost the same size and brightness intensity were observed in all the analyzed taxa. This is the first cytological characterization of the 5S in wild taxa of Capsicum by using a genus-derived probe, and the most exhaustive and comprehensive in the chili peppers up to now. The information provided here will aid the cytomolecular characterization of pepper germplasm to evaluate variability and can be instrumental to integrate physical, genetic and genomic maps already generated in the genus.
De Franceschi, Paolo; Bianco, Luca; Cestaro, Alessandro; Dondini, Luca; Velasco, Riccardo
2018-06-01
Data obtained from Illumina resequencing of 63 apple cultivars were used to obtain full-length S-RNase sequences using a strategy based on both alignment and de novo assembly of reads. The reproductive biology of apple is regulated by the S-RNase-based gametophytic self-incompatibility system, that is genetically controlled by the single, multi-genic and multi-allelic S locus. Resequencing of apple cultivars provided a huge amount of genetic data, that can be aligned to the reference genome in order to characterize variation to a genome-wide level. However, this approach is not immediately adaptable to the S-locus, due to some peculiar features such as the high degree of polymorphism, lack of colinearity between haplotypes and extensive presence of repetitive elements. In this study we describe a dedicated procedure aimed at characterizing S-RNase alleles from resequenced cultivars. The S-genotype of 63 apple accessions is reported; the full length coding sequence was determined for the 25 S-RNase alleles present in the 63 resequenced cultivars; these included 10 previously incomplete sequences (S 5 , S 6a , S 6b , S 8 , S 11 , S 23 , S 39 , S 46 , S 50 and S 58 ). Moreover, sequence divergence clearly suggests that alleles S 6a and S 6b , proposed to be neutral variants of the same alleles, should be instead considered different specificities. The promoter sequences have also been analyzed, highlighting regions of homology conserved among all the alleles.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Boespflug-Tanguy, O.; Mimault, C.; Cavagna, A.
1994-09-01
Among the numerous leukodystrophies that have an early onset and no biochemical markers, Pelizaeus-Merzbacher disease (PMD) is one that can be identified using strict clinical criteria and demonstrating an abnormal formation of myelin that is restricted to the CNS in electrophysiological studies and brain magnetic resonance imaging (MRI). In PMD, 12 different base substitutions and one total deletion of the genomic region containing the PLP gene have been reported, but, despite extensive analysis, PLP exon mutations have been found in only 10%-25% of the families analyzed. To test the genetic homogeneity of this disease, the authors have carried out linkagemore » analysis with polymorphic markers of the PLP genomic region in 16 families selected on strict diagnostic criteria of PMD. They observed a tight linkage of the PMD locus with markers of the PLP gene (cDNA PLP, exon IV polymorphism) and of the Xq22 region (DXS17, DXS94, and DXS287), whereas the markers located more proximally (DXYS1X and DXS3) or distally (DXS11) were not linked to the PMD locus. Multipoint analysis gave a maximal location score for the PMD locus (13.98) and the PLP gene (8.32) in the same interval between DXS94 and DXS287, suggesting that in all families PMD is linked to the PLP locus. Mutations of the extraexonic PLP gene sequences or of another unknown close gene could be involved in PMD. In an attempt to identify molecular defects of this genomic region that are responsible for PMD, these results meant that RFLP analysis could be used to improve genetic counseling for the numerous affected families in which a PLP exon mutation could not be demonstrated. 39 refs., 2 figs., 2 tabs.« less
Shinkai, Yoichi; Kuramochi, Masahiro; Doi, Motomichi
2018-05-03
Recently, advances in next-generation sequencing technologies have enabled genome-wide analyses of epigenetic modifications; however, it remains difficult to analyze the states of histone modifications at a single-cell resolution in living multicellular organisms because of the heterogeneity within cellular populations. Here we describe a simple method to visualize histone modifications on the specific sequence of target locus at a single-cell resolution in living Caenorhabditis elegans , by combining the LacO/LacI system and a genetically-encoded H4K20me1-specific probe, "mintbody". We demonstrate that Venus-labeled mintbody and mTurquoise2-labeled LacI can co-localize on an artificial chromosome carrying both the target locus and LacO sequences, where H4K20me1 marks the target locus. We demonstrate that our visualization method can precisely detect H4K20me1 depositions on the her-1 gene sequences on the artificial chromosome, to which the dosage compensation complex binds to regulate sex determination. The degree of H4K20me1 deposition on the her-1 sequences on the artificial chromosome correlated strongly with sex, suggesting that, using the artificial chromosome, this method can reflect context-dependent changes of H4K20me1 on endogenous genomes. Furthermore, we demonstrate live imaging of H4K20me1 depositions on the artificial chromosome. Combined with ChIP assays, this mintbody-LacO/LacI visualization method will enable analysis of developmental and context-dependent alterations of locus-specific histone modifications in specific cells and elucidation of the underlying molecular mechanisms. Copyright © 2018, G3: Genes, Genomes, Genetics.
Figure 2 from Integrative Genomics Viewer: Visualizing Big Data | Office of Cancer Genomics
Grouping and sorting genomic data in IGV. The IGV user interface displaying 202 glioblastoma samples from TCGA. Samples are grouped by tumor subtype (second annotation column) and data type (first annotation column) and sorted by copy number of the EGFR locus (middle column). Adapted from Figure 1; Robinson et al. 2011
Conservation in the face of diversity: multistrain analysis of an intracellular bacterium
USDA-ARS?s Scientific Manuscript database
Comparisons of multiple strains revealed that A. marginale has a closed-core genome with few highly plastic regions, which include the msp2 and msp3 genes, as well as the aaap locus. Comparison of the Florida and St. Maries genome sequences found that SNPs comprise 0.8% of the longer Florida genome,...
Genomic Model with Correlation Between Additive and Dominance Effects.
Xiang, Tao; Christensen, Ole Fredslund; Vitezica, Zulma Gladis; Legarra, Andres
2018-05-09
Dominance genetic effects are rarely included in pedigree-based genetic evaluation. With the availability of single nucleotide polymorphism markers and the development of genomic evaluation, estimates of dominance genetic effects have become feasible using genomic best linear unbiased prediction (GBLUP). Usually, studies involving additive and dominance genetic effects ignore possible relationships between them. It has been often suggested that the magnitude of functional additive and dominance effects at the quantitative trait loci are related, but there is no existing GBLUP-like approach accounting for such correlation. Wellmann and Bennewitz showed two ways of considering directional relationships between additive and dominance effects, which they estimated in a Bayesian framework. However, these relationships cannot be fitted at the level of individuals instead of loci in a mixed model and are not compatible with standard animal or plant breeding software. This comes from a fundamental ambiguity in assigning the reference allele at a given locus. We show that, if there has been selection, assigning the most frequent as the reference allele orients the correlation between functional additive and dominance effects. As a consequence, the most frequent reference allele is expected to have a positive value. We also demonstrate that selection creates negative covariance between genotypic additive and dominance genetic values. For parameter estimation, it is possible to use a combined additive and dominance relationship matrix computed from marker genotypes, and to use standard restricted maximum likelihood (REML) algorithms based on an equivalent model. Through a simulation study, we show that such correlations can easily be estimated by mixed model software and accuracy of prediction for genetic values is slightly improved if such correlations are used in GBLUP. However, a model assuming uncorrelated effects and fitting orthogonal breeding values and dominant deviations performed similarly for prediction. Copyright © 2018, Genetics.
Sepúlveda, Nuno; Campino, Susana G; Assefa, Samuel A; Sutherland, Colin J; Pain, Arnab; Clark, Taane G
2013-02-26
The advent of next generation sequencing technology has accelerated efforts to map and catalogue copy number variation (CNV) in genomes of important micro-organisms for public health. A typical analysis of the sequence data involves mapping reads onto a reference genome, calculating the respective coverage, and detecting regions with too-low or too-high coverage (deletions and amplifications, respectively). Current CNV detection methods rely on statistical assumptions (e.g., a Poisson model) that may not hold in general, or require fine-tuning the underlying algorithms to detect known hits. We propose a new CNV detection methodology based on two Poisson hierarchical models, the Poisson-Gamma and Poisson-Lognormal, with the advantage of being sufficiently flexible to describe different data patterns, whilst robust against deviations from the often assumed Poisson model. Using sequence coverage data of 7 Plasmodium falciparum malaria genomes (3D7 reference strain, HB3, DD2, 7G8, GB4, OX005, and OX006), we showed that empirical coverage distributions are intrinsically asymmetric and overdispersed in relation to the Poisson model. We also demonstrated a low baseline false positive rate for the proposed methodology using 3D7 resequencing data and simulation. When applied to the non-reference isolate data, our approach detected known CNV hits, including an amplification of the PfMDR1 locus in DD2 and a large deletion in the CLAG3.2 gene in GB4, and putative novel CNV regions. When compared to the recently available FREEC and cn.MOPS approaches, our findings were more concordant with putative hits from the highest quality array data for the 7G8 and GB4 isolates. In summary, the proposed methodology brings an increase in flexibility, robustness, accuracy and statistical rigour to CNV detection using sequence coverage data.
Wan, KangKang; Zhang, Zhong; Pang, Xiaoming; Yin, Xiao; Bai, Yang; Sun, Xiaoqing; Gao, Lizhi; Li, Ruiqiang; Zhang, Jinbo
2016-01-01
Jujube (Ziziphus jujuba Mill.) belongs to the Rhamnaceae family and is a popular fruit tree species with immense economic and nutritional value. Here, we report a draft genome of the dry jujube cultivar ‘Junzao’ and the genome resequencing of 31 geographically diverse accessions of cultivated and wild jujubes (Ziziphus jujuba var. spinosa). Comparative analysis revealed that the genome of ‘Dongzao’, a fresh jujube, was ~86.5 Mb larger than that of the ‘Junzao’, partially due to the recent insertions of transposable elements in the ‘Dongzao’ genome. We constructed eight proto-chromosomes of the common ancestor of Rhamnaceae and Rosaceae, two sister families in the order Rosales, and elucidated the evolutionary processes that have shaped the genome structures of modern jujubes. Population structure analysis revealed the complex genetic background of jujubes resulting from extensive hybridizations between jujube and its wild relatives. Notably, several key genes that control fruit organic acid metabolism and sugar content were identified in the selective sweep regions. We also identified S-locus genes controlling gametophytic self-incompatibility and investigated haplotype patterns of the S locus in the jujube genomes, which would provide a guideline for parent selection for jujube crossbreeding. This study provides valuable genomic resources for jujube improvement, and offers insights into jujube genome evolution and its population structure and domestication. PMID:28005948
Huang, Jian; Zhang, Chunmei; Zhao, Xing; Fei, Zhangjun; Wan, KangKang; Zhang, Zhong; Pang, Xiaoming; Yin, Xiao; Bai, Yang; Sun, Xiaoqing; Gao, Lizhi; Li, Ruiqiang; Zhang, Jinbo; Li, Xingang
2016-12-01
Jujube (Ziziphus jujuba Mill.) belongs to the Rhamnaceae family and is a popular fruit tree species with immense economic and nutritional value. Here, we report a draft genome of the dry jujube cultivar 'Junzao' and the genome resequencing of 31 geographically diverse accessions of cultivated and wild jujubes (Ziziphus jujuba var. spinosa). Comparative analysis revealed that the genome of 'Dongzao', a fresh jujube, was ~86.5 Mb larger than that of the 'Junzao', partially due to the recent insertions of transposable elements in the 'Dongzao' genome. We constructed eight proto-chromosomes of the common ancestor of Rhamnaceae and Rosaceae, two sister families in the order Rosales, and elucidated the evolutionary processes that have shaped the genome structures of modern jujubes. Population structure analysis revealed the complex genetic background of jujubes resulting from extensive hybridizations between jujube and its wild relatives. Notably, several key genes that control fruit organic acid metabolism and sugar content were identified in the selective sweep regions. We also identified S-locus genes controlling gametophytic self-incompatibility and investigated haplotype patterns of the S locus in the jujube genomes, which would provide a guideline for parent selection for jujube crossbreeding. This study provides valuable genomic resources for jujube improvement, and offers insights into jujube genome evolution and its population structure and domestication.
Trower, M K; Orton, S M; Purvis, I J; Sanseau, P; Riley, J; Christodoulou, C; Burt, D; See, C G; Elgar, G; Sherrington, R; Rogaev, E I; St George-Hyslop, P; Brenner, S; Dykes, C W
1996-02-20
The genome of the pufferfish (Fugu rubripes) (400 Mb) is approximately 7.5 times smaller than the human genome, but it has a similar gene repertoire to that of man. If regions of the two genomes exhibited conservation of gene order (i.e., were syntenic), it should be possible to reduce dramatically the effort required for identification of candidate genes in human disease loci by sequencing syntenic regions of the compact Fugu genome. We have demonstrated that three genes (dihydrolipoamide succinyltransferase, S31iii125, and S20i15), which are linked to FOS in the familial Alzheimer disease focus (AD3) on human chromosome 14, have homologues in the Fugu genome adjacent to Fugu cFOS. The relative gene order of cFOS, S31iii125, and S20i15 was the same in both genomes, but in Fugu these three genes lay within a 12.4-kb region, compared to >600 kb in the human AD3 locus. These results demonstrate the conservation of synteny between the genomes of Fugu and man and highlight the utility of this approach for sequence-based identification of genes in human disease loci.
Genome engineering in cattle: recent technological advancements.
Wang, Zhongde
2015-02-01
Great strides in technological advancements have been made in the past decade in cattle genome engineering. First, the success of cloning cattle by somatic cell nuclear transfer (SCNT) or chromatin transfer (CT) is a significant advancement that has made obsolete the need for using embryonic stem (ES) cells to conduct cell-mediated genome engineering, whereby site-specific genetic modifications can be conducted in bovine somatic cells via DNA homologous recombination (HR) and whereby genetically engineered cattle can subsequently be produced by animal cloning from the genetically modified cells. With this approach, a chosen bovine genomic locus can be precisely modified in somatic cells, such as to knock out (KO) or knock in (KI) a gene via HR, a gene-targeting strategy that had almost exclusively been used in mouse ES cells. Furthermore, by the creative application of embryonic cloning to rejuvenate somatic cells, cattle genome can be sequentially modified in the same line of somatic cells and complex genetic modifications have been achieved in cattle. Very recently, the development of designer nucleases-such as zinc finger nucleases (ZFNs) and transcription activator-like effector nuclease (TALENs), and clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9)-has enabled highly efficient and more facile genome engineering in cattle. Most notably, by employing such designer nucleases, genomes can be engineered at single-nucleotide precision; this process is now often referred to as genome or gene editing. The above achievements are a drastic departure from the traditional methods of creating genetically modified cattle, where foreign DNAs are randomly integrated into the animal genome, most often along with the integrations of bacterial or viral DNAs. Here, I review the most recent technological developments in cattle genome engineering by highlighting some of the major achievements in creating genetically engineered cattle for agricultural and biomedical applications.
Genomic analysis reveals extensive gene duplication within the bovine TRB locus
Connelley, Timothy; Aerts, Jan; Law, Andy; Morrison, W Ivan
2009-01-01
Background Diverse TR and IG repertoires are generated by V(D)J somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically diverse functional TRBV genes, which is substantially larger than that described for humans and mice. Conclusion The analyses completed in this study reveal that, although the gene content and organization of the bovine TRB locus are broadly similar to that of humans and mice, multiple duplication events have led to a marked expansion in the number of TRB genes. Similar expansions in other ruminant TR loci suggest strong evolutionary pressures in this lineage have selected for the development of enlarged sets of TR genes that can contribute to diverse TR repertoires. PMID:19393068
Leushkin, Evgeny V; Logacheva, Maria D; Penin, Aleksey A; Sutormin, Roman A; Gerasimov, Evgeny S; Kochkina, Galina A; Ivanushkina, Natalia E; Vasilenko, Oleg V; Kondrashov, Alexey S; Ozerskaya, Svetlana M
2015-05-21
Pseudogymnoascus spp. is a wide group of fungi lineages in the family Pseudorotiaceae including an aggressive pathogen of bats P. destructans. Although several lineages of P. spp. were shown to produce ascospores in culture, the vast majority of P. spp. demonstrates no evidence of sexual reproduction. P. spp. can tolerate a wide range of different temperatures and salinities and can survive even in permafrost layer. Adaptability of P. spp. to different environments is accompanied by extremely variable morphology and physiology. We sequenced genotypes of 14 strains of P. spp., 5 of which were extracted from permafrost, 1 from a cryopeg, a layer of unfrozen ground in permafrost, and 8 from temperate surface environments. All sequenced genotypes are haploid. Nucleotide diversity among these genomes is very high, with a typical evolutionary distance at synonymous sites dS ≈ 0.5, suggesting that the last common ancestor of these strains lived >50 Mya. The strains extracted from permafrost do not form a separate clade. Instead, each permafrost strain has close relatives from temperate environments. We observed a strictly clonal population structure with no conflicting topologies for ~99% of genome sequences. However, there is a number of short (~100-10,000 nt) genomic segments with the total length of 67.6 Kb which possess phylogenetic patterns strikingly different from the rest of the genome. The most remarkable case is a MAT-locus, which has 2 distinct alleles interspersed along the whole-genome phylogenetic tree. Predominantly clonal structure of genome sequences is consistent with the observations that sexual reproduction is rare in P. spp. Small number of regions with noncanonical phylogenies seem to arise due to some recombination events between derived lineages of P. spp., with MAT-locus being transferred on multiple occasions. All sequenced strains have heterothallic configuration of MAT-locus.
Inter-chromosomal Contact Properties in Live-Cell Imaging and in Hi-C.
Maass, Philipp G; Barutcu, A Rasim; Weiner, Catherine L; Rinn, John L
2018-03-15
Imaging (fluorescence in situ hybridization [FISH]) and genome-wide chromosome conformation capture (Hi-C) are two major approaches to the study of higher-order genome organization in the nucleus. Intra-chromosomal and inter-chromosomal interactions (referred to as non-homologous chromosomal contacts [NHCCs]) have been observed by several FISH-based studies, but locus-specific NHCCs have not been detected by Hi-C. Due to crosslinking, neither of these approaches assesses spatiotemporal properties. Toward resolving the discrepancies between imaging and Hi-C, we sought to understand the spatiotemporal properties of NHCCs in living cells by CRISPR/Cas9 live-cell imaging (CLING). In mammalian cells, we find that NHCCs are stable and occur as frequently as intra-chromosomal interactions, but NHCCs occur at farther spatial distance that could explain their lack of detection in Hi-C. By revealing the spatiotemporal properties in living cells, our study provides fundamental insights into the biology of NHCCs. Copyright © 2018 Elsevier Inc. All rights reserved.
GWAS meta-analysis of 16 852 women identifies new susceptibility locus for endometrial cancer
Chen, Maxine M.; O'Mara, Tracy A.; Thompson, Deborah J.; Painter, Jodie N.; Attia, John; Black, Amanda; Brinton, Louise; Chanock, Stephen; Chen, Chu; Cheng, Timothy HT; Cook, Linda S.; Crous-Bou, Marta; Doherty, Jennifer; Friedenreich, Christine M.; Garcia-Closas, Montserrat; Gaudet, Mia M.; Gorman, Maggie; Haiman, Christopher; Hankinson, Susan E.; Hartge, Patricia; Henderson, Brian E.; Hodgson, Shirley; Holliday, Elizabeth G.; Horn-Ross, Pamela L.; Hunter, David J.; Le Marchand, Loic; Liang, Xiaolin; Lissowska, Jolanta; Long, Jirong; Lu, Lingeng; Magliocco, Anthony M.; Martin, Lynn; McEvoy, Mark; Olson, Sara H.; Orlow, Irene; Pooler, Loreall; Prescott, Jennifer; Rastogi, Radhai; Rebbeck, Timothy R.; Risch, Harvey; Sacerdote, Carlotta; Schumacher, Frederick; Wendy Setiawan, Veronica; Scott, Rodney J.; Sheng, Xin; Shu, Xiao-Ou; Turman, Constance; Van Den Berg, David; Wang, Zhaoming; Weiss, Noel S.; Wentzensen, Nicholas; Xia, Lucy; Xiang, Yong-Bing; Yang, Hannah P.; Yu, Herbert; Zheng, Wei; Pharoah, Paul D.P.; Dunning, Alison M.; Tomlinson, Ian; Easton, Douglas F.; Kraft, Peter; Spurdle, Amanda B.; De Vivo, Immaculata
2016-01-01
Endometrial cancer is the most common gynecological malignancy in the developed world. Although there is evidence of genetic predisposition to the disease, most of the genetic risk remains unexplained. We present the meta-analysis results of four genome-wide association studies (4907 cases and 11 945 controls total) in women of European ancestry. We describe one new locus reaching genome-wide significance (P < 5 × 10 −8) at 6p22.3 (rs1740828; P = 2.29 × 10 −8, OR = 1.20), providing evidence of an additional region of interest for genetic susceptibility to endometrial cancer. PMID:27008869
Multi-ethnic genome-wide association study identifies novel locus for type 2 diabetes susceptibility
Cook, James P; Morris, Andrew P
2016-01-01
Genome-wide association studies (GWAS) have traditionally been undertaken in homogeneous populations from the same ancestry group. However, with the increasing availability of GWAS in large-scale multi-ethnic cohorts, we have evaluated a framework for detecting association of genetic variants with complex traits, allowing for population structure, and developed a powerful test of heterogeneity in allelic effects between ancestry groups. We have applied the methodology to identify and characterise loci associated with susceptibility to type 2 diabetes (T2D) using GWAS data from the Resource for Genetic Epidemiology on Adult Health and Aging, a large multi-ethnic population-based cohort, created for investigating the genetic and environmental basis of age-related diseases. We identified a novel locus for T2D susceptibility at genome-wide significance (P<5 × 10−8) that maps to TOMM40-APOE, a region previously implicated in lipid metabolism and Alzheimer's disease. We have also confirmed previous reports that single-nucleotide polymorphisms at the TCF7L2 locus demonstrate the greatest extent of heterogeneity in allelic effects between ethnic groups, with the lowest risk observed in populations of East Asian ancestry. PMID:27189021
A trait stacking system via intra-genomic homologous recombination.
Kumar, Sandeep; Worden, Andrew; Novak, Stephen; Lee, Ryan; Petolino, Joseph F
2016-11-01
A gene targeting method has been developed, which allows the conversion of 'breeding stacks', containing unlinked transgenes into a 'molecular stack' and thereby circumventing the breeding challenges associated with transgene segregation. A gene targeting method has been developed for converting two unlinked trait loci into a single locus transgene stack. The method utilizes intra-genomic homologous recombination (IGHR) between stably integrated target and donor loci which share sequence homology and nuclease cleavage sites whereby the donor contains a promoterless herbicide resistance transgene. Upon crossing with a zinc finger nuclease (ZFN)-expressing plant, double-strand breaks (DSB) are created in both the stably integrated target and donor loci. DSBs flanking the donor locus result in intra-genomic mobilization of a promoterless selectable marker-containing donor sequence, which can be utilized as a template for homology-directed repair of a concomitant DSB at the target locus resulting in a functional selectable marker via nuclease-mediated cassette exchange (NMCE). The method was successfully demonstrated in maize using a glyphosate tolerance gene as a donor whereby up to 3.3 % of the resulting progeny embryos cultured on selection medium regenerated plants with the donor sequence integrated into the target locus. The process could be extended to multiple cycles of trait stacking by virtue of a unique intron sequence homology for NMCE between the target and the donor loci. This is the first report that describes NMCE via IGHR, thereby enabling trait stacking using conventional crossing.
A Genome-Wide Association Study of Circulating Galectin-3
van Veldhuisen, Dirk J.; Westra, Harm-Jan; Bakker, Stephan J. L.; Gansevoort, Ron T.; Muller Kobold, Anneke C.; van Gilst, Wiek H.; Franke, Lude
2012-01-01
Galectin-3 is a lectin involved in fibrosis, inflammation and proliferation. Increased circulating levels of galectin-3 have been associated with various diseases, including cancer, immunological disorders, and cardiovascular disease. To enhance our knowledge on galectin-3 biology we performed the first genome-wide association study (GWAS) using the Illumina HumanCytoSNP-12 array imputed with the HapMap 2 CEU panel on plasma galectin-3 levels in 3,776 subjects and follow-up genotyping in an additional 3,516 subjects. We identified 2 genome wide significant loci associated with plasma galectin-3 levels. One locus harbours the LGALS3 gene (rs2274273; P = 2.35×10−188) and the other locus the ABO gene (rs644234; P = 3.65×10−47). The variance explained by the LGALS3 locus was 25.6% and by the ABO locus 3.8% and jointly they explained 29.2%. Rs2274273 lies in high linkage disequilibrium with two non-synonymous SNPs (rs4644; r2 = 1.0, and rs4652; r2 = 0.91) and wet lab follow-up genotyping revealed that both are strongly associated with galectin-3 levels (rs4644; P = 4.97×10−465 and rs4652 P = 1.50×10−421) and were also associated with LGALS3 gene-expression. The origins of our associations should be further validated by means of functional experiments. PMID:23056639
Wang, Hansong; Burnett, Terrilea; Kono, Suminori; Haiman, Christopher A.; Iwasaki, Motoki; Wilkens, Lynne R.; Loo, Lenora W.M.; Berg, David Van Den; Kolonel, Laurence N.; Henderson, Brian E.; Keku, Temitope O.; Sandler, Robert S.; Signorello, Lisa B.; Blot, William J.; Newcomb, Polly A.; Pande, Mala; Amos, Christopher I.; West, Dee W.; Bézieau, Stéphane; Berndt, Sonja I.; Zanke, Brent W.; Hsu, Li; Lindor, Noralane M.; Haile, Robert W.; Hopper, John L.; Jenkins, Mark A.; Gallinger, Steven; Casey, Graham; Stenzel, Stephanie L.; Schumacher, Fredrick R.; Peters, Ulrike; Gruber, Stephen B.; Tsugane, Shoichiro; Stram, Daniel O.; Marchand, Loïc Le
2014-01-01
The genetic basis of sporadic colorectal cancer (CRC) is not well explained by known risk polymorphisms. Here we perform a meta-analysis of two genome-wide association studies in 2,627 cases and 3,797 controls of Japanese ancestry and 1,894 cases and 4,703 controls of African ancestry, to identify genetic variants that contribute to CRC susceptibility. We replicate genome-wide statistically significant associations (P < 5×10−8) in 16,823 cases and 18,211 controls of European ancestry. This study reveals a new pan-ethnic CRC risk locus at 10q25 (rs12241008, intronic to VTI1A; P=1.4×10−9), providing additional insight into the etiology of CRC and highlighting the value of association mapping in diverse populations. PMID:25105248
Wang, Hansong; Burnett, Terrilea; Kono, Suminori; Haiman, Christopher A; Iwasaki, Motoki; Wilkens, Lynne R; Loo, Lenora W M; Van Den Berg, David; Kolonel, Laurence N; Henderson, Brian E; Keku, Temitope O; Sandler, Robert S; Signorello, Lisa B; Blot, William J; Newcomb, Polly A; Pande, Mala; Amos, Christopher I; West, Dee W; Bézieau, Stéphane; Berndt, Sonja I; Zanke, Brent W; Hsu, Li; Lindor, Noralane M; Haile, Robert W; Hopper, John L; Jenkins, Mark A; Gallinger, Steven; Casey, Graham; Stenzel, Stephanie L; Schumacher, Fredrick R; Peters, Ulrike; Gruber, Stephen B; Tsugane, Shoichiro; Stram, Daniel O; Le Marchand, Loïc
2014-08-08
The genetic basis of sporadic colorectal cancer (CRC) is not well explained by known risk polymorphisms. Here we perform a meta-analysis of two genome-wide association studies in 2,627 cases and 3,797 controls of Japanese ancestry and 1,894 cases and 4,703 controls of African ancestry, to identify genetic variants that contribute to CRC susceptibility. We replicate genome-wide statistically significant associations (P<5 × 10(-8)) in 16,823 cases and 18,211 controls of European ancestry. This study reveals a new pan-ethnic CRC risk locus at 10q25 (rs12241008, intronic to VTI1A; P=1.4 × 10(-9)), providing additional insight into the aetiology of CRC and highlighting the value of association mapping in diverse populations.
Peng, Wenzhu; Xu, Jian; Zhang, Yan; Feng, Jianxin; Dong, Chuanju; Jiang, Likun; Feng, Jingyan; Chen, Baohua; Gong, Yiwen; Chen, Lin; Xu, Peng
2016-01-01
High density genetic linkage maps are essential for QTL fine mapping, comparative genomics and high quality genome sequence assembly. In this study, we constructed a high-density and high-resolution genetic linkage map with 28,194 SNP markers on 14,146 distinct loci for common carp based on high-throughput genotyping with the carp 250 K single nucleotide polymorphism (SNP) array in a mapping family. The genetic length of the consensus map was 10,595.94 cM with an average locus interval of 0.75 cM and an average marker interval of 0.38 cM. Comparative genomic analysis revealed high level of conserved syntenies between common carp and the closely related model species zebrafish and medaka. The genome scaffolds were anchored to the high-density linkage map, spanning 1,357 Mb of common carp reference genome. QTL mapping and association analysis identified 22 QTLs for growth-related traits and 7 QTLs for sex dimorphism. Candidate genes underlying growth-related traits were identified, including important regulators such as KISS2, IGF1, SMTLB, NPFFR1 and CPE. Candidate genes associated with sex dimorphism were also identified including 3KSR and DMRT2b. The high-density and high-resolution genetic linkage map provides an important tool for QTL fine mapping and positional cloning of economically important traits, and improving common carp genome assembly. PMID:27225429
Tripathi, Charu; Mishra, Harshita; Khurana, Himani; Dwivedi, Vatsala; Kamra, Komal; Negi, Ram K.; Lal, Rup
2017-01-01
Thermophilic environments represent an interesting niche. Among thermophiles, the genus Thermus is among the most studied genera. In this study, we have sequenced the genome of Thermus parvatiensis strain RL, a thermophile isolated from Himalayan hot water springs (temperature >96°C) using PacBio RSII SMRT technique. The small genome (2.01 Mbp) comprises a chromosome (1.87 Mbp) and a plasmid (143 Kbp), designated in this study as pTP143. Annotation revealed a high number of repair genes, a squeezed genome but containing highly plastic plasmid with transposases, integrases, mobile elements and hypothetical proteins (44%). We performed a comparative genomic study of the group Thermus with an aim of analysing the phylogenetic relatedness as well as niche specific attributes prevalent among the group. We compared the reference genome RL with 16 Thermus genomes to assess their phylogenetic relationships based on 16S rRNA gene sequences, average nucleotide identity (ANI), conserved marker genes (31 and 400), pan genome and tetranucleotide frequency. The core genome of the analyzed genomes contained 1,177 core genes and many singleton genes were detected in individual genomes, reflecting a conserved core but adaptive pan repertoire. We demonstrated the presence of metagenomic islands (chromosome:5, plasmid:5) by recruiting raw metagenomic data (from the same niche) against the genomic replicons of T. parvatiensis. We also dissected the CRISPR loci wide all genomes and found widespread presence of this system across Thermus genomes. Additionally, we performed a comparative analysis of competence loci wide Thermus genomes and found evidence for recent horizontal acquisition of the locus and continued dispersal among members reflecting that natural competence is a beneficial survival trait among Thermus members and its acquisition depicts unending evolution in order to accomplish optimal fitness. PMID:28798737
2013-01-01
Background The narrow-leafed lupin, Lupinus angustifolius L., is a grain legume species with a relatively compact genome. The species has 2n = 40 chromosomes and its genome size is 960 Mbp/1C. During the last decade, L. angustifolius genomic studies have achieved several milestones, such as molecular-marker development, linkage maps, and bacterial artificial chromosome (BAC) libraries. Here, these resources were integratively used to identify and sequence two gene-rich regions (GRRs) of the genome. Results The genome was screened with a probe representing the sequence of a microsatellite fragment length polymorphism (MFLP) marker linked to Phomopsis stem blight resistance. BAC clones selected by hybridization were subjected to restriction fingerprinting and contig assembly, and 232 BAC-ends were sequenced and annotated. BAC fluorescence in situ hybridization (BAC-FISH) identified eight single-locus clones. Based on physical mapping, cytogenetic localization, and BAC-end annotation, five clones were chosen for sequencing. Within the sequences of clones that hybridized in FISH to a single-locus, two large GRRs were identified. The GRRs showed strong and conserved synteny to Glycine max duplicated genome regions, illustrated by both identical gene order and parallel orientation. In contrast, in the clones with dispersed FISH signals, more than one-third of sequences were transposable elements. Sequenced, single-locus clones were used to develop 12 genetic markers, increasing the number of L. angustifolius chromosomes linked to appropriate linkage groups by five pairs. Conclusions In general, probes originating from MFLP sequences can assist genome screening and gene discovery. However, such probes are not useful for positional cloning, because they tend to hybridize to numerous loci. GRRs identified in L. angustifolius contained a low number of interspersed repeats and had a high level of synteny to the genome of the model legume G. max. Our results showed that not only was the gene nucleotide sequence conserved between soybean and lupin GRRs, but the order and orientation of particular genes in syntenic blocks was homologous, as well. These findings will be valuable to the forthcoming sequencing of the lupin genome. PMID:23379841
Gompert, Zachariah; Lucas, Lauren K; Nice, Chris C; Fordyce, James A; Forister, Matthew L; Buerkle, C Alex
2012-07-01
Speciation is the process by which reproductively isolated lineages arise, and is one of the fundamental means by which the diversity of life increases. Whereas numerous studies have documented an association between ecological divergence and reproductive isolation, relatively little is known about the role of natural selection in genome divergence during the process of speciation. Here, we use genome-wide DNA sequences and Bayesian models to test the hypothesis that loci under divergent selection between two butterfly species (Lycaeides idas and L. melissa) also affect fitness in an admixed population. Locus-specific measures of genetic differentiation between L. idas and L. melissa and genomic introgression in hybrids varied across the genome. The most differentiated genetic regions were characterized by elevated L. idas ancestry in the admixed population, which occurs in L. idas-like habitat, consistent with the hypothesis that local adaptation contributes to speciation. Moreover, locus-specific measures of genetic differentiation (a metric of divergent selection) were positively associated with extreme genomic introgression (a metric of hybrid fitness). Interestingly, concordance of differentiation and introgression was only partial. We discuss multiple, complementary explanations for this partial concordance. © 2012 The Author(s).
Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne
2017-01-01
Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae, a distant relative of the model Caenorhabditis elegans. We used this draft to identify the likely causative mutations at the O. tipulae cov-3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13, and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. PMID:28630114
Sequence and Analysis of the Tomato JOINTLESS Locus1
Mao, Long; Begum, Dilara; Goff, Stephen A.; Wing, Rod A.
2001-01-01
A 119-kb bacterial artificial chromosome from the JOINTLESS locus on the tomato (Lycopersicon esculentum) chromosome 11 contained 15 putative genes. Repetitive sequences in this region include one copia-like LTR retrotransposon, 13 simple sequence repeats, three copies of a novel type III foldback transposon, and four putative short DNA repeats. Database searches showed that the foldback transposon and the short DNA repeats seemed to be associated preferably with genes. The predicted tomato genes were compared with the complete Arabidopsis genome. Eleven out of 15 tomato open reading frames were found to be colinear with segments on five Arabidopsis bacterial artificial chromosome/P1-derived artificial chromosome clones. The synteny patterns, however, did not reveal duplicated segments in Arabidopsis, where over half of the genome is duplicated. Our analysis indicated that the microsynteny between the tomato and Arabidopsis genomes was still conserved at a very small scale but was complicated by the large number of gene families in the Arabidopsis genome. PMID:11457984
Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne
2017-08-01
Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae , a distant relative of the model Caenorhabditis elegans We used this draft to identify the likely causative mutations at the O. tipulae cov -3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13 , and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. Copyright © 2017 by the Genetics Society of America.
2016-10-27
Institute of Infectious Diseases, Fort Detrick, Frederick, Maryland, USA 9 10 11 Running head: Complete Genome Sequence of Y. pestis strain Cadman...1 Complete Genome Sequence of Pigmentation Negative Yersinia pestis strain Cadman 1 2 3 Sean Lovetta, Kitty Chaseb, Galina Korolevaa, Gustavo...we report the genome sequence of Yersinia pestis strain Cadman, an attenuated strain 25 lacking the pgm locus. Y. pestis is the causative agent of
Castaldi, Peter J; Cho, Michael H; Litonjua, Augusto A; Bakke, Per; Gulsvik, Amund; Lomas, David A; Anderson, Wayne; Beaty, Terri H; Hokanson, John E; Crapo, James D; Laird, Nan; Silverman, Edwin K
2011-12-01
Two recent metaanalyses of genome-wide association studies conducted by the CHARGE and SpiroMeta consortia identified novel loci yielding evidence of association at or near genome-wide significance (GWS) with FEV(1) and FEV(1)/FVC. We hypothesized that a subset of these markers would also be associated with chronic obstructive pulmonary disease (COPD) susceptibility. Thirty-two single-nucleotide polymorphisms (SNPs) in or near 17 genes in 11 previously identified GWS spirometric genomic regions were tested for association with COPD status in four COPD case-control study samples (NETT/NAS, the Norway case-control study, ECLIPSE, and the first 1,000 subjects in COPDGene; total sample size, 3,456 cases and 1,906 controls). In addition to testing the 32 spirometric GWS SNPs, we tested a dense panel of imputed HapMap2 SNP markers from the 17 genes located near the 32 GWS SNPs and in a set of 21 well studied COPD candidate genes. Of the previously identified GWS spirometric genomic regions, three loci harbored SNPs associated with COPD susceptibility at a 5% false discovery rate: the 4q24 locus including FLJ20184/INTS12/GSTCD/NPNT, the 6p21 locus including AGER and PPT2, and the 5q33 locus including ADAM19. In conclusion, markers previously associated at or near GWS with spirometric measures were tested for association with COPD status in data from four COPD case-control studies, and three loci showed evidence of association with COPD susceptibility at a 5% false discovery rate.
Pathogenetics of alveolar capillary dysplasia with misalignment of pulmonary veins.
Szafranski, Przemyslaw; Gambin, Tomasz; Dharmadhikari, Avinash V; Akdemir, Kadir Caner; Jhangiani, Shalini N; Schuette, Jennifer; Godiwala, Nihal; Yatsenko, Svetlana A; Sebastian, Jessica; Madan-Khetarpal, Suneeta; Surti, Urvashi; Abellar, Rosanna G; Bateman, David A; Wilson, Ashley L; Markham, Melinda H; Slamon, Jill; Santos-Simarro, Fernando; Palomares, María; Nevado, Julián; Lapunzina, Pablo; Chung, Brian Hon-Yin; Wong, Wai-Lap; Chu, Yoyo Wing Yiu; Mok, Gary Tsz Kin; Kerem, Eitan; Reiter, Joel; Ambalavanan, Namasivayam; Anderson, Scott A; Kelly, David R; Shieh, Joseph; Rosenthal, Taryn C; Scheible, Kristin; Steiner, Laurie; Iqbal, M Anwar; McKinnon, Margaret L; Hamilton, Sara Jane; Schlade-Bartusiak, Kamilla; English, Dawn; Hendson, Glenda; Roeder, Elizabeth R; DeNapoli, Thomas S; Littlejohn, Rebecca Okashah; Wolff, Daynna J; Wagner, Carol L; Yeung, Alison; Francis, David; Fiorino, Elizabeth K; Edelman, Morris; Fox, Joyce; Hayes, Denise A; Janssens, Sandra; De Baere, Elfride; Menten, Björn; Loccufier, Anne; Vanwalleghem, Lieve; Moerman, Philippe; Sznajer, Yves; Lay, Amy S; Kussmann, Jennifer L; Chawla, Jasneek; Payton, Diane J; Phillips, Gael E; Brosens, Erwin; Tibboel, Dick; de Klein, Annelies; Maystadt, Isabelle; Fisher, Richard; Sebire, Neil; Male, Alison; Chopra, Maya; Pinner, Jason; Malcolm, Girvan; Peters, Gregory; Arbuckle, Susan; Lees, Melissa; Mead, Zoe; Quarrell, Oliver; Sayers, Richard; Owens, Martina; Shaw-Smith, Charles; Lioy, Janet; McKay, Eileen; de Leeuw, Nicole; Feenstra, Ilse; Spruijt, Liesbeth; Elmslie, Frances; Thiruchelvam, Timothy; Bacino, Carlos A; Langston, Claire; Lupski, James R; Sen, Partha; Popek, Edwina; Stankiewicz, Paweł
2016-05-01
Alveolar capillary dysplasia with misalignment of pulmonary veins (ACDMPV) is a lethal lung developmental disorder caused by heterozygous point mutations or genomic deletion copy-number variants (CNVs) of FOXF1 or its upstream enhancer involving fetal lung-expressed long noncoding RNA genes LINC01081 and LINC01082. Using custom-designed array comparative genomic hybridization, Sanger sequencing, whole exome sequencing (WES), and bioinformatic analyses, we studied 22 new unrelated families (20 postnatal and two prenatal) with clinically diagnosed ACDMPV. We describe novel deletion CNVs at the FOXF1 locus in 13 unrelated ACDMPV patients. Together with the previously reported cases, all 31 genomic deletions in 16q24.1, pathogenic for ACDMPV, for which parental origin was determined, arose de novo with 30 of them occurring on the maternally inherited chromosome 16, strongly implicating genomic imprinting of the FOXF1 locus in human lungs. Surprisingly, we have also identified four ACDMPV families with the pathogenic variants in the FOXF1 locus that arose on paternal chromosome 16. Interestingly, a combination of the severe cardiac defects, including hypoplastic left heart, and single umbilical artery were observed only in children with deletion CNVs involving FOXF1 and its upstream enhancer. Our data demonstrate that genomic imprinting at 16q24.1 plays an important role in variable ACDMPV manifestation likely through long-range regulation of FOXF1 expression, and may be also responsible for key phenotypic features of maternal uniparental disomy 16. Moreover, in one family, WES revealed a de novo missense variant in ESRP1, potentially implicating FGF signaling in the etiology of ACDMPV.
Pathogenetics of Alveolar Capillary Dysplasia with Misalignment of Pulmonary Veins
Szafranski, Przemyslaw; Gambin, Tomasz; Dharmadhikari, Avinash V.; Akdemir, Kadir Caner; Jhangiani, Shalini N.; Schuette, Jennifer; Godiwala, Nihal; Yatsenko, Svetlana A.; Sebastian, Jessica; Madan-Khetarpal, Suneeta; Surti, Urvashi; Abellar, Rosanna G.; Bateman, David A.; Wilson, Ashley L.; Markham, Melinda H.; Slamon, Jill; Santos-Simarro, Fernando; Palomares, María; Nevado, Julián; Lapunzina, Pablo; Hon-Yin, Brian Chung; Wai-Lap, Wong; Chu, Yoyo Wing Yiu; Mok, Gary Tsz Kin; Eitan, Kerem; Reiter, Joel; Ambalavanan, Namasivayam; Anderson, Scott A.; Kelly, David R.; Shieh, Joseph; Rosenthal, Taryn C.; Scheible, Kristin; Steiner, Laurie; Iqbal, M. Anwar; McKinnon, Margaret; Hamilton, Sara Jane; Schlade-Bartusiak, Kamilla; English, Dawn; Hendson, Glenda; Roeder, Elizabeth R.; DeNapoli, Thomas S.; Littlejohn, Rebecca Okashah; Wolff, Daynna J.; Wagner, Carol L.; Yeung, Alison; Francis, David; Fiorino, Elizabeth K.; Edelman, Morris; Fox, Joyce; Hayes, Denise A.; Janssens, Sandra; De Baere, Elfride; Menten, Bjorn; Loccufier, Anne; Van Walleghem, Lieve; Moerman, Philippe; Sznajer, Yves; Lay, Amy S.; Kussmann, Jennifer L.; Chawla, Jasneek; Payton, Diane J.; Phillips, Gael E.; Brosens, Erwin; Tibboel, Dick; de Klein, Annelies; Maystadt, Isabelle; Fisher, Richard; Sebire, Neil; Male, Alison; Chopra, Maya; Pinner, Jason; Malcolm, Girvan; Peters, Gregory; Arbuckle, Susan; Lees, Melissa; Mead, Zoe; Quarrell, Oliver; Sayers, Richard; Owens, Martina; Shaw-Smith, Charles; Lioy, Janet; McKay, Eileen; de Leeuw, Nicole; Feenstra, Ilse; Spruijt, Liesbeth; Elmslie, Frances; Thiruchelvam, Timothy; Bacino, Carlos A.; Langston, Claire; Lupski, James R.; Sen, Partha; Popek, Edwina; Stankiewicz, Paweł
2017-01-01
Alveolar capillary dysplasia with misalignment of pulmonary veins (ACDMPV) is a lethal lung developmental disorder caused by heterozygous point mutations or genomic deletion copy-number variants (CNVs) of FOXF1 or its upstream enhancer involving fetal lung-expressed long noncoding RNA genes LINC01081 and LINC01082. Using custom-designed array comparative genomic hybridization, Sanger sequencing, whole exome sequencing (WES), and bioinformatic analyses, we studied 22 new unrelated families (20 postnatal and two prenatal) with clinically diagnosed ACDMPV. We describe novel deletion CNVs at the FOXF1 locus in 13 unrelated ACDMPV patients. Together with the previously reported cases, all 31 genomic deletions in 16q24.1, pathogenic for ACDMPV, for which parental origin was determined, arose de novo with 30 of them occurring on the maternally inherited chromosome 16, strongly implicating genomic imprinting of the FOXF1 locus in human lungs. Surprisingly, we have also identified four ACDMPV families with the pathogenic variants in the FOXF1 locus that arose on paternal chromosome 16. Interestingly, a combination of the severe cardiac defects, including hypoplastic left heart, and single umbilical artery were observed only in children with deletion CNVs involving FOXF1 and its upstream enhancer. Our data demonstrate that genomic imprinting at 16q24.1 plays an important role in variable ACDMPV manifestation likely through long-range regulation of FOXF1 expression, and may be also responsible for key phenotypic features of maternal uniparental disomy 16. Moreover, in one family, WES revealed a de novo missense variant in ESRP1, potentially implicating FGF signaling in etiology of ACDMPV. PMID:27071622
Genome-wide association analysis of ischemic stroke in young adults.
Cheng, Yu-Ching; O'Connell, Jeffrey R; Cole, John W; Stine, O Colin; Dueker, Nicole; McArdle, Patrick F; Sparks, Mary J; Shen, Jess; Laurie, Cathy C; Nelson, Sarah; Doheny, Kimberly F; Ling, Hua; Pugh, Elizabeth W; Brott, Thomas G; Brown, Robert D; Meschia, James F; Nalls, Michael; Rich, Stephen S; Worrall, Bradford; Anderson, Christopher D; Biffi, Alessandro; Cortellini, Lynelle; Furie, Karen L; Rost, Natalia S; Rosand, Jonathan; Manolio, Teri A; Kittner, Steven J; Mitchell, Braxton D
2011-11-01
Ischemic stroke (IS) is among the leading causes of death in Western countries. There is a significant genetic component to IS susceptibility, especially among young adults. To date, research to identify genetic loci predisposing to stroke has met only with limited success. We performed a genome-wide association (GWA) analysis of early-onset IS to identify potential stroke susceptibility loci. The GWA analysis was conducted by genotyping 1 million SNPs in a biracial population of 889 IS cases and 927 controls, ages 15-49 years. Genotypes were imputed using the HapMap3 reference panel to provide 1.4 million SNPs for analysis. Logistic regression models adjusting for age, recruitment stages, and population structure were used to determine the association of IS with individual SNPs. Although no single SNP reached genome-wide significance (P < 5 × 10(-8)), we identified two SNPs in chromosome 2q23.3, rs2304556 (in FMNL2; P = 1.2 × 10(-7)) and rs1986743 (in ARL6IP6; P = 2.7 × 10(-7)), strongly associated with early-onset stroke. These data suggest that a novel locus on human chromosome 2q23.3 may be associated with IS susceptibility among young adults.
Togashi, Yuki; Dobashi, Akito; Sakata, Seiji; Sato, Yukiko; Baba, Satoko; Seto, Akira; Mitani, Hiroki; Kawabata, Kazuyoshi; Takeuchi, Kengo
2018-02-06
MYB-NFIB and MYBL1-NFIB have been reported in ~60% of adenoid cystic carcinoma cases, but driver alterations in the remaining ~40% of adenoid cystic carcinoma remain unclear. We examined 100 adenoid cystic carcinoma cases for MYB and MYBL1 locus rearrangements by fluorescence in situ hybridization (FISH) with originally designed probe sets using formalin-fixed paraffin-embedded materials. Approximately one-third of samples were also analyzed by fusion transcript-specific RT-PCR and capture RNA sequencing. In the 27 cases with frozen materials, MYB-NFIB and MYBL1-NFIB fusion transcripts were detected in 9 (33%) and 6 cases (22%) by RT-PCR, respectively. Meanwhile, high expression of MYB (18 cases, 67%) or MYBL1 (9 cases, 33%) was detected in all 27 cases in a mutually exclusive manner, regardless of its form (full-length, truncation, or fusion transcript). Interestingly, genomic rearrangements around the corresponding highly-expressed gene were observed in all 27 cases by FISH, suggesting a causative relationship between genomic rearrangements and gene expression. Among the 100 cases, including additional 73 cases, 97 harbored genomic rearrangements in the MYB (73 cases) or MYBL1 locus (24 cases) including 10 cases with atypical FISH patterns undetectable through ordinary split FISH approaches: breakpoints far distant from MYB (5 cases) and a small NFIB locus insertion into the MYB (3 cases) or MYBL1 locus (2 cases). In clinicopathological analyses, histological grade, primary tumor size, and lymph node metastasis were identified as prognostic factors, whereas MYB/MYBL1 rearrangements were not, but were associated with histological grade. In the present study, MYB or MYBL1 locus rearrangement was detected in nearly all adenoid cystic carcinoma cases, and therefore it would be a good diagnostic marker for adenoid cystic carcinoma. However, fusion transcript-specific RT-PCR for MYB-NFIB and MYBL1-NFIB and ordinary split FISH assays for MYB and MYBL1 were less sensitive, and thus detection methods should be judiciously designed because of the diversity of rearrangement modes in adenoid cystic carcinoma.
Reitzel, A M; Herrera, S; Layden, M J; Martindale, M Q; Shank, T M
2013-06-01
Characterization of large numbers of single-nucleotide polymorphisms (SNPs) throughout a genome has the power to refine the understanding of population demographic history and to identify genomic regions under selection in natural populations. To this end, population genomic approaches that harness the power of next-generation sequencing to understand the ecology and evolution of marine invertebrates represent a boon to test long-standing questions in marine biology and conservation. We employed restriction-site-associated DNA sequencing (RAD-seq) to identify SNPs in natural populations of the sea anemone Nematostella vectensis, an emerging cnidarian model with a broad geographic range in estuarine habitats in North and South America, and portions of England. We identified hundreds of SNP-containing tags in thousands of RAD loci from 30 barcoded individuals inhabiting four locations from Nova Scotia to South Carolina. Population genomic analyses using high-confidence SNPs resulted in a highly-resolved phylogeography, a result not achieved in previous studies using traditional markers. Plots of locus-specific FST against heterozygosity suggest that a majority of polymorphic sites are neutral, with a smaller proportion suggesting evidence for balancing selection. Loci inferred to be under balancing selection were mapped to the genome, where 90% were located in gene bodies, indicating potential targets of selection. The results from analyses with and without a reference genome supported similar conclusions, further highlighting RAD-seq as a method that can be efficiently applied to species lacking existing genomic resources. We discuss the utility of RAD-seq approaches in burgeoning Nematostella research as well as in other cnidarian species, particularly corals and jellyfishes, to determine phylogeographic relationships of populations and identify regions of the genome undergoing selection. © 2013 John Wiley & Sons Ltd.
Reitzel, A.M.; Herrera, S.; Layden, M.J.; Martindale, M.Q.; Shank, T.M.
2013-01-01
Characterization of large numbers of single nucleotide polymorphisms (SNPs) throughout a genome has the power to refine the understanding of population demographic history and to identify genomic regions under selection in natural populations. To this end, population genomic approaches that harness the power of next-generation sequencing to understand the ecology and evolution of marine invertebrates represent a boon to test long-standing questions in marine biology and conservation. We employed restriction-site-associated DNA sequencing (RAD-seq) to identify SNPs in natural populations of the sea anemone Nematostella vectensis, an emerging cnidarian model with a broad geographic range in estuarine habitats in North and South America, and portions of England. We identified hundreds of SNP-containing tags in thousands of RAD loci from 30 barcoded individuals inhabiting four locations from Nova Scotia to South Carolina. Population genomic analyses using high-confidence SNPs resulted in a highly-resolved phylogeography, a result not achieved in previous studies using traditional markers. Plots of locus-specific FST against heterozygosity suggest that a majority of polymorphic sites are neutral, with a smaller proportion suggesting evidence for balancing selection. Loci inferred to be under balancing selection were mapped to the genome, where 90% were located in gene bodies, indicating potential targets of selection. Results from analyses with and without a reference genome supported similar conclusions, further supporting RAD-seq as a method that can be efficiently applied to species lacking existing genomic resources. We discuss the utility of RAD-seq approaches in burgeoning Nematostella research as well as in other cnidarian species, particularly corals, to determine phylogeographic relationships of populations and identify regions of the genome undergoing selection. PMID:23473066
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gerasimov, V.A.; Yanenko, A.S.; Akhverdyan, V.Z.
1986-04-01
Bacteriophage D3112 forms two types of PA01 (D3112) lysogens: those that partially, or completely, limit the growth of the related heteroimmune phage B39. DNA/DNA hybridization has shown that the lysogens of the first type always contain one copy of prophage D3112 (monolysogens), and the lysogens of the second type contain two or more copies of prophage D3112. Limitation of the growth of phage B39 on PA01 (D3112) lysogens is associated with the functioning of the locus of prophage D3112, designated as cip (control of interaction of phages). Using deletion derivatives of plasmid RP4::D3112, the cip locus was mapped at anmore » interval of 1.3-2.45 kb of the D3112 genome. The expression of the cip locus occurs only if the D3112 genome is at the prophage state. The function of the Cip prophage of D3112 exerts an influence on early stages of development of phage B39, decreasing the efficiency of the integration and transposition processes of phage B39.« less
Coruzzi, G; Trembath, M K; Tzagoloff, A
1978-12-01
Two mutants of Saccharomyces cerevisiae which show a loss of mitochondrial rutamycin-sensitive ATPase activity are described. Although phenotypically similar to mutants of the mitochondrial locus pho1 [F. Foury and A. Tzagoloff (1976) Eur. J. Biochem. 68, 113-119], these mutants define a second ATPase locus on the mitochondrial DNA (designated pho2), which is genetically unlinked to pho1. Analysis of recombination in crosses involving multiple antibiotic resistance markers indicates that the locus is in the segment of the genome between ery1 and oli2, very close to oli1. In fact it is proposed that the oli1 and pho2 mutations are in the same gene. Supporting evidence for this proposal includes: 1. The analysis of marker retention in petite mutants shows that the oli1 and pho2 loci were either retained or lost together in all cases. 2. Recombination frequencies of 0.05% or less are observed in crosses between the oli1 and pho2 loci. 3. When rho+ revertants are isolated from the pho2 mutants they frequently are oligomycin resistant. 4. pho2 mutants have an altered subunit 9 of the ATPase complex.
Grewal, S I; Han, B; Johnstone, K
1995-01-01
Pseudomonas tolaasii, the causal agent of brown blotch disease of Agaricus bisporus, spontaneously gives rise to morphologically distinct stable sectors, referred to as the phenotypic variant form, at the margins of the wild-type colonies. The phenotypic variant form is nonpathogenic and differs from the wild type in a range of biochemical and physiological characteristics. A genomic cosmid clone (pSISG29) from a wild-type P. tolaasii library was shown to be capable of restoring a range of characteristics of the phenotypic variant to those of the wild-type form, when present in trans. Subcloning and saturation mutagenesis analysis with Tn5lacZ localized a 3.0-kb region from pSISG29, designated the pheN locus, required for complementation of the phenotypic variant to the wild-type form. Marker exchange of the Tn5lacZ-mutagenized copy of the pheN locus into the wild-type strain demonstrated that a functional copy of the pheN gene is required to maintain the wild-type pathogenic phenotype and that loss of the pheN gene or its function results in conversion of the wild-type form to the phenotypic variant form. The pheN locus contained a 2,727-bp open reading frame encoding an 83-kDa protein. The predicted amino acid sequence of the PheN protein showed homology to the sensor and regulator domains of the conserved family of two component bacterial sensor regulator proteins. Southern hybridization analysis of pheN genes from the wild type and the phenotypic variant form revealed that DNA rearrangement occurs within the pheN locus during phenotypic variation. Analysis of pheN expression with a pheN::lacZ fusion demonstrated that expression is regulated by environmental factors. These results are related to a model for control for phenotypic variation in P. tolaasii. PMID:7642492
Raynard, Steven J; Baker, Mark D
2004-01-01
In mammalian cells, little is known about the nature of recombination-prone regions of the genome. Previously, we reported that the immunoglobulin heavy chain (IgH) mu locus behaved as a hotspot for mitotic, intrachromosomal gene conversion (GC) between repeated mu constant (Cmu) regions in mouse hybridoma cells. To investigate whether elements within the mu gene regulatory region were required for hotspot activity, gene targeting was used to delete a 9.1 kb segment encompassing the mu gene promoter (Pmu), enhancer (Emu) and switch region (Smu) from the locus. In these cell lines, GC between the Cmu repeats was significantly reduced, indicating that this 'recombination-enhancing sequence' (RES) is necessary for GC hotspot activity at the IgH locus. Importantly, the RES fragment stimulated GC when appended to the same Cmu repeats integrated at ectopic genomic sites. We also show that deletion of Emu and flanking matrix attachment regions (MARs) from the RES abolishes GC hotspot activity at the IgH locus. However, no stimulation of ectopic GC was observed with the Emu/MARs fragment alone. Finally, we provide evidence that no correlation exists between the level of transcription and GC promoted by the RES. We suggest a model whereby Emu/MARS enhances mitotic GC at the endogenous IgH mu locus by effecting chromatin modifications in adjacent DNA.
Cytogenetic features of rRNA genes across land plants: analysis of the Plant rDNA database.
Garcia, Sònia; Kovařík, Ales; Leitch, Andrew R; Garnatje, Teresa
2017-03-01
The online resource http://www.plantrdnadatabase.com/ stores information on the number, chromosomal locations and structure of the 5S and 18S-5.8S-26S (35S) ribosomal DNAs (rDNA) in plants. This resource was exploited to study relationships between rDNA locus number, distribution, the occurrence of linked (L-type) and separated (S-type) 5S and 35S rDNA units, chromosome number, genome size and ploidy level. The analyses presented summarise current knowledge on rDNA locus numbers and distribution in plants. We analysed 2949 karyotypes, from 1791 species and 86 plant families, and performed ancestral character state reconstructions. The ancestral karyotype (2n = 16) has two terminal 35S sites and two interstitial 5S sites, while the median (2n = 24) presents four terminal 35S sites and three interstitial 5S sites. Whilst 86.57% of karyotypes show S-type organisation (ancestral condition), the L-type arrangement has arisen independently several times during plant evolution. A non-terminal position of 35S rDNA was found in about 25% of single-locus karyotypes, suggesting that terminal locations are not essential for functionality and expression. Single-locus karyotypes are very common, even in polyploids. In this regard, polyploidy is followed by subsequent locus loss. This results in a decrease in locus number per monoploid genome, forming part of the diploidisation process returning polyploids to a diploid-like state over time. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
A Mendelian locus on chromosome 16 determines susceptibility to doxorubicin nephropathy in the mouse
Zheng, Zongyu; Schmidt-Ott, Kai M.; Chua, Streamson; Foster, Kirk A.; Frankel, Rachelle Z.; Pavlidis, Paul; Barasch, Jonathan; D'Agati, Vivette D.; Gharavi, Ali G.
2005-01-01
The development of kidney disease is influenced by both genetic and environmental factors. Searching for models of glomerulopathy that display strong gene–environment interaction, we examined the determinants of anthracycline-induced nephropathy, a classic, strain-dependent experimental model applied to rodents in the past four decades. We produced three crosses derived from mice with contrasting susceptibility to doxorubicin (DOX) nephropathy and, surprisingly, we found that this widely studied model segregates as a single-gene defect with recessive inheritance. By genome-wide analysis of linkage, we mapped the trait locus to chromosome 16A1-B1 (DOXNPH locus) in all three crosses [peak logarithm of odds (lod) score of 92.7, P = 1 × 10-65]; this interval represents a susceptibility locus for nephropathy. Gene expression analysis indicated that susceptibility alleles at the DOXNPH locus are associated with blunted expression of protein arginine methyltransferase 7 (Prmt7) on chromosome 8, a protein previously implicated in cellular sensitivity to chemotherapeutic agents (lod = 12.4, P = 0.0001). Therefore, Prmt7 expression serves as a molecular marker for susceptibility to DOX nephropathy. Finally, increased variation in the severity of kidney disease among affected mice motivated a second genome-wide search, identifying a locus on chromosome 9 that influences the severity and progression of nephropathy (DOXmod, peak lod score 4.3, P = 0.0018). These data provide genetic and molecular characterization of a previously unrecognized Mendelian trait. Elucidation of DOX nephropathy may simultaneously provide insight into the pathogenesis of renal failure and mechanisms of cytotoxicity induced by chemotherapeutic agents. PMID:15699352
Zheng, Zongyu; Schmidt-Ott, Kai M; Chua, Streamson; Foster, Kirk A; Frankel, Rachelle Z; Pavlidis, Paul; Barasch, Jonathan; D'Agati, Vivette D; Gharavi, Ali G
2005-02-15
The development of kidney disease is influenced by both genetic and environmental factors. Searching for models of glomerulopathy that display strong gene-environment interaction, we examined the determinants of anthracycline-induced nephropathy, a classic, strain-dependent experimental model applied to rodents in the past four decades. We produced three crosses derived from mice with contrasting susceptibility to doxorubicin (DOX) nephropathy and, surprisingly, we found that this widely studied model segregates as a single-gene defect with recessive inheritance. By genome-wide analysis of linkage, we mapped the trait locus to chromosome 16A1-B1 (DOXNPH locus) in all three crosses [peak logarithm of odds (lod) score of 92.7, P = 1 x 10(-65)]; this interval represents a susceptibility locus for nephropathy. Gene expression analysis indicated that susceptibility alleles at the DOXNPH locus are associated with blunted expression of protein arginine methyltransferase 7 (Prmt7) on chromosome 8, a protein previously implicated in cellular sensitivity to chemotherapeutic agents (lod = 12.4, P = 0.0001). Therefore, Prmt7 expression serves as a molecular marker for susceptibility to DOX nephropathy. Finally, increased variation in the severity of kidney disease among affected mice motivated a second genome-wide search, identifying a locus on chromosome 9 that influences the severity and progression of nephropathy (DOXmod, peak lod score 4.3, P = 0.0018). These data provide genetic and molecular characterization of a previously unrecognized Mendelian trait. Elucidation of DOX nephropathy may simultaneously provide insight into the pathogenesis of renal failure and mechanisms of cytotoxicity induced by chemotherapeutic agents.
Richardson, Kris; Schnitzler, Gavin R; Lai, Chao-Qiang; Ordovas, Jose M
2015-12-01
Cardiovascular disease and type 2 diabetes mellitus represent overlapping diseases where a large portion of the variation attributable to genetics remains unexplained. An important player in their pathogenesis is peroxisome proliferator-activated receptor γ (PPARγ) that is involved in lipid and glucose metabolism and maintenance of metabolic homeostasis. We used a functional genomics methodology to interrogate human chromatin immunoprecipitation-sequencing, genome-wide association studies, and expression quantitative trait locus data to inform selection of candidate functional single nucleotide polymorphisms (SNPs) falling in PPARγ motifs. We derived 27 328 chromatin immunoprecipitation-sequencing peaks for PPARγ in human adipocytes through meta-analysis of 3 data sets. The PPARγ consensus motif showed greatest enrichment and mapped to 8637 peaks. We identified 146 SNPs in these motifs. This number was significantly less than would be expected by chance, and Inference of Natural Selection from Interspersed Genomically coHerent elemenTs analysis indicated that these motifs are under weak negative selection. A screen of these SNPs against genome-wide association studies for cardiometabolic traits revealed significant enrichment with 16 SNPs. A screen against the MuTHER expression quantitative trait locus data revealed 8 of these were significantly associated with altered gene expression in human adipose, more than would be expected by chance. Several SNPs fall close, or are linked by expression quantitative trait locus to lipid-metabolism loci including CYP26A1. We demonstrated the use of functional genomics to identify SNPs of potential function. Specifically, that SNPs within PPARγ motifs that bind PPARγ in adipocytes are significantly associated with cardiometabolic disease and with the regulation of transcription in adipose. This method may be used to uncover functional SNPs that do not reach significance thresholds in the agnostic approach of genome-wide association studies. © 2015 American Heart Association, Inc.
Genome Sequence of the Shiga Toxin-Producing Escherichia coli Strain NCCP15657
Kim, Byung Kwon; Song, Geun Cheol; Hong, Gun Hyong; Seong, Won-Keun; Kim, Seon-Young; Jeong, Haeyoung; Kang, Sung Gyun; Kwon, Soon-Kyeong; Lee, Choong Hoon; Song, Ju Yeon; Yu, Dong Su; Park, Mi-Sun
2012-01-01
Shiga toxin-producing Escherichia coli causes bloody diarrhea and hemolytic-uremic syndrome and serious outbreaks worldwide. Here, we report the draft genome sequence of E. coli NCCP15657 isolated from a patient. The genome has virulence genes, many in the locus of enterocyte effacement (LEE) island, encoding a metalloprotease, the Shiga toxin, and constituents of type III secretion. PMID:22740674
DNMT1-interacting RNAs block gene specific DNA methylation
Di Ruscio, Annalisa; Ebralidze, Alexander K.; Benoukraf, Touati; Amabile, Giovanni; Goff, Loyal A.; Terragni, Joylon; Figueroa, Maria Eugenia; De Figureido Pontes, Lorena Lobo; Alberich-Jorda, Meritxell; Zhang, Pu; Wu, Mengchu; D’Alò, Francesco; Melnick, Ari; Leone, Giuseppe; Ebralidze, Konstantin K.; Pradhan, Sriharsa; Rinn, John L.; Tenen, Daniel G.
2013-01-01
Summary DNA methylation was described almost a century ago. However, the rules governing its establishment and maintenance remain elusive. Here, we present data demonstrating that active transcription regulates levels of genomic methylation. We identified a novel RNA arising from the CEBPA gene locus critical in regulating the local DNA methylation profile. This RNA binds to DNMT1 and prevents CEBPA gene locus methylation. Deep sequencing of transcripts associated with DNMT1 combined with genome-scale methylation and expression profiling extended the generality of this finding to numerous gene loci. Collectively, these results delineate the nature of DNMT1-RNA interactions and suggest strategies for gene selective demethylation of therapeutic targets in disease. PMID:24107992
Jasinska, Anna J; Zelaya, Ivette; Service, Susan K; Peterson, Christine B; Cantor, Rita M; Choi, Oi-Wa; DeYoung, Joseph; Eskin, Eleazar; Fairbanks, Lynn A; Fears, Scott; Furterer, Allison E; Huang, Yu S; Ramensky, Vasily; Schmitt, Christopher A; Svardal, Hannes; Jorgensen, Matthew J; Kaplan, Jay R; Villar, Diego; Aken, Bronwen L; Flicek, Paul; Nag, Rishi; Wong, Emily S; Blangero, John; Dyer, Thomas D; Bogomolov, Marina; Benjamini, Yoav; Weinstock, George M; Dewar, Ken; Sabatti, Chiara; Wilson, Richard K; Jentsch, J David; Warren, Wesley; Coppola, Giovanni; Woods, Roger P; Freimer, Nelson B
2017-12-01
By analyzing multitissue gene expression and genome-wide genetic variation data in samples from a vervet monkey pedigree, we generated a transcriptome resource and produced the first catalog of expression quantitative trait loci (eQTLs) in a nonhuman primate model. This catalog contains more genome-wide significant eQTLs per sample than comparable human resources and identifies sex- and age-related expression patterns. Findings include a master regulatory locus that likely has a role in immune function and a locus regulating hippocampal long noncoding RNAs (lncRNAs), whose expression correlates with hippocampal volume. This resource will facilitate genetic investigation of quantitative traits, including brain and behavioral phenotypes relevant to neuropsychiatric disorders.
The correlation between relatives on the supposition of genomic imprinting.
Spencer, Hamish G
2002-01-01
Standard genetic analyses assume that reciprocal heterozygotes are, on average, phenotypically identical. If a locus is subject to genomic imprinting, however, this assumption does not hold. We incorporate imprinting into the standard quantitative-genetic model for two alleles at a single locus, deriving expressions for the additive and dominance components of genetic variance, as well as measures of resemblance among relatives. We show that, in contrast to the case with Mendelian expression, the additive and dominance deviations are correlated. In principle, this correlation allows imprinting to be detected solely on the basis of different measures of familial resemblances, but in practice, the standard error of the estimate is likely to be too large for a test to have much statistical power. The effects of genomic imprinting will need to be incorporated into quantitative-genetic models of many traits, for example, those concerned with mammalian birthweight. PMID:12019254
The correlation between relatives on the supposition of genomic imprinting.
Spencer, Hamish G
2002-05-01
Standard genetic analyses assume that reciprocal heterozygotes are, on average, phenotypically identical. If a locus is subject to genomic imprinting, however, this assumption does not hold. We incorporate imprinting into the standard quantitative-genetic model for two alleles at a single locus, deriving expressions for the additive and dominance components of genetic variance, as well as measures of resemblance among relatives. We show that, in contrast to the case with Mendelian expression, the additive and dominance deviations are correlated. In principle, this correlation allows imprinting to be detected solely on the basis of different measures of familial resemblances, but in practice, the standard error of the estimate is likely to be too large for a test to have much statistical power. The effects of genomic imprinting will need to be incorporated into quantitative-genetic models of many traits, for example, those concerned with mammalian birthweight.
Li, Xiang; Tambong, James; Yuan, Kat Xiaoli; Chen, Wen; Xu, Huimin; Lévesque, C André; De Boer, Solke H
2018-01-01
Although the genus Clavibacter was originally proposed to accommodate all phytopathogenic coryneform bacteria containing B2γ diaminobutyrate in the peptidoglycan, reclassification of all but one species into other genera has resulted in the current monospecific status of the genus. The single species in the genus, Clavibacter michiganensis, has multiple subspecies, which are all highly host-specific plant pathogens. Whole genome analysis based on average nucleotide identity and digital DNA-DNA hybridization as well as multi-locus sequence analysis (MLSA) of seven housekeeping genes support raising each of the C. michiganensis subspecies to species status. On the basis of whole genome and MLSA data, we propose the establishment of two new species and three new combinations: Clavibacter capsici sp. nov., comb. nov. and Clavibacter tessellarius sp. nov., comb. nov., and Clavibacter insidiosus comb. nov., Clavibacter nebraskensis comb. nov. and Clavibacter sepedonicus comb. nov.
Li, Xiang; Tambong, James; Yuan, Kat (Xiaoli); Chen, Wen; Xu, Huimin; Lévesque, C. André; De Boer, Solke H.
2018-01-01
Although the genus Clavibacter was originally proposed to accommodate all phytopathogenic coryneform bacteria containing B2γ diaminobutyrate in the peptidoglycan, reclassification of all but one species into other genera has resulted in the current monospecific status of the genus. The single species in the genus, Clavibacter michiganensis, has multiple subspecies, which are all highly host-specific plant pathogens. Whole genome analysis based on average nucleotide identity and digital DNA–DNA hybridization as well as multi-locus sequence analysis (MLSA) of seven housekeeping genes support raising each of the C. michiganensis subspecies to species status. On the basis of whole genome and MLSA data, we propose the establishment of two new species and three new combinations: Clavibacter capsici sp. nov., comb. nov. and Clavibacter tessellarius sp. nov., comb. nov., and Clavibacter insidiosus comb. nov., Clavibacter nebraskensis comb. nov. and Clavibacter sepedonicus comb. nov. PMID:29160202
CSGRqtl: A Comparative Quantitative Trait Locus Database for Saccharinae Grasses.
Zhang, Dong; Paterson, Andrew H
2017-01-01
Conventional biparental quantitative trait locus (QTL) mapping has led to some successes in the identification of causal genes in many organisms. QTL likelihood intervals not only provide "prior information" for finer-resolution approaches such as GWAS but also provide better statistical power than GWAS to detect variants with low/rare frequency in a natural population. Here, we describe a new element of an ongoing effort to provide online resources to facilitate study and improvement of the important Saccharinae clade. The primary goal of this new resource is the anchoring of published QTLs for this clade to the Sorghum genome. Genetic map alignments translate a wealth of genomic information from sorghum to Saccharum spp., Miscanthus spp., and other taxa. In addition, genome alignments facilitate comparison of the Saccharinae QTL sets to those of other taxa that enjoy comparable resources, exemplified herein by rice.
Site-Specific Editing of the Plasmodium falciparum Genome Using Engineered Zinc-Finger Nucleases
Straimer, Judith; Lee, Marcus CS; Lee, Andrew H; Zeitler, Bryan; Williams, April E; Pearl, Jocelynn R; Zhang, Lei; Rebar, Edward J; Gregory, Philip D; Llinás, Manuel; Urnov, Fyodor D; Fidock, David A
2013-01-01
Malaria afflicts over 200 million people worldwide and its most lethal etiologic agent, Plasmodium falciparum, is evolving to resist even the latest-generation therapeutics. Efficient tools for genome-directed investigations of P. falciparum pathogenesis, including drug resistance mechanisms, are clearly required. Here we report rapid and targeted genetic engineering of this parasite, using zinc-finger nucleases (ZFNs) that produce a double-strand break in a user-defined locus and trigger homology-directed repair. Targeting an integrated egfp locus, we obtained gene deletion parasites with unprecedented speed (two weeks), both with and without direct selection. ZFNs engineered against the endogenous parasite gene pfcrt, responsible for chloroquine treatment escape, rapidly produced parasites that carried either an allelic replacement or a panel of specified point mutations. The efficiency, versatility and precision of this method will enable a diverse array of genome editing approaches to interrogate this human pathogen. PMID:22922501
Arbab, Mandana; Srinivasan, Sharanya; Hashimoto, Tatsunori; Geijsen, Niels; Sherwood, Richard I.
2015-01-01
Summary We present self-cloning CRISPR/Cas9 (scCRISPR), a technology that allows for CRISPR/Cas9-mediated genomic mutation and site-specific knockin transgene creation within several hours by circumventing the need to clone a site-specific single-guide RNA (sgRNA) or knockin homology construct for each target locus. We introduce a self-cleaving palindromic sgRNA plasmid and a short double-stranded DNA sequence encoding the desired locus-specific sgRNA into target cells, allowing them to produce a locus-specific sgRNA plasmid through homologous recombination. scCRISPR enables efficient generation of gene knockouts (∼88% mutation rate) at approximately one-sixth the cost of plasmid-based sgRNA construction with only 2 hr of preparation for each targeted site. Additionally, we demonstrate efficient site-specific knockin of GFP transgenes without any plasmid cloning or genome-integrated selection cassette in mouse and human embryonic stem cells (2%–4% knockin rate) through PCR-based addition of short homology arms. scCRISPR substantially lowers the bar on mouse and human transgenesis. PMID:26527385
Wunderlich, K R; Abbey, C A; Clayton, D R; Song, Y; Schein, J E; Georges, M; Coppieters, W; Adelson, D L; Taylor, J F; Davis, S L; Gill, C A
2006-12-01
The polled locus has been mapped by genetic linkage analysis to the proximal region of bovine chromosome 1. As an intermediate step in our efforts to identify the polled locus and the underlying causative mutation for the polled phenotype, we have constructed a BAC-based physical map of the interval containing the polled locus. Clones containing genes and markers in the critical interval were isolated from the TAMBT (constructed from Angus and Longhorn genomic DNA) and CHORI-240 (constructed from horned Hereford genomic DNA) BAC libraries and ordered based on fingerprinting and the presence or absence of 80 STS markers. A single contig spanning 2.5 Mb was assembled. Comparison of the physical order of STSs to the corresponding region of human chromosome 21 revealed the same order of genes within the polled critical interval. This contig of overlapping BAC clones from horned and polled breeds is a useful resource for SNP discovery and characterization of positional candidate genes.
Senís, Elena; Mockenhaupt, Stefan; Rupp, Daniel; Bauer, Tobias; Paramasivam, Nagarajan; Knapp, Bettina; Gronych, Jan; Grosse, Stefanie; Windisch, Marc P.; Schmidt, Florian; Theis, Fabian J.; Eils, Roland; Lichter, Peter; Schlesner, Matthias; Bartenschlager, Ralf; Grimm, Dirk
2017-01-01
Successful RNAi applications depend on strategies allowing robust and persistent expression of minimal gene silencing triggers without perturbing endogenous gene expression. Here, we propose a novel avenue which is integration of a promoterless shmiRNA, i.e. a shRNA embedded in a micro-RNA (miRNA) scaffold, into an engineered genomic miRNA locus. For proof-of-concept, we used TALE or CRISPR/Cas9 nucleases to site-specifically integrate an anti-hepatitis C virus (HCV) shmiRNA into the liver-specific miR-122/hcr locus in hepatoma cells, with the aim to obtain cellular clones that are genetically protected against HCV infection. Using reporter assays, Northern blotting and qRT-PCR, we confirmed anti-HCV shmiRNA expression as well as miR-122 integrity and functionality in selected cellular progeny. Moreover, we employed a comprehensive battery of PCR, cDNA/miRNA profiling and whole genome sequencing analyses to validate targeted integration of a single shmiRNA molecule at the expected position, and to rule out deleterious effects on the genomes or transcriptomes of the engineered cells. Importantly, a subgenomic HCV replicon and a full-length reporter virus, but not a Dengue virus control, were significantly impaired in the modified cells. Our original combination of DNA engineering and RNAi expression technologies benefits numerous applications, from miRNA, genome and transgenesis research, to human gene therapy. PMID:27614072
Keys, C; Kemper, S; Keim, P
2005-01-01
Evaluation of the Escherichia coli genome for variable number tandem repeat (VNTR) loci in order to provide a subtyping tool with greater discrimination and more efficient capacity. Twenty-nine putative VNTR loci were identified from the E. coli genomic sequence. Their variability was validated by characterizing the number of repeats at each locus in a set of 56 E. coli O157:H7/HN and O55:H7 isolates. An optimized multiplex assay system was developed to facility high capacity analysis. Locus diversity values ranged from 0.23 to 0.95 while the number of alleles ranged from two to 29. This multiple-locus VNTR analysis (MLVA) data was used to describe genetic relationships among these isolates and was compared with PFGE (pulse field gel electrophoresis) data from a subset of the same strains. Genetic similarity values were highly correlated between the two approaches, through MLVA was capable of discrimination amongst closely related isolates when PFGE similar values were equal to 1.0. Highly variable VNTR loci exist in the E. coli O157:H7 genome and are excellent estimators of genetic relationships, in particular for closely related isolates. Escherichia coli O157:H7 MLVA offers a complimentary analysis to the more traditional PFGE approach. Application of MLVA to an outbreak cluster could generate superior molecular epidemiology and result in a more effective public health response.
Malashchuk, Igor; Lajoie, Brian R.; Mardaryev, Andrei N.; Gdula, Michal R.; Sharov, Andrey A.; Kohwi-Shigematsu, Terumi; Fessing, Michael Y.
2017-01-01
Mammalian genomes contain several dozens of large (>0.5 Mbp) lineage-specific gene loci harbouring functionally related genes. However, spatial chromatin folding, organization of the enhancer-promoter networks and their relevance to Topologically Associating Domains (TADs) in these loci remain poorly understood. TADs are principle units of the genome folding and represents the DNA regions within which DNA interacts more frequently and less frequently across the TAD boundary. Here, we used Chromatin Conformation Capture Carbon Copy (5C) technology to characterize spatial chromatin interaction network in the 3.1 Mb Epidermal Differentiation Complex (EDC) locus harbouring 61 functionally related genes that show lineage-specific activation during terminal keratinocyte differentiation in the epidermis. 5C data validated by 3D-FISH demonstrate that the EDC locus is organized into several TADs showing distinct lineage-specific chromatin interaction networks based on their transcription activity and the gene-rich or gene-poor status. Correlation of the 5C results with genome-wide studies for enhancer-specific histone modifications (H3K4me1 and H3K27ac) revealed that the majority of spatial chromatin interactions that involves the gene-rich TADs at the EDC locus in keratinocytes include both intra- and inter-TAD interaction networks, connecting gene promoters and enhancers. Compared to thymocytes in which the EDC locus is mostly transcriptionally inactive, these interactions were found to be keratinocyte-specific. In keratinocytes, the promoter-enhancer anchoring regions in the gene-rich transcriptionally active TADs are enriched for the binding of chromatin architectural proteins CTCF, Rad21 and chromatin remodeler Brg1. In contrast to gene-rich TADs, gene-poor TADs show preferential spatial contacts with each other, do not contain active enhancers and show decreased binding of CTCF, Rad21 and Brg1 in keratinocytes. Thus, spatial interactions between gene promoters and enhancers at the multi-TAD EDC locus in skin epithelial cells are cell type-specific and involve extensive contacts within TADs as well as between different gene-rich TADs, forming the framework for lineage-specific transcription. PMID:28863138
A major locus controls local adaptation and adaptive life history variation in a perennial plant.
Wang, Jing; Ding, Jihua; Tan, Biyue; Robinson, Kathryn M; Michelson, Ingrid H; Johansson, Anna; Nystedt, Björn; Scofield, Douglas G; Nilsson, Ove; Jansson, Stefan; Street, Nathaniel R; Ingvarsson, Pär K
2018-06-04
The initiation of growth cessation and dormancy represent critical life-history trade-offs between survival and growth and have important fitness effects in perennial plants. Such adaptive life-history traits often show strong local adaptation along environmental gradients but, despite their importance, the genetic architecture of these traits remains poorly understood. We integrate whole genome re-sequencing with environmental and phenotypic data from common garden experiments to investigate the genomic basis of local adaptation across a latitudinal gradient in European aspen (Populus tremula). A single genomic region containing the PtFT2 gene mediates local adaptation in the timing of bud set and explains 65% of the observed genetic variation in bud set. This locus is the likely target of a recent selective sweep that originated right before or during colonization of northern Scandinavia following the last glaciation. Field and greenhouse experiments confirm that variation in PtFT2 gene expression affects the phenotypic variation in bud set that we observe in wild natural populations. Our results reveal a major effect locus that determines the timing of bud set and that has facilitated rapid adaptation to shorter growing seasons and colder climates in European aspen. The discovery of a single locus explaining a substantial fraction of the variation in a key life-history trait is remarkable, given that such traits are generally considered to be highly polygenic. These findings provide a dramatic illustration of how loci of large-effect for adaptive traits can arise and be maintained over large geographical scales in natural populations.
Kim, Jae-Jung; Hong, Young Mi; Sohn, Saejung; Jang, Gi Young; Ha, Kee-Soo; Yun, Sin Weon; Han, Myung Ki; Lee, Kyung-Yil; Song, Min Seob; Lee, Hyoung Doo; Kim, Dong Soo; Lee, Jong-Eun; Shin, Eun-Soon; Jang, Ji-Hyun; Lee, Yeon-Su; Kim, Sook-Young; Lee, Jong-Young; Han, Bok-Ghee; Wu, Jer-Yuarn; Kim, Kwi-Joo; Park, Young-Mi; Seo, Eul-Joo; Park, In-Sook; Lee, Jong-Keuk
2011-05-01
Kawasaki disease (KD) is an acute self-limited vasculitis of infants and children that manifests as fever and signs of mucocutaneous inflammation. Coronary artery aneurysms develop in approximately 15-25% of untreated children. Although the etiology of KD is largely unknown, epidemiologic data suggest the importance of genetic factors in the susceptibility to KD. In order to identify genetic variants that influence KD susceptibility, we performed a genome-wide association study (GWAS) using Affymetrix SNP array 6.0 in 186 Korean KD patients and 600 healthy controls; 18 and 26 genomic regions with one or more sequence variants were associated with KD and KD with coronary artery lesions (CALs), respectively (p < 1 × 10(-5)). Of these, one locus on chromosome 1p31 (rs527409) was replicated in 266 children with KD and 600 normal controls (odds ratio [OR] = 2.90, 95% confidence interval [CI] = 1.85-4.54, P (combined) = 1.46 × 10(-6)); and a PELI1 locus on chromosome 2p13.3 (rs7604693) was replicated in 86 KD patients with CALs and 600 controls (OR = 2.70, 95% CI = 1.77-4.12, P (combined) = 2.00 × 10(-6)). These results implicate a locus in the 1p31 region and the PELI1 gene locus in the 2p13.3 region as susceptibility loci for KD and CALs, respectively.
Cheng, Yu-Ching; Stanne, Tara M.; Giese, Anne-Katrin; Ho, Weang Kee; Traylor, Matthew; Amouyel, Philippe; Holliday, Elizabeth G.; Malik, Rainer; Xu, Huichun; Kittner, Steven J.; Cole, John W.; O’Connell, Jeffrey R.; Danesh, John; Rasheed, Asif; Zhao, Wei; Engelter, Stefan; Grond-Ginsbach, Caspar; Kamatani, Yoichiro; Lathrop, Mark; Leys, Didier; Thijs, Vincent; Metso, Tiina M.; Tatlisumak, Turgut; Pezzini, Alessandro; Parati, Eugenio A.; Norrving, Bo; Bevan, Steve; Rothwell, Peter M; Sudlow, Cathie; Slowik, Agnieszka; Lindgren, Arne; Walters, Matthew R; Jannes, Jim; Shen, Jess; Crosslin, David; Doheny, Kimberly; Laurie, Cathy C.; Kanse, Sandip M.; Bis, Joshua C.; Fornage, Myriam; Mosley, Thomas H.; Hopewell, Jemma C.; Strauch, Konstantin; Müller-Nurasyid, Martina; Gieger, Christian; Waldenberger, Melanie; Peters, Annette; Meisinger, Christine; Ikram, M. Arfan; Longstreth, WT; Meschia, James F.; Seshadri, Sudha; Sharma, Pankaj; Worrall, Bradford; Jern, Christina; Levi, Christopher; Dichgans, Martin; Boncoraglio, Giorgio B.; Markus, Hugh S.; Debette, Stephanie; Rolfs, Arndt; Saleheen, Danish; Mitchell, Braxton D.
2015-01-01
Background and Purpose Although a genetic contribution to ischemic stroke is well recognized, only a handful of stroke loci have been identified by large-scale genetic association studies to date. Hypothesizing that genetic effects might be stronger for early- versus late-onset stroke, we conducted a two-stage meta-analysis of genome-wide association studies (GWAS), focusing on stroke cases with an age of onset < 60 years old. Methods The Discovery stage of our GWAS included 4,505 cases and 21,968 controls of European, South-Asian and African ancestry, drawn from 6 studies. In Stage 2, we selected the lead genetic variants at loci with association P<5×10−6 and performed in silico association analyses in an independent sample of up to 1,003 cases and 7,745 controls. Results One stroke susceptibility locus at 10q25 reached genome-wide significance in the combined analysis of all samples from the Discovery and Follow-up Stages (rs11196288, OR=1.41, P=9.5×10−9). The associated locus is in an intergenic region between TCF7L2 and HABP2. In a further analysis in an independent sample, we found that two SNPs in high linkage disequilibrium with rs11196288 were significantly associated with total plasma factor VII-activating protease levels, a product of HABP2. Conclusions HABP2, which encodes an extracellular serine protease involved in coagulation, fibrinolysis, and inflammatory pathways, may be a genetic susceptibility locus for early-onset stroke. PMID:26732560
Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks.
Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S K; Mammel, Mark K; Tarr, Phillip I; Eppinger, Mark
2016-01-01
Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and long-term evolution and can complement currently employed typing schemes for outbreak ex- and inclusion, diagnostics, surveillance, and forensic studies.
Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks
Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S. K.; Mammel, Mark K.; Tarr, Phillip I.; Eppinger, Mark
2016-01-01
Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and long-term evolution and can complement currently employed typing schemes for outbreak ex- and inclusion, diagnostics, surveillance, and forensic studies. PMID:27446025
Grattapaglia, Dario; Mamani, Eva M C; Silva-Junior, Orzenil B; Faria, Danielle A
2015-03-01
Keystone species in their native ranges, eucalypts, are ecologically and genetically very diverse, growing naturally along extensive latitudinal and altitudinal ranges and variable environments. Besides their ecological importance, eucalypts are also the most widely planted trees for sustainable forestry in the world. We report the development of a novel collection of 535 microsatellites for species of Eucalyptus, 494 designed from ESTs and 41 from genomic libraries. A selected subset of 223 was evaluated for individual identification, parentage testing, and ancestral information content in the two most extensively studied species, Eucalyptus grandis and Eucalyptus globulus. Microsatellites showed high transferability and overlapping allele size range, suggesting they have arisen still in their common ancestor and confirming the extensive genome conservation between these two species. A consensus linkage map with 437 microsatellites, the most comprehensive microsatellite-only genetic map for Eucalyptus, was built by assembling segregation data from three mapping populations and anchored to the Eucalyptus genome. An overall colinearity between recombination-based and physical positioning of 84% of the mapped microsatellites was observed, with some ordering discrepancies and sporadic locus duplications, consistent with the recently described whole genome duplication events in Eucalyptus. The linkage map covered 95.2% of the 605.8-Mbp assembled genome sequence, placing one microsatellite every 1.55 Mbp on average, and an overall estimate of physical to recombination distance of 618 kbp/cM. The genetic parameters estimates together with linkage and physical position data for this large set of microsatellites should assist marker choice for genome-wide population genetics and comparative mapping in Eucalyptus. © 2014 John Wiley & Sons Ltd.
Polymorphism at Expressed DQ and DR Loci in Five Common Equine MHC Haplotypes
Miller, Donald; Tallmadge, Rebecca L.; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A.; Antczak, Douglas F.
2016-01-01
The polymorphism of Major Histocompatibility Complex (MHC) class II DQ and DR genes in five common Equine Leukocyte Antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine Bacterial Artificial Chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next Generation Sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse. PMID:27889800
Rotival, Maxime; Zeller, Tanja; Wild, Philipp S; Maouche, Seraya; Szymczak, Silke; Schillert, Arne; Castagné, Raphaele; Deiseroth, Arne; Proust, Carole; Brocheton, Jessy; Godefroy, Tiphaine; Perret, Claire; Germain, Marine; Eleftheriadis, Medea; Sinning, Christoph R; Schnabel, Renate B; Lubos, Edith; Lackner, Karl J; Rossmann, Heidi; Münzel, Thomas; Rendon, Augusto; Erdmann, Jeanette; Deloukas, Panos; Hengstenberg, Christian; Diemert, Patrick; Montalescot, Gilles; Ouwehand, Willem H; Samani, Nilesh J; Schunkert, Heribert; Tregouet, David-Alexandre; Ziegler, Andreas; Goodall, Alison H; Cambien, François; Tiret, Laurence; Blankenberg, Stefan
2011-12-01
One major expectation from the transcriptome in humans is to characterize the biological basis of associations identified by genome-wide association studies. So far, few cis expression quantitative trait loci (eQTLs) have been reliably related to disease susceptibility. Trans-regulating mechanisms may play a more prominent role in disease susceptibility. We analyzed 12,808 genes detected in at least 5% of circulating monocyte samples from a population-based sample of 1,490 European unrelated subjects. We applied a method of extraction of expression patterns-independent component analysis-to identify sets of co-regulated genes. These patterns were then related to 675,350 SNPs to identify major trans-acting regulators. We detected three genomic regions significantly associated with co-regulated gene modules. Association of these loci with multiple expression traits was replicated in Cardiogenics, an independent study in which expression profiles of monocytes were available in 758 subjects. The locus 12q13 (lead SNP rs11171739), previously identified as a type 1 diabetes locus, was associated with a pattern including two cis eQTLs, RPS26 and SUOX, and 5 trans eQTLs, one of which (MADCAM1) is a potential candidate for mediating T1D susceptibility. The locus 12q24 (lead SNP rs653178), which has demonstrated extensive disease pleiotropy, including type 1 diabetes, hypertension, and celiac disease, was associated to a pattern strongly correlating to blood pressure level. The strongest trans eQTL in this pattern was CRIP1, a known marker of cellular proliferation in cancer. The locus 12q15 (lead SNP rs11177644) was associated with a pattern driven by two cis eQTLs, LYZ and YEATS4, and including 34 trans eQTLs, several of them tumor-related genes. This study shows that a method exploiting the structure of co-expressions among genes can help identify genomic regions involved in trans regulation of sets of genes and can provide clues for understanding the mechanisms linking genome-wide association loci to disease.
ERIC Educational Resources Information Center
Abu-Hilal, Maher M.
A study tested predictions for I/E (internal external) frame of reference model and extended this model to include locus of control. A sample of upper elementary (n=181) and junior high (n=191) students in the United Arab Emirates participated in the study. Structural equation modeling (SEM) analyses provided support to the external comparison…
Tost, Jörg
2016-01-01
DNA methylation is the most studied epigenetic modification, and altered DNA methylation patterns have been identified in cancer and more recently also in many other complex diseases. Furthermore, DNA methylation is influenced by a variety of environmental factors, and the analysis of DNA methylation patterns might allow deciphering previous exposure. Although a large number of techniques to study DNA methylation either genome-wide or at specific loci have been devised, they all are based on a limited number of principles for differentiating the methylation state, viz., methylation-specific/methylation-dependent restriction enzymes, antibodies or methyl-binding proteins, chemical-based enrichment, or bisulfite conversion. Second-generation sequencing has largely replaced microarrays as readout platform and is also becoming more popular for locus-specific DNA methylation analysis. In this chapter, the currently used methods for both genome-wide and locus-specific analysis of 5-methylcytosine and as its oxidative derivatives, such as 5-hydroxymethylcytosine, are reviewed in detail, and the advantages and limitations of each approach are discussed. Furthermore, emerging technologies avoiding PCR amplification and allowing a direct readout of DNA methylation are summarized, together with novel applications, such as the detection of DNA methylation in single cells or in circulating cell-free DNA.
Genome-wide association studies for multiple diseases of the German Shepherd Dog
Tsai, Kate L.; Noorai, Rooksana E.; Starr-Moss, Alison N.; Quignon, Pascale; Rinz, Caitlin J.; Ostrander, Elaine A.; Steiner, Jörg M.; Murphy, Keith E.
2012-01-01
The German Shepherd Dog (GSD) is a popular working and companion breed for which over 50 hereditary diseases have been documented. Herein, SNP profiles for 197 GSDs were generated using the Affymetrix v2 canine SNP array for a genome-wide association study to identify loci associated with four diseases: pituitary dwarfism, degenerative myelopathy (DM), congenital megaesophagus (ME), and pancreatic acinar atrophy (PAA). A locus on Chr 9 is strongly associated with pituitary dwarfism and is proximal to a plausible candidate gene, LHX3. Results for DM confirm a major locus encompassing SOD1, in which an associated point mutation was previously identified, but do not suggest modifier loci. Several SNPs on Chr 12 are associated with ME and a 4.7 Mb haplotype block is present in affected dogs. Analysis of additional ME cases for a SNP within the haplotype provides further support for this association. Results for PAA indicate more complex genetic underpinnings. Several regions on multiple chromosomes reach genome-wide significance. However, no major locus is apparent and only two associated haplotype blocks, on Chrs 7 and 12 are observed. These data suggest that PAA may be governed by multiple loci with small effects, or it may be a heterogeneous disorder. PMID:22105877
Genetics Home Reference: prostate cancer
... prostate cancer Genetic Testing Registry: Prostate cancer aggressiveness quantitative trait locus on chromosome 19 Genetic Testing Registry: ... OMIM (25 links) PROSTATE CANCER PROSTATE CANCER AGGRESSIVENESS QUANTITATIVE TRAIT LOCUS ON CHROMOSOME 19 PROSTATE CANCER ANTIGEN ...
The sh2-R allele of the maize shrunken-2 locus was caused by a complex chromosomal rearrangement.
Kramer, Vance; Shaw, Janine R; Senior, M Lynn; Hannah, L Curtis
2015-03-01
The mutant that originally defined the shrunken - 2 locus of maize is shown here to be the product of a complex chromosomal rearrangement. The maize shrunken-2 gene (sh2) encodes the large subunit of the heterotetrameric enzyme, adenosine diphosphate glucose pyrophosphorylases and a rate-limiting enzyme in starch biosynthesis. The sh2 gene was defined approximately 72 years ago by the isolation of a loss-of-function allele conditioning a shrunken, but viable seed. In subsequent years, the realization that this allele, termed zsh2-R or sh2-Reference, causes an extremely high level of sucrose to accumulate in the developing seed led to a revolution in the sweet corn industry. Now, the vast majority of sweet corns grown throughout the world contain this mutant allele. Through initial Southern analysis followed by genomic sequencing, the work reported here shows that this allele arose through a complex set of events involving at least three breaks of chromosome 3 as well as an intra-chromosomal inversion. These findings provide an explanation for some previously reported, unexpected observations concerning rates of recombination within and between genes in this region.
Keong, B P; Harikrishna, J A
2012-02-01
A preliminary screening was conducted on BC3F1 and BC4F1 backcross families developed from crossing Oryza sativa (MR219) and O. rufipogon (IRGC105491). Despite earlier results showing that O. rufipogon alleles (wild introgression) contributed to both number of panicles (qPPL-2) and tillers (qTPL-2) at loci RM250, RM208, and RM48 in line A20 of the BC2F2 population, we observed that wild introgression was lost at loci RM250 and RM208 but retained at locus RM48 in BC3F1 and BC4F1. Progeny tests conducted utilizing genotype and phenotype data on both BC4F1 and a reference population, BC2F7 (A20 line), did not show significant differences between groups having the MR219 allele and wild introgression at locus RM48. This suggests that there is no additive and transgressive effect of wild introgression in the BC3F1 and BC4F1 generated. The presence of wild introgression was largely due to gene contamination by cross-pollination during field breeding practices.
Extensive gene conversion at the PMS2 DNA mismatch repair locus.
Hayward, Bruce E; De Vos, Michel; Valleley, Elizabeth M A; Charlton, Ruth S; Taylor, Graham R; Sheridan, Eamonn; Bonthron, David T
2007-05-01
Mutations of the PMS2 DNA repair gene predispose to a characteristic range of malignancies, with either childhood onset (when both alleles are mutated) or a partially penetrant adult onset (if heterozygous). These mutations have been difficult to detect, due to interference from a family of pseudogenes located on chromosome 7. One of these, the PMS2CL pseudogene, lies within a 100-kb inverted duplication (inv dup), 700 kb centromeric to PMS2 itself on 7p22. Here, we show that the reference genomic sequences cannot be relied upon to distinguish PMS2 from PMS2CL, because of sequence transfer between the two loci. The 7p22 inv dup occurred prior to the divergence of modern ape species (15 million years ago [Mya]), but has undergone extensive sequence homogenization. This process appears to be ongoing, since there is considerable allelic diversity within the duplicated region, much of it derived from sequence exchange between PMS2 and PMS2CL. This sequence diversity can result in both false-positive and false-negative mutation analysis at this locus. Great caution is still needed in the design and interpretation of PMS2 mutation screens. 2007 Wiley-Liss, Inc.
Predictors of Parental Locus of Control in Mothers of Pre- and Early Adolescents
ERIC Educational Resources Information Center
Freed, Rachel D.; Tompson, Martha C.
2011-01-01
Parental locus of control refers to parents' perceived power and efficacy in child-rearing situations. This study explored parental locus of control and its correlates in 160 mothers of children ages 8 to 14 cross-sectionally and 1 year later. Maternal depression, maternal expressed emotion, and child internalizing and externalizing behavior were…
Ng, Maggie C Y; Graff, Mariaelisa; Lu, Yingchang; Justice, Anne E; Mudgal, Poorva; Liu, Ching-Ti; Young, Kristin; Yanek, Lisa R; Feitosa, Mary F; Wojczynski, Mary K; Rand, Kristin; Brody, Jennifer A; Cade, Brian E; Dimitrov, Latchezar; Duan, Qing; Guo, Xiuqing; Lange, Leslie A; Nalls, Michael A; Okut, Hayrettin; Tajuddin, Salman M; Tayo, Bamidele O; Vedantam, Sailaja; Bradfield, Jonathan P; Chen, Guanjie; Chen, Wei-Min; Chesi, Alessandra; Irvin, Marguerite R; Padhukasahasram, Badri; Smith, Jennifer A; Zheng, Wei; Allison, Matthew A; Ambrosone, Christine B; Bandera, Elisa V; Bartz, Traci M; Berndt, Sonja I; Bernstein, Leslie; Blot, William J; Bottinger, Erwin P; Carpten, John; Chanock, Stephen J; Chen, Yii-Der Ida; Conti, David V; Cooper, Richard S; Fornage, Myriam; Freedman, Barry I; Garcia, Melissa; Goodman, Phyllis J; Hsu, Yu-Han H; Hu, Jennifer; Huff, Chad D; Ingles, Sue A; John, Esther M; Kittles, Rick; Klein, Eric; Li, Jin; McKnight, Barbara; Nayak, Uma; Nemesure, Barbara; Ogunniyi, Adesola; Olshan, Andrew; Press, Michael F; Rohde, Rebecca; Rybicki, Benjamin A; Salako, Babatunde; Sanderson, Maureen; Shao, Yaming; Siscovick, David S; Stanford, Janet L; Stevens, Victoria L; Stram, Alex; Strom, Sara S; Vaidya, Dhananjay; Witte, John S; Yao, Jie; Zhu, Xiaofeng; Ziegler, Regina G; Zonderman, Alan B; Adeyemo, Adebowale; Ambs, Stefan; Cushman, Mary; Faul, Jessica D; Hakonarson, Hakon; Levin, Albert M; Nathanson, Katherine L; Ware, Erin B; Weir, David R; Zhao, Wei; Zhi, Degui; Arnett, Donna K; Grant, Struan F A; Kardia, Sharon L R; Oloapde, Olufunmilayo I; Rao, D C; Rotimi, Charles N; Sale, Michele M; Williams, L Keoki; Zemel, Babette S; Becker, Diane M; Borecki, Ingrid B; Evans, Michele K; Harris, Tamara B; Hirschhorn, Joel N; Li, Yun; Patel, Sanjay R; Psaty, Bruce M; Rotter, Jerome I; Wilson, James G; Bowden, Donald W; Cupples, L Adrienne; Haiman, Christopher A; Loos, Ruth J F; North, Kari E
2017-04-01
Genome-wide association studies (GWAS) have identified >300 loci associated with measures of adiposity including body mass index (BMI) and waist-to-hip ratio (adjusted for BMI, WHRadjBMI), but few have been identified through screening of the African ancestry genomes. We performed large scale meta-analyses and replications in up to 52,895 individuals for BMI and up to 23,095 individuals for WHRadjBMI from the African Ancestry Anthropometry Genetics Consortium (AAAGC) using 1000 Genomes phase 1 imputed GWAS to improve coverage of both common and low frequency variants in the low linkage disequilibrium African ancestry genomes. In the sex-combined analyses, we identified one novel locus (TCF7L2/HABP2) for WHRadjBMI and eight previously established loci at P < 5×10-8: seven for BMI, and one for WHRadjBMI in African ancestry individuals. An additional novel locus (SPRYD7/DLEU2) was identified for WHRadjBMI when combined with European GWAS. In the sex-stratified analyses, we identified three novel loci for BMI (INTS10/LPL and MLC1 in men, IRX4/IRX2 in women) and four for WHRadjBMI (SSX2IP, CASC8, PDE3B and ZDHHC1/HSD11B2 in women) in individuals of African ancestry or both African and European ancestry. For four of the novel variants, the minor allele frequency was low (<5%). In the trans-ethnic fine mapping of 47 BMI loci and 27 WHRadjBMI loci that were locus-wide significant (P < 0.05 adjusted for effective number of variants per locus) from the African ancestry sex-combined and sex-stratified analyses, 26 BMI loci and 17 WHRadjBMI loci contained ≤ 20 variants in the credible sets that jointly account for 99% posterior probability of driving the associations. The lead variants in 13 of these loci had a high probability of being causal. As compared to our previous HapMap imputed GWAS for BMI and WHRadjBMI including up to 71,412 and 27,350 African ancestry individuals, respectively, our results suggest that 1000 Genomes imputation showed modest improvement in identifying GWAS loci including low frequency variants. Trans-ethnic meta-analyses further improved fine mapping of putative causal variants in loci shared between the African and European ancestry populations.
QTL Mapping of Sex Determination Loci Supports an Ancient Pathway in Ants and Honey Bees.
Miyakawa, Misato O; Mikheyev, Alexander S
2015-11-01
Sex determination mechanisms play a central role in life-history characteristics, affecting mating systems, sex ratios, inbreeding tolerance, etc. Downstream components of sex determination pathways are highly conserved, but upstream components evolve rapidly. Evolutionary dynamics of sex determination remain poorly understood, particularly because mechanisms appear so diverse. Here we investigate the origins and evolution of complementary sex determination (CSD) in ants and bees. The honey bee has a well-characterized CSD locus, containing tandemly arranged homologs of the transformer gene [complementary sex determiner (csd) and feminizer (fem)]. Such tandem paralogs appear frequently in aculeate hymenopteran genomes. However, only comparative genomic, but not functional, data support a broader role for csd/fem in sex determination, and whether species other than the honey bee use this pathway remains controversial. Here we used a backcross to test whether csd/fem acts as a CSD locus in an ant (Vollenhovia emeryi). After sequencing and assembling the genome, we computed a linkage map, and conducted a quantitative trait locus (QTL) analysis of diploid male production using 68 diploid males and 171 workers. We found two QTLs on separate linkage groups (CsdQTL1 and CsdQTL2) that jointly explained 98.0% of the phenotypic variance. CsdQTL1 included two tandem transformer homologs. These data support the prediction that the same CSD mechanism has indeed been conserved for over 100 million years. CsdQTL2 had no similarity to CsdQTL1 and included a 236-kb region with no obvious CSD gene candidates, making it impossible to conclusively characterize it using our data. The sequence of this locus was conserved in at least one other ant genome that diverged >75 million years ago. By applying QTL analysis to ants for the first time, we support the hypothesis that elements of hymenopteran CSD are ancient, but also show that more remains to be learned about the diversity of CSD mechanisms.
Yazar, Seyhan; Mishra, Aniket; Ang, Wei; Kearns, Lisa S; Mountain, Jenny A; Pennell, Craig; Montgomery, Grant W; Young, Terri L; Hammond, Christopher J; Macgregor, Stuart; Mackey, David A; Hewitt, Alex W
2013-01-01
Corneal astigmatism is a common eye disorder characterized by irregularities in corneal curvature. Recently, the rs7677751 single nucleotide polymorphism (SNP) at the platelet-derived growth factor receptor alpha (PDGFRA) locus was found to be associated with corneal astigmatism in people of Asian ancestry. In the present study, we sought to replicate this finding and identify other genetic markers of corneal astigmatism in an Australian population of Northern European ancestry. Data from two cohorts were included in this study. The first cohort consisted of 1,013 individuals who were part of the Western Australian Pregnancy Cohort (Raine) Study: 20-year follow-up Eye Study. The second cohort comprised 1,788 individuals of 857 twin families who were recruited through the Twins Eye Study in Tasmania and the Brisbane Adolescent Twin Study. Corneal astigmatism was calculated as the absolute difference between the keratometry readings in two meridians, and genotype data were extracted from genome-wide arrays. Initially, each cohort was analyzed separately, before being combined for meta- and subsequent genome-wide pathway analysis. Following meta-analysis, SNP rs7677751 at the PDGFRA locus had a combined p=0.32. No variant was found to be statistically significantly associated with corneal astigmatism at the genome-wide level (p<5.0×10(-8)). The SNP with strongest association was rs1164064 (p=1.86×10(-6)) on chromosome 3q13. Gene-based pathway analysis identified a significant association between the Gene Ontology "segmentation" (GO:0035282) pathway, corrected p=0.009. Our data suggest that the PDGFRA locus does not transfer a major risk of corneal astigmatism in people of Northern European ancestry. Better-powered studies are required to validate the novel putative findings of our study.
Enzymatically Generated CRISPR Libraries for Genome Labeling and Screening
Lane, Andrew B.; Strzelecka, Magdalena; Ettinger, Andreas; Grenfell, Andrew W.; Wittmann, Torsten; Heald, Rebecca
2015-01-01
Summary CRISPR-based technologies have emerged as powerful tools to alter genomes and mark chromosomal loci, but an inexpensive method for generating large numbers of RNA guides for whole genome screening and labeling is lacking. Using a method that permits library construction from any source of DNA, we generated guide libraries that label repetitive loci or a single chromosomal locus in Xenopus egg extracts and show that a complex library can target the E. coli genome at high frequency. PMID:26212133
Birth and death of genes linked to chromosomal inversion
Furuta, Yoshikazu; Kawai, Mikihiko; Yahara, Koji; Takahashi, Noriko; Handa, Naofumi; Tsuru, Takeshi; Oshima, Kenshiro; Yoshida, Masaru; Azuma, Takeshi; Hattori, Masahira; Uchiyama, Ikuo; Kobayashi, Ichizo
2011-01-01
The birth and death of genes is central to adaptive evolution, yet the underlying genome dynamics remain elusive. The availability of closely related complete genome sequences helps to follow changes in gene contents and clarify their relationship to overall genome organization. Helicobacter pylori, bacteria in our stomach, are known for their extreme genome plasticity through mutation and recombination and will make a good target for such an analysis. In comparing their complete genome sequences, we found that gain and loss of genes (loci) for outer membrane proteins, which mediate host interaction, occurred at breakpoints of chromosomal inversions. Sequence comparison there revealed a unique mechanism of DNA duplication: DNA duplication associated with inversion. In this process, a DNA segment at one chromosomal locus is copied and inserted, in an inverted orientation, into a distant locus on the same chromosome, while the entire region between these two loci is also inverted. Recognition of this and three more inversion modes, which occur through reciprocal recombination between long or short sequence similarity or adjacent to a mobile element, allowed reconstruction of synteny evolution through inversion events in this species. These results will guide the interpretation of extensive DNA sequencing results for understanding long- and short-term genome evolution in various organisms and in cancer cells. PMID:21212362
Comparative Analysis of Genome Diversity in Bullmastiff Dogs
Mortlock, Sally-Anne; Khatkar, Mehar S.; Williamson, Peter
2016-01-01
Management and preservation of genomic diversity in dog breeds is a major objective for maintaining health. The present study was undertaken to characterise genomic diversity in Bullmastiff dogs using both genealogical and molecular analysis. Genealogical analysis of diversity was conducted using a database consisting of 16,378 Bullmastiff pedigrees from year 1980 to 2013. Additionally, a total of 188 Bullmastiff dogs were genotyped using the 170,000 SNP Illumina CanineHD Beadchip. Genealogical parameters revealed a mean inbreeding coefficient of 0.047; 142 total founders (f); an effective number of founders (fe) of 79; an effective number of ancestors (fa) of 62; and an effective population size of the reference population of 41. Genetic diversity and the degree of genome-wide homogeneity within the breed were also investigated using molecular data. Multiple-locus heterozygosity (MLH) was equal to 0.206; runs of homozygosity (ROH) as proportion of the genome, averaged 16.44%; effective population size was 29.1, with an average inbreeding coefficient of 0.035, all estimated using SNP Data. Fine-scale population structure was analysed using NETVIEW, a population analysis pipeline. Visualisation of the high definition network captured relationships among individuals within and between subpopulations. Effects of unequal founder use, and ancestral inbreeding and selection, were evident. While current levels of Bullmastiff heterozygosity, inbreeding and homozygosity are not unusual, a relatively small effective population size indicates that a breeding strategy to reduce the inbreeding rate may be beneficial. PMID:26824579
Heidari, Mohammad; Ghodusi, Mansureh
2016-01-01
Objective: Thus, the present research was carried out aimed at determining the relationship between self-esteem and locus of control and quality of life during treatment stages in the patients referring to drug addiction rehabilitation centers of Borujen city, Iran. Methods: The current study was a sectional research of descriptive correlation type. The research sample was 150 individuals of patients referring to addiction rehabilitation centers of Borujen city. For data gathering, Rosenberg Self-esteem Scale, Rotter’s Locus of Control Scale, and SF36 Quality of Life Questionnaire were used. Following collection of questionnaires, the data were analyzed using SPSS/16 software. Results: According to the results, in the 12th day of treatment, 96 patients exhibited moderate self-esteem, 102 patients had internal locus of control, and the score of their overall quality of life was 40.43±12.71. Furthermore, Pearson’s correlation coefficient indicated that a significant and positive relationship was observed between locus of control and quality of life during different treatment stages. Conclusion: It seems that quality of life improves during addiction treatment stages due to improvement of personality traits including locus of control and self-esteem. Therefore, consultation methods as a very crucial priority in addiction rehabilitation centers shall be taken into account by the health sector authorities and managers and can play an essential role in enhancing quality of life. PMID:27698598
Two-locus diseas models with two marker loci: The power of affected-sib-pair tests
DOE Office of Scientific and Technical Information (OSTI.GOV)
Knapp, M.; Seuchter, S.A.; Bauer, M.P.
1994-11-01
Recently, Schork et al. found that two-trait-locus, two-marker-locus (parametric) linkage analysis can provide substantially more linkage information than can standard one-trait-locus, one-marker-locus methods. However, because of the increased burden of computation, Schork et al. do not expect that their approach will be applied in an initial genome scan. Further, the specification of a suitable two-locus segregation model can be crucial. Affected-sib-pair tests are computationally simple and do not require an explicit specification of the disease model. In the past, however, these tests mainly have been applied to data with a single marker locus. Here, we consider sib-pair tests that makemore » it possible to analyze simultaneously two marker loci. The power of these tests is investigated for different (epistatic and heterogeneous) two-trait-locus models, each trait locus being linked to one of the marker loci. We compare these tests both with the test that is optimal for a certain model and with the strategy that analyzes each marker locus separately. The results indicate that a straightforward extension of the well-known mean test for two marker loci can be much more powerful than single-marker-locus analysis and that its power is only slightly inferior to the power of the optimal test. 21 refs., 5 figs., 2 tabs.« less
Yanokura, Megumi; Banno, Kouji; Adachi, Masataka; Aoki, Daisuke; Abe, Kuniya
2017-01-01
Aberrant DNA methylation is widely observed in many cancers. Concurrent DNA methylation of multiple genes occurs in endometrial cancer and is referred to as the CpG island methylator phenotype (CIMP). However, the features and causes of CIMP-positive endometrial cancer are not well understood. To investigate DNA methylation features characteristic to CIMP-positive endometrial cancer, we first classified samples from 25 patients with endometrial cancer based on the methylation status of three genes, i.e. MLH1, CDH1 (E-cadherin) and APC: CIMP-high (CIMP-H, 2/25, 8.0%), CIMP-low (CIMP-L, 7/25, 28.0%) and CIMP-negative (CIMP(-), 16/25, 64.0%). We then selected two samples each from CIMP-H and CIMP(-) classes, and analyzed DNA methylation status of both normal (peripheral blood cells: PBCs) and cancer tissues by genome-wide, targeted bisulfite sequencing. Genomes of the CIMP-H cancer tissues were significantly hypermethylated compared to those of the CIMP(-). Surprisingly, in normal tissues of the CIMP-H patients, promoter region of the miR-663a locus is hypermethylated relative to CIMP(-) samples. Consistent with this finding, miR-663a expression was lower in the CIMP-H PBCs than in the CIMP(-) PBCs. The same region of the miR663a locus is found to be highly methylated in cancer tissues of both CIMP-H and CIMP(-) cases. This is the first report showing that aberrant DNA methylation of the miR-663a promoter can occur in normal tissue of the cancer patients, suggesting a possible link between this epigenetic abnormality and endometrial cancer. This raises the possibility that the hypermethylation of the miR-663a promoter represents an epimutation associated with the CIMP-H endometrial cancers. Based on these findings, relationship of the aberrant DNA methylation and CIMP-H phenotype is discussed. PMID:28440489
Yanokura, Megumi; Banno, Kouji; Adachi, Masataka; Aoki, Daisuke; Abe, Kuniya
2017-06-01
Aberrant DNA methylation is widely observed in many cancers. Concurrent DNA methylation of multiple genes occurs in endometrial cancer and is referred to as the CpG island methylator phenotype (CIMP). However, the features and causes of CIMP-positive endometrial cancer are not well understood. To investigate DNA methylation features characteristic to CIMP-positive endometrial cancer, we first classified samples from 25 patients with endometrial cancer based on the methylation status of three genes, i.e. MLH1, CDH1 (E-cadherin) and APC: CIMP-high (CIMP-H, 2/25, 8.0%), CIMP-low (CIMP-L, 7/25, 28.0%) and CIMP-negative (CIMP(-), 16/25, 64.0%). We then selected two samples each from CIMP-H and CIMP(-) classes, and analyzed DNA methylation status of both normal (peripheral blood cells: PBCs) and cancer tissues by genome-wide, targeted bisulfite sequencing. Genomes of the CIMP-H cancer tissues were significantly hypermethylated compared to those of the CIMP(-). Surprisingly, in normal tissues of the CIMP-H patients, promoter region of the miR-663a locus is hypermethylated relative to CIMP(-) samples. Consistent with this finding, miR-663a expression was lower in the CIMP-H PBCs than in the CIMP(-) PBCs. The same region of the miR663a locus is found to be highly methylated in cancer tissues of both CIMP-H and CIMP(-) cases. This is the first report showing that aberrant DNA methylation of the miR-663a promoter can occur in normal tissue of the cancer patients, suggesting a possible link between this epigenetic abnormality and endometrial cancer. This raises the possibility that the hypermethylation of the miR-663a promoter represents an epimutation associated with the CIMP-H endometrial cancers. Based on these findings, relationship of the aberrant DNA methylation and CIMP-H phenotype is discussed.
The Evolution of Human Handedness
McManus, I C; Davison, Angus; Armour, John A L
2013-01-01
Right- and left-handedness run in families, show greater concordance in monozygotic than dizygotic twins, and are well described by single-locus Mendelian models. Here we summarize a large genome-wide association study (GWAS) that finds no significant associations with handedness and is consistent with a meta-analysis of GWASs. The GWAS had 99% power to detect a single locus using the conventional criterion of P < 5 × 10−8 for the single locus models of McManus and Annett. The strong conclusion is that handedness is not controlled by a single genetic locus. A consideration of the genetic architecture of height, primary ciliary dyskinesia, and intelligence suggests that handedness inheritance can be explained by a multilocus variant of the McManus DC model, classical effects on family and twins being barely distinguishable from the single locus model. Based on the ENGAGE meta-analysis of GWASs, we estimate at least 40 loci are involved in determining handedness. PMID:23631511
A novel locus for dilated cardiomyopathy maps to canine chromosome 8.
Werner, Petra; Raducha, Michael G; Prociuk, Ulana; Sleeper, Meg M; Van Winkle, Thomas J; Henthorn, Paula S
2008-06-01
Dilated cardiomyopathy (DCM), the most common form of cardiomyopathy, often leads to heart failure and sudden death. While a substantial proportion of DCMs are inherited, mutations responsible for the majority of DCMs remain unidentified. A genome-wide linkage study was performed to identify the locus responsible for an autosomal recessive inherited form of juvenile DCM (JDCM) in Portuguese water dogs using 16 families segregating the disease. Results link the JDCM locus to canine chromosome 8 with two-point and multipoint lod scores of 10.8 and 14, respectively. The locus maps to a 3.9-Mb region, with complete syntenic homology to human chromosome 14, that contains no genes or loci known to be involved in the development of any type of cardiomyopathy. This discovery of a DCM locus with a previously unknown etiology will provide a new gene to examine in human DCM patients and a model for testing therapeutic approaches for heart failure.
Zou, Zhi; Yang, Lifu; Wang, Danhua; Huang, Qixing; Mo, Yeyong; Xie, Guishui
2016-01-01
WRKY proteins comprise one of the largest transcription factor families in plants and form key regulators of many plant processes. This study presents the characterization of 58 WRKY genes from the castor bean (Ricinus communis L., Euphorbiaceae) genome. Compared with the automatic genome annotation, one more WRKY-encoding locus was identified and 20 out of the 57 predicted gene models were manually corrected. All RcWRKY genes were shown to contain at least one intron in their coding sequences. According to the structural features of the present WRKY domains, the identified RcWRKY genes were assigned to three previously defined groups (I-III). Although castor bean underwent no recent whole-genome duplication event like physic nut (Jatropha curcas L., Euphorbiaceae), comparative genomics analysis indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family. The expression of all 58 RcWRKY genes was supported by ESTs and/or RNA sequencing reads derived from roots, leaves, flowers, seeds and endosperms. Further global expression profiles with RNA sequencing data revealed diverse expression patterns among various tissues. Results obtained from this study not only provide valuable information for future functional analysis and utilization of the castor bean WRKY genes, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants.
Llaurens, Violaine; Gonthier, Lucy; Billiard, Sylvain
2009-11-01
Inbreeding depression and mating systems evolution are closely linked, because the purging of deleterious mutations and the fitness of individuals may depend on outcrossing vs. selfing rates. Further, the accumulation of deleterious mutations may vary among genomic regions, especially for genes closely linked to loci under balancing selection. Sporophytic self-incompatibility (SSI) is a common genetic mechanism in angiosperm that enables hermaphrodite plants to avoid selfing and promote outcrossing. The SSI phenotype is determined by the S locus and may depend on dominance relationships among alleles. Since most individuals are heterozygous at the S locus and recombination is suppressed in the S-locus region, it has been suggested that deleterious mutations could accumulate at genes linked to the S locus, generating a "sheltered load." In this article, we first theoretically investigate the conditions generating sheltered load in SSI. We show that deleterious mutations can accumulate in linkage with specific S alleles, and particularly if those S alleles are dominant. Second, we looked for the presence of sheltered load in Arabidopsis halleri using CO(2) gas treatment to overcome self-incompatibility. By examining the segregation of S alleles and measuring the relative fitness of progeny, we found significant sheltered load associated with the most dominant S allele (S15) of three S alleles tested. This sheltered load seems to be expressed at several stages of the life cycle and to have a larger effect than genomic inbreeding depression.
HisB as novel selection marker for gene targeting approaches in Aspergillus niger.
Fiedler, Markus R M; Gensheimer, Tarek; Kubisch, Christin; Meyer, Vera
2017-03-08
For Aspergillus niger, a broad set of auxotrophic and dominant resistance markers is available. However, only few offer targeted modification of a gene of interest into or at a genomic locus of choice, which hampers functional genomics studies. We thus aimed to extend the available set by generating a histidine auxotrophic strain with a characterized hisB locus for targeted gene integration and deletion in A. niger. A histidine-auxotrophic strain was established via disruption of the A. niger hisB gene by using the counterselectable pyrG marker. After curing, a hisB - , pyrG - strain was obtained, which served as recipient strain for further studies. We show here that both hisB orthologs from A. nidulans and A. niger can be used to reestablish histidine prototrophy in this recipient strain. Whereas the hisB gene from A. nidulans was suitable for efficient gene targeting at different loci in A. niger, the hisB gene from A. niger allowed efficient integration of a Tet-on driven luciferase reporter construct at the endogenous non-functional hisB locus. Subsequent analysis of the luciferase activity revealed that the hisB locus is tight under non-inducing conditions and allows even higher luciferase expression levels compared to the pyrG integration locus. Taken together, we provide here an alternative selection marker for A. niger, hisB, which allows efficient homologous integration rates as well as high expression levels which compare favorably to the well-established pyrG selection marker.
Lowry, David B.; Logan, Tierney L.; Santuari, Luca; Hardtke, Christian S.; Richards, James H.; DeRose-Wilson, Leah J.; McKay, John K.; Sen, Saunak; Juenger, Thomas E.
2013-01-01
The regulation of gene expression is crucial for an organism’s development and response to stress, and an understanding of the evolution of gene expression is of fundamental importance to basic and applied biology. To improve this understanding, we conducted expression quantitative trait locus (eQTL) mapping in the Tsu-1 (Tsushima, Japan) × Kas-1 (Kashmir, India) recombinant inbred line population of Arabidopsis thaliana across soil drying treatments. We then used genome resequencing data to evaluate whether genomic features (promoter polymorphism, recombination rate, gene length, and gene density) are associated with genes responding to the environment (E) or with genes with genetic variation (G) in gene expression in the form of eQTLs. We identified thousands of genes that responded to soil drying and hundreds of main-effect eQTLs. However, we identified very few statistically significant eQTLs that interacted with the soil drying treatment (GxE eQTL). Analysis of genome resequencing data revealed associations of several genomic features with G and E genes. In general, E genes had lower promoter diversity and local recombination rates. By contrast, genes with eQTLs (G) had significantly greater promoter diversity and were located in genomic regions with higher recombination. These results suggest that genomic architecture may play an important a role in the evolution of gene expression. PMID:24045022
Zhang, Quan; Ye, Yuzhen
2017-02-06
The CRISPR-Cas systems in prokaryotes are RNA-guided immune systems that target and deactivate foreign nucleic acids. A typical CRISPR-Cas system consists of a CRISPR array of repeat and spacer units, and a locus of cas genes. The CRISPR and the cas locus are often located next to each other in the genomes. However, there is no quantitative estimate of the co-location. In addition, ad-hoc studies have shown that some non-CRISPR genomic elements contain repeat-spacer-like structures and are mistaken as CRISPRs. Using available genome sequences, we observed that a significant number of genomes have isolated cas loci and/or CRISPRs. We found that 11%, 22% and 28% of the type I, II and III cas loci are isolated (without CRISPRs in the same genomes at all or with CRISPRs distant in the genomes), respectively. We identified a large number of genomic elements that superficially reassemble CRISPRs but don't contain diverse spacers and have no companion cas genes. We called these elements false-CRISPRs and further classified them into groups, including tandem repeats and Staphylococcus aureus repeat (STAR)-like elements. This is the first systematic study to collect and characterize false-CRISPR elements. We demonstrated that false-CRISPRs could be used to reduce the false annotation of CRISPRs, therefore showing them to be useful for improving the annotation of CRISPR-Cas systems.
Miyaoka, Yuichiro; Berman, Jennifer R; Cooper, Samantha B; Mayerl, Steven J; Chan, Amanda H; Zhang, Bin; Karlin-Neumann, George A; Conklin, Bruce R
2016-03-31
Precise genome-editing relies on the repair of sequence-specific nuclease-induced DNA nicking or double-strand breaks (DSBs) by homology-directed repair (HDR). However, nonhomologous end-joining (NHEJ), an error-prone repair, acts concurrently, reducing the rate of high-fidelity edits. The identification of genome-editing conditions that favor HDR over NHEJ has been hindered by the lack of a simple method to measure HDR and NHEJ directly and simultaneously at endogenous loci. To overcome this challenge, we developed a novel, rapid, digital PCR-based assay that can simultaneously detect one HDR or NHEJ event out of 1,000 copies of the genome. Using this assay, we systematically monitored genome-editing outcomes of CRISPR-associated protein 9 (Cas9), Cas9 nickases, catalytically dead Cas9 fused to FokI, and transcription activator-like effector nuclease at three disease-associated endogenous gene loci in HEK293T cells, HeLa cells, and human induced pluripotent stem cells. Although it is widely thought that NHEJ generally occurs more often than HDR, we found that more HDR than NHEJ was induced under multiple conditions. Surprisingly, the HDR/NHEJ ratios were highly dependent on gene locus, nuclease platform, and cell type. The new assay system, and our findings based on it, will enable mechanistic studies of genome-editing and help improve genome-editing technology.
USDA-ARS?s Scientific Manuscript database
Preliminary investigations into the organization of the western corn rootworm (Diabrotica virgifera virgifera; WCR) genome have resulted in low to moderate density gender-specific maps constructed from progeny of a backcrossed, short-diapause WCR family. Maps were based upon variation at microsatel...
Green, Elaine K; Di Florio, Arianna; Forty, Liz; Gordon-Smith, Katherine; Grozeva, Detelina; Fraser, Christine; Richards, Alexander L; Moran, Jennifer L; Purcell, Shaun; Sklar, Pamela; Kirov, George; Owen, Michael J; O'Donovan, Michael C; Craddock, Nick; Jones, Lisa; Jones, Ian R
2017-12-01
Studies have suggested that Research Diagnostic Criteria for Schizoaffective Disorder Bipolar type (RDC-SABP) might identify a more genetically homogenous subgroup of bipolar disorder. Aiming to identify loci associated with RDC-SABP, we have performed a replication study using independent RDC-SABP cases (n = 144) and controls (n = 6,559), focusing on the 10 loci that reached a p-value <10 -5 for RDC-SABP in the Wellcome Trust Case Control Consortium (WTCCC) bipolar disorder sample. Combining the WTCCC and replication datasets by meta-analysis (combined RDC-SABP, n = 423, controls, n = 9,494), we observed genome-wide significant association at one SNP, rs2352974, located within the intron of the gene TRAIP on chromosome 3p21.31 (p-value, 4.37 × 10 -8 ). This locus did not reach genome-wide significance in bipolar disorder or schizophrenia large Psychiatric Genomic Consortium datasets, suggesting that it may represent a relatively specific genetic risk for the bipolar subtype of schizoaffective disorder. © 2017 Wiley Periodicals, Inc.
A CRISPR/molecular beacon hybrid system for live-cell genomic imaging.
Wu, Xiaotian; Mao, Shiqi; Yang, Yantao; Rushdi, Muaz N; Krueger, Christopher J; Chen, Antony K
2018-04-30
The clustered regularly interspersed short palindromic repeat (CRISPR) gene-editing system has been repurposed for live-cell genomic imaging, but existing approaches rely on fluorescent protein reporters, making sensitive and continuous imaging difficult. Here, we present a fluorophore-based live-cell genomic imaging system that consists of a nuclease-deactivated mutant of the Cas9 protein (dCas9), a molecular beacon (MB), and an engineered single-guide RNA (sgRNA) harboring a unique MB target sequence (sgRNA-MTS), termed CRISPR/MB. Specifically, dCas9 and sgRNA-MTS are first co-expressed to target a specific locus in cells, followed by delivery of MBs that can then hybridize to MTS to illuminate the target locus. We demonstrated the feasibility of this approach for quantifying genomic loci, for monitoring chromatin dynamics, and for dual-color imaging when using two orthogonal MB/MTS pairs. With flexibility in selecting different combinations of fluorophore/quencher pairs and MB/MTS sequences, our CRISPR/MB hybrid system could be a promising platform for investigating chromatin activities.
A non-canonical transferred DNA insertion at the BRI1 locus in Arabidopsis thaliana.
Zhao, Zhong; Zhu, Yan; Erhardt, Mathieu; Ruan, Ying; Shen, Wen-Hui
2009-04-01
Agrobacterium-mediated transformation is widely used in transgenic plant engineering and has been proven to be a powerful tool for insertional mutagenesis of the plant genome. The transferred DNA (T-DNA) from Agrobacterium is integrated into the plant genome through illegitimate recombination between the T-DNA and the plant DNA. Contrasting to the canonical insertion, here we report on a locus showing a complex mutation associated with T-DNA insertion at the BRI1 gene in Arabidopsis thaliana. We obtained a mutant line, named salade for its phenotype of dwarf stature and proliferating rosette. Molecular characterization of this mutant revealed that in addition to T-DNA a non-T-DNA-localized transposon from bacteria was inserted in the Arabidopsis genome and that a region of more than 11.5 kb of the Arabidopsis genome was deleted at the insertion site. The deleted region contains the brassinosteroid receptor gene BRI1 and the transcription factor gene WRKY13. Our finding reveals non-canonical T-DNA insertion, implicating horizontal gene transfer and cautioning the use of T-DNA as mutagen in transgenic research.
The renal urate transporter SLC17A1 locus: confirmation of association with gout.
Hollis-Moffatt, Jade E; Phipps-Green, Amanda J; Chapman, Brett; Jones, Gregory T; van Rij, Andre; Gow, Peter J; Harrison, Andrew A; Highton, John; Jones, Peter B; Montgomery, Grant W; Stamp, Lisa K; Dalbeth, Nicola; Merriman, Tony R
2012-04-27
Two major gout-causing genes have been identified, the urate transport genes SLC2A9 and ABCG2. Variation within the SLC17A1 locus, which encodes sodium-dependent phosphate transporter 1, a renal transporter of uric acid, has also been associated with serum urate concentration. However, evidence for association with gout is equivocal. We investigated the association of the SLC17A1 locus with gout in New Zealand sample sets. Five variants (rs1165196, rs1183201, rs9358890, rs3799344, rs12664474) were genotyped across a New Zealand sample set totaling 971 cases and 1,742 controls. Cases were ascertained according to American Rheumatism Association criteria. Two population groups were studied: Caucasian and Polynesian. At rs1183201 (SLC17A1), evidence for association with gout was observed in both the Caucasian (odds ratio (OR) = 0.67, P = 3.0 × 10-6) and Polynesian (OR = 0.74, P = 3.0 × 10-3) groups. Meta-analysis confirmed association of rs1183201 with gout at a genome-wide level of significance (OR = 0.70, P = 3.0 × 10-8). Haplotype analysis suggested the presence of a common protective haplotype. We confirm the SLC17A1 locus as the third associated with gout at a genome-wide level of significance.
Lundqvist, M L; Middleton, D L; Hazard, S; Warr, G W
2001-12-14
The region of the duck IgH locus extending from upstream of the proximal diversity (D) segment to downstream of the constant gene cluster has been cloned and mapped. A sequence contig of 48,796 base pairs established that the organization of the genes is D-J(H)-mu-alpha-upsilon. No evidence for a functional homologue (or remnant) of a delta gene was found. The alpha gene is in inverted transcriptional orientation; class switch to IgA expression thus requires inversion of the approximately 27-kilobase pair region that includes both mu and alpha genes. The secreted forms of duck alpha and mu are each encoded by 4 constant region exons, and the hydrophobic C-terminal regions of the membrane receptor forms of alpha and mu are encoded by one and two transmembrane exons, respectively. Putative switch (S) regions were identified for duck mu and upsilon by comparison with chicken Smu and Supsilon sequences and for duck alpha by comparison with mouse Salpha. The duck IgH locus is rich in complex variable number tandem repeats, which occupy approximately 60% of the sequenced region, and occur at a much higher frequency in the IgH locus than in other sequenced regions of the duck genome.
Robb, L; Hilton, D J; Brook-Carter, P T; Begley, C G
1997-03-15
The interleukin-11 receptor alpha-chain, a member of the hematopoietin receptor superfamily, forms, together with gp130, a functional high-affinity receptor complex for interleukin 11. We, and others, reported the cloning of the murine interleukin 11 receptor alpha-chain cDNA (IL11Ra) and recently described the structure of the IL11Ra locus. We also described the presence of a second IL11Ra-like locus in some mouse strains. In this study we report that the second locus, designated IL11Ra2, encodes an mRNA species. The transcript was 99% identical to the IL11Ra transcript in the coding and 3'-untranslated region, but had a different 5'-untranslated region. The complete genomic organization of the IL11Ra2 locus is presented, and the two loci are shown to be located on a 200-kb NaeI genomic fragment. Comparison of the expression pattern of the IL11Ra and IL11Ra2 genes using an RT-PCR restriction fragment length polymorphism strategy revealed that while the expression of IL11Ra was widespread, expression of IL11Ra2 was restricted to testis, lymph node, and thymus.
Genome-wide meta-analysis uncovers novel loci influencing circulating leptin levels.
Kilpeläinen, Tuomas O; Carli, Jayne F Martin; Skowronski, Alicja A; Sun, Qi; Kriebel, Jennifer; Feitosa, Mary F; Hedman, Åsa K; Drong, Alexander W; Hayes, James E; Zhao, Jinghua; Pers, Tune H; Schick, Ursula; Grarup, Niels; Kutalik, Zoltán; Trompet, Stella; Mangino, Massimo; Kristiansson, Kati; Beekman, Marian; Lyytikäinen, Leo-Pekka; Eriksson, Joel; Henneman, Peter; Lahti, Jari; Tanaka, Toshiko; Luan, Jian'an; Del Greco M, Fabiola; Pasko, Dorota; Renström, Frida; Willems, Sara M; Mahajan, Anubha; Rose, Lynda M; Guo, Xiuqing; Liu, Yongmei; Kleber, Marcus E; Pérusse, Louis; Gaunt, Tom; Ahluwalia, Tarunveer S; Ju Sung, Yun; Ramos, Yolande F; Amin, Najaf; Amuzu, Antoinette; Barroso, Inês; Bellis, Claire; Blangero, John; Buckley, Brendan M; Böhringer, Stefan; I Chen, Yii-Der; de Craen, Anton J N; Crosslin, David R; Dale, Caroline E; Dastani, Zari; Day, Felix R; Deelen, Joris; Delgado, Graciela E; Demirkan, Ayse; Finucane, Francis M; Ford, Ian; Garcia, Melissa E; Gieger, Christian; Gustafsson, Stefan; Hallmans, Göran; Hankinson, Susan E; Havulinna, Aki S; Herder, Christian; Hernandez, Dena; Hicks, Andrew A; Hunter, David J; Illig, Thomas; Ingelsson, Erik; Ioan-Facsinay, Andreea; Jansson, John-Olov; Jenny, Nancy S; Jørgensen, Marit E; Jørgensen, Torben; Karlsson, Magnus; Koenig, Wolfgang; Kraft, Peter; Kwekkeboom, Joanneke; Laatikainen, Tiina; Ladwig, Karl-Heinz; LeDuc, Charles A; Lowe, Gordon; Lu, Yingchang; Marques-Vidal, Pedro; Meisinger, Christa; Menni, Cristina; Morris, Andrew P; Myers, Richard H; Männistö, Satu; Nalls, Mike A; Paternoster, Lavinia; Peters, Annette; Pradhan, Aruna D; Rankinen, Tuomo; Rasmussen-Torvik, Laura J; Rathmann, Wolfgang; Rice, Treva K; Brent Richards, J; Ridker, Paul M; Sattar, Naveed; Savage, David B; Söderberg, Stefan; Timpson, Nicholas J; Vandenput, Liesbeth; van Heemst, Diana; Uh, Hae-Won; Vohl, Marie-Claude; Walker, Mark; Wichmann, Heinz-Erich; Widén, Elisabeth; Wood, Andrew R; Yao, Jie; Zeller, Tanja; Zhang, Yiying; Meulenbelt, Ingrid; Kloppenburg, Margreet; Astrup, Arne; Sørensen, Thorkild I A; Sarzynski, Mark A; Rao, D C; Jousilahti, Pekka; Vartiainen, Erkki; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, André G; Kajantie, Eero; Osmond, Clive; Palotie, Aarno; Eriksson, Johan G; Heliövaara, Markku; Knekt, Paul B; Koskinen, Seppo; Jula, Antti; Perola, Markus; Huupponen, Risto K; Viikari, Jorma S; Kähönen, Mika; Lehtimäki, Terho; Raitakari, Olli T; Mellström, Dan; Lorentzon, Mattias; Casas, Juan P; Bandinelli, Stefanie; März, Winfried; Isaacs, Aaron; van Dijk, Ko W; van Duijn, Cornelia M; Harris, Tamara B; Bouchard, Claude; Allison, Matthew A; Chasman, Daniel I; Ohlsson, Claes; Lind, Lars; Scott, Robert A; Langenberg, Claudia; Wareham, Nicholas J; Ferrucci, Luigi; Frayling, Timothy M; Pramstaller, Peter P; Borecki, Ingrid B; Waterworth, Dawn M; Bergmann, Sven; Waeber, Gérard; Vollenweider, Peter; Vestergaard, Henrik; Hansen, Torben; Pedersen, Oluf; Hu, Frank B; Eline Slagboom, P; Grallert, Harald; Spector, Tim D; Jukema, J W; Klein, Robert J; Schadt, Erik E; Franks, Paul W; Lindgren, Cecilia M; Leibel, Rudolph L; Loos, Ruth J F
2016-02-01
Leptin is an adipocyte-secreted hormone, the circulating levels of which correlate closely with overall adiposity. Although rare mutations in the leptin (LEP) gene are well known to cause leptin deficiency and severe obesity, no common loci regulating circulating leptin levels have been uncovered. Therefore, we performed a genome-wide association study (GWAS) of circulating leptin levels from 32,161 individuals and followed up loci reaching P<10(-6) in 19,979 additional individuals. We identify five loci robustly associated (P<5 × 10(-8)) with leptin levels in/near LEP, SLC32A1, GCKR, CCNL1 and FTO. Although the association of the FTO obesity locus with leptin levels is abolished by adjustment for BMI, associations of the four other loci are independent of adiposity. The GCKR locus was found associated with multiple metabolic traits in previous GWAS and the CCNL1 locus with birth weight. Knockdown experiments in mouse adipose tissue explants show convincing evidence for adipogenin, a regulator of adipocyte differentiation, as the novel causal gene in the SLC32A1 locus influencing leptin levels. Our findings provide novel insights into the regulation of leptin production by adipose tissue and open new avenues for examining the influence of variation in leptin levels on adiposity and metabolic health.
Libioulle, Cécile; Louis, Edouard; Hansoul, Sarah; Sandor, Cynthia; Farnir, Frédéric; Franchimont, Denis; Vermeire, Séverine; Dewit, Olivier; de Vos, Martine; Dixon, Anna; Demarche, Bruno; Gut, Ivo; Heath, Simon; Foglio, Mario; Liang, Liming; Laukens, Debby; Mni, Myriam; Zelenika, Diana; Van Gossum, André; Rutgeerts, Paul; Belaiche, Jacques; Lathrop, Mark; Georges, Michel
2007-04-20
To identify novel susceptibility loci for Crohn disease (CD), we undertook a genome-wide association study with more than 300,000 SNPs characterized in 547 patients and 928 controls. We found three chromosome regions that provided evidence of disease association with p-values between 10(-6) and 10(-9). Two of these (IL23R on Chromosome 1 and CARD15 on Chromosome 16) correspond to genes previously reported to be associated with CD. In addition, a 250-kb region of Chromosome 5p13.1 was found to contain multiple markers with strongly suggestive evidence of disease association (including four markers with p < 10(-7)). We replicated the results for 5p13.1 by studying 1,266 additional CD patients, 559 additional controls, and 428 trios. Significant evidence of association (p < 4 x 10(-4)) was found in case/control comparisons with the replication data, while associated alleles were over-transmitted to affected offspring (p < 0.05), thus confirming that the 5p13.1 locus contributes to CD susceptibility. The CD-associated 250-kb region was saturated with 111 SNP markers. Haplotype analysis supports a complex locus architecture with multiple variants contributing to disease susceptibility. The novel 5p13.1 CD locus is contained within a 1.25-Mb gene desert. We present evidence that disease-associated alleles correlate with quantitative expression levels of the prostaglandin receptor EP4, PTGER4, the gene that resides closest to the associated region. Our results identify a major new susceptibility locus for CD, and suggest that genetic variants associated with disease risk at this locus could modulate cis-acting regulatory elements of PTGER4.
Libioulle, Cécile; Louis, Edouard; Hansoul, Sarah; Sandor, Cynthia; Farnir, Frédéric; Franchimont, Denis; Vermeire, Séverine; Dewit, Olivier; de Vos, Martine; Dixon, Anna; Demarche, Bruno; Gut, Ivo; Heath, Simon; Foglio, Mario; Liang, Liming; Laukens, Debby; Mni, Myriam; Zelenika, Diana; Gossum, André Van; Rutgeerts, Paul; Belaiche, Jacques; Lathrop, Mark; Georges, Michel
2007-01-01
To identify novel susceptibility loci for Crohn disease (CD), we undertook a genome-wide association study with more than 300,000 SNPs characterized in 547 patients and 928 controls. We found three chromosome regions that provided evidence of disease association with p-values between 10−6 and 10−9. Two of these (IL23R on Chromosome 1 and CARD15 on Chromosome 16) correspond to genes previously reported to be associated with CD. In addition, a 250-kb region of Chromosome 5p13.1 was found to contain multiple markers with strongly suggestive evidence of disease association (including four markers with p < 10−7). We replicated the results for 5p13.1 by studying 1,266 additional CD patients, 559 additional controls, and 428 trios. Significant evidence of association (p < 4 × 10−4) was found in case/control comparisons with the replication data, while associated alleles were over-transmitted to affected offspring (p < 0.05), thus confirming that the 5p13.1 locus contributes to CD susceptibility. The CD-associated 250-kb region was saturated with 111 SNP markers. Haplotype analysis supports a complex locus architecture with multiple variants contributing to disease susceptibility. The novel 5p13.1 CD locus is contained within a 1.25-Mb gene desert. We present evidence that disease-associated alleles correlate with quantitative expression levels of the prostaglandin receptor EP4, PTGER4, the gene that resides closest to the associated region. Our results identify a major new susceptibility locus for CD, and suggest that genetic variants associated with disease risk at this locus could modulate cis-acting regulatory elements of PTGER4. PMID:17447842
Peng, Suotang; Xu, Qun; Yuan, Xiaoping; Feng, Yue; Yu, Hanyong; Wang, Yiping; Wei, Xinghua
2014-01-01
Wild species of Oryza are extremely valuable sources of genetic material that can be used to broaden the genetic background of cultivated rice, and to increase its resistance to abiotic and biotic stresses. Until recently, there was no sequence information for the BBCC Oryza genome; therefore, no special markers had been developed for this genome type. The lack of suitable markers made it difficult to search for valuable genes in the BBCC genome. The aim of this study was to develop microsatellite markers for the BBCC genome. We obtained 13,991 SSR-containing sequences and designed 14,508 primer pairs. The most abundant was hexanuclelotide (31.39%), followed by trinucleotide (27.67%) and dinucleotide (19.04%). 600 markers were selected for validation in 23 accessions of Oryza species with the BBCC genome. A set of 495 markers produced clear amplified fragments of the expected sizes. The average number of alleles per locus (Na) was 2.5, ranging from 1 to 9. The genetic diversity per locus (He) ranged from 0 to 0.844 with a mean of 0.333. The mean polymorphism information content (PIC) was 0.290, and ranged from 0 to 0.825. Of the 495 markers, 12 were only found in the BB genome, 173 were unique to the CC genome, and 198 were also present in the AA genome. These microsatellite markers could be used to evaluate the phylogenetic relationships among different Oryza genomes, and to construct a genetic linkage map for locating and identifying valuable genes in the BBCC genome, and would also for marker-assisted breeding programs that included accessions with the AA genome, especially Oryza sativa. PMID:24632997
GWAS meta-analysis of 16 852 women identifies new susceptibility locus for endometrial cancer.
Chen, Maxine M; O'Mara, Tracy A; Thompson, Deborah J; Painter, Jodie N; Attia, John; Black, Amanda; Brinton, Louise; Chanock, Stephen; Chen, Chu; Cheng, Timothy Ht; Cook, Linda S; Crous-Bou, Marta; Doherty, Jennifer; Friedenreich, Christine M; Garcia-Closas, Montserrat; Gaudet, Mia M; Gorman, Maggie; Haiman, Christopher; Hankinson, Susan E; Hartge, Patricia; Henderson, Brian E; Hodgson, Shirley; Holliday, Elizabeth G; Horn-Ross, Pamela L; Hunter, David J; Le Marchand, Loic; Liang, Xiaolin; Lissowska, Jolanta; Long, Jirong; Lu, Lingeng; Magliocco, Anthony M; Martin, Lynn; McEvoy, Mark; Olson, Sara H; Orlow, Irene; Pooler, Loreall; Prescott, Jennifer; Rastogi, Radhai; Rebbeck, Timothy R; Risch, Harvey; Sacerdote, Carlotta; Schumacher, Frederick; Wendy Setiawan, Veronica; Scott, Rodney J; Sheng, Xin; Shu, Xiao-Ou; Turman, Constance; Van Den Berg, David; Wang, Zhaoming; Weiss, Noel S; Wentzensen, Nicholas; Xia, Lucy; Xiang, Yong-Bing; Yang, Hannah P; Yu, Herbert; Zheng, Wei; Pharoah, Paul D P; Dunning, Alison M; Tomlinson, Ian; Easton, Douglas F; Kraft, Peter; Spurdle, Amanda B; De Vivo, Immaculata
2016-06-15
Endometrial cancer is the most common gynecological malignancy in the developed world. Although there is evidence of genetic predisposition to the disease, most of the genetic risk remains unexplained. We present the meta-analysis results of four genome-wide association studies (4907 cases and 11 945 controls total) in women of European ancestry. We describe one new locus reaching genome-wide significance (P < 5 × 10 - 8 ) at 6p22.3 (rs1740828; P = 2.29 × 10 - 8 , OR = 1.20), providing evidence of an additional region of interest for genetic susceptibility to endometrial cancer. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Jasinska, Anna J.; Zelaya, Ivette; Service, Susan K.; Peterson, Christine B.; Cantor, Rita M.; Choi, Oi-Wa; DeYoung, Joseph; Eskin, Eleazar; Fairbanks, Lynn A.; Fears, Scott; Furterer, Allison E.; Huang, Yu S.; Ramensky, Vasily; Schmitt, Christopher A.; Svardal, Hannes; Jorgensen, Matthew J.; Kaplan, Jay R.; Villar, Diego; Aken, Bronwen L.; Flicek, Paul; Nag, Rishi; Wong, Emily S.; Blangero, John; Dyer, Thomas D.; Bogomolov, Marina; Benjamini, Yoav; Weinstock, George M.; Dewar, Ken; Sabatti, Chiara; Wilson, Richard K.; Jentsch, J. David; Warren, Wesley; Coppola, Giovanni; Woods, Roger P.; Freimer, Nelson B.
2017-01-01
By analyzing multi-tissue gene expression and genome-wide genetic variation data in samples from a vervet monkey pedigree, we generated a transcriptome resource and produced the first catalogue of expression quantitative trait loci (eQTLs) in a non-human primate model. This catalogue contains more genome-wide significant eQTLs, per sample, than comparable human resources, and reveals sex and age-related expression patterns. Findings include a master regulatory locus that likely plays a role in immune function, and a locus regulating hippocampal long non-coding RNAs (lncRNAs), whose expression correlates with hippocampal volume. This resource will facilitate genetic investigation of quantitative traits, including brain and behavioral phenotypes relevant to neuropsychiatric disorders. PMID:29083405
Wu, Zongfu; Wang, Weixue; Tang, Min; Shao, Jing; Dai, Chen; Zhang, Wei; Fan, Hongjie; Yao, Huochun; Zong, Jie; Chen, Dai; Wang, Junning; Lu, Chengping
2014-02-10
Streptococcus suis (SS) is an important swine pathogen worldwide that occasionally causes serious infections in humans. SS infection may result in meningitis in pigs and humans. The pathogenic mechanisms of SS are poorly understood. Here, we provide the complete genome sequence of S. suis serotype 2 (SS2) strain SC070731 isolated from a pig with meningitis. The chromosome is 2,138,568bp in length. There are 1933 predicted protein coding sequences and 96.7% (57/59) of the known virulence-associated genes are present in the genome. Strain SC070731 showed similar virulence with SS2 virulent strains HA9801 and ZY05719, but was more virulent than SS2 virulent strain P1/7 in the zebrafish infection model. Comparative genomic analysis revealed a unique 105K genomic island in strain SC070731 that is absent in seven other sequenced SS2 strains. Further analysis of the 105K genomic island indicated that it contained a complete nisin locus similar to the nisin U locus in S. uberis strain 42, a prophage similar to S. oralis phage PH10 and several antibiotic resistance genes. Several proteins in the 105K genomic island, including nisin and RelBE toxin-antitoxin system, contribute to the bacterial fitness and virulence in other pathogenic bacteria. Further investigation of newly identified gene products, including four putative new virulence-associated surface proteins, will improve our understanding of SS pathogenesis. Copyright © 2013 Elsevier B.V. All rights reserved.
2014-06-01
Specifically, we combined the CRISPR genome editing system with a novel approach allowing efficient single cell cloning of Drosophila cells with the aim of...and culture these to produce cultures completely lacking wildtype sequence at the target locus. No robust methods existed to clone single Drosophila ...targeting all kinases and phosphatases (563 genes) in the Drosophila genome . 65 samples that displayed synthetic lethality (15 genes) or synthetic
Mapping a candidate gene (MdMYB10) for red flesh and foliage colour in apple
Chagné, David; Carlisle, Charmaine M; Blond, Céline; Volz, Richard K; Whitworth, Claire J; Oraguzie, Nnadozie C; Crowhurst, Ross N; Allan, Andrew C; Espley, Richard V; Hellens, Roger P; Gardiner, Susan E
2007-01-01
Background Integrating plant genomics and classical breeding is a challenge for both plant breeders and molecular biologists. Marker-assisted selection (MAS) is a tool that can be used to accelerate the development of novel apple varieties such as cultivars that have fruit with anthocyanin through to the core. In addition, determining the inheritance of novel alleles, such as the one responsible for red flesh, adds to our understanding of allelic variation. Our goal was to map candidate anthocyanin biosynthetic and regulatory genes in a population segregating for the red flesh phenotypes. Results We have identified the Rni locus, a major genetic determinant of the red foliage and red colour in the core of apple fruit. In a population segregating for the red flesh and foliage phenotype we have determined the inheritance of the Rni locus and DNA polymorphisms of candidate anthocyanin biosynthetic and regulatory genes. Simple Sequence Repeats (SSRs) and Single Nucleotide Polymorphisms (SNPs) in the candidate genes were also located on an apple genetic map. We have shown that the MdMYB10 gene co-segregates with the Rni locus and is on Linkage Group (LG) 09 of the apple genome. Conclusion We have performed candidate gene mapping in a fruit tree crop and have provided genetic evidence that red colouration in the fruit core as well as red foliage are both controlled by a single locus named Rni. We have shown that the transcription factor MdMYB10 may be the gene underlying Rni as there were no recombinants between the marker for this gene and the red phenotype in a population of 516 individuals. Associating markers derived from candidate genes with a desirable phenotypic trait has demonstrated the application of genomic tools in a breeding programme of a horticultural crop species. PMID:17608951
Buil, Alfonso; Souto, Juan Carlos; Saut, Noémie; Germain, Marine; Rotival, Maxime; Tiret, Laurence; Cambien, Françcois; Lathrop, Mark; Zeller, Tanja; Alessi, Marie-Christine; Rodriguez de Cordoba, Santiago; Münzel, Thomas; Wild, Philipp; Fontcuberta, Jordi; Gagnon, France; Emmerich, Joseph; Almasy, Laura; Blankenberg, Stefan; Soria, José-Manuel; Morange, Pierre-Emmanuel
2010-01-01
Through its binding with protein S (PS), a key element of the coagulation/fibrinolysis cascade, the C4b-binding protein (C4BP) has been hypothesized to be involved in the susceptibility to venous thrombosis (VT). To identify genetic factors that may influence the plasma levels of the 3 C4BP existing isoforms, α7β1, α6β1, and α7β0, we conducted a genome-wide association study by analyzing 283 437 single nucleotide polymorphisms (SNPs) in the Genetic Analysis of Idiopathic Thrombophilia (GAIT) study composed of 352 persons. Three SNPs at the C4BPB/C4BPA locus were found genome-wide significantly associated with α7β0 levels. One of these SNPs was further found to explain approximately 11% of the variability of mRNA C4BPA expression in the Gutenberg Heart Study composed of 1490 persons, with no effect on C4BPB mRNA expression. The allele associated with increased α7β0 plasma levels and increased C4BPA expression was further found associated with increased risk of VT (odds ratio [OR] = 1.24 [1.03-1.53]) in 2 independent case-control studies (MARseille THrombosis Association study [MARTHA] and FActeurs de RIsque et de récidives de la maladie thromboembolique VEineuse [FARIVE]) gathering 1706 cases and 1379 controls. This SNP was not associated with free PS or total PS. In conclusion, we observed strong evidence that the C4BPB/C4BPA locus is a new susceptibility locus for VT through a PS-independent mechanism that remains to be elucidated. PMID:20212171
Regulatory Features for Odorant Receptor Genes in the Mouse Genome.
Degl'Innocenti, Andrea; D'Errico, Anna
2017-01-01
The odorant receptor genes, seven transmembrane receptor genes constituting the vastest mammalian gene multifamily, are expressed monogenically and monoallelicaly in each sensory neuron in the olfactory epithelium. This characteristic, often referred to as the one neuron-one receptor rule, is driven by mostly uncharacterized molecular dynamics, generally named odorant receptor gene choice . Much attention has been paid by the scientific community to the identification of sequences regulating the expression of odorant receptor genes within their loci , where related genes are usually arranged in genomic clusters. A number of studies identified transcription factor binding sites on odorant receptor promoter sequences. Similar binding sites were also found on a number of enhancers that regulate in cis their transcription, but have been proposed to form interchromosomal networks. Odorant receptor gene choice seems to occur via the local removal of strongly repressive epigenetic markings, put in place during the maturation of the sensory neuron on each odorant receptor locus . Here we review the fast-changing state of art for the study of regulatory features for odorant receptor genes.
Genome complexity in the coelacanth is reflected in its adaptive immune system
Saha, Nil Ratan; Ota, Tatsuya; Litman, Gary W.; Hansen, John; Parra, Zuly; Hsu, Ellen; Buonocore, Francesco; Canapa, Adriana; Cheng, Jan-Fang; Amemiya, Chris T.
2014-01-01
We have analyzed the available genome and transcriptome resources from the coelacanth in order to characterize genes involved in adaptive immunity. Two highly distinctive IgW-encoding loci have been identified that exhibit a unique genomic organization, including a multiplicity of tandemly repeated constant region exons. The overall organization of the IgW loci precludes typical heavy chain class switching. A locus encoding IgM could not be identified either computationally or by using several different experimental strategies. Four distinct sets of genes encoding Ig light chains were identified. This includes a variant sigma-type Ig light chain previously identified only in cartilaginous fishes and which is now provisionally denoted sigma-2. Genes encoding α/β and γ/δ T-cell receptors, and CD3, CD4, and CD8 co-receptors also were characterized. Ig heavy chain variable region genes and TCR components are interspersed within the TCR α/δ locus; this organization previously was reported only in tetrapods and raises questions regarding evolution and functional cooption of genes encoding variable regions. The composition, organization and syntenic conservation of the major histocompatibility complex locus have been characterized. We also identified large numbers of genes encoding cytokines and their receptors, and other genes associated with adaptive immunity. In terms of sequence identity and organization, the adaptive immune genes of the coelacanth more closely resemble orthologous genes in tetrapods than those in teleost fishes, consistent with current phylogenomic interpretations. Overall, the work reported described herein highlights the complexity inherent in the coelacanth genome and provides a rich catalog of immune genes for future investigations.
Divergence with gene flow across a speciation continuum of Heliconius butterflies.
Supple, Megan A; Papa, Riccardo; Hines, Heather M; McMillan, W Owen; Counterman, Brian A
2015-09-24
A key to understanding the origins of species is determining the evolutionary processes that drive the patterns of genomic divergence during speciation. New genomic technologies enable the study of high-resolution genomic patterns of divergence across natural speciation continua, where taxa pairs with different levels of reproductive isolation can be used as proxies for different stages of speciation. Empirical studies of these speciation continua can provide valuable insights into how genomes diverge during speciation. We examine variation across a handful of genomic regions in parapatric and allopatric populations of Heliconius butterflies with varying levels of reproductive isolation. Genome sequences were mapped to 2.2-Mb of the H. erato genome, including 1-Mb across the red color pattern locus and multiple regions unlinked to color pattern variation. Phylogenetic analyses reveal a speciation continuum of pairs of hybridizing races and incipient species in the Heliconius erato clade. Comparisons of hybridizing pairs of divergently colored races and incipient species reveal that genomic divergence increases with ecological and reproductive isolation, not only across the locus responsible for adaptive variation in red wing coloration, but also at genomic regions unlinked to color pattern. We observe high levels of divergence between the incipient species H. erato and H. himera, suggesting that divergence may accumulate early in the speciation process. Comparisons of genomic divergence between the incipient species and allopatric races suggest that limited gene flow cannot account for the observed high levels of divergence between the incipient species. Our results provide a reconstruction of the speciation continuum across the H. erato clade and provide insights into the processes that drive genomic divergence during speciation, establishing the H. erato clade as a powerful framework for the study of speciation.
Genome-wide significant risk associations for mucinous ovarian carcinoma
Kelemen, Linda E.; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; M. Lee, Janet; Seo, Ji-Heui; Phelan, Catherine M.; Beesley, Jonathan; Chen, Xiaoqin; Spindler, Tassja J.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chen, Y. Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Engelholm, Svend Aage; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kjaer, Susanne K.; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moes-Sosnowska, Joanna; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Wlodzimierz, Sawicki; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Freedman, Matthew L.; Chenevix-Trench, Georgia; Pharoah, Paul D.; Gayther, Simon A.; Berchuck, Andrew
2015-01-01
Genome-wide association studies have identified several risk associations for ovarian carcinomas (OC) but not for mucinous ovarian carcinomas (MOC). Genotypes from OC cases and controls were imputed into the 1000 Genomes Project reference panel. Analysis of 1,644 MOC cases and 21,693 controls identified three novel risk associations: rs752590 at 2q13 (P = 3.3 × 10−8), rs711830 at 2q31.1 (P = 7.5 × 10−12) and rs688187 at 19q13.2 (P = 6.8 × 10−13). Expression Quantitative Trait Locus (eQTL) analysis in ovarian and colorectal tumors (which are histologically similar to MOC) identified significant eQTL associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10−4, FDR = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors, and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease. PMID:26075790
2013-01-01
Background Cucumber is an important vegetable crop that is susceptible to many pathogens, but no disease resistance (R) genes have been cloned. The availability of whole genome sequences provides an excellent opportunity for systematic identification and characterization of the nucleotide binding and leucine-rich repeat (NB-LRR) type R gene homolog (RGH) sequences in the genome. Cucumber has a very narrow genetic base making it difficult to construct high-density genetic maps. Development of a consensus map by synthesizing information from multiple segregating populations is a method of choice to increase marker density. As such, the objectives of the present study were to identify and characterize NB-LRR type RGHs, and to develop a high-density, integrated cucumber genetic-physical map anchored with RGH loci. Results From the Gy14 draft genome, 70 NB-containing RGHs were identified and characterized. Most RGHs were in clusters with uneven distribution across seven chromosomes. In silico analysis indicated that all 70 RGHs had EST support for gene expression. Phylogenetic analysis classified 58 RGHs into two clades: CNL and TNL. Comparative analysis revealed high-degree sequence homology and synteny in chromosomal locations of these RGH members between the cucumber and melon genomes. Fifty-four molecular markers were developed to delimit 67 of the 70 RGHs, which were integrated into a genetic map through linkage analysis. A 1,681-locus cucumber consensus map including 10 gene loci and spanning 730.0 cM in seven linkage groups was developed by integrating three component maps with a bin-mapping strategy. Physically, 308 scaffolds with 193.2 Mbp total DNA sequences were anchored onto this consensus map that covered 52.6% of the 367 Mbp cucumber genome. Conclusions Cucumber contains relatively few NB-LRR RGHs that are clustered and unevenly distributed in the genome. All RGHs seem to be transcribed and shared significant sequence homology and synteny with the melon genome suggesting conservation of these RGHs in the Cucumis lineage. The 1,681-locus consensus genetic-physical map developed and the RGHs identified and characterized herein are valuable genomics resources that may have many applications such as quantitative trait loci identification, map-based gene cloning, association mapping, marker-assisted selection, as well as assembly of a more complete cucumber genome. PMID:23531125
Chen, Xiaoping; Wang, Haijian; Zhou, Gangqiao; Zhang, Xiumei; Dong, Xiaojia; Zhi, Lianteng; Jin, Li; He, Fuchu
2009-01-01
Background The human CYP3A gene cluster codes for cytochrome P450 (CYP) subfamily enzymes that catalyze the metabolism of various exogenous and endogenous chemicals and is an obvious candidate for evolutionary and environmental genomic study. Functional variants in the CYP3A locus may have undergone a selective sweep in response to various environmental conditions. Objective The goal of this study was to profile the allelic structure across the human CYP3A locus and investigate natural selection on that locus. Methods From the CYP3A locus spanning 231 kb, we resequenced 54 genomic DNA fragments (a total of 43,675 bases) spanning four genes (CYP3A4, CYP3A5, CYP3A7, and CYP3A43) and two pseudogenes (CYP3AP1 and CYP3AP2), and randomly selected intergenic regions at the CYP3A locus in Africans (24 individuals), Caucasians (24 individuals), and Chinese (29 individuals). We comprehensively investigated the nucleotide diversity and haplotype structure and examined the possible role of natural selection in shaping the sequence variation throughout the gene cluster. Results Neutrality tests with Tajima’s D, Fu and Li’s D* and F*, and Fay and Wu’s H indicated possible roles of positive selection on the entire CYP3A locus in non-Africans. Sliding-window analyses of nucleotide diversity and frequency spectrum, as well as haplotype diversity and phylogenetically inferred haplotype structure, revealed that CYP3A4 and CYP3A7 had recently undergone or were undergoing a selective sweep in all three populations, whereas CYP3A43 and CYP3A5 were undergoing a selective sweep in non-Africans and Caucasians, respectively. Conclusion The refined allelic architecture and selection spectrum for the human CYP3A locus highlight that evolutionary dynamics of molecular adaptation may underlie the phenotypic variation of the xenobiotic disposition system and varied predisposition to complex disorders in which xenobiotics play a role. PMID:20019904
Matana, Antonela; Popović, Marijana; Boutin, Thibaud; Torlak, Vesela; Brdar, Dubravka; Gunjača, Ivana; Kolčić, Ivana; Boraska Perica, Vesna; Punda, Ante; Polašek, Ozren; Hayward, Caroline; Barbalić, Maja; Zemunik, Tatijana
2018-04-18
Autoimmune thyroid diseases (AITD) are multifactorial endocrine diseases most frequently accompanied by Tg and TPO autoantibodies. Both antibodies have a higher prevalence in females and act under a strong genetic influence. To identify novel variants underlying thyroid antibody levels, we performed GWAS meta-analysis on the plasma levels of TgAb and TPOAb in three Croatian cohorts, as well as gender specific GWAS and a bivariate analysis. No significant association was detected with the level of TgAb and TPOAb in the meta-analysis of GWAS or bivariate results for all individuals. The bivariate analysis in females only revealed a genome-wide significant association for the locus near GRIN3A (rs4457391, P = 7.76 × 10 -9 ). The same locus had borderline association with TPOAb levels in females (rs1935377, P = 8.58 × 10 -8 ). In conclusion, we identified a novel gender specific locus associated with TgAb and TPOAb levels. Our findings provide a novel insight into genetic and gender differences associated with thyroid antibodies. Copyright © 2018 Elsevier Inc. All rights reserved.
Yoo, Eung Jae; Cajiao, Isabela; Kim, Jeong-Seon; Kimura, Atsushi P.; Zhang, Aiwen; Cooke, Nancy E.; Liebhaber, Stephen A.
2006-01-01
Random assortment within mammalian genomes juxtaposes genes with distinct expression profiles. This organization, along with the prevalence of long-range regulatory controls, generates a potential for aberrant transcriptional interactions. The human CD79b/GH locus contains six tightly linked genes with three mutually exclusive tissue specificities and interdigitated control elements. One consequence of this compact organization is that the pituitarycell-specific transcriptional events that activate hGH-N also trigger ectopic activation of CD79b. However, the B-cell-specific events that activate CD79b do not trigger reciprocal activation of hGH-N. Here we utilized DNase I hypersensitive site mapping, chromatin immunoprecipitation, and transgenic models to explore the basis for this asymmetric relationship. The results reveal tissue-specific patterns of chromatin structures and transcriptional controls at the CD79b/GH locus in B cells distinct from those in the pituitary gland and placenta. These three unique transcriptional environments suggest a set of corresponding gene expression pathways and transcriptional interactions that are likely to be found juxtaposed at multiple sites within the eukaryotic genome. PMID:16847312
A common variant mapping to CACNA1A is associated with susceptibility to Exfoliation syndrome
Aung, Tin; Ozaki, Mineo; Mizoguchi, Takanori; Allingham, R Rand; Li, Zheng; Haripriya, Aravind; Nakano, Satoko; Uebe, Steffen; Harder, Jeffrey M.; Chan, Anita S.Y.; Lee, Mei Chin; Burdon, Kathryn P.; Astakhov, Yury S.; Abu-Amero, Khaled K.; Zenteno, Juan C.; Nilgün, Yildirim; Zarnowski, Tomasz; Pakravan, Mohammad; Safieh, Leen Abu; Jia, Liyun; Wang, Ya Xing; Williams, Susan; Paoli, Daniela; Schlottmann, Patricio G; Huang, Lulin; Sim, Kar Seng; Foo, Jia Nee; Nakano, Masakazu; Ikeda, Yoko; Kumar, Rajesh S; Ueno, Morio; Manabe, Shin-ichi; Hayashi, Ken; Kazama, Shigeyasu; Ideta, Ryuichi; Mori, Yosai; Miyata, Kazunori; Sugiyama, Kazuhisa; Higashide, Tomomi; Chihara, Etsuo; Inoue, Kenji; Ishiko, Satoshi; Yoshida, Akitoshi; Yanagi, Masahide; Kiuchi, Yoshiaki; Aihara, Makoto; Ohashi, Tsutomu; Sakurai, Toshiya; Sugimoto, Takako; Chuman, Hideki; Matsuda, Fumihiko; Yamashiro, Kenji; Gotoh, Norimoto; Miyake, Masahiro; Astakhov, Sergei Y.; Osman, Essam A.; Al-Obeidan, Saleh A.; Owaidhah, Ohoud; Al-Jasim, Leyla; Al Shahwan, Sami; Fogarty, Rhys A.; Leo, Paul; Yetkin, Yaz; Oğuz, Çilingir; Kanavi, Mozhgan Rezaei; Beni, Afsaneh Naderi; Yazdani, Shahin; Akopov, Evgeny L.; Toh, Kai-Yee; Howell, Gareth R; Orr, Andrew C.; Goh, Yufen; Meah, Wee Yang; Peh, Su Qin; Kosior-Jarecka, Ewa; Lukasik, Urszula; Krumbiegel, Mandy; Vithana, Eranga N; Wong, Tien Yin; Liu, Yutao; Ashley Koch, Allison E.; Challa, Pratap; Rautenbach, Robyn M; Mackey, David A.; Hewitt, Alex W; Mitchell, Paul; Wang, Jie Jin; Ziskind, Ari; Carmichael, Trevor; Ramakrishnan, Rangappa; Narendran, Kalpana; Venkatesh, Rangaraj; Vijayan, Saravanan; Zhao, Peiquan; Chen, Xueyi; Guadarrama-Vallejo, Dalia; Cheng, Ching Yu; Perera, Shamira A; Husain, Rahat; Ho, Su-Ling; Welge-Luessen, Ulrich-Christoph; Mardin, Christian; Schloetzer-Schrehardt, Ursula; Hillmer, Axel M.; Herms, Stefan; Moebus, Susanne; Nöthen, Markus M.; Weisschuh, Nicole; Shetty, Rohit; Ghosh, Arkasubhra; Teo, Yik Ying; Brown, Matthew A; Lischinsky, Ignacio; Crowston, Jonathan G; Coote, Michael; Zhao, Bowen; Sang, Jinghong; Zhang, Nihong; You, Qisheng; Vysochinskaya, Vera; Founti, Panayiota; Chatzikyriakidou, Anthoula; Lambropoulos, Alexandros; Anastasopoulos, Eleftherios; Coleman, Anne L; Wilson, M Roy; Rhee, Douglas J; Kang, Jae Hee; May-Bolchakova, Inna; Heegaard, Steffen; Mori, Kazuhiko; Alward, Wallace L.M.; Jonas, Jost B; Xu, Liang; Liebmann, Jeffrey M; Chowbay, Balram; Schaeffeler, Elke; Schwab, Matthias; Lerner, Fabian; Wang, Ningli; Yang, Zhenglin; Frezzotti, Paolo; Kinoshita, Shigeru; Fingert, John H.; Inatani, Masaru; Tashiro, Kei; Reis, André; Edward, Deepak P.; Pasquale, Louis R.; Kubota, Toshiaki; Wiggs, Janey L.; Pasutto, Francesca; Topouzis, Fotis; Dubina, Michael; Craig, Jamie E.; Yoshimura, Nagahisa; Sundaresan, Periasamy; John, Simon W.M.; Ritch, Robert; Hauser, Michael A; Khor, Chiea-Chuen
2015-01-01
Exfoliation syndrome (XFS) is the commonest recognizable cause of open angle glaucoma world-wide. To better understand the etiology of XFS, we conducted a genome-wide association study (GWAS) on 1,484 patients and 1,188 controls from Japan, and followed up the most significant findings on a further 6,901 patients and 20,727 controls from 17 countries across 6 continents. We discovered a significant association between a new locus (CACNA1A rs4926244) and increased susceptibility to XFS (Odds ratio [OR] = 1.16, P = 3.36 × 10−11). Although overwhelming association at the LOXL1 locus was confirmed, the key SNP marker (LOXL1 rs4886776) demonstrated allelic reversal depending on ethnic grouping (In Japanese: ORA-allele= 9.87, P = 2.13 × 10−217; In non-Japanese: ORA-allele= 0.49, P = 2.35 × 10−31). Our findings represent the first genetic locus outside of LOXL1 which surpasses genome-wide significance for XFS, and provides insight into the biology and pathogenesis of the disease. PMID:25706626
Brewer, Megan H.; Chaudhry, Rabia; Qi, Jessica; Kidambi, Aditi; Drew, Alexander P.; Ryan, Monique M.; Subramanian, Gopinath M.; Young, Helen K.; Zuchner, Stephan; Reddel, Stephen W.; Nicholson, Garth A.; Kennerson, Marina L.
2016-01-01
With the advent of whole exome sequencing, cases where no pathogenic coding mutations can be found are increasingly being observed in many diseases. In two large, distantly-related families that mapped to the Charcot-Marie-Tooth neuropathy CMTX3 locus at chromosome Xq26.3-q27.3, all coding mutations were excluded. Using whole genome sequencing we found a large DNA interchromosomal insertion within the CMTX3 locus. The 78 kb insertion originates from chromosome 8q24.3, segregates fully with the disease in the two families, and is absent from the general population as well as 627 neurologically normal chromosomes from in-house controls. Large insertions into chromosome Xq27.1 are known to cause a range of diseases and this is the first neuropathy phenotype caused by an interchromosomal insertion at this locus. The CMTX3 insertion represents an understudied pathogenic structural variation mechanism for inherited peripheral neuropathies. Our finding highlights the importance of considering all structural variation types when studying unsolved inherited peripheral neuropathy cases with no pathogenic coding mutations. PMID:27438001
Kawabe, Yoshinori; Komatsu, Shinya; Komatsu, Shodai; Murakami, Mai; Ito, Akira; Sakuma, Tetsushi; Nakamura, Takahiro; Yamamoto, Takashi; Kamihira, Masamichi
2018-05-01
Chinese hamster ovary (CHO) cells have been used as host cells for the production of pharmaceutical proteins. For the high and stable production of target proteins, the transgene should be integrated into a suitable genomic locus of host cells. Here, we generated knock-in CHO cells, in which transgene cassettes without a vector backbone sequence were integrated into the hprt locus of the CHO genome using clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 and CRISPR-mediated precise integration into target chromosome (CRIS-PITCh) systems. We investigated the efficiency of targeted knock-in of transgenes using these systems. As a practical example, we generated knock-in CHO cells producing an scFv-Fc antibody using the CRIS-PITCh system mediated by microhomology sequences for targeting. We found that the CRIS-PITCh system can facilitate targeted knock-in for CHO cell engineering. Copyright © 2017 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Takahashi, Yuji; Shomura, Ayahiko; Sasaki, Takuji; Yano, Masahiro
2001-01-01
Hd6 is a quantitative trait locus involved in rice photoperiod sensitivity. It was detected in backcross progeny derived from a cross between the japonica variety Nipponbare and the indica variety Kasalath. To isolate a gene at Hd6, we used a large segregating population for the high-resolution and fine-scale mapping of Hd6 and constructed genomic clone contigs around the Hd6 region. Linkage analysis with P1-derived artificial chromosome clone-derived DNA markers delimited Hd6 to a 26.4-kb genomic region. We identified a gene encoding the α subunit of protein kinase CK2 (CK2α) in this region. The Nipponbare allele of CK2α contains a premature stop codon, and the resulting truncated product is undoubtedly nonfunctional. Genetic complementation analysis revealed that the Kasalath allele of CK2α increases days-to-heading. Map-based cloning with advanced backcross progeny enabled us to identify a gene underlying a quantitative trait locus even though it exhibited a relatively small effect on the phenotype. PMID:11416158
Fehringer, Gordon; Kraft, Peter; Pharoah, Paul D.; Eeles, Rosalind A.; Chatterjee, Nilanjan; Schumacher, Fred; Schildkraut, Joellen; Lindström, Sara; Brennan, Paul; Bickeböller, Heike; Houlston, Richard S.; Landi, Maria Teresa; Caporaso, Neil; Risch, Angela; Olama, Ali Amin Al; Berndt, Sonja I; Giovannucci, Edward; Grönberg, Henrik; Kote-Jarai, Zsofia; Ma, Jing; Muir, Kenneth; Stampfer, Meir; Stevens, Victoria L.; Wiklund, Fredrik; Willett, Walter; Goode, Ellen L.; Permuth, Jennifer; Risch, Harvey A.; Reid, Brett M.; Bezieau, Stephane; Brenner, Hermann; Chan, Andrew T.; Chang-Claude, Jenny; Hudson, Thomas J.; Kocarnik, Jonathan K.; Newcomb, Polly A.; Schoen, Robert E.; Slattery, Martha L.; White, Emily; Adank, Muriel A.; Ahsan, Habibul; Aittomäki, Kristiina; Baglietto, Laura; Blomquist, Carl; Canzian, Federico; Czene, Kamila; dos-Santos-Silva, Isabel; Eliassen, A. Heather; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Garcia-Closas, Montserrat; Gaudet, Mia M.; Johnson, Nichola; Hall, Per; Hazra, Aditi; Hein, Rebecca; Hofman, Albert; Hopper, John L.; Irwanto, Astrid; Johansson, Mattias; Kaaks, Rudolf; Kibriya, Muhammad G.; Lichtner, Peter; Liu, Jianjun; Lund, Eiliv; Makalic, Enes; Meindl, Alfons; Müller-Myhsok, Bertram; Muranen, Taru A.; Nevanlinna, Heli; Peeters, Petra H.; Peto, Julian; Prentice, Ross L.; Rahman, Nazneen; Sanchez, Maria Jose; Schmidt, Daniel F.; Schmutzler, Rita K.; Southey, Melissa C.; Tamimi, Rulla; Travis, Ruth C.; Turnbull, Clare; Uitterlinden, Andre G.; Wang, Zhaoming; Whittemore, Alice S.; Yang, Xiaohong R.; Zheng, Wei; Rafnar, Thorunn; Gudmundsson, Julius; Stacey, Simon N.; Stefansson, Kari; Sulem, Patrick; Chen, Y. Ann; Tyrer, Jonathan P.; Christiani, David C.; Wei, Yongyue; Shen, Hongbing; Hu, Zhibin; Shu, Xiao-Ou; Shiraishi, Kouya; Takahashi, Atsushi; Bossé, Yohan; Obeidat, Ma’en; Nickle, David; Timens, Wim; Freedman, Matthew L.; Li, Qiyuan; Seminara, Daniela; Chanock, Stephen J.; Gong, Jian; Peters, Ulrike; Gruber, Stephen B.; Amos, Christopher I.; Sellers, Thomas A.; Easton, Douglas F.; Hunter, David J.; Haiman, Christopher A.; Henderson, Brian E.; Hung, Rayjean J.
2016-01-01
Identifying genetic variants with pleiotropic associations can uncover common pathways influencing multiple cancers. We took a two-staged approach to conduct genome-wide association studies for lung, ovary, breast, prostate and colorectal cancer from the GAME-ON/GECCO Network (61,851 cases, 61,820 controls) to identify pleiotropic loci. Findings were replicated in independent association studies (55,789 cases, 330,490 controls). We identified a novel pleiotropic association at 1q22 involving breast and lung squamous cell carcinoma, with eQTL analysis showing an association with ADAM15/THBS3 gene expression in lung. We also identified a known breast cancer locus CASP8/ALS2CR12 associated with prostate cancer, a known cancer locus at CDKN2B-AS1 with different variants associated with lung adenocarcinoma and prostate cancer and confirmed the associations of a breast BRCA2 locus with lung and serous ovarian cancer. This is the largest study to date examining pleiotropy across multiple cancer-associated loci, identifying common mechanisms of cancer development and progression. PMID:27197191
The complete mitochondrial genome of Octopus bimaculatus Verrill, 1883 from the Gulf of California.
Domínguez-Contreras, José Francisco; Munguia-Vega, Adrian; Ceballos-Vázquez, Bertha Patricia; García-Rodriguez, Francisco Javier; Arellano-Martinez, Marcial
2016-11-01
The complete mitochondrial genome of Octopus bimaculatus is 16 085 bp in length and includes 13 protein-codes genes, 2 ribosomal RNA genes, 22 transfers RNA genes, and a control region. The composition of genome is A (40.9%), T (34.7%), C (16.9%), and G (7.5%). The control region of O. bimaculatus contains a VNTR locus not present in the genomes from other octopus species. A phylogenetic analysis shows a closer relationship between the mitogenomes from O. bimaculatus and O. vulgaris.
Enzymatically Generated CRISPR Libraries for Genome Labeling and Screening.
Lane, Andrew B; Strzelecka, Magdalena; Ettinger, Andreas; Grenfell, Andrew W; Wittmann, Torsten; Heald, Rebecca
2015-08-10
CRISPR-based technologies have emerged as powerful tools to alter genomes and mark chromosomal loci, but an inexpensive method for generating large numbers of RNA guides for whole genome screening and labeling is lacking. Using a method that permits library construction from any source of DNA, we generated guide libraries that label repetitive loci or a single chromosomal locus in Xenopus egg extracts and show that a complex library can target the E. coli genome at high frequency. Copyright © 2015 Elsevier Inc. All rights reserved.
Kohn, Michael H; Pelz, Hans-Joachim; Wayne, Robert K
2003-01-01
Populations may diverge at fitness-related genes as a result of adaptation to local conditions. The ability to detect this divergence by marker-based genomic scans depends on the relative magnitudes of selection, recombination, and migration. We survey rat (Rattus norvegicus) populations to assess the effect that local selection with anticoagulant rodenticides has had on microsatellite marker variation and differentiation at the warfarin resistance gene (Rw) relative to the effect on the genomic background. Initially, using a small sample of 16 rats, we demonstrate tight linkage of microsatellite D1Rat219 to Rw by association mapping of genotypes expressing an anticoagulant-rodenticide-insensitive vitamin K 2,3-epoxide reductase (VKOR). Then, using allele frequencies at D1Rat219, we show that predicted and observed resistance levels in 27 populations correspond, suggesting intense and recent selection for resistance. A contrast of F(ST) values between D1Rat219 and the genomic background revealed that rodenticide selection has overwhelmed drift-mediated population structure only at Rw. A case-controlled design distinguished these locus-specific effects of selection at Rw from background levels of differentiation more effectively than a population-controlled approach. Our results support the notion that an analysis of locus-specific population genetic structure may assist the discovery and mapping of novel candidate loci that are the object of selection or may provide supporting evidence for previously identified loci. PMID:12871915
Ren, Wen-Long; Wen, Yang-Jun; Dunwell, Jim M; Zhang, Yuan-Ming
2018-03-01
Although nonparametric methods in genome-wide association studies (GWAS) are robust in quantitative trait nucleotide (QTN) detection, the absence of polygenic background control in single-marker association in genome-wide scans results in a high false positive rate. To overcome this issue, we proposed an integrated nonparametric method for multi-locus GWAS. First, a new model transformation was used to whiten the covariance matrix of polygenic matrix K and environmental noise. Using the transferred model, Kruskal-Wallis test along with least angle regression was then used to select all the markers that were potentially associated with the trait. Finally, all the selected markers were placed into multi-locus model, these effects were estimated by empirical Bayes, and all the nonzero effects were further identified by a likelihood ratio test for true QTN detection. This method, named pKWmEB, was validated by a series of Monte Carlo simulation studies. As a result, pKWmEB effectively controlled false positive rate, although a less stringent significance criterion was adopted. More importantly, pKWmEB retained the high power of Kruskal-Wallis test, and provided QTN effect estimates. To further validate pKWmEB, we re-analyzed four flowering time related traits in Arabidopsis thaliana, and detected some previously reported genes that were not identified by the other methods.
Breaux, Breanna; Hunter, Margaret; Cruz-Schneider, Maria Paula; Sena, Leonardo; Bonde, Robert K.; Criscitiello, Michael F.
2018-01-01
The Florida manatee (Trichechus manatus latirostris) has limited diversity in the immunoglobulin heavy chain. We therefore investigated the antigen receptor loci of the other arm of the adaptive immune system: the T cell receptor. Manatees are the first species from Afrotheria, a basal eutherian superorder, to have an in-depth characterization of all T cell receptor loci. By annotating the genome and expressed transcripts, we found that each chain has distinct features that correlates to their individual functions. The genomic organization also plays a role in modulating sequence conservation between species. There were extensive V subgroup synteny blocks in the TRA and TRB loci between T. m. latirostrisand human. Increased genomic locus complexity correlated to increased locus synteny. We also identified evidence for a VHD pseudogene for the first time in a eutherian mammal. These findings emphasize the value of including species within this basal eutherian radiation in comparative studies.
Genomic amplification of the caprine EDNRA locus might lead to a dose dependent loss of pigmentation
Menzi, Fiona; Keller, Irene; Reber, Irene; Beck, Julia; Brenig, Bertram; Schütz, Ekkehard; Leeb, Tosso; Drögemüller, Cord
2016-01-01
The South African Boer goat displays a characteristic white spotting phenotype, in which the pigment is limited to the head. Exploiting the existing phenotype variation within the breed, we mapped the locus causing this white spotting phenotype to chromosome 17 by genome wide association. Subsequent whole genome sequencing identified a 1 Mb copy number variant (CNV) harboring 5 genes including EDNRA. The analysis of 358 Boer goats revealed 3 alleles with one, two, and three copies of this CNV. The copy number is correlated with the degree of white spotting in goats. We propose a hypothesis that ectopic overexpression of a mutant EDNRA scavenges EDN3 required for EDNRB signaling and normal melanocyte development and thus likely lead to an absence of melanocytes in the non-pigmented body areas of Boer goats. Our findings demonstrate the value of domestic animals as reservoir of unique mutants and for identifying a precisely defined functional CNV. PMID:27329507
Berg, Ingrid L; Neumann, Rita; Lam, Kwan-Wood G; Sarbajna, Shriparna; Odenthal-Hesse, Linda; May, Celia A; Jeffreys, Alec J
2010-10-01
PRDM9 has recently been identified as a likely trans regulator of meiotic recombination hot spots in humans and mice. PRDM9 contains a zinc finger array that, in humans, can recognize a short sequence motif associated with hot spots, with binding to this motif possibly triggering hot-spot activity via chromatin remodeling. We now report that human genetic variation at the PRDM9 locus has a strong effect on sperm hot-spot activity, even at hot spots lacking the sequence motif. Subtle changes within the zinc finger array can create hot-spot nonactivating or enhancing variants and can even trigger the appearance of a new hot spot, suggesting that PRDM9 is a major global regulator of hot spots in humans. Variation at the PRDM9 locus also influences aspects of genome instability-specifically, a megabase-scale rearrangement underlying two genomic disorders as well as minisatellite instability-implicating PRDM9 as a risk factor for some pathological genome rearrangements.
Breaux, Breanna; Hunter, Margaret E; Cruz-Schneider, Maria Paula; Sena, Leonardo; Bonde, Robert K; Criscitiello, Michael F
2018-08-01
The Florida manatee (Trichechus manatus latirostris) has limited diversity in the immunoglobulin heavy chain. We therefore investigated the antigen receptor loci of the other arm of the adaptive immune system: the T cell receptor. Manatees are the first species from Afrotheria, a basal eutherian superorder, to have an in-depth characterization of all T cell receptor loci. By annotating the genome and expressed transcripts, we found that each chain has distinct features that correlates to their individual functions. The genomic organization also plays a role in modulating sequence conservation between species. There were extensive V subgroup synteny blocks in the TRA and TRB loci between T. m. latirostris and human. Increased genomic locus complexity correlated to increased locus synteny. We also identified evidence for a VHD pseudogene for the first time in a eutherian mammal. These findings emphasize the value of including species within this basal eutherian radiation in comparative studies. Copyright © 2018. Published by Elsevier Ltd.
de Groot, G. Arjen; During, Heinjo J.; Maas, Jan W.; Schneider, Harald; Vogel, Johannes C.; Erkens, Roy H. J.
2011-01-01
Although consensus has now been reached on a general two-locus DNA barcode for land plants, the selected combination of markers (rbcL + matK) is not applicable for ferns at the moment. Yet especially for ferns, DNA barcoding is potentially of great value since fern gametophytes—while playing an essential role in fern colonization and reproduction—generally lack the morphological complexity for morphology-based identification and have therefore been underappreciated in ecological studies. We evaluated the potential of a combination of rbcL with a noncoding plastid marker, trnL-F, to obtain DNA-identifications for fern species. A regional approach was adopted, by creating a reference database of trusted rbcL and trnL-F sequences for the wild-occurring homosporous ferns of NW-Europe. A combination of parsimony analyses and distance-based analyses was performed to evaluate the discriminatory power of the two-region barcode. DNA was successfully extracted from 86 tiny fern gametophytes and was used as a test case for the performance of DNA-based identification. Primer universality proved high for both markers. Based on the combined rbcL + trnL-F dataset, all genera as well as all species with non-equal chloroplast genomes formed their own well supported monophyletic clade, indicating a high discriminatory power. Interspecific distances were larger than intraspecific distances for all tested taxa. Identification tests on gametophytes showed a comparable result. All test samples could be identified to genus level, species identification was well possible unless they belonged to a pair of Dryopteris species with completely identical chloroplast genomes. Our results suggest a high potential of the combined use of rbcL and trnL-F as a two-locus cpDNA barcode for identification of fern species. A regional approach may be preferred for ecological tests. We here offer such a ready-to-use barcoding approach for ferns, which opens the way for answering a whole range of questions previously unaddressed in fern gametophyte ecology. PMID:21298108
U'Ren, Jana M; Schupp, James M; Pearson, Talima; Hornstra, Heidie; Friedman, Christine L Clark; Smith, Kimothy L; Daugherty, Rebecca R Leadem; Rhoton, Shane D; Leadem, Ben; Georgia, Shalamar; Cardon, Michelle; Huynh, Lynn Y; DeShazer, David; Harvey, Steven P; Robison, Richard; Gal, Daniel; Mayo, Mark J; Wagner, David; Currie, Bart J; Keim, Paul
2007-03-30
The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation) to that of the most diverse tandemly repeated regions found in other less diverse bacteria. The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were identical using previous typing methods. Given the health threat to humans and livestock and the potential for B. pseudomallei to be released intentionally, MLVA could prove to be an important tool for fine-scale epidemiological or forensic tracking of this increasingly important environmental pathogen.
2013-01-01
Background The advent of next generation sequencing technology has accelerated efforts to map and catalogue copy number variation (CNV) in genomes of important micro-organisms for public health. A typical analysis of the sequence data involves mapping reads onto a reference genome, calculating the respective coverage, and detecting regions with too-low or too-high coverage (deletions and amplifications, respectively). Current CNV detection methods rely on statistical assumptions (e.g., a Poisson model) that may not hold in general, or require fine-tuning the underlying algorithms to detect known hits. We propose a new CNV detection methodology based on two Poisson hierarchical models, the Poisson-Gamma and Poisson-Lognormal, with the advantage of being sufficiently flexible to describe different data patterns, whilst robust against deviations from the often assumed Poisson model. Results Using sequence coverage data of 7 Plasmodium falciparum malaria genomes (3D7 reference strain, HB3, DD2, 7G8, GB4, OX005, and OX006), we showed that empirical coverage distributions are intrinsically asymmetric and overdispersed in relation to the Poisson model. We also demonstrated a low baseline false positive rate for the proposed methodology using 3D7 resequencing data and simulation. When applied to the non-reference isolate data, our approach detected known CNV hits, including an amplification of the PfMDR1 locus in DD2 and a large deletion in the CLAG3.2 gene in GB4, and putative novel CNV regions. When compared to the recently available FREEC and cn.MOPS approaches, our findings were more concordant with putative hits from the highest quality array data for the 7G8 and GB4 isolates. Conclusions In summary, the proposed methodology brings an increase in flexibility, robustness, accuracy and statistical rigour to CNV detection using sequence coverage data. PMID:23442253
Generation of Knock-in Mouse by Genome Editing.
Fujii, Wataru
2017-01-01
Knock-in mice are useful for evaluating endogenous gene expressions and functions in vivo. Instead of the conventional gene-targeting method using embryonic stem cells, an exogenous DNA sequence can be inserted into the target locus in the zygote using genome editing technology. In this chapter, I describe the generation of epitope-tagged mice using engineered endonuclease and single-stranded oligodeoxynucleotide through the mouse zygote as an example of how to generate a knock-in mouse by genome editing.
Li, Wanlong; Huang, Li; Gill, Bikram S.
2008-01-01
Polyploidy is known to induce numerous genetic and epigenetic changes but little is known about their physiological bases. In wheat, grain texture is mainly determined by the Hardness (Ha) locus consisting of genes Puroindoline a (Pina) and b (Pinb). These genes are conserved in diploid progenitors but were deleted from the A and B genomes of tetraploid Triticum turgidum (AB). We now report the recurrent deletions of Pina-Pinb in other lineages of polyploid wheat. We analyzed the Ha haplotype structure in 90 diploid and 300 polyploid accessions of Triticum and Aegilops spp. Pin genes were conserved in all diploid species and deletion haplotypes were detected in all polyploid Triticum and most of the polyploid Aegilops spp. Two Pina-Pinb deletion haplotypes were found in hexaploid wheat (Triticum aestivum; ABD). Pina and Pinb were eliminated from the G genome, but maintained in the A genome of tetraploid Triticum timopheevii (AG). Subsequently, Pina and Pinb were deleted from the A genome but retained in the Am genome of hexaploid Triticum zhukovskyi (AmAG). Comparison of deletion breakpoints demonstrated that the Pina-Pinb deletion occurred independently and recurrently in the four polyploid wheat species. The implications of Pina-Pinb deletions for polyploid-driven evolution of gene and genome and its possible physiological significance are discussed. PMID:18024553
Multiple invasions of an infectious retrovirus in cat genomes
Shimode, Sayumi; Nakagawa, So; Miyazawa, Takayuki
2015-01-01
Endogenous retroviruses (ERVs) are remnants of ancient retroviral infections of host germ-line cells. While most ERVs are defective, some are active and express viral proteins. The RD-114 virus is a replication-competent feline ERV, and several feline cell lines produce infectious RD-114 viral particles. All domestic cats are considered to have an ERV locus encoding a replication-competent RD-114 virus in their genomes; however, the locus has not been identified. In this study, we investigated RD-114 virus-related proviral loci in genomes of domestic cats, and found that none were capable of producing infectious viruses. We also found that all domestic cats have an RD-114 virus-related sequence on chromosome C2, termed RDRS C2a, but populations of the other RDRSs are different depending on the regions where cats live or breed. Our results indicate that RDRS C2a, the oldest RD-114-related provirus, entered the host genome before an ancestor of domestic cats started diverging and the other new RDRSs might have integrated into migrating cats in Europe. We also show that infectious RD-114 virus can be resurrected by the recombination between two non-infectious RDRSs. From these data, we conclude that cats do not harbor infectious RD-114 viral loci in their genomes and RD-114-related viruses invaded cat genomes multiple times. PMID:25641657
Blumer-Schuette, Sara E.; Giannone, Richard J.; Zurawski, Jeffrey V.; Ozdemir, Inci; Ma, Qin; Yin, Yanbin; Xu, Ying; Kataeva, Irina; Poole, Farris L.; Adams, Michael W. W.; Hamilton-Brehm, Scott D.; Elkins, James G.; Larimer, Frank W.; Land, Miriam L.; Hauser, Loren J.; Cottingham, Robert W.; Hettich, Robert L.
2012-01-01
Extremely thermophilic bacteria of the genus Caldicellulosiruptor utilize carbohydrate components of plant cell walls, including cellulose and hemicellulose, facilitated by a diverse set of glycoside hydrolases (GHs). From a biofuel perspective, this capability is crucial for deconstruction of plant biomass into fermentable sugars. While all species from the genus grow on xylan and acid-pretreated switchgrass, growth on crystalline cellulose is variable. The basis for this variability was examined using microbiological, genomic, and proteomic analyses of eight globally diverse Caldicellulosiruptor species. The open Caldicellulosiruptor pangenome (4,009 open reading frames [ORFs]) encodes 106 GHs, representing 43 GH families, but only 26 GHs from 17 families are included in the core (noncellulosic) genome (1,543 ORFs). Differentiating the strongly cellulolytic Caldicellulosiruptor species from the others is a specific genomic locus that encodes multidomain cellulases from GH families 9 and 48, which are associated with cellulose-binding modules. This locus also encodes a novel adhesin associated with type IV pili, which was identified in the exoproteome bound to crystalline cellulose. Taking into account the core genomes, pangenomes, and individual genomes, the ancestral Caldicellulosiruptor was likely cellulolytic and evolved, in some cases, into species that lost the ability to degrade crystalline cellulose while maintaining the capacity to hydrolyze amorphous cellulose and hemicellulose. PMID:22636774
Genome-wide association study on serum alkaline phosphatase levels in a Chinese population
2013-01-01
Background Serum alkaline phosphatase (ALP) is a complex phenotype influenced by both genetic and environmental factors. Recent Genome-Wide Association Studies (GWAS) have identified several loci affecting ALP levels; however, such studies in Chinese populations are limited. We performed a GWAS analyzing the association between 658,288 autosomal SNPs and serum ALP in 1,461 subjects, and replicated the top SNPs in an additional 8,830 healthy Chinese Han individuals. The interactions between significant locus and environmental factors on serum ALP levels were further investigated. Results The association between ABO locus and serum ALP levels was replicated (P = 2.50 × 10-21, 1.12 × 10-56 and 2.82 × 10-27 for SNP rs8176720, rs651007 and rs7025162 on ABO locus, respectively). SNP rs651007 accounted for 2.15% of the total variance of serum ALP levels independently of the other 2 SNPs. When comparing our findings with previously published studies, ethnic differences were observed across populations. A significant interaction between ABO rs651007 and overweight and obesity was observed (FDR for interaction was 0.036); for individuals with GG genotype, those with normal weight and those who were overweight or obese have similar serum ALP concentrations; minor allele A of rs651007 remarkably reduced serum ALP levels, but this effect was attenuated in overweight and obese individuals. Conclusions Our findings indicate that ABO locus is a major determinant for serum ALP levels in Chinese Han population. Overweight and obesity modifies the effect of ABO locus on serum ALP concentrations. PMID:24094242
Natural history of the ERVWE1 endogenous retroviral locus
Bonnaud, Bertrand; Beliaeff, Jean; Bouton, Olivier; Oriol, Guy; Duret, Laurent; Mallet, François
2005-01-01
Background The human HERV-W multicopy family includes a unique proviral locus, termed ERVWE1, whose full-length envelope ORF was preserved through evolution by the action of a selective pressure. The encoded Env protein (Syncytin) is involved in hominoid placental physiology. Results In order to infer the natural history of this domestication process, a comparative genomic analysis of the human 7q21.2 syntenic regions in eutherians was performed. In primates, this region was progressively colonized by LTR-elements, leading to two different evolutionary pathways in Cercopithecidae and Hominidae, a genetic drift versus a domestication, respectively. Conclusion The preservation in Hominoids of a genomic structure consisting in the juxtaposition of a retrotransposon-derived MaLR LTR and the ERVWE1 provirus suggests a functional link between both elements. PMID:16176588
Parreira, Valeria R.; Marri, Pradeep R.; Rosey, Everett L.; Gong, Joshua; Songer, J. Glenn; Vedantam, Gayatri; Prescott, John F.
2010-01-01
Type A Clostridium perfringens causes poultry necrotic enteritis (NE), an enteric disease of considerable economic importance, yet can also exist as a member of the normal intestinal microbiota. A recently discovered pore-forming toxin, NetB, is associated with pathogenesis in most, but not all, NE isolates. This finding suggested that NE-causing strains may possess other virulence gene(s) not present in commensal type A isolates. We used high-throughput sequencing (HTS) technologies to generate draft genome sequences of seven unrelated C. perfringens poultry NE isolates and one isolate from a healthy bird, and identified additional novel NE-associated genes by comparison with nine publicly available reference genomes. Thirty-one open reading frames (ORFs) were unique to all NE strains and formed the basis for three highly conserved NE-associated loci that we designated NELoc-1 (42 kb), NELoc-2 (11.2 kb) and NELoc-3 (5.6 kb). The largest locus, NELoc-1, consisted of netB and 36 additional genes, including those predicted to encode two leukocidins, an internalin-like protein and a ricin-domain protein. Pulsed-field gel electrophoresis (PFGE) and Southern blotting revealed that the NE strains each carried 2 to 5 large plasmids, and that NELoc-1 and -3 were localized on distinct plasmids of sizes ∼85 and ∼70 kb, respectively. Sequencing of the regions flanking these loci revealed similarity to previously characterized conjugative plasmids of C. perfringens. These results provide significant insight into the pathogenetic basis of poultry NE and are the first to demonstrate that netB resides in a large, plasmid-encoded locus. Our findings strongly suggest that poultry NE is caused by several novel virulence factors, whose genes are clustered on discrete pathogenicity loci, some of which are plasmid-borne. PMID:20532244
Lepp, Dion; Roxas, Bryan; Parreira, Valeria R; Marri, Pradeep R; Rosey, Everett L; Gong, Joshua; Songer, J Glenn; Vedantam, Gayatri; Prescott, John F
2010-05-24
Type A Clostridium perfringens causes poultry necrotic enteritis (NE), an enteric disease of considerable economic importance, yet can also exist as a member of the normal intestinal microbiota. A recently discovered pore-forming toxin, NetB, is associated with pathogenesis in most, but not all, NE isolates. This finding suggested that NE-causing strains may possess other virulence gene(s) not present in commensal type A isolates. We used high-throughput sequencing (HTS) technologies to generate draft genome sequences of seven unrelated C. perfringens poultry NE isolates and one isolate from a healthy bird, and identified additional novel NE-associated genes by comparison with nine publicly available reference genomes. Thirty-one open reading frames (ORFs) were unique to all NE strains and formed the basis for three highly conserved NE-associated loci that we designated NELoc-1 (42 kb), NELoc-2 (11.2 kb) and NELoc-3 (5.6 kb). The largest locus, NELoc-1, consisted of netB and 36 additional genes, including those predicted to encode two leukocidins, an internalin-like protein and a ricin-domain protein. Pulsed-field gel electrophoresis (PFGE) and Southern blotting revealed that the NE strains each carried 2 to 5 large plasmids, and that NELoc-1 and -3 were localized on distinct plasmids of sizes approximately 85 and approximately 70 kb, respectively. Sequencing of the regions flanking these loci revealed similarity to previously characterized conjugative plasmids of C. perfringens. These results provide significant insight into the pathogenetic basis of poultry NE and are the first to demonstrate that netB resides in a large, plasmid-encoded locus. Our findings strongly suggest that poultry NE is caused by several novel virulence factors, whose genes are clustered on discrete pathogenicity loci, some of which are plasmid-borne.
Bourgis, F.; Guyot, R.; Gherbi, H.; Tailliez, E.; Amabile, I.; Salse, J.; Lorieux, M.; Delseny, M.
2008-01-01
In Asian cultivated rice (Oryza sativa L.), aroma is one of the most valuable traits in grain quality and 2-ACP is the main volatile compound contributing to the characteristic popcorn-like odour of aromatic rices. Although the major locus for grain fragrance (frg gene) has been described recently in Basmati rice, this gene has not been characterised in true japonica varieties and molecular information available on the genetic diversity and evolutionary origin of this gene among the different varieties is still limited. Here we report on characterisation of the frg gene in the Azucena variety, one of the few aromatic japonica cultivars. We used a RIL population from a cross between Azucena and IR64, a non-aromatic indica, the reference genomic sequence of Nipponbare (japonica) and 93–11 (indica) as well as an Azucena BAC library, to identify the major fragance gene in Azucena. We thus identified a betaine aldehyde dehydrogenase gene, badh2, as the candidate locus responsible for aroma, which presented exactly the same mutation as that identified in Basmati and Jasmine-like rices. Comparative genomic analyses showed very high sequence conservation between Azucena and Nipponbare BADH2, and a MITE was identified in the promotor region of the BADH2 allele in 93–11. The badh2 mutation and MITE were surveyed in a representative rice collection, including traditional aromatic and non-aromatic rice varieties, and strongly suggested a monophylogenetic origin of this badh2 mutation in Asian cultivated rices. Altogether these new data are discussed here in the light of current hypotheses on the origin of rice genetic diversity. PMID:18491070
Durel, C-E; Denancé, C; Brisset, M-N
2009-02-01
Fire blight, caused by the bacterium Erwinia amylovora, is one of the most destructive diseases of apple (Malus xdomestica) worldwide. No major, qualitative gene for resistance to this disease has been identified so far in apple. A quantitative trait locus (QTL) analysis was performed in two F1 progenies derived from two controled crosses: one between the susceptible rootstock cultivar 'MM106' and the resistant ornamental cultivar 'Evereste' and the other one between the moderately susceptible cultivar 'Golden Delicious' and the wild apple Malus floribunda clone 821, with unknown level of fire blight resistance. Both progenies were inoculated in the greenhouse with the same reference strain of E. amylovora. The length of stem necrosis was scored 7 and 14 days after inoculation. A strong QTL effect was identified in both 'Evereste' and M. floribunda 821 at a similar position on the distal region of linkage group 12 of the apple genome. From 50% to 70% of the phenotypic variation was explained by the QTL in 'Evereste' progeny according to the scored trait. More than 40% of the phenotypic variation was explained by the M. floribunda QTL in the second progeny. It was shown that 'Evereste' and M. floribunda 821 carried distinct QTL alleles at that genomic position. A small additional QTL was identified in 'Evereste' on linkage group 15, which explained about 6% of the phenotypic variation. Although it was not possible to confirm whether or not 'Evereste' and M. floribunda QTL belonged to the same locus or two distinct closely related loci, these QTL can be valuable targets in marker-assisted selection to obtain fire blight resistant apple cultivars and form a starting point for discovering the function of the genes controlling apple fire blight resistance.
Kulski, Jerzy K; Shigenari, Atsuko; Inoko, Hidetoshi
2010-04-01
Polymorphic insertion frequencies of the retrotransposons known as the "SVA" elements were investigated at four loci in the MHC class I genomic region to determine their allele and haplotype frequencies and associations with the HLA-A, -B or -C genes for 100 Japanese, 100 African Americans, 174 Australian Caucasians and 66 reference cell lines obtained from different ethnic groups. The SVA insertions representing different subfamily members varied in frequency between none for SVA-HF in Japanese and 65% for SVA-HB in Caucasians or African Americans with significant differences in frequencies between the three populations at least at three loci. The SVA loci were in Hardy-Weinberg equilibrium except for the SVA-HA locus which deviated significantly in African Americans and Caucasians possibly because of a genomic deletion of this locus in individuals with the HLA-A*24 allele. Strong linkage disequilibria and high percentage associations between the human leucocyte antigen (HLA) class I gene alleles and some of the SVA insertions were detected in all three populations in spite of significant frequency differences for the SVA and HLA class I alleles between the three populations. The highest percentage associations (>86%) were between SVA-HB and HLA-B*08, -B*27, -B*37 to -B*41, -B*52 and -B*53; SVA-HC and HLA-B*07; SVA-HA and HLA-A*03, -A*11 and -A*30; and SVA-HF and HLA-A*03 and HLA-B*47. From pairwise associations in the three populations and the homozygous cell line results, it was possible to deduce the SVA and HLA class I allelic combinations (haplotypes), population differences and the identity by descent of several common HLA-A allelic lineages.
Lu, Yingchang; Justice, Anne E.; Mudgal, Poorva; Liu, Ching-Ti; Young, Kristin; Feitosa, Mary F.; Rand, Kristin; Dimitrov, Latchezar; Duan, Qing; Guo, Xiuqing; Lange, Leslie A.; Nalls, Michael A.; Okut, Hayrettin; Tayo, Bamidele O.; Vedantam, Sailaja; Bradfield, Jonathan P.; Chen, Guanjie; Chesi, Alessandra; Irvin, Marguerite R.; Padhukasahasram, Badri; Zheng, Wei; Allison, Matthew A.; Ambrosone, Christine B.; Bandera, Elisa V.; Berndt, Sonja I.; Blot, William J.; Bottinger, Erwin P.; Carpten, John; Chanock, Stephen J.; Chen, Yii-Der Ida; Conti, David V.; Cooper, Richard S.; Fornage, Myriam; Freedman, Barry I.; Garcia, Melissa; Goodman, Phyllis J.; Hsu, Yu-Han H.; Hu, Jennifer; Huff, Chad D.; Ingles, Sue A.; John, Esther M.; Kittles, Rick; Klein, Eric; Li, Jin; McKnight, Barbara; Nayak, Uma; Nemesure, Barbara; Olshan, Andrew; Salako, Babatunde; Sanderson, Maureen; Shao, Yaming; Siscovick, David S.; Stanford, Janet L.; Strom, Sara S.; Witte, John S.; Yao, Jie; Zhu, Xiaofeng; Ziegler, Regina G.; Zonderman, Alan B.; Ambs, Stefan; Cushman, Mary; Faul, Jessica D.; Hakonarson, Hakon; Levin, Albert M.; Nathanson, Katherine L.; Weir, David R.; Zhi, Degui; Arnett, Donna K.; Kardia, Sharon L. R.; Oloapde, Olufunmilayo I.; Rao, D. C.; Williams, L. Keoki; Becker, Diane M.; Borecki, Ingrid B.; Evans, Michele K.; Harris, Tamara B.; Hirschhorn, Joel N.; Psaty, Bruce M.; Wilson, James G.; Bowden, Donald W.; Cupples, L. Adrienne; Haiman, Christopher A.; Loos, Ruth J. F.; North, Kari E.
2017-01-01
Genome-wide association studies (GWAS) have identified >300 loci associated with measures of adiposity including body mass index (BMI) and waist-to-hip ratio (adjusted for BMI, WHRadjBMI), but few have been identified through screening of the African ancestry genomes. We performed large scale meta-analyses and replications in up to 52,895 individuals for BMI and up to 23,095 individuals for WHRadjBMI from the African Ancestry Anthropometry Genetics Consortium (AAAGC) using 1000 Genomes phase 1 imputed GWAS to improve coverage of both common and low frequency variants in the low linkage disequilibrium African ancestry genomes. In the sex-combined analyses, we identified one novel locus (TCF7L2/HABP2) for WHRadjBMI and eight previously established loci at P < 5×10−8: seven for BMI, and one for WHRadjBMI in African ancestry individuals. An additional novel locus (SPRYD7/DLEU2) was identified for WHRadjBMI when combined with European GWAS. In the sex-stratified analyses, we identified three novel loci for BMI (INTS10/LPL and MLC1 in men, IRX4/IRX2 in women) and four for WHRadjBMI (SSX2IP, CASC8, PDE3B and ZDHHC1/HSD11B2 in women) in individuals of African ancestry or both African and European ancestry. For four of the novel variants, the minor allele frequency was low (<5%). In the trans-ethnic fine mapping of 47 BMI loci and 27 WHRadjBMI loci that were locus-wide significant (P < 0.05 adjusted for effective number of variants per locus) from the African ancestry sex-combined and sex-stratified analyses, 26 BMI loci and 17 WHRadjBMI loci contained ≤ 20 variants in the credible sets that jointly account for 99% posterior probability of driving the associations. The lead variants in 13 of these loci had a high probability of being causal. As compared to our previous HapMap imputed GWAS for BMI and WHRadjBMI including up to 71,412 and 27,350 African ancestry individuals, respectively, our results suggest that 1000 Genomes imputation showed modest improvement in identifying GWAS loci including low frequency variants. Trans-ethnic meta-analyses further improved fine mapping of putative causal variants in loci shared between the African and European ancestry populations. PMID:28430825
Chromosome 9p21 in Amyotrophic Lateral Sclerosis in Finland: A Genome-Wide Association Study
Laaksovirta, Hannu; Peuralinna, Terhi; Schymick, Jennifer C.; Scholz, Sonja W.; Lai, Shaoi-Lin; Myllykangas, Liisa; Sulkava, Raimo; Jansson, Lilja; Hernandez, Dena G.; Gibbs, J. Raphael; Nalls, Michael A.; Heckerman, David; Tienari, Pentti J.; Traynor, Bryan J.
2010-01-01
Introduction The genetic etiology of amyotrophic lateral sclerosis (ALS) is not well understood. Finland is a well-suited location for a genome-wide association study of ALS, as the incidence of the disease is one of the highest in the world, and because the genetic homogeneity of the Finnish population enhances the ability to detect risk loci. Methods We performed a genome-wide association study of 442 Finnish patients diagnosed with ALS, and 521 Finnish control subjects using Illumina genome-wide genotyping arrays. DNA was collected from patients attending an ALS specialty clinic that receives referrals from neurologists throughout Finland, whereas the control samples were obtained from a population-based study of elderly Finnish individuals. Individuals known to carry D90A alleles of the SOD1 gene (n = 40) were included in the final analysis as positive controls to determine if our GWAS was able to detect an association signal at this locus. Findings We identified two association peaks that exceeded genome-wide significance. One of these was located on chromosome 21q22 (rs13048019, p = 2·58×10−8) that corresponded to the known autosomal recessive D90A allele of the SOD1 gene. The other was detected in a 232kb block of linkage disequilibrium (rs3849942, p = 9·11×10−11) in a region of chromosome 9p that has been previously identified by linkage studies of ALS families. Within this region, we defined a 42-SNP haplotype that significantly increased risk of developing ALS (p = 4·2×10−33 among familial cases, odds ratio = 21·0, 95% CI = 11·2–39·1), and which overlapped with an association locus recently reported for fronto-temporal dementia (FTD). Based on the 93 familial ALS cases included in the analysis, population attributable risk percent for the chromosome 9p21 locus was 37.9% (95% CI, 27·7 – 48·1%), and for D90A homozygosity was 25·5% (95% CI, 16·9 – 34·1%). Interpretation In summary, we present evidence that the chromosome 9p21 ALS-FTD locus is a major cause of familial ALS in the Finnish population. PMID:20801718
Behnke, Michael S; Khan, Asis; Sibley, L David
2015-02-01
Quantitative trait locus (QTL) mapping studies have been integral in identifying and understanding virulence mechanisms in the parasite Toxoplasma gondii. In this study, we interrogated a different phenotype by mapping sinefungin (SNF) resistance in the genetic cross between type 2 ME49-FUDR(r) and type 10 VAND-SNF(r). The genetic map of this cross was generated by whole-genome sequencing of the progeny and subsequent identification of single nucleotide polymorphisms (SNPs) inherited from the parents. Based on this high-density genetic map, we were able to pinpoint the sinefungin resistance phenotype to one significant locus on chromosome IX. Within this locus, a single nonsynonymous SNP (nsSNP) resulting in an early stop codon in the TGVAND_290860 gene was identified, occurring only in the sinefungin-resistant progeny. Using CRISPR/CAS9, we were able to confirm that targeted disruption of TGVAND_290860 renders parasites sinefungin resistant. Because disruption of the SNR1 gene confers resistance, we also show that it can be used as a negative selectable marker to insert either a positive drug selection cassette or a heterologous reporter. These data demonstrate the power of combining classical genetic mapping, whole-genome sequencing, and CRISPR-mediated gene disruption for combined forward and reverse genetic strategies in T. gondii. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
A Novel Locus For Dilated Cardiomyopathy Maps to Canine Chromosome 8
Werner, Petra; Raducha, Michael G.; Prociuk, Ulana; Sleeper, Meg M.; Henthorn, Paula S.
2008-01-01
Dilated cardiomyopathy (DCM), the most common form of cardiomyopathy, often leads to heart failure and sudden death. While a substantial proportion of DCMs are inherited, mutations responsible for the majority of DCMs remain unidentified. A genome-wide linkage study was performed to identify the locus responsible for an autosomal recessive inherited form of juvenile DCM (JDCM) in Portuguese water dogs using 16 families segregating the disease. Results link the JDCM locus to canine chromosome 8 with two-point and multipoint LOD scores of 10.8 and 14, respectively. The locus maps to a 3.9 Mb region, with complete syntenic homology to human chromosome 14, that contains no genes or loci known to be involved in the development of any type of cardiomyopathy. This discovery of a DCM locus with a previously unknown etiology will provide a new gene to examine in human DCM patients and a model for testing therapeutic approaches for heart failure. PMID:18442891
Towards Breaking the Histone Code – Bayesian Graphical Models for Histone Modifications
Mitra, Riten; Müller, Peter; Liang, Shoudan; Xu, Yanxun; Ji, Yuan
2013-01-01
Background Histones are proteins that wrap DNA around in small spherical structures called nucleosomes. Histone modifications (HMs) refer to the post-translational modifications to the histone tails. At a particular genomic locus, each of these HMs can either be present or absent, and the combinatory patterns of the presence or absence of multiple HMs, or the ‘histone codes,’ are believed to co-regulate important biological processes. We aim to use raw data on HM markers at different genomic loci to (1) decode the complex biological network of HMs in a single region and (2) demonstrate how the HM networks differ in different regulatory regions. We suggest that these differences in network attributes form a significant link between histones and genomic functions. Methods and Results We develop a powerful graphical model under Bayesian paradigm. Posterior inference is fully probabilistic, allowing us to compute the probabilities of distinct dependence patterns of the HMs using graphs. Furthermore, our model-based framework allows for easy but important extensions for inference on differential networks under various conditions, such as the different annotations of the genomic locations (e.g., promoters versus insulators). We applied these models to ChIP-Seq data based on CD4+ T lymphocytes. The results confirmed many existing findings and provided a unified tool to generate various promising hypotheses. Differential network analyses revealed new insights on co-regulation of HMs of transcriptional activities in different genomic regions. Conclusions The use of Bayesian graphical models and borrowing strength across different conditions provide high power to infer histone networks and their differences. PMID:23748248
Tools for Genetic Studies in Experimental Populations of Polyploids.
Bourke, Peter M; Voorrips, Roeland E; Visser, Richard G F; Maliepaard, Chris
2018-01-01
Polyploid organisms carry more than two copies of each chromosome, a condition rarely tolerated in animals but which occurs relatively frequently in the plant kingdom. One of the principal challenges faced by polyploid organisms is to evolve stable meiotic mechanisms to faithfully transmit genetic information to the next generation upon which the study of inheritance is based. In this review we look at the tools available to the research community to better understand polyploid inheritance, many of which have only recently been developed. Most of these tools are intended for experimental populations (rather than natural populations), facilitating genomics-assisted crop improvement and plant breeding. This is hardly surprising given that a large proportion of domesticated plant species are polyploid. We focus on three main areas: (1) polyploid genotyping; (2) genetic and physical mapping; and (3) quantitative trait analysis and genomic selection. We also briefly review some miscellaneous topics such as the mode of inheritance and the availability of polyploid simulation software. The current polyploid analytic toolbox includes software for assigning marker genotypes (and in particular, estimating the dosage of marker alleles in the heterozygous condition), establishing chromosome-scale linkage phase among marker alleles, constructing (short-range) haplotypes, generating linkage maps, performing genome-wide association studies (GWAS) and quantitative trait locus (QTL) analyses, and simulating polyploid populations. These tools can also help elucidate the mode of inheritance (disomic, polysomic or a mixture of both as in segmental allopolyploids) or reveal whether double reduction and multivalent chromosomal pairing occur. An increasing number of polyploids (or associated diploids) are being sequenced, leading to publicly available reference genome assemblies. Much work remains in order to keep pace with developments in genomic technologies. However, such technologies also offer the promise of understanding polyploid genomes at a level which hitherto has remained elusive.
Tools for Genetic Studies in Experimental Populations of Polyploids
Bourke, Peter M.; Voorrips, Roeland E.; Visser, Richard G. F.; Maliepaard, Chris
2018-01-01
Polyploid organisms carry more than two copies of each chromosome, a condition rarely tolerated in animals but which occurs relatively frequently in the plant kingdom. One of the principal challenges faced by polyploid organisms is to evolve stable meiotic mechanisms to faithfully transmit genetic information to the next generation upon which the study of inheritance is based. In this review we look at the tools available to the research community to better understand polyploid inheritance, many of which have only recently been developed. Most of these tools are intended for experimental populations (rather than natural populations), facilitating genomics-assisted crop improvement and plant breeding. This is hardly surprising given that a large proportion of domesticated plant species are polyploid. We focus on three main areas: (1) polyploid genotyping; (2) genetic and physical mapping; and (3) quantitative trait analysis and genomic selection. We also briefly review some miscellaneous topics such as the mode of inheritance and the availability of polyploid simulation software. The current polyploid analytic toolbox includes software for assigning marker genotypes (and in particular, estimating the dosage of marker alleles in the heterozygous condition), establishing chromosome-scale linkage phase among marker alleles, constructing (short-range) haplotypes, generating linkage maps, performing genome-wide association studies (GWAS) and quantitative trait locus (QTL) analyses, and simulating polyploid populations. These tools can also help elucidate the mode of inheritance (disomic, polysomic or a mixture of both as in segmental allopolyploids) or reveal whether double reduction and multivalent chromosomal pairing occur. An increasing number of polyploids (or associated diploids) are being sequenced, leading to publicly available reference genome assemblies. Much work remains in order to keep pace with developments in genomic technologies. However, such technologies also offer the promise of understanding polyploid genomes at a level which hitherto has remained elusive. PMID:29720992
Cho, Yun Sung; Kim, Hyunho; Kim, Hak-Min; Jho, Sungwoong; Jun, JeHoon; Lee, Yong Joo; Chae, Kyun Shik; Kim, Chang Geun; Kim, Sangsoo; Eriksson, Anders; Edwards, Jeremy S.; Lee, Semin; Kim, Byung Chul; Manica, Andrea; Oh, Tae-Kwang; Church, George M.; Bhak, Jong
2016-01-01
Human genomes are routinely compared against a universal reference. However, this strategy could miss population-specific and personal genomic variations, which may be detected more efficiently using an ethnically relevant or personal reference. Here we report a hybrid assembly of a Korean reference genome (KOREF) for constructing personal and ethnic references by combining sequencing and mapping methods. We also build its consensus variome reference, providing information on millions of variants from 40 additional ethnically homogeneous genomes from the Korean Personal Genome Project. We find that the ethnically relevant consensus reference can be beneficial for efficient variant detection. Systematic comparison of human assemblies shows the importance of assembly quality, suggesting the necessity of new technologies to comprehensively map ethnic and personal genomic structure variations. In the era of large-scale population genome projects, the leveraging of ethnicity-specific genome assemblies as well as the human reference genome will accelerate mapping all human genome diversity. PMID:27882922
Bao, Yun-Juan; Li, Yang; Liang, Zhong; Agrahari, Garima; Lee, Shaun W; Ploplis, Victoria A; Castellino, Francis J
2017-07-31
The strains serotyped as M71 from group A Streptococcus are common causes of pharyngeal and skin diseases worldwide. Here we characterize the genome of a unique non-invasive M71 human isolate, NS53. The genome does not contain structural rearrangements or large-scale gene gains/losses, but encodes a full set of non-truncated known virulence factors, thus providing an ideal reference for comparative studies. However, the NS53 genome showed incongruent phenotypic implications from distinct genotypic markers. NS53 is characterized as an emm pattern D and FCT (fibronectin-collagen-T antigen) type-3 strain, typical of skin tropic strains, but is phylogenetically close to emm pattern E strains with preference for both skin and pharyngeal infections. We propose that this incongruence could result from recombination within the emm gene locus, or, alternatively, selection has been against those genetic alterations. Combined with the inability to select for CovS switching, a process is indicated whereby NS53 has been pre-adapted to specific host niches selecting against variations in CovS and many other genes. This may allow the strain to attain successful colonization and long-term survival. A balance between genetic variations and fitness may exist for this bacterium to form a stabilized genome optimized for survival in specific host environments. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Sadsad, Rosemarie; Martinez, Elena; Jelfs, Peter; Hill-Cawthorne, Grant A.; Gilbert, Gwendolyn L.; Marais, Ben J.; Sintchenko, Vitali
2016-01-01
Background Improved tuberculosis control and the need to contain the spread of drug-resistant strains provide a strong rationale for exploring tuberculosis transmission dynamics at the population level. Whole-genome sequencing provides optimal strain resolution, facilitating detailed mapping of potential transmission pathways. Methods We sequenced 22 isolates from a Mycobacterium tuberculosis cluster in New South Wales, Australia, identified during routine 24-locus mycobacterial interspersed repetitive unit typing. Following high-depth paired-end sequencing using the Illumina HiSeq 2000 platform, two independent pipelines were employed for analysis, both employing read mapping onto reference genomes as well as de novo assembly, to control biases in variant detection. In addition to single-nucleotide polymorphisms, the analyses also sought to identify insertions, deletions and structural variants. Results Isolates were highly similar, with a distance of 13 variants between the most distant members of the cluster. The most sensitive analysis classified the 22 isolates into 18 groups. Four of the isolates did not appear to share a recent common ancestor with the largest clade; another four isolates had an uncertain ancestral relationship with the largest clade. Conclusion Whole genome sequencing, with analysis of single-nucleotide polymorphisms, insertions, deletions, structural variants and subpopulations, enabled the highest possible level of discrimination between cluster members, clarifying likely transmission pathways and exposing the complexity of strain origin. The analysis provides a basis for targeted public health intervention and enhanced classification of future isolates linked to the cluster. PMID:26938641
A presentation of the differences between the sheep and goat genetic maps
2005-01-01
The current autosomal version (4.2) of the sheep genetic map comprises 1175 loci and spans ~3540 cM. This corresponds to almost complete coverage of the sheep genome. Each chromosome is represented by a single linkage group, with the largest gap between adjacent loci being 19.8 cM. In contrast the 1998 goat genetic map (the most recently published) is much less well developed spanning 2737 cM and comprising only 307 loci. Only one of the goat chromosomes appears to have complete coverage (chromosome 27), and 16 of the chromosomes are comprised of two or more linkage groups, or a linkage group and one or more unlinked markers. The two maps share 218 loci, and the maps have been aligned using the shared loci as reference points. Overall there is good agreement between the maps in terms of homologous loci mapping to equivalent chromosomes in the two species, with only four markers mapping to non-equivalent chromosomes. However, there are lots of inversions in locus order between the sheep and goat chromosomes. Whilst some of these differences in locus order may be genuine, the majority are likely to be a consequence of the paucity of genetic information for the goat map. PMID:15601590
Comparative genomic analysis of the false killer whale (Pseudorca crassidens) LMBR1 locus.
Kim, Dae-Won; Choi, Sang-Haeng; Kim, Ryong Nam; Kim, Sun-Hong; Paik, Sang-Gi; Nam, Seong-Hyeuk; Kim, Dong-Wook; Kim, Aeri; Kang, Aram; Park, Hong-Seog
2010-09-01
The sequencing and comparative genomic analysis of LMBR1 loci in mammals or other species, including human, would be very important in understanding evolutionary genetic changes underlying the evolution of limb development. In this regard, comparative genomic annotation of the false killer whale LMBR1 locus could shed new light on the evolution of limb development. We sequenced two false killer whale BAC clones, corresponding to 156 kb and 144 kb, respectively, harboring the tightly linked RNF32, LMBR1, and NOM1 genes. Our annotation of the false killer whale LMBR1 gene showed that it consists of 17 exons (1473 bp), in contrast to 18 exons (1596 bp) in human, and it displays 93.1% and 95.6% nucleotide and amino acid sequence similarity, respectively, compared with the human gene. In particular, we discovered that exon 10, deleted in the false killer whale LMBR1 gene, is present only in primates, and this fact strongly implies that exon 10 might be crucial in determining primate-specific limb development. ZRS and TFBS sequences have been well conserved across 11 species, suggesting that these regions could be involved in an important function of limb development and limb patterning. The neighboring gene RNF32 showed several lineage-conserved exons, such as exons 2 through 9 conserved in eutherian mammals, exons 3 through 9 conserved in mammals, and exons 5 through 9 conserved in vertebrates. The other neighboring gene, NOM1, had undergone a substitution (ATG→GTA) at the start codon, giving rise to a 36 bp shorter N-terminal sequence compared with the human sequence. Our comparative analysis of the false killer whale LMBR1 genomic locus provides important clues regarding the genetic regions that may play crucial roles in limb development and patterning.
Comparative Analysis of the Orphan CRISPR2 Locus in 242 Enterococcus faecalis Strains
Hullahalli, Karthik; Rodrigues, Marinelle; Schmidt, Brendan D.; Li, Xiang; Bhardwaj, Pooja; Palmer, Kelli L.
2015-01-01
Clustered, Regularly Interspaced Short Palindromic Repeats and their associated Cas proteins (CRISPR-Cas) provide prokaryotes with a mechanism for defense against mobile genetic elements (MGEs). A CRISPR locus is a molecular memory of MGE encounters. It contains an array of short sequences, called spacers, that generally have sequence identity to MGEs. Three different CRISPR loci have been identified among strains of the opportunistic pathogen Enterococcus faecalis. CRISPR1 and CRISPR3 are associated with the cas genes necessary for blocking MGEs, but these loci are present in only a subset of E. faecalis strains. The orphan CRISPR2 lacks cas genes and is ubiquitous in E. faecalis, although its spacer content varies from strain to strain. Because CRISPR2 is a variable locus occurring in all E. faecalis, comparative analysis of CRISPR2 sequences may provide information about the clonality of E. faecalis strains. We examined CRISPR2 sequences from 228 E. faecalis genomes in relationship to subspecies phylogenetic lineages (sequence types; STs) determined by multilocus sequence typing (MLST), and to a genome phylogeny generated for a representative 71 genomes. We found that specific CRISPR2 sequences are associated with specific STs and with specific branches on the genome tree. To explore possible applications of CRISPR2 analysis, we evaluated 14 E. faecalis bloodstream isolates using CRISPR2 analysis and MLST. CRISPR2 analysis identified two groups of clonal strains among the 14 isolates, an assessment that was confirmed by MLST. CRISPR2 analysis was also used to accurately predict the ST of a subset of isolates. We conclude that CRISPR2 analysis, while not a replacement for MLST, is an inexpensive method to assess clonality among E. faecalis isolates, and can be used in conjunction with MLST to identify recombination events occurring between STs. PMID:26398194
Whistler, Cheryl A; Hall, Jeffrey A; Xu, Feng; Ilyas, Saba; Siwakoti, Puskar; Cooper, Vaughn S; Jones, Stephen H
2015-06-01
Vibrio parahaemolyticus sequence type 36 (ST36) strains that are native to the Pacific Ocean have recently caused multistate outbreaks of gastroenteritis linked to shellfish harvested from the Atlantic Ocean. Whole-genome comparisons of 295 genomes of V. parahaemolyticus, including several traced to northeastern U.S. sources, were used to identify diagnostic loci, one putatively encoding an endonuclease (prp), and two others potentially conferring O-antigenic properties (cps and flp). The combination of all three loci was present in only one clade of closely related strains of ST36, ST59, and one additional unknown sequence type. However, each locus was also identified outside this clade, with prp and flp occurring in only two nonclade isolates and cps in four. Based on the distribution of these loci in sequenced genomes, prp identified clade strains with >99% accuracy, but the addition of one more locus increased accuracy to 100%. Oligonucleotide primers targeting prp and cps were combined in a multiplex PCR method that defines species using the tlh locus and determines the presence of both the tdh and trh hemolysin-encoding genes, which are also present in ST36. Application of the method in vitro to a collection of 94 clinical isolates collected over a 4-year period in three northeastern U.S. states and 87 environmental isolates revealed that the prp and cps amplicons were detected only in clinical isolates identified as belonging to the ST36 clade and in no environmental isolates from the region. The assay should improve detection and surveillance, thereby reducing infections. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Zhang, Han; Rokas, Antonis; Slot, Jason C
2012-01-01
Dermatophyte fungi of the family Arthrodermataceae (Eurotiomycetes) colonize keratinized tissue, such as skin, frequently causing superficial mycoses in humans and other mammals, reptiles, and birds. Competition with native microflora likely underlies the propensity of these dermatophytes to produce a diversity of antibiotics and compounds for scavenging iron, which is extremely scarce, as well as the presence of an unusually large number of putative secondary metabolism gene clusters, most of which contain non-ribosomal peptide synthetases (NRPS), in their genomes. To better understand the historical origins and diversification of NRPS-containing gene clusters we examined the evolution of a variable locus (VL) that exists in one of three alternative conformations among the genomes of seven dermatophyte species. The first conformation of the VL (termed VLA) contains only 539 base pairs of sequence and lacks protein-coding genes, whereas the other two conformations (termed VLB and VLC) span 36 Kb and 27 Kb and contain 12 and 10 genes, respectively. Interestingly, both VLB and VLC appear to contain distinct secondary metabolism gene clusters; VLB contains a NRPS gene as well as four porphyrin metabolism genes never found to be physically linked in the genomes of 128 other fungal species, whereas VLC also contains a NRPS gene as well as several others typically found associated with secondary metabolism gene clusters. Phylogenetic evidence suggests that the VL locus was present in the ancestor of all seven species achieving its present distribution through subsequent differential losses or retentions of specific conformations. We propose that the existence of variable loci, similar to the one we studied, in fungal genomes could potentially explain the dramatic differences in secondary metabolic diversity between closely related species of filamentous fungi, and contribute to host adaptation and the generation of metabolic diversity.
Haselden, Karen; Powell, Theresa; Drinnan, Mike; Carding, Paul
2009-11-01
Locus of Control (LoC) refers to an individuals' perception of whether they are in control of life events. Health Locus of Control refers to whether someone feels they have influence over their health. Health Locus of Control has not been studied in any depth in voice-disordered patients. The objective of this study was to examine Health Locus of Control in three patient groups: (1) Spasmodic Dysphonia, (2) Functional Dysphonia and (3) a nondysphonic group with Nonlaryngeal Dystonia. LoC was measured and compared in a total of 57 patients using the Multidimensional Health Locus of Control Scales (diagnostic specific) Form C. Internal, Chance, and Powerful others LoC were measured and comparisons were made using one-way analysis of variance. Contrary to expectations Internal LoC was found to be significantly higher in the Functional Dysphonia group when compared to the other two groups. There was no significant difference between the groups in Chance or Powerful others LoC. The two organic groups, Spasmodic Dysphonia and Nonlaryngeal Dystonia, were more alike in Internal Health Locus of Control than the Functional Dysphonia group. The diagnostic nature of the groups was reflected in their LoC scores rather than their voice loss. These results contribute to the debate about the etiology of Spasmodic Dysphonia and will be of interest to those involved in the psychology of voice and those managing voice-disordered patients.
Identification and Characterization of Genomic Amplifications in Ovarian Serous Carcinoma
2008-01-01
lower cost. As a result, we have analyzed more than 40 affinity purified ovarian serous tumors and our results demonstrated that CCNE1, Notch3 , Rsf-1...serous tumors. In addition, we have further characterized the biological functions of the two of the commonly amplified genes, Notch3 and Rsf-1, in...analyses, we have focused on two of the most frequently amplified regions, 11q13.2 (Rsf-1 locus) and 19p13 ( Notch3 locus), for detailed mapping and
Carter, Tamar E.; Boulter, Alexis; Existe, Alexandre; Romain, Jean R.; St. Victor, Jean Yves; Mulligan, Connie J.; Okech, Bernard A.
2015-01-01
Antimalarial drugs are a key tool in malaria elimination programs. With the emergence of artemisinin resistance in southeast Asia, an effort to identify molecular markers for surveillance of resistant malaria parasites is underway. Non-synonymous mutations in the kelch propeller domain (K13-propeller) in Plasmodium falciparum have been associated with artemisinin resistance in samples from southeast Asia, but additional studies are needed to characterize this locus in other P. falciparum populations with different levels of artemisinin use. Here, we sequenced the K13-propeller locus in 82 samples from Haiti, where limited government oversight of non-governmental organizations may have resulted in low-level use of artemisinin-based combination therapies. We detected a single-nucleotide polymorphism (SNP) at nucleotide 1,359 in a single isolate. Our results contribute to our understanding of the global genomic diversity of the K13-propeller locus in P. falciparum populations. PMID:25646258
USH1H, a novel locus for type I Usher syndrome, maps to chromosome 15q22-23.
Ahmed, Z M; Riazuddin, S; Khan, S N; Friedman, P L; Riazuddin, S; Friedman, T B
2009-01-01
Usher syndrome (USH) is a hereditary disorder associated with sensorineural hearing impairment, progressive loss of vision attributable to retinitis pigmentosa (RP) and variable vestibular function. Three clinical types have been described with type I (USH1) being the most severe. To date, six USH1 loci have been reported. We ascertained two large Pakistani consanguineous families segregating profound hearing loss, vestibular dysfunction, and RP, the defining features of USH1. In these families, we excluded linkage of USH to the 11 known USH loci and subsequently performed a genome-wide linkage screen. We found a novel USH1 locus designated USH1H that mapped to chromosome 15q22-23 in a 4.92-cM interval. This locus overlaps the non-syndromic deafness locus DFNB48 raising the possibility that the two disorders may be caused by allelic mutations.
Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders.
Hamosh, Ada; Scott, Alan F; Amberger, Joanna; Bocchini, Carol; Valle, David; McKusick, Victor A
2002-01-01
Online Mendelian Inheritance in Man (OMIM) is a comprehensive, authoritative and timely knowledgebase of human genes and genetic disorders compiled to support research and education in human genomics and the practice of clinical genetics. Started by Dr Victor A. McKusick as the definitive reference Mendelian Inheritance in Man, OMIM (www.ncbi.nlm.nih.gov/omim) is now distributed electronically by the National Center for Biotechnology Information (NCBI), where it is integrated with the Entrez suite of databases. Derived from the biomedical literature, OMIM is written and edited at Johns Hopkins University with input from scientists and physicians around the world. Each OMIM entry has a full-text summary of a genetically determined phenotype and/or gene and has numerous links to other genetic databases such as DNA and protein sequence, PubMed references, general and locus-specific mutation databases, approved gene nomenclature, and the highly detailed mapviewer, as well as patient support groups and many others. OMIM is an easy and straightforward portal to the burgeoning information in human genetics.
Sequencing the Unrearranged Human Immunoglobin
DOE Office of Scientific and Technical Information (OSTI.GOV)
Warren, Rene
2010-06-03
Rene Warren from Canada's Michael Smith Genome Sciences Centre discusses sequencing and finishing the IgH heavy chain locus on June 3, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.
Zhang, Chunxiao; Sheng, Chaolan; Wang, Wei; Hu, Hongbo; Peng, Huasong; Zhang, Xuehong
2015-01-01
Streptomyces lomondensis S015 synthesizes the broad-spectrum phenazine antibiotic lomofungin. Whole genome sequencing of this strain revealed a genomic locus consisting of 23 open reading frames that includes the core phenazine biosynthesis gene cluster lphzGFEDCB. lomo10, encoding a putative flavin-dependent monooxygenase, was also identified in this locus. Inactivation of lomo10 by in-frame partial deletion resulted in the biosynthesis of a new phenazine metabolite, 1-carbomethoxy-6-formyl-4,9-dihydroxy-phenazine, along with the absence of lomofungin. This result suggests that lomo10 is responsible for the hydroxylation of lomofungin at its C-7 position. This is the first description of a phenazine hydroxylation gene in Streptomyces, and the results of this study lay the foundation for further investigation of phenazine metabolite biosynthesis in Streptomyces. PMID:26305803
Coates, B S; Alves, A P; Wang, H; Zhou, X; Nowatzki, T; Chen, H; Rangasamy, M; Robertson, H M; Whitfield, C W; Walden, K K; Kachman, S D; French, B W; Meinke, L J; Hawthorne, D; Abel, C A; Sappington, T W; Siegfried, B D; Miller, N J
2016-02-01
The western corn rootworm, Diabrotica virgifera virgifera, is an insect pest of corn and population suppression with chemical insecticides is an important management tool. Traits conferring organophosphate insecticide resistance have increased in frequency amongst D. v. virgifera populations, resulting in the reduced efficacy in many corn-growing regions of the USA. We used comparative functional genomic and quantitative trait locus (QTL) mapping approaches to investigate the genetic basis of D. v. virgifera resistance to the organophosphate methyl-parathion. RNA from adult methyl-parathion resistant and susceptible adults was hybridized to 8331 microarray probes. The results predicted that 11 transcripts were significantly up-regulated in resistant phenotypes, with the most significant (fold increases ≥ 2.43) being an α-esterase-like transcript. Differential expression was validated only for the α-esterase (ST020027A20C03), with 11- to 13-fold greater expression in methyl-parathion resistant adults (P < 0.05). Progeny with a segregating methyl-parathion resistance trait were obtained from a reciprocal backcross design. QTL analyses of high-throughput single nucleotide polymorphism genotype data predicted involvement of a single genome interval. These data suggest that a specific carboyxesterase may function in field-evolved corn rootworm resistance to organophosphates, even though direct linkage between the QTL and this locus could not be established. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.
Huynh, Bao-Lam; Matthews, William C; Ehlers, Jeffrey D; Lucas, Mitchell R; Santos, Jansen R P; Ndeve, Arsenio; Close, Timothy J; Roberts, Philip A
2016-01-01
Genome resolution of a major QTL associated with the Rk locus in cowpea for resistance to root-knot nematodes has significance for plant breeding programs and R gene characterization. Cowpea (Vigna unguiculata L. Walp.) is a susceptible host of root-knot nematodes (Meloidogyne spp.) (RKN), major plant-parasitic pests in global agriculture. To date, breeding for host resistance in cowpea has relied on phenotypic selection which requires time-consuming and expensive controlled infection assays. To facilitate marker-based selection, we aimed to identify and map quantitative trait loci (QTL) conferring the resistance trait. One recombinant inbred line (RIL) and two F2:3 populations, each derived from a cross between a susceptible and a resistant parent, were genotyped with genome-wide single nucleotide polymorphism (SNP) markers. The populations were screened in the field for root-galling symptoms and/or under growth-chamber conditions for nematode reproduction levels using M. incognita and M. javanica biotypes. One major QTL was mapped consistently on linkage group VuLG11 of each population. By genotyping additional cowpea lines and near-isogenic lines derived from conventional backcrossing, we confirmed that the detected QTL co-localized with the genome region associated with the Rk locus for RKN resistance that has been used in conventional breeding for many decades. This chromosomal location defined with flanking markers will be a valuable target in marker-assisted breeding and for positional cloning of genes controlling RKN resistance.
League, Garrett P; Slot, Jason C; Rokas, Antonis
2012-11-01
The asparagine degradation pathway in the S288c laboratory strain of Saccharomyces cerevisiae is comprised of genes located at two separate loci. ASP1 is located on chromosome IV and encodes for cytosolic l-asparaginase I, whereas ASP3 contains a gene cluster located on chromosome XII comprised of four identical genes, ASP3-1, ASP3-2, ASP3-3, and ASP3-4, which encode for cell wall-associated l-asparaginase II. Interestingly, the ASP3 locus appears to be only present, in variable copy number, in S. cerevisiae strains isolated from laboratory or industrial environments and is completely absent from the genomes of 128 diverse fungal species. Investigation of the evolutionary history of ASP3 across these 128 genomes as well as across the genomes of 43 S. cerevisiae strains shows that ASP3 likely arose in a S. cerevisiae strain via horizontal gene transfer (HGT) from, or a close relative of, the wine yeast Wickerhamomyces anomalus, which co-occurs with S. cerevisiae in several biotechnological processes. Thus, because the ASP3 present in the S288c laboratory strain of S. cerevisiae is induced in response to nitrogen starvation, its acquisition may have aided yeast adaptation to artificial environments. Our finding that the ASP3 locus in S. cerevisiae originated via HGT further highlights the importance of gene sharing between yeasts in the evolution of their remarkable metabolic diversity. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Littlejohn, Mathew D; Turner, Sally-Anne; Walker, Caroline G; Berry, Sarah D; Tiplady, Kathryn; Sherlock, Ric G; Sutherland, Greg; Swift, Simon; Garrick, Dorian; Lacy-Hulbert, S Jane; McDougall, Scott; Spelman, Richard J; Snell, Russell G; Hillerton, J Eric
2018-05-01
Inflammation of the mammary gland following bacterial infection, commonly known as mastitis, affects all mammalian species. Although the aetiology and epidemiology of mastitis in the dairy cow are well described, the genetic factors mediating resistance to mammary gland infection are not well known, due in part to the difficulty in obtaining robust phenotypic information from sufficiently large numbers of individuals. To address this problem, an experimental mammary gland infection experiment was undertaken, using a Friesian-Jersey cross breed F2 herd. A total of 604 animals received an intramammary infusion of Streptococcus uberis in one gland, and the clinical response over 13 milkings was used for linkage mapping and genome-wide association analysis. A quantitative trait locus (QTL) was detected on bovine chromosome 11 for clinical mastitis status using micro-satellite and Affymetrix 10 K SNP markers, and then exome and genome sequence data used from the six F1 sires of the experimental animals to examine this region in more detail. A total of 485 sequence variants were typed in the QTL interval, and association mapping using these and an additional 37 986 genome-wide markers from the Illumina SNP50 bovine SNP panel revealed association with markers encompassing the interleukin-1 gene cluster locus. This study highlights a region on bovine chromosome 11, consistent with earlier studies, as conferring resistance to experimentally induced mammary gland infection, and newly prioritises the IL1 gene cluster for further analysis in genetic resistance to mastitis.
Pooled genome wide association detects association upstream of FCRL3 with Graves' disease.
Khong, Jwu Jin; Burdon, Kathryn P; Lu, Yi; Laurie, Kate; Leonardos, Lefta; Baird, Paul N; Sahebjada, Srujana; Walsh, John P; Gajdatsy, Adam; Ebeling, Peter R; Hamblin, Peter Shane; Wong, Rosemary; Forehan, Simon P; Fourlanos, Spiros; Roberts, Anthony P; Doogue, Matthew; Selva, Dinesh; Montgomery, Grant W; Macgregor, Stuart; Craig, Jamie E
2016-11-18
Graves' disease is an autoimmune thyroid disease of complex inheritance. Multiple genetic susceptibility loci are thought to be involved in Graves' disease and it is therefore likely that these can be identified by genome wide association studies. This study aimed to determine if a genome wide association study, using a pooling methodology, could detect genomic loci associated with Graves' disease. Nineteen of the top ranking single nucleotide polymorphisms including HLA-DQA1 and C6orf10, were clustered within the Major Histo-compatibility Complex region on chromosome 6p21, with rs1613056 reaching genome wide significance (p = 5 × 10 -8 ). Technical validation of top ranking non-Major Histo-compatablity complex single nucleotide polymorphisms with individual genotyping in the discovery cohort revealed four single nucleotide polymorphisms with p ≤ 10 -4 . Rs17676303 on chromosome 1q23.1, located upstream of FCRL3, showed evidence of association with Graves' disease across the discovery, replication and combined cohorts. A second single nucleotide polymorphism rs9644119 downstream of DPYSL2 showed some evidence of association supported by finding in the replication cohort that warrants further study. Pooled genome wide association study identified a genetic variant upstream of FCRL3 as a susceptibility locus for Graves' disease in addition to those identified in the Major Histo-compatibility Complex. A second locus downstream of DPYSL2 is potentially a novel genetic variant in Graves' disease that requires further confirmation.
Song, B K; Pan, M Z; Lau, Y L; Wan, K L
2014-07-29
Commercial flocks infected by Eimeria species parasites, including Eimeria maxima, have an increased risk of developing clinical or subclinical coccidiosis; an intestinal enteritis associated with increased mortality rates in poultry. Currently, infection control is largely based on chemotherapy or live vaccines; however, drug resistance is common and vaccines are relatively expensive. The development of new cost-effective intervention measures will benefit from unraveling the complex genetic mechanisms that underlie host-parasite interactions, including the identification and characterization of genes encoding proteins such as phosphatidylinositol 4-phosphate 5-kinase (PIP5K). We previously identified a PIP5K coding sequence within the E. maxima genome. In this study, we analyzed two bacterial artificial chromosome clones presenting a ~145-kb E. maxima (Weybridge strain) genomic region spanning the PIP5K gene locus. Sequence analysis revealed that ~95% of the simple sequence repeats detected were located within regions comparable to the previously described feature-rich segments of the Eimeria tenella genome. Comparative sequence analysis with the orthologous E. maxima (Houghton strain) region revealed a moderate level of conserved synteny. Unique segmental organizations and telomere-like repeats were also observed in both genomes. A number of incomplete transposable elements were detected and further scrutiny of these elements in both orthologous segments revealed interesting nesting events, which may play a role in facilitating genome plasticity in E. maxima. The current analysis provides more detailed information about the genome organization of E. maxima and may help to reveal genotypic differences that are important for expression of traits related to pathogenicity and virulence.
The dynamic proliferation of CanSINEs mirrors the complex evolution of Feliforms
2014-01-01
Background Repetitive short interspersed elements (SINEs) are retrotransposons ubiquitous in mammalian genomes and are highly informative markers to identify species and phylogenetic associations. Of these, SINEs unique to the order Carnivora (CanSINEs) yield novel insights on genome evolution in domestic dogs and cats, but less is known about their role in related carnivores. In particular, genome-wide assessment of CanSINE evolution has yet to be completed across the Feliformia (cat-like) suborder of Carnivora. Within Feliformia, the cat family Felidae is composed of 37 species and numerous subspecies organized into eight monophyletic lineages that likely arose 10 million years ago. Using the Felidae family as a reference phylogeny, along with representative taxa from other families of Feliformia, the origin, proliferation and evolution of CanSINEs within the suborder were assessed. Results We identified 93 novel intergenic CanSINE loci in Feliformia. Sequence analyses separated Feliform CanSINEs into two subfamilies, each characterized by distinct RNA polymerase binding motifs and phylogenetic associations. Subfamily I CanSINEs arose early within Feliformia but are no longer under active proliferation. Subfamily II loci are more recent, exclusive to Felidae and show evidence for adaptation to extant RNA polymerase activity. Further, presence/absence distributions of CanSINE loci are largely congruent with taxonomic expectations within Feliformia and the less resolved nodes in the Felidae reference phylogeny present equally ambiguous CanSINE data. SINEs are thought to be nearly impervious to excision from the genome. However, we observed a nearly complete excision of a CanSINEs locus in puma (Puma concolor). In addition, we found that CanSINE proliferation in Felidae frequently targeted existing CanSINE loci for insertion sites, resulting in tandem arrays. Conclusions We demonstrate the existence of at least two SINE families within the Feliformia suborder, one of which is actively involved in insertional mutagenesis. We find SINEs are powerful markers of speciation and conclude that the few inconsistencies with expected patterns of speciation likely represent incomplete lineage sorting, species hybridization and SINE-mediated genome rearrangement. PMID:24947429
The Chlamydomonas genome project: a decade on
Blaby, Ian K.; Blaby-Haas, Crysten; Tourasse, Nicolas; Hom, Erik F. Y.; Lopez, David; Aksoy, Munevver; Grossman, Arthur; Umen, James; Dutcher, Susan; Porter, Mary; King, Stephen; Witman, George; Stanke, Mario; Harris, Elizabeth H.; Goodstein, David; Grimwood, Jane; Schmutz, Jeremy; Vallon, Olivier; Merchant, Sabeeha S.; Prochnik, Simon
2014-01-01
The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis and micronutrient homeostasis. Ten years since its genome project was initiated, an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the “omics” era. Housed at Phytozome, the Joint Genome Institute’s (JGI) plant genomics portal, the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of RNA-Seq data. Here, we present the past, present and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes. PMID:24950814
[Use of multiple locus variable number tandem repeats analysis for the Brucella systematization].
Kulakov, Iu K; Kovalev, D A; Misetova, E N; Golovneva, S I; Liapustina, L V; Zheludkov, M M
2012-01-01
The methods of molecular-genetic differentiation to strain level acquire increasing significance in the current system of struggle with brucellosis. MLVA (multiple locus variable number tandem repeats analysis) was selected for molecular-genetic differentiation to strain level and simultaneous establishment of the genetic relationship of investigated Brucella strains. The goal of this work was MLVA typing of three pathogenic Brucella species strains with the analysis of stability of chosen loci, discrimination power and concordance to conventional phenotypic methods of the Brucella differentiation for use in systematization of brucellosis causing agents. Twenty six Brucella strains representing reference (n = 15), vaccine (n = 2) and field strains of three pathogenic Brucella species were tested: B. melitensis (n = 3), B. abortus (n = 2), B. suis (n = 2), and isolates (n = 2) with unidentified taxonomic position using MLVA with 9 pairs primers on known variable loci of Brucella genome. The analysis of the stability of chosen loci, discrimination power on Hunter-Gaston discrimination index (HGDI) and consistency to phenotypic methods of identification was performed. MLVA was confirmed for the results of phenotypic methods of identification, stability of the chosen loci in majority reference, and vaccine strains with a high index of variability HGDI 0.9969 for all loci. A dendrogram was plotted on the basis of MLVA data on distributed Brucella strains in related clusters according to its taxonomic species and biovar positions and construction of 25 genotypes. B. melitensis strains formed cluster related to the reference strain of B. melitensis 63/9 biovar 2. Australian isolates of Brucella 83-4 and Brucella 83-6 isolated from rodents formed a cluster distant from other strains of Brucella. MLVA is a promising method for differentiation of Brucella strains with known and unresolved taxonomic status for their systematization and creation of MLVA genotype catalogue that will promote qualitative improvement of brucellosis surveillance system in Russia.
Fehringer, Gordon; Kraft, Peter; Pharoah, Paul D; Eeles, Rosalind A; Chatterjee, Nilanjan; Schumacher, Fredrick R; Schildkraut, Joellen M; Lindström, Sara; Brennan, Paul; Bickeböller, Heike; Houlston, Richard S; Landi, Maria Teresa; Caporaso, Neil; Risch, Angela; Amin Al Olama, Ali; Berndt, Sonja I; Giovannucci, Edward L; Grönberg, Henrik; Kote-Jarai, Zsofia; Ma, Jing; Muir, Kenneth; Stampfer, Meir J; Stevens, Victoria L; Wiklund, Fredrik; Willett, Walter C; Goode, Ellen L; Permuth, Jennifer B; Risch, Harvey A; Reid, Brett M; Bezieau, Stephane; Brenner, Hermann; Chan, Andrew T; Chang-Claude, Jenny; Hudson, Thomas J; Kocarnik, Jonathan K; Newcomb, Polly A; Schoen, Robert E; Slattery, Martha L; White, Emily; Adank, Muriel A; Ahsan, Habibul; Aittomäki, Kristiina; Baglietto, Laura; Blomquist, Carl; Canzian, Federico; Czene, Kamila; Dos-Santos-Silva, Isabel; Eliassen, A Heather; Figueroa, Jonine D; Flesch-Janys, Dieter; Fletcher, Olivia; Garcia-Closas, Montserrat; Gaudet, Mia M; Johnson, Nichola; Hall, Per; Hazra, Aditi; Hein, Rebecca; Hofman, Albert; Hopper, John L; Irwanto, Astrid; Johansson, Mattias; Kaaks, Rudolf; Kibriya, Muhammad G; Lichtner, Peter; Liu, Jianjun; Lund, Eiliv; Makalic, Enes; Meindl, Alfons; Müller-Myhsok, Bertram; Muranen, Taru A; Nevanlinna, Heli; Peeters, Petra H; Peto, Julian; Prentice, Ross L; Rahman, Nazneen; Sanchez, Maria Jose; Schmidt, Daniel F; Schmutzler, Rita K; Southey, Melissa C; Tamimi, Rulla; Travis, Ruth C; Turnbull, Clare; Uitterlinden, Andre G; Wang, Zhaoming; Whittemore, Alice S; Yang, Xiaohong R; Zheng, Wei; Buchanan, Daniel D; Casey, Graham; Conti, David V; Edlund, Christopher K; Gallinger, Steven; Haile, Robert W; Jenkins, Mark; Le Marchand, Loïc; Li, Li; Lindor, Noralene M; Schmit, Stephanie L; Thibodeau, Stephen N; Woods, Michael O; Rafnar, Thorunn; Gudmundsson, Julius; Stacey, Simon N; Stefansson, Kari; Sulem, Patrick; Chen, Y Ann; Tyrer, Jonathan P; Christiani, David C; Wei, Yongyue; Shen, Hongbing; Hu, Zhibin; Shu, Xiao-Ou; Shiraishi, Kouya; Takahashi, Atsushi; Bossé, Yohan; Obeidat, Ma'en; Nickle, David; Timens, Wim; Freedman, Matthew L; Li, Qiyuan; Seminara, Daniela; Chanock, Stephen J; Gong, Jian; Peters, Ulrike; Gruber, Stephen B; Amos, Christopher I; Sellers, Thomas A; Easton, Douglas F; Hunter, David J; Haiman, Christopher A; Henderson, Brian E; Hung, Rayjean J
2016-09-01
Identifying genetic variants with pleiotropic associations can uncover common pathways influencing multiple cancers. We took a two-stage approach to conduct genome-wide association studies for lung, ovary, breast, prostate, and colorectal cancer from the GAME-ON/GECCO Network (61,851 cases, 61,820 controls) to identify pleiotropic loci. Findings were replicated in independent association studies (55,789 cases, 330,490 controls). We identified a novel pleiotropic association at 1q22 involving breast and lung squamous cell carcinoma, with eQTL analysis showing an association with ADAM15/THBS3 gene expression in lung. We also identified a known breast cancer locus CASP8/ALS2CR12 associated with prostate cancer, a known cancer locus at CDKN2B-AS1 with different variants associated with lung adenocarcinoma and prostate cancer, and confirmed the associations of a breast BRCA2 locus with lung and serous ovarian cancer. This is the largest study to date examining pleiotropy across multiple cancer-associated loci, identifying common mechanisms of cancer development and progression. Cancer Res; 76(17); 5103-14. ©2016 AACR. ©2016 American Association for Cancer Research.
Ribosomal DNA Integrating rAAV-rDNA Vectors Allow for Stable Transgene Expression
Lisowski, Leszek; Lau, Ashley; Wang, Zhongya; Zhang, Yue; Zhang, Feijie; Grompe, Markus; Kay, Mark A
2012-01-01
Although recombinant adeno-associated virus (rAAV) vectors are proving to be efficacious in clinical trials, the episomal character of the delivered transgene restricts their effectiveness to use in quiescent tissues, and may not provide lifelong expression. In contrast, integrating vectors enhance the risk of insertional mutagenesis. In an attempt to overcome both of these limitations, we created new rAAV-rDNA vectors, with an expression cassette flanked by ribosomal DNA (rDNA) sequences capable of homologous recombination into genomic rDNA. We show that after in vivo delivery the rAAV-rDNA vectors integrated into the genomic rDNA locus 8–13 times more frequently than control vectors, providing an estimate that 23–39% of the integrations were specific to the rDNA locus. Moreover, a rAAV-rDNA vector containing a human factor IX (hFIX) expression cassette resulted in sustained therapeutic levels of serum hFIX even after repeated manipulations to induce liver regeneration. Because of the relative safety of integration in the rDNA locus, these vectors expand the usage of rAAV for therapeutics requiring long-term gene transfer into dividing cells. PMID:22990671
Atopic dermatitis in West Highland white terriers is associated with a 1.3-Mb region on CFA 17.
Roque, Joana B; O'Leary, Caroline A; Duffy, David L; Kyaw-Tanner, Myat; Gharahkhani, Puya; Vogelnest, Linda; Mason, Kenneth; Shipstone, Michael; Latter, Melanie
2012-03-01
Canine atopic dermatitis (AD) is an allergic inflammatory skin disease that shares similarities with AD in humans. Canine AD is likely to be an inherited disease in dogs and is common in West Highland white terriers (WHWTs). We performed a genome-wide association study using the Affymetrix Canine SNP V2 array consisting of over 42,800 single nucleotide polymorphisms, on 35 atopic and 25 non-atopic WHWTs. A gene-dropping simulation method, using SIB-PAIR, identified a projected 1.3 Mb area of association (genome-wide P = 6 × 10(-5) to P = 7 × 10(-4)) on CFA 17. Nineteen genes on CFA 17, including 1 potential candidate gene (PTPN22), were located less than 0.5 Mb from the interval of association identified on the genome-wide association analysis. Four haplotypes within this locus were differently distributed between cases and controls in this population of dogs. These findings suggest that a major locus for canine AD in WHWTs may be located on, or in close proximity to an area on CFA 17.
Spontaneous CRISPR loci generation in vivo by non-canonical spacer integration
Nivala, Jeff; Shipman, Seth L.; Church, George M.
2018-01-01
The adaptation phase of CRISPR-Cas immunity depends on the precise integration of short segments of foreign DNA (spacers) into a specific genomic location within the CRISPR locus by the Cas1-Cas2 integration complex. Although off-target spacer integration outside of canonical CRISPR arrays has been described in vitro, no evidence of non-specific integration activity has been found in vivo. Here, we show that non-canonical off-target integrations can occur within bacterial chromosomes at locations that resemble the native CRISPR locus by characterizing hundreds of off-target integration locations within Escherichia coli. Considering whether such promiscuous Cas1-Cas2 activity could have an evolutionary role through the genesis of neo-CRISPR loci, we combed existing CRISPR databases and available genomes for evidence of off-target integration activity. This search uncovered several putative instances of naturally occurring off-target spacer integration events within the genomes of Yersinia pestis and Sulfolobus islandicus. These results are important in understanding alternative routes to CRISPR array genesis and evolution, as well as in the use of spacer acquisition in technological applications. PMID:29379209
Peng, Jin; Wang, Yong; Jiang, Junyi; Zhou, Xiaoyang; Song, Lei; Wang, Lulu; Ding, Chen; Qin, Jun; Liu, Liping; Wang, Weihua; Liu, Jianqiao; Huang, Xingxu; Wei, Hong; Zhang, Pumin
2015-11-12
Precise genome modification in large domesticated animals is desirable under many circumstances. In the past it is only possible through lengthy and burdensome cloning procedures. Here we attempted to achieve that goal through the use of the newest genome-modifying tool CRISPR/Cas9. We set out to knockin human albumin cDNA into pig Alb locus for the production of recombinant human serum albumin (rHSA). HSA is a widely used human blood product and is in high demand. We show that homologous recombination can occur highly efficiently in swine zygotes. All 16 piglets born from the manipulated zygotes carry the expected knockin allele and we demonstrated the presence of human albumin in the blood of these piglets. Furthermore, the knockin allele was successfully transmitted through germline. This success in precision genomic engineering is expected to spur exploration of pigs and other large domesticated animals to be used as bioreactors for the production of biomedical products or creation of livestock strains with more desirable traits.
[Efficient genome editing in human pluripotent stem cells through CRISPR/Cas9].
Liu, Gai-gai; Li, Shuang; Wei, Yu-da; Zhang, Yong-xian; Ding, Qiu-rong
2015-11-01
The RNA-guided CRISPR (clustered regularly interspaced short palindromic repeat)-associated Cas9 nuclease has offered a new platform for genome editing with high efficiency. Here, we report the use of CRISPR/Cas9 technology to target a specific genomic region in human pluripotent stem cells. We show that CRISPR/Cas9 can be used to disrupt a gene by introducing frameshift mutations to gene coding region; to knock in specific sequences (e.g. FLAG tag DNA sequence) to targeted genomic locus via homology directed repair; to induce large genomic deletion through dual-guide multiplex. Our results demonstrate the versatile application of CRISPR/Cas9 in stem cell genome editing, which can be widely utilized for functional studies of genes or genome loci in human pluripotent stem cells.
COBRA-Seq: Sensitive and Quantitative Methylome Profiling
Varinli, Hilal; Statham, Aaron L.; Clark, Susan J.; Molloy, Peter L.; Ross, Jason P.
2015-01-01
Combined Bisulfite Restriction Analysis (COBRA) quantifies DNA methylation at a specific locus. It does so via digestion of PCR amplicons produced from bisulfite-treated DNA, using a restriction enzyme that contains a cytosine within its recognition sequence, such as TaqI. Here, we introduce COBRA-seq, a genome wide reduced methylome method that requires minimal DNA input (0.1–1.0 μg) and can either use PCR or linear amplification to amplify the sequencing library. Variants of COBRA-seq can be used to explore CpG-depleted as well as CpG-rich regions in vertebrate DNA. The choice of enzyme influences enrichment for specific genomic features, such as CpG-rich promoters and CpG islands, or enrichment for less CpG dense regions such as enhancers. COBRA-seq coupled with linear amplification has the additional advantage of reduced PCR bias by producing full length fragments at high abundance. Unlike other reduced representative methylome methods, COBRA-seq has great flexibility in the choice of enzyme and can be multiplexed and tuned, to reduce sequencing costs and to interrogate different numbers of sites. Moreover, COBRA-seq is applicable to non-model organisms without the reference genome and compatible with the investigation of non-CpG methylation by using restriction enzymes containing CpA, CpT, and CpC in their recognition site. PMID:26512698
Burall, Laurel S; Grim, Christopher J; Datta, Atin R
2017-01-01
Four listeriosis incidences/outbreaks, spanning 19 months, have been linked to Listeria monocytogenes serotype 4b variant (4bV) strains. Three of these incidents can be linked to a defined geographical region, while the fourth is likely to be linked. In this study, whole genome sequencing (WGS) of strains from these incidents was used for genomic comparisons using two approached. The first was JSpecies tetramer, which analyzed tetranucleotide frequency to assess relatedness. The second, the CFSAN SNP Pipeline, was used to perform WGS SNP analyses against three different reference genomes to evaluate relatedness by SNP distances. In each case, unrelated strains were included as controls. The analyses showed that strains from these incidents form a highly related clade with SNP differences of ≤101 within the clade and >9000 against other strains. Multi-Virulence-Locus Sequence Typing, a third standardized approach for evaluation relatedness, was used to assess the genetic drift in six conserved, known virulence loci and showed a different clustering pattern indicating possible differences in selection pressure experienced by these genes. These data suggest a high degree of relatedness among these 4bV strains linked to a defined geographic region and also highlight the possibility of alterations related to adaptation and virulence.
Liu, Shi; Gao, Peng; Zhu, Qianglong; Luan, Feishi; Davis, Angela R.; Wang, Xiaolu
2016-01-01
Cleaved amplified polymorphic sequence (CAPS) markers are useful tools for detecting single nucleotide polymorphisms (SNPs). This study detected and converted SNP sites into CAPS markers based on high-throughput re-sequencing data in watermelon, for linkage map construction and quantitative trait locus (QTL) analysis. Two inbred lines, Cream of Saskatchewan (COS) and LSW-177 had been re-sequenced and analyzed by Perl self-compiled script for CAPS marker development. 88.7% and 78.5% of the assembled sequences of the two parental materials could map to the reference watermelon genome, respectively. Comparative assembled genome data analysis provided 225,693 and 19,268 SNPs and indels between the two materials. 532 pairs of CAPS markers were designed with 16 restriction enzymes, among which 271 pairs of primers gave distinct bands of the expected length and polymorphic bands, via PCR and enzyme digestion, with a polymorphic rate of 50.94%. Using the new CAPS markers, an initial CAPS-based genetic linkage map was constructed with the F2 population, spanning 1836.51 cM with 11 linkage groups and 301 markers. 12 QTLs were detected related to fruit flesh color, length, width, shape index, and brix content. These newly CAPS markers will be a valuable resource for breeding programs and genetic studies of watermelon. PMID:27162496
Lu, Jiangjie; Liu, Yuyang; Xu, Jing; Mei, Ziwei; Shi, Yujun; Liu, Pengli; He, Jianbo; Wang, Xiaotong; Meng, Yijun; Feng, Shangguo; Shen, Chenjia; Wang, Huizhong
2018-01-01
Plants of the Dendrobium genus are orchids with not only ornamental value but also high medicinal value. To understand the genetic basis of variations in active ingredients of the stem total polysaccharide contents (STPCs) among different Dendrobium species, it is of paramount importance to understand the mechanism of STPC formation and identify genes affecting its process at the whole genome level. Here, we report the first high-density single-nucleotide polymorphism (SNP) integrated genetic map with a good genome coverage of Dendrobium. The specific-locus amplified fragment sequencing (SLAF-seq) technology led to identification of 7,013,400 SNPs from 1,503,626 high-quality SLAF markers from two parents (Dendrobium moniliforme ♀ × Dendrobium officinale ♂) and their interspecific F1 hybrid population. The final genetic map contained 8, 573 SLAF markers, covering 19 linkage groups (LGs). This genetic map spanned a length of 2,737.49 cM, where the average distance between markers is 0.32 cM. In total, 5 quantitative trait loci (QTL) related to STPC were identified, 3 of which have candidate genes within the confidence intervals of these stable QTLs based on the D. officinale genome sequence. This study will build a foundation up for the mapping of other medicinal-related traits and provide an important reference for the molecular breeding of these Chinese herb. PMID:29636767
Joint genotype- and ancestry-based genome-wide association studies in admixed populations.
Szulc, Piotr; Bogdan, Malgorzata; Frommlet, Florian; Tang, Hua
2017-09-01
In genome-wide association studies (GWAS) genetic loci that influence complex traits are localized by inspecting associations between genotypes of genetic markers and the values of the trait of interest. On the other hand, admixture mapping, which is performed in case of populations consisting of a recent mix of two ancestral groups, relies on the ancestry information at each locus (locus-specific ancestry). Recently it has been proposed to jointly model genotype and locus-specific ancestry within the framework of single marker tests. Here, we extend this approach for population-based GWAS in the direction of multimarker models. A modified version of the Bayesian information criterion is developed for building a multilocus model that accounts for the differential correlation structure due to linkage disequilibrium (LD) and admixture LD. Simulation studies and a real data example illustrate the advantages of this new approach compared to single-marker analysis or modern model selection strategies based on separately analyzing genotype and ancestry data, as well as to single-marker analysis combining genotypic and ancestry information. Depending on the signal strength, our procedure automatically chooses whether genotypic or locus-specific ancestry markers are added to the model. This results in a good compromise between the power to detect causal mutations and the precision of their localization. The proposed method has been implemented in R and is available at http://www.math.uni.wroc.pl/~mbogdan/admixtures/. © 2017 WILEY PERIODICALS, INC.
Eizirik, Eduardo; David, Victor A.; Buckley-Beason, Valerie; Roelke, Melody E.; Schäffer, Alejandro A.; Hannah, Steven S.; Narfström, Kristina; O'Brien, Stephen J.; Menotti-Raymond, Marilyn
2010-01-01
Mammalian coat patterns (e.g., spots, stripes) are hypothesized to play important roles in camouflage and other relevant processes, yet the genetic and developmental bases for these phenotypes are completely unknown. The domestic cat, with its diversity of coat patterns, is an excellent model organism to investigate these phenomena. We have established three independent pedigrees to map the four recognized pattern variants classically considered to be specified by a single locus, Tabby; in order of dominance, these are the unpatterned agouti form called “Abyssinian” or “ticked” (Ta), followed by Spotted (Ts), Mackerel (TM), and Blotched (tb). We demonstrate that at least three different loci control the coat markings of the domestic cat. One locus, responsible for the Abyssinian form (herein termed the Ticked locus), maps to an ∼3.8-Mb region on cat chromosome B1. A second locus controls the Tabby alleles TM and tb, and maps to an ∼5-Mb genomic region on cat chromosome A1. One or more additional loci act as modifiers and create a spotted coat by altering mackerel stripes. On the basis of our results and associated observations, we hypothesize that mammalian patterned coats are formed by two distinct processes: a spatially oriented developmental mechanism that lays down a species-specific pattern of skin cell differentiation and a pigmentation-oriented mechanism that uses information from the preestablished pattern to regulate the synthesis of melanin profiles. PMID:19858284
FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm.
Tuo, Shouheng; Zhang, Junying; Yuan, Xiguo; Zhang, Yuanyuan; Liu, Zhaowen
2016-01-01
Two-locus model is a typical significant disease model to be identified in genome-wide association study (GWAS). Due to intensive computational burden and diversity of disease models, existing methods have drawbacks on low detection power, high computation cost, and preference for some types of disease models. In this study, two scoring functions (Bayesian network based K2-score and Gini-score) are used for characterizing two SNP locus as a candidate model, the two criteria are adopted simultaneously for improving identification power and tackling the preference problem to disease models. Harmony search algorithm (HSA) is improved for quickly finding the most likely candidate models among all two-locus models, in which a local search algorithm with two-dimensional tabu table is presented to avoid repeatedly evaluating some disease models that have strong marginal effect. Finally G-test statistic is used to further test the candidate models. We investigate our method named FHSA-SED on 82 simulated datasets and a real AMD dataset, and compare it with two typical methods (MACOED and CSE) which have been developed recently based on swarm intelligent search algorithm. The results of simulation experiments indicate that our method outperforms the two compared algorithms in terms of detection power, computation time, evaluation times, sensitivity (TPR), specificity (SPC), positive predictive value (PPV) and accuracy (ACC). Our method has identified two SNPs (rs3775652 and rs10511467) that may be also associated with disease in AMD dataset.
FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm
Tuo, Shouheng; Zhang, Junying; Yuan, Xiguo; Zhang, Yuanyuan; Liu, Zhaowen
2016-01-01
Motivation Two-locus model is a typical significant disease model to be identified in genome-wide association study (GWAS). Due to intensive computational burden and diversity of disease models, existing methods have drawbacks on low detection power, high computation cost, and preference for some types of disease models. Method In this study, two scoring functions (Bayesian network based K2-score and Gini-score) are used for characterizing two SNP locus as a candidate model, the two criteria are adopted simultaneously for improving identification power and tackling the preference problem to disease models. Harmony search algorithm (HSA) is improved for quickly finding the most likely candidate models among all two-locus models, in which a local search algorithm with two-dimensional tabu table is presented to avoid repeatedly evaluating some disease models that have strong marginal effect. Finally G-test statistic is used to further test the candidate models. Results We investigate our method named FHSA-SED on 82 simulated datasets and a real AMD dataset, and compare it with two typical methods (MACOED and CSE) which have been developed recently based on swarm intelligent search algorithm. The results of simulation experiments indicate that our method outperforms the two compared algorithms in terms of detection power, computation time, evaluation times, sensitivity (TPR), specificity (SPC), positive predictive value (PPV) and accuracy (ACC). Our method has identified two SNPs (rs3775652 and rs10511467) that may be also associated with disease in AMD dataset. PMID:27014873
Ma, Li; Runesha, H Birali; Dvorkin, Daniel; Garbe, John R; Da, Yang
2008-01-01
Background Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers provide opportunities to detect epistatic SNPs associated with quantitative traits and to detect the exact mode of an epistasis effect. Computational difficulty is the main bottleneck for epistasis testing in large scale GWAS. Results The EPISNPmpi and EPISNP computer programs were developed for testing single-locus and epistatic SNP effects on quantitative traits in GWAS, including tests of three single-locus effects for each SNP (SNP genotypic effect, additive and dominance effects) and five epistasis effects for each pair of SNPs (two-locus interaction, additive × additive, additive × dominance, dominance × additive, and dominance × dominance) based on the extended Kempthorne model. EPISNPmpi is the parallel computing program for epistasis testing in large scale GWAS and achieved excellent scalability for large scale analysis and portability for various parallel computing platforms. EPISNP is the serial computing program based on the EPISNPmpi code for epistasis testing in small scale GWAS using commonly available operating systems and computer hardware. Three serial computing utility programs were developed for graphical viewing of test results and epistasis networks, and for estimating CPU time and disk space requirements. Conclusion The EPISNPmpi parallel computing program provides an effective computing tool for epistasis testing in large scale GWAS, and the epiSNP serial computing programs are convenient tools for epistasis analysis in small scale GWAS using commonly available computer hardware. PMID:18644146
Seeger, Kerstin; Flinspach, Katrin; Haug‐Schifferdecker, Elisa; Kulik, Andreas; Gust, Bertolt; Fiedler, Hans‐Peter; Heide, Lutz
2011-01-01
Summary Streptomyces cinnamonensis DSM 1042 produces two types of isoprenoid secondary metabolites: the prenylated naphthalene derivative furanonaphthoquinone I (FNQ I), and isoprenylated phenazines which are termed endophenazines. Previously, a 55 kb gene cluster was identified which contained genes for both FNQ I and endophenazine biosynthesis. However, several genes required for the biosynthesis of these metabolites were not present in this cluster. We now re‐screened the cosmid library for genes of the mevalonate pathway and identified a separate genomic locus which contains the previously missing genes. This locus (15 kb) comprised orthologues of four phenazine biosynthesis genes known from Pseudomonas strains. Furthermore, the locus contained a putative operon of six genes of the mevalonate pathway, as well as the gene epzP which showed sequence similarity to a recently discovered class of prenyltransferases. Inactivation and complementation experiments proved the involvement of epzP in the prenylation reaction in endophenazine biosynthesis. This newly identified genomic locus is more than 40 kb distant from the previously identified cluster. The protein EpzP was expressed in Escherichia coli in form of a his‐tag fusion protein and purified. The enzyme catalysed the prenylation of 5,10‐dihydrophenazine‐1‐carboxylic acid (dihydro‐PCA) using dimethylallyl diphosphate (DMAPP) as isoprenoid substrate. Km values were determined as 108 µM for dihydro‐PCA and 25 µM for DMAPP. PMID:21342470
Eizirik, Eduardo; David, Victor A; Buckley-Beason, Valerie; Roelke, Melody E; Schäffer, Alejandro A; Hannah, Steven S; Narfström, Kristina; O'Brien, Stephen J; Menotti-Raymond, Marilyn
2010-01-01
Mammalian coat patterns (e.g., spots, stripes) are hypothesized to play important roles in camouflage and other relevant processes, yet the genetic and developmental bases for these phenotypes are completely unknown. The domestic cat, with its diversity of coat patterns, is an excellent model organism to investigate these phenomena. We have established three independent pedigrees to map the four recognized pattern variants classically considered to be specified by a single locus, Tabby; in order of dominance, these are the unpatterned agouti form called "Abyssinian" or "ticked" (T(a)), followed by Spotted (T(s)), Mackerel (T(M)), and Blotched (t(b)). We demonstrate that at least three different loci control the coat markings of the domestic cat. One locus, responsible for the Abyssinian form (herein termed the Ticked locus), maps to an approximately 3.8-Mb region on cat chromosome B1. A second locus controls the Tabby alleles T(M) and t(b), and maps to an approximately 5-Mb genomic region on cat chromosome A1. One or more additional loci act as modifiers and create a spotted coat by altering mackerel stripes. On the basis of our results and associated observations, we hypothesize that mammalian patterned coats are formed by two distinct processes: a spatially oriented developmental mechanism that lays down a species-specific pattern of skin cell differentiation and a pigmentation-oriented mechanism that uses information from the preestablished pattern to regulate the synthesis of melanin profiles.
Tabassum, Rubina; Chauhan, Ganesh; Dwivedi, Om Prakash; Mahajan, Anubha; Jaiswal, Alok; Kaur, Ismeet; Bandesh, Khushdeep; Singh, Tejbir; Mathai, Benan John; Pandey, Yogesh; Chidambaram, Manickam; Sharma, Amitabh; Chavali, Sreenivas; Sengupta, Shantanu; Ramakrishnan, Lakshmi; Venkatesh, Pradeep; Aggarwal, Sanjay K; Ghosh, Saurabh; Prabhakaran, Dorairaj; Srinath, Reddy K; Saxena, Madhukar; Banerjee, Monisha; Mathur, Sandeep; Bhansali, Anil; Shah, Viral N; Madhu, Sri Venkata; Marwaha, Raman K; Basu, Analabha; Scaria, Vinod; McCarthy, Mark I; Venkatesan, Radha; Mohan, Viswanathan; Tandon, Nikhil; Bharadwaj, Dwaipayan
2013-03-01
Indians undergoing socioeconomic and lifestyle transitions will be maximally affected by epidemic of type 2 diabetes (T2D). We conducted a two-stage genome-wide association study of T2D in 12,535 Indians, a less explored but high-risk group. We identified a new type 2 diabetes-associated locus at 2q21, with the lead signal being rs6723108 (odds ratio 1.31; P = 3.32 × 10⁻⁹). Imputation analysis refined the signal to rs998451 (odds ratio 1.56; P = 6.3 × 10⁻¹²) within TMEM163 that encodes a probable vesicular transporter in nerve terminals. TMEM163 variants also showed association with decreased fasting plasma insulin and homeostatic model assessment of insulin resistance, indicating a plausible effect through impaired insulin secretion. The 2q21 region also harbors RAB3GAP1 and ACMSD; those are involved in neurologic disorders. Forty-nine of 56 previously reported signals showed consistency in direction with similar effect sizes in Indians and previous studies, and 25 of them were also associated (P < 0.05). Known loci and the newly identified 2q21 locus altogether explained 7.65% variance in the risk of T2D in Indians. Our study suggests that common susceptibility variants for T2D are largely the same across populations, but also reveals a population-specific locus and provides further insights into genetic architecture and etiology of T2D.
Heritability and GWAS Analyses of Acne in Australian Adolescent Twins.
Mina-Vargas, Angela; Colodro-Conde, Lucía; Grasby, Katrina; Zhu, Gu; Gordon, Scott; Medland, Sarah E; Martin, Nicholas G
2017-12-01
Acne vulgaris is a skin disease with a multifactorial and complex pathology. While several twin studies have estimated that acne has a heritability of up to 80%, the genomic elements responsible for the origin and pathology of acne are still undiscovered. Here we performed a twin-based structural equation model, using available data on acne severity for an Australian sample of 4,491 twins and their siblings aged from 10 to 24. This study extends by a factor of 3 an earlier analysis of the genetic factors of acne. Acne severity was rated by nurses on a 4-point scale (1 = absent to 4 = severe) on up to three body sites (face, back, chest) and on up to three occasions (age 12, 14, and 16). The phenotype that we analyzed was the most severe rating at any site or age. The polychoric correlation for monozygotic twins was higher (r MZ = 0.86, 95% CI [0.81, 0.90]) than for dizygotic twins (r DZ = 0.42, 95% CI [0.35, 0.47]). A model that includes additive genetic effects and unique environmental effects was the most parsimonious model to explain the genetic variance of acne severity, and the estimated heritability was 0.85 (95% CI [0.82, 0.87]). We then conducted a genome-wide analysis including an additional 271 siblings - for a total of 4,762 individuals. A genome-wide association study (GWAS) scan did not detect loci associated with the severity of acne at the threshold of 5E-08 but suggestive association was found for three SNPs: rs10515088 locus 5q13.1 (p = 3.9E-07), rs12738078 locus 1p35.5 (p = 6.7E-07), and rs117943429 locus 18q21.2 (p = 9.1E-07). The 5q13.1 locus is close to PIK3R1, a gene that has a potential regulatory effect on sebocyte differentiation.
Sukumaran, Sivakumar; Lopes, Marta; Dreisigacker, Susanne; Reynolds, Matthew
2018-04-01
GWAS on multi-environment data identified genomic regions associated with trade-offs for grain weight and grain number. Grain yield (GY) can be dissected into its components thousand grain weight (TGW) and grain number (GN), but little has been achieved in assessing the trade-off between them in spring wheat. In the present study, the Wheat Association Mapping Initiative (WAMI) panel of 287 elite spring bread wheat lines was phenotyped for GY, GN, and TGW in ten environments across different wheat growing regions in Mexico, South Asia, and North Africa. The panel genotyped with the 90 K Illumina Infinitum SNP array resulted in 26,814 SNPs for genome-wide association study (GWAS). Statistical analysis of the multi-environmental data for GY, GN, and TGW observed repeatability estimates of 0.76, 0.62, and 0.95, respectively. GWAS on BLUPs of combined environment analysis identified 38 loci associated with the traits. Among them four loci-6A (85 cM), 5A (98 cM), 3B (99 cM), and 2B (96 cM)-were associated with multiple traits. The study identified two loci that showed positive association between GY and TGW, with allelic substitution effects of 4% (GY) and 1.7% (TGW) for 6A locus and 0.2% (GY) and 7.2% (TGW) for 2B locus. The locus in chromosome 6A (79-85 cM) harbored a gene TaGW2-6A. We also identified that a combination of markers associated with GY, TGW, and GN together explained higher variation for GY (32%), than the markers associated with GY alone (27%). The marker-trait associations from the present study can be used for marker-assisted selection (MAS) and to discover the underlying genes for these traits in spring wheat.
Shen, Qi; Zhang, Dong; Sun, Wei; Zhang, Yu-Jun; Shang, Zhi-Wei; Chen, Shi-Lin
2017-05-01
Perilla frutescens is one of 60 kinds of food and medicine plants in the initial directory announced by health ministry of China. With the development of Perilla domain in recent , the breeding and application of good varieties has become the main bottleneck of its development. This study reported that applied to the system selection, add to marker-assisted method to breed perilla varieties. Through the whole genome sequencing and consistency matching, annotated the mutation locus according to genome data, and comparison analysis with Perilla common variants database, finally selected 30 non-synonymous mutation SNPs used as characteristic markers of Zhongyan Feishu No.1. those SNP marker were used as chosen standard of Perilla varieties. Finally breeding new perilla variety Zhongyan Feishu No.1, which possess to characters of the leaf and seed dual-used, high yield, high resistance, and could used to green fertilizer. The Zhongyan Feishu No.1 acquired the plant new varieties identification of Beijing city , the identification numbers is 2016054. Marker assisted identification guide new varieties breeding in plants, which can provide a new reference for breeding of medicinal plants. Copyright© by the Chinese Pharmaceutical Association.
Chen, H; Zhao, Z; Liu, L; Kong, W; Lin, Y; You, S; Bai, W; Xiao, Y; Zheng, H; Jiang, L; Li, J; Zhou, J; Tao, D; Wan, J
2017-09-01
Oryza longistaminata originates from African wild rice and contains valuable traits conferring tolerance to biotic and abiotic stress. However, interspecific crosses between O. longistaminata and Oryza sativa cultivars are hindered by reproductive barriers. To dissect the mechanism of interspecific hybrid sterility, we developed a near-isogenic line (NIL) using indica variety RD23 as the recipient parent and O. longistaminata as the donor parent. Both pollen and embryo sac semi-sterility were observed in F 1 hybrids between RD23 and NIL. Cytological analysis demonstrated that pollen abortion in F 1 hybrids occurred at the early bi-nucleate stage due to a failure of the first mitosis in microspores. Partial embryo sacs in the F 1 hybrids were defective during the functional megaspore formation stage. Most notably, nearly half of the male or female gametes were aborted in heterozygotes S40 i S40 l , regardless of their genotypes. Thus, S40 was indicated as a one-locus sporophytic sterility gene controlling both male and female fertility in hybrids between RD23 and O. longistaminata. A population of 16 802 plants derived from the hybrid RD23/NIL-S40 was developed to fine-map S40. Finally, the S40 locus was delimited to an 80-kb region on the short arm of chromosome 1 in terms with reference sequences of cv. 93-11. Eight open reading frames (ORFs) were localized in this region. On the basis of gene expression and genomic sequence analysis, ORF5 and ORF8 were identified as candidate genes for the S40 locus. These results are helpful in cloning the S40 gene and marker-assisted transferring of the corresponding neutral allele in rice breeding programs.
Molecular mapping of chromosomes 17 and X. Progress report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barker, D.F.
1989-12-31
The basic aims of this project are the construction of high density genetic maps of chromosomes 17 and X and the utilization of these maps for the subsequent isolation of a set of physically overlapping DNA segment clones. The strategy depends on the utilization of chromosome specific libraries of small (1--15 kb) segments from each of the two chromosomes. Since the time of submission of our previous progress report, we have refined the genetic map of markers which we had previously isolated for chromosome 17. We have completed our genetic mapping in CEPH reference and NF1 families of 15 markersmore » in the pericentric region of chromosome 17. Physical mapping results with three probes, were shown be in very close genetic proximity to the NF1 gene, with respect to two translocation breakpoints which disrupt the activity of the gene. All three of the probes were found to lie between the centromere and the most proximal translocation breakpoint, providing important genetic markers proximal to the NF1 gene. Our primary focus has shifted to the X chromosome. We have isolated an additional 30 polymorphic markers, bringing the total number we have isolated to over 80. We have invested substantial effort in characterizing the polymorphisms at each of these loci and constructed plasmid subclones which reveal the polymorphisms for nearly all of the loci. These subclones are of practical value in that they produce simpler and stronger patterns on human genomic Southern blots, thus improving the efficiency of the genetic mapping experiments. These subclones may also be of value for deriving DNA sequence information at each locus, necessary for establishing polymerase chain reaction primers specific for each locus. Such information would allow the use of each locus as a sequence tagged site.« less
Molecular mapping of chromosomes 17 and X
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barker, D.F.
1989-01-01
The basic aims of this project are the construction of high density genetic maps of chromosomes 17 and X and the utilization of these maps for the subsequent isolation of a set of physically overlapping DNA segment clones. The strategy depends on the utilization of chromosome specific libraries of small (1--15 kb) segments from each of the two chromosomes. Since the time of submission of our previous progress report, we have refined the genetic map of markers which we had previously isolated for chromosome 17. We have completed our genetic mapping in CEPH reference and NF1 families of 15 markersmore » in the pericentric region of chromosome 17. Physical mapping results with three probes, were shown be in very close genetic proximity to the NF1 gene, with respect to two translocation breakpoints which disrupt the activity of the gene. All three of the probes were found to lie between the centromere and the most proximal translocation breakpoint, providing important genetic markers proximal to the NF1 gene. Our primary focus has shifted to the X chromosome. We have isolated an additional 30 polymorphic markers, bringing the total number we have isolated to over 80. We have invested substantial effort in characterizing the polymorphisms at each of these loci and constructed plasmid subclones which reveal the polymorphisms for nearly all of the loci. These subclones are of practical value in that they produce simpler and stronger patterns on human genomic Southern blots, thus improving the efficiency of the genetic mapping experiments. These subclones may also be of value for deriving DNA sequence information at each locus, necessary for establishing polymerase chain reaction primers specific for each locus. Such information would allow the use of each locus as a sequence tagged site.« less
Pourcel, Christine; Minandri, Fabrizia; Hauck, Yolande; D'Arezzo, Silvia; Imperi, Francesco; Vergnaud, Gilles; Visca, Paolo
2011-01-01
Acinetobacter baumannii is an important opportunistic pathogen responsible for nosocomial outbreaks, mostly occurring in intensive care units. Due to the multiplicity of infection sources, reliable molecular fingerprinting techniques are needed to establish epidemiological correlations among A. baumannii isolates. Multiple-locus variable-number tandem-repeat analysis (MLVA) has proven to be a fast, reliable, and cost-effective typing method for several bacterial species. In this study, an MLVA assay compatible with simple PCR- and agarose gel-based electrophoresis steps as well as with high-throughput automated methods was developed for A. baumannii typing. Preliminarily, 10 potential polymorphic variable-number tandem repeats (VNTRs) were identified upon bioinformatic screening of six annotated genome sequences of A. baumannii. A collection of 7 reference strains plus 18 well-characterized isolates, including unique types and representatives of the three international A. baumannii lineages, was then evaluated in a two-center study aimed at validating the MLVA assay and comparing it with other genotyping assays, namely, macrorestriction analysis with pulsed-field gel electrophoresis (PFGE) and PCR-based sequence group (SG) profiling. The results showed that MLVA can discriminate between isolates with identical PFGE types and SG profiles. A panel of eight VNTR markers was selected, all showing the ability to be amplified and good amounts of polymorphism in the majority of strains. Independently generated MLVA profiles, composed of an ordered string of allele numbers corresponding to the number of repeats at each VNTR locus, were concordant between centers. Typeability, reproducibility, stability, discriminatory power, and epidemiological concordance were excellent. A database containing information and MLVA profiles for several A. baumannii strains is available from http://mlva.u-psud.fr/. PMID:21147956
USDA-ARS?s Scientific Manuscript database
Rye is a diploid crop species with many outstanding qualities, and is also important as a source of new traits for wheat and triticale improvement. Here we describe a BAC library of rye cv. Blanco, representing a valuable resource for rye molecular genetic studies. The library provides a 6 × genome ...
Methods and materials for the production of L-lactic acid in yeast
Hause, Ben [Jordan, MN; Rajgarhia, Vineet [Minnetonka, MN; Suominen, Pirkko [Maple Grove, MN
2009-05-19
Recombinant yeast are provided having, in one aspect, multiple exogenous LDH genes integrated into the genome, while leaving native PDC genes intact. In a second aspect, recombinant yeast are provided having an exogenous LDH gene integrated into its genome at the locus of a native PDC gene, with deletion of the native PDC gene. The recombinant yeast are useful in fermentation process for producing lactic acid.
Comparative genomics of the mimicry switch in Papilio dardanus.
Timmermans, Martijn J T N; Baxter, Simon W; Clark, Rebecca; Heckel, David G; Vogel, Heiko; Collins, Steve; Papanicolaou, Alexie; Fukova, Iva; Joron, Mathieu; Thompson, Martin J; Jiggins, Chris D; ffrench-Constant, Richard H; Vogler, Alfried P
2014-07-22
The African Mocker Swallowtail, Papilio dardanus, is a textbook example in evolutionary genetics. Classical breeding experiments have shown that wing pattern variation in this polymorphic Batesian mimic is determined by the polyallelic H locus that controls a set of distinct mimetic phenotypes. Using bacterial artificial chromosome (BAC) sequencing, recombination analyses and comparative genomics, we show that H co-segregates with an interval of less than 500 kb that is collinear with two other Lepidoptera genomes and contains 24 genes, including the transcription factor genes engrailed (en) and invected (inv). H is located in a region of conserved gene order, which argues against any role for genomic translocations in the evolution of a hypothesized multi-gene mimicry locus. Natural populations of P. dardanus show significant associations of specific morphs with single nucleotide polymorphisms (SNPs), centred on en. In addition, SNP variation in the H region reveals evidence of non-neutral molecular evolution in the en gene alone. We find evidence for a duplication potentially driving physical constraints on recombination in the lamborni morph. Absence of perfect linkage disequilibrium between different genes in the other morphs suggests that H is limited to nucleotide positions in the regulatory and coding regions of en. Our results therefore support the hypothesis that a single gene underlies wing pattern variation in P. dardanus.
Multidrug-resistant enterococci lack CRISPR-cas.
Palmer, Kelli L; Gilmore, Michael S
2010-10-12
Clustered, regularly interspaced short palindromic repeats (CRISPR) provide bacteria and archaea with sequence-specific, acquired defense against plasmids and phage. Because mobile elements constitute up to 25% of the genome of multidrug-resistant (MDR) enterococci, it was of interest to examine the codistribution of CRISPR and acquired antibiotic resistance in enterococcal lineages. A database was built from 16 Enterococcus faecalis draft genome sequences to identify commonalities and polymorphisms in the location and content of CRISPR loci. With this data set, we were able to detect identities between CRISPR spacers and sequences from mobile elements, including pheromone-responsive plasmids and phage, suggesting that CRISPR regulates the flux of these elements through the E. faecalis species. Based on conserved locations of CRISPR and CRISPR-cas loci and the discovery of a new CRISPR locus with associated functional genes, CRISPR3-cas, we screened additional E. faecalis strains for CRISPR content, including isolates predating the use of antibiotics. We found a highly significant inverse correlation between the presence of a CRISPR-cas locus and acquired antibiotic resistance in E. faecalis, and examination of an additional eight E. faecium genomes yielded similar results for that species. A mechanism for CRISPR-cas loss in E. faecalis was identified. The inverse relationship between CRISPR-cas and antibiotic resistance suggests that antibiotic use inadvertently selects for enterococcal strains with compromised genome defense.
Computer vision and machine learning for robust phenotyping in genome-wide studies
Zhang, Jiaoping; Naik, Hsiang Sing; Assefa, Teshale; Sarkar, Soumik; Reddy, R. V. Chowda; Singh, Arti; Ganapathysubramanian, Baskar; Singh, Asheesh K.
2017-01-01
Traditional evaluation of crop biotic and abiotic stresses are time-consuming and labor-intensive limiting the ability to dissect the genetic basis of quantitative traits. A machine learning (ML)-enabled image-phenotyping pipeline for the genetic studies of abiotic stress iron deficiency chlorosis (IDC) of soybean is reported. IDC classification and severity for an association panel of 461 diverse plant-introduction accessions was evaluated using an end-to-end phenotyping workflow. The workflow consisted of a multi-stage procedure including: (1) optimized protocols for consistent image capture across plant canopies, (2) canopy identification and registration from cluttered backgrounds, (3) extraction of domain expert informed features from the processed images to accurately represent IDC expression, and (4) supervised ML-based classifiers that linked the automatically extracted features with expert-rating equivalent IDC scores. ML-generated phenotypic data were subsequently utilized for the genome-wide association study and genomic prediction. The results illustrate the reliability and advantage of ML-enabled image-phenotyping pipeline by identifying previously reported locus and a novel locus harboring a gene homolog involved in iron acquisition. This study demonstrates a promising path for integrating the phenotyping pipeline into genomic prediction, and provides a systematic framework enabling robust and quicker phenotyping through ground-based systems. PMID:28272456
Large scale genomic reorganization of topological domains at the HoxD locus.
Fabre, Pierre J; Leleu, Marion; Mormann, Benjamin H; Lopez-Delisle, Lucille; Noordermeer, Daan; Beccari, Leonardo; Duboule, Denis
2017-08-07
The transcriptional activation of HoxD genes during mammalian limb development involves dynamic interactions with two topologically associating domains (TADs) flanking the HoxD cluster. In particular, the activation of the most posterior HoxD genes in developing digits is controlled by regulatory elements located in the centromeric TAD (C-DOM) through long-range contacts. To assess the structure-function relationships underlying such interactions, we measured compaction levels and TAD discreteness using a combination of chromosome conformation capture (4C-seq) and DNA FISH. We assessed the robustness of the TAD architecture by using a series of genomic deletions and inversions that impact the integrity of this chromatin domain and that remodel long-range contacts. We report multi-partite associations between HoxD genes and up to three enhancers. We find that the loss of native chromatin topology leads to the remodeling of TAD structure following distinct parameters. Our results reveal that the recomposition of TAD architectures after large genomic re-arrangements is dependent on a boundary-selection mechanism in which CTCF mediates the gating of long-range contacts in combination with genomic distance and sequence specificity. Accordingly, the building of a recomposed TAD at this locus depends on distinct functional and constitutive parameters.
Zhang, Wenchao; Dai, Xinbin; Wang, Qishan; Xu, Shizhong; Zhao, Patrick X
2016-05-01
The term epistasis refers to interactions between multiple genetic loci. Genetic epistasis is important in regulating biological function and is considered to explain part of the 'missing heritability,' which involves marginal genetic effects that cannot be accounted for in genome-wide association studies. Thus, the study of epistasis is of great interest to geneticists. However, estimating epistatic effects for quantitative traits is challenging due to the large number of interaction effects that must be estimated, thus significantly increasing computing demands. Here, we present a new web server-based tool, the Pipeline for estimating EPIStatic genetic effects (PEPIS), for analyzing polygenic epistatic effects. The PEPIS software package is based on a new linear mixed model that has been used to predict the performance of hybrid rice. The PEPIS includes two main sub-pipelines: the first for kinship matrix calculation, and the second for polygenic component analyses and genome scanning for main and epistatic effects. To accommodate the demand for high-performance computation, the PEPIS utilizes C/C++ for mathematical matrix computing. In addition, the modules for kinship matrix calculations and main and epistatic-effect genome scanning employ parallel computing technology that effectively utilizes multiple computer nodes across our networked cluster, thus significantly improving the computational speed. For example, when analyzing the same immortalized F2 rice population genotypic data examined in a previous study, the PEPIS returned identical results at each analysis step with the original prototype R code, but the computational time was reduced from more than one month to about five minutes. These advances will help overcome the bottleneck frequently encountered in genome wide epistatic genetic effect analysis and enable accommodation of the high computational demand. The PEPIS is publically available at http://bioinfo.noble.org/PolyGenic_QTL/.
Ruth, Katherine S; Campbell, Purdey J; Chew, Shelby; Lim, Ee Mun; Hadlow, Narelle; Stuckey, Bronwyn G A; Brown, Suzanne J; Feenstra, Bjarke; Joseph, John; Surdulescu, Gabriela L; Zheng, Hou Feng; Richards, J Brent; Murray, Anna; Spector, Tim D; Wilson, Scott G; Perry, John R B
2016-02-01
Genetic factors contribute strongly to sex hormone levels, yet knowledge of the regulatory mechanisms remains incomplete. Genome-wide association studies (GWAS) have identified only a small number of loci associated with sex hormone levels, with several reproductive hormones yet to be assessed. The aim of the study was to identify novel genetic variants contributing to the regulation of sex hormones. We performed GWAS using genotypes imputed from the 1000 Genomes reference panel. The study used genotype and phenotype data from a UK twin register. We included 2913 individuals (up to 294 males) from the Twins UK study, excluding individuals receiving hormone treatment. Phenotypes were standardised for age, sex, BMI, stage of menstrual cycle and menopausal status. We tested 7,879,351 autosomal SNPs for association with levels of dehydroepiandrosterone sulphate (DHEAS), oestradiol, free androgen index (FAI), follicle-stimulating hormone (FSH), luteinizing hormone (LH), prolactin, progesterone, sex hormone-binding globulin and testosterone. Eight independent genetic variants reached genome-wide significance (P<5 × 10(-8)), with minor allele frequencies of 1.3-23.9%. Novel signals included variants for progesterone (P=7.68 × 10(-12)), oestradiol (P=1.63 × 10(-8)) and FAI (P=1.50 × 10(-8)). A genetic variant near the FSHB gene was identified which influenced both FSH (P=1.74 × 10(-8)) and LH (P=3.94 × 10(-9)) levels. A separate locus on chromosome 7 was associated with both DHEAS (P=1.82 × 10(-14)) and progesterone (P=6.09 × 10(-14)). This study highlights loci that are relevant to reproductive function and suggests overlap in the genetic basis of hormone regulation.
Horikoshi, Momoko; Mӓgi, Reedik; van de Bunt, Martijn; Surakka, Ida; Sarin, Antti-Pekka; Mahajan, Anubha; Marullo, Letizia; Thorleifsson, Gudmar; Hӓgg, Sara; Hottenga, Jouke-Jan; Ladenvall, Claes; Ried, Janina S; Winkler, Thomas W; Willems, Sara M; Pervjakova, Natalia; Esko, Tõnu; Beekman, Marian; Nelson, Christopher P; Willenborg, Christina; Wiltshire, Steven; Ferreira, Teresa; Fernandez, Juan; Gaulton, Kyle J; Steinthorsdottir, Valgerdur; Hamsten, Anders; Magnusson, Patrik K E; Willemsen, Gonneke; Milaneschi, Yuri; Robertson, Neil R; Groves, Christopher J; Bennett, Amanda J; Lehtimӓki, Terho; Viikari, Jorma S; Rung, Johan; Lyssenko, Valeriya; Perola, Markus; Heid, Iris M; Herder, Christian; Grallert, Harald; Müller-Nurasyid, Martina; Roden, Michael; Hypponen, Elina; Isaacs, Aaron; van Leeuwen, Elisabeth M; Karssen, Lennart C; Mihailov, Evelin; Houwing-Duistermaat, Jeanine J; de Craen, Anton J M; Deelen, Joris; Havulinna, Aki S; Blades, Matthew; Hengstenberg, Christian; Erdmann, Jeanette; Schunkert, Heribert; Kaprio, Jaakko; Tobin, Martin D; Samani, Nilesh J; Lind, Lars; Salomaa, Veikko; Lindgren, Cecilia M; Slagboom, P Eline; Metspalu, Andres; van Duijn, Cornelia M; Eriksson, Johan G; Peters, Annette; Gieger, Christian; Jula, Antti; Groop, Leif; Raitakari, Olli T; Power, Chris; Penninx, Brenda W J H; de Geus, Eco; Smit, Johannes H; Boomsma, Dorret I; Pedersen, Nancy L; Ingelsson, Erik; Thorsteinsdottir, Unnur; Stefansson, Kari; Ripatti, Samuli; Prokopenko, Inga; McCarthy, Mark I; Morris, Andrew P
2015-07-01
Reference panels from the 1000 Genomes (1000G) Project Consortium provide near complete coverage of common and low-frequency genetic variation with minor allele frequency ≥0.5% across European ancestry populations. Within the European Network for Genetic and Genomic Epidemiology (ENGAGE) Consortium, we have undertaken the first large-scale meta-analysis of genome-wide association studies (GWAS), supplemented by 1000G imputation, for four quantitative glycaemic and obesity-related traits, in up to 87,048 individuals of European ancestry. We identified two loci for body mass index (BMI) at genome-wide significance, and two for fasting glucose (FG), none of which has been previously reported in larger meta-analysis efforts to combine GWAS of European ancestry. Through conditional analysis, we also detected multiple distinct signals of association mapping to established loci for waist-hip ratio adjusted for BMI (RSPO3) and FG (GCK and G6PC2). The index variant for one association signal at the G6PC2 locus is a low-frequency coding allele, H177Y, which has recently been demonstrated to have a functional role in glucose regulation. Fine-mapping analyses revealed that the non-coding variants most likely to drive association signals at established and novel loci were enriched for overlap with enhancer elements, which for FG mapped to promoter and transcription factor binding sites in pancreatic islets, in particular. Our study demonstrates that 1000G imputation and genetic fine-mapping of common and low-frequency variant association signals at GWAS loci, integrated with genomic annotation in relevant tissues, can provide insight into the functional and regulatory mechanisms through which their effects on glycaemic and obesity-related traits are mediated.
Discovery and Fine-Mapping of Glycaemic and Obesity-Related Trait Loci Using High-Density Imputation
van de Bunt, Martijn; Surakka, Ida; Sarin, Antti-Pekka; Mahajan, Anubha; Marullo, Letizia; Thorleifsson, Gudmar; Hӓgg, Sara; Hottenga, Jouke-Jan; Ladenvall, Claes; Ried, Janina S.; Winkler, Thomas W.; Willems, Sara M.; Pervjakova, Natalia; Esko, Tõnu; Beekman, Marian; Nelson, Christopher P.; Willenborg, Christina; Ferreira, Teresa; Fernandez, Juan; Gaulton, Kyle J.; Steinthorsdottir, Valgerdur; Hamsten, Anders; Magnusson, Patrik K. E.; Willemsen, Gonneke; Milaneschi, Yuri; Robertson, Neil R.; Groves, Christopher J.; Bennett, Amanda J.; Lehtimӓki, Terho; Viikari, Jorma S.; Rung, Johan; Lyssenko, Valeriya; Perola, Markus; Heid, Iris M.; Herder, Christian; Grallert, Harald; Müller-Nurasyid, Martina; Roden, Michael; Hypponen, Elina; Isaacs, Aaron; van Leeuwen, Elisabeth M.; Karssen, Lennart C.; Mihailov, Evelin; Houwing-Duistermaat, Jeanine J.; de Craen, Anton J. M.; Deelen, Joris; Havulinna, Aki S.; Blades, Matthew; Hengstenberg, Christian; Erdmann, Jeanette; Schunkert, Heribert; Kaprio, Jaakko; Tobin, Martin D.; Samani, Nilesh J.; Lind, Lars; Salomaa, Veikko; Lindgren, Cecilia M.; Slagboom, P. Eline; Metspalu, Andres; van Duijn, Cornelia M.; Eriksson, Johan G.; Peters, Annette; Gieger, Christian; Jula, Antti; Groop, Leif; Raitakari, Olli T.; Power, Chris; Penninx, Brenda W. J. H.; de Geus, Eco; Smit, Johannes H.; Boomsma, Dorret I.; Pedersen, Nancy L.; Ingelsson, Erik; Thorsteinsdottir, Unnur; Stefansson, Kari; Ripatti, Samuli; Prokopenko, Inga; McCarthy, Mark I.; Morris, Andrew P.
2015-01-01
Reference panels from the 1000 Genomes (1000G) Project Consortium provide near complete coverage of common and low-frequency genetic variation with minor allele frequency ≥0.5% across European ancestry populations. Within the European Network for Genetic and Genomic Epidemiology (ENGAGE) Consortium, we have undertaken the first large-scale meta-analysis of genome-wide association studies (GWAS), supplemented by 1000G imputation, for four quantitative glycaemic and obesity-related traits, in up to 87,048 individuals of European ancestry. We identified two loci for body mass index (BMI) at genome-wide significance, and two for fasting glucose (FG), none of which has been previously reported in larger meta-analysis efforts to combine GWAS of European ancestry. Through conditional analysis, we also detected multiple distinct signals of association mapping to established loci for waist-hip ratio adjusted for BMI (RSPO3) and FG (GCK and G6PC2). The index variant for one association signal at the G6PC2 locus is a low-frequency coding allele, H177Y, which has recently been demonstrated to have a functional role in glucose regulation. Fine-mapping analyses revealed that the non-coding variants most likely to drive association signals at established and novel loci were enriched for overlap with enhancer elements, which for FG mapped to promoter and transcription factor binding sites in pancreatic islets, in particular. Our study demonstrates that 1000G imputation and genetic fine-mapping of common and low-frequency variant association signals at GWAS loci, integrated with genomic annotation in relevant tissues, can provide insight into the functional and regulatory mechanisms through which their effects on glycaemic and obesity-related traits are mediated. PMID:26132169
Książkiewicz, Michał; Rychel, Sandra; Nelson, Matthew N; Wyrwa, Katarzyna; Naganowska, Barbara; Wolko, Bogdan
2016-10-21
The Arabidopsis FLOWERING LOCUS T (FT) gene, a member of the phosphatidylethanolamine binding protein (PEBP) family, is a major controller of flowering in response to photoperiod, vernalization and light quality. In legumes, FT evolved into three, functionally diversified clades, FTa, FTb and FTc. A milestone achievement in narrow-leafed lupin (Lupinus angustifolius L.) domestication was the loss of vernalization responsiveness at the Ku locus. Recently, one of two existing L. angustifolius homologs of FTc, LanFTc1, was revealed to be the gene underlying Ku. It is the first recorded involvement of an FTc homologue in vernalization. The evolutionary basis of this phenomenon in lupin has not yet been deciphered. Bacterial artificial chromosome (BAC) clones carrying LanFTc1 and LanFTc2 genes were localized in different mitotic chromosomes and constituted sequence-specific landmarks for linkage groups NLL-10 and NLL-17. BAC-derived superscaffolds containing LanFTc genes revealed clear microsyntenic patterns to genome sequences of nine legume species. Superscaffold-1 carrying LanFTc1 aligned to regions encoding one or more FT-like genes whereas superscaffold-2 mapped to a region lacking such a homolog. Comparative mapping of the L. angustifolius genome assembly anchored to linkage map localized superscaffold-1 in the middle of a 15 cM conserved, collinear region. In contrast, superscaffold-2 was found at the edge of a 20 cM syntenic block containing highly disrupted collinearity at the LanFTc2 locus. 118 PEBP-family full-length homologs were identified in 10 legume genomes. Bayesian phylogenetic inference provided novel evidence supporting the hypothesis that whole-genome and tandem duplications contributed to expansion of PEBP-family genes in legumes. Duplicated genes were subjected to strong purifying selection. Promoter analysis of FT genes revealed no statistically significant sequence similarity between duplicated copies; only RE-alpha and CCAAT-box motifs were found at conserved positions and orientations. Numerous lineage-specific duplications occurred during the evolution of legume PEBP-family genes. Whole-genome duplications resulted in the origin of subclades FTa, FTb and FTc and in the multiplication of FTa and FTb copy number. LanFTc1 is located in the region conserved among all main lineages of Papilionoideae. LanFTc1 is a direct descendant of ancestral FTc, whereas LanFTc2 appeared by subsequent duplication.
Predictors of Parental Locus of Control in Mothers of Pre- and Early-Adolescents
Freed, Rachel D.; Tompson, Martha C.
2016-01-01
Parental locus of control refers to parents’ perceived power and efficacy in child-rearing situations. This study explored parental locus of control and its correlates in 160 mothers of children ages 8–14 cross-sectionally and 1 year later. Maternal depression, maternal expressed emotion, and child internalizing and externalizing behavior were examined, along with a number of sociodemographic factors. Cross-sectional analyses indicated that external parental locus of control was associated with child externalizing behavior, maternal depression, less maternal education, lower income, and older maternal age. Longitudinal analyses showed that child age and externalizing behavior also predicted increases in external parental locus of control 1 year later. Finally, lower income and less parental perceived control predicted increases in child externalizing behavior over time. PMID:21229447
The Chlamydomonas genome project: a decade on.
Blaby, Ian K; Blaby-Haas, Crysten E; Tourasse, Nicolas; Hom, Erik F Y; Lopez, David; Aksoy, Munevver; Grossman, Arthur; Umen, James; Dutcher, Susan; Porter, Mary; King, Stephen; Witman, George B; Stanke, Mario; Harris, Elizabeth H; Goodstein, David; Grimwood, Jane; Schmutz, Jeremy; Vallon, Olivier; Merchant, Sabeeha S; Prochnik, Simon
2014-10-01
The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis, and micronutrient homeostasis. Ten years since its genome project was initiated an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the omics era. Housed at Phytozome, the plant genomics portal of the Joint Genome Institute (JGI), the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of whole transcriptome sequencing (RNA-Seq) data. We present here the past, present, and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions, and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes. Copyright © 2014 Elsevier Ltd. All rights reserved.
Non-viral delivery of genome-editing nucleases for gene therapy.
Wang, M; Glass, Z A; Xu, Q
2017-03-01
Manipulating the genetic makeup of mammalian cells using programmable nuclease-based genome-editing technology has recently evolved into a powerful avenue that holds great potential for treating genetic disorders. There are four types of genome-editing nucleases, including meganucleases, zinc finger nucleases, transcription activator-like effector nucleases and clustered, regularly interspaced, short palindromic repeat-associated nucleases such as Cas9. These nucleases have been harnessed to introduce precise and specific changes of the genome sequence at virtually any genome locus of interest. The therapeutic relevance of these genome-editing technologies, however, is challenged by the safe and efficient delivery of nuclease into targeted cells. Herein, we summarize recent advances that have been made on non-viral delivery of genome-editing nucleases. In particular, we focus on non-viral delivery of Cas9/sgRNA ribonucleoproteins for genome editing. In addition, the future direction for developing non-viral delivery of programmable nucleases for genome editing is discussed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Siddaramappa, Shivakumara; Delano, Susana; Green, Lance D.
2012-01-01
Dehalogenimonas lykanthroporepellens is the type species of the genus Dehalogenimonas, which belongs to a deeply branching lineage within the phylum Chloroflexi. This strictly anaerobic, mesophilic, non spore forming, Gram negative staining bacterium was first isolated from chlorinated solvent contaminated groundwater at a Superfund site located near Baton Rouge, Louisiana, USA. D. lykanthroporepellens was of interest for genome sequencing for two reasons: (a) its unusual ability to couple growth with reductive dechlorination of environmentally important polychlorinated aliphatic alkanes and (b) its phylogenetic position distant from previously sequenced bacteria. The 1,686,510 bp circular chromosome of strain BL-DC-9{sup T} contains 1,720 predicted proteinmore » coding genes, 47 tRNA genes, a single large subunit rRNA (23S-5S) locus, and a single, orphan, small unit rRNA (16S) locus.« less
Billoud, Bernard; Jouanno, Émilie; Nehr, Zofia; Carton, Baptiste; Rolland, Élodie; Chenivesse, Sabine; Charrier, Bénédicte
2015-01-01
Mutagenesis is the only process by which unpredicted biological gene function can be identified. Despite that several macroalgal developmental mutants have been generated, their causal mutation was never identified, because experimental conditions were not gathered at that time. Today, progresses in macroalgal genomics and judicious choices of suitable genetic models make mutated gene identification possible. This article presents a comparative study of two methods aiming at identifying a genetic locus in the brown alga Ectocarpus siliculosus: positional cloning and Next-Generation Sequencing (NGS)-based mapping. Once necessary preliminary experimental tools were gathered, we tested both analyses on an Ectocarpus morphogenetic mutant. We show how a narrower localization results from the combination of the two methods. Advantages and drawbacks of these two approaches as well as potential transfer to other macroalgae are discussed. PMID:25745426
Bowman, Shaun M; Piwowar, Amy; Ciocca, Maria; Free, Stephen J
2005-01-01
Two Neurospora mutants with a phenotype that includes a tight colonial growth pattern, an inability to form conidia and an inability to form protoperithecia have been isolated and characterized. The relevant mutations were mapped to the same locus on the sequenced Neurospora genome. The mutations responsible for the mutant phenotype then were identified by examining likely candidate genes from the mutant genomes at the mapped locus with PCR amplification and a sequencing assay. The results demonstrate that a map and sequence strategy is a feasible way to identify mutant genes in Neurospora. The gene responsible for the phenotype is a putative alpha-1,2-mannosyltransferase gene. The mutant cell wall has an altered composition demonstrating that the gene functions in cell wall biosynthesis. The results demonstrate that the mnt-1 gene is required for normal cell wall biosynthesis, morphology and for the regulation of asexual development.
aTRAM 2.0: An Improved, Flexible Locus Assembler for NGS Data
Allen, Julie M; LaFrance, Raphael; Folk, Ryan A; Johnson, Kevin P; Guralnick, Robert P
2018-01-01
Massive strides have been made in technologies for collecting genome-scale data. However, tools for efficiently and flexibly assembling raw outputs into downstream analytical workflows are still nascent. aTRAM 1.0 was designed to assemble any locus from genome sequencing data but was neither optimized for efficiency nor able to serve as a single toolkit for all assembly needs. We have completely re-implemented aTRAM and redesigned its structure for faster read retrieval while adding a number of key features to improve flexibility and functionality. The software can now (1) assemble single- or paired-end data, (2) utilize both read directions in the database, (3) use an additional de novo assembly module, and (4) leverage new built-in pipelines to automate common workflows in phylogenomics. Owing to reimplementation of databasing strategies, we demonstrate that aTRAM 2.0 is much faster across all applications compared to the previous version. PMID:29881251
Genome-wide diversity and selective pressure in the human rhinovirus
Kistler, Amy L; Webster, Dale R; Rouskin, Silvi; Magrini, Vince; Credle, Joel J; Schnurr, David P; Boushey, Homer A; Mardis, Elaine R; Li, Hao; DeRisi, Joseph L
2007-01-01
Background The human rhinoviruses (HRV) are one of the most common and diverse respiratory pathogens of humans. Over 100 distinct HRV serotypes are known, yet only 6 genomes are available. Due to the paucity of HRV genome sequence, little is known about the genetic diversity within HRV or the forces driving this diversity. Previous comparative genome sequence analyses indicate that recombination drives diversification in multiple genera of the picornavirus family, yet it remains unclear if this holds for HRV. Results To resolve this and gain insight into the forces driving diversification in HRV, we generated a representative set of 34 fully sequenced HRVs. Analysis of these genomes shows consistent phylogenies across the genome, conserved non-coding elements, and only limited recombination. However, spikes of genetic diversity at both the nucleotide and amino acid level are detectable within every locus of the genome. Despite this, the HRV genome as a whole is under purifying selective pressure, with islands of diversifying pressure in the VP1, VP2, and VP3 structural genes and two non-structural genes, the 3C protease and 3D polymerase. Mapping diversifying residues in these factors onto available 3-dimensional structures revealed the diversifying capsid residues partition to the external surface of the viral particle in statistically significant proximity to antigenic sites. Diversifying pressure in the pleconaril binding site is confined to a single residue known to confer drug resistance (VP1 191). In contrast, diversifying pressure in the non-structural genes is less clear, mapping both nearby and beyond characterized functional domains of these factors. Conclusion This work provides a foundation for understanding HRV genetic diversity and insight into the underlying biology driving evolution in HRV. It expands our knowledge of the genome sequence space that HRV reference serotypes occupy and how the pattern of genetic diversity across HRV genomes differs from other picornaviruses. It also reveals evidence of diversifying selective pressure in both structural genes known to interact with the host immune system and in domains of unassigned function in the non-structural 3C and 3D genes, raising the possibility that diversification of undiscovered functions in these essential factors may influence HRV fitness and evolution. PMID:17477878
Generation of Stable Knockout Mammalian Cells by TALEN-Mediated Locus-Specific Gene Editing.
Mahata, Barun; Biswas, Kaushik
2017-01-01
Precise and targeted genome editing using Transcription Activator-Like Effector Endonucleases (TALENs) has been widely used and proven to be an extremely effective and specific knockout strategy in both cultured cells and animal models. The current chapter describes a protocol for the construction and generation of TALENs using serial and hierarchical digestion and ligation steps, and using the synthesized TALEN pairs to achieve locus-specific targeted gene editing in mammalian cell lines using a modified clonal selection strategy in an easy and cost-efficient manner.
Insights into DDT Resistance from the Drosophila melanogaster Genetic Reference Panel
Schmidt, Joshua M.; Battlay, Paul; Gledhill-Smith, Rebecca S.; Good, Robert T.; Lumb, Chris; Fournier-Level, Alexandre; Robin, Charles
2017-01-01
Insecticide resistance is considered a classic model of microevolution, where a strong selective agent is applied to a large natural population, resulting in a change in frequency of alleles that confer resistance. While many insecticide resistance variants have been characterized at the gene level, they are typically single genes of large effect identified in highly resistant pest species. In contrast, multiple variants have been implicated in DDT resistance in Drosophila melanogaster; however, only the Cyp6g1 locus has previously been shown to be relevant to field populations. Here we use genome-wide association studies (GWAS) to identify DDT-associated polygenes and use selective sweep analyses to assess their adaptive significance. We identify and verify two candidate DDT resistance loci. A largely uncharacterized gene, CG10737, has a function in muscles that ameliorates the effects of DDT, while a putative detoxifying P450, Cyp6w1, shows compelling evidence of positive selection. PMID:28935691
C57BL/6N mutation in Cytoplasmic FMR interacting protein 2 regulates cocaine response
Kumar, Vivek; Kim, Kyungin; Joseph, Chryshanthi; Kourrich, Saïd; Yoo, Seung Hee; Huang, Hung Chung; Vitaterna, Martha H.; de Villena, Fernando Pardo-Manuel; Churchill, Gary; Bonci, Antonello; Takahashi, Joseph S.
2015-01-01
The inbred mouse C57BL/6J is the reference strain for genome sequence and for most behavioral and physiological phenotypes. However the International Knockout Mouse Consortium uses an embryonic stem cell line derived from a related C57BL/6N substrain. We found that C57BL/6N has lower acute and sensitized response to cocaine and methamphetamine. We mapped a single causative locus and identified a non-synonymous mutation of serine to phenylalanine (S968F) in Cytoplasmic FMR interacting protein 2 (Cyfip2) as the causative variant. The S968F mutation destabilizes CYFIP2 and deletion of the C57BL/6N mutant allele leads to acute and sensitized cocaine response phenotypes. We propose CYFIP2 is a key regulator of cocaine response in mammals and present a framework to utilize mouse substrains to discover novel genes and alleles regulating behavior. PMID:24357318
Galanter, Joshua Mark; Fernandez-Lopez, Juan Carlos; Gignoux, Christopher R; Barnholtz-Sloan, Jill; Fernandez-Rozadilla, Ceres; Via, Marc; Hidalgo-Miranda, Alfredo; Contreras, Alejandra V; Figueroa, Laura Uribe; Raska, Paola; Jimenez-Sanchez, Gerardo; Zolezzi, Irma Silva; Torres, Maria; Ponte, Clara Ruiz; Ruiz, Yarimar; Salas, Antonio; Nguyen, Elizabeth; Eng, Celeste; Borjas, Lisbeth; Zabala, William; Barreto, Guillermo; González, Fernando Rondón; Ibarra, Adriana; Taboada, Patricia; Porras, Liliana; Moreno, Fabián; Bigham, Abigail; Gutierrez, Gerardo; Brutsaert, Tom; León-Velarde, Fabiola; Moore, Lorna G; Vargas, Enrique; Cruz, Miguel; Escobedo, Jorge; Rodriguez-Santana, José; Rodriguez-Cintrón, William; Chapela, Rocio; Ford, Jean G; Bustamante, Carlos; Seminara, Daniela; Shriver, Mark; Ziv, Elad; Burchard, Esteban Gonzalez; Haile, Robert; Parra, Esteban; Carracedo, Angel
2012-01-01
Most individuals throughout the Americas are admixed descendants of Native American, European, and African ancestors. Complex historical factors have resulted in varying proportions of ancestral contributions between individuals within and among ethnic groups. We developed a panel of 446 ancestry informative markers (AIMs) optimized to estimate ancestral proportions in individuals and populations throughout Latin America. We used genome-wide data from 953 individuals from diverse African, European, and Native American populations to select AIMs optimized for each of the three main continental populations that form the basis of modern Latin American populations. We selected markers on the basis of locus-specific branch length to be informative, well distributed throughout the genome, capable of being genotyped on widely available commercial platforms, and applicable throughout the Americas by minimizing within-continent heterogeneity. We then validated the panel in samples from four admixed populations by comparing ancestry estimates based on the AIMs panel to estimates based on genome-wide association study (GWAS) data. The panel provided balanced discriminatory power among the three ancestral populations and accurate estimates of individual ancestry proportions (R² > 0.9 for ancestral components with significant between-subject variance). Finally, we genotyped samples from 18 populations from Latin America using the AIMs panel and estimated variability in ancestry within and between these populations. This panel and its reference genotype information will be useful resources to explore population history of admixture in Latin America and to correct for the potential effects of population stratification in admixed samples in the region.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.
Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich
2012-02-01
The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes
Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich
2012-01-01
The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information. PMID:22384404
Galanter, Joshua Mark; Fernandez-Lopez, Juan Carlos; Gignoux, Christopher R.; Barnholtz-Sloan, Jill; Fernandez-Rozadilla, Ceres; Via, Marc; Hidalgo-Miranda, Alfredo; Contreras, Alejandra V.; Figueroa, Laura Uribe; Raska, Paola; Jimenez-Sanchez, Gerardo; Silva Zolezzi, Irma; Torres, Maria; Ponte, Clara Ruiz; Ruiz, Yarimar; Salas, Antonio; Nguyen, Elizabeth; Eng, Celeste; Borjas, Lisbeth; Zabala, William; Barreto, Guillermo; Rondón González, Fernando; Ibarra, Adriana; Taboada, Patricia; Porras, Liliana; Moreno, Fabián; Bigham, Abigail; Gutierrez, Gerardo; Brutsaert, Tom; León-Velarde, Fabiola; Moore, Lorna G.; Vargas, Enrique; Cruz, Miguel; Escobedo, Jorge; Rodriguez-Santana, José; Rodriguez-Cintrón, William; Chapela, Rocio; Ford, Jean G.; Bustamante, Carlos; Seminara, Daniela; Shriver, Mark; Ziv, Elad; Gonzalez Burchard, Esteban; Haile, Robert
2012-01-01
Most individuals throughout the Americas are admixed descendants of Native American, European, and African ancestors. Complex historical factors have resulted in varying proportions of ancestral contributions between individuals within and among ethnic groups. We developed a panel of 446 ancestry informative markers (AIMs) optimized to estimate ancestral proportions in individuals and populations throughout Latin America. We used genome-wide data from 953 individuals from diverse African, European, and Native American populations to select AIMs optimized for each of the three main continental populations that form the basis of modern Latin American populations. We selected markers on the basis of locus-specific branch length to be informative, well distributed throughout the genome, capable of being genotyped on widely available commercial platforms, and applicable throughout the Americas by minimizing within-continent heterogeneity. We then validated the panel in samples from four admixed populations by comparing ancestry estimates based on the AIMs panel to estimates based on genome-wide association study (GWAS) data. The panel provided balanced discriminatory power among the three ancestral populations and accurate estimates of individual ancestry proportions (R2>0.9 for ancestral components with significant between-subject variance). Finally, we genotyped samples from 18 populations from Latin America using the AIMs panel and estimated variability in ancestry within and between these populations. This panel and its reference genotype information will be useful resources to explore population history of admixture in Latin America and to correct for the potential effects of population stratification in admixed samples in the region. PMID:22412386
A Locus Encoding Variable Defense Systems against Invading DNA Identified in Streptococcus suis
Okura, Masatoshi; Nozawa, Takashi; Watanabe, Takayasu; Murase, Kazunori; Nakagawa, Ichiro; Takamatsu, Daisuke; Osaki, Makoto; Sekizaki, Tsutomu; Gottschalk, Marcelo; Hamada, Shigeyuki
2017-01-01
Streptococcus suis, an important zoonotic pathogen, is known to have an open pan-genome and to develop a competent state. In S. suis, limited genetic lineages are suggested to be associated with zoonosis. However, little is known about the evolution of diversified lineages and their respective phenotypic or ecological characteristics. In this study, we performed comparative genome analyses of S. suis, with a focus on the competence genes, mobile genetic elements, and genetic elements related to various defense systems against exogenous DNAs (defense elements) that are associated with gene gain/loss/exchange mediated by horizontal DNA movements and their restrictions. Our genome analyses revealed a conserved competence-inducing peptide type (pherotype) of the competence system and large-scale genome rearrangements in certain clusters based on the genome phylogeny of 58 S. suis strains. Moreover, the profiles of the defense elements were similar or identical to each other among the strains belonging to the same genomic clusters. Our findings suggest that these genetic characteristics of each cluster might exert specific effects on the phenotypic or ecological differences between the clusters. We also found certain loci that shift several types of defense elements in S. suis. Of note, one of these loci is a previously unrecognized variable region in bacteria, at which strains of distinct clusters code for different and various defense elements. This locus might represent a novel defense mechanism that has evolved through an arms race between bacteria and invading DNAs, mediated by mobile genetic elements and genetic competence. PMID:28379509
Liu, Xin; Wang, Li Gang; Luo, Wei Zhen; Li, Yong; Liang, Jing; Yan, Hua; Zhao, Ke Bin; Wang, Li Xian; Zhang, Long Chao
2014-12-01
A high-density single nucleotide polymorphism (SNP) array containing 62 163 markers was employed for a genome-wide association study (GWAS) to identify variants associated with lean meat in ham (LMH, %) and lean meat percentage (LMP, %) within a porcine Large White×Minzhu intercross population. For each individual, LMH and LMP were measured after slaughter at the age of 240±7 days. A total of 557 F2 animals were genotyped. The GWAS revealed that 21 SNPs showed significant genome-wide or chromosome-wide associations with LMH and LMP by the Genome-wide Rapid Association using Mixed Model and Regression-Genomic Control approach. Nineteen significant genome-wide SNPs were mapped to the distal end of Sus Scrofa Chromosome (SSC) 2, where a major known gene responsible for muscle mass, IGF2 is located. A conditioned analysis, in which the genotype of the strongest associated SNP is included as a fixed effect in the model, showed that those significant SNPs on SSC2 were derived from a single quantitative trait locus. The two chromosome-wide association SNPs on SSC1 disappeared after conditioned analysis suggested the association signal is a false association derived from using a F2 population. The present result is expected to lead to novel insights into muscle mass in different pig breeds and lays a preliminary foundation for follow-up studies for identification of causal mutations for subsequent application in marker-assisted selection programs for improving muscle mass in pigs. © 2014 Japanese Society of Animal Science.
Modulation of genetic associations with serum urate levels by body-mass-index in humans.
Huffman, Jennifer E; Albrecht, Eva; Teumer, Alexander; Mangino, Massimo; Kapur, Karen; Johnson, Toby; Kutalik, Zoltán; Pirastu, Nicola; Pistis, Giorgio; Lopez, Lorna M; Haller, Toomas; Salo, Perttu; Goel, Anuj; Li, Man; Tanaka, Toshiko; Dehghan, Abbas; Ruggiero, Daniela; Malerba, Giovanni; Smith, Albert V; Nolte, Ilja M; Portas, Laura; Phipps-Green, Amanda; Boteva, Lora; Navarro, Pau; Johansson, Asa; Hicks, Andrew A; Polasek, Ozren; Esko, Tõnu; Peden, John F; Harris, Sarah E; Murgia, Federico; Wild, Sarah H; Tenesa, Albert; Tin, Adrienne; Mihailov, Evelin; Grotevendt, Anne; Gislason, Gauti K; Coresh, Josef; D'Adamo, Pio; Ulivi, Sheila; Vollenweider, Peter; Waeber, Gerard; Campbell, Susan; Kolcic, Ivana; Fisher, Krista; Viigimaa, Margus; Metter, Jeffrey E; Masciullo, Corrado; Trabetti, Elisabetta; Bombieri, Cristina; Sorice, Rossella; Döring, Angela; Reischl, Eva; Strauch, Konstantin; Hofman, Albert; Uitterlinden, Andre G; Waldenberger, Melanie; Wichmann, H-Erich; Davies, Gail; Gow, Alan J; Dalbeth, Nicola; Stamp, Lisa; Smit, Johannes H; Kirin, Mirna; Nagaraja, Ramaiah; Nauck, Matthias; Schurmann, Claudia; Budde, Kathrin; Farrington, Susan M; Theodoratou, Evropi; Jula, Antti; Salomaa, Veikko; Sala, Cinzia; Hengstenberg, Christian; Burnier, Michel; Mägi, Reedik; Klopp, Norman; Kloiber, Stefan; Schipf, Sabine; Ripatti, Samuli; Cabras, Stefano; Soranzo, Nicole; Homuth, Georg; Nutile, Teresa; Munroe, Patricia B; Hastie, Nicholas; Campbell, Harry; Rudan, Igor; Cabrera, Claudia; Haley, Chris; Franco, Oscar H; Merriman, Tony R; Gudnason, Vilmundur; Pirastu, Mario; Penninx, Brenda W; Snieder, Harold; Metspalu, Andres; Ciullo, Marina; Pramstaller, Peter P; van Duijn, Cornelia M; Ferrucci, Luigi; Gambaro, Giovanni; Deary, Ian J; Dunlop, Malcolm G; Wilson, James F; Gasparini, Paolo; Gyllensten, Ulf; Spector, Tim D; Wright, Alan F; Hayward, Caroline; Watkins, Hugh; Perola, Markus; Bochud, Murielle; Kao, W H Linda; Caulfield, Mark; Toniolo, Daniela; Völzke, Henry; Gieger, Christian; Köttgen, Anna; Vitart, Veronique
2015-01-01
We tested for interactions between body mass index (BMI) and common genetic variants affecting serum urate levels, genome-wide, in up to 42569 participants. Both stratified genome-wide association (GWAS) analyses, in lean, overweight and obese individuals, and regression-type analyses in a non BMI-stratified overall sample were performed. The former did not uncover any novel locus with a major main effect, but supported modulation of effects for some known and potentially new urate loci. The latter highlighted a SNP at RBFOX3 reaching genome-wide significant level (effect size 0.014, 95% CI 0.008-0.02, Pinter= 2.6 x 10-8). Two top loci in interaction term analyses, RBFOX3 and ERO1LB-EDARADD, also displayed suggestive differences in main effect size between the lean and obese strata. All top ranking loci for urate effect differences between BMI categories were novel and most had small magnitude but opposite direction effects between strata. They include the locus RBMS1-TANK (men, Pdifflean-overweight= 4.7 x 10-8), a region that has been associated with several obesity related traits, and TSPYL5 (men, Pdifflean-overweight= 9.1 x 10-8), regulating adipocytes-produced estradiol. The top-ranking known urate loci was ABCG2, the strongest known gout risk locus, with an effect halved in obese compared to lean men (Pdifflean-obese= 2 x 10-4). Finally, pathway analysis suggested a role for N-glycan biosynthesis as a prominent urate-associated pathway in the lean stratum. These results illustrate a potentially powerful way to monitor changes occurring in obesogenic environment.
Modulation of Genetic Associations with Serum Urate Levels by Body-Mass-Index in Humans
Huffman, Jennifer E.; Albrecht, Eva; Teumer, Alexander; Mangino, Massimo; Kapur, Karen; Johnson, Toby; Kutalik, Zoltán; Pirastu, Nicola; Pistis, Giorgio; Lopez, Lorna M.; Haller, Toomas; Salo, Perttu; Goel, Anuj; Li, Man; Tanaka, Toshiko; Dehghan, Abbas; Ruggiero, Daniela; Malerba, Giovanni; Smith, Albert V.; Nolte, Ilja M.; Portas, Laura; Phipps-Green, Amanda; Boteva, Lora; Navarro, Pau; Johansson, Asa; Hicks, Andrew A.; Polasek, Ozren; Esko, Tõnu; Peden, John F.; Harris, Sarah E.; Murgia, Federico; Wild, Sarah H.; Tenesa, Albert; Tin, Adrienne; Mihailov, Evelin; Grotevendt, Anne; Gislason, Gauti K.; Coresh, Josef; D'Adamo, Pio; Ulivi, Sheila; Vollenweider, Peter; Waeber, Gerard; Campbell, Susan; Kolcic, Ivana; Fisher, Krista; Viigimaa, Margus; Metter, Jeffrey E.; Masciullo, Corrado; Trabetti, Elisabetta; Bombieri, Cristina; Sorice, Rossella; Döring, Angela; Reischl, Eva; Strauch, Konstantin; Hofman, Albert; Uitterlinden, Andre G.; Waldenberger, Melanie; Wichmann, H-Erich; Davies, Gail; Gow, Alan J.; Dalbeth, Nicola; Stamp, Lisa; Smit, Johannes H.; Kirin, Mirna; Nagaraja, Ramaiah; Nauck, Matthias; Schurmann, Claudia; Budde, Kathrin; Farrington, Susan M.; Theodoratou, Evropi; Jula, Antti; Salomaa, Veikko; Sala, Cinzia; Hengstenberg, Christian; Burnier, Michel; Mägi, Reedik; Klopp, Norman; Kloiber, Stefan; Schipf, Sabine; Ripatti, Samuli; Cabras, Stefano; Soranzo, Nicole; Homuth, Georg; Nutile, Teresa; Munroe, Patricia B.; Hastie, Nicholas; Campbell, Harry; Rudan, Igor; Cabrera, Claudia; Haley, Chris; Franco, Oscar H.; Merriman, Tony R.; Gudnason, Vilmundur; Pirastu, Mario; Penninx, Brenda W.; Snieder, Harold; Metspalu, Andres; Ciullo, Marina; Pramstaller, Peter P.; van Duijn, Cornelia M.; Ferrucci, Luigi; Gambaro, Giovanni; Deary, Ian J.; Dunlop, Malcolm G.; Wilson, James F.; Gasparini, Paolo; Gyllensten, Ulf; Spector, Tim D.; Wright, Alan F.; Hayward, Caroline; Watkins, Hugh; Perola, Markus; Bochud, Murielle; Kao, W. H. Linda; Caulfield, Mark; Toniolo, Daniela; Völzke, Henry; Gieger, Christian; Köttgen, Anna; Vitart, Veronique
2015-01-01
We tested for interactions between body mass index (BMI) and common genetic variants affecting serum urate levels, genome-wide, in up to 42569 participants. Both stratified genome-wide association (GWAS) analyses, in lean, overweight and obese individuals, and regression-type analyses in a non BMI-stratified overall sample were performed. The former did not uncover any novel locus with a major main effect, but supported modulation of effects for some known and potentially new urate loci. The latter highlighted a SNP at RBFOX3 reaching genome-wide significant level (effect size 0.014, 95% CI 0.008-0.02, Pinter= 2.6 x 10-8). Two top loci in interaction term analyses, RBFOX3 and ERO1LB-EDARADD, also displayed suggestive differences in main effect size between the lean and obese strata. All top ranking loci for urate effect differences between BMI categories were novel and most had small magnitude but opposite direction effects between strata. They include the locus RBMS1-TANK (men, Pdifflean-overweight= 4.7 x 10-8), a region that has been associated with several obesity related traits, and TSPYL5 (men, Pdifflean-overweight= 9.1 x 10-8), regulating adipocytes-produced estradiol. The top-ranking known urate loci was ABCG2, the strongest known gout risk locus, with an effect halved in obese compared to lean men (Pdifflean-obese= 2 x 10-4). Finally, pathway analysis suggested a role for N-glycan biosynthesis as a prominent urate-associated pathway in the lean stratum. These results illustrate a potentially powerful way to monitor changes occurring in obesogenic environment. PMID:25811787
Immunoglobulin Genomics in the Guinea Pig (Cavia porcellus)
Guo, Yongchen; Bao, Yonghua; Meng, Qingwen; Hu, Xiaoxiang; Meng, Qingyong; Ren, Liming; Li, Ning; Zhao, Yaofeng
2012-01-01
In science, the guinea pig is known as one of the gold standards for modeling human disease. It is especially important as a molecular and cellular biology model for studying the human immune system, as its immunological genes are more similar to human genes than are those of mice. The utility of the guinea pig as a model organism can be further enhanced by further characterization of the genes encoding components of the immune system. Here, we report the genomic organization of the guinea pig immunoglobulin (Ig) heavy and light chain genes. The guinea pig IgH locus is located in genomic scaffolds 54 and 75, and spans approximately 6,480 kb. 507 VH segments (94 potentially functional genes and 413 pseudogenes), 41 DH segments, six JH segments, four constant region genes (μ, γ, ε, and α), and one reverse δ remnant fragment were identified within the two scaffolds. Many VH pseudogenes were found within the guinea pig, and likely constituted a potential donor pool for gene conversion during evolution. The Igκ locus mapped to a 4,029 kb region of scaffold 37 and 24 is composed of 349 Vκ (111 potentially functional genes and 238 pseudogenes), three Jκ and one Cκ genes. The Igλ locus spans 1,642 kb in scaffold 4 and consists of 142 Vλ (58 potentially functional genes and 84 pseudogenes) and 11 Jλ -Cλ clusters. Phylogenetic analysis suggested the guinea pig’s large germline VH gene segments appear to form limited gene families. Therefore, this species may generate antibody diversity via a gene conversion-like mechanism associated with its pseudogene reserves. PMID:22761756
Genomic organization of the 260 kb surrounding the waxy locus in a Japonica rice
Nagano; Wu; Kawasaki; Kishima; Sano
1999-12-01
The present study was carried out to characterize the molecular organization in the vicinity of the waxy locus in rice. To determine the structural organization of the region surrounding waxy, contiguous clones covering a total of 260 kb were constructed using a bacterial artificial chromosome (BAC) library from the Shimokita variety of Japonica rice. This map also contains 200 overlapping subclones, which allowed construction of a fine physical map with a total of 64 HindIII sites. During the course of constructing the map, we noticed the presence of some repeated regions which might be related to transposable elements. We divided the 260-kb region into 60 segments (average size of 5.7 kb) to use as probes to determine their genomic organization. Hybridization patterns obtained by probing with these segments were classified into four types: class 1, a single or a few bands without a smeared background; class 2, a single or a few bands with a smeared background; class 3, multiple discrete bands without a smeared background; and class 4, only a smeared background. These classes constituted 6.5%, 20.9%, 3.7%, and 68.9% of the 260-kb region, respectively. The distribution of each class revealed that repetitive sequences are a major component in this region, as expected, and that unique sequence regions were mostly no longer than 6 kb due to interruption by repetitive sequences. We discuss how the map constructed here might be a powerful tool for characterization and comparison of the genome structures and the genes around the waxy locus in the Oryza species.
ERIC Educational Resources Information Center
Robertson, Carol
2016-01-01
Learning about chromosomes is standard fare in biology classrooms today. However, students may find it difficult to understand the relationships among the "genome", "chromosomes", "genes", a "gene locus", and "alleles". In the simple activity described in this article, which follows the 5E approach…
Nagel, Inga; Szczepanowski, Monika; Martín-Subero, José I; Harder, Lana; Akasaka, Takashi; Ammerpohl, Ole; Callet-Bauchu, Evelyne; Gascoyne, Randy D; Gesk, Stefan; Horsman, Doug; Klapper, Wolfram; Majid, Aneela; Martinez-Climent, José A; Stilgenbauer, Stephan; Tönnies, Holger; Dyer, Martin J S; Siebert, Reiner
2010-08-26
Sequence variants at the TERT-CLPTM1L locus in chromosome 5p have been recently associated with disposition for various cancers. Here we show that this locus including the gene encoding the telomerase reverse-transcriptase TERT at 5p13.33 is rarely but recurrently targeted by somatic chromosomal translocations to IGH and non-IG loci in B-cell neoplasms, including acute lymphoblastic leukemia, chronic lymphocytic leukemia, mantle cell lymphoma and splenic marginal zone lymphoma. In addition, cases with genomic amplification of TERT locus were identified. Tumors bearing chromosomal aberrations involving TERT showed higher TERT transcriptional expression and increased telomerase activity. These data suggest that deregulation of TERT gene by chromosomal abnormalities leading to increased telomerase activity might contribute to B-cell lymphomagenesis.
Using the Saccharomyces Genome Database (SGD) for analysis of genomic information
Skrzypek, Marek S.; Hirschman, Jodi
2011-01-01
Analysis of genomic data requires access to software tools that place the sequence-derived information in the context of biology. The Saccharomyces Genome Database (SGD) integrates functional information about budding yeast genes and their products with a set of analysis tools that facilitate exploring their biological details. This unit describes how the various types of functional data available at SGD can be searched, retrieved, and analyzed. Starting with the guided tour of the SGD Home page and Locus Summary page, this unit highlights how to retrieve data using YeastMine, how to visualize genomic information with GBrowse, how to explore gene expression patterns with SPELL, and how to use Gene Ontology tools to characterize large-scale datasets. PMID:21901739
Ragupathy, Raja; Naeem, Hamid A; Reimer, Elsa; Lukow, Odean M; Sapirstein, Harry D; Cloutier, Sylvie
2008-01-01
Sequencing of a BAC clone encompassing the Glu-B1 locus in Glenlea, revealed a 10.3 Kb segmental duplication including the Bx7 gene and flanking an LTR retroelement. To better understand the evolution of this locus, two collections of wheat were surveyed. The first consisted of 96 diploid and tetraploid species accessions while the second consisted of 316 Triticum aestivum cultivars and landraces from 41 countries. The genotypes were first characterized by SDS-PAGE and a total of 40 of the 316 T. aestivum accessions were found to display the overexpressed Bx7 phenotype (Bx7OE). Three lines from the 96 diploid/tetraploid collection also displayed the stronger intensity staining characteristic of the Bx7(OE) subunit. The relative amounts of the Bx7 subunit to total HMW-GS were quantified by RP-HPLC for all Bx7OE accessions and a number of checks. The entire collection was assessed for the presence of four DNA markers namely an 18 bp indel of the coding region of Bx7 variant alleles, a 43 bp indel of the 5'-region and the left and right junctions of the LTR retrotransposon borders and the duplicated segment. All 43 accessions found to have the Bx7OE subunit by SDS-PAGE and RP-HPLC produced the four diagnostic PCR amplicons. None of the lines without the Bx7OE had the LTR retroelement/duplication genomic structure. However, the 18 and 43 bp indel were found in accessions other than Bx7OE. These results indicate that the overexpression of the Bx7 HMW-GS is likely the result of a single event, i.e., a gene duplication at the Glu-B1 locus mediated by the insertion of a retroelement. Also, the 18 and 43 bp indels pre-date the duplication event. Allelic variants Bx7*, Bx7 with and without 43 bp insert and Bx7OE were found in both tetraploid and hexaploid collections and shared the same genomic organization. Though the possibility of introgression from T. aestivum to T. turgidum cannot be ruled out, the three structural genomic changes of the B-genome taken together support the hypothesis of multiple polyploidization events involving different tetraploid progenitors.
USDA-ARS?s Scientific Manuscript database
Bottle gourd (Lagenaria siceraria) is an important vegetable crop as well as a rootstock for other cucurbit crops. In this study, we report a high-quality 313.4-Mb genome sequence of a bottle gourd inbred line, USVL1VR-Ls, with a scaffold N50 of 8.7 Mb and the longest of 19.0 Mb. About 98.3% of the ...
Schyth, Brian Dall; Bela-ong, Dennis Berbulla; Jalali, Seyed Amir Hossein; Kristensen, Lasse Bøgelund Juel; Einer-Jensen, Katja; Pedersen, Finn Skou; Lorenzen, Niels
2015-01-01
MicroRNAs (miRNAs) are ~22 base pair-long non-coding RNAs which regulate gene expression in the cytoplasm of eukaryotic cells by binding to specific target regions in mRNAs to mediate transcriptional blocking or mRNA cleavage. Through their fundamental roles in cellular pathways, gene regulation mediated by miRNAs has been shown to be involved in almost all biological phenomena, including development, metabolism, cell cycle, tumor formation, and host-pathogen interactions. To address the latter in a primitive vertebrate host, we here used an array platform to analyze the miRNA response in rainbow trout (Oncorhynchus mykiss) following inoculation with the virulent fish rhabdovirus Viral hemorrhagic septicaemia virus. Two clustered miRNAs, miR-462 and miR-731 (herein referred to as miR-462 cluster), described only in teleost fishes, were found to be strongly upregulated, indicating their involvement in fish-virus interactions. We searched for homologues of the two teleost miRNAs in other vertebrate species and investigated whether findings related to ours have been reported for these homologues. Gene synteny analysis along with gene sequence conservation suggested that the teleost fish miR-462 and miR-731 had evolved from the ancestral miR-191 and miR-425 (herein called miR-191 cluster), respectively. Whereas the miR-462 cluster locus is found between two protein-coding genes (intergenic) in teleost fish genomes, the miR-191 cluster locus is found within an intron of a protein-coding gene (intragenic) in the human genome. Interferon (IFN)-inducible and immune-related promoter elements found upstream of the teleost miR-462 cluster locus suggested roles in immune responses to viral pathogens in fish, while in humans, the miR-191 cluster functionally associated with cell cycle regulation. Stimulation of fish cell cultures with the IFN inducer poly I:C accordingly upregulated the expression of miR-462 and miR-731, while no stimulatory effect on miR-191 and miR-425 expression was observed in human cell lines. Despite high sequence conservation, evolution has thus resulted in different regulation and presumably also different functional roles of these orthologous miRNA clusters in different vertebrate lineages. PMID:26207374
Gilbert, Maarten J.; Miller, William G.; Yee, Emma; Kik, Marja; Zomer, Aldert L.; Wagenaar, Jaap A.; Duim, Birgitta
2016-01-01
Abstract Campylobacter iguaniorum is most closely related to the species C. fetus, C. hyointestinalis, and C. lanienae. Reptiles, chelonians and lizards in particular, appear to be a primary reservoir of this Campylobacter species. Here we report the genome comparison of C. iguaniorum strain 1485E, isolated from a bearded dragon (Pogona vitticeps), and strain 2463D, isolated from a green iguana (Iguana iguana), with the genomes of closely related taxa, in particular with reptile-associated C. fetus subsp. testudinum. In contrast to C. fetus, C. iguaniorum is lacking an S-layer encoding region. Furthermore, a defined lipooligosaccharide biosynthesis locus, encoding multiple glycosyltransferases and bounded by waa genes, is absent from C. iguaniorum. Instead, multiple predicted glycosylation regions were identified in C. iguaniorum. One of these regions is > 50 kb with deviant G + C content, suggesting acquisition via lateral transfer. These similar, but non-homologous glycosylation regions were located at the same position on the genome in both strains. Multiple genes encoding respiratory enzymes not identified to date within the C. fetus clade were present. C. iguaniorum shared highest homology with C. hyointestinalis and C. fetus. As in reptile-associated C. fetus subsp. testudinum, a putative tricarballylate catabolism locus was identified. However, despite colonizing a shared host, no recent recombination between both taxa was detected. This genomic study provides a better understanding of host adaptation, virulence, phylogeny, and evolution of C. iguaniorum and related Campylobacter taxa. PMID:27604878
Liu, Yong; Wei, Wen-Ping; Ye, Bang-Ce
2018-05-18
The overexpression of bacterial secondary metabolite biosynthetic enzymes is the basis for industrial overproducing strains. Genome editing tools can be used to further improve gene expression and yield. Saccharopolyspora erythraea produces erythromycin, which has extensive clinical applications. In this study, the CRISPR-Cas9 system was used to edit genes in the S. erythraea genome. A temperature-sensitive plasmid containing the PermE promoter, to drive Cas9 expression, and the Pj23119 and PkasO promoters, to drive sgRNAs, was designed. Erythromycin esterase, encoded by S. erythraea SACE_1765, inactivates erythromycin by hydrolyzing the macrolactone ring. Sequencing and qRT-PCR confirmed that reporter genes were successfully inserted into the SACE_1765 gene. Deletion of SACE_1765 in a high-producing strain resulted in a 12.7% increase in erythromycin levels. Subsequent PermE- egfp knock-in at the SACE_0712 locus resulted in an 80.3% increase in erythromycin production compared with that of wild type. Further investigation showed that PermE promoter knock-in activated the erythromycin biosynthetic gene clusters at the SACE_0712 locus. Additionally, deletion of indA (SACE_1229) using dual sgRNA targeting without markers increased the editing efficiency to 65%. In summary, we have successfully applied Cas9-based genome editing to a bacterial strain, S. erythraea, with a high GC content. This system has potential application for both genome-editing and biosynthetic gene cluster activation in Actinobacteria.
Bonnafous, Fanny; Fievet, Ghislain; Blanchet, Nicolas; Boniface, Marie-Claude; Carrère, Sébastien; Gouzy, Jérôme; Legrand, Ludovic; Marage, Gwenola; Bret-Mestries, Emmanuelle; Munos, Stéphane; Pouilly, Nicolas; Vincourt, Patrick; Langlade, Nicolas; Mangin, Brigitte
2018-02-01
This study compares five models of GWAS, to show the added value of non-additive modeling of allelic effects to identify genomic regions controlling flowering time of sunflower hybrids. Genome-wide association studies are a powerful and widely used tool to decipher the genetic control of complex traits. One of the main challenges for hybrid crops, such as maize or sunflower, is to model the hybrid vigor in the linear mixed models, considering the relatedness between individuals. Here, we compared two additive and three non-additive association models for their ability to identify genomic regions associated with flowering time in sunflower hybrids. A panel of 452 sunflower hybrids, corresponding to incomplete crossing between 36 male lines and 36 female lines, was phenotyped in five environments and genotyped for 2,204,423 SNPs. Intra-locus effects were estimated in multi-locus models to detect genomic regions associated with flowering time using the different models. Thirteen quantitative trait loci were identified in total, two with both model categories and one with only non-additive models. A quantitative trait loci on LG09, detected by both the additive and non-additive models, is located near a GAI homolog and is presented in detail. Overall, this study shows the added value of non-additive modeling of allelic effects for identifying genomic regions that control traits of interest and that could participate in the heterosis observed in hybrids.
Carter, Tamar E; Boulter, Alexis; Existe, Alexandre; Romain, Jean R; St Victor, Jean Yves; Mulligan, Connie J; Okech, Bernard A
2015-03-01
Antimalarial drugs are a key tool in malaria elimination programs. With the emergence of artemisinin resistance in southeast Asia, an effort to identify molecular markers for surveillance of resistant malaria parasites is underway. Non-synonymous mutations in the kelch propeller domain (K13-propeller) in Plasmodium falciparum have been associated with artemisinin resistance in samples from southeast Asia, but additional studies are needed to characterize this locus in other P. falciparum populations with different levels of artemisinin use. Here, we sequenced the K13-propeller locus in 82 samples from Haiti, where limited government oversight of non-governmental organizations may have resulted in low-level use of artemisinin-based combination therapies. We detected a single-nucleotide polymorphism (SNP) at nucleotide 1,359 in a single isolate. Our results contribute to our understanding of the global genomic diversity of the K13-propeller locus in P. falciparum populations. © The American Society of Tropical Medicine and Hygiene.
Chromosomal arrangement of leghemoglobin genes in soybean.
Lee, J S; Brown, G G; Verma, D P
1983-01-01
A cluster of four different leghemoglobin (Lb) genes was isolated from AluI-HaeIII and EcoRI genomic libraries of soybean in a set of overlapping clones which together include 45 kilobases (kb) of contiguous DNA. These four genes, including a pseudogene, are present in the same orientation and are arranged in the order: 5'-Lba-Lbc1-Lb psi-Lbc3-3'. The intergenic regions average 2.5 kb. In addition to this main Lb locus, there are other Lb genes which do not appear to be contiguous to this locus. A sequence probably common to the 3' region of Lb loci was found flanking the Lbc3 gene. The 3' flanking region of the main Lb locus also contains a sequence that appears to be expressed more abundantly in root tissue. Another sequence which is primarily expressed in root and leaf is found 5' to two Lb loci. Overall, the main leghemoglobin locus is similar in structure to the mammalian globin gene loci. Images PMID:6310504
Structural polymorphism at LCR and its role in beta-globin gene regulation.
Kukreti, Shrikant; Kaur, Harpreet; Kaushik, Mahima; Bansal, Aparna; Saxena, Sarika; Kaushik, Shikha; Kukreti, Ritushree
2010-09-01
Information on the secondary structures and conformational manifestations of eukaryotic DNA and their biological significance with reference to gene regulation and expression is limited. The human beta-globin gene Locus Control Region (LCR), a dominant regulator of globin gene expression, is a contiguous piece of DNA with five tissue-specific DNase I-hypersensitive sites (HSs). Since these HSs have a high density of transcription factor binding sites, structural interdependencies between HSs and different promoters may directly or indirectly regulate LCR functions. Mutations and SNPs may stabilize or destabilize the local secondary structures, affecting the gene expression by changes in the protein-DNA recognition patterns. Various palindromic or quasi-palindromic segments within LCR, could cause structural polymorphism and geometrical switching of DNA. This emphasizes the importance of understanding of the sequence-dependent variations of the DNA structure. Such structural motifs might act as regulatory elements. The local conformational variability of a DNA segment or action of a DNA specific protein is key to create and maintain active chromatin domains and affect transcription of various tissue specific beta-globin genes. We, summarize here the current status of beta-globin LCR structure and function. Further structural studies at molecular level and functional genomics might solve the regulatory puzzles that control the beta-globin gene locus. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.
Locus of Control and Sex Differences in Performance on an Instructional Task.
ERIC Educational Resources Information Center
Holloway, Richard L.; Robinson, Beatrice
1979-01-01
Used locus of control, ability, sex, task selection, task structure, and recall in a regression model to predict affective response to type of instruction of 104 high school seniors. Results showed a main effect for recall, and interaction effects for recall x sex and recall x ability. References are listed. (Author/JEG)
Nicolas, Laura; Cols, Montserrat; Choi, Jee Eun; Chaudhuri, Jayanta; Vuong, Bao
2018-01-01
Adaptive immune responses require the generation of a diverse repertoire of immunoglobulins (Igs) that can recognize and neutralize a seemingly infinite number of antigens. V(D)J recombination creates the primary Ig repertoire, which subsequently is modified by somatic hypermutation (SHM) and class switch recombination (CSR). SHM promotes Ig affinity maturation whereas CSR alters the effector function of the Ig. Both SHM and CSR require activation-induced cytidine deaminase (AID) to produce dU:dG mismatches in the Ig locus that are transformed into untemplated mutations in variable coding segments during SHM or DNA double-strand breaks (DSBs) in switch regions during CSR. Within the Ig locus, DNA repair pathways are diverted from their canonical role in maintaining genomic integrity to permit AID-directed mutation and deletion of gene coding segments. Recently identified proteins, genes, and regulatory networks have provided new insights into the temporally and spatially coordinated molecular interactions that control the formation and repair of DSBs within the Ig locus. Unravelling the genetic program that allows B cells to selectively alter the Ig coding regions while protecting non-Ig genes from DNA damage advances our understanding of the molecular processes that maintain genomic integrity as well as humoral immunity. PMID:29744038
Xie, Zicong; Pang, Daxin; Wang, Kankan; Li, Mengjing; Guo, Nannan; Yuan, Hongming; Li, Jianing; Zou, Xiaodong; Jiao, Huping; Ouyang, Hongsheng; Li, Zhanjun; Tang, Xiaochun
2017-06-08
Genetically modified pigs have important roles in agriculture and biomedicine. However, genome-specific knock-in techniques in pigs are still in their infancy and optimal strategies have not been extensively investigated. In this study, we performed electroporation to introduce a targeting donor vector (a non-linearized vector that did not contain a promoter or selectable marker) into Porcine Foetal Fibroblasts (PFFs) along with a CRISPR/Cas9 vector. After optimization, the efficiency of the EGFP site-specific knock-in could reach up to 29.6% at the pRosa26 locus in PFFs. Next, we used the EGFP reporter PFFs to address two key conditions in the process of achieving transgenic pigs, the limiting dilution method and the strategy to evaluate the safety and feasibility of the knock-in locus. This study demonstrates that we establish an efficient procedures for the exogenous gene knock-in technique and creates a platform to efficiently generate promoter-less and selectable marker-free transgenic PFFs through the CRISPR/Cas9 system. This study should contribute to the generation of promoter-less and selectable marker-free transgenic pigs and it may provide insights into sophisticated site-specific genome engineering techniques for additional species.
Convergent evolution in the genetic basis of Müllerian mimicry in heliconius butterflies.
Baxter, Simon W; Papa, Riccardo; Chamberlain, Nicola; Humphray, Sean J; Joron, Mathieu; Morrison, Clay; ffrench-Constant, Richard H; McMillan, W Owen; Jiggins, Chris D
2008-11-01
The neotropical butterflies Heliconius melpomene and H. erato are Müllerian mimics that display the same warningly colored wing patterns in local populations, yet pattern diversity between geographic regions. Linkage mapping has previously shown convergent red wing phenotypes in these species are controlled by loci on homologous chromosomes. Here, AFLP bulk segregant analysis using H. melpomene crosses identified genetic markers tightly linked to two red wing-patterning loci. These markers were used to screen a H. melpomene BAC library and a tile path was assembled spanning one locus completely and part of the second. Concurrently, a similar strategy was used to identify a BAC clone tightly linked to the locus controlling the mimetic red wing phenotypes in H. erato. A methionine rich storage protein (MRSP) gene was identified within this BAC clone, and comparative genetic mapping shows red wing color loci are in homologous regions of the genome of H. erato and H. melpomene. Subtle differences in these convergent phenotypes imply they evolved independently using somewhat different developmental routes, but are nonetheless regulated by the same switch locus. Genetic mapping of MRSP in a third related species, the "tiger" patterned H. numata, has no association with wing patterning and shows no evidence for genomic translocation of wing-patterning loci.
Downing, Chris; Johnson, Thomas E; Larson, Colin; Leakey, Tatiana I; Siegfried, Rachel N; Rafferty, Tonya M; Cooney, Craig A
2010-01-01
C57BL/6J (B6) mice are susceptible to in utero growth retardation and a number of morphological malformations following prenatal alcohol exposure, while DBA/2J (D2) mice are relatively resistant. We have previously shown that genomic imprinting may play a role in differential sensitivity between B6 and D2 (Downing and Gilliam 1999). The best characterized mechanism mediating genomic imprinting is differential DNA methylation. In the present study we examined DNA methylation and gene expression, in both embryonic and placental tissue, at the mouse Igf2 locus following in utero ethanol exposure. We also examined the effects of a methyl-supplemented diet on methylation and ethanol teratogenesis. In embryos from susceptible B6 mice, we found small decreases in DNA methylation at four CpG sites in one of the differentially methylated regions of the Igf2 locus; only one of the four sites showed a statistically significant decrease. We observed no significant decreases in methylation in placentae. All Igf2 transcripts showed approximately 1.5 fold decreases following intrauterine alcohol exposure. Placing dams on a methyl-supplemented diet before pregnancy and throughout gestation brought methylation back up to control levels. Methyl-supplementation also resulted in lower prenatal mortality, greater prenatal growth, and decreased digit malformations; it dramatically reduced vertebral malformations. Thus, while prenatal alcohol had only small effects on DNA methylation at the Igf2 locus, placing dams on a methyl-supplemented diet partially ameliorated ethanol teratogenesis. PMID:20705422
Huson, Heather J.; Kim, Eui-Soo; Godfrey, Robert W.; Olson, Timothy A.; McClure, Matthew C.; Chase, Chad C.; Rizzi, Rita; O'Brien, Ana M. P.; Van Tassell, Curt P.; Garcia, José F.; Sonstegard, Tad S.
2014-01-01
The slick hair coat (SLICK) is a dominantly inherited trait typically associated with tropically adapted cattle that are from Criollo descent through Spanish colonization of cattle into the New World. The trait is of interest relative to climate change, due to its association with improved thermo-tolerance and subsequent increased productivity. Previous studies localized the SLICK locus to a 4 cM region on chromosome (BTA) 20 and identified signatures of selection in this region derived from Senepol cattle. The current study compares three slick-haired Criollo-derived breeds including Senepol, Carora, and Romosinuano and three additional slick-haired cross-bred lineages to non-slick ancestral breeds. Genome-wide association (GWA), haplotype analysis, signatures of selection, runs of homozygosity (ROH), and identity by state (IBS) calculations were used to identify a 0.8 Mb (37.7–38.5 Mb) consensus region for the SLICK locus on BTA20 in which contains SKP2 and SPEF2 as possible candidate genes. Three specific haplotype patterns are identified in slick individuals, all with zero frequency in non-slick individuals. Admixture analysis identified common genetic patterns between the three slick breeds at the SLICK locus. Principal component analysis (PCA) and admixture results show Senepol and Romosinuano sharing a higher degree of genetic similarity to one another with a much lesser degree of similarity to Carora. Variation in GWA, haplotype analysis, and IBS calculations with accompanying population structure information supports potentially two mutations, one common to Senepol and Romosinuano and another in Carora, effecting genes contained within our refined location for the SLICK locus. PMID:24808908
Huson, Heather J; Kim, Eui-Soo; Godfrey, Robert W; Olson, Timothy A; McClure, Matthew C; Chase, Chad C; Rizzi, Rita; O'Brien, Ana M P; Van Tassell, Curt P; Garcia, José F; Sonstegard, Tad S
2014-01-01
The slick hair coat (SLICK) is a dominantly inherited trait typically associated with tropically adapted cattle that are from Criollo descent through Spanish colonization of cattle into the New World. The trait is of interest relative to climate change, due to its association with improved thermo-tolerance and subsequent increased productivity. Previous studies localized the SLICK locus to a 4 cM region on chromosome (BTA) 20 and identified signatures of selection in this region derived from Senepol cattle. The current study compares three slick-haired Criollo-derived breeds including Senepol, Carora, and Romosinuano and three additional slick-haired cross-bred lineages to non-slick ancestral breeds. Genome-wide association (GWA), haplotype analysis, signatures of selection, runs of homozygosity (ROH), and identity by state (IBS) calculations were used to identify a 0.8 Mb (37.7-38.5 Mb) consensus region for the SLICK locus on BTA20 in which contains SKP2 and SPEF2 as possible candidate genes. Three specific haplotype patterns are identified in slick individuals, all with zero frequency in non-slick individuals. Admixture analysis identified common genetic patterns between the three slick breeds at the SLICK locus. Principal component analysis (PCA) and admixture results show Senepol and Romosinuano sharing a higher degree of genetic similarity to one another with a much lesser degree of similarity to Carora. Variation in GWA, haplotype analysis, and IBS calculations with accompanying population structure information supports potentially two mutations, one common to Senepol and Romosinuano and another in Carora, effecting genes contained within our refined location for the SLICK locus.
Literature-Based Gene Curation and Proposed Genetic Nomenclature for Cryptococcus
Inglis, Diane O.; Skrzypek, Marek S.; Liaw, Edward; Moktali, Venkatesh; Sherlock, Gavin
2014-01-01
Cryptococcus, a major cause of disseminated infections in immunocompromised patients, kills over 600,000 people per year worldwide. Genes involved in the virulence of the meningitis-causing fungus are being characterized at an increasing rate, and to date, at least 648 Cryptococcus gene names have been published. However, these data are scattered throughout the literature and are challenging to find. Furthermore, conflicts in locus identification exist, so that named genes have been subsequently published under new names or names associated with one locus have been used for another locus. To avoid these conflicts and to provide a central source of Cryptococcus gene information, we have collected all published Cryptococcus gene names from the scientific literature and associated them with standard Cryptococcus locus identifiers and have incorporated them into FungiDB (www.fungidb.org). FungiDB is a panfungal genome database that collects gene information and functional data and provides search tools for 61 species of fungi and oomycetes. We applied these published names to a manually curated ortholog set of all Cryptococcus species currently in FungiDB, including Cryptococcus neoformans var. neoformans strains JEC21 and B-3501A, C. neoformans var. grubii strain H99, and Cryptococcus gattii strains R265 and WM276, and have written brief descriptions of their functions. We also compiled a protocol for gene naming that summarizes guidelines proposed by members of the Cryptococcus research community. The centralization of genomic and literature-based information for Cryptococcus at FungiDB will help researchers communicate about genes of interest, such as those related to virulence, and will further facilitate research on the pathogen. PMID:24813190
Genomic Rearrangements in Arabidopsis Considered as Quantitative Traits.
Imprialou, Martha; Kahles, André; Steffen, Joshua G; Osborne, Edward J; Gan, Xiangchao; Lempe, Janne; Bhomra, Amarjit; Belfield, Eric; Visscher, Anne; Greenhalgh, Robert; Harberd, Nicholas P; Goram, Richard; Hein, Jotun; Robert-Seilaniantz, Alexandre; Jones, Jonathan; Stegle, Oliver; Kover, Paula; Tsiantis, Miltos; Nordborg, Magnus; Rätsch, Gunnar; Clark, Richard M; Mott, Richard
2017-04-01
To understand the population genetics of structural variants and their effects on phenotypes, we developed an approach to mapping structural variants that segregate in a population sequenced at low coverage. We avoid calling structural variants directly. Instead, the evidence for a potential structural variant at a locus is indicated by variation in the counts of short-reads that map anomalously to that locus. These structural variant traits are treated as quantitative traits and mapped genetically, analogously to a gene expression study. Association between a structural variant trait at one locus, and genotypes at a distant locus indicate the origin and target of a transposition. Using ultra-low-coverage (0.3×) population sequence data from 488 recombinant inbred Arabidopsis thaliana genomes, we identified 6502 segregating structural variants. Remarkably, 25% of these were transpositions. While many structural variants cannot be delineated precisely, we validated 83% of 44 predicted transposition breakpoints by polymerase chain reaction. We show that specific structural variants may be causative for quantitative trait loci for germination and resistance to infection by the fungus Albugo laibachii , isolate Nc14. Further we show that the phenotypic heritability attributable to read-mapping anomalies differs from, and, in the case of time to germination and bolting, exceeds that due to standard genetic variation. Genes within structural variants are also more likely to be silenced or dysregulated. This approach complements the prevalent strategy of structural variant discovery in fewer individuals sequenced at high coverage. It is generally applicable to large populations sequenced at low-coverage, and is particularly suited to mapping transpositions. Copyright © 2017 by the Genetics Society of America.
Eggermann, Thomas; Heilsberg, Ann-Kathrin; Bens, Susanne; Siebert, Reiner; Beygo, Jasmin; Buiting, Karin; Begemann, Matthias; Soellner, Lukas
2014-07-01
The chromosomal region 11p15 contains two imprinting control regions (ICRs) and is a key player in molecular processes regulated by genomic imprinting. Genomic as well as epigenetic changes affecting 11p15 are associated either with Silver-Russell syndrome (SRS) or Beckwith-Wiedemann syndrome (BWS). In the last years, a growing number of patients affected by imprinting disorders (IDs) have reported carrying the disease-specific 11p15 hypomethylation patterns as well as methylation changes at imprinted loci at other chromosomal sites (multi-locus methylation defects, MLMD). Furthermore, in several patients, molecular alterations (e.g., uniparental disomies, UPDs) additional to the primary epimutations have been reported. To determine the frequency and distribution of mutations and epimutations in patients referred as SRS or BWS for genetic testing, we retrospectively ascertained our routine patient cohort consisting of 711 patients (SRS, n = 571; BWS, n = 140). As this cohort represents the typical cohort in a routine diagnostic lab without clinical preselection, the detection rates were much lower than those reported from clinically characterized cohorts in the literature (SRS, 19.9%; BWS, 28.6%). Among the molecular subgroups known to be predisposed to MLMD, the frequencies corresponded to that in the literature (SRS, 7.1% in ICR1 hypomethylation carriers; BWS, 20.8% in ICR2 hypomethylation patients). In several patients, more than one epigenetic or genetic disturbance could be identified. Our study illustrates that the complex molecular alterations as well as the overlapping and sometimes unusual clinical findings in patients with imprinting disorders (IDs) often make the decision for a specific imprinting disorder test difficult. We therefore suggest to implement molecular assays in routine ID diagnostics which allow the detection of a broad range of (epi)mutation types (epimutations, UPDs, chromosomal imbalances) and cover the clinically most relevant known ID loci because of the following: (a) Multi-locus tests increase the detection rates as they cover numerous loci. (b) Patients with unexpected molecular alterations are detected. (c) The testing of rare imprinting disorders becomes more efficient and quality of molecular diagnosis increases. (d) The tests identify MLMDs. In the future, the detailed characterization of clinical and molecular findings in ID patients will help us to decipher the complex regulation of imprinting and thereby providing the basis for more directed genetic counseling and therapeutic managements in IDs. Molecular disturbances in patients with imprinting disorders are often not restricted to the disease-specific locus but also affect other chromosomal regions. These additional disturbances include methylation defects, uniparental disomies as well as chromosomal imbalances. The identification of these additional alterations is mandatory for a well-directed genetic counseling. Furthermore, these findings help to decipher the complex regulation of imprinting.
Tonomura, Noriko; Elvers, Ingegerd; Thomas, Rachael; Megquier, Kate; Turner-Maier, Jason; Howald, Cedric; Sarver, Aaron L.; Swofford, Ross; Frantz, Aric M.; Ito, Daisuke; Mauceli, Evan; Arendt, Maja; Noh, Hyun Ji; Koltookian, Michele; Biagi, Tara; Fryc, Sarah; Williams, Christina; Avery, Anne C.; Kim, Jong-Hyuk; Barber, Lisa; Burgess, Kristine; Lander, Eric S.; Karlsson, Elinor K.; Azuma, Chieko
2015-01-01
Dogs, with their breed-determined limited genetic background, are great models of human disease including cancer. Canine B-cell lymphoma and hemangiosarcoma are both malignancies of the hematologic system that are clinically and histologically similar to human B-cell non-Hodgkin lymphoma and angiosarcoma, respectively. Golden retrievers in the US show significantly elevated lifetime risk for both B-cell lymphoma (6%) and hemangiosarcoma (20%). We conducted genome-wide association studies for hemangiosarcoma and B-cell lymphoma, identifying two shared predisposing loci. The two associated loci are located on chromosome 5, and together contribute ~20% of the risk of developing these cancers. Genome-wide p-values for the top SNP of each locus are 4.6×10-7 and 2.7×10-6, respectively. Whole genome resequencing of nine cases and controls followed by genotyping and detailed analysis identified three shared and one B-cell lymphoma specific risk haplotypes within the two loci, but no coding changes were associated with the risk haplotypes. Gene expression analysis of B-cell lymphoma tumors revealed that carrying the risk haplotypes at the first locus is associated with down-regulation of several nearby genes including the proximal gene TRPC6, a transient receptor Ca2+-channel involved in T-cell activation, among other functions. The shared risk haplotype in the second locus overlaps the vesicle transport and release gene STX8. Carrying the shared risk haplotype is associated with gene expression changes of 100 genes enriched for pathways involved in immune cell activation. Thus, the predisposing germ-line mutations in B-cell lymphoma and hemangiosarcoma appear to be regulatory, and affect pathways involved in T-cell mediated immune response in the tumor. This suggests that the interaction between the immune system and malignant cells plays a common role in the tumorigenesis of these relatively different cancers. PMID:25642983
Rosato, Marcela; Kovařík, Aleš; Garilleti, Ricardo; Rosselló, Josep A.
2016-01-01
Genes encoding ribosomal RNA (rDNA) are universal key constituents of eukaryotic genomes, and the nuclear genome harbours hundreds to several thousand copies of each species. Knowledge about the number of rDNA loci and gene copy number provides information for comparative studies of organismal and molecular evolution at various phylogenetic levels. With the exception of seed plants, the range of 45S rDNA locus (encoding 18S, 5.8S and 26S rRNA) and gene copy number variation within key evolutionary plant groups is largely unknown. This is especially true for the three earliest land plant lineages Marchantiophyta (liverworts), Bryophyta (mosses), and Anthocerotophyta (hornworts). In this work, we report the extent of rDNA variation in early land plants, assessing the number of 45S rDNA loci and gene copy number in 106 species and 25 species, respectively, of mosses, liverworts and hornworts. Unexpectedly, the results show a narrow range of ribosomal locus variation (one or two 45S rDNA loci) and gene copies not present in vascular plant lineages, where a wide spectrum is recorded. Mutation analysis of whole genomic reads showed higher (3-fold) intragenomic heterogeneity of Marchantia polymorpha (Marchantiophyta) rDNA compared to Physcomitrella patens (Bryophyta) and two angiosperms (Arabidopsis thaliana and Nicotiana tomentosifomis) suggesting the presence of rDNA pseudogenes in its genome. No association between phylogenetic position, taxonomic adscription and the number of rDNA loci and gene copy number was found. Our results suggest a likely evolutionary rDNA stasis during land colonisation and diversification across 480 myr of bryophyte evolution. We hypothesise that strong selection forces may be acting against ribosomal gene locus amplification. Despite showing a predominant haploid phase and infrequent meiosis, overall rDNA homogeneity is not severely compromised in bryophytes. PMID:27622766
Zhang, Han; Rokas, Antonis; Slot, Jason C.
2012-01-01
Background Dermatophyte fungi of the family Arthrodermataceae (Eurotiomycetes) colonize keratinized tissue, such as skin, frequently causing superficial mycoses in humans and other mammals, reptiles, and birds. Competition with native microflora likely underlies the propensity of these dermatophytes to produce a diversity of antibiotics and compounds for scavenging iron, which is extremely scarce, as well as the presence of an unusually large number of putative secondary metabolism gene clusters, most of which contain non-ribosomal peptide synthetases (NRPS), in their genomes. To better understand the historical origins and diversification of NRPS-containing gene clusters we examined the evolution of a variable locus (VL) that exists in one of three alternative conformations among the genomes of seven dermatophyte species. Results The first conformation of the VL (termed VLA) contains only 539 base pairs of sequence and lacks protein-coding genes, whereas the other two conformations (termed VLB and VLC) span 36 Kb and 27 Kb and contain 12 and 10 genes, respectively. Interestingly, both VLB and VLC appear to contain distinct secondary metabolism gene clusters; VLB contains a NRPS gene as well as four porphyrin metabolism genes never found to be physically linked in the genomes of 128 other fungal species, whereas VLC also contains a NRPS gene as well as several others typically found associated with secondary metabolism gene clusters. Phylogenetic evidence suggests that the VL locus was present in the ancestor of all seven species achieving its present distribution through subsequent differential losses or retentions of specific conformations. Conclusions We propose that the existence of variable loci, similar to the one we studied, in fungal genomes could potentially explain the dramatic differences in secondary metabolic diversity between closely related species of filamentous fungi, and contribute to host adaptation and the generation of metabolic diversity. PMID:22860027
Chesi, Alessandra; Mitchell, Jonathan A; Kalkwarf, Heidi J; Bradfield, Jonathan P; Lappe, Joan M; McCormack, Shana E; Gilsanz, Vicente; Oberfield, Sharon E; Hakonarson, Hakon; Shepherd, John A; Kelly, Andrea; Zemel, Babette S; Grant, Struan F A
2015-09-01
Childhood fractures are common, with the forearm being the most common site. Genome-wide association studies (GWAS) have identified more than 60 loci associated with bone mineral density (BMD) in adults but less is known about genetic influences specific to bone in childhood. To identify novel genetic factors that influence pediatric bone strength at a common site for childhood fractures, we performed a sex-stratified trans-ethnic genome-wide association study of areal BMD (aBMD) and bone mineral content (BMC) Z-scores measured by dual energy X-ray absorptiometry at the one-third distal radius, in a cohort of 1399 children without clinical abnormalities in bone health. We tested signals with P < 5 × 10(-6) for replication in an independent, same-age cohort of 486 Caucasian children. Two loci yielded a genome-wide significant combined P-value: rs7797976 within CPED1 in females [P = 2.4 × 10(-11), β =- 0.30 standard deviations (SD) per T allele; aBMD-Z] and rs7035284 at 9p21.3 in males (P = 1.2 × 10(-8), β = 0.28 SD per G allele; BMC-Z). Signals at the CPED1-WNT16-FAM3C locus have been previously associated with BMD at other skeletal sites in adults and children. Our result at the distal radius underscores the importance of this locus at multiple skeletal sites. The 9p21.3 locus is within a gene desert, with the nearest gene flanking each side being MIR31HG and MTAP, neither of which has been implicated in BMD or BMC previously. These findings suggest that genetic determinants of childhood bone accretion at the radius, a skeletal site that is primarily cortical bone, exist and also differ by sex. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Lieb, Wolfgang; Chen, Ming-Huei; Teumer, Alexander; de Boer, Rudolf A.; Lin, Honghuang; Fox, Ervin R.; Musani, Solomon K.; Wilson, James G.; Wang, Thomas J.; Völzke, Henry; Petersen, Ann-Kristin; Meisinger, Christine; Nauck, Matthias; Schlesinger, Sabrina; Li, Yong; Menard, Jöel; Hercberg, Serge; Wichmann, H.-Erich; Völker, Uwe; Rawal, Rajesh; Bidlingmaier, Martin; Hannemann, Anke; Dörr, Marcus; Rettig, Rainer; van Gilst, Wiek H.; van Veldhuisen, Dirk J.; Bakker, Stephan J.L.; Navis, Gerjan; Wallaschofski, Henri; Meneton, Pierre; van der Harst, Pim; Reincke, Martin; Vasan, Ramachandran S.; Consortium, CKDGen
2015-01-01
Background The renin-angiotensin-aldosterone-system (RAAS) is critical for regulation of blood pressure and fluid balance and influences cardiovascular remodeling. Dysregulation of the RAAS contributes to cardiovascular and renal morbidity. The genetic architecture of circulating RAAS components is incompletely understood. Methods and Results We meta-analyzed genome-wide association data for plasma renin activity (n=5,275), plasma renin concentrations (n=8,014) and circulating aldosterone (n=13,289) from up to four population-based cohorts of European and European-American ancestry, and assessed replication of the top results in an independent sample (n=6,487). Single nucleotide polymorphisms (SNPs) in two independent loci displayed associations with plasma renin activity atgenome-wide significance (p<5×10-8). A third locus was close to this threshold (rs4253311 in kallikrein B [KLKB1], p=5.5×10-8). Two of these loci replicated in an independent sample for both plasma renin and aldosterone concentrations (SNP rs5030062 in kininogen 1 [KNG1]: p=0.001 for plasma renin, p=0.024 for plasma aldosterone concentration; rs4253311 with p<0.001 for both plasma renin and aldosterone concentration). SNPs in the NEBL gene reached genome-wide significance for plasma renin concentration in the discovery sample (top SNP rs3915911, p= 8.81×10-9), but did not replicate (p=0.81). No locus reached genome-wide significance for aldosterone. SNPs rs5030062 and rs4253311 were not related to blood pressure or renal traits; in a companion study, variants in the kallikrein B locus were associated with B-type natriuretic peptide concentrations in African-Americans. Conclusions We identified two genetic loci (kininogen 1 and kallikrein B) influencing key components of the RAAS, consistent with the close interrelation between the kallikrein-kinin system and the RAAS. PMID:25477429
Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies.
Card, Daren C; Schield, Drew R; Reyes-Velasco, Jacobo; Fujita, Matthew K; Andrew, Audra L; Oyler-McCance, Sara J; Fike, Jennifer A; Tomback, Diana F; Ruggiero, Robert P; Castoe, Todd A
2014-01-01
As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5-5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.
Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies
Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthre K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.
2014-01-01
As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (~3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.
Production of genome-edited pluripotent stem cells and mice by CRISPR/Cas.
Horii, Takuro; Hatada, Izuho
2016-01-01
Clustered regularly at interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) nucleases, so-called CRISPR/Cas, was recently developed as an epoch-making genome engineering technology. This system only requires Cas9 nuclease and single-guide RNA complementary to a target locus. CRISPR/Cas enables the generation of knockout cells and animals in a single step. This system can also be used to generate multiple mutations and knockin in a single step, which is not possible using other methods. In this review, we provide an overview of genome editing by CRISPR/Cas in pluripotent stem cells and mice.
Bolland, Daniel J; Wood, Andrew L; Corcoran, Anne E
2009-01-01
V(D)J recombination in lymphocytes is the cutting and pasting together of antigen receptor genes in cis to generate the enormous variety of coding sequences required to produce diverse antigen receptor proteins. It is the key role of the adaptive immune response, which must potentially combat millions of different foreign antigens. Most antigen receptor loci have evolved to be extremely large and contain multiple individual V, D and J genes. The immunoglobulin heavy chain (Igh) and immunoglobulin kappa light chain (Igk) loci are the largest multigene loci in the mammalian genome and V(D)J recombination is one of the most complicated genetic processes in the nucleus. The challenge for the appropriate lymphocyte is one of macro-management-to make all of the antigen receptor genes in a particular locus available for recombination at the appropriate developmental time-point. Conversely, these large loci must be kept closed in lymphocytes in which they do not normally recombine, to guard against genomic instability generated by the DNA double strand breaks inherent to the V(D)J recombination process. To manage all of these demanding criteria, V(D)J recombination is regulated at numerous levels. It is restricted to lymphocytes since the Rag genes which control the DNA double-strand break step of recombination are only expressed in these cells. Within the lymphocyte lineage, immunoglobulin recombination is restricted to B-lymphocytes and TCR recombination to T-lymphocytes by regulation of locus accessibility, which occurs at multiple levels. Accessibility of recombination signal sequences (RSSs) flanking individual V, D and J genes at the nucleosomal level is the key micro-management mechanism, which is discussed in greater detail in other chapters. This chapter will explore how the antigen receptor loci are regulated as a whole, focussing on the Igh locus as a paradigm for the mechanisms involved. Numerous recent studies have begun to unravel the complex and complementary processes involved in this large-scale locus organisation. We will examine the structure of the Igh locus and the large-scale and higher-order chromatin remodelling processes associated with V(D)J recombination, at the level of the locus itself, its conformational changes and its dynamic localisation within the nucleus.
Revealing the missing expressed genes beyond the human reference genome by RNA-Seq.
Chen, Geng; Li, Ruiyuan; Shi, Leming; Qi, Junyi; Hu, Pengzhan; Luo, Jian; Liu, Mingyao; Shi, Tieliu
2011-12-02
The complete and accurate human reference genome is important for functional genomics researches. Therefore, the incomplete reference genome and individual specific sequences have significant effects on various studies. we used two RNA-Seq datasets from human brain tissues and 10 mixed cell lines to investigate the completeness of human reference genome. First, we demonstrated that in previously identified ~5 Mb Asian and ~5 Mb African novel sequences that are absent from the human reference genome of NCBI build 36, ~211 kb and ~201 kb of them could be transcribed, respectively. Our results suggest that many of those transcribed regions are not specific to Asian and African, but also present in Caucasian. Then, we found that the expressions of 104 RefSeq genes that are unalignable to NCBI build 37 in brain and cell lines are higher than 0.1 RPKM. 55 of them are conserved across human, chimpanzee and macaque, suggesting that there are still a significant number of functional human genes absent from the human reference genome. Moreover, we identified hundreds of novel transcript contigs that cannot be aligned to NCBI build 37, RefSeq genes and EST sequences. Some of those novel transcript contigs are also conserved among human, chimpanzee and macaque. By positioning those contigs onto the human genome, we identified several large deletions in the reference genome. Several conserved novel transcript contigs were further validated by RT-PCR. Our findings demonstrate that a significant number of genes are still absent from the incomplete human reference genome, highlighting the importance of further refining the human reference genome and curating those missing genes. Our study also shows the importance of de novo transcriptome assembly. The comparative approach between reference genome and other related human genomes based on the transcriptome provides an alternative way to refine the human reference genome.
Min, Xiang Jia
2013-01-01
Expressed Sequence Tags (ESTs) are a rich resource for identifying Alternatively Splicing (AS) genes. The ASFinder webserver is designed to identify AS isoforms from EST-derived sequences. Two approaches are implemented in ASFinder. If no genomic sequences are provided, the server performs a local BLASTN to identify AS isoforms from ESTs having both ends aligned but an internal segment unaligned. Otherwise, ASFinder uses SIM4 to map ESTs to the genome, then the overlapping ESTs that are mapped to the same genomic locus and have internal variable exon/intron boundaries are identified as AS isoforms. The tool is available at http://proteomics.ysu.edu/tools/ASFinder.html.
Chromatin immunoprecipitation of mouse embryos.
Voss, Anne K; Dixon, Mathew P; McLennan, Tamara; Kueh, Andrew J; Thomas, Tim
2012-01-01
During prenatal development, a large number of different cell types are formed, the vast majority of which contain identical genetic material. The basis of the great variety in cell phenotype and function is the differential expression of the approximately 25,000 genes in the mammalian genome. Transcriptional activity is regulated at many levels by proteins, including members of the basal transcriptional apparatus, DNA-binding transcription factors, and chromatin-binding proteins. Importantly, chromatin structure dictates the availability of a specific genomic locus for transcriptional activation as well as the efficiency, with which transcription can occur. Chromatin immunoprecipitation (ChIP) is a method to assess if chromatin modifications or proteins are present at a specific locus. ChIP involves the cross linking of DNA and associated proteins and immunoprecipitation using specific antibodies to DNA-associated proteins followed by examination of the co-precipitated DNA sequences or proteins. In the last few years, ChIP has become an essential technique for scientists studying transcriptional regulation and chromatin structure. Using ChIP on mouse embryos, we can document the presence or absence of specific proteins and chromatin modifications at genomic loci in vivo during mammalian development. Here, we describe a ChIP technique adapted for mouse embryos.
2013-01-01
Background Molecular diagnostics can resolve locus heterogeneity underlying clinical phenotypes that may otherwise be co-assigned as a specific syndrome based on shared clinical features, and can associate phenotypically diverse diseases to a single locus through allelic affinity. Here we describe an apparently novel syndrome, likely caused by de novo truncating mutations in ASXL3, which shares characteristics with Bohring-Opitz syndrome, a disease associated with de novo truncating mutations in ASXL1. Methods We used whole-genome and whole-exome sequencing to interrogate the genomes of four subjects with an undiagnosed syndrome. Results Using genome-wide sequencing, we identified heterozygous, de novo truncating mutations in ASXL3, a transcriptional repressor related to ASXL1, in four unrelated probands. We found that these probands shared similar phenotypes, including severe feeding difficulties, failure to thrive, and neurologic abnormalities with significant developmental delay. Further, they showed less phenotypic overlap with patients who had de novo truncating mutations in ASXL1. Conclusion We have identified truncating mutations in ASXL3 as the likely cause of a novel syndrome with phenotypic overlap with Bohring-Opitz syndrome. PMID:23383720
He, Xiangjun; Tan, Chunlai; Wang, Feng; Wang, Yaofeng; Zhou, Rui; Cui, Dexuan; You, Wenxing; Zhao, Hui; Ren, Jianwei; Feng, Bo
2016-01-01
CRISPR/Cas9-induced site-specific DNA double-strand breaks (DSBs) can be repaired by homology-directed repair (HDR) or non-homologous end joining (NHEJ) pathways. Extensive efforts have been made to knock-in exogenous DNA to a selected genomic locus in human cells; which, however, has focused on HDR-based strategies and was proven inefficient. Here, we report that NHEJ pathway mediates efficient rejoining of genome and plasmids following CRISPR/Cas9-induced DNA DSBs, and promotes high-efficiency DNA integration in various human cell types. With this homology-independent knock-in strategy, integration of a 4.6 kb promoterless ires-eGFP fragment into the GAPDH locus yielded up to 20% GFP+ cells in somatic LO2 cells, and 1.70% GFP+ cells in human embryonic stem cells (ESCs). Quantitative comparison further demonstrated that the NHEJ-based knock-in is more efficient than HDR-mediated gene targeting in all human cell types examined. These data support that CRISPR/Cas9-induced NHEJ provides a valuable new path for efficient genome editing in human ESCs and somatic cells. PMID:26850641
He, Xiangjun; Tan, Chunlai; Wang, Feng; Wang, Yaofeng; Zhou, Rui; Cui, Dexuan; You, Wenxing; Zhao, Hui; Ren, Jianwei; Feng, Bo
2016-05-19
CRISPR/Cas9-induced site-specific DNA double-strand breaks (DSBs) can be repaired by homology-directed repair (HDR) or non-homologous end joining (NHEJ) pathways. Extensive efforts have been made to knock-in exogenous DNA to a selected genomic locus in human cells; which, however, has focused on HDR-based strategies and was proven inefficient. Here, we report that NHEJ pathway mediates efficient rejoining of genome and plasmids following CRISPR/Cas9-induced DNA DSBs, and promotes high-efficiency DNA integration in various human cell types. With this homology-independent knock-in strategy, integration of a 4.6 kb promoterless ires-eGFP fragment into the GAPDH locus yielded up to 20% GFP+ cells in somatic LO2 cells, and 1.70% GFP+ cells in human embryonic stem cells (ESCs). Quantitative comparison further demonstrated that the NHEJ-based knock-in is more efficient than HDR-mediated gene targeting in all human cell types examined. These data support that CRISPR/Cas9-induced NHEJ provides a valuable new path for efficient genome editing in human ESCs and somatic cells. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
In vivo genome editing of the albumin locus as a platform for protein replacement therapy
Sharma, Rajiv; Anguela, Xavier M.; Doyon, Yannick; Wechsler, Thomas; DeKelver, Russell C.; Sproul, Scott; Paschon, David E.; Miller, Jeffrey C.; Davidson, Robert J.; Shivak, David; Zhou, Shangzhen; Rieders, Julianne; Gregory, Philip D.; Holmes, Michael C.; Rebar, Edward J.
2015-01-01
Site-specific genome editing provides a promising approach for achieving long-term, stable therapeutic gene expression. Genome editing has been successfully applied in a variety of preclinical models, generally focused on targeting the diseased locus itself; however, limited targeting efficiency or insufficient expression from the endogenous promoter may impede the translation of these approaches, particularly if the desired editing event does not confer a selective growth advantage. Here we report a general strategy for liver-directed protein replacement therapies that addresses these issues: zinc finger nuclease (ZFN) –mediated site-specific integration of therapeutic transgenes within the albumin gene. By using adeno-associated viral (AAV) vector delivery in vivo, we achieved long-term expression of human factors VIII and IX (hFVIII and hFIX) in mouse models of hemophilia A and B at therapeutic levels. By using the same targeting reagents in wild-type mice, lysosomal enzymes were expressed that are deficient in Fabry and Gaucher diseases and in Hurler and Hunter syndromes. The establishment of a universal nuclease-based platform for secreted protein production would represent a critical advance in the development of safe, permanent, and functional cures for diverse genetic and nongenetic diseases. PMID:26297739
Genome-wide association study of handedness excludes simple genetic models
Armour, J AL; Davison, A; McManus, I C
2014-01-01
Handedness is a human behavioural phenotype that appears to be congenital, and is often assumed to be inherited, but for which the developmental origin and underlying causation(s) have been elusive. Models of the genetic basis of variation in handedness have been proposed that fit different features of the observed resemblance between relatives, but none has been decisively tested or a corresponding causative locus identified. In this study, we applied data from well-characterised individuals studied at the London Twin Research Unit. Analysis of genome-wide SNP data from 3940 twins failed to identify any locus associated with handedness at a genome-wide level of significance. The most straightforward interpretation of our analyses is that they exclude the simplest formulations of the ‘right-shift' model of Annett and the ‘dextral/chance' model of McManus, although more complex modifications of those models are still compatible with our observations. For polygenic effects, our study is inadequately powered to reliably detect alleles with effect sizes corresponding to an odds ratio of 1.2, but should have good power to detect effects at an odds ratio of 2 or more. PMID:24065183
Osipova, Svetlana; Permyakov, Alexey; Permyakova, Marina; Pshenichnikova, Tatyana; Verkhoturov, Vasiliy; Rudikovsky, Alexandr; Rudikovskaya, Elena; Shishparenok, Alexandr; Doroshkov, Alexey; Börner, Andreas
2016-05-01
A quantitative trait locus (QTL) approach was taken to reveal the genetic basis in wheat of traits associated with photosynthesis during a period of exposure to water deficit stress. The performance, with respect to shoot biomass, gas exchange and chlorophyll fluorescence, leaf pigment content and the activity of various ascorbate-glutathione cycle enzymes and catalase, of a set of 80 wheat lines, each containing a single chromosomal segment introgressed from the bread wheat D genome progenitor Aegilops tauschii, was monitored in plants exposed to various water regimes. Four of the seven D genome chromosomes (1D, 2D, 5D, and 7D) carried clusters of both major (LOD >3.0) and minor (LOD between 2.0 and 3.0) QTL. A major QTL underlying the activity of glutathione reductase was located on chromosome 2D, and another, controlling the activity of ascorbate peroxidase, on chromosome 7D. A region of chromosome 2D defined by the microsatellite locus Xgwm539 and a second on chromosome 7D flanked by the marker loci Xgwm1242 and Xgwm44 harbored a number of QTL associated with the water deficit stress response.
Repurposing CRISPR/Cas9 for in situ functional assays.
Malina, Abba; Mills, John R; Cencic, Regina; Yan, Yifei; Fraser, James; Schippers, Laura M; Paquet, Marilène; Dostie, Josée; Pelletier, Jerry
2013-12-01
RNAi combined with next-generation sequencing has proven to be a powerful and cost-effective genetic screening platform in mammalian cells. Still, this technology has its limitations and is incompatible with in situ mutagenesis screens on a genome-wide scale. Using p53 as a proof-of-principle target, we readapted the CRISPR (clustered regularly interspaced short palindromic repeats)/Cas9 (CRISPR associated 9) genome-editing system to demonstrate the feasibility of this methodology for targeted gene disruption positive selection assays. By using novel "all-in-one" lentiviral and retroviral delivery vectors heterologously expressing both a codon-optimized Cas9 and its synthetic guide RNA (sgRNA), we show robust selection for the CRISPR-modified Trp53 locus following drug treatment. Furthermore, by linking Cas9 expression to GFP fluorescence, we use an "all-in-one" system to track disrupted Trp53 in chemoresistant lymphomas in the Eμ-myc mouse model. Deep sequencing analysis of the tumor-derived endogenous Cas9-modified Trp53 locus revealed a wide spectrum of mutants that were enriched with seemingly limited off-target effects. Taken together, these results establish Cas9 genome editing as a powerful and practical approach for positive in situ genetic screens.
Repurposing CRISPR/Cas9 for in situ functional assays
Malina, Abba; Mills, John R.; Cencic, Regina; Yan, Yifei; Fraser, James; Schippers, Laura M.; Paquet, Marilène; Dostie, Josée; Pelletier, Jerry
2013-01-01
RNAi combined with next-generation sequencing has proven to be a powerful and cost-effective genetic screening platform in mammalian cells. Still, this technology has its limitations and is incompatible with in situ mutagenesis screens on a genome-wide scale. Using p53 as a proof-of-principle target, we readapted the CRISPR (clustered regularly interspaced short palindromic repeats)/Cas9 (CRISPR associated 9) genome-editing system to demonstrate the feasibility of this methodology for targeted gene disruption positive selection assays. By using novel “all-in-one” lentiviral and retroviral delivery vectors heterologously expressing both a codon-optimized Cas9 and its synthetic guide RNA (sgRNA), we show robust selection for the CRISPR-modified Trp53 locus following drug treatment. Furthermore, by linking Cas9 expression to GFP fluorescence, we use an “all-in-one” system to track disrupted Trp53 in chemoresistant lymphomas in the Eμ-myc mouse model. Deep sequencing analysis of the tumor-derived endogenous Cas9-modified Trp53 locus revealed a wide spectrum of mutants that were enriched with seemingly limited off-target effects. Taken together, these results establish Cas9 genome editing as a powerful and practical approach for positive in situ genetic screens. PMID:24298059
Jalali, Ali; Aldinger, Kimberly A.; Chary, Ajit; Mclone, David G.; Bowman, Robin M.; Le, Luan Cong; Jardine, Phillip; Newbury-Ecob, Ruth; Mallick, Andrew; Jafari, Nadereh; Russell, Eric J.; Curran, John; Nguyen, Pam; Ouahchi, Karim; Lee, Charles; Dobyns, William B.; Millen, Kathleen J.; Pina-Neto, Joao M.; Kessler, John A.; Bassuk, Alexander G.
2010-01-01
We previously reported a Vietnamese-American family with isolated autosomal dominant occipital cephalocele. Upon further neuroimaging studies, we have recharacterized this condition as autosomal dominant Dandy-Walker with occipital cephalocele (ADDWOC). A similar ADDWOC family from Brazil was also recently described. To determine the genetic etiology of ADDWOC, we performed genome-wide linkage analysis on members of the Vietnamese-American and Brazilian pedigrees. Linkage analysis of the Vietnamese-American family identified the ADDWOC causative locus on chromosome 2q36.1 with a multipoint parametric LOD score of 3.3, while haplotype analysis refined the locus to 1.1 Mb. Sequencing of the five known genes in this locus did not identify any protein-altering mutations. However, a terminal deletion of chromosome 2 in a patient with an isolated case of Dandy-Walker malformation also encompassed the 2q36.1 chromosomal region. The Brazilian pedigree did not show linkage to this 2q36.1 region. Taken together, these results demonstrate a locus for ADDWOC on 2q36.1 and also suggest locus heterogeneity for ADDWOC. PMID:18204864
Histone Code Modulation by Oncogenic PWWP-domain Protein in Breast Cancers
2012-06-01
imaginal discs, the Drosophila melanogaster homologue of human retinoblastoma binding protein 2. Genetics 2000; 156: 645-663. [10] Zeng J, Ge Z, Wang...in breast cancer patients. Earlier, we used genomic analysis of copy number and gene expression to perform a detailed analysis of the 8p11-12...1 Figure 1. Representative view of ChIP-seq peak of a histone modifying factor at the UBR2V2 genomic locus in the
Identification of four novel susceptibility loci for oestrogen receptor negative breast cancer.
Couch, Fergus J; Kuchenbaecker, Karoline B; Michailidou, Kyriaki; Mendoza-Fandino, Gustavo A; Nord, Silje; Lilyquist, Janna; Olswold, Curtis; Hallberg, Emily; Agata, Simona; Ahsan, Habibul; Aittomäki, Kristiina; Ambrosone, Christine; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Arun, Banu K; Arver, Brita; Barile, Monica; Barkardottir, Rosa B; Barrowdale, Daniel; Beckmann, Lars; Beckmann, Matthias W; Benitez, Javier; Blank, Stephanie V; Blomqvist, Carl; Bogdanova, Natalia V; Bojesen, Stig E; Bolla, Manjeet K; Bonanni, Bernardo; Brauch, Hiltrud; Brenner, Hermann; Burwinkel, Barbara; Buys, Saundra S; Caldes, Trinidad; Caligo, Maria A; Canzian, Federico; Carpenter, Jane; Chang-Claude, Jenny; Chanock, Stephen J; Chung, Wendy K; Claes, Kathleen B M; Cox, Angela; Cross, Simon S; Cunningham, Julie M; Czene, Kamila; Daly, Mary B; Damiola, Francesca; Darabi, Hatef; de la Hoya, Miguel; Devilee, Peter; Diez, Orland; Ding, Yuan C; Dolcetti, Riccardo; Domchek, Susan M; Dorfling, Cecilia M; Dos-Santos-Silva, Isabel; Dumont, Martine; Dunning, Alison M; Eccles, Diana M; Ehrencrona, Hans; Ekici, Arif B; Eliassen, Heather; Ellis, Steve; Fasching, Peter A; Figueroa, Jonine; Flesch-Janys, Dieter; Försti, Asta; Fostira, Florentia; Foulkes, William D; Friebel, Tara; Friedman, Eitan; Frost, Debra; Gabrielson, Marike; Gammon, Marilie D; Ganz, Patricia A; Gapstur, Susan M; Garber, Judy; Gaudet, Mia M; Gayther, Simon A; Gerdes, Anne-Marie; Ghoussaini, Maya; Giles, Graham G; Glendon, Gord; Godwin, Andrew K; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Greene, Mark H; Gronwald, Jacek; Guénel, Pascal; Gunter, Marc; Haeberle, Lothar; Haiman, Christopher A; Hamann, Ute; Hansen, Thomas V O; Hart, Steven; Healey, Sue; Heikkinen, Tuomas; Henderson, Brian E; Herzog, Josef; Hogervorst, Frans B L; Hollestelle, Antoinette; Hooning, Maartje J; Hoover, Robert N; Hopper, John L; Humphreys, Keith; Hunter, David J; Huzarski, Tomasz; Imyanitov, Evgeny N; Isaacs, Claudine; Jakubowska, Anna; James, Paul; Janavicius, Ramunas; Jensen, Uffe Birk; John, Esther M; Jones, Michael; Kabisch, Maria; Kar, Siddhartha; Karlan, Beth Y; Khan, Sofia; Khaw, Kay-Tee; Kibriya, Muhammad G; Knight, Julia A; Ko, Yon-Dschun; Konstantopoulou, Irene; Kosma, Veli-Matti; Kristensen, Vessela; Kwong, Ava; Laitman, Yael; Lambrechts, Diether; Lazaro, Conxi; Lee, Eunjung; Le Marchand, Loic; Lester, Jenny; Lindblom, Annika; Lindor, Noralane; Lindstrom, Sara; Liu, Jianjun; Long, Jirong; Lubinski, Jan; Mai, Phuong L; Makalic, Enes; Malone, Kathleen E; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Marme, Frederik; Martens, John W M; McGuffog, Lesley; Meindl, Alfons; Miller, Austin; Milne, Roger L; Miron, Penelope; Montagna, Marco; Mazoyer, Sylvie; Mulligan, Anna M; Muranen, Taru A; Nathanson, Katherine L; Neuhausen, Susan L; Nevanlinna, Heli; Nordestgaard, Børge G; Nussbaum, Robert L; Offit, Kenneth; Olah, Edith; Olopade, Olufunmilayo I; Olson, Janet E; Osorio, Ana; Park, Sue K; Peeters, Petra H; Peissel, Bernard; Peterlongo, Paolo; Peto, Julian; Phelan, Catherine M; Pilarski, Robert; Poppe, Bruce; Pylkäs, Katri; Radice, Paolo; Rahman, Nazneen; Rantala, Johanna; Rappaport, Christine; Rennert, Gad; Richardson, Andrea; Robson, Mark; Romieu, Isabelle; Rudolph, Anja; Rutgers, Emiel J; Sanchez, Maria-Jose; Santella, Regina M; Sawyer, Elinor J; Schmidt, Daniel F; Schmidt, Marjanka K; Schmutzler, Rita K; Schumacher, Fredrick; Scott, Rodney; Senter, Leigha; Sharma, Priyanka; Simard, Jacques; Singer, Christian F; Sinilnikova, Olga M; Soucy, Penny; Southey, Melissa; Steinemann, Doris; Stenmark-Askmalm, Marie; Stoppa-Lyonnet, Dominique; Swerdlow, Anthony; Szabo, Csilla I; Tamimi, Rulla; Tapper, William; Teixeira, Manuel R; Teo, Soo-Hwang; Terry, Mary B; Thomassen, Mads; Thompson, Deborah; Tihomirova, Laima; Toland, Amanda E; Tollenaar, Robert A E M; Tomlinson, Ian; Truong, Thérèse; Tsimiklis, Helen; Teulé, Alex; Tumino, Rosario; Tung, Nadine; Turnbull, Clare; Ursin, Giski; van Deurzen, Carolien H M; van Rensburg, Elizabeth J; Varon-Mateeva, Raymonda; Wang, Zhaoming; Wang-Gohrke, Shan; Weiderpass, Elisabete; Weitzel, Jeffrey N; Whittemore, Alice; Wildiers, Hans; Winqvist, Robert; Yang, Xiaohong R; Yannoukakos, Drakoulis; Yao, Song; Zamora, M Pilar; Zheng, Wei; Hall, Per; Kraft, Peter; Vachon, Celine; Slager, Susan; Chenevix-Trench, Georgia; Pharoah, Paul D P; Monteiro, Alvaro A N; García-Closas, Montserrat; Easton, Douglas F; Antoniou, Antonis C
2016-04-27
Common variants in 94 loci have been associated with breast cancer including 15 loci with genome-wide significant associations (P<5 × 10(-8)) with oestrogen receptor (ER)-negative breast cancer and BRCA1-associated breast cancer risk. In this study, to identify new ER-negative susceptibility loci, we performed a meta-analysis of 11 genome-wide association studies (GWAS) consisting of 4,939 ER-negative cases and 14,352 controls, combined with 7,333 ER-negative cases and 42,468 controls and 15,252 BRCA1 mutation carriers genotyped on the iCOGS array. We identify four previously unidentified loci including two loci at 13q22 near KLF5, a 2p23.2 locus near WDR43 and a 2q33 locus near PPIL3 that display genome-wide significant associations with ER-negative breast cancer. In addition, 19 known breast cancer risk loci have genome-wide significant associations and 40 had moderate associations (P<0.05) with ER-negative disease. Using functional and eQTL studies we implicate TRMT61B and WDR43 at 2p23.2 and PPIL3 at 2q33 in ER-negative breast cancer aetiology. All ER-negative loci combined account for ∼11% of familial relative risk for ER-negative disease and may contribute to improved ER-negative and BRCA1 breast cancer risk prediction.
Identification of four novel susceptibility loci for oestrogen receptor negative breast cancer
Couch, Fergus J.; Kuchenbaecker, Karoline B.; Michailidou, Kyriaki; Mendoza-Fandino, Gustavo A.; Nord, Silje; Lilyquist, Janna; Olswold, Curtis; Hallberg, Emily; Agata, Simona; Ahsan, Habibul; Aittomäki, Kristiina; Ambrosone, Christine; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Arun, Banu K.; Arver, Brita; Barile, Monica; Barkardottir, Rosa B.; Barrowdale, Daniel; Beckmann, Lars; Beckmann, Matthias W.; Benitez, Javier; Blank, Stephanie V.; Blomqvist, Carl; Bogdanova, Natalia V.; Bojesen, Stig E.; Bolla, Manjeet K.; Bonanni, Bernardo; Brauch, Hiltrud; Brenner, Hermann; Burwinkel, Barbara; Buys, Saundra S.; Caldes, Trinidad; Caligo, Maria A.; Canzian, Federico; Carpenter, Jane; Chang-Claude, Jenny; Chanock, Stephen J.; Chung, Wendy K.; Claes, Kathleen B. M.; Cox, Angela; Cross, Simon S.; Cunningham, Julie M.; Czene, Kamila; Daly, Mary B.; Damiola, Francesca; Darabi, Hatef; de la Hoya, Miguel; Devilee, Peter; Diez, Orland; Ding, Yuan C.; Dolcetti, Riccardo; Domchek, Susan M.; Dorfling, Cecilia M.; dos-Santos-Silva, Isabel; Dumont, Martine; Dunning, Alison M.; Eccles, Diana M.; Ehrencrona, Hans; Ekici, Arif B.; Eliassen, Heather; Ellis, Steve; Fasching, Peter A.; Figueroa, Jonine; Flesch-Janys, Dieter; Försti, Asta; Fostira, Florentia; Foulkes, William D.; Friebel, Tara; Friedman, Eitan; Frost, Debra; Gabrielson, Marike; Gammon, Marilie D.; Ganz, Patricia A.; Gapstur, Susan M.; Garber, Judy; Gaudet, Mia M.; Gayther, Simon A.; Gerdes, Anne-Marie; Ghoussaini, Maya; Giles, Graham G.; Glendon, Gord; Godwin, Andrew K.; Goldberg, Mark S.; Goldgar, David E.; González-Neira, Anna; Greene, Mark H.; Gronwald, Jacek; Guénel, Pascal; Gunter, Marc; Haeberle, Lothar; Haiman, Christopher A.; Hamann, Ute; Hansen, Thomas V. O.; Hart, Steven; Healey, Sue; Heikkinen, Tuomas; Henderson, Brian E.; Herzog, Josef; Hogervorst, Frans B. L.; Hollestelle, Antoinette; Hooning, Maartje J.; Hoover, Robert N.; Hopper, John L.; Humphreys, Keith; Hunter, David J.; Huzarski, Tomasz; Imyanitov, Evgeny N.; Isaacs, Claudine; Jakubowska, Anna; James, Paul; Janavicius, Ramunas; Jensen, Uffe Birk; John, Esther M.; Jones, Michael; Kabisch, Maria; Kar, Siddhartha; Karlan, Beth Y.; Khan, Sofia; Khaw, Kay-Tee; Kibriya, Muhammad G.; Knight, Julia A.; Ko, Yon-Dschun; Konstantopoulou, Irene; Kosma, Veli-Matti; Kristensen, Vessela; Kwong, Ava; Laitman, Yael; Lambrechts, Diether; Lazaro, Conxi; Lee, Eunjung; Le Marchand, Loic; Lester, Jenny; Lindblom, Annika; Lindor, Noralane; Lindstrom, Sara; Liu, Jianjun; Long, Jirong; Lubinski, Jan; Mai, Phuong L.; Makalic, Enes; Malone, Kathleen E.; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Marme, Frederik; Martens, John W. M.; McGuffog, Lesley; Meindl, Alfons; Miller, Austin; Milne, Roger L.; Miron, Penelope; Montagna, Marco; Mazoyer, Sylvie; Mulligan, Anna M.; Muranen, Taru A.; Nathanson, Katherine L.; Neuhausen, Susan L.; Nevanlinna, Heli; Nordestgaard, Børge G.; Nussbaum, Robert L.; Offit, Kenneth; Olah, Edith; Olopade, Olufunmilayo I.; Olson, Janet E.; Osorio, Ana; Park, Sue K.; Peeters, Petra H.; Peissel, Bernard; Peterlongo, Paolo; Peto, Julian; Phelan, Catherine M.; Pilarski, Robert; Poppe, Bruce; Pylkäs, Katri; Radice, Paolo; Rahman, Nazneen; Rantala, Johanna; Rappaport, Christine; Rennert, Gad; Richardson, Andrea; Robson, Mark; Romieu, Isabelle; Rudolph, Anja; Rutgers, Emiel J.; Sanchez, Maria-Jose; Santella, Regina M.; Sawyer, Elinor J.; Schmidt, Daniel F.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Schumacher, Fredrick; Scott, Rodney; Senter, Leigha; Sharma, Priyanka; Simard, Jacques; Singer, Christian F.; Sinilnikova, Olga M.; Soucy, Penny; Southey, Melissa; Steinemann, Doris; Stenmark-Askmalm, Marie; Stoppa-Lyonnet, Dominique; Swerdlow, Anthony; Szabo, Csilla I.; Tamimi, Rulla; Tapper, William; Teixeira, Manuel R.; Teo, Soo-Hwang; Terry, Mary B.; Thomassen, Mads; Thompson, Deborah; Tihomirova, Laima; Toland, Amanda E.; Tollenaar, Robert A. E. M.; Tomlinson, Ian; Truong, Thérèse; Tsimiklis, Helen; Teulé, Alex; Tumino, Rosario; Tung, Nadine; Turnbull, Clare; Ursin, Giski; van Deurzen, Carolien H. M.; van Rensburg, Elizabeth J.; Varon-Mateeva, Raymonda; Wang, Zhaoming; Wang-Gohrke, Shan; Weiderpass, Elisabete; Weitzel, Jeffrey N.; Whittemore, Alice; Wildiers, Hans; Winqvist, Robert; Yang, Xiaohong R.; Yannoukakos, Drakoulis; Yao, Song; Zamora, M Pilar; Zheng, Wei; Hall, Per; Kraft, Peter; Vachon, Celine; Slager, Susan; Chenevix-Trench, Georgia; Pharoah, Paul D. P.; Monteiro, Alvaro A. N.; García-Closas, Montserrat; Easton, Douglas F.; Antoniou, Antonis C.
2016-01-01
Common variants in 94 loci have been associated with breast cancer including 15 loci with genome-wide significant associations (P<5 × 10−8) with oestrogen receptor (ER)-negative breast cancer and BRCA1-associated breast cancer risk. In this study, to identify new ER-negative susceptibility loci, we performed a meta-analysis of 11 genome-wide association studies (GWAS) consisting of 4,939 ER-negative cases and 14,352 controls, combined with 7,333 ER-negative cases and 42,468 controls and 15,252 BRCA1 mutation carriers genotyped on the iCOGS array. We identify four previously unidentified loci including two loci at 13q22 near KLF5, a 2p23.2 locus near WDR43 and a 2q33 locus near PPIL3 that display genome-wide significant associations with ER-negative breast cancer. In addition, 19 known breast cancer risk loci have genome-wide significant associations and 40 had moderate associations (P<0.05) with ER-negative disease. Using functional and eQTL studies we implicate TRMT61B and WDR43 at 2p23.2 and PPIL3 at 2q33 in ER-negative breast cancer aetiology. All ER-negative loci combined account for ∼11% of familial relative risk for ER-negative disease and may contribute to improved ER-negative and BRCA1 breast cancer risk prediction. PMID:27117709
2012-01-01
Background Staphylococcus aureus Repeat (STAR) elements are a type of interspersed intergenic direct repeat. In this study the conservation and variation in these elements was explored by bioinformatic analyses of published staphylococcal genome sequences and through sequencing of specific STAR element loci from a large set of S. aureus isolates. Results Using bioinformatic analyses, we found that the STAR elements were located in different genomic loci within each staphylococcal species. There was no correlation between the number of STAR elements in each genome and the evolutionary relatedness of staphylococcal species, however higher levels of repeats were observed in both S. aureus and S. lugdunensis compared to other staphylococcal species. Unexpectedly, sequencing of the internal spacer sequences of individual repeat elements from multiple isolates showed conservation at the sequence level within deep evolutionary lineages of S. aureus. Whilst individual STAR element loci were demonstrated to expand and contract, the sequences associated with each locus were stable and distinct from one another. Conclusions The high degree of lineage and locus-specific conservation of these intergenic repeat regions suggests that STAR elements are maintained due to selective or molecular forces with some of these elements having an important role in cell physiology. The high prevalence in two of the more virulent staphylococcal species is indicative of a potential role for STAR elements in pathogenesis. PMID:23020678
Wu, Chen; Wang, Zhaoming; Song, Xin; Feng, Xiao-Shan; Abnet, Christian C; He, Jie; Hu, Nan; Zuo, Xian-Bo; Tan, Wen; Zhan, Qimin; Hu, Zhibin; He, Zhonghu; Jia, Weihua; Zhou, Yifeng; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Zhao, Xue-Ke; Gao, She-Gan; Yuan, Zhi-Qing; Zhou, Fu-You; Fan, Zong-Min; Cui, Ji-Li; Lin, Hong-Li; Han, Xue-Na; Li, Bei; Chen, Xi; Dawsey, Sanford M; Liao, Linda; Lee, Maxwell P; Ding, Ti; Qiao, You-Lin; Liu, Zhihua; Liu, Yu; Yu, Dianke; Chang, Jiang; Wei, Lixuan; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Han, Jing-Jing; Zhou, Sheng-Li; Zhang, Peng; Zhang, Dong-Yun; Yuan, Yuan; Huang, Ying; Liu, Chunling; Zhai, Kan; Qiao, Yan; Jin, Guangfu; Guo, Chuanhai; Fu, Jianhua; Miao, Xiaoping; Lu, Changdong; Yang, Haijun; Wang, Chaoyu; Wheeler, William A; Gail, Mitchell; Yeager, Meredith; Yuenger, Jeff; Guo, Er-Tao; Li, Ai-Li; Zhang, Wei; Li, Xue-Min; Sun, Liang-Dan; Ma, Bao-Gen; Li, Yan; Tang, Sa; Peng, Xiu-Qing; Liu, Jing; Hutchinson, Amy; Jacobs, Kevin; Giffen, Carol; Burdette, Laurie; Fraumeni, Joseph F; Shen, Hongbing; Ke, Yang; Zeng, Yixin; Wu, Tangchun; Kraft, Peter; Chung, Charles C; Tucker, Margaret A; Hou, Zhi-Chao; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Wang, Li; Yuan, Guo; Chen, Li-Sha; Liu, Xiao; Ma, Teng; Meng, Hui; Sun, Li; Li, Xin-Min; Li, Xiu-Min; Ku, Jian-Wei; Zhou, Ying-Fa; Yang, Liu-Qin; Wang, Zhou; Li, Yin; Qige, Qirenwang; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Yuan, Ling; Yue, Wen-Bin; Wang, Ran; Wang, Lu-Wen; Fan, Xue-Ping; Zhu, Fang-Heng; Zhao, Wei-Xing; Mao, Yi-Min; Zhang, Mei; Xing, Guo-Lan; Li, Ji-Lin; Han, Min; Ren, Jing-Li; Liu, Bin; Ren, Shu-Wei; Kong, Qing-Peng; Li, Feng; Sheyhidin, Ilyar; Wei, Wu; Zhang, Yan-Rui; Feng, Chang-Wei; Wang, Jin; Yang, Yu-Hua; Hao, Hong-Zhang; Bao, Qi-De; Liu, Bao-Chi; Wu, Ai-Qun; Xie, Dong; Yang, Wan-Cai; Wang, Liang; Zhao, Xiao-Hang; Chen, Shu-Qing; Hong, Jun-Yan; Zhang, Xue-Jun; Freedman, Neal D; Goldstein, Alisa M; Lin, Dongxin; Taylor, Philip R; Wang, Li-Dong; Chanock, Stephen J
2014-09-01
We conducted a joint (pooled) analysis of three genome-wide association studies (GWAS) of esophageal squamous cell carcinoma (ESCC) in individuals of Chinese ancestry (5,337 ESCC cases and 5,787 controls) with 9,654 ESCC cases and 10,058 controls for follow-up. In a logistic regression model adjusted for age, sex, study and two eigenvectors, two new loci achieved genome-wide significance, marked by rs7447927 at 5q31.2 (per-allele odds ratio (OR) = 0.85, 95% confidence interval (CI) = 0.82-0.88; P = 7.72 × 10(-20)) and rs1642764 at 17p13.1 (per-allele OR = 0.88, 95% CI = 0.85-0.91; P = 3.10 × 10(-13)). rs7447927 is a synonymous SNP in TMEM173, and rs1642764 is an intronic SNP in ATP1B2, near TP53. Furthermore, a locus in the HLA class II region at 6p21.32 (rs35597309) achieved genome-wide significance in the two populations at highest risk for ESSC (OR = 1.33, 95% CI = 1.22-1.46; P = 1.99 × 10(-10)). Our joint analysis identifies new ESCC susceptibility loci overall as well as a new locus unique to the population in the Taihang Mountain region at high risk of ESCC.
Zhang, Lujun; Li, Zhixin; Fan, Renchun; Wei, Bo; Zhang, Xiangqi
2016-07-19
The Roegneria of Triticeae is a large genus including about 130 allopolyploid species. Little is known about its high-molecular-weight glutenin subunits (HMW-GSs). Here, we reported six novel HMW-GS genes from R. nakaii and R. alashanica. Sequencing indicated that Rny1, Rny3, and Ray1 possessed intact open reading frames (ORFs), whereas Rny2, Rny4, and Ray2 harbored in-frame stop codons. All of the six genes possessed a similar primary structure to known HMW-GS, while showing some unique characteristics. Their coding regions were significantly shorter than Glu-1 genes in wheat. The amino acid sequences revealed that all of the six genes were intermediate towards the y-type. The phylogenetic analysis showed that the HMW-GSs from species with St, StY, or StH genome(s) clustered in an independent clade, varying from the typical x- and y-type clusters. Thus, the Glu-1 locus in R. nakaii and R. alashanica is a very primitive glutenin locus across evolution. The six genes were phylogenetically split into two groups clustered to different clades, respectively, each of the two clades included the HMW-GSs from species with St (diploid and tetraploid species), StY, and StH genomes. Hence, it is concluded that the six Roegneria HMW-GS genes are from two St genomes undergoing slight differentiation.
NetF-producing Clostridium perfringens: Clonality and plasmid pathogenicity loci analysis.
Mehdizadeh Gohari, Iman; Kropinski, Andrew M; Weese, Scott J; Whitehead, Ashley E; Parreira, Valeria R; Boerlin, Patrick; Prescott, John F
2017-04-01
Clostridium perfringens is an important cause of foal necrotizing enteritis and canine acute hemorrhagic diarrhea. A major virulence determinant of the strains associated with these diseases appears to be a beta-sheet pore-forming toxin, NetF, encoded within a pathogenicity locus (NetF locus) on a large tcp-conjugative plasmid. Strains producing NetF also produce the putative toxin NetE, encoded within the same pathogenicity locus, as well as CPE enterotoxin and CPB2 on a second plasmid, and sometimes the putative toxin NetG within a pathogenicity locus (NetG locus) on another separate large conjugative plasmid. Previous genome sequences of two netF-positive C. perfringens showed that they both shared three similar plasmids, including the NetF/NetE and CPE/CPB2 toxins-encoding plasmids mentioned above and a putative bacteriocin-encoding plasmid. The main purpose of this study was to determine whether all NetF-producing strains share this common plasmid profile and whether their distinct NetF and CPE pathogenicity loci are conserved. To answer this question, 15 equine and 15 canine netF-positive isolates of C. perfringens were sequenced using Illumina Hiseq2000 technology. In addition, the clonal relationships among the NetF-producing strains were evaluated by core genome multilocus sequence typing (cgMLST). The data obtained showed that all NetF-producing strains have a common plasmid profile and that the defined pathogenicity loci on the plasmids are conserved in all these strains. cgMLST analysis showed that the NetF-producing C. perfringens strains belong to two distinct clonal complexes. The pNetG plasmid was absent from isolates of one of the clonal complexes, and there were minor but consistent differences in the NetF/NetE and CPE/CPB2 plasmids between the two clonal complexes. Copyright © 2017 Elsevier B.V. All rights reserved.
Mehdizadeh Gohari, Iman; Kropinski, Andrew M; Weese, Scott J; Parreira, Valeria R; Whitehead, Ashley E; Boerlin, Patrick; Prescott, John F
2016-01-01
The recent discovery of a novel beta-pore-forming toxin, NetF, which is strongly associated with canine and foal necrotizing enteritis should improve our understanding of the role of type A Clostridium perfringens associated disease in these animals. The current study presents the complete genome sequence of two netF-positive strains, JFP55 and JFP838, which were recovered from cases of foal necrotizing enteritis and canine hemorrhagic gastroenteritis, respectively. Genome sequencing was done using Single Molecule, Real-Time (SMRT) technology-PacBio and Illumina Hiseq2000. The JFP55 and JFP838 genomes include a single 3.34 Mb and 3.53 Mb chromosome, respectively, and both genomes include five circular plasmids. Plasmid annotation revealed that three plasmids were shared by the two newly sequenced genomes, including a NetF/NetE toxins-encoding tcp-conjugative plasmid, a CPE/CPB2 toxins-encoding tcp-conjugative plasmid and a putative bacteriocin-encoding plasmid. The putative beta-pore-forming toxin genes, netF, netE and netG, were located in unique pathogenicity loci on tcp-conjugative plasmids. The C. perfringens JFP55 chromosome carries 2,825 protein-coding genes whereas the chromosome of JFP838 contains 3,014 protein-encoding genes. Comparison of these two chromosomes with three available reference C. perfringens chromosome sequences identified 48 (~247 kb) and 81 (~430 kb) regions unique to JFP55 and JFP838, respectively. Some of these divergent genomic regions in both chromosomes are phage- and plasmid-related segments. Sixteen of these unique chromosomal regions (~69 kb) were shared between the two isolates. Five of these shared regions formed a mosaic of plasmid-integrated segments, suggesting that these elements were acquired early in a clonal lineage of netF-positive C. perfringens strains. These results provide significant insight into the basis of canine and foal necrotizing enteritis and are the first to demonstrate that netF resides on a large and unique plasmid-encoded locus.
Capel, K C C; Migotto, A E; Zilberberg, C; Lin, M F; Forsman, Z; Miller, D J; Kitahara, M V
2016-09-30
Members of the azooxanthellate coral genus Tubastraea are invasive species with particular concern because they have become established and are fierce competitors in the invaded areas in many parts of the world. Pacific Tubastraea species are spreading fast throughout the Atlantic Ocean, occupying over 95% of the available substrate in some areas and out-competing native endemic species. Approximately half of all known coral species are azooxanthellate but these are seriously under-represented compared to zooxanthellate corals in terms of the availability of mitochondrial (mt) genome data. In the present study, the complete mt DNA sequences of Atlantic individuals of the invasive scleractinian species Tubastraea coccinea and Tubastraea tagusensis were determined and compared to the GenBank reference sequence available for a Pacific "T. coccinea" individual. At 19,094bp (compared to 19,070bp for the GenBank specimen), the mt genomes assembled for the Atlantic T. coccinea and T. tagusensis were among the longest sequence determined to date for "Complex" scleractinians. Comparisons of genomes data showed that the "T. coccinea" sequence deposited on GenBank was more closely related to that from Dendrophyllia arbuscula than to the Atlantic Tubastraea spp., in terms of genome length and base pair similarities. This was confirmed by phylogenetic analysis, suggesting that the former was misidentified and might actually be a member from the genus Dendrophyllia. In addition, although in general the COX1 locus has a slow evolutionary rate in Scleractinia, it was the most variable region of the Tubastraea mt genome and can be used as markers for genus or species identification. Given the limited data available for azooxanthellate corals, the results presented here represent an important contribution to our understanding of phylogenetic relationships and the evolutionary history of the Scleractinia. Copyright © 2016 Elsevier B.V. All rights reserved.
Coordinates and intervals in graph-based reference genomes.
Rand, Knut D; Grytten, Ivar; Nederbragt, Alexander J; Storvik, Geir O; Glad, Ingrid K; Sandve, Geir K
2017-05-18
It has been proposed that future reference genomes should be graph structures in order to better represent the sequence diversity present in a species. However, there is currently no standard method to represent genomic intervals, such as the positions of genes or transcription factor binding sites, on graph-based reference genomes. We formalize offset-based coordinate systems on graph-based reference genomes and introduce methods for representing intervals on these reference structures. We show the advantage of our methods by representing genes on a graph-based representation of the newest assembly of the human genome (GRCh38) and its alternative loci for regions that are highly variable. More complex reference genomes, containing alternative loci, require methods to represent genomic data on these structures. Our proposed notation for genomic intervals makes it possible to fully utilize the alternative loci of the GRCh38 assembly and potential future graph-based reference genomes. We have made a Python package for representing such intervals on offset-based coordinate systems, available at https://github.com/uio-cels/offsetbasedgraph . An interactive web-tool using this Python package to visualize genes on a graph created from GRCh38 is available at https://github.com/uio-cels/genomicgraphcoords .
Liao, Can; Fu, Fang; Li, Ru; Yang, Xin; Xu, Qing; Li, Dong-Zhi
2012-01-01
We present three foetuses with Dandy-Walker malformation, intra-uterine growth restriction and multiple congenital abnormalities, who were studied by array-based comparative genomic hybridization and revealed a novel locus on chromosome 7p21.3. The association of pure chromosome 7p aberrations with Dandy-Walker malformation has rarely been reported. The present study suggests that the critical region associated with Dandy-Walker malformation is restricted to 7p21.3, including the cerebellar disease associated genes NDUFA4 and PHF14. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Methylation at CPT1A locus is associated with lipoprotein subfraction profiles
USDA-ARS?s Scientific Manuscript database
Lipoprotein subfractions help discriminate cardiometabolic disease risk. Genetic loci validated as associating with lipoprotein measures do not account for a large proportion of the individual variation in lipoprotein measures. We hypothesized that DNA methylation levels across the genome contribute...
van Eck, Herman J; Vos, Peter G; Valkonen, Jari P T; Uitdewilligen, Jan G A M L; Lensing, Hellen; de Vetten, Nick; Visser, Richard G F
2017-03-01
The method of graphical genotyping is applied to a panel of tetraploid potato cultivars to visualize haplotype sharing. The method allowed to map genes involved in virus and nematode resistance. The physical coordinates of the amount of linkage drag surrounding these genes are easily interpretable. Graphical genotyping is a visually attractive and easily interpretable method to represent genetic marker data. In this paper, the method is extended from diploids to a panel of tetraploid potato cultivars. Application of filters to select a subset of SNPs allows one to visualize haplotype sharing between individuals that also share a specific locus. The method is illustrated with cultivars resistant to Potato virus Y (PVY), while simultaneously selecting for the absence of the SNPs in susceptible clones. SNP data will then merge into an image which displays the coordinates of a distal genomic region on the northern arm of chromosome 11 where a specific haplotype is introgressed from the wild potato species S. stoloniferum (CPC 2093) carrying a gene (Ny (o,n)sto ) conferring resistance to two PVY strains, PVY O and PVY NTN . Graphical genotyping was also successful in showing the haplotypes on chromosome 12 carrying Ry-f sto , another resistance gene derived from S. stoloniferum conferring broad-spectrum resistance to PVY, as well as chromosome 5 haplotypes from S. vernei, with the Gpa5 locus involved in resistance against Globodera pallida cyst nematodes. The image also shows shortening of linkage drag by meiotic recombination of the introgression segment in more recent breeding material. Identity-by-descent was found to be a requirement for using graphical genotyping, which is proposed as a non-statistical alternative method for gene discovery, as compared with genome-wide association studies. The potential and limitations of the method are discussed.
Whole-genome sequencing identifies EN1 as a determinant of bone density and fracture
Zheng, Hou-Feng; Forgetta, Vincenzo; Hsu, Yi-Hsiang; Estrada, Karol; Rosello-Diez, Alberto; Leo, Paul J; Dahia, Chitra L; Park-Min, Kyung Hyun; Tobias, Jonathan H; Kooperberg, Charles; Kleinman, Aaron; Styrkarsdottir, Unnur; Liu, Ching-Ti; Uggla, Charlotta; Evans, Daniel S; Nielson, Carrie M; Walter, Klaudia; Pettersson-Kymmer, Ulrika; McCarthy, Shane; Eriksson, Joel; Kwan, Tony; Jhamai, Mila; Trajanoska, Katerina; Memari, Yasin; Min, Josine; Huang, Jie; Danecek, Petr; Wilmot, Beth; Li, Rui; Chou, Wen-Chi; Mokry, Lauren E; Moayyeri, Alireza; Claussnitzer, Melina; Cheng, Chia-Ho; Cheung, Warren; Medina-Gómez, Carolina; Ge, Bing; Chen, Shu-Huang; Choi, Kwangbom; Oei, Ling; Fraser, James; Kraaij, Robert; Hibbs, Matthew A; Gregson, Celia L; Paquette, Denis; Hofman, Albert; Wibom, Carl; Tranah, Gregory J; Marshall, Mhairi; Gardiner, Brooke B; Cremin, Katie; Auer, Paul; Hsu, Li; Ring, Sue; Tung, Joyce Y; Thorleifsson, Gudmar; Enneman, Anke W; van Schoor, Natasja M; de Groot, Lisette C.P.G.M.; van der Velde, Nathalie; Melin, Beatrice; Kemp, John P; Christiansen, Claus; Sayers, Adrian; Zhou, Yanhua; Calderari, Sophie; van Rooij, Jeroen; Carlson, Chris; Peters, Ulrike; Berlivet, Soizik; Dostie, Josée; Uitterlinden, Andre G; Williams, Stephen R.; Farber, Charles; Grinberg, Daniel; LaCroix, Andrea Z; Haessler, Jeff; Chasman, Daniel I; Giulianini, Franco; Rose, Lynda M; Ridker, Paul M; Eisman, John A; Nguyen, Tuan V; Center, Jacqueline R; Nogues, Xavier; Garcia-Giralt, Natalia; Launer, Lenore L; Gudnason, Vilmunder; Mellström, Dan; Vandenput, Liesbeth; Karlsson, Magnus K; Ljunggren, Östen; Svensson, Olle; Hallmans, Göran; Rousseau, François; Giroux, Sylvie; Bussière, Johanne; Arp, Pascal P; Koromani, Fjorda; Prince, Richard L; Lewis, Joshua R; Langdahl, Bente L; Hermann, A Pernille; Jensen, Jens-Erik B; Kaptoge, Stephen; Khaw, Kay-Tee; Reeve, Jonathan; Formosa, Melissa M; Xuereb-Anastasi, Angela; Åkesson, Kristina; McGuigan, Fiona E; Garg, Gaurav; Olmos, Jose M; Zarrabeitia, Maria T; Riancho, Jose A; Ralston, Stuart H; Alonso, Nerea; Jiang, Xi; Goltzman, David; Pastinen, Tomi; Grundberg, Elin; Gauguier, Dominique; Orwoll, Eric S; Karasik, David; Davey-Smith, George; Smith, Albert V; Siggeirsdottir, Kristin; Harris, Tamara B; Zillikens, M Carola; van Meurs, Joyce BJ; Thorsteinsdottir, Unnur; Maurano, Matthew T; Timpson, Nicholas J; Soranzo, Nicole; Durbin, Richard; Wilson, Scott G; Ntzani, Evangelia E; Brown, Matthew A; Stefansson, Kari; Hinds, David A; Spector, Tim; Cupples, L Adrienne; Ohlsson, Claes; Greenwood, Celia MT; Jackson, Rebecca D; Rowe, David W; Loomis, Cynthia A; Evans, David M; Ackert-Bicknell, Cheryl L; Joyner, Alexandra L; Duncan, Emma L; Kiel, Douglas P; Rivadeneira, Fernando; Richards, J Brent
2016-01-01
SUMMARY The extent to which low-frequency (minor allele frequency [MAF] between 1–5%) and rare (MAF ≤ 1%) variants contribute to complex traits and disease in the general population is largely unknown. Bone mineral density (BMD) is highly heritable, is a major predictor of osteoporotic fractures and has been previously associated with common genetic variants1–8, and rare, population-specific, coding variants9. Here we identify novel non-coding genetic variants with large effects on BMD (ntotal = 53,236) and fracture (ntotal = 508,253) in individuals of European ancestry from the general population. Associations for BMD were derived from whole-genome sequencing (n=2,882 from UK10K), whole-exome sequencing (n= 3,549), deep imputation of genotyped samples using a combined UK10K/1000Genomes reference panel (n=26,534), and de-novo replication genotyping (n= 20,271). We identified a low-frequency non-coding variant near a novel locus, EN1, with an effect size 4-fold larger than the mean of previously reported common variants for lumbar spine BMD8 (rs11692564[T], MAF = 1.7%, replication effect size = +0.20 standard deviations [SD], Pmeta = 2×10−14), which was also associated with a decreased risk of fracture (OR = 0.85; P = 2×10−11; ncases = 98,742 and ncontrols = 409,511). Using an En1Cre/flox mouse model, we observed that conditional loss of En1 results in low bone mass, likely as a consequence of high bone turn-over. We also identified a novel low-frequency non-coding variant with large effects on BMD near WNT16 (rs148771817[T], MAF = 1.1%, replication effect size = +0.39 SD, Pmeta = 1×10−11). In general, there was an excess of association signals arising from deleterious coding and conserved non-coding variants. These findings provide evidence that low-frequency non-coding variants have large effects on BMD and fracture, thereby providing rationale for whole-genome sequencing and improved imputation reference panels to study the genetic architecture of complex traits and disease in the general population. PMID:26367794
Perry, John R B; McMahon, George; Day, Felix R; Ring, Susan M; Nelson, Scott M; Lawlor, Debbie A
2016-01-15
Anti-Müllerian hormone (AMH) is an essential messenger of sexual differentiation in the foetus and is an emerging biomarker of postnatal reproductive function in females. Due to a paucity of adequately sized studies, the genetic determinants of circulating AMH levels are poorly characterized. In samples from 2815 adolescents aged 15 from the ALSPAC study, we performed the first genome-wide association study of serum AMH levels across a set of ∼9 m '1000 Genomes Reference Panel' imputed genetic variants. Genetic variants at the AMH protein-coding gene showed considerable allelic heterogeneity, with both common variants [rs4807216 (P(Male) = 2 × 10(-49), Beta: ∼0.9 SDs per allele), rs8112524 (P(Male) = 3 × 10(-8), Beta: ∼0.25)] and low-frequency variants [rs2385821 (P(Male) = 6 × 10(-31), Beta: ∼1.2, frequency 3.6%)] independently associated with apparently large effect sizes in males, but not females. For all three SNPs, we highlight mechanistic links to AMH gene function and demonstrate highly significant sex interactions (P(Het) 0.0003-6.3 × 10(-12)), culminating in contrasting estimates of trait variance explained (24.5% in males versus 0.8% in females). Using these SNPs as a genetic proxy for AMH levels, we found no evidence in additional datasets to support a biological role for AMH in complex traits and diseases in men. © The Author 2015. Published by Oxford University Press.
Li, Xianran; Tian, Feng; Huang, Haiyan; Tan, Lubin; Zhu, Zuofeng; Hu, Songnian; Sun, Chuanqing
2008-06-01
To facilitate cloning gene(s) underlying gpa7, a deep-coverage BAC library was constructed for an isolate of common wild rice (Oryza rufipogon Griff.) collected from Dongxiang, Jiangxi Province, China (DXCWR). gpa7, a quantitative trait locus corresponding to grain number per panicle, is positioned in the short arm of chromosome 7. The BAC library containing 96,768 clones represents approximate 18 haploid genome equivalents. The contig spanning DXCWR gpa7 was constructed with a series of ordered markers. The putative physical map near the gpa7 locus of another accession of O. rufipogon (Accession: IRGC 105491) was also isolated in silico. Analysis of the physical maps of gpa7 indicated that a segment of about 150 kb was deleted during domestication of common wild rice.
Smith, Nicholas L; Felix, Janine F; Morrison, Alanna C; Demissie, Serkalem; Glazer, Nicole L; Loehr, Laura R; Cupples, L Adrienne; Dehghan, Abbas; Lumley, Thomas; Rosamond, Wayne D; Lieb, Wolfgang; Rivadeneira, Fernando; Bis, Joshua C; Folsom, Aaron R; Benjamin, Emelia; Aulchenko, Yurii S; Haritunians, Talin; Couper, David; Murabito, Joanne; Wang, Ying A; Stricker, Bruno H; Gottdiener, John S; Chang, Patricia P; Wang, Thomas J; Rice, Kenneth M; Hofman, Albert; Heckbert, Susan R; Fox, Ervin R; O'Donnell, Christopher J; Uitterlinden, Andre G; Rotter, Jerome I; Willerson, James T; Levy, Daniel; van Duijn, Cornelia M; Psaty, Bruce M; Witteman, Jacqueline C M; Boerwinkle, Eric; Vasan, Ramachandran S
2010-06-01
Although genetic factors contribute to the onset of heart failure (HF), no large-scale genome-wide investigation of HF risk has been published to date. We have investigated the association of 2,478,304 single-nucleotide polymorphisms with incident HF by meta-analyzing data from 4 community-based prospective cohorts: the Atherosclerosis Risk in Communities Study, the Cardiovascular Health Study, the Framingham Heart Study, and the Rotterdam Study. Eligible participants for these analyses were of European or African ancestry and free of clinical HF at baseline. Each study independently conducted genome-wide scans and imputed data to the approximately 2.5 million single-nucleotide polymorphisms in HapMap. Within each study, Cox proportional hazards regression models provided age- and sex-adjusted estimates of the association between each variant and time to incident HF. Fixed-effect meta-analyses combined results for each single-nucleotide polymorphism from the 4 cohorts to produce an overall association estimate and P value. A genome-wide significance P value threshold was set a priori at 5.0x10(-7). During a mean follow-up of 11.5 years, 2526 incident HF events (12%) occurred in 20 926 European-ancestry participants. The meta-analysis identified a genome-wide significant locus at chromosomal position 15q22 (1.4x10(-8)), which was 58.8 kb from USP3. Among 2895 African-ancestry participants, 466 incident HF events (16%) occurred during a mean follow-up of 13.7 years. One genome-wide significant locus was identified at 12q14 (6.7x10(-8)), which was 6.3 kb from LRIG3. We identified 2 loci that were associated with incident HF and exceeded genome-wide significance. The findings merit replication in other community-based settings of incident HF.
Two Low Coverage Bird Genomes and a Comparison of Reference-Guided versus De Novo Genome Assemblies
Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthew K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.
2014-01-01
As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies. PMID:25192061
Genomic Locus Modulating IOP in the BXD RI Mouse Strains
King, Rebecca; Li, Ying; Wang, Jiaxing; Struebing, Felix L.; Geisert, Eldon E.
2018-01-01
Intraocular pressure (IOP) is the primary risk factor for developing glaucoma, yet little is known about the contribution of genomic background to IOP regulation. The present study leverages an array of systems genetics tools to study genomic factors modulating normal IOP in the mouse. The BXD recombinant inbred (RI) strain set was used to identify genomic loci modulating IOP. We measured the IOP in a total of 506 eyes from 38 different strains. Strain averages were subjected to conventional quantitative trait analysis by means of composite interval mapping. Candidate genes were defined, and immunohistochemistry and quantitative PCR (qPCR) were used for validation. Of the 38 BXD strains examined the mean IOP ranged from a low of 13.2mmHg to a high of 17.1mmHg. The means for each strain were used to calculate a genome wide interval map. One significant quantitative trait locus (QTL) was found on Chr.8 (96 to 103 Mb). Within this 7 Mb region only 4 annotated genes were found: Gm15679, Cdh8, Cdh11 and Gm8730. Only two genes (Cdh8 and Cdh11) were candidates for modulating IOP based on the presence of non-synonymous SNPs. Further examination using SIFT (Sorting Intolerant From Tolerant) analysis revealed that the SNPs in Cdh8 (Cadherin 8) were predicted to not change protein function; while the SNPs in Cdh11 (Cadherin 11) would not be tolerated, affecting protein function. Furthermore, immunohistochemistry demonstrated that CDH11 is expressed in the trabecular meshwork of the mouse. We have examined the genomic regulation of IOP in the BXD RI strain set and found one significant QTL on Chr. 8. Within this QTL, there is one good candidate gene, Cdh11. PMID:29496776
Hart, James C; Miller, Craig T
2017-09-07
Here, we present and characterize the spontaneous X-linked recessive mutation casper , which causes oculocutaneous albinism in threespine sticklebacks ( Gasterosteus aculeatus ). In humans, Hermansky-Pudlak syndrome results in pigmentation defects due to disrupted formation of the melanin-containing lysosomal-related organelle (LRO), the melanosome. casper mutants display not only reduced pigmentation of melanosomes in melanophores, but also reductions in the iridescent silver color from iridophores, while the yellow pigmentation from xanthophores appears unaffected. We mapped casper using high-throughput sequencing of genomic DNA from bulked casper mutants to a region of the stickleback X chromosome (chromosome 19) near the stickleback ortholog of Hermansky-Pudlak syndrome 5 ( Hps5 ). casper mutants have an insertion of a single nucleotide in the sixth exon of Hps5 , predicted to generate an early frameshift. Genome editing using CRISPR/Cas9 induced lesions in Hps5 and phenocopied the casper mutation. Injecting single or paired Hps5 guide RNAs revealed higher incidences of genomic deletions from paired guide RNAs compared to single gRNAs. Stickleback Hps5 provides a genetic system where a hemizygous locus in XY males and a diploid locus in XX females can be used to generate an easily scored visible phenotype, facilitating quantitative studies of different genome editing approaches. Lastly, we show the ability to better visualize patterns of fluorescent transgenic reporters in Hps5 mutant fish. Thus, Hps5 mutations present an opportunity to study pigmented LROs in the emerging stickleback model system, as well as a tool to aid in assaying genome editing and visualizing enhancer activity in transgenic fish. Copyright © 2017 Hart and Milller.
GWAS and admixture mapping identify different asthma-associated loci in Latinos: The GALA II Study
Galanter, Joshua M; Gignoux, Christopher R; Torgerson, Dara G; Roth, Lindsey A; Eng, Celeste; Oh, Sam S; Nguyen, Elizabeth A; Drake, Katherine A; Huntsman, Scott; Hu, Donglei; Sen, Saunak; Davis, Adam; Farber, Harold J.; Avila, Pedro C.; Brigino-Buenaventura, Emerita; LeNoir, Michael A.; Meade, Kelley; Serebrisky, Denise; Borrell, Luisa N; Rodríguez-Cintrón, William; Estrada, Andres Moreno; Mendoza, Karla Sandoval; Winkler, Cheryl A.; Klitz, William; Romieu, Isabelle; London, Stephanie J.; Gilliland, Frank; Martinez, Fernando; Bustamante, Carlos; Williams, L Keoki; Kumar, Rajesh; Rodríguez-Santana, José R.; Burchard, and Esteban G.
2013-01-01
Background Asthma is a complex disease with both genetic and environmental causes. Genome-wide association studies of asthma have mostly involved European populations and replication of positive associations has been inconsistent. Objective To identify asthma-associated genes in a large Latino population with genome-wide association analysis and admixture mapping. Methods Latino children with asthma (n = 1,893) and healthy controls (n = 1,881) were recruited from five sites in the United States: Puerto Rico, New York, Chicago, Houston, and the San Francisco Bay Area. Subjects were genotyped on an Affymetrix World Array IV chip. We performed genome-wide association and admixture mapping to identify asthma-associated loci. Results We identified a significant association between ancestry and asthma at 6p21 (lowest p-value: rs2523924, p < 5 × 10−6). This association replicates in a meta-analysis of the EVE Asthma Consortium (p = 0.01). Fine mapping of the region in this study and the EVE Asthma Consortium suggests an association between PSORS1C1 and asthma. We confirmed the strong allelic association between the 17q21 asthma in Latinos (IKZF3, lowest p-value: rs90792, OR: 0.67, 95% CI 0.61 – 0.75, p = 6 × 10−13) and replicated associations in several genes that had previously been associated with asthma in genome-wide association studies. Conclusions Admixture mapping and genome-wide association are complementary techniques that provide evidence for multiple asthma-associated loci in Latinos. Admixture mapping identifies a novel locus on 6p21 that replicates in a meta-analysis of several Latino populations, while genome-wide association confirms the previously identified locus on 17q21. PMID:24406073
A modifier of Huntington's disease onset at the MLH1 locus.
Lee, Jong-Min; Chao, Michael J; Harold, Denise; Abu Elneel, Kawther; Gillis, Tammy; Holmans, Peter; Jones, Lesley; Orth, Michael; Myers, Richard H; Kwak, Seung; Wheeler, Vanessa C; MacDonald, Marcy E; Gusella, James F
2017-10-01
Huntington's disease (HD) is a dominantly inherited neurodegenerative disease caused by an expanded CAG repeat in HTT. Many clinical characteristics of HD such as age at motor onset are determined largely by the size of HTT CAG repeat. However, emerging evidence strongly supports a role for other genetic factors in modifying the disease pathogenesis driven by mutant huntingtin. A recent genome-wide association analysis to discover genetic modifiers of HD onset age provided initial evidence for modifier loci on chromosomes 8 and 15 and suggestive evidence for a locus on chromosome 3. Here, genotyping of candidate single nucleotide polymorphisms in a cohort of 3,314 additional HD subjects yields independent confirmation of the former two loci and moves the third to genome-wide significance at MLH1, a locus whose mouse orthologue modifies CAG length-dependent phenotypes in a Htt-knock-in mouse model of HD. Both quantitative and dichotomous association analyses implicate a functional variant on ∼32% of chromosomes with the beneficial modifier effect that delays HD motor onset by 0.7 years/allele. Genomic DNA capture and sequencing of a modifier haplotype localize the functional variation to a 78 kb region spanning the 3'end of MLH1 and the 5'end of the neighboring LRRFIP2, and marked by an isoleucine-valine missense variant in MLH1. Analysis of expression Quantitative Trait Loci (eQTLs) provides modest support for altered regulation of MLH1 and LRRFIP2, raising the possibility that the modifier affects regulation of both genes. Finally, polygenic modification score and heritability analyses suggest the existence of additional genetic modifiers, supporting expanded, comprehensive genetic analysis of larger HD datasets. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Diversity Outbred Mice at 21: Maintaining Allelic Variation in the Face of Selection
Chesler, Elissa J.; Gatti, Daniel M.; Morgan, Andrew P.; Strobel, Marge; Trepanier, Laura; Oberbeck, Denesa; McWeeney, Shannon; Hitzemann, Robert; Ferris, Martin; McMullan, Rachel; Clayshultle, Amelia; Bell, Timothy A.; de Villena, Fernando Pardo-Manuel; Churchill, Gary A.
2016-01-01
Multi-parent populations (MPPs) capture and maintain the genetic diversity from multiple inbred founder strains to provide a resource for high-resolution genetic mapping through the accumulation of recombination events over many generations. Breeding designs that maintain a large effective population size with randomized assignment of breeders at each generation can minimize the impact of selection, inbreeding, and genetic drift on allele frequencies. Small deviations from expected allele frequencies will have little effect on the power and precision of genetic analysis, but a major distortion could result in reduced power and loss of important functional alleles. We detected strong transmission ratio distortion in the Diversity Outbred (DO) mouse population on chromosome 2, caused by meiotic drive favoring transmission of the WSB/EiJ allele at the R2d2 locus. The distorted region harbors thousands of polymorphisms derived from the seven non-WSB founder strains and many of these would be lost if the sweep was allowed to continue. To ensure the utility of the DO population to study genetic variation on chromosome 2, we performed an artificial selection against WSB/EiJ alleles at the R2d2 locus. Here, we report that we have purged the WSB/EiJ allele from the drive locus while preserving WSB/EiJ alleles in the flanking regions. We observed minimal disruption to allele frequencies across the rest of the autosomal genome. However, there was a shift in haplotype frequencies of the mitochondrial genome and an increase in the rate of an unusual sex chromosome aneuploidy. The DO population has been restored to genome-wide utility for genetic analysis, but our experience underscores that vigilant monitoring of similar genetic resource populations is needed to ensure their long-term utility. PMID:27694113
Diversity Outbred Mice at 21: Maintaining Allelic Variation in the Face of Selection.
Chesler, Elissa J; Gatti, Daniel M; Morgan, Andrew P; Strobel, Marge; Trepanier, Laura; Oberbeck, Denesa; McWeeney, Shannon; Hitzemann, Robert; Ferris, Martin; McMullan, Rachel; Clayshultle, Amelia; Bell, Timothy A; Manuel de Villena, Fernando Pardo; Churchill, Gary A
2016-12-07
Multi-parent populations (MPPs) capture and maintain the genetic diversity from multiple inbred founder strains to provide a resource for high-resolution genetic mapping through the accumulation of recombination events over many generations. Breeding designs that maintain a large effective population size with randomized assignment of breeders at each generation can minimize the impact of selection, inbreeding, and genetic drift on allele frequencies. Small deviations from expected allele frequencies will have little effect on the power and precision of genetic analysis, but a major distortion could result in reduced power and loss of important functional alleles. We detected strong transmission ratio distortion in the Diversity Outbred (DO) mouse population on chromosome 2, caused by meiotic drive favoring transmission of the WSB/EiJ allele at the R2d2 locus. The distorted region harbors thousands of polymorphisms derived from the seven non-WSB founder strains and many of these would be lost if the sweep was allowed to continue. To ensure the utility of the DO population to study genetic variation on chromosome 2, we performed an artificial selection against WSB/EiJ alleles at the R2d2 locus. Here, we report that we have purged the WSB/EiJ allele from the drive locus while preserving WSB/EiJ alleles in the flanking regions. We observed minimal disruption to allele frequencies across the rest of the autosomal genome. However, there was a shift in haplotype frequencies of the mitochondrial genome and an increase in the rate of an unusual sex chromosome aneuploidy. The DO population has been restored to genome-wide utility for genetic analysis, but our experience underscores that vigilant monitoring of similar genetic resource populations is needed to ensure their long-term utility. Copyright © 2016 by the Genetics Society of America.
Immunoglobulin genomics in the guinea pig (Cavia porcellus).
Guo, Yongchen; Bao, Yonghua; Meng, Qingwen; Hu, Xiaoxiang; Meng, Qingyong; Ren, Liming; Li, Ning; Zhao, Yaofeng
2012-01-01
In science, the guinea pig is known as one of the gold standards for modeling human disease. It is especially important as a molecular and cellular biology model for studying the human immune system, as its immunological genes are more similar to human genes than are those of mice. The utility of the guinea pig as a model organism can be further enhanced by further characterization of the genes encoding components of the immune system. Here, we report the genomic organization of the guinea pig immunoglobulin (Ig) heavy and light chain genes. The guinea pig IgH locus is located in genomic scaffolds 54 and 75, and spans approximately 6,480 kb. 507 V(H) segments (94 potentially functional genes and 413 pseudogenes), 41 D(H) segments, six J(H) segments, four constant region genes (μ, γ, ε, and α), and one reverse δ remnant fragment were identified within the two scaffolds. Many V(H) pseudogenes were found within the guinea pig, and likely constituted a potential donor pool for gene conversion during evolution. The Igκ locus mapped to a 4,029 kb region of scaffold 37 and 24 is composed of 349 V(κ) (111 potentially functional genes and 238 pseudogenes), three J(κ) and one C(κ) genes. The Igλ locus spans 1,642 kb in scaffold 4 and consists of 142 V(λ) (58 potentially functional genes and 84 pseudogenes) and 11 J(λ) -C(λ) clusters. Phylogenetic analysis suggested the guinea pig's large germline V(H) gene segments appear to form limited gene families. Therefore, this species may generate antibody diversity via a gene conversion-like mechanism associated with its pseudogene reserves.
Fogh, Isabella; Ratti, Antonia; Gellera, Cinzia; Lin, Kuang; Tiloca, Cinzia; Moskvina, Valentina; Corrado, Lucia; Sorarù, Gianni; Cereda, Cristina; Corti, Stefania; Gentilini, Davide; Calini, Daniela; Castellotti, Barbara; Mazzini, Letizia; Querin, Giorgia; Gagliardi, Stella; Del Bo, Roberto; Conforti, Francesca L.; Siciliano, Gabriele; Inghilleri, Maurizio; Saccà, Francesco; Bongioanni, Paolo; Penco, Silvana; Corbo, Massimo; Sorbi, Sandro; Filosto, Massimiliano; Ferlini, Alessandra; Di Blasio, Anna M.; Signorini, Stefano; Shatunov, Aleksey; Jones, Ashley; Shaw, Pamela J.; Morrison, Karen E.; Farmer, Anne E.; Van Damme, Philip; Robberecht, Wim; Chiò, Adriano; Traynor, Bryan J.; Sendtner, Michael; Melki, Judith; Meininger, Vincent; Hardiman, Orla; Andersen, Peter M.; Leigh, Nigel P.; Glass, Jonathan D.; Overste, Daniel; Diekstra, Frank P.; Veldink, Jan H.; van Es, Michael A.; Shaw, Christopher E.; Weale, Michael E.; Lewis, Cathryn M.; Williams, Julie; Brown, Robert H.; Landers, John E.; Ticozzi, Nicola; Ceroni, Mauro; Pegoraro, Elena; Comi, Giacomo P.; D'Alfonso, Sandra; van den Berg, Leonard H.; Taroni, Franco; Al-Chalabi, Ammar; Powell, John; Silani, Vincenzo; Brescia Morra, Vincenzo; Filla, Alessandro; Massimo, Filosto; Marsili, Angela; Viviana, Pensato; Puorro, Giorgia; La Bella, Vincenzo; Logroscino, Giancarlo; Monsurrò, Maria Rosaria; Quattrone, Aldo; Simone, Isabella Laura; Ahmeti, Kreshnik B.; Ajroud-Driss, Senda; Armstrong, Jennifer; Birve, Anne; Blauw, Hylke M.; Bruijn, Lucie; Chen, Wenjie; Comeau, Mary C.; Cronin, Simon; Soraya, Gkazi Athina; Grab, Josh D.; Groen, Ewout J.; Haines, Jonathan L.; Heller, Scott; Huang, Jie; Hung, Wu-Yen; Jaworski, James M.; Khan, Humaira; Langefeld, Carl D.; Marion, Miranda C.; McLaughlin, Russell L.; Miller, Jack W.; Mora, Gabriele; Pericak-Vance, Margaret A.; Rampersaud, Evadnie; Siddique, Nailah; Siddique, Teepu; Smith, Bradley N.; Sufit, Robert; Topp, Simon; Vance, Caroline; van Vught, Paul; Yang, Yi; Zheng, J.G.
2014-01-01
Identification of mutations at familial loci for amyotrophic lateral sclerosis (ALS) has provided novel insights into the aetiology of this rapidly progressing fatal neurodegenerative disease. However, genome-wide association studies (GWAS) of the more common (∼90%) sporadic form have been less successful with the exception of the replicated locus at 9p21.2. To identify new loci associated with disease susceptibility, we have established the largest association study in ALS to date and undertaken a GWAS meta-analytical study combining 3959 newly genotyped Italian individuals (1982 cases and 1977 controls) collected by SLAGEN (Italian Consortium for the Genetics of ALS) together with samples from Netherlands, USA, UK, Sweden, Belgium, France, Ireland and Italy collected by ALSGEN (the International Consortium on Amyotrophic Lateral Sclerosis Genetics). We analysed a total of 13 225 individuals, 6100 cases and 7125 controls for almost 7 million single-nucleotide polymorphisms (SNPs). We identified a novel locus with genome-wide significance at 17q11.2 (rs34517613 with P = 1.11 × 10−8; OR 0.82) that was validated when combined with genotype data from a replication cohort (P = 8.62 × 10−9; OR 0.833) of 4656 individuals. Furthermore, we confirmed the previously reported association at 9p21.2 (rs3849943 with P = 7.69 × 10−9; OR 1.16). Finally, we estimated the contribution of common variation to heritability of sporadic ALS as ∼12% using a linear mixed model accounting for all SNPs. Our results provide an insight into the genetic structure of sporadic ALS, confirming that common variation contributes to risk and that sufficiently powered studies can identify novel susceptibility loci. PMID:24256812
Han, Jun; Zhao, Xiaojie; Cui, Yu; Song, Wei; Huo, Naxin; Liang, Yong; Xie, Jingzhong; Wang, Zhenzhong; Wu, Qiuhong; Chen, Yong-Xing; Lu, Ping; Zhang, De-Yun; Wang, Lili; Sun, Hua; Yang, Tsomin; Keeble-Gagnere, Gabriel; Appels, Rudi; Doležel, Jaroslav; Ling, Hong-Qing; Luo, Mingcheng; Gu, Yongqiang; Sun, Qixin; Liu, Zhiyong
2014-01-01
Powdery mildew, caused by Blumeria graminis f. sp. tritici, is one of the most important wheat diseases in the world. In this study, a single dominant powdery mildew resistance gene MlIW172 was identified in the IW172 wild emmer accession and mapped to the distal region of chromosome arm 7AL (bin7AL-16-0.86-0.90) via molecular marker analysis. MlIW172 was closely linked with the RFLP probe Xpsr680-derived STS marker Xmag2185 and the EST markers BE405531 and BE637476. This suggested that MlIW172 might be allelic to the Pm1 locus or a new locus closely linked to Pm1. By screening genomic BAC library of durum wheat cv. Langdon and 7AL-specific BAC library of hexaploid wheat cv. Chinese Spring, and after analyzing genome scaffolds of Triticum urartu containing the marker sequences, additional markers were developed to construct a fine genetic linkage map on the MlIW172 locus region and to delineate the resistance gene within a 0.48 cM interval. Comparative genetics analyses using ESTs and RFLP probe sequences flanking the MlIW172 region against other grass species revealed a general co-linearity in this region with the orthologous genomic regions of rice chromosome 6, Brachypodium chromosome 1, and sorghum chromosome 10. However, orthologous resistance gene-like RGA sequences were only present in wheat and Brachypodium. The BAC contigs and sequence scaffolds that we have developed provide a framework for the physical mapping and map-based cloning of MlIW172. PMID:24955773
Bolton, Jennifer L; Hayward, Caroline; Direk, Nese; Lewis, John G; Hammond, Geoffrey L; Hill, Lesley A; Anderson, Anna; Huffman, Jennifer; Wilson, James F; Campbell, Harry; Rudan, Igor; Wright, Alan; Hastie, Nicholas; Wild, Sarah H; Velders, Fleur P; Hofman, Albert; Uitterlinden, Andre G; Lahti, Jari; Räikkönen, Katri; Kajantie, Eero; Widen, Elisabeth; Palotie, Aarno; Eriksson, Johan G; Kaakinen, Marika; Järvelin, Marjo-Riitta; Timpson, Nicholas J; Davey Smith, George; Ring, Susan M; Evans, David M; St Pourcain, Beate; Tanaka, Toshiko; Milaneschi, Yuri; Bandinelli, Stefania; Ferrucci, Luigi; van der Harst, Pim; Rosmalen, Judith G M; Bakker, Stephen J L; Verweij, Niek; Dullaart, Robin P F; Mahajan, Anubha; Lindgren, Cecilia M; Morris, Andrew; Lind, Lars; Ingelsson, Erik; Anderson, Laura N; Pennell, Craig E; Lye, Stephen J; Matthews, Stephen G; Eriksson, Joel; Mellstrom, Dan; Ohlsson, Claes; Price, Jackie F; Strachan, Mark W J; Reynolds, Rebecca M; Tiemeier, Henning; Walker, Brian R
2014-07-01
Variation in plasma levels of cortisol, an essential hormone in the stress response, is associated in population-based studies with cardio-metabolic, inflammatory and neuro-cognitive traits and diseases. Heritability of plasma cortisol is estimated at 30-60% but no common genetic contribution has been identified. The CORtisol NETwork (CORNET) consortium undertook genome wide association meta-analysis for plasma cortisol in 12,597 Caucasian participants, replicated in 2,795 participants. The results indicate that <1% of variance in plasma cortisol is accounted for by genetic variation in a single region of chromosome 14. This locus spans SERPINA6, encoding corticosteroid binding globulin (CBG, the major cortisol-binding protein in plasma), and SERPINA1, encoding α1-antitrypsin (which inhibits cleavage of the reactive centre loop that releases cortisol from CBG). Three partially independent signals were identified within the region, represented by common SNPs; detailed biochemical investigation in a nested sub-cohort showed all these SNPs were associated with variation in total cortisol binding activity in plasma, but some variants influenced total CBG concentrations while the top hit (rs12589136) influenced the immunoreactivity of the reactive centre loop of CBG. Exome chip and 1000 Genomes imputation analysis of this locus in the CROATIA-Korcula cohort identified missense mutations in SERPINA6 and SERPINA1 that did not account for the effects of common variants. These findings reveal a novel common genetic source of variation in binding of cortisol by CBG, and reinforce the key role of CBG in determining plasma cortisol levels. In turn this genetic variation may contribute to cortisol-associated degenerative diseases.
Direk, Nese; Lewis, John G.; Hammond, Geoffrey L.; Hill, Lesley A.; Anderson, Anna; Huffman, Jennifer; Wilson, James F.; Campbell, Harry; Rudan, Igor; Wright, Alan; Hastie, Nicholas; Wild, Sarah H.; Velders, Fleur P.; Hofman, Albert; Uitterlinden, Andre G.; Lahti, Jari; Räikkönen, Katri; Kajantie, Eero; Widen, Elisabeth; Palotie, Aarno; Eriksson, Johan G.; Kaakinen, Marika; Järvelin, Marjo-Riitta; Timpson, Nicholas J.; Davey Smith, George; Ring, Susan M.; Evans, David M.; St Pourcain, Beate; Tanaka, Toshiko; Milaneschi, Yuri; Bandinelli, Stefania; Ferrucci, Luigi; van der Harst, Pim; Rosmalen, Judith G. M.; Bakker, Stephen J. L.; Verweij, Niek; Dullaart, Robin P. F.; Mahajan, Anubha; Lindgren, Cecilia M.; Morris, Andrew; Lind, Lars; Ingelsson, Erik; Anderson, Laura N.; Pennell, Craig E.; Lye, Stephen J.; Matthews, Stephen G.; Eriksson, Joel; Mellstrom, Dan; Ohlsson, Claes; Price, Jackie F.; Strachan, Mark W. J.; Reynolds, Rebecca M.; Tiemeier, Henning; Walker, Brian R.
2014-01-01
Variation in plasma levels of cortisol, an essential hormone in the stress response, is associated in population-based studies with cardio-metabolic, inflammatory and neuro-cognitive traits and diseases. Heritability of plasma cortisol is estimated at 30–60% but no common genetic contribution has been identified. The CORtisol NETwork (CORNET) consortium undertook genome wide association meta-analysis for plasma cortisol in 12,597 Caucasian participants, replicated in 2,795 participants. The results indicate that <1% of variance in plasma cortisol is accounted for by genetic variation in a single region of chromosome 14. This locus spans SERPINA6, encoding corticosteroid binding globulin (CBG, the major cortisol-binding protein in plasma), and SERPINA1, encoding α1-antitrypsin (which inhibits cleavage of the reactive centre loop that releases cortisol from CBG). Three partially independent signals were identified within the region, represented by common SNPs; detailed biochemical investigation in a nested sub-cohort showed all these SNPs were associated with variation in total cortisol binding activity in plasma, but some variants influenced total CBG concentrations while the top hit (rs12589136) influenced the immunoreactivity of the reactive centre loop of CBG. Exome chip and 1000 Genomes imputation analysis of this locus in the CROATIA-Korcula cohort identified missense mutations in SERPINA6 and SERPINA1 that did not account for the effects of common variants. These findings reveal a novel common genetic source of variation in binding of cortisol by CBG, and reinforce the key role of CBG in determining plasma cortisol levels. In turn this genetic variation may contribute to cortisol-associated degenerative diseases. PMID:25010111
Julià, Antonio; Blanco, Francisco; Fernández-Gutierrez, Benjamín; González, Antonio; Cañete, Juan D; Maymó, Joan; Alperi-López, Mercedes; Olivè, Alex; Corominas, Héctor; Martínez-Taboada, Víctor; González-Álvaro, Isidoro; Fernandez-Nebro, Antonio; Erra, Alba; Sánchez-Fernández, Simón; Alonso, Arnald; López-Lasanta, María; Tortosa, Raül; Codó, Laia; Lluis Gelpi, Josep; García-Montero, Andrés C; Bertranpetit, Jaume; Absher, Devin; Myers, Richard M; Tornero, Jesús; Marsal, Sara
2016-06-01
Rheumatoid factor (RF) is a well-established diagnostic and prognostic biomarker in rheumatoid arthritis (RA). However, ∼20% of RA patients are negative for this anti-IgG antibody. To date, only variation at the HLA-DRB1 gene has been associated with the presence of RF. This study was undertaken to identify additional genetic variants associated with RF positivity. A genome-wide association study (GWAS) for RF positivity was performed using an Illumina Quad610 genotyping platform. A total of 937 RF-positive and 323 RF-negative RA patients were genotyped for >550,000 single-nucleotide polymorphisms (SNPs). Association testing was performed using an allelic chi-square test implemented in Plink software. An independent cohort of 472 RF-positive and 190 RF-negative RA patients was used to validate the most significant findings. In the discovery stage, a SNP in the IRX1 locus on chromosome 5p15.3 (SNP rs1502644) showed a genome-wide significant association with RF positivity (P = 4.13 × 10(-8) , odds ratio [OR] 0.37 [95% confidence interval (95% CI) 0.26-0.53]). In the validation stage, the association of IRX1 with RF was replicated in an independent group of RA patients (P = 0.034, OR 0.58 [95% CI 0.35-0.97] and combined P = 1.14 × 10(-8) , OR 0.43 [95% CI 0.32-0.58]). To our knowledge, this is the first GWAS of RF positivity in RA. Variation at the IRX1 locus on chromosome 5p15.3 is associated with the presence of RF. Our findings indicate that IRX1 and HLA-DRB1 are the strongest genetic factors for RF production in RA. © 2016, American College of Rheumatology.
Mohorianu, Irina; Stocks, Matthew Benedict; Wood, John; Dalmay, Tamas; Moulton, Vincent
2013-01-01
Small RNAs (sRNAs) are 20–25 nt non-coding RNAs that act as guides for the highly sequence-specific regulatory mechanism known as RNA silencing. Due to the recent increase in sequencing depth, a highly complex and diverse population of sRNAs in both plants and animals has been revealed. However, the exponential increase in sequencing data has also made the identification of individual sRNA transcripts corresponding to biological units (sRNA loci) more challenging when based exclusively on the genomic location of the constituent sRNAs, hindering existing approaches to identify sRNA loci. To infer the location of significant biological units, we propose an approach for sRNA loci detection called CoLIde (Co-expression based sRNA Loci Identification) that combines genomic location with the analysis of other information such as variation in expression levels (expression pattern) and size class distribution. For CoLIde, we define a locus as a union of regions sharing the same pattern and located in close proximity on the genome. Biological relevance, detected through the analysis of size class distribution, is also calculated for each locus. CoLIde can be applied on ordered (e.g., time-dependent) or un-ordered (e.g., organ, mutant) series of samples both with or without biological/technical replicates. The method reliably identifies known types of loci and shows improved performance on sequencing data from both plants (e.g., A. thaliana, S. lycopersicum) and animals (e.g., D. melanogaster) when compared with existing locus detection techniques. CoLIde is available for use within the UEA Small RNA Workbench which can be downloaded from: http://srna-workbench.cmp.uea.ac.uk. PMID:23851377
Mohorianu, Irina; Stocks, Matthew Benedict; Wood, John; Dalmay, Tamas; Moulton, Vincent
2013-07-01
Small RNAs (sRNAs) are 20-25 nt non-coding RNAs that act as guides for the highly sequence-specific regulatory mechanism known as RNA silencing. Due to the recent increase in sequencing depth, a highly complex and diverse population of sRNAs in both plants and animals has been revealed. However, the exponential increase in sequencing data has also made the identification of individual sRNA transcripts corresponding to biological units (sRNA loci) more challenging when based exclusively on the genomic location of the constituent sRNAs, hindering existing approaches to identify sRNA loci. To infer the location of significant biological units, we propose an approach for sRNA loci detection called CoLIde (Co-expression based sRNA Loci Identification) that combines genomic location with the analysis of other information such as variation in expression levels (expression pattern) and size class distribution. For CoLIde, we define a locus as a union of regions sharing the same pattern and located in close proximity on the genome. Biological relevance, detected through the analysis of size class distribution, is also calculated for each locus. CoLIde can be applied on ordered (e.g., time-dependent) or un-ordered (e.g., organ, mutant) series of samples both with or without biological/technical replicates. The method reliably identifies known types of loci and shows improved performance on sequencing data from both plants (e.g., A. thaliana, S. lycopersicum) and animals (e.g., D. melanogaster) when compared with existing locus detection techniques. CoLIde is available for use within the UEA Small RNA Workbench which can be downloaded from: http://srna-workbench.cmp.uea.ac.uk.
Ma, Jiale; Sun, Min; Bao, Yinli; Pan, Zihao; Zhang, Wei; Lu, Chengping; Yao, Huochun
2013-12-01
Avian pathogenic Escherichia coli (APEC) strains frequently cause extra-intestinal infections and significant economic losses. Recent studies revealed that the type VI secretion system (T6SS) is involved in APEC pathogenesis. Here we provide the first evidence of three distinguishable and conserved T6SS loci in APEC genomes. In addition, we present the prevalence and comparative genomic analysis of these three T6SS loci in 472 APEC isolates. The prevalence of T6SS1, T6SS2 and T6SS3 loci were 14.62% (69/472), 2.33% (11/472) and 0.85% (4/472) positive in the APEC collections, respectively, and revealed that >85% of the strains contained T6SS loci which consisted of the virulent phylogenetic groups D and B2. Comprehensive analysis showed prominent characteristics of T6SS1 locus, including wildly prevalence, rich sequence diversity, versatile VgrG islands and excellent expression competence in various E. coli pathotypes. Whereas the T6SS2 locus infatuated with ECOR groups B2 and sequence conservation, of which are only expressed in meningitis E. coli. Regrettably, the T6SS3 locus was encoded in negligible APEC isolates and lacked several key genes. An in-depth analysis about VgrG proteins indicated that their COG4253 and gp27 domain were involved in the transport of putative effector islands and recognition of host cells respectively, which revealed that VgrG proteins played an important role in functions formation of T6SS. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
2012-01-01
Background Cotton is the world’s most important natural textile fiber and a significant oilseed crop. Decoding cotton genomes will provide the ultimate reference and resource for research and utilization of the species. Integration of high-density genetic maps with genomic sequence information will largely accelerate the process of whole-genome assembly in cotton. Results In this paper, we update a high-density interspecific genetic linkage map of allotetraploid cultivated cotton. An additional 1,167 marker loci have been added to our previously published map of 2,247 loci. Three new marker types, InDel (insertion-deletion) and SNP (single nucleotide polymorphism) developed from gene information, and REMAP (retrotransposon-microsatellite amplified polymorphism), were used to increase map density. The updated map consists of 3,414 loci in 26 linkage groups covering 3,667.62 cM with an average inter-locus distance of 1.08 cM. Furthermore, genome-wide sequence analysis was finished using 3,324 informative sequence-based markers and publicly-available Gossypium DNA sequence information. A total of 413,113 EST and 195 BAC sequences were physically anchored and clustered by 3,324 sequence-based markers. Of these, 14,243 ESTs and 188 BACs from different species of Gossypium were clustered and specifically anchored to the high-density genetic map. A total of 2,748 candidate unigenes from 2,111 ESTs clusters and 63 BACs were mined for functional annotation and classification. The 337 ESTs/genes related to fiber quality traits were integrated with 132 previously reported cotton fiber quality quantitative trait loci, which demonstrated the important roles in fiber quality of these genes. Higher-level sequence conservation between different cotton species and between the A- and D-subgenomes in tetraploid cotton was found, indicating a common evolutionary origin for orthologous and paralogous loci in Gossypium. Conclusion This study will serve as a valuable genomic resource for tetraploid cotton genome assembly, for cloning genes related to superior agronomic traits, and for further comparative genomic analyses in Gossypium. PMID:23046547
Lu, Wei; Liu, Jun; Xin, Qiang; Wan, Lili; Hong, Dengfeng; Yang, Guangsheng
2013-01-01
Background and Aims Spontaneous male sterility is an advantageous trait for both constructing efficient pollination control systems and for understanding the developmental process of the male reproductive unit in many crops. A triallelic genetic male-sterile locus (BnMs5) has been identified in Brassica napus; however, its complicated genome structure has greatly hampered the isolation of this locus. The aim of this study was to physically map BnMs5 through an integrated map-based cloning strategy and analyse the local chromosomal evolution around BnMs5. Methods A large F2 population was used to integrate the existing genetic maps around BnMs5. A map-based cloning strategy in combination with comparative mapping among B. napus, Arabidopsis, Brassica rapa and Brassica oleracea was employed to facilitate the identification of a target bacterial artificial chromosome (BAC) clone covering the BnMs5 locus. The genomic sequences from the Brassica species were analysed to reveal the regional chromosomal evolution around BnMs5. Key Results BnMs5 was finally delimited to a 0·3-cM genetic fragment from an integrated local genetic map, and was anchored on the B. napus A8 chromosome. Screening of a B. napus BAC clone library and identification of the positive clones validated that JBnB034L06 was the target BAC clone. The closest flanking markers restrict BnMs5 to a 21-kb region on JBnB034L06 containing six predicted functional genes. Good collinearity relationship around BnMs5 between several Brassica species was observed, while violent chromosomal evolutionary events including insertions/deletions, duplications and single nucleotide mutations were also found to have extensively occurred during their divergence. Conclusions This work represents major progress towards the molecular cloning of BnMs5, as well as presenting a powerful, integrative method to mapping loci in plants with complex genomic architecture, such as the amphidiploid B. napus. PMID:23243189
Genome-wide meta-analyses of stratified depression in Generation Scotland and UK Biobank.
Hall, Lynsey S; Adams, Mark J; Arnau-Soler, Aleix; Clarke, Toni-Kim; Howard, David M; Zeng, Yanni; Davies, Gail; Hagenaars, Saskia P; Maria Fernandez-Pujals, Ana; Gibson, Jude; Wigmore, Eleanor M; Boutin, Thibaud S; Hayward, Caroline; Scotland, Generation; Porteous, David J; Deary, Ian J; Thomson, Pippa A; Haley, Chris S; McIntosh, Andrew M
2018-01-10
Few replicable genetic associations for Major Depressive Disorder (MDD) have been identified. Recent studies of MDD have identified common risk variants by using a broader phenotype definition in very large samples, or by reducing phenotypic and ancestral heterogeneity. We sought to ascertain whether it is more informative to maximize the sample size using data from all available cases and controls, or to use a sex or recurrent stratified subset of affected individuals. To test this, we compared heritability estimates, genetic correlation with other traits, variance explained by MDD polygenic score, and variants identified by genome-wide meta-analysis for broad and narrow MDD classifications in two large British cohorts - Generation Scotland and UK Biobank. Genome-wide meta-analysis of MDD in males yielded one genome-wide significant locus on 3p22.3, with three genes in this region (CRTAP, GLB1, and TMPPE) demonstrating a significant association in gene-based tests. Meta-analyzed MDD, recurrent MDD and female MDD yielded equivalent heritability estimates, showed no detectable difference in association with polygenic scores, and were each genetically correlated with six health-correlated traits (neuroticism, depressive symptoms, subjective well-being, MDD, a cross-disorder phenotype and Bipolar Disorder). Whilst stratified GWAS analysis revealed a genome-wide significant locus for male MDD, the lack of independent replication, and the consistent pattern of results in other MDD classifications suggests that phenotypic stratification using recurrence or sex in currently available sample sizes is currently weakly justified. Based upon existing studies and our findings, the strategy of maximizing sample sizes is likely to provide the greater gain.
Gilbert, Maarten J; Miller, William G; Yee, Emma; Kik, Marja; Zomer, Aldert L; Wagenaar, Jaap A; Duim, Birgitta
2016-10-05
Campylobacter iguaniorum is most closely related to the species C fetus, C hyointestinalis, and C lanienae Reptiles, chelonians and lizards in particular, appear to be a primary reservoir of this Campylobacter species. Here we report the genome comparison of C iguaniorum strain 1485E, isolated from a bearded dragon (Pogona vitticeps), and strain 2463D, isolated from a green iguana (Iguana iguana), with the genomes of closely related taxa, in particular with reptile-associated C fetus subsp. testudinum In contrast to C fetus, C iguaniorum is lacking an S-layer encoding region. Furthermore, a defined lipooligosaccharide biosynthesis locus, encoding multiple glycosyltransferases and bounded by waa genes, is absent from C iguaniorum Instead, multiple predicted glycosylation regions were identified in C iguaniorum One of these regions is > 50 kb with deviant G + C content, suggesting acquisition via lateral transfer. These similar, but non-homologous glycosylation regions were located at the same position on the genome in both strains. Multiple genes encoding respiratory enzymes not identified to date within the C. fetus clade were present. C iguaniorum shared highest homology with C hyointestinalis and C fetus. As in reptile-associated C fetus subsp. testudinum, a putative tricarballylate catabolism locus was identified. However, despite colonizing a shared host, no recent recombination between both taxa was detected. This genomic study provides a better understanding of host adaptation, virulence, phylogeny, and evolution of C iguaniorum and related Campylobacter taxa. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Romero Navarro, J. Alberto; Phillips-Mora, Wilbert; Arciniegas-Leal, Adriana; Mata-Quirós, Allan; Haiminen, Niina; Mustiga, Guiliana; Livingstone III, Donald; van Bakel, Harm; Kuhn, David N.; Parida, Laxmi; Kasarskis, Andrew; Motamayor, Juan C.
2017-01-01
Chocolate is a highly valued and palatable confectionery product. Chocolate is primarily made from the processed seeds of the tree species Theobroma cacao. Cacao cultivation is highly relevant for small-holder farmers throughout the tropics, yet its productivity remains limited by low yields and widespread pathogens. A panel of 148 improved cacao clones was assembled based on productivity and disease resistance, and phenotypic single-tree replicated clonal evaluation was performed for 8 years. Using high-density markers, the diversity of clones was expressed relative to 10 known ancestral cacao populations, and significant effects of ancestry were observed in productivity and disease resistance. Genome-wide association (GWA) was performed, and six markers were significantly associated with frosty pod disease resistance. In addition, genomic selection was performed, and consistent with the observed extensive linkage disequilibrium, high predictive ability was observed at low marker densities for all traits. Finally, quantitative trait locus mapping and differential expression analysis of two cultivars with contrasting disease phenotypes were performed to identify genes underlying frosty pod disease resistance, identifying a significant quantitative trait locus and 35 differentially expressed genes using two independent differential expression analyses. These results indicate that in breeding populations of heterozygous and recently admixed individuals, mapping approaches can be used for low complexity traits like pod color cacao, or in other species single gene disease resistance, however genomic selection for quantitative traits remains highly effective relative to mapping. Our results can help guide the breeding process for sustainable improved cacao productivity. PMID:29184558
Soler-Bistué, Alfonso; Mondotte, Juan A.; Bland, Michael Jason; Val, Marie-Eve; Saleh, María-Carla; Mazel, Didier
2015-01-01
The effects on cell physiology of gene order within the bacterial chromosome are poorly understood. In silico approaches have shown that genes involved in transcription and translation processes, in particular ribosomal protein (RP) genes, localize near the replication origin (oriC) in fast-growing bacteria suggesting that such a positional bias is an evolutionarily conserved growth-optimization strategy. Such genomic localization could either provide a higher dosage of these genes during fast growth or facilitate the assembly of ribosomes and transcription foci by keeping physically close the many components of these macromolecular machines. To explore this, we used novel recombineering tools to create a set of Vibrio cholerae strains in which S10-spec-α (S10), a locus bearing half of the ribosomal protein genes, was systematically relocated to alternative genomic positions. We show that the relative distance of S10 to the origin of replication tightly correlated with a reduction of S10 dosage, mRNA abundance and growth rate within these otherwise isogenic strains. Furthermore, this was accompanied by a significant reduction in the host-invasion capacity in Drosophila melanogaster. Both phenotypes were rescued in strains bearing two S10 copies highly distal to oriC, demonstrating that replication-dependent gene dosage reduction is the main mechanism behind these alterations. Hence, S10 positioning connects genome structure to cell physiology in Vibrio cholerae. Our results show experimentally for the first time that genomic positioning of genes involved in the flux of genetic information conditions global growth control and hence bacterial physiology and potentially its evolution. PMID:25875621
In-depth Investigation of Genetic Region Identifies Mechanism that Contributes to Cancer Risk
Investigators in the Laboratory of Translational Genomics have identified a genetic variant in a multi-cancer risk locus at chromosome 5p15.33 that explains, at least in part, the molecular mechanism through which this variant influences cancer risk.
Evolutionary biology: microsporidia sex--a missing link to fungi.
Dyer, Paul S
2008-11-11
The evolutionary origins of the microsporidia, a group of intracellular eukaryotic pathogens, have been unclear. Genome analysis of a sex locus and other gene clusters has now revealed conserved synteny with zygomycete fungi, indicating that microsporidia are true fungi descended from a zygomycete ancestor.
Mapping of the Gynoecy in Bitter Gourd (Momordica charantia) Using RAD-Seq Analysis
Matsumura, Hideo; Miyagi, Norimichi; Taniai, Naoki; Fukushima, Mai; Tarora, Kazuhiko; Shudo, Ayano; Urasaki, Naoya
2014-01-01
Momordica charantia is a monoecious plant of the Cucurbitaceae family that has both male and female unisexual flowers. Its unique gynoecious line, OHB61-5, is essential as a maternal parent in the production of F1 cultivars. To identify the DNA markers for this gynoecy, a RAD-seq (restriction-associated DNA tag sequencing) analysis was employed to reveal genome-wide DNA polymorphisms and to genotype the F2 progeny from a cross between OHB61-5 and a monoecious line. Based on a RAD-seq analysis of F2 individuals, a linkage map was constructed using 552 co-dominant markers. In addition, after analyzing the pooled genomic DNA from monoecious or gynoecious F2 plants, several SNP loci that are genetically linked to gynoecy were identified. GTFL-1, the closest SNP locus to the putative gynoecious locus, was converted to a conventional DNA marker using invader assay technology, which is applicable to the marker-assisted selection of gynoecy in M. charantia breeding. PMID:24498029
Inheritable Silencing of Endogenous Genes by Hit-and-Run Targeted Epigenetic Editing.
Amabile, Angelo; Migliara, Alessandro; Capasso, Paola; Biffi, Mauro; Cittaro, Davide; Naldini, Luigi; Lombardo, Angelo
2016-09-22
Gene silencing is instrumental to interrogate gene function and holds promise for therapeutic applications. Here, we repurpose the endogenous retroviruses' silencing machinery of embryonic stem cells to stably silence three highly expressed genes in somatic cells by epigenetics. This was achieved by transiently expressing combinations of engineered transcriptional repressors that bind to and synergize at the target locus to instruct repressive histone marks and de novo DNA methylation, thus ensuring long-term memory of the repressive epigenetic state. Silencing was highly specific, as shown by genome-wide analyses, sharply confined to the targeted locus without spreading to nearby genes, resistant to activation induced by cytokine stimulation, and relieved only by targeted DNA demethylation. We demonstrate the portability of this technology by multiplex gene silencing, adopting different DNA binding platforms and interrogating thousands of genomic loci in different cell types, including primary T lymphocytes. Targeted epigenome editing might have broad application in research and medicine. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Structure of the human MSH2 locus and analysis of two Muir-Torre kindreds for msh2 mutations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kolodner, R.D.; Lipford, J.; Kane, M.F.
1994-12-01
Hereditary nonpolyposis colorectal carcinoma (HNPCC) is a major cancer susceptibility syndrome known to be caused by inheritance of mutations in genes such as hMSH2 and hMLH1, which encode components of a DNA mismatch repair system. The MSH2 genomic locus has been cloned and shown to cover {approximately}73 kb of genomic DNA and to contain 16 exons. The sequence of all of the intron-exon junctions has been determined and used to develop methods for analyzing each MSH2 exon for mutations. These methods have been used to analyze two large HNPCC kindreds exhibiting features of the Muir-Torre syndrome and demonstrate that cancermore » susceptibility is due to the inheritance of a frameshift mutation in the MSH2 gene in one family and a nonsense mutation in the MSH2 gene in the other family. 59 refs., 5 figs., 1 tab.« less
Polymer physics predicts the effects of structural variants on chromatin architecture.
Bianco, Simona; Lupiáñez, Darío G; Chiariello, Andrea M; Annunziatella, Carlo; Kraft, Katerina; Schöpflin, Robert; Wittler, Lars; Andrey, Guillaume; Vingron, Martin; Pombo, Ana; Mundlos, Stefan; Nicodemi, Mario
2018-05-01
Structural variants (SVs) can result in changes in gene expression due to abnormal chromatin folding and cause disease. However, the prediction of such effects remains a challenge. Here we present a polymer-physics-based approach (PRISMR) to model 3D chromatin folding and to predict enhancer-promoter contacts. PRISMR predicts higher-order chromatin structure from genome-wide chromosome conformation capture (Hi-C) data. Using the EPHA4 locus as a model, the effects of pathogenic SVs are predicted in silico and compared to Hi-C data generated from mouse limb buds and patient-derived fibroblasts. PRISMR deconvolves the folding complexity of the EPHA4 locus and identifies SV-induced ectopic contacts and alterations of 3D genome organization in homozygous or heterozygous states. We show that SVs can reconfigure topologically associating domains, thereby producing extensive rewiring of regulatory interactions and causing disease by gene misexpression. PRISMR can be used to predict interactions in silico, thereby providing a tool for analyzing the disease-causing potential of SVs.
Belbin, Gillian Morven; Odgis, Jacqueline; Sorokin, Elena P; Yee, Muh-Ching; Kohli, Sumita; Glicksberg, Benjamin S; Gignoux, Christopher R; Wojcik, Genevieve L; Van Vleck, Tielman; Jeff, Janina M; Linderman, Michael; Schurmann, Claudia; Ruderfer, Douglas; Cai, Xiaoqiang; Merkelson, Amanda; Justice, Anne E; Young, Kristin L; Graff, Misa; North, Kari E; Peters, Ulrike; James, Regina; Hindorff, Lucia; Kornreich, Ruth; Edelmann, Lisa; Gottesman, Omri; Stahl, Eli EA; Cho, Judy H; Loos, Ruth JF; Bottinger, Erwin P; Nadkarni, Girish N; Abul-Husn, Noura S
2017-01-01
Achieving confidence in the causality of a disease locus is a complex task that often requires supporting data from both statistical genetics and clinical genomics. Here we describe a combined approach to identify and characterize a genetic disorder that leverages distantly related patients in a health system and population-scale mapping. We utilize genomic data to uncover components of distant pedigrees, in the absence of recorded pedigree information, in the multi-ethnic BioMe biobank in New York City. By linking to medical records, we discover a locus associated with both elevated genetic relatedness and extreme short stature. We link the gene, COL27A1, with a little-known genetic disease, previously thought to be rare and recessive. We demonstrate that disease manifests in both heterozygotes and homozygotes, indicating a common collagen disorder impacting up to 2% of individuals of Puerto Rican ancestry, leading to a better understanding of the continuum of complex and Mendelian disease. PMID:28895531
Lenz, Tobias L.; Mueller, Birte; Trillmich, Fritz; Wolf, Jochen B. W.
2013-01-01
It is still debated whether main individual fitness differences in natural populations can be attributed to genome-wide effects or to particular loci of outstanding functional importance such as the major histocompatibility complex (MHC). In a long-term monitoring project on Galápagos sea lions (Zalophus wollebaeki), we collected comprehensive fitness and mating data for a total of 506 individuals. Controlling for genome-wide inbreeding, we find strong associations between the MHC locus and nearly all fitness traits. The effect was mainly attributable to MHC sequence divergence and could be decomposed into contributions of own and maternal genotypes. In consequence, the population seems to have evolved a pool of highly divergent alleles conveying near-optimal MHC divergence even by random mating. Our results demonstrate that a single locus can significantly contribute to fitness in the wild and provide conclusive evidence for the ‘divergent allele advantage’ hypothesis, a special form of balancing selection with interesting evolutionary implications. PMID:23677346
Neuroblastoma is a paediatric malignancy that typically arises in early childhood, and is derived from the developing sympathetic nervous system. Clinical phenotypes range from localized tumours with excellent outcomes to widely metastatic disease in which long-term survival is approximately 40% despite intensive therapy. A previous genome-wide association study identified common polymorphisms at the LMO1 gene locus that are highly associated with neuroblastoma susceptibility and oncogenic addiction to LMO1 in the tumour cells.
Ferret: a user-friendly Java tool to extract data from the 1000 Genomes Project.
Limou, Sophie; Taverner, Andrew M; Winkler, Cheryl A
2016-07-15
The 1000 Genomes (1KG) Project provides a near-comprehensive resource on human genetic variation in worldwide reference populations. 1KG variants can be accessed through a browser and through the raw and annotated data that are regularly released on an ftp server. We developed Ferret, a user-friendly Java tool, to easily extract genetic variation information from these large and complex data files. From a locus, gene(s) or SNP(s) of interest, Ferret retrieves genotype data for 1KG SNPs and indels, and computes allelic frequencies for 1KG populations and optionally, for the Exome Sequencing Project populations. By converting the 1KG data into files that can be imported into popular pre-existing tools (e.g. PLINK and HaploView), Ferret offers a straightforward way, even for non-bioinformatics specialists, to manipulate, explore and merge 1KG data with the user's dataset, as well as visualize linkage disequilibrium pattern, infer haplotypes and design tagSNPs. Ferret tool and source code are publicly available at http://limousophie35.github.io/Ferret/ ferret@nih.gov Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.
Pincot, Dominique D A; Poorten, Thomas J; Hardigan, Michael A; Harshman, Julia M; Acharya, Charlotte B; Cole, Glenn S; Gordon, Thomas R; Stueven, Michelle; Edger, Patrick P; Knapp, Steven J
2018-05-04
Fusarium wilt, a soil-borne disease caused by the fungal pathogen Fusarium oxysporum f. sp. fragariae , threatens strawberry ( Fragaria × ananassa ) production worldwide. The spread of the pathogen, coupled with disruptive changes in soil fumigation practices, have greatly increased disease pressure and the importance of developing resistant cultivars. While resistant and susceptible cultivars have been reported, a limited number of germplasm accessions have been analyzed, and contradictory conclusions have been reached in earlier studies to elucidate the underlying genetic basis of resistance. Here, we report the discovery of Fw1 , a dominant gene conferring resistance to Fusarium wilt in strawberry. The Fw1 locus was uncovered in a genome-wide association study of 565 historically and commercially important strawberry accessions genotyped with 14,408 SNP markers. Fourteen SNPs in linkage disequilibrium with Fw1 physically mapped to a 2.3 Mb segment on chromosome 2 in a diploid F. vesca reference genome. Fw1 and 11 tightly linked GWAS-significant SNPs mapped to linkage group 2C in octoploid segregating populations. The most significant SNP explained 85% of the phenotypic variability and predicted resistance in 97% of the accessions tested-broad-sense heritability was 0.96. Several disease resistance and defense-related gene homologs, including a small cluster of genes encoding nucleotide-binding leucine-rich-repeat proteins, were identified in the 0.7 Mb genomic segment predicted to harbor Fw1 DNA variants and candidate genes identified in the present study should facilitate the development of high-throughput genotyping assays for accurately predicting Fusarium wilt phenotypes and applying marker-assisted selection. Copyright © 2018 Pincot et al.
Harms, Klaus; Lunnan, Asbjørn; Hülter, Nils; Mourier, Tobias; Vinner, Lasse; Andam, Cheryl P.; Marttinen, Pekka; Fridholm, Helena; Hansen, Anders Johannes; Hanage, William P.; Nielsen, Kaare Magne; Willerslev, Eske; Johnsen, Pål Jarle
2016-01-01
In a screen for unexplained mutation events we identified a previously unrecognized mechanism generating clustered DNA polymorphisms such as microindels and cumulative SNPs. The mechanism, short-patch double illegitimate recombination (SPDIR), facilitates short single-stranded DNA molecules to invade and replace genomic DNA through two joint illegitimate recombination events. SPDIR is controlled by key components of the cellular genome maintenance machinery in the gram-negative bacterium Acinetobacter baylyi. The source DNA is primarily intragenomic but can also be acquired through horizontal gene transfer. The DNA replacements are nonreciprocal and locus independent. Bioinformatic approaches reveal occurrence of SPDIR events in the gram-positive human pathogen Streptococcus pneumoniae and in the human genome. PMID:27956618
Lee, Ciaran M; Cradick, Thomas J; Fine, Eli J; Bao, Gang
2016-01-01
The rapid advancement in targeted genome editing using engineered nucleases such as ZFNs, TALENs, and CRISPR/Cas9 systems has resulted in a suite of powerful methods that allows researchers to target any genomic locus of interest. A complementary set of design tools has been developed to aid researchers with nuclease design, target site selection, and experimental validation. Here, we review the various tools available for target selection in designing engineered nucleases, and for quantifying nuclease activity and specificity, including web-based search tools and experimental methods. We also elucidate challenges in target selection, especially in predicting off-target effects, and discuss future directions in precision genome editing and its applications. PMID:26750397
Gopalakrishnan, Shyam; Samaniego Castruita, Jose A; Sinding, Mikkel-Holger S; Kuderna, Lukas F K; Räikkönen, Jannikke; Petersen, Bent; Sicheritz-Ponten, Thomas; Larson, Greger; Orlando, Ludovic; Marques-Bonet, Tomas; Hansen, Anders J; Dalén, Love; Gilbert, M Thomas P
2017-06-29
An increasing number of studies are addressing the evolutionary genomics of dog domestication, principally through resequencing dog, wolf and related canid genomes. There is, however, only one de novo assembled canid genome currently available against which to map such data - that of a boxer dog (Canis lupus familiaris). We generated the first de novo wolf genome (Canis lupus lupus) as an additional choice of reference, and explored what implications may arise when previously published dog and wolf resequencing data are remapped to this reference. Reassuringly, we find that regardless of the reference genome choice, most evolutionary genomic analyses yield qualitatively similar results, including those exploring the structure between the wolves and dogs using admixture and principal component analysis. However, we do observe differences in the genomic coverage of re-mapped samples, the number of variants discovered, and heterozygosity estimates of the samples. In conclusion, the choice of reference is dictated by the aims of the study being undertaken; if the study focuses on the differences between the different dog breeds or the fine structure among dogs, then using the boxer reference genome is appropriate, but if the aim of the study is to look at the variation within wolves and their relationships to dogs, then there are clear benefits to using the de novo assembled wolf reference genome.
Locus category based analysis of a large genome-wide association study of rheumatoid arthritis
Freudenberg, Jan; Lee, Annette T.; Siminovitch, Katherine A.; Amos, Christopher I.; Ballard, David; Li, Wentian; Gregersen, Peter K.
2010-01-01
To pinpoint true positive single-nucleotide polymorphism (SNP) associations in a genome-wide association study (GWAS) of rheumatoid arthritis (RA), we categorize genetic loci by external knowledge. We test both the ‘enrichment of associated loci’ in a locus category and the ‘combined association’ of a locus category. The former is quantified by the odds ratio for the presence of SNP associations at the loci of a category, whereas the latter is quantified by the number of loci in a category that have SNP associations. These measures are compared with their expected values as obtained from the permutation of the affection status. To account for linkage disequilibrium (LD) among SNPs, we view each LD block as a genetic locus. Positional candidates were defined as loci implicated by earlier GWAS results, whereas functional candidates were defined by annotations regarding the molecular roles of genes, such as gene ontology categories. As expected, immune-related categories show the largest enrichment signal, although it is not very strong. The intersection of positional and functional candidate information predicts novel RA loci near the genes TEC/TXK, MBL2 and PIK3R1/CD180. Notably, a combined association signal is not only produced by immune-related categories, but also by most other categories and even randomly defined categories. The unspecific quality of these signals limits the possible conclusions from combined association tests. It also reduces the magnitude of enrichment test results. These unspecific signals might result from common variants of small effect and hardly concentrated in candidate categories, or an inflated size of associated regions from weak LD with infrequent mutations. PMID:20639398
Fine mapping of the celiac disease-associated LPP locus reveals a potential functional variant.
Almeida, Rodrigo; Ricaño-Ponce, Isis; Kumar, Vinod; Deelen, Patrick; Szperl, Agata; Trynka, Gosia; Gutierrez-Achury, Javier; Kanterakis, Alexandros; Westra, Harm-Jan; Franke, Lude; Swertz, Morris A; Platteel, Mathieu; Bilbao, Jose Ramon; Barisani, Donatella; Greco, Luigi; Mearin, Luisa; Wolters, Victorien M; Mulder, Chris; Mazzilli, Maria Cristina; Sood, Ajit; Cukrowska, Bozena; Núñez, Concepción; Pratesi, Riccardo; Withoff, Sebo; Wijmenga, Cisca
2014-05-01
Using the Immunochip for genotyping, we identified 39 non-human leukocyte antigen (non-HLA) loci associated to celiac disease (CeD), an immune-mediated disease with a worldwide frequency of ∼1%. The most significant non-HLA signal mapped to the intronic region of 70 kb in the LPP gene. Our aim was to fine map and identify possible functional variants in the LPP locus. We performed a meta-analysis in a cohort of 25 169 individuals from six different populations previously genotyped using Immunochip. Imputation using data from the Genome of the Netherlands and 1000 Genomes projects, followed by meta-analysis, confirmed the strong association signal on the LPP locus (rs2030519, P = 1.79 × 10(-49)), without any novel associations. The conditional analysis on this top SNP-indicated association to a single common haplotype. By performing haplotype analyses in each population separately, as well as in a combined group of the four populations that reach the significant threshold after correction (P < 0.008), we narrowed down the CeD-associated region from 70 to 2.8 kb (P = 1.35 × 10(-44)). By intersecting regulatory data from the ENCODE project, we found a functional SNP, rs4686484 (P = 3.12 × 10(-49)), that maps to several B-cell enhancer elements and a highly conserved region. This SNP was also predicted to change the binding motif of the transcription factors IRF4, IRF11, Nkx2.7 and Nkx2.9, suggesting its role in transcriptional regulation. We later found significantly low levels of LPP mRNA in CeD biopsies compared with controls, thus our results suggest that rs4686484 is the functional variant in this locus, while LPP expression is decreased in CeD.
Genetic Locus for Streptolysin S Production by Group A Streptococcus
Nizet, Victor; Beall, Bernard; Bast, Darrin J.; Datta, Vivekananda; Kilburn, Laurie; Low, Donald E.; De Azavedo, Joyce C. S.
2000-01-01
Group A streptococcus (GAS) is an important human pathogen that causes pharyngitis and invasive infections, including necrotizing fasciitis. Streptolysin S (SLS) is the cytolytic factor that creates the zone of beta-hemolysis surrounding GAS colonies grown on blood agar. We recently reported the discovery of a potential genetic determinant involved in SLS production, sagA, encoding a small peptide of 53 amino acids (S. D. Betschel, S. M. Borgia, N. L. Barg, D. E. Low, and J. C. De Azavedo, Infect. Immun. 66:1671–1679, 1998). Using transposon mutagenesis, chromosomal walking steps, and data from the GAS genome sequencing project (www.genome.ou.edu/strep.html), we have now identified a contiguous nine-gene locus (sagA to sagI) involved in SLS production. The sag locus is conserved among GAS strains regardless of M protein type. Targeted plasmid integrational mutagenesis of each gene in the sag operon resulted in an SLS-negative phenotype. Targeted integrations (i) upstream of the sagA promoter and (ii) downstream of a terminator sequence after sagI did not affect SLS production, establishing the functional boundaries of the operon. A rho-independent terminator sequence between sagA and sagB appears to regulate the amount of sagA transcript produced versus transcript for the entire operon. Reintroduction of the nine-gene sag locus on a plasmid vector restored SLS activity to the nonhemolytic sagA knockout mutant. Finally, heterologous expression of the intact sag operon conferred the SLS beta-hemolytic phenotype to the nonhemolytic Lactococcus lactis. We conclude that gene products of the GAS sag operon are both necessary and sufficient for SLS production. Sequence homologies of sag operon gene products suggest that SLS is related to the bacteriocin family of microbial toxins. PMID:10858242
Covariate analysis of late-onset Alzheimer disease refines the chromosome 12 locus.
Liang, X; Schnetz-Boutaud, N; Kenealy, S J; Jiang, L; Bartlett, J; Lynch, B; Gaskell, P C; Gwirtsman, H; McFarland, L; Bembe, M L; Bronson, P; Gilbert, J R; Martin, E R; Pericak-Vance, M A; Haines, J L
2006-03-01
Alzheimer disease (AD) is a progressive neurodegenerative disorder of later life with a complex etiology and a strong genetic component. Several genomic screens have suggested that a region between chromosome 12p13 and 12q22 contains at least one additional locus underlying the susceptibility of AD. However, localization of this locus has been difficult. We performed a 5 cM microsatellite marker screen across 74 cM on chromosome 12 with 15 markers in 585 multiplex families consisting of 994 affected sibpairs and 213 other affected relative pairs. Analyses across the entire data set did not reveal significant evidence of linkage. However, suggestive linkage was observed in several subsets. In the 91 families where no affected individuals carry an ApoE varepsilon4 allele, an HLOD score of 1.55 was generated at D12S1042. We further examined the linkage data considering the proposed linkages to chromosome 9 (D9S741) and chromosome 10 (alpha-catenin gene). There was a modest (P=0.20) increase in the LOD score for D12S368 (MLOD=1.70) when using the D9S741 LOD scores as a covariate and a highly significant (P<0.001) increase in the MLOD score (4.19) for D12S1701 in autopsy-confirmed families (n=228) when using alpha-catenin LOD scores as a covariate. In both cases, families with no evidence of linkage to D9S741 or alpha-catenin demonstrated most of the evidence of linkage to chromosome 12, suggesting locus heterogeneity. Taken together, our data suggest that the 16 cM region between D12S1042 and D12S368 should be the subject of further detailed genomic efforts for the disease.
Literature-based gene curation and proposed genetic nomenclature for cryptococcus.
Inglis, Diane O; Skrzypek, Marek S; Liaw, Edward; Moktali, Venkatesh; Sherlock, Gavin; Stajich, Jason E
2014-07-01
Cryptococcus, a major cause of disseminated infections in immunocompromised patients, kills over 600,000 people per year worldwide. Genes involved in the virulence of the meningitis-causing fungus are being characterized at an increasing rate, and to date, at least 648 Cryptococcus gene names have been published. However, these data are scattered throughout the literature and are challenging to find. Furthermore, conflicts in locus identification exist, so that named genes have been subsequently published under new names or names associated with one locus have been used for another locus. To avoid these conflicts and to provide a central source of Cryptococcus gene information, we have collected all published Cryptococcus gene names from the scientific literature and associated them with standard Cryptococcus locus identifiers and have incorporated them into FungiDB (www.fungidb.org). FungiDB is a panfungal genome database that collects gene information and functional data and provides search tools for 61 species of fungi and oomycetes. We applied these published names to a manually curated ortholog set of all Cryptococcus species currently in FungiDB, including Cryptococcus neoformans var. neoformans strains JEC21 and B-3501A, C. neoformans var. grubii strain H99, and Cryptococcus gattii strains R265 and WM276, and have written brief descriptions of their functions. We also compiled a protocol for gene naming that summarizes guidelines proposed by members of the Cryptococcus research community. The centralization of genomic and literature-based information for Cryptococcus at FungiDB will help researchers communicate about genes of interest, such as those related to virulence, and will further facilitate research on the pathogen. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Ram, R; Wakil, S M; Muiya, N P; Andres, E; Mazhar, N; Hagos, S; Alshahid, M; Meyer, B F; Morahan, G; Dzimiri, N
2017-03-01
Hypertriglyceridemia (hTG) is a lipid disorder, resulting from an elevation in triglyceride levels, with a strong genetic component. It constitutes a significant risk factor for coronary artery disease (CAD), a leading cause of death worldwide. In this study, we performed a common variant association study for hTG in ethnic Saudi Arabs. We genotyped 5501 individuals in a two-phase experiment using Affymetrix Axiom ® Genome-Wide CEU 1 Array (Affymetrix, Santa Cruz, CA) that contains a total of 587,352 single nucleotide polymorphisms (SNPs). The lead variant was the rs1558861 [1.99 (1.73-2.30); p = 7.37 × 10 -22 ], residing on chromosome (chr) 11 at the apolipoprotein A-I/A-5 (APOA1/APOA5) locus. The rs780094 [1.34 (1.21-1.49); p = 8.57 × 10 -8 ] on chr 2 at the glucokinase regulatory protein (GCKR) locus was similarly significantly associated, while the rs10911205 [1.29 (1.16-1.44); p = 3.52 × 10 -6 ] on chr1 at the laminin subunit gamma-1 (LAMC1) locus showed suggestive association with disease. Furthermore, the rs17145738 [0.68 (0.60-0.77); p = 6.69 × 10 -9 ] on chr7 at the carbohydrate-responsive element-binding protein-encoding (MLXIPL) gene locus displayed significant protective characteristics, while another variant rs6982502 [0.76 (0.68-0.84); p = 5.31 × 10 -7 ] on chr8 showed similar but weaker properties. These findings were replicated in 317 cases vs 1415 controls from the same ethnic Arab population. Our study identified several variants across the human genome that are associated with hTG in ethnic Arabs. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Li, Ying; Wang, Jiaxing; Allingham, R. Rand; Hauser, Michael A.; Wiggs, Janey L.; Geisert, Eldon E.
2018-01-01
Central corneal thickness (CCT) is one of the most heritable ocular traits and it is also a phenotypic risk factor for primary open angle glaucoma (POAG). The present study uses the BXD Recombinant Inbred (RI) strains to identify novel quantitative trait loci (QTLs) modulating CCT in the mouse with the potential of identifying a molecular link between CCT and risk of developing POAG. The BXD RI strain set was used to define mammalian genomic loci modulating CCT, with a total of 818 corneas measured from 61 BXD RI strains (between 60–100 days of age). The mice were anesthetized and the eyes were positioned in front of the lens of the Phoenix Micron IV Image-Guided OCT system or the Bioptigen OCT system. CCT data for each strain was averaged and used to QTLs modulating this phenotype using the bioinformatics tools on GeneNetwork (www.genenetwork.org). The candidate genes and genomic loci identified in the mouse were then directly compared with the summary data from a human POAG genome wide association study (NEIGHBORHOOD) to determine if any genomic elements modulating mouse CCT are also risk factors for POAG.This analysis revealed one significant QTL on Chr 13 and a suggestive QTL on Chr 7. The significant locus on Chr 13 (13 to 19 Mb) was examined further to define candidate genes modulating this eye phenotype. For the Chr 13 QTL in the mouse, only one gene in the region (Pou6f2) contained nonsynonymous SNPs. Of these five nonsynonymous SNPs in Pou6f2, two resulted in changes in the amino acid proline which could result in altered secondary structure affecting protein function. The 7 Mb region under the mouse Chr 13 peak distributes over 2 chromosomes in the human: Chr 1 and Chr 7. These genomic loci were examined in the NEIGHBORHOOD database to determine if they are potential risk factors for human glaucoma identified using meta-data from human GWAS. The top 50 hits all resided within one gene (POU6F2), with the highest significance level of p = 10−6 for SNP rs76319873. POU6F2 is found in retinal ganglion cells and in corneal limbal stem cells. To test the effect of POU6F2 on CCT we examined the corneas of a Pou6f2-null mice and the corneas were thinner than those of wild-type littermates. In addition, these POU6F2 RGCs die early in the DBA/2J model of glaucoma than most RGCs. Using a mouse genetic reference panel, we identified a transcription factor, Pou6f2, that modulates CCT in the mouse. POU6F2 is also found in a subset of retinal ganglion cells and these RGCs are sensitive to injury. PMID:29370175
Quantifying Genome Editing Outcomes at Endogenous Loci using SMRT Sequencing
Clark, Joseph; Punjya, Niraj; Sebastiano, Vittorio; Bao, Gang; Porteus, Matthew H
2014-01-01
SUMMARY Targeted genome editing with engineered nucleases has transformed the ability to introduce precise sequence modifications at almost any site within the genome. A major obstacle to probing the efficiency and consequences of genome editing is that no existing method enables the frequency of different editing events to be simultaneously measured across a cell population at any endogenous genomic locus. We have developed a novel method for quantifying individual genome editing outcomes at any site of interest using single molecule real time (SMRT) DNA sequencing. We show that this approach can be applied at various loci, using multiple engineered nuclease platforms including TALENs, RNA guided endonucleases (CRISPR/Cas9), and ZFNs, and in different cell lines to identify conditions and strategies in which the desired engineering outcome has occurred. This approach facilitates the evaluation of new gene editing technologies and permits sensitive quantification of editing outcomes in almost every experimental system used. PMID:24685129
Genetic mapping of the female mimic morph locus in the ruff
2013-01-01
Background Ruffs (Aves: Philomachus pugnax) possess a genetic polymorphism for male mating behaviour resulting in three permanent alternative male reproductive morphs: (i) territorial ‘Independents’, (ii) non-territorial ‘Satellites’, and (iii) female-mimicking ‘Faeders’. Development into independent or satellite morphs has previously been shown to be due to a single-locus, two-allele autosomal Mendelian mode of inheritance at the Satellite locus. Here, we use linkage analysis to map the chromosomal location of the Faeder locus, which controls development into the Faeder morph, and draw further conclusions about candidate genes, assuming shared synteny with other birds. Results Segregation data on the Faeder locus were obtained from captive-bred pedigrees comprising 64 multi-generation families (N = 381). There was no evidence that the Faeder locus was linked to the Satellite locus, but it was linked with microsatellite marker Ppu020. Comparative mapping of ruff microsatellite markers against the chicken (Gallus gallus) and zebra finch (Taeniopygia guttata) genomes places the Ppu020 and Faeder loci on a region of chromosome 11 that includes the Melanocortin-1 receptor (MC1R) gene, which regulates colour polymorphisms in numerous birds and other vertebrates. Melanin-based colouration varies with life-history strategies in ruffs and other species, thus the MC1R gene is a strong candidate to play a role in alternative male morph determination. Conclusion Two unlinked loci appear to control behavioural development in ruffs. The Faeder locus is linked to Ppu020, which, assuming synteny, is located on avian chromosome 11. MC1R is a candidate gene involved in alternative male morph determination in ruffs. PMID:24256185
Jones, Stephen L; Shah, Priti Pradhan
2016-03-01
Extant trust research champions 3 different centers of action that determine perceptions of trust: the trustor (the individual rendering trust judgments), the trustee (the party being trusted), and the trustor-trustee dyad. We refer to the centers of action as loci of trust. Thus far, researchers have investigated determinants residing within each locus independently but have not concurrently investigated all 3 loci. Thus, the relative influence of each locus on perceptions of trust is unknown. Nor is it known how the influence of each locus changes with time. Where is the dominant locus of trust? And how does it change over time? We address these questions by examining the influence of trustors, trustees, and dyads on perceived ability, benevolence, and integrity. We find that trustor influence decreases over time while trustee and dyadic influences increase. We also find that the trustor is the dominant locus for perceived ability, benevolence, and integrity initially, but over time the trustee becomes the dominant locus for perceived ability and integrity. For perceived benevolence, the trustor remains the dominant driver over time. (c) 2016 APA, all rights reserved).