Ojamies, P N; Kontro, M; Edgren, H; Ellonen, P; Lagström, S; Almusa, H; Miettinen, T; Eldfors, S; Tamborero, D; Wennerberg, K; Heckman, C; Porkka, K; Wolf, M; Kallioniemi, O
2017-05-01
In our individualized systems medicine program, personalized treatment options are identified and administered to chemorefractory acute myeloid leukemia (AML) patients based on exome sequencing and ex vivo drug sensitivity and resistance testing data. Here, we analyzed how clonal heterogeneity affects the responses of 13 AML patients to chemotherapy or targeted treatments using ultra-deep (average 68 000 × coverage) amplicon resequencing. Using amplicon resequencing, we identified 16 variants from 4 patients (frequency 0.54-2%) that were not detected previously by exome sequencing. A correlation-based method was developed to detect mutation-specific responses in serial samples across multiple time points. Significant subclone-specific responses were observed for both chemotherapy and targeted therapy. We detected subclonal responses in patients where clinical European LeukemiaNet (ELN) criteria showed no response. Subclonal responses also helped to identify putative mechanisms underlying drug sensitivities, such as sensitivity to azacitidine in DNMT3A mutated cell clones and resistance to cytarabine in a subclone with loss of NF1 gene. In summary, ultra-deep amplicon resequencing method enables sensitive quantification of subclonal variants and their responses to therapies. This approach provides new opportunities for designing combinatorial therapies blocking multiple subclones as well as for real-time assessment of such treatments.
pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data.
Beyens, Matthias; Boeckx, Nele; Van Camp, Guy; Op de Beeck, Ken; Vandeweyer, Geert
2017-12-14
Haloplex targeted resequencing is a popular method to analyze both germline and somatic variants in gene panels. However, involved wet-lab procedures may introduce false positives that need to be considered in subsequent data-analysis. No variant filtering rationale addressing amplicon enrichment related systematic errors, in the form of an all-in-one package, exists to our knowledge. We present pyAmpli, a platform independent parallelized Python package that implements an amplicon-based germline and somatic variant filtering strategy for Haloplex data. pyAmpli can filter variants for systematic errors by user pre-defined criteria. We show that pyAmpli significantly increases specificity, without reducing sensitivity, essential for reporting true positive clinical relevant mutations in gene panel data. pyAmpli is an easy-to-use software tool which increases the true positive variant call rate in targeted resequencing data. It specifically reduces errors related to PCR-based enrichment of targeted regions.
Sulaiman, Irshad M.; Tang, Kevin; Osborne, John; Sammons, Scott; Wohlhueter, Robert M.
2007-01-01
We developed a set of seven resequencing GeneChips, based on the complete genome sequences of 24 strains of smallpox virus (variola virus), for rapid characterization of this human-pathogenic virus. Each GeneChip was designed to analyze a divergent segment of approximately 30,000 bases of the smallpox virus genome. This study includes the hybridization results of 14 smallpox virus strains. Of the 14 smallpox virus strains hybridized, only 7 had sequence information included in the design of the smallpox virus resequencing GeneChips; similar information for the remaining strains was not tiled as a reference in these GeneChips. By use of variola virus-specific primers and long-range PCR, 22 overlapping amplicons were amplified to cover nearly the complete genome and hybridized with the smallpox virus resequencing GeneChip set. These GeneChips were successful in generating nucleotide sequences for all 14 of the smallpox virus strains hybridized. Analysis of the data indicated that the GeneChip resequencing by hybridization was fast and reproducible and that the smallpox virus resequencing GeneChips could differentiate the 14 smallpox virus strains characterized. This study also suggests that high-density resequencing GeneChips have potential biodefense applications and may be used as an alternate tool for rapid identification of smallpox virus in the future. PMID:17182757
Xu, Xiaojing; Yang, Xiaoxu; Wu, Qixi; Liu, Aijie; Yang, Xiaoling; Ye, Adam Yongxin; Huang, August Yue; Li, Jiarui; Wang, Meng; Yu, Zhe; Wang, Sheng; Zhang, Zhichao; Wu, Xiru
2015-01-01
ABSTRACT The majority of children with Dravet syndrome (DS) are caused by de novo SCN1A mutations. To investigate the origin of the mutations, we developed and applied a new method that combined deep amplicon resequencing with a Bayesian model to detect and quantify allelic fractions with improved sensitivity. Of 174 SCN1A mutations in DS probands which were considered “de novo” by Sanger sequencing, we identified 15 cases (8.6%) of parental mosaicism. We identified another five cases of parental mosaicism that were also detectable by Sanger sequencing. Fraction of mutant alleles in the 20 cases of parental mosaicism ranged from 1.1% to 32.6%. Thirteen (65% of 20) mutations originated paternally and seven (35% of 20) maternally. Twelve (60% of 20) mosaic parents did not have any epileptic symptoms. Their mutant allelic fractions were significantly lower than those in mosaic parents with epileptic symptoms (P = 0.016). We identified mosaicism with varied allelic fractions in blood, saliva, urine, hair follicle, oral epithelium, and semen, demonstrating that postzygotic mutations could affect multiple somatic cells as well as germ cells. Our results suggest that more sensitive tools for detecting low‐level mosaicism in parents of families with seemingly “de novo” mutations will allow for better informed genetic counseling. PMID:26096185
Ladas, Ioannis; Fitarelli-Kiehl, Mariana; Song, Chen; Adalsteinsson, Viktor A; Parsons, Heather A; Lin, Nancy U; Wagle, Nikhil; Makrigiorgos, G Mike
2017-10-01
The use of clinical samples and circulating cell-free DNA (cfDNA) collected from liquid biopsies for diagnostic and prognostic applications in cancer is burgeoning, and improved methods that reduce the influence of excess wild-type (WT) portion of the sample are desirable. Here we present enrichment of mutation-containing sequences using enzymatic degradation of WT DNA. Mutation enrichment is combined with high-resolution melting (HRM) performed in multiplexed closed-tube reactions as a rapid, cost-effective screening tool before targeted resequencing. We developed a homogeneous, closed-tube approach to use a double-stranded DNA-specific nuclease for degradation of WT DNA at multiple targets simultaneously. The No Denaturation Nuclease-assisted Minor Allele Enrichment with Probe Overlap (ND-NaME-PrO) uses WT oligonucleotides overlapping both strands on putative DNA targets. Under conditions of partial denaturation (DNA breathing), the oligonucleotide probes enhance double-stranded DNA-specific nuclease digestion at the selected targets, with high preference toward WT over mutant DNA. To validate ND-NaME-PrO, we used multiplexed HRM, digital PCR, and MiSeq targeted resequencing of mutated genomic DNA and cfDNA. Serial dilution of KRAS mutation-containing DNA shows mutation enrichment by 10- to 120-fold and detection of allelic fractions down to 0.01%. Multiplexed ND-NaME-PrO combined with multiplexed PCR-HRM showed mutation scanning of 10-20 DNA amplicons simultaneously. ND-NaME-PrO applied on cfDNA from clinical samples enables mutation enrichment and HRM scanning over 10 DNA targets. cfDNA mutations were enriched up to approximately 100-fold (average approximately 25-fold) and identified via targeted resequencing. Closed-tube homogeneous ND-NaME-PrO combined with multiplexed HRM is a convenient approach to efficiently enrich for mutations on multiple DNA targets and to enable prescreening before targeted resequencing. © 2017 American Association for Clinical Chemistry.
Goossens, Dirk; Moens, Lotte N; Nelis, Eva; Lenaerts, An-Sofie; Glassee, Wim; Kalbe, Andreas; Frey, Bruno; Kopal, Guido; De Jonghe, Peter; De Rijk, Peter; Del-Favero, Jurgen
2009-03-01
We evaluated multiplex PCR amplification as a front-end for high-throughput sequencing, to widen the applicability of massive parallel sequencers for the detailed analysis of complex genomes. Using multiplex PCR reactions, we sequenced the complete coding regions of seven genes implicated in peripheral neuropathies in 40 individuals on a GS-FLX genome sequencer (Roche). The resulting dataset showed highly specific and uniform amplification. Comparison of the GS-FLX sequencing data with the dataset generated by Sanger sequencing confirmed the detection of all variants present and proved the sensitivity of the method for mutation detection. In addition, we showed that we could exploit the multiplexed PCR amplicons to determine individual copy number variation (CNV), increasing the spectrum of detected variations to both genetic and genomic variants. We conclude that our straightforward procedure substantially expands the applicability of the massive parallel sequencers for sequencing projects of a moderate number of amplicons (50-500) with typical applications in resequencing exons in positional or functional candidate regions and molecular genetic diagnostics. 2008 Wiley-Liss, Inc.
Kotoula, Vassiliki; Lyberopoulou, Aggeliki; Papadopoulou, Kyriaki; Charalambous, Elpida; Alexopoulou, Zoi; Gakou, Chryssa; Lakis, Sotiris; Tsolaki, Eleftheria; Lilakos, Konstantinos; Fountzilas, George
2015-01-01
Background—Aim Massively parallel sequencing (MPS) holds promise for expanding cancer translational research and diagnostics. As yet, it has been applied on paraffin DNA (FFPE) with commercially available highly multiplexed gene panels (100s of DNA targets), while custom panels of low multiplexing are used for re-sequencing. Here, we evaluated the performance of two highly multiplexed custom panels on FFPE DNA. Methods Two custom multiplex amplification panels (B, 373 amplicons; T, 286 amplicons) were coupled with semiconductor sequencing on DNA samples from FFPE breast tumors and matched peripheral blood samples (n samples: 316; n libraries: 332). The two panels shared 37% DNA targets (common or shifted amplicons). Panel performance was evaluated in paired sample groups and quartets of libraries, where possible. Results Amplicon read ratios yielded similar patterns per gene with the same panel in FFPE and blood samples; however, performance of common amplicons differed between panels (p<0.001). FFPE genotypes were compared for 1267 coding and non-coding variant replicates, 999 out of which (78.8%) were concordant in different paired sample combinations. Variant frequency was highly reproducible (Spearman’s rho 0.959). Repeatedly discordant variants were of high coverage / low frequency (p<0.001). Genotype concordance was (a) high, for intra-run duplicates with the same panel (mean±SD: 97.2±4.7, 95%CI: 94.8–99.7, p<0.001); (b) modest, when the same DNA was analyzed with different panels (mean±SD: 81.1±20.3, 95%CI: 66.1–95.1, p = 0.004); and (c) low, when different DNA samples from the same tumor were compared with the same panel (mean±SD: 59.9±24.0; 95%CI: 43.3–76.5; p = 0.282). Low coverage / low frequency variants were validated with Sanger sequencing even in samples with unfavourable DNA quality. Conclusions Custom MPS may yield novel information on genomic alterations, provided that data evaluation is adjusted to tumor tissue FFPE DNA. To this scope, eligibility of all amplicons along with variant coverage and frequency need to be assessed. PMID:26039550
de Muinck, Eric J; Trosvik, Pål; Gilfillan, Gregor D; Hov, Johannes R; Sundaram, Arvind Y M
2017-07-06
Advances in sequencing technologies and bioinformatics have made the analysis of microbial communities almost routine. Nonetheless, the need remains to improve on the techniques used for gathering such data, including increasing throughput while lowering cost and benchmarking the techniques so that potential sources of bias can be better characterized. We present a triple-index amplicon sequencing strategy to sequence large numbers of samples at significantly lower c ost and in a shorter timeframe compared to existing methods. The design employs a two-stage PCR protocol, incorpo rating three barcodes to each sample, with the possibility to add a fourth-index. It also includes heterogeneity spacers to overcome low complexity issues faced when sequencing amplicons on Illumina platforms. The library preparation method was extensively benchmarked through analysis of a mock community in order to assess biases introduced by sample indexing, number of PCR cycles, and template concentration. We further evaluated the method through re-sequencing of a standardized environmental sample. Finally, we evaluated our protocol on a set of fecal samples from a small cohort of healthy adults, demonstrating good performance in a realistic experimental setting. Between-sample variation was mainly related to batch effects, such as DNA extraction, while sample indexing was also a significant source of bias. PCR cycle number strongly influenced chimera formation and affected relative abundance estimates of species with high GC content. Libraries were sequenced using the Illumina HiSeq and MiSeq platforms to demonstrate that this protocol is highly scalable to sequence thousands of samples at a very low cost. Here, we provide the most comprehensive study of performance and bias inherent to a 16S rRNA gene amplicon sequencing method to date. Triple-indexing greatly reduces the number of long custom DNA oligos required for library preparation, while the inclusion of variable length heterogeneity spacers minimizes the need for PhiX spike-in. This design results in a significant cost reduction of highly multiplexed amplicon sequencing. The biases we characterize highlight the need for highly standardized protocols. Reassuringly, we find that the biological signal is a far stronger structuring factor than the various sources of bias.
Ion Torrent sequencing as a tool for mutation discovery in the flax (Linum usitatissimum L.) genome.
Galindo-González, Leonardo; Pinzón-Latorre, David; Bergen, Erik A; Jensen, Dustin C; Deyholos, Michael K
2015-01-01
Detection of induced mutations is valuable for inferring gene function and for developing novel germplasm for crop improvement. Many reverse genetics approaches have been developed to identify mutations in genes of interest within a mutagenized population, including some approaches that rely on next-generation sequencing (e.g. exome capture, whole genome resequencing). As an alternative to these genome or exome-scale methods, we sought to develop a scalable and efficient method for detection of induced mutations that could be applied to a small number of target genes, using Ion Torrent technology. We developed this method in flax (Linum usitatissimum), to demonstrate its utility in a crop species. We used an amplicon-based approach in which DNA samples from an ethyl methanesulfonate (EMS)-mutagenized population were pooled and used as template in PCR reactions to amplify a region of each gene of interest. Barcodes were incorporated during PCR, and the pooled amplicons were sequenced using an Ion Torrent PGM. A pilot experiment with known SNPs showed that they could be detected at a frequency > 0.3% within the pools. We then selected eight genes for which we wanted to discover novel mutations, and applied our approach to screen 768 individuals from the EMS population, using either the Ion 314 or Ion 316 chips. Out of 29 potential mutations identified after processing the NGS reads, 16 mutations were confirmed using Sanger sequencing. The methodology presented here demonstrates the utility of Ion Torrent technology in detecting mutation variants in specific genome regions for large populations of a species such as flax. The methodology could be scaled-up to test >100 genes using the higher capacity chips now available from Ion Torrent.
Rao, Shitao; Leung, Cherry She Ting; Lam, Macro Hb; Wing, Yun Kwok; Waye, Mary Miu Yee; Tsui, Stephen Kwok Wing
2017-03-01
To date almost 200 genes were found to be associated with major depressive disorder (MDD) or suicide attempts (SA), but very few genes were reported for their molecular mechanisms. This study aimed to find out whether there were common or rare variants in three candidate genes altering the risk for MDD and SA in Chinese. Three candidate genes (HOMER1, SLC6A4 and TEF) were chosen for resequencing analysis and association studies as they were reported to be involved in the etiology of MDD and SA. Following that, bioinformatics analyses were applied on those variants of interest. After resequencing analysis and alignment for the amplicons, a total of 34 common or rare variants were found in the randomly selected 36 Hong Kong Chinese patients with both MDD and SA. Among those, seven variants show potentially deleterious features. Rs60029191 and a rare variant located in regulatory region of the HOMER1 gene may affect the promoter activities through interacting with predicted transcription factors. Two missense mutations existed in the SLC6A4 coding regions were firstly reported in Hong Kong Chinese MDD and SA patients, and both of them could affect the transport efficiency of SLC6A4 to serotonin. Moreover, a common variant rs6354 located in the untranslated region of this gene may affect the expression level or exonic splicing of serotonin transporter. In addition, both of a most studied polymorphism rs738499 and a low-frequency variant in the promoter region of the TEF gene were found to be located in potential transcription factor binding sites, which may let the two variants be able to influence the promoter activities of the gene. This study elucidated the potentially molecular mechanisms of the three candidate genes altering the risk for MDD and SA. These findings implied that not only common variants but rare variants could make contributions to the genetic susceptibility to MDD and SA in Chinese. Copyright © 2016 Elsevier B.V. All rights reserved.
Babben, Steve; Perovic, Dragan; Koch, Michael; Ordon, Frank
2015-01-01
Recent declines in costs accelerated sequencing of many species with large genomes, including hexaploid wheat (Triticum aestivum L.). Although the draft sequence of bread wheat is known, it is still one of the major challenges to developlocus specific primers suitable to be used in marker assisted selection procedures, due to the high homology of the three genomes. In this study we describe an efficient approach for the development of locus specific primers comprising four steps, i.e. (i) identification of genomic and coding sequences (CDS) of candidate genes, (ii) intron- and exon-structure reconstruction, (iii) identification of wheat A, B and D sub-genome sequences and primer development based on sequence differences between the three sub-genomes, and (iv); testing of primers for functionality, correct size and localisation. This approach was applied to single, low and high copy genes involved in frost tolerance in wheat. In summary for 27 of these genes for which sequences were derived from Triticum aestivum, Triticum monococcum and Hordeum vulgare, a set of 119 primer pairs was developed and after testing on Nulli-tetrasomic (NT) lines, a set of 65 primer pairs (54.6%), corresponding to 19 candidate genes, turned out to be specific. Out of these a set of 35 fragments was selected for validation via Sanger's amplicon re-sequencing. All fragments, with the exception of one, could be assigned to the original reference sequence. The approach presented here showed a much higher specificity in primer development in comparison to techniques used so far in bread wheat and can be applied to other polyploid species with a known draft sequence. PMID:26565976
He, W; Zhao, S; Liu, X; Dong, S; Lv, J; Liu, D; Wang, J; Meng, Z
2013-12-04
Large-scale next-generation sequencing (NGS)-based resequencing detects sequence variations, constructs evolutionary histories, and identifies phenotype-related genotypes. However, NGS-based resequencing studies generate extraordinarily large amounts of data, making computations difficult. Effective use and analysis of these data for NGS-based resequencing studies remains a difficult task for individual researchers. Here, we introduce ReSeqTools, a full-featured toolkit for NGS (Illumina sequencing)-based resequencing analysis, which processes raw data, interprets mapping results, and identifies and annotates sequence variations. ReSeqTools provides abundant scalable functions for routine resequencing analysis in different modules to facilitate customization of the analysis pipeline. ReSeqTools is designed to use compressed data files as input or output to save storage space and facilitates faster and more computationally efficient large-scale resequencing studies in a user-friendly manner. It offers abundant practical functions and generates useful statistics during the analysis pipeline, which significantly simplifies resequencing analysis. Its integrated algorithms and abundant sub-functions provide a solid foundation for special demands in resequencing projects. Users can combine these functions to construct their own pipelines for other purposes.
A fungal mock community control for amplicon sequencing experiments
USDA-ARS?s Scientific Manuscript database
The field of microbial ecology has been profoundly advanced by the ability to profile the composition of complex microbial communities by means of high throughput amplicon sequencing of marker genes amplified directly from environmental genomic DNA extracts. However, it has become increasingly clear...
Application of resequencing to rice genomics, functional genomics and evolutionary analysis
2014-01-01
Rice is a model system used for crop genomics studies. The completion of the rice genome draft sequences in 2002 not only accelerated functional genome studies, but also initiated a new era of resequencing rice genomes. Based on the reference genome in rice, next-generation sequencing (NGS) using the high-throughput sequencing system can efficiently accomplish whole genome resequencing of various genetic populations and diverse germplasm resources. Resequencing technology has been effectively utilized in evolutionary analysis, rice genomics and functional genomics studies. This technique is beneficial for both bridging the knowledge gap between genotype and phenotype and facilitating molecular breeding via gene design in rice. Here, we also discuss the limitation, application and future prospects of rice resequencing. PMID:25006357
de la Harpe, Marylaure; Paris, Margot; Karger, Dirk N; Rolland, Jonathan; Kessler, Michael; Salamin, Nicolas; Lexer, Christian
2017-05-01
Understanding the drivers and limits of species radiations is a crucial goal of evolutionary genetics and molecular ecology, yet research on this topic has been hampered by the notorious difficulty of connecting micro- and macroevolutionary approaches to studying the drivers of diversification. To chart the current research gaps, opportunities and challenges of molecular ecology approaches to studying radiations, we examine the literature in the journal Molecular Ecology and revisit recent high-profile examples of evolutionary genomic research on radiations. We find that available studies of radiations are highly unevenly distributed among taxa, with many ecologically important and species-rich organismal groups remaining severely understudied, including arthropods, plants and fungi. Most studies employed molecular methods suitable over either short or long evolutionary time scales, such as microsatellites or restriction site-associated DNA sequencing (RAD-seq) in the former case and conventional amplicon sequencing of organellar DNA in the latter. The potential of molecular ecology studies to address and resolve patterns and processes around the species level in radiating groups of taxa is currently limited primarily by sample size and a dearth of information on radiating nuclear genomes as opposed to organellar ones. Based on our literature survey and personal experience, we suggest possible ways forward in the coming years. We touch on the potential and current limitations of whole-genome sequencing (WGS) in studies of radiations. We suggest that WGS and targeted ('capture') resequencing emerge as the methods of choice for scaling up the sampling of populations, species and genomes, including currently understudied organismal groups and the genes or regulatory elements expected to matter most to species radiations. © 2017 John Wiley & Sons Ltd.
Tuononen, Katja; Sarhadi, Virinder Kaur; Wirtanen, Aino; Rönty, Mikko; Salmenkivi, Kaisa; Knuuttila, Aija; Remes, Satu; Telaranta-Keerie, Aino I; Bloor, Stuart; Ellonen, Pekka; Knuutila, Sakari
2013-01-01
Anaplastic lymphoma receptor tyrosine kinase (ALK) gene rearrangements occur in a subgroup of non-small cell lung carcinomas (NSCLCs). The identification of these rearrangements is important for guiding treatment decisions. The aim of our study was to screen ALK gene fusions in NSCLCs and to compare the results detected by targeted resequencing with results detected by commonly used methods, including fluorescence in situ hybridization (FISH), immunohistochemistry (IHC), and real-time reverse transcription-PCR (RT-PCR). Furthermore, we aimed to ascertain the potential of targeted resequencing in detection of ALK-rearranged lung carcinomas. We assessed ALK fusion status for 95 formalin-fixed paraffin-embedded tumor tissue specimens from 87 patients with NSCLC by FISH and real-time RT-PCR, for 57 specimens from 56 patients by targeted resequencing, and for 14 specimens from 14 patients by IHC. All methods were performed successfully on formalin-fixed paraffin-embedded tumor tissue material. We detected ALK fusion in 5.7% (5 out of 87) of patients examined. The results obtained from resequencing correlated significantly with those from FISH, real-time RT-PCR, and IHC. Targeted resequencing proved to be a promising method for ALK gene fusion detection in NSCLC. Means to reduce the material and turnaround time required for analysis are, however, needed.
Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David
2018-04-11
Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.
PrimerSuite: A High-Throughput Web-Based Primer Design Program for Multiplex Bisulfite PCR.
Lu, Jennifer; Johnston, Andrew; Berichon, Philippe; Ru, Ke-Lin; Korbie, Darren; Trau, Matt
2017-01-24
The analysis of DNA methylation at CpG dinucleotides has become a major research focus due to its regulatory role in numerous biological processes, but the requisite need for assays which amplify bisulfite-converted DNA represents a major bottleneck due to the unique design constraints imposed on bisulfite-PCR primers. Moreover, a review of the literature indicated no available software solutions which accommodated both high-throughput primer design, support for multiplex amplification assays, and primer-dimer prediction. In response, the tri-modular software package PrimerSuite was developed to support bisulfite multiplex PCR applications. This software was constructed to (i) design bisulfite primers against multiple regions simultaneously (PrimerSuite), (ii) screen for primer-primer dimerizing artefacts (PrimerDimer), and (iii) support multiplex PCR assays (PrimerPlex). Moreover, a major focus in the development of this software package was the emphasis on extensive empirical validation, and over 1300 unique primer pairs have been successfully designed and screened, with over 94% of them producing amplicons of the expected size, and an average mapping efficiency of 93% when screened using bisulfite multiplex resequencing. The potential use of the software in other bisulfite-based applications such as methylation-specific PCR is under consideration for future updates. This resource is freely available for use at PrimerSuite website (www.primer-suite.com).
Novák, Karel; Pikousová, Jitka; Czerneková, Vladimíra; Mátlová, Věra
2017-07-03
The allelic variants of immunity genes in historical breeds likely reflect local infection pressure and therefore represent a reservoir for breeding. Screening to determine the diversity of the Toll-like receptor gene TLR4 was conducted in two conserved cattle breeds: Czech Red and Czech Red Pied. High-throughput sequencing of pooled PCR amplicons using the PacBio platform revealed polymorphisms, which were subsequently confirmed via genotyping techniques. Eight SNPs found in coding and adjacent regions were grouped into 18 haplotypes, representing a significant portion of the known diversity in the global breed panel and presumably exceeding diversity in production populations. Notably, the ancient Czech Red breed appeared to possess greater haplotype diversity than the Czech Red Pied breed, a Simmental variant, although the haplotype frequencies might have been distorted by significant crossbreeding and bottlenecks in the history of Czech Red cattle. The differences in haplotype frequencies validated the phenotypic distinctness of the local breeds. Due to the availability of Czech Red Pied production herds, the effect of intensive breeding on TLR diversity can be evaluated in this model. The advantages of the Pacific Biosciences technology for the resequencing of long PCR fragments with subsequent direct phasing were independently validated.
Targeted Re-Sequencing Emulsion PCR Panel for Myopathies: Results in 94 Cases.
Punetha, Jaya; Kesari, Akanchha; Uapinyoying, Prech; Giri, Mamta; Clarke, Nigel F; Waddell, Leigh B; North, Kathryn N; Ghaoui, Roula; O'Grady, Gina L; Oates, Emily C; Sandaradura, Sarah A; Bönnemann, Carsten G; Donkervoort, Sandra; Plotz, Paul H; Smith, Edward C; Tesi-Rocha, Carolina; Bertorini, Tulio E; Tarnopolsky, Mark A; Reitter, Bernd; Hausmanowa-Petrusewicz, Irena; Hoffman, Eric P
2016-05-27
Molecular diagnostics in the genetic myopathies often requires testing of the largest and most complex transcript units in the human genome (DMD, TTN, NEB). Iteratively targeting single genes for sequencing has traditionally entailed high costs and long turnaround times. Exome sequencing has begun to supplant single targeted genes, but there are concerns regarding coverage and needed depth of the very large and complex genes that frequently cause myopathies. To evaluate efficiency of next-generation sequencing technologies to provide molecular diagnostics for patients with previously undiagnosed myopathies. We tested a targeted re-sequencing approach, using a 45 gene emulsion PCR myopathy panel, with subsequent sequencing on the Illumina platform in 94 undiagnosed patients. We compared the targeted re-sequencing approach to exome sequencing for 10 of these patients studied. We detected likely pathogenic mutations in 33 out of 94 patients with a molecular diagnostic rate of approximately 35%. The remaining patients showed variants of unknown significance (35/94 patients) or no mutations detected in the 45 genes tested (26/94 patients). Mutation detection rates for targeted re-sequencing vs. whole exome were similar in both methods; however exome sequencing showed better distribution of reads and fewer exon dropouts. Given that costs of highly parallel re-sequencing and whole exome sequencing are similar, and that exome sequencing now takes considerably less laboratory processing time than targeted re-sequencing, we recommend exome sequencing as the standard approach for molecular diagnostics of myopathies.
Targeted resequencing in peanuts using the fluidigm access array
USDA-ARS?s Scientific Manuscript database
The presence of homoeologous gene copies in allotetraploid peanut makes it challenging to select homologous SNPs differentiating two or more cultivars. An integrated approach of improved bioinformatics and targeted resequencing to select homologous SNPs in tetraploid peanut is needed. Raw transcrip...
Cheng, Feng; Wu, Jian; Cai, Chengcheng; Fu, Lixia; Liang, Jianli; Borm, Theo; Zhuang, Mu; Zhang, Yangyong; Zhang, Fenglan; Bonnema, Guusje; Wang, Xiaowu
2016-12-20
The closely related species Brassica rapa and B. oleracea encompass a wide range of vegetable, fodder and oil crops. The release of their reference genomes has facilitated resequencing collections of B. rapa and B. oleracea aiming to build their variome datasets. These data can be used to investigate the evolutionary relationships between and within the different species and the domestication of the crops, hereafter named morphotypes. These data can also be used in genetic studies aiming at the identification of genes that influence agronomic traits. We selected and resequenced 199 B. rapa and 119 B. oleracea accessions representing 12 and nine morphotypes, respectively. Based on these resequencing data, we obtained 2,249,473 and 3,852,169 high quality SNPs (single-nucleotide polymorphisms), as well as 303,617 and 417,004 InDels for the B. rapa and B. oleracea populations, respectively. The variome datasets of B. rapa and B. oleracea represent valuable resources to researchers working on evolution, domestication or breeding of Brassica vegetable crops.
Egawa, Jun; Watanabe, Yuichiro; Shibuya, Masako; Endo, Taro; Sugimoto, Atsunori; Igeta, Hirofumi; Nunokawa, Ayako; Inoue, Emiko; Someya, Toshiyuki
2015-03-01
The oxytocin receptor (OXTR) is implicated in the pathophysiology of autism spectrum disorder (ASD). A recent study found a rare non-synonymous OXTR gene variation, rs35062132 (R376G), associated with ASD in a Japanese population. In order to investigate the association between rare non-synonymous OXTR variations and ASD, we resequenced OXTR and performed association analysis with ASD in a Japanese population. We resequenced the OXTR coding region in 213 ASD patients. Rare non-synonymous OXTR variations detected by resequencing were genotyped in 213 patients and 667 controls. We detected three rare non-synonymous variations: rs35062132 (R376G/C), rs151257822 (G334D), and g.8809426G>T (R150S). However, there was no significant association between these rare non-synonymous variations and ASD. Our present study does not support the contribution of rare non-synonymous OXTR variations to ASD susceptibility in the Japanese population. © 2014 The Authors. Psychiatry and Clinical Neurosciences © 2014 Japanese Society of Psychiatry and Neurology.
Robertson, Laura S.; Cornman, Robert S.
2014-01-01
We developed genetic resources for two North American frogs, Lithobates clamitans and Pseudacris regilla, widespread native amphibians that are potential indicator species of environmental health. For both species, mRNA from multiple tissues was sequenced using 454 technology. De novo assemblies with Mira3 resulted in 50 238 contigs (N50 = 687 bp) and 48 213 contigs (N50 = 686 bp) for L. clamitans and P. regilla, respectively, after clustering with CD-Hit-EST and purging contigs below 200 bp. We performed BLASTX similarity searches against the Xenopus tropicalis proteome and, for predicted ORFs, HMMER similarity searches against the Pfam-A database. Because there is broad interest in amphibian immune factors, we manually annotated putative antimicrobial peptides. To identify conserved regions suitable for amplicon resequencing across a broad taxonomic range, we performed an additional assembly of public short-read transcriptome data derived from two species of the genus Rana and identified reciprocal best TBLASTX matches among all assemblies. Although P. regilla, a hylid frog, is substantially more diverged from the ranid species, we identified 56 genes that were sufficiently conserved to allow nondegenerate primer design with Primer3. In addition to providing a foundation for comparative genomics and quantitative gene expression analysis, our results enable quick development of nuclear sequence-based markers for phylogenetics or population genetics.
ERIC Educational Resources Information Center
Dwyer, Dave; Gruenwald, Mark; Stickles, Joe; Axtell, Mike
2018-01-01
Resequencing Calculus is a project that has reordered the typical delivery of Calculus material to better serve the needs of STEM majors. Funded twice by the National Science Foundation, this project has produced a three-semester textbook that has been piloted at numerous institutions, large and small, public and private. This paper describes the…
USDA-ARS?s Scientific Manuscript database
The next generation sequencing (NGS) technologies have opened a wealth of opportunities for plant breeding and genomics research, and changed the paradigms of marker detection, genotyping, and gene discovery. Abundant genomic resources have been generated using a whole genome resequencing (WGR) str...
De Franceschi, Paolo; Bianco, Luca; Cestaro, Alessandro; Dondini, Luca; Velasco, Riccardo
2018-06-01
Data obtained from Illumina resequencing of 63 apple cultivars were used to obtain full-length S-RNase sequences using a strategy based on both alignment and de novo assembly of reads. The reproductive biology of apple is regulated by the S-RNase-based gametophytic self-incompatibility system, that is genetically controlled by the single, multi-genic and multi-allelic S locus. Resequencing of apple cultivars provided a huge amount of genetic data, that can be aligned to the reference genome in order to characterize variation to a genome-wide level. However, this approach is not immediately adaptable to the S-locus, due to some peculiar features such as the high degree of polymorphism, lack of colinearity between haplotypes and extensive presence of repetitive elements. In this study we describe a dedicated procedure aimed at characterizing S-RNase alleles from resequenced cultivars. The S-genotype of 63 apple accessions is reported; the full length coding sequence was determined for the 25 S-RNase alleles present in the 63 resequenced cultivars; these included 10 previously incomplete sequences (S 5 , S 6a , S 6b , S 8 , S 11 , S 23 , S 39 , S 46 , S 50 and S 58 ). Moreover, sequence divergence clearly suggests that alleles S 6a and S 6b , proposed to be neutral variants of the same alleles, should be instead considered different specificities. The promoter sequences have also been analyzed, highlighting regions of homology conserved among all the alleles.
USDA-ARS?s Scientific Manuscript database
During ongoing proteomic analysis of the soybean (Glycine max (L.) Merr) germplasm collection, PI 603408 was identified as a landrace whose seeds lack accumulation of one of the major seed storage glycinin protein subunits. Whole genomic resequencing was used to identify a two-base deletion affectin...
USDA-ARS?s Scientific Manuscript database
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer ge...
USDA-ARS?s Scientific Manuscript database
A bacterial artificial chromosome (BAC) library and BAC-end sequences for Gossypium hirsutum L. have recently been developed. Here we report on genomic-based genome-wide SNP mining utilizing re-sequencing data with a BAC-end sequence reference for twelve G. hirsutum L. lines, one G. barbadense L. li...
Papadopoulou, Evanthia; Goodchild, Sarah A; Cleary, David W; Weller, Simon A; Gale, Nittaya; Stubberfield, Michael R; Brown, Tom; Bartlett, Philip N
2015-02-03
The development of sensors for the detection of pathogen-specific DNA, including relevant species/strain level discrimination, is critical in molecular diagnostics with major impacts in areas such as bioterrorism and food safety. Herein, we use electrochemically driven denaturation assays monitored by surface-enhanced Raman spectroscopy (SERS) to target single nucleotide polymorphisms (SNPs) that distinguish DNA amplicons generated from Yersinia pestis, the causative agent of plague, from the closely related species Y. pseudotuberculosis. Two assays targeting SNPs within the groEL and metH genes of these two species have been successfully designed. Polymerase chain reaction (PCR) was used to produce Texas Red labeled single-stranded DNA (ssDNA) amplicons of 262 and 251 bases for the groEL and metH targets, respectively. These amplicons were used in an unpurified form to hybridize to immobilized probes then subjected to electrochemically driven melting. In all cases electrochemically driven melting was able to discriminate between fully homologous DNA and that containing SNPs. The metH assay was particularly challenging due to the presence of only a single base mismatch in the middle of the 251 base long PCR amplicon. However, manipulation of assay conditions (conducting the electrochemical experiments at 10 °C) resulted in greater discrimination between the complementary and mismatched DNA. Replicate data were collected and analyzed for each duplex on different days, using different batches of PCR product and different sphere segment void (SSV) substrates. Despite the variability introduced by these differences, the assays are shown to be reliable and robust providing a new platform for strain discrimination using unpurified PCR samples.
A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic
Madsen, Bo Eskerod; Browning, Sharon R.
2009-01-01
Resequencing is an emerging tool for identification of rare disease-associated mutations. Rare mutations are difficult to tag with SNP genotyping, as genotyping studies are designed to detect common variants. However, studies have shown that genetic heterogeneity is a probable scenario for common diseases, in which multiple rare mutations together explain a large proportion of the genetic basis for the disease. Thus, we propose a weighted-sum method to jointly analyse a group of mutations in order to test for groupwise association with disease status. For example, such a group of mutations may result from resequencing a gene. We compare the proposed weighted-sum method to alternative methods and show that it is powerful for identifying disease-associated genes, both on simulated and Encode data. Using the weighted-sum method, a resequencing study can identify a disease-associated gene with an overall population attributable risk (PAR) of 2%, even when each individual mutation has much lower PAR, using 1,000 to 7,000 affected and unaffected individuals, depending on the underlying genetic model. This study thus demonstrates that resequencing studies can identify important genetic associations, provided that specialised analysis methods, such as the weighted-sum method, are used. PMID:19214210
Sulaiman, Irshad M; Sammons, Scott A; Wohlhueter, Robert M
2008-04-01
We recently developed a set of seven resequencing GeneChips for the rapid sequencing of Variola virus strains in the WHO Repository of the Centers for Disease Control and Prevention. In this study, we attempted to hybridize these GeneChips with some known non-Variola orthopoxvirus isolates, including monkeypox, cowpox, and vaccinia viruses, for rapid detection.
Yang, Huaan; Jian, Jianbo; Li, Xuan; Renshaw, Daniel; Clements, Jonathan; Sweetingham, Mark W; Tan, Cong; Li, Chengdao
2015-09-02
Molecular marker-assisted breeding provides an efficient tool to develop improved crop varieties. A major challenge for the broad application of markers in marker-assisted selection is that the marker phenotypes must match plant phenotypes in a wide range of breeding germplasm. In this study, we used the legume crop species Lupinus angustifolius (lupin) to demonstrate the utility of whole genome sequencing and re-sequencing on the development of diagnostic markers for molecular plant breeding. Nine lupin cultivars released in Australia from 1973 to 2007 were subjected to whole genome re-sequencing. The re-sequencing data together with the reference genome sequence data were used in marker development, which revealed 180,596 to 795,735 SNP markers from pairwise comparisons among the cultivars. A total of 207,887 markers were anchored on the lupin genetic linkage map. Marker mining obtained an average of 387 SNP markers and 87 InDel markers for each of the 24 genome sequence assembly scaffolds bearing markers linked to 11 genes of agronomic interest. Using the R gene PhtjR conferring resistance to phomopsis stem blight disease as a test case, we discovered 17 candidate diagnostic markers by genotyping and selecting markers on a genetic linkage map. A further 243 candidate diagnostic markers were discovered by marker mining on a scaffold bearing non-diagnostic markers linked to the PhtjR gene. Nine out from the ten tested candidate diagnostic markers were confirmed as truly diagnostic on a broad range of commercial cultivars. Markers developed using these strategies meet the requirements for broad application in molecular plant breeding. We demonstrated that low-cost genome sequencing and re-sequencing data were sufficient and very effective in the development of diagnostic markers for marker-assisted selection. The strategies used in this study may be applied to any trait or plant species. Whole genome sequencing and re-sequencing provides a powerful tool to overcome current limitations in molecular plant breeding, which will enable plant breeders to precisely pyramid favourable genes to develop super crop varieties to meet future food demands.
Wang, Yao; Cui, Yazhou; Zhou, Xiaoyan; Han, Jinxiang
2015-01-01
Objective Osteogenesis imperfecta (OI) is a rare inherited skeletal disease, characterized by bone fragility and low bone density. The mutations in this disorder have been widely reported to be on various exonal hotspots of the candidate genes, including COL1A1, COL1A2, CRTAP, LEPRE1, and FKBP10, thus creating a great demand for precise genetic tests. However, large genome sizes make the process daunting and the analyses, inefficient and expensive. Therefore, we aimed at developing a fast, accurate, efficient, and cheaper sequencing platform for OI diagnosis; and to this end, use of an advanced array-based technique was proposed. Method A CustomSeq Affymetrix Resequencing Array was established for high-throughput sequencing of five genes simultaneously. Genomic DNA extraction from 13 OI patients and 85 normal controls and amplification using long-range PCR (LR-PCR) were followed by DNA fragmentation and chip hybridization, according to standard Affymetrix protocols. Hybridization signals were determined using GeneChip Sequence Analysis Software (GSEQ). To examine the feasibility, the outcome from new resequencing approach was validated by conventional capillary sequencing method. Result Overall call rates using resequencing array was 96–98% and the agreement between microarray and capillary sequencing was 99.99%. 11 out of 13 OI patients with pathogenic mutations were successfully detected by the chip analysis without adjustment, and one mutation could also be identified using manual visual inspection. Conclusion A high-throughput resequencing array was developed that detects the disease-associated mutations in OI, providing a potential tool to facilitate large-scale genetic screening for OI patients. Through this method, a novel mutation was also found. PMID:25742658
Droege, Marcus; Hill, Brendon
2008-08-31
The Genome Sequencer FLX System (GS FLX), powered by 454 Sequencing, is a next-generation DNA sequencing technology featuring a unique mix of long reads, exceptional accuracy, and ultra-high throughput. It has been proven to be the most versatile of all currently available next-generation sequencing technologies, supporting many high-profile studies in over seven applications categories. GS FLX users have pursued innovative research in de novo sequencing, re-sequencing of whole genomes and target DNA regions, metagenomics, and RNA analysis. 454 Sequencing is a powerful tool for human genetics research, having recently re-sequenced the genome of an individual human, currently re-sequencing the complete human exome and targeted genomic regions using the NimbleGen sequence capture process, and detected low-frequency somatic mutations linked to cancer.
Abo, Ryan P; Ducar, Matthew; Garcia, Elizabeth P; Thorner, Aaron R; Rojas-Rudilla, Vanesa; Lin, Ling; Sholl, Lynette M; Hahn, William C; Meyerson, Matthew; Lindeman, Neal I; Van Hummelen, Paul; MacConaill, Laura E
2015-02-18
Genomic structural variation (SV), a common hallmark of cancer, has important predictive and therapeutic implications. However, accurately detecting SV using high-throughput sequencing data remains challenging, especially for 'targeted' resequencing efforts. This is critically important in the clinical setting where targeted resequencing is frequently being applied to rapidly assess clinically actionable mutations in tumor biopsies in a cost-effective manner. We present BreaKmer, a novel approach that uses a 'kmer' strategy to assemble misaligned sequence reads for predicting insertions, deletions, inversions, tandem duplications and translocations at base-pair resolution in targeted resequencing data. Variants are predicted by realigning an assembled consensus sequence created from sequence reads that were abnormally aligned to the reference genome. Using targeted resequencing data from tumor specimens with orthogonally validated SV, non-tumor samples and whole-genome sequencing data, BreaKmer had a 97.4% overall sensitivity for known events and predicted 17 positively validated, novel variants. Relative to four publically available algorithms, BreaKmer detected SV with increased sensitivity and limited calls in non-tumor samples, key features for variant analysis of tumor specimens in both the clinical and research settings. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
A fungal mock community control for amplicon sequencing experiments
USDA-ARS?s Scientific Manuscript database
Microbial ecology has been profoundly advanced by the ability to profile complex microbial communities by sequencing of marker genes amplified from environmental samples. However, inclusion of appropriate controls is vital to revealing the limitations and biases of this technique. “Mock community” s...
Cornelissen, Marion; Gall, Astrid; Vink, Monique; Zorgdrager, Fokla; Binter, Špela; Edwards, Stephanie; Jurriaans, Suzanne; Bakker, Margreet; Ong, Swee Hoe; Gras, Luuk; van Sighem, Ard; Bezemer, Daniela; de Wolf, Frank; Reiss, Peter; Kellam, Paul; Berkhout, Ben; Fraser, Christophe; van der Kuyl, Antoinette C
2017-07-15
The BEEHIVE (Bridging the Evolution and Epidemiology of HIV in Europe) project aims to analyse nearly-complete viral genomes from >3000 HIV-1 infected Europeans using high-throughput deep sequencing techniques to investigate the virus genetic contribution to virulence. Following the development of a computational pipeline, including a new de novo assembler for RNA virus genomes, to generate larger contiguous sequences (contigs) from the abundance of short sequence reads that characterise the data, another area that determines genome sequencing success is the quality and quantity of the input RNA. A pilot experiment with 125 patient plasma samples was performed to investigate the optimal method for isolation of HIV-1 viral RNA for long amplicon genome sequencing. Manual isolation with the QIAamp Viral RNA Mini Kit (Qiagen) was superior over robotically extracted RNA using either the QIAcube robotic system, the mSample Preparation Systems RNA kit with automated extraction by the m2000sp system (Abbott Molecular), or the MagNA Pure 96 System in combination with the MagNA Pure 96 Instrument (Roche Diagnostics). We scored amplification of a set of four HIV-1 amplicons of ∼1.9, 3.6, 3.0 and 3.5kb, and subsequent recovery of near-complete viral genomes. Subsequently, 616 BEEHIVE patient samples were analysed to determine factors that influence successful amplification of the genome in four overlapping amplicons using the QIAamp Viral RNA Kit for viral RNA isolation. Both low plasma viral load and high sample age (stored before 1999) negatively influenced the amplification of viral amplicons >3kb. A plasma viral load of >100,000 copies/ml resulted in successful amplification of all four amplicons for 86% of the samples, this value dropped to only 46% for samples with viral loads of <20,000 copies/ml. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Whole exome resequencing distinguishes cystic kidney diseases from phenocopies in renal ciliopathies
Gee, Heon Yung; Otto, Edgar A.; Hurd, Toby W.; Ashraf, Shazia; Chaki, Moumita; Cluckey, Andrew; Vega-Warner, Virginia; Saisawat, Pawaree; Diaz, Katrina A.; Fang, Humphrey; Kohl, Stefan; Allen, Susan J.; Airik, Rannar; Zhou, Weibin; Ramaswami, Gokul; Janssen, Sabine; Fu, Clementine; Innis, Jamie L.; Weber, Stefanie; Vester, Udo; Davis, Erica E.; Katsanis, Nicholas; Fathy, Hanan M.; Jeck, Nikola; Klaus, Gunther; Nayir, Ahmet; Rahim, Khawla A.; Attrach, Ibrahim Al; Hassoun, Ibrahim Al; Ozturk, Savas; Drozdz, Dorota; Helmchen, Udo; O’Toole, John F.; Attanasio, Massimo; Nürnberg, Gudrun; Nürnberg, Peter; Washburn, Joseph; MacDonald, James; James, Jeffrey W.; Levy, Shawn; Hildebrandt, Friedhelm
2013-01-01
Rare single-gene disorders cause chronic disease. However, half of the 6,000 recessive single gene causes of disease are still unknown. Because recessive disease genes can illuminate, at least in part, disease pathomechanism, their identification offers direct opportunities for improved clinical management and potentially treatment. Rare diseases comprise the majority of chronic kidney disease (CKD) in children but are notoriously difficult to diagnose. Whole exome resequencing facilitates identification of recessive disease genes. However, its utility is impeded by the large number of genetic variants detected. We here overcome this limitation by combining homozygosity mapping with whole exome resequencing in 10 sib pairs with a nephronophthisis-related ciliopathy, which represents the most frequent genetic cause of CKD in the first three decades of life. In 7 of 10 sib-ships with a histologic or ultrasonographic diagnosis of nephronophthisis-related ciliopathy we detect the causative gene. In six sib-ships we identify mutations of known nephronophthisis-related ciliopathy genes, while in two additional sib-ships we found mutations in the known CKD-causing genes SLC4A1 and AGXT as phenocopies of nephronophthisis-related ciliopathy. Thus whole exome resequencing establishes an efficient, non-invasive approach towards early detection and causation-based diagnosis of rare kidney diseases. This approach can be extended to other rare recessive disorders, thereby providing accurate diagnosis and facilitating the study of disease mechanisms. PMID:24257694
A targeted resequencing gene panel for focal epilepsy.
Hildebrand, Michael S; Myers, Candace T; Carvill, Gemma L; Regan, Brigid M; Damiano, John A; Mullen, Saul A; Newton, Mark R; Nair, Umesh; Gazina, Elena V; Milligan, Carol J; Reid, Christopher A; Petrou, Steven; Scheffer, Ingrid E; Berkovic, Samuel F; Mefford, Heather C
2016-04-26
We report development of a targeted resequencing gene panel for focal epilepsy, the most prevalent phenotypic group of the epilepsies. The targeted resequencing gene panel was designed using molecular inversion probe (MIP) capture technology and sequenced using massively parallel Illumina sequencing. We demonstrated proof of principle that mutations can be detected in 4 previously genotyped focal epilepsy cases. We searched for both germline and somatic mutations in 251 patients with unsolved sporadic or familial focal epilepsy and identified 11 novel or very rare missense variants in 5 different genes: CHRNA4, GRIN2B, KCNT1, PCDH19, and SCN1A. Of these, 2 were predicted to be pathogenic or likely pathogenic, explaining ∼0.8% of the cohort, and 8 were of uncertain significance based on available data. We have developed and validated a targeted resequencing panel for focal epilepsies, the most important clinical class of epilepsies, accounting for about 60% of all cases. Our application of MIP technology is an innovative approach that will be advantageous in the clinical setting because it is highly sensitive, efficient, and cost-effective for screening large patient cohorts. Our findings indicate that mutations in known genes likely explain only a small proportion of focal epilepsy cases. This is not surprising given the established clinical and genetic heterogeneity of these disorders and underscores the importance of further gene discovery studies in this complex syndrome. © 2016 American Academy of Neurology.
Gee, Heon Yung; Otto, Edgar A; Hurd, Toby W; Ashraf, Shazia; Chaki, Moumita; Cluckey, Andrew; Vega-Warner, Virginia; Saisawat, Pawaree; Diaz, Katrina A; Fang, Humphrey; Kohl, Stefan; Allen, Susan J; Airik, Rannar; Zhou, Weibin; Ramaswami, Gokul; Janssen, Sabine; Fu, Clementine; Innis, Jamie L; Weber, Stefanie; Vester, Udo; Davis, Erica E; Katsanis, Nicholas; Fathy, Hanan M; Jeck, Nikola; Klaus, Gunther; Nayir, Ahmet; Rahim, Khawla A; Al Attrach, Ibrahim; Al Hassoun, Ibrahim; Ozturk, Savas; Drozdz, Dorota; Helmchen, Udo; O'Toole, John F; Attanasio, Massimo; Lewis, Richard A; Nürnberg, Gudrun; Nürnberg, Peter; Washburn, Joseph; MacDonald, James; Innis, Jeffrey W; Levy, Shawn; Hildebrandt, Friedhelm
2014-04-01
Rare single-gene disorders cause chronic disease. However, half of the 6000 recessive single gene causes of disease are still unknown. Because recessive disease genes can illuminate, at least in part, disease pathomechanism, their identification offers direct opportunities for improved clinical management and potentially treatment. Rare diseases comprise the majority of chronic kidney disease (CKD) in children but are notoriously difficult to diagnose. Whole-exome resequencing facilitates identification of recessive disease genes. However, its utility is impeded by the large number of genetic variants detected. We here overcome this limitation by combining homozygosity mapping with whole-exome resequencing in 10 sib pairs with a nephronophthisis-related ciliopathy, which represents the most frequent genetic cause of CKD in the first three decades of life. In 7 of 10 sibships with a histologic or ultrasonographic diagnosis of nephronophthisis-related ciliopathy, we detect the causative gene. In six sibships, we identify mutations of known nephronophthisis-related ciliopathy genes, while in two additional sibships we found mutations in the known CKD-causing genes SLC4A1 and AGXT as phenocopies of nephronophthisis-related ciliopathy. Thus, whole-exome resequencing establishes an efficient, noninvasive approach towards early detection and causation-based diagnosis of rare kidney diseases. This approach can be extended to other rare recessive disorders, thereby providing accurate diagnosis and facilitating the study of disease mechanisms.
Swarm: robust and fast clustering method for amplicon-based studies.
Mahé, Frédéric; Rognes, Torbjørn; Quince, Christopher; de Vargas, Colomban; Dunthorn, Micah
2014-01-01
Popular de novo amplicon clustering methods suffer from two fundamental flaws: arbitrary global clustering thresholds, and input-order dependency induced by centroid selection. Swarm was developed to address these issues by first clustering nearly identical amplicons iteratively using a local threshold, and then by using clusters' internal structure and amplicon abundances to refine its results. This fast, scalable, and input-order independent approach reduces the influence of clustering parameters and produces robust operational taxonomic units.
Swarm: robust and fast clustering method for amplicon-based studies
Rognes, Torbjørn; Quince, Christopher; de Vargas, Colomban; Dunthorn, Micah
2014-01-01
Popular de novo amplicon clustering methods suffer from two fundamental flaws: arbitrary global clustering thresholds, and input-order dependency induced by centroid selection. Swarm was developed to address these issues by first clustering nearly identical amplicons iteratively using a local threshold, and then by using clusters’ internal structure and amplicon abundances to refine its results. This fast, scalable, and input-order independent approach reduces the influence of clustering parameters and produces robust operational taxonomic units. PMID:25276506
Razali, Haslina; O'Connor, Emily; Drews, Anna; Burke, Terry; Westerdahl, Helena
2017-07-28
High-throughput sequencing enables high-resolution genotyping of extremely duplicated genes. 454 amplicon sequencing (454) has become the standard technique for genotyping the major histocompatibility complex (MHC) genes in non-model organisms. However, illumina MiSeq amplicon sequencing (MiSeq), which offers a much higher read depth, is now superseding 454. The aim of this study was to quantitatively and qualitatively evaluate the performance of MiSeq in relation to 454 for genotyping MHC class I alleles using a house sparrow (Passer domesticus) dataset with pedigree information. House sparrows provide a good study system for this comparison as their MHC class I genes have been studied previously and, consequently, we had prior expectations concerning the number of alleles per individual. We found that 454 and MiSeq performed equally well in genotyping amplicons with low diversity, i.e. amplicons from individuals that had fewer than 6 alleles. Although there was a higher rate of failure in the 454 dataset in resolving amplicons with higher diversity (6-9 alleles), the same genotypes were identified by both 454 and MiSeq in 98% of cases. We conclude that low diversity amplicons are equally well genotyped using either 454 or MiSeq, but the higher coverage afforded by MiSeq can lead to this approach outperforming 454 in amplicons with higher diversity.
AMPLISAS: a web server for multilocus genotyping using next-generation amplicon sequencing data.
Sebastian, Alvaro; Herdegen, Magdalena; Migalska, Magdalena; Radwan, Jacek
2016-03-01
Next-generation sequencing (NGS) technologies are revolutionizing the fields of biology and medicine as powerful tools for amplicon sequencing (AS). Using combinations of primers and barcodes, it is possible to sequence targeted genomic regions with deep coverage for hundreds, even thousands, of individuals in a single experiment. This is extremely valuable for the genotyping of gene families in which locus-specific primers are often difficult to design, such as the major histocompatibility complex (MHC). The utility of AS is, however, limited by the high intrinsic sequencing error rates of NGS technologies and other sources of error such as polymerase amplification or chimera formation. Correcting these errors requires extensive bioinformatic post-processing of NGS data. Amplicon Sequence Assignment (AMPLISAS) is a tool that performs analysis of AS results in a simple and efficient way, while offering customization options for advanced users. AMPLISAS is designed as a three-step pipeline consisting of (i) read demultiplexing, (ii) unique sequence clustering and (iii) erroneous sequence filtering. Allele sequences and frequencies are retrieved in excel spreadsheet format, making them easy to interpret. AMPLISAS performance has been successfully benchmarked against previously published genotyped MHC data sets obtained with various NGS technologies. © 2015 John Wiley & Sons Ltd.
Unlabeled oligonucleotides as internal temperature controls for genotyping by amplicon melting.
Seipp, Michael T; Durtschi, Jacob D; Liew, Michael A; Williams, Jamie; Damjanovich, Kristy; Pont-Kingdon, Genevieve; Lyon, Elaine; Voelkerding, Karl V; Wittwer, Carl T
2007-07-01
Amplicon melting is a closed-tube method for genotyping that does not require probes, real-time analysis, or allele-specific polymerase chain reaction. However, correct differentiation of homozygous mutant and wild-type samples by melting temperature (Tm) requires high-resolution melting and closely controlled reaction conditions. When three different DNA extraction methods were used to isolate DNA from whole blood, amplicon Tm differences of 0.03 to 0.39 degrees C attributable to the extractions were observed. To correct for solution chemistry differences between samples, complementary unlabeled oligonucleotides were included as internal temperature controls to shift and scale the temperature axis of derivative melting plots. This adjustment was applied to a duplex amplicon melting assay for the methylenetetrahydrofolate reductase variants 1298A>C and 677C>T. High- and low-temperature controls bracketing the amplicon melting region decreased the Tm SD within homozygous genotypes by 47 to 82%. The amplicon melting assay was 100% concordant to an adjacent hybridization probe (HybProbe) melting assay when temperature controls were included, whereas a 3% error rate was observed without temperature correction. In conclusion, internal temperature controls increase the accuracy of genotyping by high-resolution amplicon melting and should also improve results on lower resolution instruments.
Cousins, Matthew M.; Donnell, Deborah; Eshleman, Susan H.
2013-01-01
We adapted high-resolution melting (HRM) technology to measure genetic diversity without sequencing. Diversity is measured as a single numeric HRM score. Herein, we determined the impact of mutation types and amplicon characteristics on HRM diversity scores. Plasmids were generated with single-base changes, insertions, and deletions. Different primer sets were used to vary the position of mutations within amplicons. Plasmids and plasmid mixtures were analyzed to determine the impact of mutation type, position, and concentration on HRM scores. The impact of amplicon length and G/C content on HRM scores was also evaluated. Different mutation types affected HRM scores to varying degrees (1-bp deletion < 1-bp change < 3-bp insertion < 9-bp insertion). The impact of mutations on HRM scores was influenced by amplicon length and the position of the mutation within the amplicon. Mutations were detected at concentrations of 5% to 95%, with the greatest impact at 50%. The G/C content altered melting temperature values of amplicons but had no impact on HRM scores. These data are relevant to the design of assays that measure genetic diversity using HRM technology. PMID:23178437
DNA analysis of molluscs from a museum wet collection: a comparison of different extraction methods.
Jaksch, Katharina; Eschner, Anita; Rintelen, Thomas V; Haring, Elisabeth
2016-07-18
DNA isolation and PCR amplification from molluscan taxa is considered as problematic because polysaccharides in tissue and mucus presumably co-precipitate with the DNA and inhibit the activity of DNA polymerase. In the present study we tested two common extraction methods on specimens from the mollusc collection of the Natural History Museum Vienna (NHMW). We analysed a broad variety of taxa covering a large temporal span (acquisition years 1877 to 1999), which distinguishes our study from previous ones where mostly fresh material was used. We also took other factors into account: effects of sample age, effects of formaldehyde treatment and taxon-specific problems. We used several primer combinations to amplify amplicons of different lengths of two mitochondrial genes: cytochrome c oxidase subunit 1 (COI) and 16S rRNA gene (16S). Overall PCR success was 43 % in the 576 extractions (including all primer combinations). The smallest amplicon (~240 bp) showed the best results (49 % positive reactions), followed by the 400 bp amplicon (40.5 %). Both short sections yielded significantly better results than the 700 bp long amplicon (27 %). Comparatively, the Gen-ial-First, All-tissue DNA-Kit-extraction method performed significantly better than Promega-Tissue and Hair Extraction Kit. Generally, PCR success is age-dependent. Nonetheless, we were able to obtain the longest amplicon even from 137-year-old material. Importantly, formaldehyde traces did not totally inhibit amplification success, although very high concentrations did. Museum material has gained importance for DNA analysis in recent years, especially for DNA barcoding projects. In some cases, however, the amplification of the standard barcoding region (partial sequence of the COI) is problematic with old material. Our study clearly shows that the COI barcoding region could be amplified in up to 49 % of PCRs (varying with amplicon length), which is, for museum samples, quite a high percentage. The difference between extraction methods was minimal and we recommend using an established kit for a first attempt because experience and routine in handling might be more important than slight performance differences of the various kits. Finally, we identify fixation, storage, sample conservation and documentation of the specimens' history rather than the DNA extraction method to be the most crucial factors for PCR success.
SEED 2: a user-friendly platform for amplicon high-throughput sequencing data analyses.
Vetrovský, Tomáš; Baldrian, Petr; Morais, Daniel; Berger, Bonnie
2018-02-14
Modern molecular methods have increased our ability to describe microbial communities. Along with the advances brought by new sequencing technologies, we now require intensive computational resources to make sense of the large numbers of sequences continuously produced. The software developed by the scientific community to address this demand, although very useful, require experience of the command-line environment, extensive training and have steep learning curves, limiting their use. We created SEED 2, a graphical user interface for handling high-throughput amplicon-sequencing data under Windows operating systems. SEED 2 is the only sequence visualizer that empowers users with tools to handle amplicon-sequencing data of microbial community markers. It is suitable for any marker genes sequences obtained through Illumina, IonTorrent or Sanger sequencing. SEED 2 allows the user to process raw sequencing data, identify specific taxa, produce of OTU-tables, create sequence alignments and construct phylogenetic trees. Standard dual core laptops with 8 GB of RAM can handle ca. 8 million of Illumina PE 300 bp sequences, ca. 4GB of data. SEED 2 was implemented in Object Pascal and uses internal functions and external software for amplicon data processing. SEED 2 is a freeware software, available at http://www.biomed.cas.cz/mbu/lbwrf/seed/ as a self-contained file, including all the dependencies, and does not require installation. Supplementary data contain a comprehensive list of supported functions. daniel.morais@biomed.cas.cz. Supplementary data are available at Bioinformatics online. © The Author(s) 2018. Published by Oxford University Press.
Dou, Yanmei; Yang, Xiaoxu; Li, Ziyi; Wang, Sheng; Zhang, Zheng; Ye, Adam Yongxin; Yan, Linlin; Yang, Changhong; Wu, Qixi; Li, Jiarui; Zhao, Boxun; Huang, August Yue; Wei, Liping
2017-08-01
The roles and characteristics of postzygotic single-nucleotide mosaicisms (pSNMs) in autism spectrum disorders (ASDs) remain unclear. In this study of the whole exomes of 2,361 families in the Simons Simplex Collection, we identified 1,248 putative pSNMs in children and 285 de novo SNPs in children with detectable parental mosaicism. Ultra-deep amplicon resequencing suggested a validation rate of 51%. Analyses of validated pSNMs revealed that missense/loss-of-function (LoF) pSNMs with a high mutant allele fraction (MAF≥ 0.2) contributed to ASD diagnoses (P = 0.022, odds ratio [OR] = 5.25), whereas missense/LoF pSNMs with a low MAF (MAF<0.2) contributed to autistic traits in male non-ASD siblings (P = 0.033). LoF pSNMs in parents were less likely to be transmitted to offspring than neutral pSNMs (P = 0.037), and missense/LoF pSNMs in parents with a low MAF were transmitted more to probands than to siblings (P = 0.016, OR = 1.45). We estimated that pSNMs in probands or de novo mutations inherited from parental pSNMs increased the risk of ASD by approximately 6%. Adding pSNMs into the transmission and de novo association test model revealed 13 new ASD risk genes. These results expand the existing repertoire of genes involved in ASD and shed new light on the contribution of genomic mosaicisms to ASD diagnoses and autistic traits. © 2017 The Authors. Human Mutation published by Wiley Periodicals, Inc.
Nanopore sequencing of drug-resistance-associated genes in malaria parasites, Plasmodium falciparum.
Runtuwene, Lucky R; Tuda, Josef S B; Mongan, Arthur E; Makalowski, Wojciech; Frith, Martin C; Imwong, Mallika; Srisutham, Suttipat; Nguyen Thi, Lan Anh; Tuan, Nghia Nguyen; Eshita, Yuki; Maeda, Ryuichiro; Yamagishi, Junya; Suzuki, Yutaka
2018-05-29
Here, we report the application of a portable sequencer, MinION, for genotyping the malaria parasite Plasmodium falciparum. In the present study, an amplicon mixture of nine representative genes causing resistance to anti-malaria drugs is diagnosed. First, we developed the procedure for four laboratory strains (3D7, Dd2, 7G8, and K1), and then applied the developed procedure to ten clinical samples. We sequenced and re-sequenced the samples using the obsolete flow cell R7.3 and the most recent flow cell R9.4. Although the average base-call accuracy of the MinION sequencer was 74.3%, performing >50 reads at a given position improves the accuracy of the SNP call, yielding a precision and recall rate of 0.92 and 0.8, respectively, with flow cell R7.3. These numbers increased significantly with flow cell R9.4, in which the precision and recall are 1 and 0.97, respectively. Based on the SNP information, the drug resistance status in ten clinical samples was inferred. We also analyzed K13 gene mutations from 54 additional clinical samples as a proof of concept. We found that a novel amino-acid changing variation is dominant in this area. In addition, we performed a small population-based analysis using 3 and 5 cases (K13) and 10 and 5 cases (PfCRT) from Thailand and Vietnam, respectively. We identified distinct genotypes from the respective regions. This approach will change the standard methodology for the sequencing diagnosis of malaria parasites, especially in developing countries.
Single-Molecule Electrical Random Resequencing of DNA and RNA
NASA Astrophysics Data System (ADS)
Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji
2012-07-01
Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.
PCR Amplicon Prediction from Multiplex Degenerate Primer and Probe Sets
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S. N.
2013-08-08
Assessing primer specificity and predicting both desired and off-target amplification products is an essential step for robust PCR assay design. Code is described to predict potential polymerase chain reaction (PCR) amplicons in a large sequence database such as NCBI nt from either singleplex or a large multiplexed set of primers, allowing degenerate primer and probe bases, with target mismatch annotates amplicons with gene information automatically downloaded from NCBI, and optionally it can predict whether there are also TaqMan/Luminex probe matches within predicted amplicons.
Unlabeled Oligonucleotides as Internal Temperature Controls for Genotyping by Amplicon Melting
Seipp, Michael T.; Durtschi, Jacob D.; Liew, Michael A.; Williams, Jamie; Damjanovich, Kristy; Pont-Kingdon, Genevieve; Lyon, Elaine; Voelkerding, Karl V.; Wittwer, Carl T.
2007-01-01
Amplicon melting is a closed-tube method for genotyping that does not require probes, real-time analysis, or allele-specific polymerase chain reaction. However, correct differentiation of homozygous mutant and wild-type samples by melting temperature (Tm) requires high-resolution melting and closely controlled reaction conditions. When three different DNA extraction methods were used to isolate DNA from whole blood, amplicon Tm differences of 0.03 to 0.39°C attributable to the extractions were observed. To correct for solution chemistry differences between samples, complementary unlabeled oligonucleotides were included as internal temperature controls to shift and scale the temperature axis of derivative melting plots. This adjustment was applied to a duplex amplicon melting assay for the methylenetetrahydrofolate reductase variants 1298A>C and 677C>T. High- and low-temperature controls bracketing the amplicon melting region decreased the Tm SD within homozygous genotypes by 47 to 82%. The amplicon melting assay was 100% concordant to an adjacent hybridization probe (HybProbe) melting assay when temperature controls were included, whereas a 3% error rate was observed without temperature correction. In conclusion, internal temperature controls increase the accuracy of genotyping by high-resolution amplicon melting and should also improve results on lower resolution instruments. PMID:17591926
Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus
Kinoti, Wycliff M.; Constable, Fiona E.; Nancarrow, Narelle; Plummer, Kim M.; Rodoni, Brendan
2017-01-01
The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS) of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp) gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV) was the most frequently detected Ilarvirus, occurring in 48 of the 61 Ilarvirus-positive trees and Prune dwarf virus (PDV) and Apple mosaic virus (ApMV) were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV) was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus-like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus-like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus-like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples, and the need for a standardized approach to accurately determine what constitutes an active, viable virus infection after detection by molecular based methods. PMID:28713347
Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus.
Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan
2017-01-01
The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS) of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp) gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV) was the most frequently detected Ilarvirus , occurring in 48 of the 61 Ilarvirus -positive trees and Prune dwarf virus (PDV) and Apple mosaic virus (ApMV) were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV) was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus -like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus -like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus -like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples, and the need for a standardized approach to accurately determine what constitutes an active, viable virus infection after detection by molecular based methods.
Efficient error correction for next-generation sequencing of viral amplicons
2012-01-01
Background Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. Results In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Conclusions Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses. The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm PMID:22759430
Efficient error correction for next-generation sequencing of viral amplicons.
Skums, Pavel; Dimitrova, Zoya; Campo, David S; Vaughan, Gilberto; Rossi, Livia; Forbi, Joseph C; Yokosawa, Jonny; Zelikovsky, Alex; Khudyakov, Yury
2012-06-25
Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses.The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm.
ERIC Educational Resources Information Center
Elkins, Kelly M.; Kadunc, Raelynn E.
2012-01-01
In this laboratory experiment, real-time polymerase chain reaction (real-time PCR) was conducted using published human TPOX single-locus DNA primers for validation and various student-designed short tandem repeat (STR) primers for Combined DNA Index System (CODIS) loci. SYBR Green was used to detect the amplification of the expected amplicons. The…
Electroanalytical study of proflavine intercalation in 5-methyl or inosine-containing amplicons.
Alexiadou, Despina K; Ioannou, Andrea K; Kouidou-Andreou, Sofia A; Voulgaropoulos, Anastasios N; Girousi, Stella Th
2008-10-01
Amplicons corresponding to the GC-rich p53 exon 5 and its analogues, synthesized by substituting 60% of cytosine by 5-methyl-cytosine, or 60% of guanosine by inosine and GC-poor p53 exon 6 were synthesized and investigated electrochemically, in the presence and absence of proflavine, by differential pulse voltammetry (DPV). Incorporation of base analogues and the thermal stability of the resulting amplicons were tested in the presence of a fluorescent probe (Sybr-Green). Peak current at 1.0 V was lower for methylated than for unmethylated PCR amplicons and was similarly affected by proflavine intercalation. In contrast, considerable peak current differences were observed in the presence of proflavine for unmodified exon 5 v.s. exon 6 or inosine-containing amplicons. Thermal analysis verified the expected shifts in melting temperature (T (m)) due to the base analogue incorporation and GC-content variations. In conclusion, methylated and unmethylated PCR amplicons could be distinguished in model DNA systems using differential pulse voltammetry (DPV) and use of proflavine could serve as an electrochemical probe for identifying different DNA conformations.
Henry, Kevin A
2018-01-01
Immunogenetic analyses of expressed antibody repertoires are becoming increasingly common experimental investigations and are critical to furthering our understanding of autoimmunity, infectious disease, and cancer. Next-generation DNA sequencing (NGS) technologies have now made it possible to interrogate antibody repertoires to unprecedented depths, typically by sequencing of cDNAs encoding immunoglobulin variable domains. In this chapter, we describe simple, fast, and reliable methods for producing and sequencing multiplex PCR amplicons derived from the variable regions (V H , V H H or V L ) of rearranged immunoglobulin heavy and light chain genes using the Illumina MiSeq platform. We include complete protocols and primer sets for amplicon sequencing of V H /V H H/V L repertoires directly from human, mouse, and llama lymphocytes as well as from phage-displayed V H /V H H/V L libraries; these can be easily be adapted to other types of amplicons with little modification. The resulting amplicons are diverse and representative, even using as few as 10 3 input B cells, and their generation is relatively inexpensive, requiring no special equipment and only a limited set of primers. In the absence of heavy-light chain pairing, single-domain antibodies are uniquely amenable to NGS analyses. We present a number of applications of NGS technology useful in discovery of single-domain antibodies from phage display libraries, including: (i) assessment of library functionality; (ii) confirmation of desired library randomization; (iii) estimation of library diversity; and (iv) monitoring the progress of panning experiments. While the case studies presented here are of phage-displayed single-domain antibody libraries, the principles extend to other types of in vitro display libraries.
Coon, Keith D; Valla, Jon; Szelinger, Szabolics; Schneider, Lonnie E; Niedzielko, Tracy L; Brown, Kevin M; Pearson, John V; Halperin, Rebecca; Dunckley, Travis; Papassotiropoulos, Andreas; Caselli, Richard J; Reiman, Eric M; Stephan, Dietrich A
2006-08-01
The role of mitochondrial dysfunction in the pathogenesis of Alzheimer's disease (AD) has been well documented. Though evidence for the role of mitochondria in AD seems incontrovertible, the impact of mitochondrial DNA (mtDNA) mutations in AD etiology remains controversial. Though mutations in mitochondrially encoded genes have repeatedly been implicated in the pathogenesis of AD, many of these studies have been plagued by lack of replication as well as potential contamination of nuclear-encoded mitochondrial pseudogenes. To assess the role of mtDNA mutations in the pathogenesis of AD, while avoiding the pitfalls of nuclear-encoded mitochondrial pseudogenes encountered in previous investigations and showcasing the benefits of a novel resequencing technology, we sequenced the entire coding region (15,452 bp) of mtDNA from 19 extremely well-characterized AD patients and 18 age-matched, unaffected controls utilizing a new, reliable, high-throughput array-based resequencing technique, the Human MitoChip. High-throughput, array-based DNA resequencing of the entire mtDNA coding region from platelets of 37 subjects revealed the presence of 208 loci displaying a total of 917 sequence variants. There were no statistically significant differences in overall mutational burden between cases and controls, however, 265 independent sites of statistically significant change between cases and controls were identified. Changed sites were found in genes associated with complexes I (30.2%), III (3.0%), IV (33.2%), and V (9.1%) as well as tRNA (10.6%) and rRNA (14.0%). Despite their statistical significance, the subtle nature of the observed changes makes it difficult to determine whether they represent true functional variants involved in AD etiology or merely naturally occurring dissimilarity. Regardless, this study demonstrates the tremendous value of this novel mtDNA resequencing platform, which avoids the pitfalls of erroneously amplifying nuclear-encoded mtDNA pseudogenes, and our proposed analysis paradigm, which utilizes the availability of raw signal intensity values for each of the four potential alleles to facilitate quantitative estimates of mtDNA heteroplasmy. This information provides a potential new target for burgeoning diagnostics and therapeutics that could truly assist those suffering from this devastating disorder.
JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms
Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim
2015-01-01
The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/ PMID:26424080
JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.
Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim
2015-01-01
The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/. © The Author(s) 2015. Published by Oxford University Press.
Evaluation of Quality Assessment Protocols for High Throughput Genome Resequencing Data
Chiara, Matteo; Pavesi, Giulio
2017-01-01
Large-scale initiatives aiming to recover the complete sequence of thousands of human genomes are currently being undertaken worldwide, concurring to the generation of a comprehensive catalog of human genetic variation. The ultimate and most ambitious goal of human population scale genomics is the characterization of the so-called human “variome,” through the identification of causal mutations or haplotypes. Several research institutions worldwide currently use genotyping assays based on Next-Generation Sequencing (NGS) for diagnostics and clinical screenings, and the widespread application of such technologies promises major revolutions in medical science. Bioinformatic analysis of human resequencing data is one of the main factors limiting the effectiveness and general applicability of NGS for clinical studies. The requirement for multiple tools, to be combined in dedicated protocols in order to accommodate different types of data (gene panels, exomes, or whole genomes) and the high variability of the data makes difficult the establishment of a ultimate strategy of general use. While there already exist several studies comparing sensitivity and accuracy of bioinformatic pipelines for the identification of single nucleotide variants from resequencing data, little is known about the impact of quality assessment and reads pre-processing strategies. In this work we discuss major strengths and limitations of the various genome resequencing protocols are currently used in molecular diagnostics and for the discovery of novel disease-causing mutations. By taking advantage of publicly available data we devise and suggest a series of best practices for the pre-processing of the data that consistently improve the outcome of genotyping with minimal impacts on computational costs. PMID:28736571
Quantification of differential gene expression by multiplexed targeted resequencing of cDNA
Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.
2017-01-01
Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677
Evaluating information content of SNPs for sample-tagging in re-sequencing projects.
Hu, Hao; Liu, Xiang; Jin, Wenfei; Hilger Ropers, H; Wienker, Thomas F
2015-05-15
Sample-tagging is designed for identification of accidental sample mix-up, which is a major issue in re-sequencing studies. In this work, we develop a model to measure the information content of SNPs, so that we can optimize a panel of SNPs that approach the maximal information for discrimination. The analysis shows that as low as 60 optimized SNPs can differentiate the individuals in a population as large as the present world, and only 30 optimized SNPs are in practice sufficient in labeling up to 100 thousand individuals. In the simulated populations of 100 thousand individuals, the average Hamming distances, generated by the optimized set of 30 SNPs are larger than 18, and the duality frequency, is lower than 1 in 10 thousand. This strategy of sample discrimination is proved robust in large sample size and different datasets. The optimized sets of SNPs are designed for Whole Exome Sequencing, and a program is provided for SNP selection, allowing for customized SNP numbers and interested genes. The sample-tagging plan based on this framework will improve re-sequencing projects in terms of reliability and cost-effectiveness.
Herbold, Craig W.; Pelikan, Claus; Kuzyk, Orest; Hausmann, Bela; Angel, Roey; Berry, David; Loy, Alexander
2015-01-01
High throughput sequencing of phylogenetic and functional gene amplicons provides tremendous insight into the structure and functional potential of complex microbial communities. Here, we introduce a highly adaptable and economical PCR approach to barcoding and pooling libraries of numerous target genes. In this approach, we replace gene- and sequencing platform-specific fusion primers with general, interchangeable barcoding primers, enabling nearly limitless customized barcode-primer combinations. Compared to barcoding with long fusion primers, our multiple-target gene approach is more economical because it overall requires lower number of primers and is based on short primers with generally lower synthesis and purification costs. To highlight our approach, we pooled over 900 different small-subunit rRNA and functional gene amplicon libraries obtained from various environmental or host-associated microbial community samples into a single, paired-end Illumina MiSeq run. Although the amplicon regions ranged in size from approximately 290 to 720 bp, we found no significant systematic sequencing bias related to amplicon length or gene target. Our results indicate that this flexible multiplexing approach produces large, diverse, and high quality sets of amplicon sequence data for modern studies in microbial ecology. PMID:26236305
Formation of Linear Amplicons with Inverted Duplications in Leishmania Requires the MRE11 Nuclease
Laffitte, Marie-Claude N.; Genois, Marie-Michelle; Mukherjee, Angana; Légaré, Danielle; Masson, Jean-Yves; Ouellette, Marc
2014-01-01
Extrachromosomal DNA amplification is frequent in the protozoan parasite Leishmania selected for drug resistance. The extrachromosomal amplified DNA is either circular or linear, and is formed at the level of direct or inverted homologous repeated sequences that abound in the Leishmania genome. The RAD51 recombinase plays an important role in circular amplicons formation, but the mechanism by which linear amplicons are formed is unknown. We hypothesized that the Leishmania infantum DNA repair protein MRE11 is required for linear amplicons following rearrangements at the level of inverted repeats. The purified LiMRE11 protein showed both DNA binding and exonuclease activities. Inactivation of the LiMRE11 gene led to parasites with enhanced sensitivity to DNA damaging agents. The MRE11−/− parasites had a reduced capacity to form linear amplicons after drug selection, and the reintroduction of an MRE11 allele led to parasites regaining their capacity to generate linear amplicons, but only when MRE11 had an active nuclease activity. These results highlight a novel MRE11-dependent pathway used by Leishmania to amplify portions of its genome to respond to a changing environment. PMID:25474106
Genetics and Pathogenesis of Diffuse Large B-Cell Lymphoma.
Schmitz, Roland; Wright, George W; Huang, Da Wei; Johnson, Calvin A; Phelan, James D; Wang, James Q; Roulland, Sandrine; Kasbekar, Monica; Young, Ryan M; Shaffer, Arthur L; Hodson, Daniel J; Xiao, Wenming; Yu, Xin; Yang, Yandan; Zhao, Hong; Xu, Weihong; Liu, Xuelu; Zhou, Bin; Du, Wei; Chan, Wing C; Jaffe, Elaine S; Gascoyne, Randy D; Connors, Joseph M; Campo, Elias; Lopez-Guillermo, Armando; Rosenwald, Andreas; Ott, German; Delabie, Jan; Rimsza, Lisa M; Tay Kuang Wei, Kevin; Zelenetz, Andrew D; Leonard, John P; Bartlett, Nancy L; Tran, Bao; Shetty, Jyoti; Zhao, Yongmei; Soppet, Dan R; Pittaluga, Stefania; Wilson, Wyndham H; Staudt, Louis M
2018-04-12
Diffuse large B-cell lymphomas (DLBCLs) are phenotypically and genetically heterogeneous. Gene-expression profiling has identified subgroups of DLBCL (activated B-cell-like [ABC], germinal-center B-cell-like [GCB], and unclassified) according to cell of origin that are associated with a differential response to chemotherapy and targeted agents. We sought to extend these findings by identifying genetic subtypes of DLBCL based on shared genomic abnormalities and to uncover therapeutic vulnerabilities based on tumor genetics. We studied 574 DLBCL biopsy samples using exome and transcriptome sequencing, array-based DNA copy-number analysis, and targeted amplicon resequencing of 372 genes to identify genes with recurrent aberrations. We developed and implemented an algorithm to discover genetic subtypes based on the co-occurrence of genetic alterations. We identified four prominent genetic subtypes in DLBCL, termed MCD (based on the co-occurrence of MYD88 L265P and CD79B mutations), BN2 (based on BCL6 fusions and NOTCH2 mutations), N1 (based on NOTCH1 mutations), and EZB (based on EZH2 mutations and BCL2 translocations). Genetic aberrations in multiple genes distinguished each genetic subtype from other DLBCLs. These subtypes differed phenotypically, as judged by differences in gene-expression signatures and responses to immunochemotherapy, with favorable survival in the BN2 and EZB subtypes and inferior outcomes in the MCD and N1 subtypes. Analysis of genetic pathways suggested that MCD and BN2 DLBCLs rely on "chronic active" B-cell receptor signaling that is amenable to therapeutic inhibition. We uncovered genetic subtypes of DLBCL with distinct genotypic, epigenetic, and clinical characteristics, providing a potential nosology for precision-medicine strategies in DLBCL. (Funded by the Intramural Research Program of the National Institutes of Health and others.).
pPCV, a versatile vector for cloning PCR products.
Janner, Christiane R; Brito, Ana Lívia P; Moraes, Lidia Maria P; Reis, Viviane Cb; Torres, Fernando Ag
2013-01-01
The efficiency of PCR product cloning depends on the nature of the DNA polymerase employed because amplicons may have blunt-ends or 3' adenosines overhangs. Therefore, for amplicon cloning, available commercial vectors are either blunt-ended or have a single 3' overhanging thymidine. The aim of this work was to offer in a single vector the ability to clone both types of PCR products. For that purpose, a minimal polylinker was designed to include restriction sites for EcoRV and XcmI which enable direct cloning of amplicons bearing blunt-ends or A-overhangs, respectively, still offering blue/white selection. When tested, the resulting vector, pPCV, presented high efficiency cloning of both types of amplicons.
Pirim, Dilek; Wang, Xingbin; Niemsiri, Vipavee; Radwan, Zaheda H.; Bunker, Clareann H.; Hokanson, John E.; Hamman, Richard F.; Barmada, M. Michael; Demirci, F. Yesim; Kamboh, M. Ilyas
2015-01-01
Background Cholesteryl ester transfer protein (CETP) plays a crucial role in lipid metabolism. Associations of common CETP variants with variation in plasma lipid levels, and/or CETP mass/activity have been extensively studied and well-documented; however, the effects of uncommon/rare CETP variants on plasma lipid profile remain undefined. Hence, resequencing of the gene in extreme phenotypes and follow-up rare-variant association analyses are essential to fill this gap. Objective To identify common and uncommon/rare variants in the CETP gene by resequencing the entire gene and test the effects of both common and uncommon/rare CETP variants on plasma lipid traits in two genetically distinct populations. Methods and Results The entire CETP gene plus flanking regions were resequenced in 190 individuals comprising 95 non-Hispanic Whites (NHWs) and 95 African blacks with extreme HDL-C levels. A total of 279 sequence variants were identified, of which 25 were novel. Selected variants were genotyped in the entire samples of 623 NHWs and 788 African blacks and 184 QC-passed variants were tested in relation to plasma lipid traits by using gene-based, single-site, haplotype and rare variant association analyses (SKAT-O). Two novel and independent associations of rs1968905 and rs289740 with HDL-C were identified in African blacks. Using SKAT-O analysis, we also identified rare variants with minor allele frequency <0.01 to be associated with HDL-C in both NHWs (P=0.024) and African blacks (P=0.009). Conclusions Our results point out that in addition to the common CETP variants, rare genetic variants in the CETP gene also contribute to the phenotypic variation of HDL-C in the general population. PMID:26683795
Sena-Esteves, Miguel; Saeki, Yoshinaga; Camp, Sara M.; Chiocca, E. Antonio; Breakefield, Xandra O.
1999-01-01
We report here on the development and characterization of a novel herpes simplex virus type 1 (HSV-1) amplicon-based vector system which takes advantage of the host range and retention properties of HSV–Epstein-Barr virus (EBV) hybrid amplicons to efficiently convert cells to retrovirus vector producer cells after single-step transduction. The retrovirus genes gag-pol and env (GPE) and retroviral vector sequences were modified to minimize sequence overlap and cloned into an HSV-EBV hybrid amplicon. Retrovirus expression cassettes were used to generate the HSV-EBV-retrovirus hybrid vectors, HERE and HERA, which code for the ecotropic and the amphotropic envelopes, respectively. Retrovirus vector sequences encoding lacZ were cloned downstream from the GPE expression unit. Transfection of 293T/17 cells with amplicon plasmids yielded retrovirus titers between 106 and 107 transducing units/ml, while infection of the same cells with amplicon vectors generated maximum titers 1 order of magnitude lower. Retrovirus titers were dependent on the extent of transduction by amplicon vectors for the same cell line, but different cell lines displayed varying capacities to produce retrovirus vectors even at the same transduction efficiencies. Infection of human and dog primary gliomas with this system resulted in the production of retrovirus vectors for more than 1 week and the long-term retention and increase in transgene activity over time in these cell populations. Although the efficiency of this system still has to be determined in vivo, many applications are foreseeable for this approach to gene delivery. PMID:10559361
Decelle, Johan; Romac, Sarah; Sasaki, Eriko; Not, Fabrice; Mahé, Frédéric
2014-01-01
Metabarcoding is a powerful tool for exploring microbial diversity in the environment, but its accurate interpretation is impeded by diverse technical (e.g. PCR and sequencing errors) and biological biases (e.g. intra-individual polymorphism) that remain poorly understood. To help interpret environmental metabarcoding datasets, we investigated the intracellular diversity of the V4 and V9 regions of the 18S rRNA gene from Acantharia and Nassellaria (radiolarians) using 454 pyrosequencing. Individual cells of radiolarians were isolated, and PCRs were performed with generalist primers to amplify the V4 and V9 regions. Different denoising procedures were employed to filter the pyrosequenced raw amplicons (Acacia, AmpliconNoise, Linkage method). For each of the six isolated cells, an average of 541 V4 and 562 V9 amplicons assigned to radiolarians were obtained, from which one numerically dominant sequence and several minor variants were found. At the 97% identity, a diversity metrics commonly used in environmental surveys, up to 5 distinct OTUs were detected in a single cell. However, most amplicons grouped within a single OTU whereas other OTUs contained very few amplicons. Different analytical methods provided evidence that most minor variants forming different OTUs correspond to PCR and sequencing artifacts. Duplicate PCR and sequencing from the same DNA extract of a single cell had only 9 to 16% of unique amplicons in common, and alignment visualization of V4 and V9 amplicons showed that most minor variants contained substitutions in highly-conserved regions. We conclude that intracellular variability of the 18S rRNA in radiolarians is very limited despite its multi-copy nature and the existence of multiple nuclei in these protists. Our study recommends some technical guidelines to conservatively discard artificial amplicons from metabarcoding datasets, and thus properly assess the diversity and richness of protists in the environment.
Patterns and drivers of fungal community depth stratification in Sphagnum peat
Louis J. Lamit; Karl J. Romanowicz; Lynette R. Potvin; Adam R. Rivers; Kanwar Singh; Jay T. Lennon; Susannah G. Tringe; Evan S. Kane; Erik A. Lilleskov
2017-01-01
Peatlands store an immense pool of soil carbon vulnerable to microbial oxidation due to drought and intentional draining. We used amplicon sequencing and quantitative PCR to (i) examine how fungi are influenced by depth in the peat profile, water table and plant functional group at the onset of a multiyear mesocosm experiment, and (ii) test if fungi are correlated with...
Portability of tag SNPs across isolated population groups: an example from India.
Sarkar Roy, N; Farheen, S; Roy, N; Sengupta, S; Majumder, P P
2008-01-01
Isolated population groups are useful in conducting association studies of complex diseases to avoid various pitfalls, including those arising from population stratification. Since DNA resequencing is expensive, it is recommended that genotyping be carried out at tagSNP (tSNP) loci. For this, tSNPs identified in one isolated population need to be used in another. Unless tSNPs are highly portable across populations this strategy may result in loss of information in association studies. We examined the issue of tSNP portability by sampling individuals from 10 isolated ethnic groups from India. We generated DNA resequencing data pertaining to 3 genomic regions and identified tSNPs in each population. We defined an index of tSNP portability and showed that portability is low across isolated Indian ethnic groups. The extent of portability did not significantly correlate with genetic similarity among the populations studied here. We also analyzed our data with sequence data from individuals of African and European descent. Our results indicated that it may be necessary to carry out resequencing in a small number of individuals to discover SNPs and identify tSNPs in the specific isolated population in which a disease association study is to be conducted.
Saeed, Isaam; Wong, Stephen Q.; Mar, Victoria; Goode, David L.; Caramia, Franco; Doig, Ken; Ryland, Georgina L.; Thompson, Ella R.; Hunter, Sally M.; Halgamuge, Saman K.; Ellul, Jason; Dobrovic, Alexander; Campbell, Ian G.; Papenfuss, Anthony T.; McArthur, Grant A.; Tothill, Richard W.
2014-01-01
Targeted resequencing by massively parallel sequencing has become an effective and affordable way to survey small to large portions of the genome for genetic variation. Despite the rapid development in open source software for analysis of such data, the practical implementation of these tools through construction of sequencing analysis pipelines still remains a challenging and laborious activity, and a major hurdle for many small research and clinical laboratories. We developed TREVA (Targeted REsequencing Virtual Appliance), making pre-built pipelines immediately available as a virtual appliance. Based on virtual machine technologies, TREVA is a solution for rapid and efficient deployment of complex bioinformatics pipelines to laboratories of all sizes, enabling reproducible results. The analyses that are supported in TREVA include: somatic and germline single-nucleotide and insertion/deletion variant calling, copy number analysis, and cohort-based analyses such as pathway and significantly mutated genes analyses. TREVA is flexible and easy to use, and can be customised by Linux-based extensions if required. TREVA can also be deployed on the cloud (cloud computing), enabling instant access without investment overheads for additional hardware. TREVA is available at http://bioinformatics.petermac.org/treva/. PMID:24752294
Warren, Liling L.; Li, Li; Nelson, Matthew R.; Ehm, Margaret G.; Shen, Judong; Fraser, Dana J.; Aponte, Jennifer L.; Nangle, Keith L.; Slater, Andrew J.; Woollard, Peter M.; Hall, Matt D.; Topp, Simon D.; Yuan, Xin; Cardon, Lon R.; Chissoe, Stephanie L.; Mooser, Vincent; Morris, Andrew D.; Palmer, Colin N.A.; Perry, John R.; Frayling, Timothy M.; Whittaker, John C.; Waterworth, Dawn M.
2012-01-01
Increased adiponectin levels have been shown to be associated with a lower risk of type 2 diabetes. To understand the relations between genetic variation at the adiponectin-encoding gene, ADIPOQ, and adiponectin levels, and subsequently its role in disease, we conducted a deep resequencing experiment of ADIPOQ in 14,002 subjects, including 12,514 Europeans, 594 African Americans, and 567 Indian Asians. We identified 296 single nucleotide polymorphisms (SNPs), including 30 amino acid changes, and carried out association analyses in a subset of 3,665 subjects from two independent studies. We confirmed multiple genome-wide association study findings and identified a novel association between a low-frequency SNP (rs17366653) and adiponectin levels (P = 2.2E–17). We show that seven SNPs exert independent effects on adiponectin levels. Together, they explained 6% of adiponectin variation in our samples. We subsequently assessed association between these SNPs and type 2 diabetes in the Genetics of Diabetes Audit and Research in Tayside Scotland (GO-DARTS) study, comprised of 5,145 case and 6,374 control subjects. No evidence of association with type 2 diabetes was found, but we were also unable to exclude the possibility of substantial effects (e.g., odds ratio 95% CI for rs7366653 [0.91–1.58]). Further investigation by large-scale and well-powered Mendelian randomization studies is warranted. PMID:22403302
Barrick, Jeffrey E; Colburn, Geoffrey; Deatherage, Daniel E; Traverse, Charles C; Strand, Matthew D; Borges, Jordan J; Knoester, David B; Reba, Aaron; Meyer, Austin G
2014-11-29
Mutations that alter chromosomal structure play critical roles in evolution and disease, including in the origin of new lifestyles and pathogenic traits in microbes. Large-scale rearrangements in genomes are often mediated by recombination events involving new or existing copies of mobile genetic elements, recently duplicated genes, or other repetitive sequences. Most current software programs for predicting structural variation from short-read DNA resequencing data are intended primarily for use on human genomes. They typically disregard information in reads mapping to repeat sequences, and significant post-processing and manual examination of their output is often required to rule out false-positive predictions and precisely describe mutational events. We have implemented an algorithm for identifying structural variation from DNA resequencing data as part of the breseq computational pipeline for predicting mutations in haploid microbial genomes. Our method evaluates the support for new sequence junctions present in a clonal sample from split-read alignments to a reference genome, including matches to repeat sequences. Then, it uses a statistical model of read coverage evenness to accept or reject these predictions. Finally, breseq combines predictions of new junctions and deleted chromosomal regions to output biologically relevant descriptions of mutations and their effects on genes. We demonstrate the performance of breseq on simulated Escherichia coli genomes with deletions generating unique breakpoint sequences, new insertions of mobile genetic elements, and deletions mediated by mobile elements. Then, we reanalyze data from an E. coli K-12 mutation accumulation evolution experiment in which structural variation was not previously identified. Transposon insertions and large-scale chromosomal changes detected by breseq account for ~25% of spontaneous mutations in this strain. In all cases, we find that breseq is able to reliably predict structural variation with modest read-depth coverage of the reference genome (>40-fold). Using breseq to predict structural variation should be useful for studies of microbial epidemiology, experimental evolution, synthetic biology, and genetics when a reference genome for a closely related strain is available. In these cases, breseq can discover mutations that may be responsible for important or unintended changes in genomes that might otherwise go undetected.
Verkoczy, L K; Berinstein, N L
1998-10-01
Differential display PCR (DD RT-PCR) has been extensively used for analysis of differential gene expression, but continues to be hampered by technical limitations that impair its effectiveness. In order to isolate novel genes co-expressing with human RAG1, we have developed an effective, multi-tiered screening/purification approach which effectively complements the standard DD RT-PCR methodology. In 'primary' screens, standard DD RT-PCR was used, detecting 22 reproducible differentially expressed amplicons between clonally related cell variants with differential constitutive expression of RAG mRNAs. 'Secondary' screens used differential display (DD) amplicons as probes in low and high stringency northern blotting. Eight of 22 independent DD amplicons detected nine independent differentially expressed transcripts. 'Tertiary' screens used reconfirmed amplicons as probes in northern analysis of multiple RAG-and RAG+sources. Reconfirmed DD amplicons detected six independent RAG co-expressing transcripts. All DD amplicons reconfirmed by northern blot were a heterogeneous mixture of cDNAs, necessitating further purification to isolate single cDNAs prior to subcloning and sequencing. To effectively select the appropriate cDNAs from DD amplicons, we excised and eluted the cDNA(s) directly from regions of prior northern blots in which differentially expressed transcripts were detected. Sequences of six purified cDNA clones specifically detecting RAG co-expressing transcripts included matches to portions of the human RAG2 and BSAP regions and to four novel partial cDNAs (three with homologies to human ESTs). Overall, our results also suggest that even when using clonally related variants from the same cell line in addition to all appropriate internal controls previously reported, further screening and purification steps are still required in order to efficiently and specifically isolate differentially expressed genes by DD RT-PCR.
Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc'h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine
2017-01-01
Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus's but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies.
Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc’h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine
2017-01-01
Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus’s but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies. PMID:28362878
BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS.
Fosso, Bruno; Santamaria, Monica; Marzano, Marinella; Alonso-Alemany, Daniel; Valiente, Gabriel; Donvito, Giacinto; Monaco, Alfonso; Notarangelo, Pasquale; Pesole, Graziano
2015-07-01
Substantial advances in microbiology, molecular evolution and biodiversity have been carried out in recent years thanks to Metagenomics, which allows to unveil the composition and functions of mixed microbial communities in any environmental niche. If the investigation is aimed only at the microbiome taxonomic structure, a target-based metagenomic approach, here also referred as Meta-barcoding, is generally applied. This approach commonly involves the selective amplification of a species-specific genetic marker (DNA meta-barcode) in the whole taxonomic range of interest and the exploration of its taxon-related variants through High-Throughput Sequencing (HTS) technologies. The accessibility to proper computational systems for the large-scale bioinformatic analysis of HTS data represents, currently, one of the major challenges in advanced Meta-barcoding projects. BioMaS (Bioinformatic analysis of Metagenomic AmpliconS) is a new bioinformatic pipeline designed to support biomolecular researchers involved in taxonomic studies of environmental microbial communities by a completely automated workflow, comprehensive of all the fundamental steps, from raw sequence data upload and cleaning to final taxonomic identification, that are absolutely required in an appropriately designed Meta-barcoding HTS-based experiment. In its current version, BioMaS allows the analysis of both bacterial and fungal environments starting directly from the raw sequencing data from either Roche 454 or Illumina HTS platforms, following two alternative paths, respectively. BioMaS is implemented into a public web service available at https://recasgateway.ba.infn.it/ and is also available in Galaxy at http://galaxy.cloud.ba.infn.it:8080 (only for Illumina data). BioMaS is a friendly pipeline for Meta-barcoding HTS data analysis specifically designed for users without particular computing skills. A comparative benchmark, carried out by using a simulated dataset suitably designed to broadly represent the currently known bacterial and fungal world, showed that BioMaS outperforms QIIME and MOTHUR in terms of extent and accuracy of deep taxonomic sequence assignments.
Shiba, Norio
2015-12-01
A new class of gene mutations, identified in the pathogenesis of adult acute myeloid leukemia (AML), includes DNMT3A, IDH1/2, TET2 and EZH2. However, these mutations are rare in pediatric AML cases, indicating that pathogeneses differ between adult and pediatric forms of AML. Meanwhile, the recent development of massively parallel sequencing technologies has provided a new opportunity to discover genetic changes across entire genomes or proteincoding sequences. In order to reveal a complete registry of gene mutations, we performed whole exome resequencing of paired tumor-normal specimens from 19 pediatric AML cases using Illumina HiSeq 2000. In total, 80 somatic mutations or 4.2 mutations per sample were identified. Many of the recurrent mutations identified in this study involved previously reported targets in AML, such as FLT3, CEBPA, KIT, CBL, NRAS, WT1 and EZH2. On the other hand, several genes were newly identified in the current study, including BCORL1 and major cohesin components such as SMC3 and RAD21. Whole exome resequencing revealed a complex array of gene mutations in pediatric AML genomes. Our results indicate that a subset of pediatric AML represents a discrete entity that could be discriminated from its adult counterpart, in terms of the spectrum of gene mutations.
Sato, Kengo; Kuroki, Yoko; Kumita, Wakako; Fujiyama, Asao; Toyoda, Atsushi; Kawai, Jun; Iriki, Atsushi; Sasaki, Erika; Okano, Hideyuki; Sakakibara, Yasubumi
2015-11-20
The first draft of the common marmoset (Callithrix jacchus) genome was published by the Marmoset Genome Sequencing and Analysis Consortium. The draft was based on whole-genome shotgun sequencing, and the current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions and supercontigs and relatively short contigs that are unmapped to chromosomes in the draft genome. We performed resequencing and assembly of the genome of common marmoset by deep sequencing with high-throughput sequencing technology. Several different sequence runs using Illumina sequencing platforms were executed, and 181 Gbp of high-quality bases including mate-pairs with long insert lengths of 3, 8, 20, and 40 Kbp were obtained, that is, approximately 60× coverage. The resequencing significantly improved the MGSAC draft genome sequence. The N50 of the contigs, which is a statistical measure used to evaluate assembly quality, doubled. As a result, 51% of the contigs (total length: 299 Mbp) that were unmapped to chromosomes in the MGSAC draft were merged with chromosomal contigs, and the improved genome sequence helped to detect 5,288 new genes that are homologous to human cDNAs and the gaps in 5,187 transcripts of the Ensembl gene annotations were completely filled.
Novel mutations in LRP6 highlight the role of WNT signaling in tooth agenesis
Ludwig, Kerstin U.; Sullivan, Robert; van Rooij, Iris A.L.M.; Thonissen, Michelle; Swinnen, Steven; Phan, Milien; Conte, Federica; Ishorst, Nina; Gilissen, Christian; RoaFuentes, Laury; van de Vorst, Maartje; Henkes, Arjen; Steehouwer, Marloes; van Beusekom, Ellen; Bloemen, Marjon; Vankeirsbilck, Bruno; Bergé, Stefaan; Hens, Greet; Schoenaers, Joseph; Poorten, Vincent Vander; Roosenboom, Jasmien; Verdonck, An; Devriendt, Koen; Roeleveldt, Nel; Jhangiani, Shalini N.; Vissers, Lisenka E.L.M.; Lupski, James R.; de Ligt, Joep; Von den Hoff, Johannes W.; Pfundt, Rolph; Brunner, Han G.; Zhou, Huiqing; Dixon, Jill; Mangold, Elisabeth; van Bokhoven, Hans; Dixon, Michael J.; Kleefstra, Tjitske
2016-01-01
Purpose Here we aimed to identify a novel genetic cause of tooth agenesis (TA) and/or orofacial clefting (OFC) by combining whole exome sequencing (WES) and targeted re-sequencing in a large cohort of TA and OFC patients. Methods WES was performed in two unrelated patients, one with severe TA and OFC and another with severe TA only. After identifying deleterious mutations in a gene encoding the low density lipoprotein receptor-related protein 6 (LRP6), all its exons were re-sequenced with molecular inversion probes, in 67 patients with TA, 1,072 patients with OFC and in 706 controls. Results We identified a frameshift (c.4594delG, p.Cys1532fs) and a canonical splice site mutation (c.3398-2A>C, p.?) in LRP6 respectively in the patient with TA and OFC, and in the patient with severe TA only. The targeted re-sequencing showed significant enrichment of unique LRP6 variants in TA patients, but not in nonsyndromic OFC. From the 5 variants in patients with TA, 2 affect the canonical splice site and 3 were missense variants; all variants segregated with the dominant phenotype and in 1 case the missense mutation occurred de novo. Conclusion Mutations in LRP6 cause tooth agenesis in man. PMID:26963285
Glioblastoma (GBM) is the most common primary brain tumor and has a dismal prognosis. Amplification of chromosome 12q13-q15 (Cyclin-dependent kinase 4 (CDK4) amplicon) is frequently observed in numerous human cancers including GBM. Phosphoinositide 3-kinase enhancer (PIKE) is a group of GTP-binding proteins that belong to the subgroup of centaurin GTPase family, encoded by CENTG1 located in CDK4 amplicon. However, the pathological significance of CDK4 amplicon in GBM formation remains incompletely understood.
Reproducibility and quantitation of amplicon sequencing-based detection
Zhou, Jizhong; Wu, Liyou; Deng, Ye; Zhi, Xiaoyang; Jiang, Yi-Huei; Tu, Qichao; Xie, Jianping; Van Nostrand, Joy D; He, Zhili; Yang, Yunfeng
2011-01-01
To determine the reproducibility and quantitation of the amplicon sequencing-based detection approach for analyzing microbial community structure, a total of 24 microbial communities from a long-term global change experimental site were examined. Genomic DNA obtained from each community was used to amplify 16S rRNA genes with two or three barcode tags as technical replicates in the presence of a small quantity (0.1% wt/wt) of genomic DNA from Shewanella oneidensis MR-1 as the control. The technical reproducibility of the amplicon sequencing-based detection approach is quite low, with an average operational taxonomic unit (OTU) overlap of 17.2%±2.3% between two technical replicates, and 8.2%±2.3% among three technical replicates, which is most likely due to problems associated with random sampling processes. Such variations in technical replicates could have substantial effects on estimating β-diversity but less on α-diversity. A high variation was also observed in the control across different samples (for example, 66.7-fold for the forward primer), suggesting that the amplicon sequencing-based detection approach could not be quantitative. In addition, various strategies were examined to improve the comparability of amplicon sequencing data, such as increasing biological replicates, and removing singleton sequences and less-representative OTUs across biological replicates. Finally, as expected, various statistical analyses with preprocessed experimental data revealed clear differences in the composition and structure of microbial communities between warming and non-warming, or between clipping and non-clipping. Taken together, these results suggest that amplicon sequencing-based detection is useful in analyzing microbial community structure even though it is not reproducible and quantitative. However, great caution should be taken in experimental design and data interpretation when the amplicon sequencing-based detection approach is used for quantitative analysis of the β-diversity of microbial communities. PMID:21346791
AmpliVar: mutation detection in high-throughput sequence from amplicon-based libraries.
Hsu, Arthur L; Kondrashova, Olga; Lunke, Sebastian; Love, Clare J; Meldrum, Cliff; Marquis-Nicholson, Renate; Corboy, Greg; Pham, Kym; Wakefield, Matthew; Waring, Paul M; Taylor, Graham R
2015-04-01
Conventional means of identifying variants in high-throughput sequencing align each read against a reference sequence, and then call variants at each position. Here, we demonstrate an orthogonal means of identifying sequence variation by grouping the reads as amplicons prior to any alignment. We used AmpliVar to make key-value hashes of sequence reads and group reads as individual amplicons using a table of flanking sequences. Low-abundance reads were removed according to a selectable threshold, and reads above this threshold were aligned as groups, rather than as individual reads, permitting the use of sensitive alignment tools. We show that this approach is more sensitive, more specific, and more computationally efficient than comparable methods for the analysis of amplicon-based high-throughput sequencing data. The method can be extended to enable alignment-free confirmation of variants seen in hybridization capture target-enrichment data. © 2015 WILEY PERIODICALS, INC.
Yuan, Yali; Wei, Shiqiang; Liu, Guangpeng; Xie, Shunbi; Chai, Yaqin; Yuan, Ruo
2014-02-06
In this study, we for the first time presented an efficient, accurate, rapid, simple and ultrasensitive detection system for small molecule ochratoxin A (OTA) by using the integration of loop-mediated isothermal amplification (LAMP) technique and subsequently direct readout of LAMP amplicons with a signal-on electrochemiluminescent (ECL) system. Firstly, the dsDNA composed by OTA aptamer and its capture DNA were immobilized on the electrode. After the target recognition, the OTA aptamer bond with target OTA and subsequently left off the electrode, which effectively decreased the immobilization amount of OTA aptamer on electrode. Then, the remaining OTA aptamers on the electrode served as inner primer to initiate the LAMP reaction. Interestingly, the LAMP amplification was detected by monitoring the intercalation of DNA-binding Ru(phen)3(2+) ECL indictors into newly formed amplicons with a set of integrated electrodes. The ECL indictor Ru(phen)3(2+) binding to amplicons caused the reduction of the ECL intensity due to the slow diffusion of Ru(phen)3(2+)-amplicons complex to the electrode surface. Therefore, the presence of more OTA was expected to lead to the release of more OTA aptamer, which meant less OTA aptamer remained on electrode for producing LAMP amplicons, resulting in less Ru(phen)3(2+) interlaced into the formed amplicons within a fixed Ru(phen)3(2+) amount with an obviously increased ECL signal input. As a result, a detection limit as low as 10 fM for OTA was achieved. The aptasensor also has good reproducibility and stability. Copyright © 2013 Elsevier B.V. All rights reserved.
Hor, Hyun; Francescatto, Ludmila; Bartesaghi, Luca; Ortega-Cubero, Sara; Kousi, Maria; Lorenzo-Betancor, Oswaldo; Jiménez-Jiménez, Felix J.; Gironell, Alexandre; Clarimón, Jordi; Drechsel, Oliver; Agúndez, José A. G.; Kenzelmann Broz, Daniela; Chiquet-Ehrismann, Ruth; Lleó, Alberto; Coria, Francisco; García-Martin, Elena; Alonso-Navarro, Hortensia; Martí, Maria J.; Kulisevsky, Jaume; Hor, Charlotte N.; Ossowski, Stephan; Chrast, Roman; Katsanis, Nicholas; Pastor, Pau; Estivill, Xavier
2015-01-01
Essential tremor (ET) is a common movement disorder with an estimated prevalence of 5% of the population aged over 65 years. In spite of intensive efforts, the genetic architecture of ET remains unknown. We used a combination of whole-exome sequencing and targeted resequencing in three ET families. In vitro and in vivo experiments in oligodendrocyte precursor cells and zebrafish were performed to test our findings. Whole-exome sequencing revealed a missense mutation in TENM4 segregating in an autosomal-dominant fashion in an ET family. Subsequent targeted resequencing of TENM4 led to the discovery of two novel missense mutations. Not only did these two mutations segregate with ET in two additional families, but we also observed significant over transmission of pathogenic TENM4 alleles across the three families. Consistent with a dominant mode of inheritance, in vitro analysis in oligodendrocyte precursor cells showed that mutant proteins mislocalize. Finally, expression of human mRNA harboring any of three patient mutations in zebrafish embryos induced defects in axon guidance, confirming a dominant-negative mode of action for these mutations. Our genetic and functional data, which is corroborated by the existence of a Tenm4 knockout mouse displaying an ET phenotype, implicates TENM4 in ET. Together with previous studies of TENM4 in model organisms, our studies intimate that processes regulating myelination in the central nervous system and axon guidance might be significant contributors to the genetic burden of this disorder. PMID:26188006
Tuskan, Gerry
2018-02-13
The U.S. Department of Energy Joint Genome Institute (JGI) invited scientists interested in the application of genomics to bioenergy and environmental issues, as well as all current and prospective users and collaborators, to attend the annual DOE JGI Genomics of Energy Environment Meeting held March 22-24, 2011 in Walnut Creek, Calif. The emphasis of this meeting was on the genomics of renewable energy strategies, carbon cycling, environmental gene discovery, and engineering of fuel-producing organisms. The meeting features presentations by leading scientists advancing these topics. Gerry Tuskan of Oak Ridge National Laboratory on Resequencing in Populus: Towards Genome Wide Association Genetics at the 6th annual Genomics of Energy Environment Meeting on March 23, 2011.
[Genotyping of the Chinese isolates of coltivirus].
Xu, Li-hong; Tao, San-ju; Cao, Yu-xi; Wang, Huan-qin; Yang, Dong-rong; He, Ying; Liu, Qin-zhi; Chen, Bo-quan
2003-12-01
To classify the Chinese isolates of Coltiviruses. Three sets of primers were selected among them two were specific to the 9th and 12th segments of subgroup B2, and one was for the 12th segment of subgroup B1-All the Chinese isolates of Coltivirus selected in the experiment were classified according to the lengths of different amplicons of the reverse transcriptase-polymerase Chain reaction (RT-PCR). The homogenicity of the nucleic acids of the isolates BJ95-75 and YN-6 was also compared with other Coltivirus strains belonging to subgroup B2. With the primers 12-854-S/12-B2-R, which were specific to the 12th segment of Coltivirus subgroup B2-850 bp amplicons were obtained from Beijing isolate BJ95-75 and all the Yunnan isolates such as YN-6, -67-1, -68-1, -69, -70-1, -70-2, -90, -92-2, -93 of Coltivirus 492 bp DNA fragments were also amplified from all of them with the segment 9th specific primers 9-JKT-S/9-JKT-R. However no positive results were obtained from Northeast isolates NE97-12, NE97-31 and control viruses YN-99(Orbivirus),YN-151-1(JEV) with the same two sets of primers. With 12-B1-S/12-B1R primers specific to the 12th segment of subgroup B1, no amplicons of right length were obtained from any of the Chinese isolates of Coltivirus and the control viruses. When compared the nucleic acid sequences of BJ95-75 and YN-6 with other Coltivirus strains such as Bannavirus, JKT6423, JKT6969, JKT7043, the amplicons from segment 12th of these two strains had more than 89.4% homology with the other strains, especially to the earlier Chinese isolate Bannavirus, the homolog was more then 98.9%. Nearly 96.5% and 99.2% of the nucleic acids of the amplicons from segment 9th of the two strains were being homologous to Bannavirus and about 84.0% to JKT6423, which had been classified into type B2a. But the maximal homogenicity was about 53% when compared with the other two coltivirus strains. JKT6969 and JKT7043 which had been classified into type B2b. Genotyping the recent Chinese isolates of coltivirus for the first time in our country. Most of the Chinese isolates belong to subgroup B2, more exactly type B2a. The Northeast isolates NE97-12 and NE97-31 were not correctly grouped with the available primers.
Biedrzycka, Aleksandra; Sebastian, Alvaro; Migalska, Magdalena; Westerdahl, Helena; Radwan, Jacek
2017-07-01
Characterization of highly duplicated genes, such as genes of the major histocompatibility complex (MHC), where multiple loci often co-amplify, has until recently been hindered by insufficient read depths per amplicon. Here, we used ultra-deep Illumina sequencing to resolve genotypes at exon 3 of MHC class I genes in the sedge warbler (Acrocephalus schoenobaenus). We sequenced 24 individuals in two replicates and used this data, as well as a simulated data set, to test the effect of amplicon coverage (range: 500-20 000 reads per amplicon) on the repeatability of genotyping using four different genotyping approaches. A third replicate employed unique barcoding to assess the extent of tag jumping, that is swapping of individual tag identifiers, which may confound genotyping. The reliability of MHC genotyping increased with coverage and approached or exceeded 90% within-method repeatability of allele calling at coverages of >5000 reads per amplicon. We found generally high agreement between genotyping methods, especially at high coverages. High reliability of the tested genotyping approaches was further supported by our analysis of the simulated data set, although the genotyping approach relying primarily on replication of variants in independent amplicons proved sensitive to repeatable errors. According to the most repeatable genotyping method, the number of co-amplifying variants per individual ranged from 19 to 42. Tag jumping was detectable, but at such low frequencies that it did not affect the reliability of genotyping. We thus demonstrate that gene families with many co-amplifying genes can be reliably genotyped using HTS, provided that there is sufficient per amplicon coverage. © 2016 John Wiley & Sons Ltd.
Huang, Xin; Gollin, Susanne M.; Raja, Siva; Godfrey, Tony E.
2002-01-01
Amplification of chromosomal band 11q13 is a common event in human cancer. It has been reported in about 45% of head and neck carcinomas and in other cancers including esophageal, breast, liver, lung, and bladder cancer. To understand the mechanism of 11q13 amplification and to identify the potential oncogene(s) driving it, we have fine-mapped the structure of the amplicon in oral squamous cell carcinoma cell lines and localized the proximal and distal breakpoints. A 5-Mb physical map of the region has been prepared from which sequence is available. We quantified copy number of sequence-tagged site markers at 42–550 kb intervals along the length of the amplicon and defined the amplicon core and breakpoints by using TaqMan-based quantitative microsatellite analysis. The core of the amplicon maps to a 1.5-Mb region. The proximal breakpoint localizes to two intervals between sequence-tagged site markers, 550 kb and 160 kb in size, and the distal breakpoint maps to a 250 kb interval. The cyclin D1 gene maps to the amplicon core, as do two new expressed sequence tag clusters. We have analyzed one of these expressed sequence tag clusters and now report that it contains a previously uncharacterized gene, TAOS1 (tumor amplified and overexpressed sequence 1), which is both amplified and overexpressed in oral cancer cells. The data suggest that TAOS1 may be an amplification-dependent candidate oncogene with a role in the development and/or progression of human tumors, including oral squamous cell carcinomas. The approach described here should be useful for characterizing amplified genomic regions in a wide variety of tumors. PMID:12172009
Sørensen, Maria Rathmann; Ilsøe, Mette; Strube, Mikael Lenz; Bishop, Richard; Erbs, Gitte; Hartmann, Sofie Bruun; Jungersen, Gregers
2017-01-01
The need for typing of the swine leukocyte antigen (SLA) is increasing with the expanded use of pigs as models for human diseases and organ-transplantation experiments, their use in infection studies, and for design of veterinary vaccines. Knowledge of SLA sequences is furthermore a prerequisite for the prediction of epitope binding in pigs. The low number of known SLA class I alleles and the limited knowledge of their prevalence in different pig breeds emphasizes the need for efficient SLA typing methods. This study utilizes an SLA class I-typing method based on next-generation sequencing of barcoded PCR amplicons. The amplicons were generated with universal primers and predicted to resolve 68-88% of all known SLA class I alleles dependent on amplicon size. We analyzed the SLA profiles of 72 pigs from four different pig populations; Göttingen minipigs and Belgian, Kenyan, and Danish fattening pigs. We identified 67 alleles, nine previously described haplotypes and 15 novel haplotypes. The highest variation in SLA class I profiles was observed in the Danish pigs and the lowest among the Göttingen minipig population, which also have the highest percentage of homozygote individuals. Highlighting the fact that there are still numerous unknown SLA class I alleles to be discovered, a total of 12 novel SLA class I alleles were identified. Overall, we present new information about known and novel alleles and haplotypes and their prevalence in the tested pig populations.
Hulse-Kemp, Amanda M.; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A.; Scheffler, Brian E.; Fang, David D.; Chen, Z. Jeffrey; Van Deynze, Allen; Stelly, David M.
2015-01-01
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. PMID:25858960
Liu, Shi; Gao, Peng; Zhu, Qianglong; Luan, Feishi; Davis, Angela R.; Wang, Xiaolu
2016-01-01
Cleaved amplified polymorphic sequence (CAPS) markers are useful tools for detecting single nucleotide polymorphisms (SNPs). This study detected and converted SNP sites into CAPS markers based on high-throughput re-sequencing data in watermelon, for linkage map construction and quantitative trait locus (QTL) analysis. Two inbred lines, Cream of Saskatchewan (COS) and LSW-177 had been re-sequenced and analyzed by Perl self-compiled script for CAPS marker development. 88.7% and 78.5% of the assembled sequences of the two parental materials could map to the reference watermelon genome, respectively. Comparative assembled genome data analysis provided 225,693 and 19,268 SNPs and indels between the two materials. 532 pairs of CAPS markers were designed with 16 restriction enzymes, among which 271 pairs of primers gave distinct bands of the expected length and polymorphic bands, via PCR and enzyme digestion, with a polymorphic rate of 50.94%. Using the new CAPS markers, an initial CAPS-based genetic linkage map was constructed with the F2 population, spanning 1836.51 cM with 11 linkage groups and 301 markers. 12 QTLs were detected related to fruit flesh color, length, width, shape index, and brix content. These newly CAPS markers will be a valuable resource for breeding programs and genetic studies of watermelon. PMID:27162496
Trujillano, D; Ramos, M D; González, J; Tornador, C; Sotillo, F; Escaramis, G; Ossowski, S; Armengol, L; Casals, T; Estivill, X
2013-07-01
Here we have developed a novel and much more efficient strategy for the complete molecular characterisation of the cystic fibrosis (CF) transmembrane regulator (CFTR) gene, based on multiplexed targeted resequencing. We have tested this approach in a cohort of 92 samples with previously characterised CFTR mutations and polymorphisms. After enrichment of the pooled barcoded DNA libraries with a custom NimbleGen SeqCap EZ Choice array (Roche) and sequencing with a HiSeq2000 (Illumina) sequencer, we applied several bioinformatics tools to call mutations and polymorphisms in CFTR. The combination of several bioinformatics tools allowed us to detect all known pathogenic variants (point mutations, short insertions/deletions, and large genomic rearrangements) and polymorphisms (including the poly-T and poly-thymidine-guanine polymorphic tracts) in the 92 samples. In addition, we report the precise characterisation of the breakpoints of seven genomic rearrangements in CFTR, including those of a novel deletion of exon 22 and a complex 85 kb inversion which includes two large deletions affecting exons 4-8 and 12-21, respectively. This work is a proof-of-principle that targeted resequencing is an accurate and cost-effective approach for the genetic testing of CF and CFTR-related disorders (ie, male infertility) amenable to the routine clinical practice, and ready to substitute classical molecular methods in medical genetics.
Nucleic acid sequence detection using multiplexed oligonucleotide PCR
Nolan, John P [Santa Fe, NM; White, P Scott [Los Alamos, NM
2006-12-26
Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.
Zepeda-Mendoza, Marie Lisandra; Bohmann, Kristine; Carmona Baez, Aldo; Gilbert, M Thomas P
2016-05-03
DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5'-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way. We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe. DAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying unused tag combinations, (iii) evaluate the comparability of PCR replicates of the same sample, and (iv) filter tagged amplicons from a number of PCR replicates using parameters of minimum length, copy number, and reproducibility across the PCR replicates. This enables an efficient analysis of complex datasets, and ultimately increases the ease of handling datasets from large-scale studies.
Pyle, Angela; Hudson, Gavin; Wilson, Ian J; Coxhead, Jonathan; Smertenko, Tania; Herbert, Mary; Santibanez-Koref, Mauro; Chinnery, Patrick F
2015-05-01
Recent reports have questioned the accepted dogma that mammalian mitochondrial DNA (mtDNA) is strictly maternally inherited. In humans, the argument hinges on detecting a signature of inter-molecular recombination in mtDNA sequences sampled at the population level, inferring a paternal source for the mixed haplotypes. However, interpreting these data is fraught with difficulty, and direct experimental evidence is lacking. Using extreme-high depth mtDNA re-sequencing up to ~1.2 million-fold coverage, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans, thus excluding a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals. Our findings indicate that an active mechanism eliminates paternal mtDNA which likely acts at the molecular level.
Pyle, Angela; Hudson, Gavin; Wilson, Ian J.; Coxhead, Jonathan; Smertenko, Tania; Herbert, Mary; Santibanez-Koref, Mauro; Chinnery, Patrick F.
2015-01-01
Recent reports have questioned the accepted dogma that mammalian mitochondrial DNA (mtDNA) is strictly maternally inherited. In humans, the argument hinges on detecting a signature of inter-molecular recombination in mtDNA sequences sampled at the population level, inferring a paternal source for the mixed haplotypes. However, interpreting these data is fraught with difficulty, and direct experimental evidence is lacking. Using extreme-high depth mtDNA re-sequencing up to ~1.2 million-fold coverage, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans, thus excluding a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals. Our findings indicate that an active mechanism eliminates paternal mtDNA which likely acts at the molecular level. PMID:25973765
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tuskan, Gerry
The U.S. Department of Energy Joint Genome Institute (JGI) invited scientists interested in the application of genomics to bioenergy and environmental issues, as well as all current and prospective users and collaborators, to attend the annual DOE JGI Genomics of Energy Environment Meeting held March 22-24, 2011 in Walnut Creek, Calif. The emphasis of this meeting was on the genomics of renewable energy strategies, carbon cycling, environmental gene discovery, and engineering of fuel-producing organisms. The meeting features presentations by leading scientists advancing these topics. Gerry Tuskan of Oak Ridge National Laboratory on Resequencing in Populus: Towards Genome Wide Association Geneticsmore » at the 6th annual Genomics of Energy Environment Meeting on March 23, 2011.« less
Characterizing partial AZFc deletions of the Y chromosome with amplicon-specific sequence markers
Navarro-Costa, Paulo; Pereira, Luísa; Alves, Cíntia; Gusmão, Leonor; Proença, Carmen; Marques-Vidal, Pedro; Rocha, Tiago; Correia, Sónia C; Jorge, Sónia; Neves, António; Soares, Ana P; Nunes, Joaquim; Calhaz-Jorge, Carlos; Amorim, António; Plancha, Carlos E; Gonçalves, João
2007-01-01
Background The AZFc region of the human Y chromosome is a highly recombinogenic locus containing multi-copy male fertility genes located in repeated DNA blocks (amplicons). These AZFc gene families exhibit slight sequence variations between copies which are considered to have functional relevance. Yet, partial AZFc deletions yield phenotypes ranging from normospermia to azoospermia, thwarting definite conclusions on their real impact on fertility. Results The amplicon content of partial AZFc deletion products was characterized with novel amplicon-specific sequence markers. Data indicate that partial AZFc deletions are a male infertility risk [odds ratio: 5.6 (95% CI: 1.6–30.1)] and although high diversity of partial deletion products and sequence conversion profiles were recorded, the AZFc marker profiles detected in fertile men were also observed in infertile men. Additionally, the assessment of rearrangement recurrence by Y-lineage analysis indicated that while partial AZFc deletions occurred in highly diverse samples, haplotype diversity was minimal in fertile men sharing identical marker profiles. Conclusion Although partial AZFc deletion products are highly heterogeneous in terms of amplicon content, this plasticity is not sufficient to account for the observed phenotypical variance. The lack of causative association between the deletion of specific gene copies and infertility suggests that AZFc gene content might be part of a multifactorial network, with Y-lineage evolution emerging as a possible phenotype modulator. PMID:17903263
Wilhelm, Roland C; Cardenas, Erick; Leung, Hilary; Maas, Kendra; Hartmann, Martin; Hahn, Aria; Hallam, Steven; Mohn, William W
2017-01-01
The scarcity of long-term data on soil microbial communities in the decades following timber harvesting limits current understanding of the ecological problems associated with maintaining the productivity of managed forests. The high complexity of soil communities and the heterogeneity of forest and soil necessitates a comprehensive approach to understand the role of microbial processes in managed forest ecosystems. Here, we describe a curated collection of well replicated, multi-faceted data from eighteen reforested sites in six different North American ecozones within the Long-term Soil Productivity (LTSP) Study, without detailed analysis of results or discussion. The experiments were designed to contrast microbial community composition and function among forest soils from harvested treatment plots with varying intensities of organic matter removal. The collection includes 724 bacterial (16S) and 658 fungal (ITS2) amplicon libraries, 133 shotgun metagenomic libraries as well as stable isotope probing amplicon libraries capturing the effects of harvesting on hemicellulolytic and cellulolytic populations. This collection serves as a foundation for the LTSP Study and other studies of the ecology of forest soil and forest disturbance.
A metagenomic survey of forest soil microbial communities more than a decade after timber harvesting
Wilhelm, Roland C.; Cardenas, Erick; Leung, Hilary; Maas, Kendra; Hartmann, Martin; Hahn, Aria; Hallam, Steven; Mohn, William W.
2017-01-01
The scarcity of long-term data on soil microbial communities in the decades following timber harvesting limits current understanding of the ecological problems associated with maintaining the productivity of managed forests. The high complexity of soil communities and the heterogeneity of forest and soil necessitates a comprehensive approach to understand the role of microbial processes in managed forest ecosystems. Here, we describe a curated collection of well replicated, multi-faceted data from eighteen reforested sites in six different North American ecozones within the Long-term Soil Productivity (LTSP) Study, without detailed analysis of results or discussion. The experiments were designed to contrast microbial community composition and function among forest soils from harvested treatment plots with varying intensities of organic matter removal. The collection includes 724 bacterial (16S) and 658 fungal (ITS2) amplicon libraries, 133 shotgun metagenomic libraries as well as stable isotope probing amplicon libraries capturing the effects of harvesting on hemicellulolytic and cellulolytic populations. This collection serves as a foundation for the LTSP Study and other studies of the ecology of forest soil and forest disturbance. PMID:28765786
Novel gene C17orf37 in Prostate Cancer Progression and Metastasis: A Prospective Biomarker
2010-05-01
15. SUBJECT TERMS 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT 18 . NUMBER OF PAGES 19a. NAME OF RESPONSIBLE PERSON a...UPA and VEGF. These results were reported in an original article. Ongoing experiments: Currently we are performing in vivo xenograft studies in...Activation of multiple cancer- associated genes at the ERBB2 amplicon in breast cancer. Endocr Relat Cancer 13: 39. Kelly P, Stemmle LN , Madden JF
PGen: large-scale genomic variations analysis workflow and browser in SoyKB.
Liu, Yang; Khan, Saad M; Wang, Juexin; Rynge, Mats; Zhang, Yuanxun; Zeng, Shuai; Chen, Shiyuan; Maldonado Dos Santos, Joao V; Valliyodan, Babu; Calyam, Prasad P; Merchant, Nirav; Nguyen, Henry T; Xu, Dong; Joshi, Trupti
2016-10-06
With the advances in next-generation sequencing (NGS) technology and significant reductions in sequencing costs, it is now possible to sequence large collections of germplasm in crops for detecting genome-scale genetic variations and to apply the knowledge towards improvements in traits. To efficiently facilitate large-scale NGS resequencing data analysis of genomic variations, we have developed "PGen", an integrated and optimized workflow using the Extreme Science and Engineering Discovery Environment (XSEDE) high-performance computing (HPC) virtual system, iPlant cloud data storage resources and Pegasus workflow management system (Pegasus-WMS). The workflow allows users to identify single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), perform SNP annotations and conduct copy number variation analyses on multiple resequencing datasets in a user-friendly and seamless way. We have developed both a Linux version in GitHub ( https://github.com/pegasus-isi/PGen-GenomicVariations-Workflow ) and a web-based implementation of the PGen workflow integrated within the Soybean Knowledge Base (SoyKB), ( http://soykb.org/Pegasus/index.php ). Using PGen, we identified 10,218,140 single-nucleotide polymorphisms (SNPs) and 1,398,982 indels from analysis of 106 soybean lines sequenced at 15X coverage. 297,245 non-synonymous SNPs and 3330 copy number variation (CNV) regions were identified from this analysis. SNPs identified using PGen from additional soybean resequencing projects adding to 500+ soybean germplasm lines in total have been integrated. These SNPs are being utilized for trait improvement using genotype to phenotype prediction approaches developed in-house. In order to browse and access NGS data easily, we have also developed an NGS resequencing data browser ( http://soykb.org/NGS_Resequence/NGS_index.php ) within SoyKB to provide easy access to SNP and downstream analysis results for soybean researchers. PGen workflow has been optimized for the most efficient analysis of soybean data using thorough testing and validation. This research serves as an example of best practices for development of genomics data analysis workflows by integrating remote HPC resources and efficient data management with ease of use for biological users. PGen workflow can also be easily customized for analysis of data in other species.
Valtcheva, Nadejda; Lang, Franziska M; Noske, Aurelia; Samartzis, Eleftherios P; Schmidt, Anna-Maria; Bellini, Elisa; Fink, Daniel; Moch, Holger; Rechsteiner, Markus; Dedes, Konstantin J; Wild, Peter J
2017-01-19
Endometrioid adenocarcinoma of the uterus and ovarian endometrioid carcinoma share many morphological and molecular features. Differentiation between simultaneous primary carcinomas and ovarian metastases of an endometrial cancer may be very challenging but is essential for prognostic and therapeutic considerations. In the present case study of a 33 year-old patient we used targeted amplicon next-generation re-sequencing for clarifying the origin of synchronous endometrioid cancer of the corpus uteri and the left ovary. The patient developed a metachronous lung metastasis of an endometrioid adenocarcinoma four years after hyster- and adnexectomy, vaginal brachytherapy and treatment with the synthetic steroid tibolone. Removal of the metastasis and megestrol treatment for seven years led to a complete remission. A total of 409 genes from the Ampliseq Comprehensive Cancer Panel (Ion Torrent, Thermo Fisher) were analysed by next generation sequencing and mutations in 10 genes, including ARID1A, CTNNB1, PIK3CA and PTEN were identified and confirmed by Sanger sequencing. Primary endometrial as well as ovarian cancer showed an identical mutational profile, suggesting the presence of an ovarian metastasis of the endometrial cancer, rather than a simultaneous endometrial and ovarian cancer. The metachronous lung metastasis showed a different mutational profile compared to the primary cancer. Immunohistochemical staining of the corresponding proteins suggested that the tumour development was driven by alterations in the protein function rather than by changes of the protein abundance in the cell. Our results have demonstrated next generation sequencing as a valuable tool in the differentiation of synchronous primary tumours and metastases, which has an important impact on the clinical decision making process. Similar to breast cancer, targeted therapies based on mutational tumour profiling will become increasingly important in endometrial and ovarian cancer. In summary, our results support the usage of next generation sequencing as a supplementary diagnostic tool, assisting in personalized precision medicine.
Péterfia, Bálint; Kalmár, Alexandra; Patai, Árpád V; Csabai, István; Bodor, András; Micsik, Tamás; Wichmann, Barnabás; Egedi, Krisztina; Hollósi, Péter; Kovalszky, Ilona; Tulassay, Zsolt; Molnár, Béla
2017-01-01
Background: To support cancer therapy, development of low cost library preparation techniques for targeted next generation sequencing (NGS) is needed. In this study we designed and tested a PCR-based library preparation panel with limited target area for sequencing the top 12 somatic mutation hot spots in colorectal cancer on the GS Junior instrument. Materials and Methods: A multiplex PCR panel was designed to amplify regions of mutation hot spots in 12 selected genes ( APC, BRAF, CTNNB1, EGFR, FBXW7, KRAS, NRAS, MSH6, PIK3CA, SMAD2, SMAD4, TP53 ). Amplicons were sequenced on a GS Junior instrument using ligated and barcoded adaptors. Eight samples were sequenced in a single run. Colonic DNA samples (8 normal mucosa; 33 adenomas; 17 adenocarcinomas) as well as HT-29 and Caco-2 cell lines with known mutation profiles were analyzed. Variants found by the panel on APC, BRAF, KRAS and NRAS genes were validated by conventional sequencing. Results: In total, 34 kinds of mutations were detected including two novel mutations ( FBXW7 c.1740:C>G and SMAD4 c.413C>G) that have not been recorded in mutation databases, and one potential germline mutation ( APC ). The most frequently mutated genes were APC, TP53 and KRAS with 30%, 15% and 21% frequencies in adenomas and 29%, 53% and 29% frequencies in carcinomas, respectively. In cell lines, all the expected mutations were detected except for one located in a homopolymer region. According to re-sequencing results sensitivity and specificity was 100% and 92% respectively. Conclusions: Our NGS-based screening panel denotes a promising step towards low cost colorectal cancer genotyping on the GS Junior instrument. Despite the relatively low coverage, we discovered two novel mutations and obtained mutation frequencies comparable to literature data. Additionally, as an advantage, this panel requires less template DNA than sequence capture colon cancer panels currently available for the GS Junior instrument.
Next-Generation Genomics Facility at C-CAMP: Accelerating Genomic Research in India
S, Chandana; Russiachand, Heikham; H, Pradeep; S, Shilpa; M, Ashwini; S, Sahana; B, Jayanth; Atla, Goutham; Jain, Smita; Arunkumar, Nandini; Gowda, Malali
2014-01-01
Next-Generation Sequencing (NGS; http://www.genome.gov/12513162) is a recent life-sciences technological revolution that allows scientists to decode genomes or transcriptomes at a much faster rate with a lower cost. Genomic-based studies are in a relatively slow pace in India due to the non-availability of genomics experts, trained personnel and dedicated service providers. Using NGS there is a lot of potential to study India's national diversity (of all kinds). We at the Centre for Cellular and Molecular Platforms (C-CAMP) have launched the Next Generation Genomics Facility (NGGF) to provide genomics service to scientists, to train researchers and also work on national and international genomic projects. We have HiSeq1000 from Illumina and GS-FLX Plus from Roche454. The long reads from GS FLX Plus, and high sequence depth from HiSeq1000, are the best and ideal hybrid approaches for de novo and re-sequencing of genomes and transcriptomes. At our facility, we have sequenced around 70 different organisms comprising of more than 388 genomes and 615 transcriptomes – prokaryotes and eukaryotes (fungi, plants and animals). In addition we have optimized other unique applications such as small RNA (miRNA, siRNA etc), long Mate-pair sequencing (2 to 20 Kb), Coding sequences (Exome), Methylome (ChIP-Seq), Restriction Mapping (RAD-Seq), Human Leukocyte Antigen (HLA) typing, mixed genomes (metagenomes) and target amplicons, etc. Translating DNA sequence data from NGS sequencer into meaningful information is an important exercise. Under NGGF, we have bioinformatics experts and high-end computing resources to dissect NGS data such as genome assembly and annotation, gene expression, target enrichment, variant calling (SSR or SNP), comparative analysis etc. Our services (sequencing and bioinformatics) have been utilized by more than 45 organizations (academia and industry) both within India and outside, resulting several publications in peer-reviewed journals and several genomic/transcriptomic data is available at NCBI.
2010-01-01
Background Mitochondria are a valuable resource for studying the evolutionary process and deducing phylogeny. A few mitochondria genomes have been sequenced, but a comprehensive picture of the domestication event for silkworm mitochondria remains to be established. In this study, we integrate the extant data, and perform a whole genome resequencing of Japanese wild silkworm to obtain breakthrough results in silkworm mitochondrial (mt) population, and finally use these to deduce a more comprehensive phylogeny of the Bombycidae. Results We identified 347 single nucleotide polymorphisms (SNPs) in the mt genome, but found no past recombination event to have occurred in the silkworm progenitor. A phylogeny inferred from these whole genome SNPs resulted in a well-classified tree, confirming that the domesticated silkworm, Bombyx mori, most recently diverged from the Chinese wild silkworm, rather than from the Japanese wild silkworm. We showed that the population sizes of the domesticated and Chinese wild silkworms both experience neither expansion nor contraction. We also discovered that one mt gene, named cytochrome b, shows a strong signal of positive selection in the domesticated clade. This gene is related to energy metabolism, and may have played an important role during silkworm domestication. Conclusions We present a comparative analysis on 41 mt genomes of B. mori and B. mandarina from China and Japan. With these, we obtain a much clearer picture of the evolution history of the silkworm. The data and analyses presented here aid our understanding of the silkworm in general, and provide a crucial insight into silkworm phylogeny. PMID:20334646
Hor, Hyun; Francescatto, Ludmila; Bartesaghi, Luca; Ortega-Cubero, Sara; Kousi, Maria; Lorenzo-Betancor, Oswaldo; Jiménez-Jiménez, Felix J; Gironell, Alexandre; Clarimón, Jordi; Drechsel, Oliver; Agúndez, José A G; Kenzelmann Broz, Daniela; Chiquet-Ehrismann, Ruth; Lleó, Alberto; Coria, Francisco; García-Martin, Elena; Alonso-Navarro, Hortensia; Martí, Maria J; Kulisevsky, Jaume; Hor, Charlotte N; Ossowski, Stephan; Chrast, Roman; Katsanis, Nicholas; Pastor, Pau; Estivill, Xavier
2015-10-15
Essential tremor (ET) is a common movement disorder with an estimated prevalence of 5% of the population aged over 65 years. In spite of intensive efforts, the genetic architecture of ET remains unknown. We used a combination of whole-exome sequencing and targeted resequencing in three ET families. In vitro and in vivo experiments in oligodendrocyte precursor cells and zebrafish were performed to test our findings. Whole-exome sequencing revealed a missense mutation in TENM4 segregating in an autosomal-dominant fashion in an ET family. Subsequent targeted resequencing of TENM4 led to the discovery of two novel missense mutations. Not only did these two mutations segregate with ET in two additional families, but we also observed significant over transmission of pathogenic TENM4 alleles across the three families. Consistent with a dominant mode of inheritance, in vitro analysis in oligodendrocyte precursor cells showed that mutant proteins mislocalize. Finally, expression of human mRNA harboring any of three patient mutations in zebrafish embryos induced defects in axon guidance, confirming a dominant-negative mode of action for these mutations. Our genetic and functional data, which is corroborated by the existence of a Tenm4 knockout mouse displaying an ET phenotype, implicates TENM4 in ET. Together with previous studies of TENM4 in model organisms, our studies intimate that processes regulating myelination in the central nervous system and axon guidance might be significant contributors to the genetic burden of this disorder. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Ancient homology underlies adaptive mimetic diversity across butterflies
Gallant, Jason R.; Imhoff, Vance E.; Martin, Arnaud; Savage, Wesley K.; Chamberlain, Nicola L.; Pote, Ben L.; Peterson, Chelsea; Smith, Gabriella E.; Evans, Benjamin; Reed, Robert D.; Kronforst, Marcus R.; Mullen, Sean P.
2014-01-01
Convergent evolution provides a rare, natural experiment with which to test the predictability of adaptation at the molecular level. Little is known about the molecular basis of convergence over macro-evolutionary timescales. Here we use a combination of positional cloning, population genomic resequencing, association mapping and developmental data to demonstrate that positionally orthologous nucleotide variants in the upstream region of the same gene, WntA, are responsible for parallel mimetic variation in two butterfly lineages that diverged >65 million years ago. Furthermore, characterization of spatial patterns of WntA expression during development suggests that alternative regulatory mechanisms underlie wing pattern variation in each system. Taken together, our results reveal a strikingly predictable molecular basis for phenotypic convergence over deep evolutionary time. PMID:25198507
USDA-ARS?s Scientific Manuscript database
The dynamics of microbial communities associated with dying cover crops are of interest because of potential impacts on disease in a subsequent crop, and because of the importance of microbial activity on plant residue to soil organic matter dynamics and nutrient cycling. High throughput amplicon se...
USDA-ARS?s Scientific Manuscript database
The current study evaluates the potential of using high resolution DNA melting assays to discriminate species in the genus, Isaria. The study utilizes a previously identified 103 base pair PCR amplicon, which was reported to be selective for Isaria fumosorosea. Our study finds the amplicon selective...
Induction of humoral responses to BHV-1 glycoprotein D expressed by HSV-1 amplicon vectors
Blanc, Andrea Maria; Berois, Mabel Beatriz; Tomé, Lorena Magalí; Epstein, Alberto L.
2012-01-01
Herpes simplex virus type-1 (HSV-1) amplicon vectors are versatile and useful tools for transferring genes into cells that are capable of stimulating a specific immune response to their expressed antigens. In this work, two HSV-1-derived amplicon vectors were generated. One of these expressed the full-length glycoprotein D (gD) of bovine herpesvirus 1 while the second expressed the truncated form of gD (gDtr) which lacked the trans-membrane region. After evaluating gD expression in the infected cells, the ability of both vectors to induce a specific gD immune response was tested in BALB/c mice that were intramuscularly immunized. Specific serum antibody responses were detected in mice inoculated with both vectors, and the response against truncated gD was higher than the response against full-length gD. These results reinforce previous findings that HSV-1 amplicon vectors can potentially deliver antigens to animals and highlight the prospective use of these vectors for treating infectious bovine rhinotracheitis disease. PMID:22437537
Microbes in deep marine sediments viewed through amplicon sequencing and metagenomics
NASA Astrophysics Data System (ADS)
Biddle, J.; Leon, Z. R.; Russell, J. A., III; Martino, A. J.
2016-12-01
Nearly twenty percent of microbial biomass on Earth can be found in the marine subsurface. The majority of this is concentrated on continental margins, which have been investigated by scientific drilling. On the Costa Rica Margin, Iberian Margin and Peru Margins, sediment samples have been investigated through DNA extraction followed by amplicon and metagenomic sequencing. Overall samples show a high degree of microbial diversity, including many lineages of newly defined groups. In this talk, metagenome assembled genomes of unusual lineages will be presented, including their relationships to shallower relatives. From Costa Rica, in particular, we have retrieved deep relatives of Lokiarchaeota and Thorarchaeota, as well as other deeply branching archaeal relatives. We discuss their genome similarities to both other archaea and eukaryotes. From the Iberian Margin, relatives of Atribacteria and Aerophobetes will be discussed. Finally, we will detail the knowledge lost or gained depending on whether samples are studied via amplicon sequencing or total metagenomics, as studies in other environments have shown that up to 15% of microbial diversity is ignored when samples are studied via amplicon sequencing alone.
Partial and Full PCR-Based Reverse Genetics Strategy for Influenza Viruses
Chen, Hongjun; Ye, Jianqiang; Xu, Kemin; Angel, Matthew; Shao, Hongxia; Ferrero, Andrea; Sutton, Troy; Perez, Daniel R.
2012-01-01
Since 1999, plasmid-based reverse genetics (RG) systems have revolutionized the way influenza viruses are studied. However, it is not unusual to encounter cloning difficulties for one or more influenza genes while attempting to recover virus de novo. To overcome some of these shortcomings we sought to develop partial or full plasmid-free RG systems. The influenza gene of choice is assembled into a RG competent unit by virtue of overlapping PCR reactions containing a cDNA copy of the viral gene segment under the control of RNA polymerase I promoter (pol1) and termination (t1) signals – herein referred to as Flu PCR amplicons. Transfection of tissue culture cells with either HA or NA Flu PCR amplicons and 7 plasmids encoding the remaining influenza RG units, resulted in efficient virus rescue. Likewise, transfections including both HA and NA Flu PCR amplicons and 6 RG plasmids also resulted in efficient virus rescue. In addition, influenza viruses were recovered from a full set of Flu PCR amplicons without the use of plasmids. PMID:23029501
Shi, Jian; Yuan, Meng; Wang, Zhan-Dong; Xu, Xiao-Li; Hong, Lei; Sun, Shenglin
2017-02-01
The carcinogenesis of non-small cell lung carcinoma has been found to associate with activating and resistant mutations in the tyrosine kinase domain of specific oncogenes. Here, we assessed the type, frequency, and abundance of epithelial growth factor receptor, KRAS, BRAF, and ALK mutations in 154 non-small cell lung carcinoma specimens using single-molecule amplification and re-sequencing technology. We found that epithelial growth factor receptor mutations were the most prevalent (44.2%), followed by KRAS (18.8%), ALK (7.8%), and BRAF (5.8%) mutations. The type and abundance of the mutations in tumor specimens appeared to be heterogeneous. Thus, we conclude that identification of clinically significant oncogenic mutations may improve the classification of patients and provide valuable information for determination of the therapeutic strategies.
Hulse-Kemp, Amanda M; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A; Scheffler, Brian E; Fang, David D; Chen, Z Jeffrey; Van Deynze, Allen; Stelly, David M
2015-04-09
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. Copyright © 2015 Hulse-Kemp et al.
Wang, Zheng; Malanoski, Anthony P; Lin, Baochuan; Kidd, Carolyn; Long, Nina C; Blaney, Kate M; Thach, Dzung C; Tibbetts, Clark; Stenger, David A
2008-01-01
Background Febrile respiratory illness (FRI) has a high impact on public health and global economics and poses a difficult challenge for differential diagnosis. A particular issue is the detection of genetically diverse pathogens, i.e. human rhinoviruses (HRV) and enteroviruses (HEV) which are frequent causes of FRI. Resequencing Pathogen Microarray technology has demonstrated potential for differential diagnosis of several respiratory pathogens simultaneously, but a high confidence design method to select probes for genetically diverse viruses is lacking. Results Using HRV and HEV as test cases, we assess a general design strategy for detecting and serotyping genetically diverse viruses. A minimal number of probe sequences (26 for HRV and 13 for HEV), which were potentially capable of detecting all serotypes of HRV and HEV, were determined and implemented on the Resequencing Pathogen Microarray RPM-Flu v.30/31 (Tessarae RPM-Flu). The specificities of designed probes were validated using 34 HRV and 28 HEV strains. All strains were successfully detected and identified at least to species level. 33 HRV strains and 16 HEV strains could be further differentiated to serotype level. Conclusion This study provides a fundamental evaluation of simultaneous detection and differential identification of genetically diverse RNA viruses with a minimal number of prototype sequences. The results demonstrated that the newly designed RPM-Flu v.30/31 can provide comprehensive and specific analysis of HRV and HEV samples which implicates that this design strategy will be applicable for other genetically diverse viruses. PMID:19046445
Schweizer, Rena M; Robinson, Jacqueline; Harrigan, Ryan; Silva, Pedro; Galverni, Marco; Musiani, Marco; Green, Richard E; Novembre, John; Wayne, Robert K
2016-01-01
In an era of ever-increasing amounts of whole-genome sequence data for individuals and populations, the utility of traditional single nucleotide polymorphisms (SNPs) array-based genome scans is uncertain. We previously performed a SNP array-based genome scan to identify candidate genes under selection in six distinct grey wolf (Canis lupus) ecotypes. Using this information, we designed a targeted capture array for 1040 genes, including all exons and flanking regions, as well as 5000 1-kb nongenic neutral regions, and resequenced these regions in 107 wolves. Selection tests revealed striking patterns of variation within candidate genes relative to noncandidate regions and identified potentially functional variants related to local adaptation. We found 27% and 47% of candidate genes from the previous SNP array study had functional changes that were outliers in sweed and bayenv analyses, respectively. This result verifies the use of genomewide SNP surveys to tag genes that contain functional variants between populations. We highlight nonsynonymous variants in APOB, LIPG and USH2A that occur in functional domains of these proteins, and that demonstrate high correlation with precipitation seasonality and vegetation. We find Arctic and High Arctic wolf ecotypes have higher numbers of genes under selection, which highlight their conservation value and heightened threat due to climate change. This study demonstrates that combining genomewide genotyping arrays with large-scale resequencing and environmental data provides a powerful approach to discern candidate functional variants in natural populations. © 2015 John Wiley & Sons Ltd.
Natarajan, Sathishkumar; Kim, Hoy-Taek; Thamilarasan, Senthil Kumar; Veerappan, Karpagam; Park, Jong-In; Nou, Ill-Sup
2016-01-01
Powdery mildew is one of the most common fungal diseases in the world. This disease frequently affects melon (Cucumis melo L.) and other Cucurbitaceous family crops in both open field and greenhouse cultivation. One of the goals of genomics is to identify the polymorphic loci responsible for variation in phenotypic traits. In this study, powdery mildew disease assessment scores were calculated for four melon accessions, 'SCNU1154', 'Edisto47', 'MR-1', and 'PMR5'. To investigate the genetic variation of these accessions, whole genome re-sequencing using the Illumina HiSeq 2000 platform was performed. A total of 754,759,704 quality-filtered reads were generated, with an average of 82.64% coverage relative to the reference genome. Comparisons of the sequences for the melon accessions revealed around 7.4 million single nucleotide polymorphisms (SNPs), 1.9 million InDels, and 182,398 putative structural variations (SVs). Functional enrichment analysis of detected variations classified them into biological process, cellular component and molecular function categories. Further, a disease-associated QTL map was constructed for 390 SNPs and 45 InDels identified as related to defense-response genes. Among them 112 SNPs and 12 InDels were observed in powdery mildew responsive chromosomes. Accordingly, this whole genome re-sequencing study identified SNPs and InDels associated with defense genes that will serve as candidate polymorphisms in the search for sources of resistance against powdery mildew disease and could accelerate marker-assisted breeding in melon.
Payen, Thibaut; Murat, Claude; Gigant, Anaïs; Morin, Emmanuelle; De Mita, Stéphane; Martin, Francis
2015-09-01
The Périgord black truffle (Tuber melanosporum Vittad.), considered a gastronomic delicacy worldwide, is an ectomycorrhizal filamentous fungus that is ecologically important in Mediterranean French, Italian and Spanish woodlands. In this study, we developed a novel resource of single nucleotide polymorphisms (SNPs) for T. melanosporum using Illumina high-throughput resequencing. The genome from six T. melanosporum geographical accessions was sequenced to a depth of approximately 20×. These geographical accessions were selected from different populations within the northern and southern regions of the geographical species distribution. Approximately 80% of the reads for each of the six resequenced geographical accessions mapped against the reference T. melanosporum genome assembly, estimating the core genome size of this organism to be approximately 110 Mbp. A total of 442 326 SNPs corresponding to 3540 SNPs/Mbps were identified as being included in all seven genomes. The SNPs occurred more frequently in repeated sequences (85%), although 4501 SNPs were also identified in the coding regions of 2587 genes. Using the ratio of nonsynonymous mutations per nonsynonymous site (pN) to synonymous mutations per synonymous site (pS) and Tajima's D index scanning the whole genome, we were able to identify genomic regions and genes potentially subjected to positive or purifying selection. The SNPs identified represent a valuable resource for future population genetics and genomics studies. © 2015 John Wiley & Sons Ltd.
Meadows, J R S; Kijas, J W
2009-02-01
The male-specific region of the ovine Y chromosome (MSY) remains poorly characterized, yet sequence variants from this region have the potential to reveal the wild progenitor of domestic sheep or examples of domestic and wild paternal introgression. The 5' promoter region of the sex-determining gene SRY was re-sequenced using a subset of wild sheep including bighorn (Ovis canadensis), thinhorn (Ovis dalli spp.), urial (Ovis vignei), argali (Ovis ammon), mouflon (Ovis musimon) and domestic sheep (Ovis aries). Seven novel SNPs (oY2-oY8) were revealed; these were polymorphic between but not within species. Re-sequencing and fragment analysis was applied to the MSY microsatellite SRYM18. It contains a complex compound repeat structure and sequencing of three novel size fragments revealed that a pentanucleotide element remained fixed, whilst a dinucleotide element displayed variability within species. Comparison of the sequence between species revealed that urial and argali sheep grouped more closely to the mouflon and domestic breeds than the pachyceriforms (bighorn and thinhorn). SNP and microsatellite data were combined to define six previously undetected haplotypes. Analysis revealed the mouflon as the only species to share a haplotype with domestic sheep, consistent with its status as a feral domesticate that has undergone male-mediated exchange with domestic animals. A comparison of the remaining wild species and domestic sheep revealed that O. aries is free from signatures of wild sheep introgression.
Visschedijk, Marijn C; Alberts, Rudi; Mucha, Soren; Deelen, Patrick; de Jong, Dirk J; Pierik, Marieke; Spekhorst, Lieke M; Imhann, Floris; van der Meulen-de Jong, Andrea E; van der Woude, C Janneke; van Bodegraven, Adriaan A; Oldenburg, Bas; Löwenberg, Mark; Dijkstra, Gerard; Ellinghaus, David; Schreiber, Stefan; Wijmenga, Cisca; Rivas, Manuel A; Franke, Andre; van Diemen, Cleo C; Weersma, Rinse K
2016-01-01
Genome-wide association studies have revealed several common genetic risk variants for ulcerative colitis (UC). However, little is known about the contribution of rare, large effect genetic variants to UC susceptibility. In this study, we performed a deep targeted re-sequencing of 122 genes in Dutch UC patients in order to investigate the contribution of rare variants to the genetic susceptibility to UC. The selection of genes consists of 111 established human UC susceptibility genes and 11 genes that lead to spontaneous colitis when knocked-out in mice. In addition, we sequenced the promoter regions of 45 genes where known variants exert cis-eQTL-effects. Targeted pooled re-sequencing was performed on DNA of 790 Dutch UC cases. The Genome of the Netherlands project provided sequence data of 500 healthy controls. After quality control and prioritization based on allele frequency and pathogenicity probability, follow-up genotyping of 171 rare variants was performed on 1021 Dutch UC cases and 1166 Dutch controls. Single-variant association and gene-based analyses identified an association of rare variants in the MUC2 gene with UC. The associated variants in the Dutch population could not be replicated in a German replication cohort (1026 UC cases, 3532 controls). In conclusion, this study has identified a putative role for MUC2 on UC susceptibility in the Dutch population and suggests a population-specific contribution of rare variants to UC.
Zhang, Quan; Zhu, Feng; Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua
2015-01-01
Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as revealed by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus.
Detection of Naegleria Species in Environmental Samples from Peninsular Malaysia
Ithoi, Init; Ahmad, Arine Fadzlun; Nissapatorn, Veeranoot; Lau, Yee Ling; Mahmud, Rohela; Mak, Joon Wah
2011-01-01
Background In Malaysia, researchers and medical practitioners are unfamiliar with Naegleria infections. Thus little is known about the existence of pathogenic Naegleria fowleri, and the resultant primary amoebic meningoencephalitis (PAM) is seldom included in the differential diagnosis of central nervous system infections. This study was conducted to detect the presence of Naegleria species in various environmental samples. Methods/Findings A total of 41 Naegleria-like isolates were isolated from water and dust samples. All these isolates were subjected to PCR using two primer sets designed from the ITS1-ITS2 regions. The N. fowleri species-specific primer set failed to produce the expected amplicon. The Naegleria genus-specific primers produced amplicons of 408 bp (35), 450 bp (2), 457 bp (2) or 381 bp (2) from all 41 isolates isolated from aquatic (33) and dust (8) samples. Analysis of the sequences from 10 representative isolates revealed that amplicons with fragments 408, 450 and 457 bp showed homology with non-pathogenic Naegleria species, and 381 bp showed homology with Vahlkampfia species. These results concurred with the morphological observation that all 39 isolates which exhibited flagella were Naegleria, while 2 isolates (AC7, JN034055 and AC8, JN034056) that did not exhibit flagella were Vahlkampfia species. Conclusion To date, pathogenic species of N. fowleri have not been isolated from Malaysia. All 39 isolates that produced amplicons (408, 450 and 457 bp) from the genus-specific primers were identified as being similar to nonpathogenic Naegleria. Amplicon 408 bp from 5 representative isolates showed 100% and 99.7% identity to Naegleria philippinensis isolate RJTM (AM167890) and is thus believed to be the most common species in our environment. Amplicons 450 bp and 457 bp were respectively believed to be from 2 new species of Naegleria, since representative isolates showed lower homology and had a longer base pair length when compared to the reference species in the Genbank, Naegleria schusteri (AJ566626) and Naegleria laresi (AJ566630), respectively. PMID:21915311
Deciphering the Mechanism of Alternative Cleavage and Polyadenylation in Mantle Cell Lymphoma (MCL)
2014-10-01
Kubo , T., Wada, T., Yamaguchi, Y., Shimizu, A. & Handa, H. Knock-down of 25 kDa subunit of cleavage factor Im inHela cells alters alternative...usage was calculated as 62normalized DDDCT. Oligonucleotides used for qRT–PCR. Cyclin D1 common forward, 59-CTGC CAGGAGCAGATCGAAG; reverse, 59...CTdeviation of either amplicon at all of the dilutions was calculated as a correction factor. d, The experiment shown in c was repeated for DICER1 and
Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform
Van Nostrand, Joy D.; Ning, Daliang; Sun, Bo; Xue, Kai; Liu, Feifei; Deng, Ye; Liang, Yuting; Zhou, Jizhong
2017-01-01
Illumina’s MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered, the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1–3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility. PMID:28453559
Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wen, Chongqing; Wu, Liyou; Qin, Yujia
Illumina's MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered,more » the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1-3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility.« less
Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform
Wen, Chongqing; Wu, Liyou; Qin, Yujia; ...
2017-04-28
Illumina's MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered,more » the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1-3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility.« less
Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform.
Wen, Chongqing; Wu, Liyou; Qin, Yujia; Van Nostrand, Joy D; Ning, Daliang; Sun, Bo; Xue, Kai; Liu, Feifei; Deng, Ye; Liang, Yuting; Zhou, Jizhong
2017-01-01
Illumina's MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered, the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1-3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility.
Adachi, Noboru; Umetsu, Kazuo; Shojo, Hideki
2014-01-01
Mitochondrial DNA (mtDNA) is widely used for DNA analysis of highly degraded samples because of its polymorphic nature and high number of copies in a cell. However, as endogenous mtDNA in deteriorated samples is scarce and highly fragmented, it is not easy to obtain reliable data. In the current study, we report the risks of direct sequencing mtDNA in highly degraded material, and suggest a strategy to ensure the quality of sequencing data. It was observed that direct sequencing data of the hypervariable segment (HVS) 1 by using primer sets that generate an amplicon of 407 bp (long-primer sets) was different from results obtained by using newly designed primer sets that produce an amplicon of 120-139 bp (mini-primer sets). The data aligned with the results of mini-primer sets analysis in an amplicon length-dependent manner; the shorter the amplicon, the more evident the endogenous sequence became. Coding region analysis using multiplex amplified product-length polymorphisms revealed the incongruence of single nucleotide polymorphisms between the coding region and HVS 1 caused by contamination with exogenous mtDNA. Although the sequencing data obtained using long-primer sets turned out to be erroneous, it was unambiguous and reproducible. These findings suggest that PCR primers that produce amplicons shorter than those currently recognized should be used for mtDNA analysis in highly degraded samples. Haplogroup motif analysis of the coding region and HVS should also be performed to improve the reliability of forensic mtDNA data. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Mitsui, Jun; Fukuda, Yoko; Azuma, Kyo; Tozaki, Hirokazu; Ishiura, Hiroyuki; Takahashi, Yuji; Goto, Jun; Tsuji, Shoji
2010-07-01
We have recently found that multiple rare variants of the glucocerebrosidase gene (GBA) confer a robust risk for Parkinson disease, supporting the 'common disease-multiple rare variants' hypothesis. To develop an efficient method of identifying rare variants in a large number of samples, we applied multiplexed resequencing using a next-generation sequencer to identification of rare variants of GBA. Sixteen sets of pooled DNAs from six pooled DNA samples were prepared. Each set of pooled DNAs was subjected to polymerase chain reaction to amplify the target gene (GBA) covering 6.5 kb, pooled into one tube with barcode indexing, and then subjected to extensive sequence analysis using the SOLiD System. Individual samples were also subjected to direct nucleotide sequence analysis. With the optimization of data processing, we were able to extract all the variants from 96 samples with acceptable rates of false-positive single-nucleotide variants.
Multiplex amplification of large sets of human exons.
Porreca, Gregory J; Zhang, Kun; Li, Jin Billy; Xie, Bin; Austin, Derek; Vassallo, Sara L; LeProust, Emily M; Peck, Bill J; Emig, Christopher J; Dahl, Fredrik; Gao, Yuan; Church, George M; Shendure, Jay
2007-11-01
A new generation of technologies is poised to reduce DNA sequencing costs by several orders of magnitude. But our ability to fully leverage the power of these technologies is crippled by the absence of suitable 'front-end' methods for isolating complex subsets of a mammalian genome at a scale that matches the throughput at which these platforms will routinely operate. We show that targeting oligonucleotides released from programmable microarrays can be used to capture and amplify approximately 10,000 human exons in a single multiplex reaction. Additionally, we show integration of this protocol with ultra-high-throughput sequencing for targeted variation discovery. Although the multiplex capture reaction is highly specific, we found that nonuniform capture is a key issue that will need to be resolved by additional optimization. We anticipate that highly multiplexed methods for targeted amplification will enable the comprehensive resequencing of human exons at a fraction of the cost of whole-genome resequencing.
Varshney, Rajeev K; Saxena, Rachit K; Upadhyaya, Hari D; Khan, Aamir W; Yu, Yue; Kim, Changhoon; Rathore, Abhishek; Kim, Dongseon; Kim, Jihun; An, Shaun; Kumar, Vinay; Anuradha, Ghanta; Yamini, Kalinati Narasimhan; Zhang, Wei; Muniswamy, Sonnappa; Kim, Jong-So; Penmetsa, R Varma; von Wettberg, Eric; Datta, Swapan K
2017-07-01
Pigeonpea (Cajanus cajan), a tropical grain legume with low input requirements, is expected to continue to have an important role in supplying food and nutritional security in developing countries in Asia, Africa and the tropical Americas. From whole-genome resequencing of 292 Cajanus accessions encompassing breeding lines, landraces and wild species, we characterize genome-wide variation. On the basis of a scan for selective sweeps, we find several genomic regions that were likely targets of domestication and breeding. Using genome-wide association analysis, we identify associations between several candidate genes and agronomically important traits. Candidate genes for these traits in pigeonpea have sequence similarity to genes functionally characterized in other plants for flowering time control, seed development and pod dehiscence. Our findings will allow acceleration of genetic gains for key traits to improve yield and sustainability in pigeonpea.
Detecting Directional Selection in the Presence of Recent Admixture in African-Americans
Lohmueller, Kirk E.; Bustamante, Carlos D.; Clark, Andrew G.
2011-01-01
We investigate the performance of tests of neutrality in admixed populations using plausible demographic models for African-American history as well as resequencing data from African and African-American populations. The analysis of both simulated and human resequencing data suggests that recent admixture does not result in an excess of false-positive results for neutrality tests based on the frequency spectrum after accounting for the population growth in the parental African population. Furthermore, when simulating positive selection, Tajima's D, Fu and Li's D, and haplotype homozygosity have lower power to detect population-specific selection using individuals sampled from the admixed population than from the nonadmixed population. Fay and Wu's H test, however, has more power to detect selection using individuals from the admixed population than from the nonadmixed population, especially when the selective sweep ended long ago. Our results have implications for interpreting recent genome-wide scans for positive selection in human populations. PMID:21196524
Dennis, Paul G.; Keller, Jurg; Tyson, Gene W.
2012-01-01
Microbially induced concrete corrosion (MICC) is an important problem in sewers. Here, small-subunit (SSU) rRNA gene amplicon pyrosequencing was used to characterize MICC communities. Microbial community composition differed between wall- and ceiling-associated MICC layers. Acidithiobacillus spp. were present at low abundances, and the communities were dominated by other sulfur-oxidizing-associated lineages. PMID:22843532
Rubio, Marcela da Silva; Penha Filho, Rafael Antonio Casarin; Almeida, Adriana Maria de; Berchieri, Angelo
2017-12-01
Currently there are 2659 Salmonella serovars. The host-specific biovars Salmonella Pullorum and Salmonella Gallinarum cause systemic infections in food-producing and wild birds. Fast diagnosis is crucial to control the dissemination in avian environments. The present work describes the development of a multiplex qPCR in real time using a low-cost DNA dye (SYBr Green) to identify and quantify these biovars. Primers were chosen based on genomic regions of difference (RoD) and optimized to control dimers. Primers pSGP detect both host-specific biovars but not other serovars and pSG and pSP differentiate biovars. Three amplicons showed different melting temperatures (Tm), allowing differentiation. The pSGP amplicon (97 bp) showed Tm of 78°C for both biovars. The pSG amplicon (273 bp) showed a Tm of 86.2°C for S. Gallinarum and pSP amplicon (260 bp) dissociated at 84.8°C for S. Pullorum identification. The multiplex qPCR in real time showed high sensitivity and was capable of quantifying 10 8 -10 1 CFU of these biovars.
DNA melting analysis: application of the "open tube" format for detection of mutant KRAS.
Botezatu, Irina V; Kondratova, Valentina N; Shelepov, Valery P; Lichtenstein, Anatoly V
2011-12-15
High-resolution melting (HRM) analysis is a very effective method for genotyping and mutation scanning that is usually performed just after PCR amplification (the "closed tube" format). Though simple and convenient, the closed tube format makes the HRM dependent on the PCR mix, not generally optimal for DNA melting analysis. Here, the "open tube" format, namely the post-PCR optimization procedure (amplicon shortening and solution chemistry modification), is proposed. As a result, mutation scanning of short amplicons becomes feasible on a standard real-time PCR instrument (not primarily designed for HRM) using SYBR Green I. This approach has allowed us to considerably enhance the sensitivity of detecting mutant KRAS using both low- and high-resolution systems (the Bio-Rad iQ5-SYBR Green I and Bio-Rad CFX96-EvaGreen, respectively). The open tube format, though more laborious than the closed tube one, can be used in situations when maximal sensitivity of the method is needed. It also permits standardization of DNA melting experiments and the introduction of instruments of a "lower level" into the range of those suitable for mutation scanning. Copyright © 2011 Elsevier Inc. All rights reserved.
CoVaCS: a consensus variant calling system.
Chiara, Matteo; Gioiosa, Silvia; Chillemi, Giovanni; D'Antonio, Mattia; Flati, Tiziano; Picardi, Ernesto; Zambelli, Federico; Horner, David Stephen; Pesole, Graziano; Castrignanò, Tiziana
2018-02-05
The advent and ongoing development of next generation sequencing technologies (NGS) has led to a rapid increase in the rate of human genome re-sequencing data, paving the way for personalized genomics and precision medicine. The body of genome resequencing data is progressively increasing underlining the need for accurate and time-effective bioinformatics systems for genotyping - a crucial prerequisite for identification of candidate causal mutations in diagnostic screens. Here we present CoVaCS, a fully automated, highly accurate system with a web based graphical interface for genotyping and variant annotation. Extensive tests on a gold standard benchmark data-set -the NA12878 Illumina platinum genome- confirm that call-sets based on our consensus strategy are completely in line with those attained by similar command line based approaches, and far more accurate than call-sets from any individual tool. Importantly our system exhibits better sensitivity and higher specificity than equivalent commercial software. CoVaCS offers optimized pipelines integrating state of the art tools for variant calling and annotation for whole genome sequencing (WGS), whole-exome sequencing (WES) and target-gene sequencing (TGS) data. The system is currently hosted at Cineca, and offers the speed of a HPC computing facility, a crucial consideration when large numbers of samples must be analysed. Importantly, all the analyses are performed automatically allowing high reproducibility of the results. As such, we believe that CoVaCS can be a valuable tool for the analysis of human genome resequencing studies. CoVaCS is available at: https://bioinformatics.cineca.it/covacs .
Powell, John H; Amish, Stephen J; Haynes, Gwilym D; Luikart, Gordon; Latch, Emily K
2016-09-01
Mule deer (Odocoileus hemionus) are an excellent nonmodel species for empirically testing hypotheses in landscape and population genomics due to their large population sizes (low genetic drift), relatively continuous distribution, diversity of occupied habitats and phenotypic variation. Because few genomic resources are currently available for this species, we used exon data from a cattle (Bos taurus) reference genome to direct targeted resequencing of 5935 genes in mule deer. We sequenced approximately 3.75 Mbp at minimum 20X coverage in each of the seven mule deer, identifying 23 204 single nucleotide polymorphisms (SNPs) within, or adjacent to, 6886 exons in 3559 genes. We found 91 SNP loci (from 69 genes) with putatively fixed allele frequency differences between the two major lineages of mule deer (mule deer and black-tailed deer), and our estimate of mean genetic divergence (genome-wide FST = 0.123) between these lineages was consistent with previous findings using microsatellite loci. We detected an over-representation of gamete generation and amino acid transport genes among the genes with SNPs exhibiting potentially fixed allele frequency differences between lineages. This targeted resequencing approach using exon capture techniques has identified a suite of loci that can be used in future research to investigate the genomic basis of adaptation and differentiation between black-tailed deer and mule deer. This study also highlights techniques (and an exon capture array) that will facilitate population genomic research in other cervids and nonmodel organisms. © 2016 John Wiley & Sons Ltd.
Lee, Hayan; Schatz, Michael C
2012-08-15
Genome resequencing and short read mapping are two of the primary tools of genomics and are used for many important applications. The current state-of-the-art in mapping uses the quality values and mapping quality scores to evaluate the reliability of the mapping. These attributes, however, are assigned to individual reads and do not directly measure the problematic repeats across the genome. Here, we present the Genome Mappability Score (GMS) as a novel measure of the complexity of resequencing a genome. The GMS is a weighted probability that any read could be unambiguously mapped to a given position and thus measures the overall composition of the genome itself. We have developed the Genome Mappability Analyzer to compute the GMS of every position in a genome. It leverages the parallelism of cloud computing to analyze large genomes, and enabled us to identify the 5-14% of the human, mouse, fly and yeast genomes that are difficult to analyze with short reads. We examined the accuracy of the widely used BWA/SAMtools polymorphism discovery pipeline in the context of the GMS, and found discovery errors are dominated by false negatives, especially in regions with poor GMS. These errors are fundamental to the mapping process and cannot be overcome by increasing coverage. As such, the GMS should be considered in every resequencing project to pinpoint the 'dark matter' of the genome, including of known clinically relevant variations in these regions. The source code and profiles of several model organisms are available at http://gma-bio.sourceforge.net
Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua
2015-01-01
Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as reveled by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus. PMID:25974068
Kim, Eun Hye; Lee, Hwan Young; Yang, In Seok; Jung, Sang-Eun; Yang, Woo Ick; Shin, Kyoung-Jin
2016-05-01
The next-generation sequencing (NGS) method has been utilized to analyze short tandem repeat (STR) markers, which are routinely used for human identification purposes in the forensic field. Some researchers have demonstrated the successful application of the NGS system to STR typing, suggesting that NGS technology may be an alternative or additional method to overcome limitations of capillary electrophoresis (CE)-based STR profiling. However, there has been no available multiplex PCR system that is optimized for NGS analysis of forensic STR markers. Thus, we constructed a multiplex PCR system for the NGS analysis of 18 markers (13CODIS STRs, D2S1338, D19S433, Penta D, Penta E and amelogenin) by designing amplicons in the size range of 77-210 base pairs. Then, PCR products were generated from two single-sources, mixed samples and artificially degraded DNA samples using a multiplex PCR system, and were prepared for sequencing on the MiSeq system through construction of a subsequent barcoded library. By performing NGS and analyzing the data, we confirmed that the resultant STR genotypes were consistent with those of CE-based typing. Moreover, sequence variations were detected in targeted STR regions. Through the use of small-sized amplicons, the developed multiplex PCR system enables researchers to obtain successful STR profiles even from artificially degraded DNA as well as STR loci which are analyzed with large-sized amplicons in the CE-based commercial kits. In addition, successful profiles can be obtained from mixtures up to a 1:19 ratio. Consequently, the developed multiplex PCR system, which produces small size amplicons, can be successfully applied to STR NGS analysis of forensic casework samples such as mixtures and degraded DNA samples. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Gang; Olson, J.C.; Pu, R.
1995-10-01
Serological assays are routinely used in the laboratory diagnosis of human immunodeficiency virus type-1 (HrV-1) infection, but the polymerase chain reaction (PCR) is ultimately the most sensitive and direct method for establishing definitive diagnosis. As an alternative to the conventional radioactive PCR procedure we have developed and evaluated a pair of rapid nonradioisotopic flow cytometric detection methods. Using heminested PCR we directly incorporated fluorescein-12-dUTP (fluo-dUTP) or digoxigenin-11-dUTP (dig-dUTP) into the PCR-amplicons. The labeled amplicons were hybridized with biotinylated antisense and sense probes, followed by capture of the hybrid DNA using streptavidin-coated beads which were finally analyzed in a flow cytometermore » by (1) direct detection of the fluorescence intensity of the amplicons incorporating fluo-dUTP and (2) immunodetection of the amplicons incorporating dig-dUTP by anti-digoxigenin IgG labeled with fluorescein isothiocyanate (FITC). Although both assays were functionally comparable with radiolabeled probe in reliably detecting as low as five copies of HIV-1 proviral DNA sequences, the immunodetection of dig-dUTP consistently yielded higher mean channel fluorescence and gave a stable signal over an extended period of 12-14 weeks. In testing a panel of 20 pedigreed PBMC specimens from blood donors with or without HIV-1 infection, the results of both flow cytometric assays were identical with those of the conventional radioactive procedure. Therefore, we conclude that the dig-dUTP incorporation in amplicons, hybridization with a pair of sense-antisense biotinylated probes and immunodetection of hybrids by flow cytometric analyses is the nonisotopic method of choice for PCR-diagnosis of HIV-1 infection. 21 refs., 2 figs., 4 tabs.« less
Fast, accurate and easy-to-pipeline methods for amplicon sequence processing
NASA Astrophysics Data System (ADS)
Antonielli, Livio; Sessitsch, Angela
2016-04-01
Next generation sequencing (NGS) technologies established since years as an essential resource in microbiology. While on the one hand metagenomic studies can benefit from the continuously increasing throughput of the Illumina (Solexa) technology, on the other hand the spreading of third generation sequencing technologies (PacBio, Oxford Nanopore) are getting whole genome sequencing beyond the assembly of fragmented draft genomes, making it now possible to finish bacterial genomes even without short read correction. Besides (meta)genomic analysis next-gen amplicon sequencing is still fundamental for microbial studies. Amplicon sequencing of the 16S rRNA gene and ITS (Internal Transcribed Spacer) remains a well-established widespread method for a multitude of different purposes concerning the identification and comparison of archaeal/bacterial (16S rRNA gene) and fungal (ITS) communities occurring in diverse environments. Numerous different pipelines have been developed in order to process NGS-derived amplicon sequences, among which Mothur, QIIME and USEARCH are the most well-known and cited ones. The entire process from initial raw sequence data through read error correction, paired-end read assembly, primer stripping, quality filtering, clustering, OTU taxonomic classification and BIOM table rarefaction as well as alternative "normalization" methods will be addressed. An effective and accurate strategy will be presented using the state-of-the-art bioinformatic tools and the example of a straightforward one-script pipeline for 16S rRNA gene or ITS MiSeq amplicon sequencing will be provided. Finally, instructions on how to automatically retrieve nucleotide sequences from NCBI and therefore apply the pipeline to targets other than 16S rRNA gene (Greengenes, SILVA) and ITS (UNITE) will be discussed.
Kresse, Stine H; Berner, Jeanne-Marie; Meza-Zepeda, Leonardo A; Gregory, Simon G; Kuo, Wen-Lin; Gray, Joe W; Forus, Anne; Myklebost, Ola
2005-11-07
Amplification of the q21-q23 region on chromosome 1 is frequently found in sarcomas and a variety of other solid tumours. Previous analyses of sarcomas have indicated the presence of at least two separate amplicons within this region, one located in 1q21 and one located near the apolipoprotein A-II (APOA2) gene in 1q23. In this study we have mapped and characterized the amplicon in 1q23 in more detail. We have used fluorescence in situ hybridisation (FISH) and microarray-based comparative genomic hybridisation (array CGH) to map and define the borders of the amplicon in 10 sarcomas. A subregion of approximately 800 kb was identified as the core of the amplicon. The amplification patterns of nine possible candidate target genes located to this subregion were determined by Southern blot analysis. The genes activating transcription factor 6 (ATF6) and dual specificity phosphatase 12 (DUSP12) showed the highest level of amplification, and they were also shown to be over-expressed by quantitative real-time reverse transcription PCR (RT-PCR). In general, the level of expression reflected the level of amplification in the different tumours. DUSP12 was expressed significantly higher than ATF6 in a subset of the tumours. In addition, two genes known to be transcriptionally activated by ATF6, glucose-regulated protein 78 kDa and -94 kDa (GRP78 and GRP94), were shown to be over-expressed in the tumours that showed over-expression of ATF6. ATF6 and DUSP12 seem to be the most likely candidate target genes for the 1q23 amplification in sarcomas. Both genes have possible roles in promoting cell growth, which makes them interesting candidate targets.
Agricultural biodiversity in the post-genomics era
USDA-ARS?s Scientific Manuscript database
The toolkit available for assessing and utilizing biological diversity within agricultural systems is rapidly expanding. In particular, genome and transcriptome re-sequencing as well as genome complexity reduction techniques are gaining popularity as the cost of generating short read sequence data d...
Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe
2010-01-01
Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome. PMID:20543950
Odell, L J; Baumgartner, J C; Xia, T; David, L L
1999-08-01
Collagenase is a potential virulence factor shown to be expressed by Porphyromonas gingivalis associated with periodontal disease. The purpose of this study was to use the polymerase chain reaction (PCR) to detect the presence of the collagenase gene (prtC) in 21 strains of Porphyromonas species isolated from endodontic infections. Type strains for P. gingivalis (ATCC 33277), P. endodontalis (ATCC 35406), Prevotella intermedia (ATCC 25611), and Prevotella nigrescens (ATCC 33563) were used as controls. When PCR primers specific for the 16S ribosomal RNA gene of P. gingivalis or P. endodontalis were used, 16 of the strains were identified as P. gingivalis, and five strains were identified as P. endodontalis. The presence of the prtC gene for collagenase was detected using PCR. Amplicons were analyzed by agarose gel electrophoresis, with an 815 bp amplicon representing the presence of the collagenase gene. Type strain ATCC 33277 and all 16 clinical isolates of P. gingivalis produced the collagenase gene amplicon. Neither type strain ATCC 35406 nor the five strains from clinical isolates of P. endodontalis produced the collagenase gene amplicon. These results indicate that P. gingivalis from endodontic infections possesses the prtC gene. P. endodontalis does not seem to exhibit prtC. The virulence of P. gingivalis may be related to its production of collagenase.
Toyota, M; Canzian, F; Ushijima, T; Hosoya, Y; Kuramoto, T; Serikawa, T; Imai, K; Sugimura, T; Nagao, M
1996-01-01
Representational difference analysis (RDA) was applied to isolate chromosomal markers in the rat. Four series of RDA [restriction enzymes, BamHI and HindIII; subtraction of ACI/N (ACI) amplicon from BUF/Nac (BUF) amplicon and vice versa] yielded 131 polymorphic markers; 125 of these markers were mapped to all chromosomes except for chromosome X. This was done by using a mapping panel of 105 ACI x BUF F2 rats. To complement the relative paucity of chromosomal markers in the rat, genetically directed RDA, which allows isolation of polymorphic markers in the specific chromosomal region, was performed. By changing the F2 driver-DNA allele frequency around the region, four markers were isolated from the D1Ncc1 locus. Twenty-five of 27 RDA markers were informative regarding the dot blot analysis of amplicons, hybridizing only with tester amplicons. Dot blot analysis at a high density per unit of area made it possible to process a large number of samples. Quantitative trait loci can now be mapped in the rat genome by processing a large number of samples with RDA markers and then by isolating markers close to the loci of interest by genetically directed RDA. Images Fig. 1 Fig. 3 Fig. 4 PMID:8632989
Fraley, Stephanie I.; Athamanolap, Pornpat; Masek, Billie J.; Hardick, Justin; Carroll, Karen C.; Hsieh, Yu-Hsiang; Rothman, Richard E.; Gaydos, Charlotte A.; Wang, Tza-Huei; Yang, Samuel
2016-01-01
High Resolution Melt (HRM) is a versatile and rapid post-PCR DNA analysis technique primarily used to differentiate sequence variants among only a few short amplicons. We recently developed a one-vs-one support vector machine algorithm (OVO SVM) that enables the use of HRM for identifying numerous short amplicon sequences automatically and reliably. Herein, we set out to maximize the discriminating power of HRM + SVM for a single genetic locus by testing longer amplicons harboring significantly more sequence information. Using universal primers that amplify the hypervariable bacterial 16 S rRNA gene as a model system, we found that long amplicons yield more complex HRM curve shapes. We developed a novel nested OVO SVM approach to take advantage of this feature and achieved 100% accuracy in the identification of 37 clinically relevant bacteria in Leave-One-Out-Cross-Validation. A subset of organisms were independently tested. Those from pure culture were identified with high accuracy, while those tested directly from clinical blood bottles displayed more technical variability and reduced accuracy. Our findings demonstrate that long sequences can be accurately and automatically profiled by HRM with a novel nested SVM approach and suggest that clinical sample testing is feasible with further optimization. PMID:26778280
Intrinsic challenges in ancient microbiome reconstruction using 16S rRNA gene amplification.
Ziesemer, Kirsten A; Mann, Allison E; Sankaranarayanan, Krithivasan; Schroeder, Hannes; Ozga, Andrew T; Brandt, Bernd W; Zaura, Egija; Waters-Rist, Andrea; Hoogland, Menno; Salazar-García, Domingo C; Aldenderfer, Mark; Speller, Camilla; Hendy, Jessica; Weston, Darlene A; MacDonald, Sandy J; Thomas, Gavin H; Collins, Matthew J; Lewis, Cecil M; Hofman, Corinne; Warinner, Christina
2015-11-13
To date, characterization of ancient oral (dental calculus) and gut (coprolite) microbiota has been primarily accomplished through a metataxonomic approach involving targeted amplification of one or more variable regions in the 16S rRNA gene. Specifically, the V3 region (E. coli 341-534) of this gene has been suggested as an excellent candidate for ancient DNA amplification and microbial community reconstruction. However, in practice this metataxonomic approach often produces highly skewed taxonomic frequency data. In this study, we use non-targeted (shotgun metagenomics) sequencing methods to better understand skewed microbial profiles observed in four ancient dental calculus specimens previously analyzed by amplicon sequencing. Through comparisons of microbial taxonomic counts from paired amplicon (V3 U341F/534R) and shotgun sequencing datasets, we demonstrate that extensive length polymorphisms in the V3 region are a consistent and major cause of differential amplification leading to taxonomic bias in ancient microbiome reconstructions based on amplicon sequencing. We conclude that systematic amplification bias confounds attempts to accurately reconstruct microbiome taxonomic profiles from 16S rRNA V3 amplicon data generated using universal primers. Because in silico analysis indicates that alternative 16S rRNA hypervariable regions will present similar challenges, we advocate for the use of a shotgun metagenomics approach in ancient microbiome reconstructions.
Intrinsic challenges in ancient microbiome reconstruction using 16S rRNA gene amplification
Ziesemer, Kirsten A.; Mann, Allison E.; Sankaranarayanan, Krithivasan; Schroeder, Hannes; Ozga, Andrew T.; Brandt, Bernd W.; Zaura, Egija; Waters-Rist, Andrea; Hoogland, Menno; Salazar-García, Domingo C.; Aldenderfer, Mark; Speller, Camilla; Hendy, Jessica; Weston, Darlene A.; MacDonald, Sandy J.; Thomas, Gavin H.; Collins, Matthew J.; Lewis, Cecil M.; Hofman, Corinne; Warinner, Christina
2015-01-01
To date, characterization of ancient oral (dental calculus) and gut (coprolite) microbiota has been primarily accomplished through a metataxonomic approach involving targeted amplification of one or more variable regions in the 16S rRNA gene. Specifically, the V3 region (E. coli 341–534) of this gene has been suggested as an excellent candidate for ancient DNA amplification and microbial community reconstruction. However, in practice this metataxonomic approach often produces highly skewed taxonomic frequency data. In this study, we use non-targeted (shotgun metagenomics) sequencing methods to better understand skewed microbial profiles observed in four ancient dental calculus specimens previously analyzed by amplicon sequencing. Through comparisons of microbial taxonomic counts from paired amplicon (V3 U341F/534R) and shotgun sequencing datasets, we demonstrate that extensive length polymorphisms in the V3 region are a consistent and major cause of differential amplification leading to taxonomic bias in ancient microbiome reconstructions based on amplicon sequencing. We conclude that systematic amplification bias confounds attempts to accurately reconstruct microbiome taxonomic profiles from 16S rRNA V3 amplicon data generated using universal primers. Because in silico analysis indicates that alternative 16S rRNA hypervariable regions will present similar challenges, we advocate for the use of a shotgun metagenomics approach in ancient microbiome reconstructions. PMID:26563586
Lermo, Anabel; Liébana, Susana; Campoy, Susana; Fabiano, Silvia; García, M Inés; Soutullo, Adriana; Zumárraga, Martín J; Alegret, Salvador; Pividori, M Isabel
2010-06-01
A highly sensitive assay for rapidly screening-out Mycobacterium bovis in contaminated samples was developed based on electrochemical genosensing. The assay consists of specific amplification and double-tagging of the IS6110 fragment, highly related to M. bovis, followed by electrochemical detection of the amplified product. PCR amplification was carried out using a labeled set of primers and resulted in a amplicon tagged at each terminus with both biotin and digoxigenin. Two different electrochemical platforms for the detection of the double-tagged amplicon were evaluated: (i) an avidin biocomposite (Av-GEB) and (ii) a magneto sensor (m-GEC) combined with streptavidin magnetic beads. In both cases, the double- tagged amplicon was immobilized through its biotinylated end and electrochemically detected, using an antiDig-HRP conjugate, through its digoxigenin end. The assay was determined to be highly sensitive, based on the detection of 620 and 10 fmol of PCR amplicon using the Av-GEB and m-GEC strategies, respectively. Moreover, the m-GEC assay showed promising features for the detection of M. bovis on dairy farms by screening for the presence of the bacterium's DNA in milk samples. The obtained results are discussed and compared with respect to those of inter-laboratory PCR assays and tuberculin skin testing.
Poritz, Mark A.; Blaschke, Anne J.; Byington, Carrie L.; Meyers, Lindsay; Nilsson, Kody; Jones, David E.; Thatcher, Stephanie A.; Robbins, Thomas; Lingenfelter, Beth; Amiott, Elizabeth; Herbener, Amy; Daly, Judy; Dobrowolski, Steven F.; Teng, David H. -F.; Ririe, Kirk M.
2011-01-01
The ideal clinical diagnostic system should deliver rapid, sensitive, specific and reproducible results while minimizing the requirements for specialized laboratory facilities and skilled technicians. We describe an integrated diagnostic platform, the “FilmArray”, which fully automates the detection and identification of multiple organisms from a single sample in about one hour. An unprocessed biologic/clinical sample is subjected to nucleic acid purification, reverse transcription, a high-order nested multiplex polymerase chain reaction and amplicon melt curve analysis. Biochemical reactions are enclosed in a disposable pouch, minimizing the PCR contamination risk. FilmArray has the potential to detect greater than 100 different nucleic acid targets at one time. These features make the system well-suited for molecular detection of infectious agents. Validation of the FilmArray technology was achieved through development of a panel of assays capable of identifying 21 common viral and bacterial respiratory pathogens. Initial testing of the system using both cultured organisms and clinical nasal aspirates obtained from children demonstrated an analytical and clinical sensitivity and specificity comparable to existing diagnostic platforms. We demonstrate that automated identification of pathogens from their corresponding target amplicon(s) can be accomplished by analysis of the DNA melting curve of the amplicon. PMID:22039434
Electrochemical DNA sensor for anthrax toxin activator gene atxA-detection of PCR amplicons.
Das, Ritu; Goel, Ajay K; Sharma, Mukesh K; Upadhyay, Sanjay
2015-12-15
We report the DNA probe functionalized electrochemical genosensor for the detection of Bacillus anthracis, specific towards the regulatory gene atxA. The DNA sensor is fabricated on electrochemically deposited gold nanoparticle on self assembled layer of (3-Mercaptopropyl) trimethoxysilane (MPTS) on GC electrode. DNA hybridization is monitored by differential pulse voltammogram (DPV). The modified GC electrode is characterized by atomic force microscopy (AFM), cyclic voltammetry (CV), and electrochemical impedance spectroscopy (EIS) method. We also quantified the DNA probe density on electrode surface by the chronocoulometric method. The detection is specific and selective for atxA gene by DNA probe on the electrode surface. No report is available for the detection of B. anthracis by using atxA an anthrax toxin activator gene. In the light of real and complex sample, we have studied the PCR amplicons of 303, 361 and 568 base pairs by using symmetric and asymmetric PCR approaches. The DNA probe of atxA gene efficiently hybridizes with different base pairs of PCR amplicons. The detection limit is found to be 1.0 pM (S/N ratio=3). The results indicate that the DNA sensor is able to detect synthetic target as well as PCR amplicons of different base pairs. Copyright © 2015 Elsevier B.V. All rights reserved.
Complete genome assemblies and methylome characterization in infectious diseases
USDA-ARS?s Scientific Manuscript database
Understanding the genetic basis of infectious diseases is a critical component to effective treatments. Because of the rapid evolution of bacterial strains and frequent horizontal transfer of DNA between them, resequencing of new isolates against known reference strains often provides an incomplete ...
Genetics Home Reference: anauxetic dysplasia
... one gene that provides instructions for making a protein component of the RNase MRP enzyme complex can also cause anauxetic ... A, Donskoi M, Kenna TJ, Thomas GP, Clark GR, Duncan EL, Brown MA. Whole-exome re-sequencing in a family quartet identifies POP1 mutations as ...
Stewart, C Neal
2017-04-26
A new resequencing analysis of weedy rice (Oryza sativa L.) biotypes illuminates distinct evolutionary paths and outcomes of de-domestication and ferality. This largest effort to date in weedy plant genomics gives a better understanding of weediness while also providing a promising source of alleles for rice breeding.
Vertical stratification of bacteria and archaea in sediments of a boreal stratified humic lake
NASA Astrophysics Data System (ADS)
Rissanen, Antti J.; Mpamah, Promise; Peura, Sari; Taipale, Sami; Biasi, Christina; Nykänen, Hannu
2015-04-01
Boreal stratified humic lakes, with steep redox gradients in the water column and in the sediment, are important sources of methane (CH4) to the atmosphere. CH4 flux from these lakes is largely controlled by the balance between CH4-production (methanogenesis), which takes place in the organic rich sediment and in the deepest water layers, and CH4-consumption (methanotrophy), which takes place mainly in the water column. While there is already some published information on the activity, diversity and community structure of bacteria in the water columns of these lakes, such information on sediment microbial communities is very scarce. This study aims to characterize the vertical variation patterns in the diversity and the structure of microbial communities in sediment of a boreal stratified lake. Particular focus is on microbes with the potential to contribute to methanogenesis (fermentative bacteria and methanogenic archaea) and to methanotrophy (methanotrophic bacteria and archaea). Two sediment cores (26 cm deep), collected from the deepest point (~6 m) of a small boreal stratified lake during winter-stratification, were divided into depth sections of 1 to 2 cm for analyses. Communities were studied from DNA extracted from sediment samples by next-generation sequencing (Ion Torrent) of polymerase chain reaction (PCR) - amplified bacterial and archaeal 16S rRNA gene amplicons. The abundance of methanogenic archaea was also specifically studied by quantitative-PCR of methyl coenzyme-M reductase gene (mcrA) amplicons. Furthermore, the community structure and the abundance of bacteria were studied by phospholipid fatty acid (PLFA) analysis. Dominant potential fermentative bacteria belonged to families Syntrophaceae, Clostridiaceae and Peptostreptococcaceae. There were considerable differences in the vertical distribution among these groups. The relative abundance of Syntrophaceae started to increase from the sediment surface, peaked at depth layer from 5 to 10 cm (up to 21 % of bacterial 16S rRNA gene amplicons) and decreased gradually towards deeper layers while the relative abundances of Clostridiaceae and Peptostreptococcaceae started to increase at deeper depths, at 5 cm and 10 cm, respectively, both peaking at depth layer from 20 to 26 cm (Clostridiaceae up to 13 % and Peptostreptococcaceae up to 11 % of bacterial 16S rRNA amplicons). Methanogenic community was dominated by acetoclastic methanogens (genus Methanosaeta), which were most abundant at depth layer from sediment surface to 10 cm (up to 87 % of archaeal 16S rRNA gene amplicons) and decreased drastically until the depth of 18 cm having quite stable relative abundance from 18 to 26 cm (5 to 11 % of archaeal 16S rRNA gene amplicons). Hydrogenotrophic methanogens (Methanoregula, Methanolinea, Methanospirillum, Methanocella) (3 to 11 % of archaeal 16S rRNA gene amplicons) did not show any specific depth patterns. The proportion of methanotrophic microbes was very low and they consisted almost completely of type II methanotrophic bacteria (family Methylocystaceae), which had highest relative abundance at depth layer from 5 to 10 cm (up to 3 % of bacterial 16S rRNA gene amplicons) and were almost absent below 15 cm. Anaerobic methanotrophic archaea were not detected. These findings will be discussed with results from PLFA and q-PCR analyses.
Qiao, Jiangwei; Cai, Mengxian; Yan, Guixin; Wang, Nian; Li, Feng; Chen, Binyun; Gao, Guizhen; Xu, Kun; Li, Jun; Wu, Xiaoming
2016-01-01
Brassica napus (rapeseed) is a recent allotetraploid plant and the second most important oilseed crop worldwide. The origin of B. napus and the genetic relationships with its diploid ancestor species remain largely unresolved. Here, chloroplast DNA (cpDNA) from 488 B. napus accessions of global origin, 139 B. rapa accessions and 49 B. oleracea accessions were populationally resequenced using Illumina Solexa sequencing technologies. The intraspecific cpDNA variants and their allelic frequencies were called genomewide and further validated via EcoTILLING analyses of the rpo region. The cpDNA of the current global B. napus population comprises more than 400 variants (SNPs and short InDels) and maintains one predominant haplotype (Bncp1). Whole-genome resequencing of the cpDNA of Bncp1 haplotype eliminated its direct inheritance from any accession of the B. rapa or B. oleracea species. The distribution of the polymorphism information content (PIC) values for each variant demonstrated that B. napus has much lower cpDNA diversity than B. rapa; however, a vast majority of the wild and cultivated B. oleracea specimens appeared to share one same distinct cpDNA haplotype, in contrast to its wild C-genome relatives. This finding suggests that the cpDNA of the three Brassica species is well differentiated. The predominant B. napus cpDNA haplotype may have originated from uninvestigated relatives or from interactions between cpDNA mutations and natural/artificial selection during speciation and evolution. These exhaustive data on variation in cpDNA would provide fundamental data for research on cpDNA and chloroplasts. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Cheng, Hong-Qiu; Huang, En-Min; Xu, Ming-Yan; Shu, Shen-You
2012-01-01
The poliovirus receptor related-1 (PVRL1) gene encodes nectin-1, a cell–cell adhesion molecule (OMIM #600644), and is mutated in the cleft lip with or without cleft palate/ectodermal dysplasia-1 syndrome (CLPED1, OMIM #225000). In addition, PVRL1 mutations have been associated with nonsyndromic cleft lip with or without a cleft palate (NSCL/P) in studies of multiethnic samples. To investigate the possible involvement of this gene in southern Han Chinese NSCL/P patients, we performed (i) a case–control association study, and (ii) a resequencing study. A set of 470 patients with NSCL/P and 693 controls were recruited, and a total of 45 tagging single-nucleotide polymorphisms (SNPs) were genotyped by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. In the resequencing study, the coding regions of the PVRL1 α isoform were direct sequenced in 45 trios from multiply affected families. One (rs7128327) of the 45 tested SNPs showed a trend toward statistical significance in the genotypic-level chi-square test (p=0.009567). However, this result did not withstand correction for multiple testing. Likewise, sliding window haplotype analyses consisting of two, three, or four SNPs failed to detect any positive association. Resequencing analysis also failed to identify any novel rare sequence variants. In conclusion, the present study provided no support for the hypothesis that common or rare variants in PVRL1 play a significant role in NSCL/P development in the southern Han Chinese population. This is the first study that has used tagging SNPs covering all the coding and noncoding regions to search for common NSCL/P-associated mutations of PVRL1. PMID:22455396
2013-01-01
Background Genetic linkage maps are important tools in breeding programmes and quantitative trait analyses. Traditional molecular markers used for genotyping are limited in throughput and efficiency. The advent of next-generation sequencing technologies has facilitated progeny genotyping and genetic linkage map construction in the major grains. However, the applicability of the approach remains untested in the fungal system. Findings Shiitake mushroom, Lentinula edodes, is a basidiomycetous fungus that represents one of the most popular cultivated edible mushrooms. Here, we developed a rapid genotyping method based on low-coverage (~0.5 to 1.5-fold) whole-genome resequencing. We used the approach to genotype 20 single-spore isolates derived from L. edodes strain L54 and constructed the first high-density sequence-based genetic linkage map of L. edodes. The accuracy of the proposed genotyping method was verified experimentally with results from mating compatibility tests and PCR-single-strand conformation polymorphism on a few known genes. The linkage map spanned a total genetic distance of 637.1 cM and contained 13 linkage groups. Two hundred sequence-based markers were placed on the map, with an average marker spacing of 3.4 cM. The accuracy of the map was confirmed by comparing with previous maps the locations of known genes such as matA and matB. Conclusions We used the shiitake mushroom as an example to provide a proof-of-principle that low-coverage resequencing could allow rapid genotyping of basidiospore-derived progenies, which could in turn facilitate the construction of high-density genetic linkage maps of basidiomycetous fungi for quantitative trait analyses and improvement of genome assembly. PMID:23915543
Lévêque, Marianne; Marlin, Sandrine; Jonard, Laurence; Procaccio, Vincent; Reynier, Pascal; Amati-Bonneau, Patrizia; Baulande, Sylvain; Pierron, Denis; Lacombe, Didier; Duriez, Françoise; Francannet, Christine; Mom, Thierry; Journel, Hubert; Catros, Hélène; Drouin-Garraud, Valérie; Obstoy, Marie-Françoise; Dollfus, Hélène; Eliot, Marie-Madeleine; Faivre, Laurence; Duvillard, Christian; Couderc, Remy; Garabedian, Eréa-Noël; Petit, Christine; Feldmann, Delphine; Denoyelle, Françoise
2007-11-01
Mitochondrial DNA (mtDNA) mutations have been implicated in non-syndromic hearing loss either as primary or as predisposing factors. As only a part of the mitochondrial genome is usually explored in deafness, its prevalence is probably under-estimated. Among 1350 families with non-syndromic sensorineural hearing loss collected through a French collaborative network, we selected 29 large families with a clear maternal lineage and screened them for known mtDNA mutations in 12S rRNA, tRNASer(UCN) and tRNALeu(UUR) genes. When no mutation could be identified, a whole mitochondrial genome screening was performed, using a microarray resequencing chip: the MitoChip version 2.0 developed by Affymetrix Inc. Known mtDNA mutations was found in nine of the 29 families, which are described in the article: five with A1555G, two with the T7511C, one with 7472insC and one with A3243G mutation. In the remaining 20 families, the resequencing Mitochip detected 258 mitochondrial homoplasmic variants and 107 potentially heteroplasmic variants. Controls were made by direct sequencing on selected fragments and showed a high sensibility of the MitoChip but a low specificity, especially for heteroplasmic variations. An original analysis on the basis of species conservation, frequency and phylogenetic investigation was performed to select the more probably pathogenic variants. The entire genome analysis allowed us to identify five additional families with a putatively pathogenic mitochondrial variant: T669C, C1537T, G8078A, G12236A and G15077A. These results indicate that the new MitoChip platform is a rapid and valuable tool for identification of new mtDNA mutations in deafness.
Leichty, Aaron R; Brisson, Dustin
2014-10-01
Population genomic analyses have demonstrated power to address major questions in evolutionary and molecular microbiology. Collecting populations of genomes is hindered in many microbial species by the absence of a cost effective and practical method to collect ample quantities of sufficiently pure genomic DNA for next-generation sequencing. Here we present a simple method to amplify genomes of a target microbial species present in a complex, natural sample. The selective whole genome amplification (SWGA) technique amplifies target genomes using nucleotide sequence motifs that are common in the target microbe genome, but rare in the background genomes, to prime the highly processive phi29 polymerase. SWGA thus selectively amplifies the target genome from samples in which it originally represented a minor fraction of the total DNA. The post-SWGA samples are enriched in target genomic DNA, which are ideal for population resequencing. We demonstrate the efficacy of SWGA using both laboratory-prepared mixtures of cultured microbes as well as a natural host-microbe association. Targeted amplification of Borrelia burgdorferi mixed with Escherichia coli at genome ratios of 1:2000 resulted in >10(5)-fold amplification of the target genomes with <6.7-fold amplification of the background. SWGA-treated genomic extracts from Wolbachia pipientis-infected Drosophila melanogaster resulted in up to 70% of high-throughput resequencing reads mapping to the W. pipientis genome. By contrast, 2-9% of sequencing reads were derived from W. pipientis without prior amplification. The SWGA technique results in high sequencing coverage at a fraction of the sequencing effort, thus allowing population genomic studies at affordable costs. Copyright © 2014 by the Genetics Society of America.
2011-01-01
Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants. PMID:22082336
Verde, Ignazio; Jenkins, Jerry; Dondini, Luca; Micali, Sabrina; Pagliarani, Giulia; Vendramin, Elisa; Paris, Roberta; Aramini, Valeria; Gazza, Laura; Rossini, Laura; Bassi, Daniele; Troggio, Michela; Shu, Shengqiang; Grimwood, Jane; Tartarini, Stefano; Dettori, Maria Teresa; Schmutz, Jeremy
2017-03-11
The availability of the peach genome sequence has fostered relevant research in peach and related Prunus species enabling the identification of genes underlying important horticultural traits as well as the development of advanced tools for genetic and genomic analyses. The first release of the peach genome (Peach v1.0) represented a high-quality WGS (Whole Genome Shotgun) chromosome-scale assembly with high contiguity (contig L50 214.2 kb), large portions of mapped sequences (96%) and high base accuracy (99.96%). The aim of this work was to improve the quality of the first assembly by increasing the portion of mapped and oriented sequences, correcting misassemblies and improving the contiguity and base accuracy using high-throughput linkage mapping and deep resequencing approaches. Four linkage maps with 3,576 molecular markers were used to improve the portion of mapped and oriented sequences (from 96.0% and 85.6% of Peach v1.0 to 99.2% and 98.2% of v2.0, respectively) and enabled a more detailed identification of discernible misassemblies (10.4 Mb in total). The deep resequencing approach fixed 859 homozygous SNPs (Single Nucleotide Polymorphisms) and 1347 homozygous indels. Moreover, the assembled NGS contigs enabled the closing of 212 gaps with an improvement in the contig L50 of 19.2%. The improved high quality peach genome assembly (Peach v2.0) represents a valuable tool for the analysis of the genetic diversity, domestication, and as a vehicle for genetic improvement of peach and related Prunus species. Moreover, the important phylogenetic position of peach and the absence of recent whole genome duplication (WGD) events make peach a pivotal species for comparative genomics studies aiming at elucidating plant speciation and diversification processes.
2010-01-01
Background Classical and quantitative linkage analyses of genetic crosses have traditionally been used to map genes of interest, such as those conferring chloroquine or quinine resistance in malaria parasites. Next-generation sequencing technologies now present the possibility of determining genome-wide genetic variation at single base-pair resolution. Here, we combine in vivo experimental evolution, a rapid genetic strategy and whole genome re-sequencing to identify the precise genetic basis of artemisinin resistance in a lineage of the rodent malaria parasite, Plasmodium chabaudi. Such genetic markers will further the investigation of resistance and its control in natural infections of the human malaria, P. falciparum. Results A lineage of isogenic in vivo drug-selected mutant P. chabaudi parasites was investigated. By measuring the artemisinin responses of these clones, the appearance of an in vivo artemisinin resistance phenotype within the lineage was defined. The underlying genetic locus was mapped to a region of chromosome 2 by Linkage Group Selection in two different genetic crosses. Whole-genome deep coverage short-read re-sequencing (Illumina® Solexa) defined the point mutations, insertions, deletions and copy-number variations arising in the lineage. Eight point mutations arise within the mutant lineage, only one of which appears on chromosome 2. This missense mutation arises contemporaneously with artemisinin resistance and maps to a gene encoding a de-ubiquitinating enzyme. Conclusions This integrated approach facilitates the rapid identification of mutations conferring selectable phenotypes, without prior knowledge of biological and molecular mechanisms. For malaria, this model can identify candidate genes before resistant parasites are commonly observed in natural human malaria populations. PMID:20846421
Hirano, Tomonari; Kazama, Yusuke; Ishii, Kotaro; Ohbu, Sumie; Shirakawa, Yuki; Abe, Tomoko
2015-04-01
Heavy-ion beams are widely used for mutation breeding and molecular biology. Although the mutagenic effects of heavy-ion beam irradiation have been characterized by sequence analysis of some restricted chromosomal regions or loci, there have been no evaluations at the whole-genome level or of the detailed genomic rearrangements in the mutant genomes. In this study, using array comparative genomic hybridization (array-CGH) and resequencing, we comprehensively characterized the mutations in Arabidopsis thaliana genomes irradiated with Ar or Fe ions. We subsequently used this information to investigate the mutagenic effects of the heavy-ion beams. Array-CGH demonstrated that the average number of deleted areas per genome were 1.9 and 3.7 following Ar-ion and Fe-ion irradiation, respectively, with deletion sizes ranging from 149 to 602,180 bp; 81% of the deletions were accompanied by genomic rearrangements. To provide a further detailed analysis, the genomes of the mutants induced by Ar-ion beam irradiation were resequenced, and total mutations, including base substitutions, duplications, in/dels, inversions, and translocations, were detected using three algorithms. All three resequenced mutants had genomic rearrangements. Of the 22 DNA fragments that contributed to the rearrangements, 19 fragments were responsible for the intrachromosomal rearrangements, and multiple rearrangements were formed in the localized regions of the chromosomes. The interchromosomal rearrangements were detected in the multiply rearranged regions. These results indicate that the heavy-ion beams led to clustered DNA damage in the chromosome, and that they have great potential to induce complicated intrachromosomal rearrangements. Heavy-ion beams will prove useful as unique mutagens for plant breeding and the establishment of mutant lines. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
Kim, Tae-Sung; He, Qiang; Kim, Kyu-Won; Yoon, Min-Young; Ra, Won-Hee; Li, Feng Peng; Tong, Wei; Yu, Jie; Oo, Win Htet; Choi, Buung; Heo, Eun-Beom; Yun, Byoung-Kook; Kwon, Soon-Jae; Kwon, Soon-Wook; Cho, Yoo-Hyun; Lee, Chang-Yong; Park, Beom-Seok; Park, Yong-Jin
2016-05-26
Rice germplasm collections continue to grow in number and size around the world. Since maintaining and screening such massive resources remains challenging, it is important to establish practical methods to manage them. A core collection, by definition, refers to a subset of the entire population that preserves the majority of genetic diversity, enhancing the efficiency of germplasm utilization. Here, we report whole-genome resequencing of the 137 rice mini core collection or Korean rice core set (KRICE_CORE) that represents 25,604 rice germplasms deposited in the Korean genebank of the Rural Development Administration (RDA). We implemented the Illumina HiSeq 2000 and 2500 platform to produce short reads and then assembled those with 9.8 depths using Nipponbare as a reference. Comparisons of the sequences with the reference genome yielded more than 15 million (M) single nucleotide polymorphisms (SNPs) and 1.3 M INDELs. Phylogenetic and population analyses using 2,046,529 high-quality SNPs successfully assigned rice accessions to the relevant rice subgroups, suggesting that these SNPs capture evolutionary signatures that have accumulated in rice subpopulations. Furthermore, genome-wide association studies (GWAS) for four exemplary agronomic traits in the KRIC_CORE manifest the utility of KRICE_CORE; that is, identifying previously defined genes or novel genetic factors that potentially regulate important phenotypes. This study provides strong evidence that the size of KRICE_CORE is small but contains high genetic and functional diversity across the genome. Thus, our resequencing results will be useful for future breeding, as well as functional and evolutionary studies, in the post-genomic era.
Cefalù, Angelo B; Spina, Rossella; Noto, Davide; Ingrassia, Valeria; Valenti, Vincenza; Giammanco, Antonina; Fayer, Francesca; Misiano, Gabriella; Cocorullo, Gianfranco; Scrimali, Chiara; Palesano, Ornella; Altieri, Grazia I; Ganci, Antonina; Barbagallo, Carlo M; Averna, Maurizio R
Severe hypertriglyceridemia (HTG) may result from mutations in genes affecting the intravascular lipolysis of triglyceride (TG)-rich lipoproteins. The aim of this study was to develop a targeted next-generation sequencing panel for the molecular diagnosis of disorders characterized by severe HTG. We developed a targeted customized panel for next-generation sequencing Ion Torrent Personal Genome Machine to capture the coding exons and intron/exon boundaries of 18 genes affecting the main pathways of TG synthesis and metabolism. We sequenced 11 samples of patients with severe HTG (TG>885 mg/dL-10 mmol/L): 4 positive controls in whom pathogenic mutations had previously been identified by Sanger sequencing and 7 patients in whom the molecular defect was still unknown. The customized panel was accurate, and it allowed to confirm genetic variants previously identified in all positive controls with primary severe HTG. Only 1 patient of 7 with HTG was found to be carrier of a homozygous pathogenic mutation of the third novel mutation of LMF1 gene (c.1380C>G-p.Y460X). The clinical and molecular familial cascade screening allowed the identification of 2 additional affected siblings and 7 heterozygous carriers of the mutation. We showed that our targeted resequencing approach for genetic diagnosis of severe HTG appears to be accurate, less time consuming, and more economical compared with traditional Sanger resequencing. The identification of pathogenic mutations in candidate genes remains challenging and clinical resequencing should mainly intended for patients with strong clinical criteria for monogenic severe HTG. Copyright © 2017 National Lipid Association. Published by Elsevier Inc. All rights reserved.
Raghavan, Avanthi; Neeli, Hemanth; Jin, Weijun; Badellino, Karen O.; Demissie, Serkalem; Manning, Alisa K.; DerOhannessian, Stephanie L.; Wolfe, Megan L.; Cupples, L. Adrienne; Li, Mingyao; Kathiresan, Sekar; Rader, Daniel J.
2011-01-01
Genome-wide association studies (GWAS) have successfully identified loci associated with quantitative traits, such as blood lipids. Deep resequencing studies are being utilized to catalogue the allelic spectrum at GWAS loci. The goal of these studies is to identify causative variants and missing heritability, including heritability due to low frequency and rare alleles with large phenotypic impact. Whereas rare variant efforts have primarily focused on nonsynonymous coding variants, we hypothesized that noncoding variants in these loci are also functionally important. Using the HDL-C gene LIPG as an example, we explored the effect of regulatory variants identified through resequencing of subjects at HDL-C extremes on gene expression, protein levels, and phenotype. Resequencing a portion of the LIPG promoter and 5′ UTR in human subjects with extreme HDL-C, we identified several rare variants in individuals from both extremes. Luciferase reporter assays were used to measure the effect of these rare variants on LIPG expression. Variants conferring opposing effects on gene expression were enriched in opposite extremes of the phenotypic distribution. Minor alleles of a common regulatory haplotype and noncoding GWAS SNPs were associated with reduced plasma levels of the LIPG gene product endothelial lipase (EL), consistent with its role in HDL-C catabolism. Additionally, we found that a common nonfunctional coding variant associated with HDL-C (rs2000813) is in linkage disequilibrium with a 5′ UTR variant (rs34474737) that decreases LIPG promoter activity. We attribute the gene regulatory role of rs34474737 to the observed association of the coding variant with plasma EL levels and HDL-C. Taken together, the findings show that both rare and common noncoding regulatory variants are important contributors to the allelic spectrum in complex trait loci. PMID:22174694
2014-10-01
amplicon of Corona Virus RdP gene. Finally, one PCR amplicon of a Chikungunya virus gene from the VHF group was sequenced. These sequence data are...suggestive of STIs ( discharge or genital ulcer) often go undiagnosed, and are treated empirically with broad spectrum antibiotics. The drug resistance... discharge are offered anonymous screening for gonorrhea and chlamydia (GC) and specimen taken for detection and isolation of Neisseria gonorrhoeae
Moreno Andrade, Vicente D.; Saldaña Gutiérrez, Carlos; Calvillo Medina, Rosa P.; Cruz Hérnandez, Andrés; Vázquez Cruz, Moisés A.; Torres Ruíz, Alfonso; Romero Gómez, Sergio; Ramos López, Miguel A.; Álvarez-Hidalgo, Erika; López-Gaytan, Silvia B.; Ramírez, Natanahel Salvador; Jones, George H.
2018-01-01
ABSTRACT Bee pollen is a highly nutritive natural foodstuff. Because of its use as a comestible, the association of bacteria with bee pollen is commercially and biologically important. We report here the bacterial diversity of seven bee pollen samples (five from Europe, one from Chile, and one from Mexico) based on 16S rRNA gene amplicon metagenome sequencing. PMID:29773615
DNA detection rates of host mtDNA in bloodmeals of human body lice (Pediculus humanus L., 1758).
Davey, J S; Casey, C S; Burgess, I F; Cable, J
2007-09-01
Using polymerase chain reaction, we investigated the extent to which digestion affects the potential to amplify 12S mitochondrial DNA sequences from bloodmeals of individual human body lice (Pediculus humanus L.) (Phthiraptera, Pediculidae) up to 72 h after feeding on a surrogate rabbit host (Oryctolagus cuniculus L.) (Lagomorpha, Leporidae). Two rabbit-specific primer pairs were developed to produce amplicons of 199 bp and 283 bp, the smaller of which was found to have a significantly slower decay rate. Median detection periods (T50) for the amplicons were 20 h and 12 h, with maximum detection periods of 24 h and 12 h, respectively, suggesting an inversely proportional linear relationship between amplicon size and digestion time. The data provide an indication of timeframes essential for the design of forensic sampling protocols and a basis for investigating the feeding frequency of human lice.
A tonoplast sugar transporter underlies a sugar accumulation QTL in watermelon
USDA-ARS?s Scientific Manuscript database
The molecular mechanism controlling accumulation of soluble sugars in watermelon (Citrullus lanatus) fruit, a trait associated with sweet-dessert watermelon domestication, is still unknown. We re-sequenced 96 recombinant inbred lines, derived from a cross between sweet and unsweet watermelon accessi...
Lin, Baochuan; Malanoski, Anthony P.; Wang, Zheng; Blaney, Kate M.; Long, Nina C.; Meador, Carolyn E.; Metzgar, David; Myers, Christopher A.; Yingst, Samuel L.; Monteville, Marshall R.; Saad, Magdi D.; Schnur, Joel M.; Tibbetts, Clark; Stenger, David A.
2009-01-01
Zoonotic microbes have historically been, and continue to emerge as, threats to human health. The recent outbreaks of highly pathogenic avian influenza virus in bird populations and the appearance of some human infections have increased the concern of a possible new influenza pandemic, which highlights the need for broad-spectrum detection methods for rapidly identifying the spread or outbreak of all variants of avian influenza virus. In this study, we demonstrate that high-density resequencing pathogen microarrays (RPM) can be such a tool. The results from 37 influenza virus isolates show that the RPM platform is an effective means for detecting and subtyping influenza virus, while simultaneously providing sequence information for strain resolution, pathogenicity, and drug resistance without additional analysis. This study establishes that the RPM platform is a broad-spectrum pathogen detection and surveillance tool for monitoring the circulation of prevalent influenza viruses in the poultry industry and in wild birds or incidental exposures and infections in humans. PMID:19279171
[Fine mapping of complex disease susceptibility loci].
Song, Qingfeng; Zhang, Hongxing; Ma, Yilong; Zhou, Gangqiao
2014-01-01
Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers have identified more than 3800 susceptibility loci for more than 660 diseases or traits. However, the most significantly associated variants or causative variants in these loci and their biological functions have remained to be clarified. These causative variants can help to elucidate the pathogenesis and discover new biomarkers of complex diseases. One of the main goals in the post-GWAS era is to identify the causative variants and susceptibility genes, and clarify their functional aspects by fine mapping. For common variants, imputation or re-sequencing based strategies were implemented to increase the number of analyzed variants and help to identify the most significantly associated variants. In addition, functional element, expression quantitative trait locus (eQTL) and haplotype analyses were performed to identify functional common variants and susceptibility genes. For rare variants, fine mapping was carried out by re-sequencing, rare haplotype analysis, family-based analysis, burden test, etc.This review summarizes the strategies and problems for fine mapping.
Kamada, Mayumi; Hase, Sumitaka; Sato, Kengo; Toyoda, Atsushi; Fujiyama, Asao; Sakakibara, Yasubumi
2014-01-01
De novo microbial genome sequencing reached a turning point with third-generation sequencing (TGS) platforms, and several microbial genomes have been improved by TGS long reads. Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and it has a function in the production of the traditional Japanese fermented food “natto.” The B. subtilis natto BEST195 genome was previously sequenced with short reads, but it included some incomplete regions. We resequenced the BEST195 genome using a PacBio RS sequencer, and we successfully obtained a complete genome sequence from one scaffold without any gaps, and we also applied Illumina MiSeq short reads to enhance quality. Compared with the previous BEST195 draft genome and Marburg 168 genome, we found that incomplete regions in the previous genome sequence were attributed to GC-bias and repetitive sequences, and we also identified some novel genes that are found only in the new genome. PMID:25329997
Detecting directional selection in the presence of recent admixture in African-Americans.
Lohmueller, Kirk E; Bustamante, Carlos D; Clark, Andrew G
2011-03-01
We investigate the performance of tests of neutrality in admixed populations using plausible demographic models for African-American history as well as resequencing data from African and African-American populations. The analysis of both simulated and human resequencing data suggests that recent admixture does not result in an excess of false-positive results for neutrality tests based on the frequency spectrum after accounting for the population growth in the parental African population. Furthermore, when simulating positive selection, Tajima's D, Fu and Li's D, and haplotype homozygosity have lower power to detect population-specific selection using individuals sampled from the admixed population than from the nonadmixed population. Fay and Wu's H test, however, has more power to detect selection using individuals from the admixed population than from the nonadmixed population, especially when the selective sweep ended long ago. Our results have implications for interpreting recent genome-wide scans for positive selection in human populations. © 2011 by the Genetics Society of America
Rapid genome resequencing of an atoxigenic strain of Aspergillus carbonarius
Cabañes, F. Javier; Sanseverino, Walter; Castellá, Gemma; ...
2015-03-13
In microorganisms, Ion Torrent sequencing technology has been proved to be useful in whole-genome sequencing of bacterial genomes (5 Mbp). In our study, for the first time we used this technology to perform a resequencing approach in a whole fungal genome (36 Mbp), a non-ochratoxin A producing strain of Aspergillus carbonarius. Ochratoxin A (OTA) is a potent nephrotoxin which is found mainly in cereals and their products, but it also occurs in a variety of common foods and beverages. Due to the fact that this strain does not produce OTA, we focused some of the bioinformatics analyses in genes involvedmore » in OTA biosynthesis, using a reference genome of an OTA producing strain of the same species. This study revealed that in the atoxigenic strain there is a high accumulation of nonsense and missense mutations in several genes. Importantly, a two fold increase in gene mutation ratio was observed in PKS and NRPS encoding genes which are suggested to be involved in OTA biosynthesis.« less
Identifying disease polymorphisms from case-control genetic association data.
Park, L
2010-12-01
In case-control association studies, it is typical to observe several associated polymorphisms in a gene region. Often the most significantly associated polymorphism is considered to be the disease polymorphism; however, it is not clear whether it is the disease polymorphism or there is more than one disease polymorphism in the gene region. Currently, there is no method that can handle these problems based on the linkage disequilibrium (LD) relationship between polymorphisms. To distinguish real disease polymorphisms from markers in LD, a method that can detect disease polymorphisms in a gene region has been developed. Relying on the LD between polymorphisms in controls, the proposed method utilizes model-based likelihood ratio tests to find disease polymorphisms. This method shows reliable Type I and Type II error rates when sample sizes are large enough, and works better with re-sequenced data. Applying this method to fine mapping using re-sequencing or dense genotyping data would provide important information regarding the genetic architecture of complex traits.
Ma, Zhiying; He, Shoupu; Wang, Xingfen; Sun, Junling; Zhang, Yan; Zhang, Guiyin; Wu, Liqiang; Li, Zhikun; Liu, Zhihao; Sun, Gaofei; Yan, Yuanyuan; Jia, Yinhua; Yang, Jun; Pan, Zhaoe; Gu, Qishen; Li, Xueyuan; Sun, Zhengwen; Dai, Panhong; Liu, Zhengwen; Gong, Wenfang; Wu, Jinhua; Wang, Mi; Liu, Hengwei; Feng, Keyun; Ke, Huifeng; Wang, Junduo; Lan, Hongyu; Wang, Guoning; Peng, Jun; Wang, Nan; Wang, Liru; Pang, Baoyin; Peng, Zhen; Li, Ruiqiang; Tian, Shilin; Du, Xiongming
2018-05-07
Upland cotton is the most important natural-fiber crop. The genomic variation of diverse germplasms and alleles underpinning fiber quality and yield should be extensively explored. Here, we resequenced a core collection comprising 419 accessions with 6.55-fold coverage depth and identified approximately 3.66 million SNPs for evaluating the genomic variation. We performed phenotyping across 12 environments and conducted genome-wide association study of 13 fiber-related traits. 7,383 unique SNPs were significantly associated with these traits and were located within or near 4,820 genes; more associated loci were detected for fiber quality than fiber yield, and more fiber genes were detected in the D than the A subgenome. Several previously undescribed causal genes for days to flowering, fiber length, and fiber strength were identified. Phenotypic selection for these traits increased the frequency of elite alleles during domestication and breeding. These results provide targets for molecular selection and genetic manipulation in cotton improvement.
Illuminator, a desktop program for mutation detection using short-read clonal sequencing.
Carr, Ian M; Morgan, Joanne E; Diggle, Christine P; Sheridan, Eamonn; Markham, Alexander F; Logan, Clare V; Inglehearn, Chris F; Taylor, Graham R; Bonthron, David T
2011-10-01
Current methods for sequencing clonal populations of DNA molecules yield several gigabases of data per day, typically comprising reads of < 100 nt. Such datasets permit widespread genome resequencing and transcriptome analysis or other quantitative tasks. However, this huge capacity can also be harnessed for the resequencing of smaller (gene-sized) target regions, through the simultaneous parallel analysis of multiple subjects, using sample "tagging" or "indexing". These methods promise to have a huge impact on diagnostic mutation analysis and candidate gene testing. Here we describe a software package developed for such studies, offering the ability to resolve pooled samples carrying barcode tags and to align reads to a reference sequence using a mutation-tolerant process. The program, Illuminator, can identify rare sequence variants, including insertions and deletions, and permits interactive data analysis on standard desktop computers. It facilitates the effective analysis of targeted clonal sequencer data without dedicated computational infrastructure or specialized training. Copyright © 2011 Elsevier Inc. All rights reserved.
Characterization of GM events by insert knowledge adapted re-sequencing approaches
Yang, Litao; Wang, Congmao; Holst-Jensen, Arne; Morisset, Dany; Lin, Yongjun; Zhang, Dabing
2013-01-01
Detection methods and data from molecular characterization of genetically modified (GM) events are needed by stakeholders of public risk assessors and regulators. Generally, the molecular characteristics of GM events are incomprehensively revealed by current approaches and biased towards detecting transformation vector derived sequences. GM events are classified based on available knowledge of the sequences of vectors and inserts (insert knowledge). Herein we present three insert knowledge-adapted approaches for characterization GM events (TT51-1 and T1c-19 rice as examples) based on paired-end re-sequencing with the advantages of comprehensiveness, accuracy, and automation. The comprehensive molecular characteristics of two rice events were revealed with additional unintended insertions comparing with the results from PCR and Southern blotting. Comprehensive transgene characterization of TT51-1 and T1c-19 is shown to be independent of a priori knowledge of the insert and vector sequences employing the developed approaches. This provides an opportunity to identify and characterize also unknown GM events. PMID:24088728
Crellen, Thomas; Allan, Fiona; David, Sophia; Durrant, Caroline; Huckvale, Thomas; Holroyd, Nancy; Emery, Aidan M; Rollinson, David; Aanensen, David M; Berriman, Matthew; Webster, Joanne P; Cotton, James A
2016-02-16
Schistosoma mansoni is a parasitic fluke that infects millions of people in the developing world. This study presents the first application of population genomics to S. mansoni based on high-coverage resequencing data from 10 global isolates and an isolate of the closely-related Schistosoma rodhaini, which infects rodents. Using population genetic tests, we document genes under directional and balancing selection in S. mansoni that may facilitate adaptation to the human host. Coalescence modeling reveals the speciation of S. mansoni and S. rodhaini as 107.5-147.6KYA, a period which overlaps with the earliest archaeological evidence for fishing in Africa. Our results indicate that S. mansoni originated in East Africa and experienced a decline in effective population size 20-90KYA, before dispersing across the continent during the Holocene. In addition, we find strong evidence that S. mansoni migrated to the New World with the 16-19th Century Atlantic Slave Trade.
Characterization of GM events by insert knowledge adapted re-sequencing approaches.
Yang, Litao; Wang, Congmao; Holst-Jensen, Arne; Morisset, Dany; Lin, Yongjun; Zhang, Dabing
2013-10-03
Detection methods and data from molecular characterization of genetically modified (GM) events are needed by stakeholders of public risk assessors and regulators. Generally, the molecular characteristics of GM events are incomprehensively revealed by current approaches and biased towards detecting transformation vector derived sequences. GM events are classified based on available knowledge of the sequences of vectors and inserts (insert knowledge). Herein we present three insert knowledge-adapted approaches for characterization GM events (TT51-1 and T1c-19 rice as examples) based on paired-end re-sequencing with the advantages of comprehensiveness, accuracy, and automation. The comprehensive molecular characteristics of two rice events were revealed with additional unintended insertions comparing with the results from PCR and Southern blotting. Comprehensive transgene characterization of TT51-1 and T1c-19 is shown to be independent of a priori knowledge of the insert and vector sequences employing the developed approaches. This provides an opportunity to identify and characterize also unknown GM events.
Rapid genome resequencing of an atoxigenic strain of Aspergillus carbonarius
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cabañes, F. Javier; Sanseverino, Walter; Castellá, Gemma
In microorganisms, Ion Torrent sequencing technology has been proved to be useful in whole-genome sequencing of bacterial genomes (5 Mbp). In our study, for the first time we used this technology to perform a resequencing approach in a whole fungal genome (36 Mbp), a non-ochratoxin A producing strain of Aspergillus carbonarius. Ochratoxin A (OTA) is a potent nephrotoxin which is found mainly in cereals and their products, but it also occurs in a variety of common foods and beverages. Due to the fact that this strain does not produce OTA, we focused some of the bioinformatics analyses in genes involvedmore » in OTA biosynthesis, using a reference genome of an OTA producing strain of the same species. This study revealed that in the atoxigenic strain there is a high accumulation of nonsense and missense mutations in several genes. Importantly, a two fold increase in gene mutation ratio was observed in PKS and NRPS encoding genes which are suggested to be involved in OTA biosynthesis.« less
Synthetic spike-in standards for high-throughput 16S rRNA gene amplicon sequencing
Tourlousse, Dieter M.; Yoshiike, Satowa; Ohashi, Akiko; Matsukura, Satoko; Noda, Naohiro
2017-01-01
Abstract High-throughput sequencing of 16S rRNA gene amplicons (16S-seq) has become a widely deployed method for profiling complex microbial communities but technical pitfalls related to data reliability and quantification remain to be fully addressed. In this work, we have developed and implemented a set of synthetic 16S rRNA genes to serve as universal spike-in standards for 16S-seq experiments. The spike-ins represent full-length 16S rRNA genes containing artificial variable regions with negligible identity to known nucleotide sequences, permitting unambiguous identification of spike-in sequences in 16S-seq read data from any microbiome sample. Using defined mock communities and environmental microbiota, we characterized the performance of the spike-in standards and demonstrated their utility for evaluating data quality on a per-sample basis. Further, we showed that staggered spike-in mixtures added at the point of DNA extraction enable concurrent estimation of absolute microbial abundances suitable for comparative analysis. Results also underscored that template-specific Illumina sequencing artifacts may lead to biases in the perceived abundance of certain taxa. Taken together, the spike-in standards represent a novel bioanalytical tool that can substantially improve 16S-seq-based microbiome studies by enabling comprehensive quality control along with absolute quantification. PMID:27980100
Mehetre, Gajanan T.; Paranjpe, Aditi; Dastager, Syed G.
2016-01-01
Microbial diversity in geothermal waters of the Unkeshwar hot springs in Maharashtra, India, was studied using 16S rRNA amplicon metagenomic sequencing. Taxonomic analysis revealed the presence of Bacteroidetes, Proteobacteria, Cyanobacteria, Actinobacteria, Archeae, and OD1 phyla. Metabolic function prediction analysis indicated a battery of biological information systems indicating rich and novel microbial diversity, with potential biotechnological applications in this niche. PMID:26950332
Moreno Andrade, Vicente D; Saldaña Gutiérrez, Carlos; Calvillo Medina, Rosa P; Cruz Hérnandez, Andrés; Vázquez Cruz, Moisés A; Torres Ruíz, Alfonso; Romero Gómez, Sergio; Ramos López, Miguel A; Álvarez-Hidalgo, Erika; López-Gaytan, Silvia B; Ramírez, Natanahel Salvador; Jones, George H; Hernandez-Flores, Jose Luis; Campos-Guillén, Juan
2018-05-17
Bee pollen is a highly nutritive natural foodstuff. Because of its use as a comestible, the association of bacteria with bee pollen is commercially and biologically important. We report here the bacterial diversity of seven bee pollen samples (five from Europe, one from Chile, and one from Mexico) based on 16S rRNA gene amplicon metagenome sequencing. Copyright © 2018 Moreno Andrade et al.
Structure of the Bacterial Community in Different Stages of Early Childhood Caries.
Ximenes, Marcos; Armas, Rafael Dutra de; Triches, Thaisa Cezária; Cardoso, Mariane; Vieira, Ricardo de Souza
2018-01-15
To characterise in vivo the structure of bacterial communities in decayed and sound primary teeth. Samples of biofilms were collected from three groups of patients with complete and exclusively primary dentition (n = 45): G1: sound teeth (n = 15); G2: enamel lesion (n = 15); G3: dentin lesion (n = 15). DNA was extracted (CTAB 2%) from the biofilm, the partial 16S rRNA gene was amplified with Bacteria Universal Primers (BA338fGC - UN518r) and subjected to DGGE (denaturing gradient gel electrophoresis). Multidimensional scaling and ANOSIM (analysis of similarity) were employed to determine the structure of the bacterial communities. The amplicon richness was determined by averaging amplicons, with the differences between treatments determined with ANOVA, while means were compared using Tukey's test (p < 0.05). Compared to sound teeth, a greater variety of bacterial communities was found in decayed teeth. Despite the differences between the bacterial communities of sound teeth and decayed teeth, the Venn diagram showed that the samples had 38 amplicons in common. Greater amplicon richness was observed in samples of decayed teeth (enamel: 20.5 ± 2.7; dentin: 20.1 ± 2.8) compared with the sound samples (12.0 ± 4.3) (p <0.05), indicating enhanced growth for specific groups of bacteria on decayed teeth. Although there is less bacterial diversity on sound than ECC-decayed teeth, the bacterial communities are very similar.
Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
McDonald, Daniel; Gonzalez, Antonio; Navas-Molina, Jose A.; Jiang, Lingjing; Xu, Zhenjiang Zech; Winker, Kevin; Kado, Deborah M.; Orwoll, Eric; Manary, Mark; Mirarab, Siavash
2018-01-01
ABSTRACT Recent algorithmic advances in amplicon-based microbiome studies enable the inference of exact amplicon sequence fragments. These new methods enable the investigation of sub-operational taxonomic units (sOTU) by removing erroneous sequences. However, short (e.g., 150-nucleotide [nt]) DNA sequence fragments do not contain sufficient phylogenetic signal to reproduce a reasonable tree, introducing a barrier in the utilization of critical phylogenetically aware metrics such as Faith’s PD or UniFrac. Although fragment insertion methods do exist, those methods have not been tested for sOTUs from high-throughput amplicon studies in insertions against a broad reference phylogeny. We benchmarked the SATé-enabled phylogenetic placement (SEPP) technique explicitly against 16S V4 sequence fragments and showed that it outperforms the conceptually problematic but often-used practice of reconstructing de novo phylogenies. In addition, we provide a BSD-licensed QIIME2 plugin (https://github.com/biocore/q2-fragment-insertion) for SEPP and integration into the microbial study management platform QIITA. IMPORTANCE The move from OTU-based to sOTU-based analysis, while providing additional resolution, also introduces computational challenges. We demonstrate that one popular method of dealing with sOTUs (building a de novo tree from the short sequences) can provide incorrect results in human gut metagenomic studies and show that phylogenetic placement of the new sequences with SEPP resolves this problem while also yielding other benefits over existing methods. PMID:29719869
Unlabeled probes for the detection and typing of herpes simplex virus.
Dames, Shale; Pattison, David C; Bromley, L Kathryn; Wittwer, Carl T; Voelkerding, Karl V
2007-10-01
Unlabeled probe detection with a double-stranded DNA (dsDNA) binding dye is one method to detect and confirm target amplification after PCR. Unlabeled probes and amplicon melting have been used to detect small deletions and single-nucleotide polymorphisms in assays where template is in abundance. Unlabeled probes have not been applied to low-level target detection, however. Herpes simplex virus (HSV) was chosen as a model to compare the unlabeled probe method to an in-house reference assay using dual-labeled, minor groove binding probes. A saturating dsDNA dye (LCGreen Plus) was used for real-time PCR. HSV-1, HSV-2, and an internal control were differentiated by PCR amplicon and unlabeled probe melting analysis after PCR. The unlabeled probe technique displayed 98% concordance with the reference assay for the detection of HSV from a variety of archived clinical samples (n = 182). HSV typing using unlabeled probes was 99% concordant (n = 104) to sequenced clinical samples and allowed for the detection of sequence polymorphisms in the amplicon and under the probe. Unlabeled probes and amplicon melting can be used to detect and genotype as few as 10 copies of target per reaction, restricted only by stochastic limitations. The use of unlabeled probes provides an attractive alternative to conventional fluorescence-labeled, probe-based assays for genotyping and detection of HSV and might be useful for other low-copy targets where typing is informative.
Kristensen, Lasse S; Andersen, Gitte B; Hager, Henrik; Hansen, Lise Lotte
2012-01-01
Sensitive and specific mutation detection is of particular importance in cancer diagnostics, prognostics, and individualized patient treatment. However, the majority of molecular methodologies that have been developed with the aim of increasing the sensitivity of mutation testing have drawbacks in terms of specificity, convenience, or costs. Here, we have established a new method, Competitive Amplification of Differentially Melting Amplicons (CADMA), which allows very sensitive and specific detection of all mutation types. The principle of the method is to amplify wild-type and mutated sequences simultaneously using a three-primer system. A mutation-specific primer is designed to introduce melting temperature decreasing mutations in the resulting mutated amplicon, while a second overlapping primer is designed to amplify both wild-type and mutated sequences. When combined with a third common primer very sensitive mutation detection becomes possible, when using high-resolution melting (HRM) as detection platform. The introduction of melting temperature decreasing mutations in the mutated amplicon also allows for further mutation enrichment by fast coamplification at lower denaturation temperature PCR (COLD-PCR). For proof-of-concept, we have designed CADMA assays for clinically relevant BRAF, EGFR, KRAS, and PIK3CA mutations, which are sensitive to, between 0.025% and 0.25%, mutated alleles in a wild-type background. In conclusion, CADMA enables highly sensitive and specific mutation detection by HRM analysis. © 2011 Wiley Periodicals, Inc.
Translational genomics for analysis of complex traits in peanut and sorghum
USDA-ARS?s Scientific Manuscript database
The integration of sequencing and genotype data from natural variation studies (by whole genome resequencing [wgs] or genotype by sequencing [gbs]), transcriptome (RNA-seq) and mutant analysis (also by wgs) facilitated the development of DNA markers in the form of single nucleotide polymorphic (SNP)...
Integrated translational genomics for analysis of complex traits in sorghum
USDA-ARS?s Scientific Manuscript database
We will report on the integration of sequencing and genotype data from natural variation (by whole genome resequencing [wgs] or genotype by sequencing [gbs]), transcriptome (RNA-seq) and mutant analysis (also by wgs) with the goal of identifying genes controlling important agronomic traits and tran...
SNPMeta: SNP annotation and SNP metadata collection without a reference genome
USDA-ARS?s Scientific Manuscript database
The increase in availability of resequencing data is greatly accelerating SNP discovery and has facilitated the development of SNP genotyping assays. This, in turn, is increasing interest in annotation of individual SNPs. Currently, these data are only available through curation, or comparison to a ...
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
Comparative population genomics of maize domestication and improvement
USDA-ARS?s Scientific Manuscript database
Domestication and modern breeding represent exemplary case studies of evolution in action. Maize is an outcrossing species with a complex genome, and an understanding of maize evolution is thus relevant for both plant and animal systems. This study is the largest plant resequencing effort to date, ...
Liu, Wen; Ghouri, Fozia; Yu, Hang; Li, Xiang; Yu, Shuhong; Shahid, Muhammad Qasim; Liu, Xiangdong
2017-01-01
Common wild rice (Oryza rufipogon Griff.) is an important germplasm for rice breeding, which contains many resistance genes. Re-sequencing provides an unprecedented opportunity to explore the abundant useful genes at whole genome level. Here, we identified the nucleotide-binding site leucine-rich repeat (NBS-LRR) encoding genes by re-sequencing of two wild rice lines (i.e. Huaye 1 and Huaye 2) that were developed from common wild rice. We obtained 128 to 147 million reads with approximately 32.5-fold coverage depth, and uniquely covered more than 89.6% (> = 1 fold) of reference genomes. Two wild rice lines showed high SNP (single-nucleotide polymorphisms) variation rate in 12 chromosomes against the reference genomes of Nipponbare (japonica cultivar) and 93-11 (indica cultivar). InDels (insertion/deletion polymorphisms) count-length distribution exhibited normal distribution in the two lines, and most of the InDels were ranged from -5 to 5 bp. With reference to the Nipponbare genome sequence, we detected a total of 1,209,308 SNPs, 161,117 InDels and 4,192 SVs (structural variations) in Huaye 1, and 1,387,959 SNPs, 180,226 InDels and 5,305 SVs in Huaye 2. A total of 44.9% and 46.9% genes exhibited sequence variations in two wild rice lines compared to the Nipponbare and 93-11 reference genomes, respectively. Analysis of NBS-LRR mutant candidate genes showed that they were mainly distributed on chromosome 11, and NBS domain was more conserved than LRR domain in both wild rice lines. NBS genes depicted higher levels of genetic diversity in Huaye 1 than that found in Huaye 2. Furthermore, protein-protein interaction analysis showed that NBS genes mostly interacted with the cytochrome C protein (Os05g0420600, Os01g0885000 and BGIOSGA038922), while some NBS genes interacted with heat shock protein, DNA-binding activity, Phosphoinositide 3-kinase and a coiled coil region. We explored abundant NBS-LRR encoding genes in two common wild rice lines through genome wide re-sequencing, which proved to be a useful tool to exploit elite NBS-LRR genes in wild rice. The data here provide a foundation for future work aimed at dissecting the genetic basis of disease resistance in rice, and the two wild rice lines will be useful germplasm for the molecular improvement of cultivated rice.
Yu, Hang; Li, Xiang; Yu, Shuhong; Shahid, Muhammad Qasim
2017-01-01
Common wild rice (Oryza rufipogon Griff.) is an important germplasm for rice breeding, which contains many resistance genes. Re-sequencing provides an unprecedented opportunity to explore the abundant useful genes at whole genome level. Here, we identified the nucleotide-binding site leucine-rich repeat (NBS-LRR) encoding genes by re-sequencing of two wild rice lines (i.e. Huaye 1 and Huaye 2) that were developed from common wild rice. We obtained 128 to 147 million reads with approximately 32.5-fold coverage depth, and uniquely covered more than 89.6% (> = 1 fold) of reference genomes. Two wild rice lines showed high SNP (single-nucleotide polymorphisms) variation rate in 12 chromosomes against the reference genomes of Nipponbare (japonica cultivar) and 93–11 (indica cultivar). InDels (insertion/deletion polymorphisms) count-length distribution exhibited normal distribution in the two lines, and most of the InDels were ranged from -5 to 5 bp. With reference to the Nipponbare genome sequence, we detected a total of 1,209,308 SNPs, 161,117 InDels and 4,192 SVs (structural variations) in Huaye 1, and 1,387,959 SNPs, 180,226 InDels and 5,305 SVs in Huaye 2. A total of 44.9% and 46.9% genes exhibited sequence variations in two wild rice lines compared to the Nipponbare and 93–11 reference genomes, respectively. Analysis of NBS-LRR mutant candidate genes showed that they were mainly distributed on chromosome 11, and NBS domain was more conserved than LRR domain in both wild rice lines. NBS genes depicted higher levels of genetic diversity in Huaye 1 than that found in Huaye 2. Furthermore, protein-protein interaction analysis showed that NBS genes mostly interacted with the cytochrome C protein (Os05g0420600, Os01g0885000 and BGIOSGA038922), while some NBS genes interacted with heat shock protein, DNA-binding activity, Phosphoinositide 3-kinase and a coiled coil region. We explored abundant NBS-LRR encoding genes in two common wild rice lines through genome wide re-sequencing, which proved to be a useful tool to exploit elite NBS-LRR genes in wild rice. The data here provide a foundation for future work aimed at dissecting the genetic basis of disease resistance in rice, and the two wild rice lines will be useful germplasm for the molecular improvement of cultivated rice. PMID:28700714
Hirano, Tetsuo; Ike, Fumio; Murata, Takehide; Obata, Yuichi; Utiyama, Hiroyasu; Yokoyama, Kazunari K
2008-04-02
Human acute myeloblastic leukemia HL-60 cells become resistant to differentiation during long-term cultivation. After 150 passages, double minute chromosomes (dmins) found in early-passaged cells are replaced by large extrachromosomal elements (LEEs). In a DNA library derived from a purified fraction of LEEs, 12.6% (23/183) of clones were assigned to 8q24 and 9.2% (17/183) were assigned to 14q11 in the human genome. Fluorescence in situ hybridization (FISH) revealed a small aberrant chromosome, which had not been found in early-passaged cells, in addition to the purified LEEs. We determined that each LEE consisted of six discontinuous segments in a region that extended for 4.4Mb over the 8q24 locus. Five genes, namely, Myc (a proto-oncogene), NSMCE2 (for a SUMO ligase), CCDC26 (for a retinoic acid-dependent modulator of myeloid differentiation), TRIB1 (for a regulator of MAPK kinase) and LOC389637 (for a protein of unknown function), were encoded by the amplicon. Breaks in the chromosomal DNA within the amplicon were found in the NSMCE2 and CCDC26 genes. The discontinuous structure of the amplicon unit of the LEEs was identical with that of dmins in HL-60 early-passaged cells. The difference between them seemed, predominantly, to be the number (10-15 copies per LEE versus 2 or 3 copies per dmin) of constituent units. Expression of the Myc, NSMCE2, CCDC26 and LOC389637 and TRIB1 genes was constitutive in all lines of HL-60 cells and that of the first four genes was repressed during the terminal differentiation of early-passaged HL-60 cells. We also detected abnormal transcripts of CCDC26. Our results suggest that these genes were selected during the development of amplicons. They might be amplified and, sometimes, truncated to contribute to the maintenance of HL-60 cells in an undifferentiated state.
Reinblatt, Maura; Pin, Richard H; Bowers, William J; Federoff, Howard J; Fong, Yuman
2005-12-01
Tumor hypoxia induces vascular endothelial growth factor (VEGF) expression, which stimulates angiogenesis and tumor proliferation. The VEGF signaling pathway is inhibited by soluble VEGF receptors (soluble fetal liver kinase 1; sFlk-1), which bind VEGF and block its interaction with endothelial cells. Herpes simplex virus (HSV) amplicons are replication-incompetent viruses used for gene delivery. We attempted to attenuate angiogenesis and inhibit pancreatic tumor growth through HSV amplicon-mediated expression of sFlk-1 under hypoxic control. A multimerized hypoxia-responsive enhancer (10 x HRE) was cloned upstream of the sFlk-1 gene (10 x HRE/sFlk-1). A novel HSV amplicon expressing 10 x HRE/sFlk-1 was genetically engineered (HSV10 x HRE/sFlk-1).Human pancreatic adenocarcinoma cells (AsPC1) were transduced with HSV10 x HRE/sFlk-1 and incubated in normoxia (21% oxygen) or hypoxia (1% oxygen). Capillary inhibition was evaluated by human umbilical vein endothelial cell assay. Western blot assessed sFlk-1 expression. AsPC1 flank tumor xenografts (n = 24) were transduced with HSV10 x HRE/sFlk-1. Media from normoxic AsPC1 transduced with HSV10 x HRE/sFlk-1 yielded a 36% reduction in capillary formation versus controls (P < .05), whereas hypoxic AsPC1 yielded a 76% reduction (P < .005). Western blot of AsPC1 transduced with HSV10 x HRE/sFlk-1 demonstrated greater sFlk-1 expression in hypoxia versus normoxia. AsPC1 flank tumors treated with HSV10 x HRE/sFlk-1 exhibited a 59% reduction in volume versus controls (P < .000001). HSV amplicon delivery of a hypoxia-inducible soluble VEGF receptor significantly reduces new vessel formation and tumor growth. Tumor hypoxia can thus be used to direct antiangiogenic therapy to pancreatic adenocarcinoma.
McLaren, Robert S; Ensenberger, Martin G; Budowle, Bruce; Rabbach, Dawn; Fulmer, Patricia M; Sprecher, Cindy J; Bessetti, Joseph; Sundquist, Terri M; Storts, Douglas R
2008-09-01
Several laboratories have reported the occurrence of a split or n-1 peak at the vWA locus in PowerPlex 16 and PowerPlex ES amplification products separated on 4- and 16-capillary electrophoresis instruments. The root cause of this artifact is post-PCR reannealing of the unlabeled, unincorporated vWA primer to the 3'-end of the tetramethylrhodamine (TMR)-labeled strand of the vWA amplicon. This reannealing occurs in the capillary post-electrokinetic injection. The split peak is eliminated by incorporation into the loading cocktail of a sacrificial hybridization sequence (SHS) oligonucleotide that is complementary to the vWA primer. The SHS preferentially anneals to the primer instead of the TMR-labeled strand of the vWA amplicon. In addition, the n-10/n-18 artifact that may be seen at the vWA locus was determined to be due to double-stranded amplicon formed post-electrokinetic injection into the capillary. This was also eliminated by adding in two Complementary Oligo Targets (COT1 and COT2) in addition to the SHS oligonucleotide into the loading cocktail. These three oligonucleotides are complementary to the 33 bases at the 5'-end of the unlabeled vWA amplicon strand and the 60 bases at its 3'-end and therefore compete for hybridization to the TMR-labeled amplicon strand. Incorporation of these three oligonucleotides in the Internal Lane Standard 600 (ILS600) eliminate both the split peak and n-10/n-18 artifact in PowerPlex 16 and PowerPlex ES amplification products without affecting sizing of alleles at the vWA locus or any locus in the PowerPlex 16, PowerPlex Y, PowerPlex ES, AmpFlSTR Profiler Plus ID, AmpFlSTR Cofiler, and AmpFlSTR SGM Plus kits.
Snelling, Timothy J; Genç, Buğra; McKain, Nest; Watson, Mick; Waters, Sinéad M; Creevey, Christopher J; Wallace, R John
2014-01-01
Ruminal archaeomes of two mature sheep grazing in the Scottish uplands were analysed by different sequencing and analysis methods in order to compare the apparent archaeal communities. All methods revealed that the majority of methanogens belonged to the Methanobacteriales order containing the Methanobrevibacter, Methanosphaera and Methanobacteria genera. Sanger sequenced 1.3 kb 16S rRNA gene amplicons identified the main species of Methanobrevibacter present to be a SGMT Clade member Mbb. millerae (≥ 91% of OTUs); Methanosphaera comprised the remainder of the OTUs. The primers did not amplify ruminal Thermoplasmatales-related 16S rRNA genes. Illumina sequenced V6-V8 16S rRNA gene amplicons identified similar Methanobrevibacter spp. and Methanosphaera clades and also identified the Thermoplasmatales-related order as 13% of total archaea. Unusually, both methods concluded that Mbb. ruminantium and relatives from the same clade (RO) were almost absent. Sequences mapping to rumen 16S rRNA and mcrA gene references were extracted from Illumina metagenome data. Mapping of the metagenome data to 16S rRNA gene references produced taxonomic identification to Order level including 2-3% Thermoplasmatales, but was unable to discriminate to species level. Mapping of the metagenome data to mcrA gene references resolved 69% to unclassified Methanobacteriales. Only 30% of sequences were assigned to species level clades: of the sequences assigned to Methanobrevibacter, most mapped to SGMT (16%) and RO (10%) clades. The Sanger 16S amplicon and Illumina metagenome mcrA analyses showed similar species richness (Chao1 Index 19-35), while Illumina metagenome and amplicon 16S rRNA analysis gave lower richness estimates (10-18). The values of the Shannon Index were low in all methods, indicating low richness and uneven species distribution. Thus, although much information may be extracted from the other methods, Illumina amplicon sequencing of the V6-V8 16S rRNA gene would be the method of choice for studying rumen archaeal communities.
Kresse, Stine H; Berner, Jeanne-Marie; Meza-Zepeda, Leonardo A; Gregory, Simon G; Kuo, Wen-Lin; Gray, Joe W; Forus, Anne; Myklebost, Ola
2005-01-01
Background Amplification of the q21-q23 region on chromosome 1 is frequently found in sarcomas and a variety of other solid tumours. Previous analyses of sarcomas have indicated the presence of at least two separate amplicons within this region, one located in 1q21 and one located near the apolipoprotein A-II (APOA2) gene in 1q23. In this study we have mapped and characterized the amplicon in 1q23 in more detail. Results We have used fluorescence in situ hybridisation (FISH) and microarray-based comparative genomic hybridisation (array CGH) to map and define the borders of the amplicon in 10 sarcomas. A subregion of approximately 800 kb was identified as the core of the amplicon. The amplification patterns of nine possible candidate target genes located to this subregion were determined by Southern blot analysis. The genes activating transcription factor 6 (ATF6) and dual specificity phosphatase 12 (DUSP12) showed the highest level of amplification, and they were also shown to be over-expressed by quantitative real-time reverse transcription PCR (RT-PCR). In general, the level of expression reflected the level of amplification in the different tumours. DUSP12 was expressed significantly higher than ATF6 in a subset of the tumours. In addition, two genes known to be transcriptionally activated by ATF6, glucose-regulated protein 78 kDa and -94 kDa (GRP78 and GRP94), were shown to be over-expressed in the tumours that showed over-expression of ATF6. Conclusion ATF6 and DUSP12 seem to be the most likely candidate target genes for the 1q23 amplification in sarcomas. Both genes have possible roles in promoting cell growth, which makes them interesting candidate targets. PMID:16274472
Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan
2017-01-01
PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored.
2009-11-01
that differentially expressed tumor suppressor miRNAs can be utilized to control the replication of an oncolytic DNA virus in a tumor-specific...demonstrated that the utilization of the tissue-specific promoter and the miRNA-mediated 3’UTRs in a targeted virotherapy is a viable approach with...elements into the whole HSV-1 viral genome should increase the safety margin substantially. The major advantage of the amplicon/helper system is its
Characterization of Novel Genes Within 8P11-12 Amplicon in Breast Cancer
2007-06-01
C-myc amplification in breast cancer: a meta - analysis of its occurrence and prognostic relevance. Br J Cancer, 83: 1688-1695, 2000. 2. Hui, R...Nass SJ, Dickson RB, Trock BJ. C-myc amplification in breast cancer: a meta - analysis of its occurrence and prognostic relevance. Br J Cancer 2000;83...a detailed genomic and expression analysis of the 8p11-p12 amplicon in breast cancer cell lines and identified several novel candidate genes
[Experimental and calculated spectra of the amplicons UBC-85 and UBC-126 (RAPD-PCR)].
Glazko, G V; Rogozin, I B; Glazko, V I; Zelenaia, L B; Sozinov, A A
1997-01-01
The comparative analysis of experimental amplification spectrum in 13 Ungulata species and counting ones in DNA sequences of different taxa in GenBank (mammalian, other vertebrate, invertebrate, viruses, prokaryote) with the uses of RAPD-PCR primers UBC-85 and UBC-126 was carried out. The particularities of the distribution of amplicons' frequencies in experimental and counting spectrums were revealed, for some of them the similar increased frequencies in mammalian and prokaryotic species were observed.
Identifying Molecular Regulators of Neuronal Functions Affected in the Movement Disorder Dystonia
2015-08-01
GC-3’ (forward), 5’-CGT GTG GCT GTT GGG GTT GTT GCT GAG GTA-3’ (reverse) for the 498-bp amplicon, 5’-CAC CCT ATC AGG GGA GGA CAA CTT TCG-3’ (forward...3’ (reverse) for the 983- bp amplicon, and 5’-CAC CCT ATC AGG GGA GGA CAA CTT TCG-3’ (forward), 5’-ACA GTG TAG TAA GGC AAA GCA AGG AG-3’ (reverse) for
Mehetre, Gajanan T; Paranjpe, Aditi; Dastager, Syed G; Dharne, Mahesh S
2016-02-25
Microbial diversity in geothermal waters of the Unkeshwar hot springs in Maharashtra, India, was studied using 16S rRNA amplicon metagenomic sequencing. Taxonomic analysis revealed the presence of Bacteroidetes, Proteobacteria, Cyanobacteria, Actinobacteria, Archeae, and OD1 phyla. Metabolic function prediction analysis indicated a battery of biological information systems indicating rich and novel microbial diversity, with potential biotechnological applications in this niche. Copyright © 2016 Mehetre et al.
Integrated and translational genomics for analysis of complex traits in crops
USDA-ARS?s Scientific Manuscript database
We report here on integration of sequencing and genotype data from natural variation (by whole genome resequencing [wgs] or genotype by sequencing [gbs]), transcriptome (RNA-seq) and mutant analysis (also by wgs) with the goal of translating gems from these resources into useable DNA markers in the ...
Genomic Analyses Yield Markers for Identifying Agronomically Important Genes in Potato
USDA-ARS?s Scientific Manuscript database
This study explores the genetic architecture underling the potato evolution through a comprehensive assessment of wild and cultivated potato species based on the re-sequencing of 201 accessions of Solanum section Petota with >12 × genome coverage. We identified 450 domesticated genes, which showed e...
High-Throughput resequencing of maize landraces at genomic regions associated with flowering time
USDA-ARS?s Scientific Manuscript database
Despite the reduction in the price of sequencing, it remains expensive to sequence and assemble whole, complex genomes of multiple samples for population studies, particularly for large genomes like those of many crop species. Enrichment of target genome regions coupled with next generation sequenci...
USDA-ARS?s Scientific Manuscript database
We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE (“Assessing Changes to Exons”) converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detect...
Niklaus J. Grünwald
2012-01-01
Whole and partial genome sequences are becoming available at an ever-increasing pace. For many plant pathogen systems, we are moving into the era of genome resequencing. The first Phytophthora genomes, P. ramorum and P. sojae, became available in 2004, followed shortly by P. infestans...
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
Kallifatidis, Beatrice; Borovička, Jan; Stránská, Jana; Drábek, Jiří; Mills, Deetta K
2014-03-01
The capability of Fluorescent Random Amplified Microsatellites (F-RAMS) to profile hallucinogenic mushrooms to species and sub-species level was assessed. Fifteen samples of Amanita rubescens and 22 samples of other hallucinogenic and non-hallucinogenic mushrooms of the genera Amanita and Psilocybe were profiled using two fluorescently-labeled, 5'degenerate primers, 5'-6FAM-SpC3-DD (CCA)5 and 5'-6FAM-SpC3-DHB (CGA)5, which target different microsatellite repeat regions. Among the two primers, 5'-6FAM-SpC3-DHB (CGA)5 provided more reliable data for identification purposes, by grouping samples of the same species and clustering closely related species together in a dendrogram based on amplicon similarities. A high degree of intra-specific variation between the 15 A. rubescens samples was shown with both primers and the amplicons generated for all A. rubescens samples were organized into three classes of amplicons (discriminant, private, and marker) based on their individualizing potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Eichmann, Cordula; Parson, Walther
2008-09-01
The traditional protocol for forensic mitochondrial DNA (mtDNA) analyses involves the amplification and sequencing of the two hypervariable segments HVS-I and HVS-II of the mtDNA control region. The primers usually span fragment sizes of 300-400 bp each region, which may result in weak or failed amplification in highly degraded samples. Here we introduce an improved and more stable approach using shortened amplicons in the fragment range between 144 and 237 bp. Ten such amplicons were required to produce overlapping fragments that cover the entire human mtDNA control region. These were co-amplified in two multiplex polymerase chain reactions and sequenced with the individual amplification primers. The primers were carefully selected to minimize binding on homoplasic and haplogroup-specific sites that would otherwise result in loss of amplification due to mis-priming. The multiplexes have successfully been applied to ancient and forensic samples such as bones and teeth that showed a high degree of degradation.
Kim, Na Young; Lee, Hwan Young; Park, Sun Joo; Yang, Woo Ick; Shin, Kyoung-Jin
2013-05-01
Two multiplex polymerase chain reaction (PCR) systems (Midiplex and Miniplex) were developed for the amplification of the mitochondrial DNA (mtDNA) control region, and the efficiencies of the multiplexes for amplifying degraded DNA were validated using old skeletal remains. The Midiplex system consisted of two multiplex PCRs to amplify six overlapping amplicons ranging in length from 227 to 267 bp. The Miniplex system consisted of three multiplex PCRs to amplify 10 overlapping short amplicons ranging in length from 142 to 185 bp. Most mtDNA control region sequences of several 60-year-old and 400-500-year-old skeletal remains were successfully obtained using both PCR systems and consistent with those previously obtained by monoplex amplification. The multiplex system consisting of smaller amplicons is effective for mtDNA sequence analyses of ancient and forensic degraded samples, saving time, cost, and the amount of DNA sample consumed during analysis. © 2013 American Academy of Forensic Sciences.
Billmyre, R Blake; Clancey, Shelly Applen; Heitman, Joseph
2017-09-26
Pathogenic microbes confront an evolutionary conflict between the pressure to maintain genome stability and the need to adapt to mounting external stresses. Bacteria often respond with elevated mutation rates, but little evidence exists of stable eukaryotic hypermutators in nature. Whole genome resequencing of the human fungal pathogen Cryptococcus deuterogattii identified an outbreak lineage characterized by a nonsense mutation in the mismatch repair component MSH2. This defect results in a moderate mutation rate increase in typical genes, and a larger increase in genes containing homopolymer runs. This allows facile inactivation of genes with coding homopolymer runs including FRR1 , which encodes the target of the immunosuppresive antifungal drugs FK506 and rapamycin. Our study identifies a eukaryotic hypermutator lineage spread over two continents and suggests that pathogenic eukaryotic microbes may experience similar selection pressures on mutation rate as bacterial pathogens, particularly during long periods of clonal growth or while expanding into new environments.
Stoeck, Thorsten; Breiner, Hans-Werner; Filker, Sabine; Ostermaier, Veronika; Kammerlander, Barbara; Sonntag, Bettina
2014-02-01
Analyses of high-throughput environmental sequencing data have become the 'gold-standard' to address fundamental questions of microbial diversity, ecology and biogeography. Findings that emerged from sequencing are, e.g. the discovery of the extensive 'rare microbial biosphere' and its potential function as a seed-bank. Even though applied since several years, results from high-throughput environmental sequencing have hardly been validated. We assessed how well pyrosequenced amplicons [the hypervariable eukaryotic V4 region of the small subunit ribosomal RNA (SSU rRNA) gene] reflected morphotype ciliate plankton. Moreover, we assessed if amplicon sequencing had the potential to detect the annual ciliate plankton stock. In both cases, we identified significant quantitative and qualitative differences. Our study makes evident that taxon abundance distributions inferred from amplicon data are highly biased and do not mirror actual morphotype abundances at all. Potential reasons included cell losses after fixation, cryptic morphotypes, resting stages, insufficient sequence data availability of morphologically described species and the unsatisfying resolution of the V4 SSU rRNA fragment for accurate taxonomic assignments. The latter two underline the necessity of barcoding initiatives for eukaryotic microbes to better and fully exploit environmental amplicon data sets, which then will also allow studying the potential of seed-bank taxa as a buffer for environmental changes. © 2013 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.
Meeting the challenges of non-referenced genome assembly from short-read sequence data
M. Parks; A. Liston; R. Cronn
2010-01-01
Massively parallel sequencing technologies (MPST) offer unprecedented opportunities for novel sequencing projects. MPST, while offering tremendous sequencing capacity, are typically most effective in resequencing projects (as opposed to the sequencing of novel genomes) due to the fact that sequence is returned in relatively short reads. Nonetheless, there is great...
A new rainbow trout (Oncorhynchus mykiss) reference genome assembly
USDA-ARS?s Scientific Manuscript database
In an effort to improve the rainbow trout reference genome assembly, we have re-sequenced the doubled-haploid Swanson line using the longest available reads from the Illumina technology. Overall we generated over 510 million 260nt paired-end shotgun reads, and 1 billion 160nt mate-pair reads from f...
USDA-ARS?s Scientific Manuscript database
Human selection has reshaped crop genomes. Here we report an apple genome variation map generated through genome sequencing of 117 diverse accessions. A comprehensive model of apple speciation and domestication along the Silk Road was proposed based on evidence from diverse genomic analyses. Cultiva...
USDA-ARS?s Scientific Manuscript database
Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5' and 3' flanking regions of IRS2 (approx. 14.5 kb), were bidirectionally sequenced for single nucleotide...
Reduced representation approaches to interrogate genome diversity in large repetitive plant genomes.
Hirsch, Cory D; Evans, Joseph; Buell, C Robin; Hirsch, Candice N
2014-07-01
Technology and software improvements in the last decade now provide methodologies to access the genome sequence of not only a single accession, but also multiple accessions of plant species. This provides a means to interrogate species diversity at the genome level. Ample diversity among accessions in a collection of species can be found, including single-nucleotide polymorphisms, insertions and deletions, copy number variation and presence/absence variation. For species with small, non-repetitive rich genomes, re-sequencing of query accessions is robust, highly informative, and economically feasible. However, for species with moderate to large sized repetitive-rich genomes, technical and economic barriers prevent en masse genome re-sequencing of accessions. Multiple approaches to access a focused subset of loci in species with larger genomes have been developed, including reduced representation sequencing, exome capture and transcriptome sequencing. Collectively, these approaches have enabled interrogation of diversity on a genome scale for large plant genomes, including crop species important to worldwide food security. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Reilly, Morgann C.; Kim, Joonhoon; Lynn, Jed; ...
2018-01-06
Plant biomass, once reduced to its composite sugars, can be converted to fuel substitutes. One means of overcoming the recalcitrance of lignocellulose is pretreatment followed by enzymatic hydrolysis. However, currently available commercial enzyme cocktails are inhibited in the presence of residual pretreatment chemicals. Recent studies have identified a number of cellulolytic enzymes from bacteria that are tolerant to pretreatment chemicals such as ionic liquids. The challenge now is generation of these enzymes in copious amounts, an arena where fungal organisms such as Aspergillus niger have proven efficient. Fungal host strains still need to be engineered to increase production titers ofmore » heterologous protein over native enzymes, which has been a difficult task. Here, we developed a forward genetics screen coupled with whole-genome resequencing to identify specific lesions responsible for a protein hyper-production phenotype in A. niger. As a result, this strategy successfully identified novel targets, including a low-affinity glucose transporter, MstC, whose deletion significantly improved secretion of recombinant proteins driven by a glucoamylase promoter.« less
Whole genome re-sequencing of date palms yields insights into diversification of a fruit tree crop.
Hazzouri, Khaled M; Flowers, Jonathan M; Visser, Hendrik J; Khierallah, Hussam S M; Rosas, Ulises; Pham, Gina M; Meyer, Rachel S; Johansen, Caryn K; Fresquez, Zoë A; Masmoudi, Khaled; Haider, Nadia; El Kadri, Nabila; Idaghdour, Youssef; Malek, Joel A; Thirkhill, Deborah; Markhand, Ghulam S; Krueger, Robert R; Zaid, Abdelouahhab; Purugganan, Michael D
2015-11-09
Date palms (Phoenix dactylifera) are the most significant perennial crop in arid regions of the Middle East and North Africa. Here, we present a comprehensive catalogue of approximately seven million single nucleotide polymorphisms in date palms based on whole genome re-sequencing of a collection of 62 cultivars. Population structure analysis indicates a major genetic divide between North Africa and the Middle East/South Asian date palms, with evidence of admixture in cultivars from Egypt and Sudan. Genome-wide scans for selection suggest at least 56 genomic regions associated with selective sweeps that may underlie geographic adaptation. We report candidate mutations for trait variation, including nonsense polymorphisms and presence/absence variation in gene content in pathways for key agronomic traits. We also identify a copia-like retrotransposon insertion polymorphism in the R2R3 myb-like orthologue of the oil palm virescens gene associated with fruit colour variation. This analysis documents patterns of post-domestication diversification and provides a genomic resource for this economically important perennial tree crop.
Natural and Unanticipated Modifiers of RNAi Activity in Caenorhabditis elegans
Asad, Nadeem; Aw, Wen Yih; Timmons, Lisa
2012-01-01
Organisms used as model genomics systems are maintained as isogenic strains, yet evidence of sequence differences between independently maintained wild-type stocks has been substantiated by whole-genome resequencing data and strain-specific phenotypes. Sequence differences may arise from replication errors, transposon mobilization, meiotic gene conversion, or environmental or chemical assault on the genome. Low frequency alleles or mutations with modest effects on phenotypes can contribute to natural variation, and it has proven possible for such sequences to become fixed by adapted evolutionary enrichment and identified by resequencing. Our objective was to identify and analyze single locus genetic defects leading to RNAi resistance in isogenic strains of Caenorhabditis elegans. In so doing, we uncovered a mutation that arose de novo in an existing strain, which initially frustrated our phenotypic analysis. We also report experimental, environmental, and genetic conditions that can complicate phenotypic analysis of RNAi pathway defects. These observations highlight the potential for unanticipated mutations, coupled with genetic and environmental phenomena, to enhance or suppress the effects of known mutations and cause variation between wild-type strains. PMID:23209671
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reilly, Morgann C.; Kim, Joonhoon; Lynn, Jed
Plant biomass, once reduced to its composite sugars, can be converted to fuel substitutes. One means of overcoming the recalcitrance of lignocellulose is pretreatment followed by enzymatic hydrolysis. However, currently available commercial enzyme cocktails are inhibited in the presence of residual pretreatment chemicals. Recent studies have identified a number of cellulolytic enzymes from bacteria that are tolerant to pretreatment chemicals such as ionic liquids. The challenge now is generation of these enzymes in copious amounts, an arena where fungal organisms such as Aspergillus niger have proven efficient. Fungal host strains still need to be engineered to increase production titers ofmore » heterologous protein over native enzymes, which has been a difficult task. Here, we developed a forward genetics screen coupled with whole-genome resequencing to identify specific lesions responsible for a protein hyper-production phenotype in A. niger. This strategy successfully identified novel targets, including a low-affinity glucose transporter, MstC, whose deletion significantly improved secretion of recombinant proteins driven by a glucoamylase promoter.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reilly, Morgann C.; Kim, Joonhoon; Lynn, Jed
Plant biomass, once reduced to its composite sugars, can be converted to fuel substitutes. One means of overcoming the recalcitrance of lignocellulose is pretreatment followed by enzymatic hydrolysis. However, currently available commercial enzyme cocktails are inhibited in the presence of residual pretreatment chemicals. Recent studies have identified a number of cellulolytic enzymes from bacteria that are tolerant to pretreatment chemicals such as ionic liquids. The challenge now is generation of these enzymes in copious amounts, an arena where fungal organisms such as Aspergillus niger have proven efficient. Fungal host strains still need to be engineered to increase production titers ofmore » heterologous protein over native enzymes, which has been a difficult task. Here, we developed a forward genetics screen coupled with whole-genome resequencing to identify specific lesions responsible for a protein hyper-production phenotype in A. niger. As a result, this strategy successfully identified novel targets, including a low-affinity glucose transporter, MstC, whose deletion significantly improved secretion of recombinant proteins driven by a glucoamylase promoter.« less
Whole genome re-sequencing of date palms yields insights into diversification of a fruit tree crop
Hazzouri, Khaled M.; Flowers, Jonathan M.; Visser, Hendrik J.; Khierallah, Hussam S. M.; Rosas, Ulises; Pham, Gina M.; Meyer, Rachel S.; Johansen, Caryn K.; Fresquez, Zoë A.; Masmoudi, Khaled; Haider, Nadia; El Kadri, Nabila; Idaghdour, Youssef; Malek, Joel A.; Thirkhill, Deborah; Markhand, Ghulam S.; Krueger, Robert R.; Zaid, Abdelouahhab; Purugganan, Michael D.
2015-01-01
Date palms (Phoenix dactylifera) are the most significant perennial crop in arid regions of the Middle East and North Africa. Here, we present a comprehensive catalogue of approximately seven million single nucleotide polymorphisms in date palms based on whole genome re-sequencing of a collection of 62 cultivars. Population structure analysis indicates a major genetic divide between North Africa and the Middle East/South Asian date palms, with evidence of admixture in cultivars from Egypt and Sudan. Genome-wide scans for selection suggest at least 56 genomic regions associated with selective sweeps that may underlie geographic adaptation. We report candidate mutations for trait variation, including nonsense polymorphisms and presence/absence variation in gene content in pathways for key agronomic traits. We also identify a copia-like retrotransposon insertion polymorphism in the R2R3 myb-like orthologue of the oil palm virescens gene associated with fruit colour variation. This analysis documents patterns of post-domestication diversification and provides a genomic resource for this economically important perennial tree crop. PMID:26549859
NASA Astrophysics Data System (ADS)
Tibbetts, Clark; Lichanska, Agnieszka M.; Borsuk, Lisa A.; Weslowski, Brian; Morris, Leah M.; Lorence, Matthew C.; Schafer, Klaus O.; Campos, Joseph; Sene, Mohamadou; Myers, Christopher A.; Faix, Dennis; Blair, Patrick J.; Brown, Jason; Metzgar, David
2010-04-01
High-density resequencing microarrays support simultaneous detection and identification of multiple viral and bacterial pathogens. Because detection and identification using RPM is based upon multiple specimen-specific target pathogen gene sequences generated in the individual test, the test results enable both a differential diagnostic analysis and epidemiological tracking of detected pathogen strains and variants from one specimen to the next. The RPM assay enables detection and identification of pathogen sequences that share as little as 80% sequence similarity to prototype target gene sequences represented as detector tiles on the array. This capability enables the RPM to detect and identify previously unknown strains and variants of a detected pathogen, as in sentinel cases associated with an infectious disease outbreak. We illustrate this capability using assay results from testing influenza A virus vaccines configured with strains that were first defined years after the design of the RPM microarray. Results are also presented from RPM-Flu testing of three specimens independently confirmed to the positive for the 2009 Novel H1N1 outbreak strain of influenza virus.
Martinelli, Axel; Henriques, Gisela; Cravo, Pedro; Hunt, Paul
2011-01-01
In malaria parasites, mutations in two genes of folate biosynthesis encoding dihydrofolate reductase (dhfr) and dihydropteroate synthase (dhps) modify responses to antifolate therapies which target these enzymes. However, the involvement of other genes which modify the availability of exogenous folate, for example, has been proposed. Here, we used short-read whole-genome re-sequencing to determine the mutations in a clone of the rodent malaria parasite, Plasmodium chabaudi, which has altered susceptibility to both sulphadoxine and pyrimethamine. This clone bears a previously identified S106N mutation in dhfr and no mutation in dhps. Instead, three additional point mutations in genes on chromosomes 2, 13 and 14 were identified. The mutated gene on chromosome 13 (mdr2 K392Q) encodes an ABC transporter. Because Quantitative Trait Locus analysis previously indicated an association of genetic markers on chromosome 13 with responses to individual and combined antifolates, MDR2 is proposed to modulate antifolate responses, possibly mediated by the transport of folate intermediates. PMID:20858498
Reilly, Morgann C.; Kim, Joonhoon; Lynn, Jed; ...
2018-01-06
Plant biomass, once reduced to its composite sugars, can be converted to fuel substitutes. One means of overcoming the recalcitrance of lignocellulose is pretreatment followed by enzymatic hydrolysis. However, currently available commercial enzyme cocktails are inhibited in the presence of residual pretreatment chemicals. Recent studies have identified a number of cellulolytic enzymes from bacteria that are tolerant to pretreatment chemicals such as ionic liquids. The challenge now is generation of these enzymes in copious amounts, an arena where fungal organisms such as Aspergillus niger have proven efficient. Fungal host strains still need to be engineered to increase production titers ofmore » heterologous protein over native enzymes, which has been a difficult task. Here, we developed a forward genetics screen coupled with whole-genome resequencing to identify specific lesions responsible for a protein hyper-production phenotype in A. niger. This strategy successfully identified novel targets, including a low-affinity glucose transporter, MstC, whose deletion significantly improved secretion of recombinant proteins driven by a glucoamylase promoter.« less
A universal procedure for primer labelling of amplicons.
Neilan, B A; Wilton, A N; Jacobs, D
1997-01-01
Detection and visualisation of nucleic acids is integral to genome analyses. Exponential amplification procedures have provided the means for the manipulation of nucleic acid sequences, which were otherwise inaccessible. We describe the development and application of a universal method for the labelling of any PCR product using a single end-labelled primer. Amplification was performed in a single reaction with the resulting amplicon labelled to a high specific activity. The method was adapted to a wide range of PCRs and significantly reduced the expense of such analyses. PMID:9207046
Synthetic spike-in standards for high-throughput 16S rRNA gene amplicon sequencing.
Tourlousse, Dieter M; Yoshiike, Satowa; Ohashi, Akiko; Matsukura, Satoko; Noda, Naohiro; Sekiguchi, Yuji
2017-02-28
High-throughput sequencing of 16S rRNA gene amplicons (16S-seq) has become a widely deployed method for profiling complex microbial communities but technical pitfalls related to data reliability and quantification remain to be fully addressed. In this work, we have developed and implemented a set of synthetic 16S rRNA genes to serve as universal spike-in standards for 16S-seq experiments. The spike-ins represent full-length 16S rRNA genes containing artificial variable regions with negligible identity to known nucleotide sequences, permitting unambiguous identification of spike-in sequences in 16S-seq read data from any microbiome sample. Using defined mock communities and environmental microbiota, we characterized the performance of the spike-in standards and demonstrated their utility for evaluating data quality on a per-sample basis. Further, we showed that staggered spike-in mixtures added at the point of DNA extraction enable concurrent estimation of absolute microbial abundances suitable for comparative analysis. Results also underscored that template-specific Illumina sequencing artifacts may lead to biases in the perceived abundance of certain taxa. Taken together, the spike-in standards represent a novel bioanalytical tool that can substantially improve 16S-seq-based microbiome studies by enabling comprehensive quality control along with absolute quantification. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Harnicarova, Andrea; Kozubek, Stanislav; Pachernik, Jiri
2006-12-10
Using sequential RNA-DNA fluorescence in situ hybridization, the nuclear arrangement of both the active and inactive c-myc gene as well as its transcription was investigated in colon cancer HT-29 cells induced to differentiate into enterocytes. Cytogenetic studies revealed the presence of two chromosomes 8 in HT-29 cells, of which the one containing c-myc gene amplicons was substantially larger and easily distinguished from the normal chromosome. This observation enabled detection of both activity and nuclear localization of c-myc genes in single cells and in individual chromosome territories. Similar transcriptional activity of the c-myc gene was observed in both the normal andmore » derivative chromosome 8 territories showing no influence of the amplification on the c-myc gene expression. Our experiments demonstrate strikingly specific nuclear and territorial arrangements of active genes as compared with inactive ones: on the periphery of their territories facing to the very central region of the cell nucleus. Nuclear arrangement of c-myc genes and transcripts was conserved during cell differentiation and, therefore, independent of the level of differentiation-specific c-myc gene expression. However, after the induction of differentiation, a more internal territorial location was found for the single copy c-myc gene of normal chromosome 8, while amplicons conserved their territorial topography.« less
Scala, Giovanni; Affinito, Ornella; Palumbo, Domenico; Florio, Ermanno; Monticelli, Antonella; Miele, Gennaro; Chiariotti, Lorenzo; Cocozza, Sergio
2016-11-25
CpG sites in an individual molecule may exist in a binary state (methylated or unmethylated) and each individual DNA molecule, containing a certain number of CpGs, is a combination of these states defining an epihaplotype. Classic quantification based approaches to study DNA methylation are intrinsically unable to fully represent the complexity of the underlying methylation substrate. Epihaplotype based approaches, on the other hand, allow methylation profiles of cell populations to be studied at the single molecule level. For such investigations, next-generation sequencing techniques can be used, both for quantitative and for epihaplotype analysis. Currently available tools for methylation analysis lack output formats that explicitly report CpG methylation profiles at the single molecule level and that have suited statistical tools for their interpretation. Here we present ampliMethProfiler, a python-based pipeline for the extraction and statistical epihaplotype analysis of amplicons from targeted deep bisulfite sequencing of multiple DNA regions. ampliMethProfiler tool provides an easy and user friendly way to extract and analyze the epihaplotype composition of reads from targeted bisulfite sequencing experiments. ampliMethProfiler is written in python language and requires a local installation of BLAST and (optionally) QIIME tools. It can be run on Linux and OS X platforms. The software is open source and freely available at http://amplimethprofiler.sourceforge.net .
Tout, Jessica; Siboni, Nachshon; Messer, Lauren F.; Garren, Melissa; Stocker, Roman; Webster, Nicole S.; Ralph, Peter J.; Seymour, Justin R.
2015-01-01
Rising seawater temperature associated with global climate change is a significant threat to coral health and is linked to increasing coral disease and pathogen-related bleaching events. We performed heat stress experiments with the coral Pocillopora damicornis, where temperature was increased to 31°C, consistent with the 2–3°C predicted increase in summer sea surface maxima. 16S rRNA amplicon sequencing revealed a large shift in the composition of the bacterial community at 31°C, with a notable increase in Vibrio, including known coral pathogens. To investigate the dynamics of the naturally occurring Vibrio community, we performed quantitative PCR targeting (i) the whole Vibrio community and (ii) the coral pathogen Vibrio coralliilyticus. At 31°C, Vibrio abundance increased by 2–3 orders of magnitude and V. coralliilyticus abundance increased by four orders of magnitude. Using a Vibrio-specific amplicon sequencing assay, we further demonstrated that the community composition shifted dramatically as a consequence of heat stress, with significant increases in the relative abundance of known coral pathogens. Our findings provide quantitative evidence that the abundance of potential coral pathogens increases within natural communities of coral-associated microbes as a consequence of rising seawater temperature and highlight the potential negative impacts of anthropogenic climate change on coral reef ecosystems. PMID:26042096
Liu, Yun-Xi; Zhao, Zhong-Tang; Cao, Wu-Chun; Xu, Xiao-Qun; Suo, Ji-Jiang; Xing, Yu-Bin; Jia, Ning; Du, Ming-Mei; Liu, Bo-Wei; Yao, Yuan
2013-01-01
The aim of the present study was to evaluate the clinical usefulness of applying RT-nested PCR along with RFLP as a method for diagnosis and genotypic differentiation of Hantavirus in the acute-stage sera of HFRS patients as compared to the ELISA technique. A prospective study of patients with suspected HFRS patients was carried out. Sera were collected for serological evaluation by ELISA and RT-nested PCR testing. Primers were selected from the published sequence of the S segment of HTNV strain 76-118 and SEOV strain SR-11, which made it possible to obtain an amplicon of 403 bp by RT-nested PCR. The genotypic differentiations of the RT-nested PCR amplicons were carried out by RFLP. Sequence analyses of the amplicons were used to confirm the accuracy of the results obtained by RFLP. Of the 48 acute-stage sera from suspected HFRS patients, 35 were ELISA-positive while 41 were positive by RT-nested PCR. With Hind III and Hinf I, RFLP profiles of the RT-nested PCR amplicons of the 41 positive sera exhibited two patterns. 33 had RFLP profiles similar to the reference strain R22, and thus belonged to the SEOV type. The other 8 samples which were collected during October-December had RFLP profiles similar to the reference strain 76-118, and thus belonged to the HTNV type. Sequence phylogenetic analysis of RT-nested PCR amplicons revealed sdp1, sdp2 YXL-2008, and sdp3 as close relatives of HTNV strain 76-118, while sdp22 and sdp37 as close relatives of SEOV strain Z37 and strain R22 located in two separate clusters in the phylogenetic tree. These results were identical to those acquired by RFLP. RT-nested PCR integrated with RFLP was a rapid, simple, accurate method for detecting and differentiating the genotypes of Hantavirus in the acute-stage sera of suspected HFRS patients. In Shandong province, the main genotypes of Hantavirus belonged to the SEOV types, while the HTNV types were observed during the autumn-winter season.
Gene amplification of the transcription factor DP1 and CTNND1 in human lung cancer.
Castillo, Sandra D; Angulo, Barbara; Suarez-Gauthier, Ana; Melchor, Lorenzo; Medina, Pedro P; Sanchez-Verde, Lydia; Torres-Lanzas, Juan; Pita, Guillermo; Benitez, Javier; Sanchez-Cespedes, Montse
2010-09-01
The search for novel oncogenes is important because they could be the target of future specific anticancer therapies. In the present paper we report the identification of novel amplified genes in lung cancer by means of global gene expression analysis. To screen for amplicons, we aligned the gene expression data according to the position of transcripts in the human genome and searched for clusters of over-expressed genes. We found several clusters with gene over-expression, suggesting an underlying genomic amplification. FISH and microarray analysis for DNA copy number in two clusters, at chromosomes 11q12 and 13q34, confirmed the presence of amplifications spanning about 0.4 and 1 Mb for 11q12 and 13q34, respectively. Amplification at these regions each occurred at a frequency of 3%. Moreover, quantitative RT-PCR of each individual transcript within the amplicons allowed us to verify the increased in gene expression of several genes. The p120ctn and DP1 proteins, encoded by two candidate oncogenes, CTNND1 and TFDP1, at 11q12 and 13q amplicons, respectively, showed very strong immunostaining in lung tumours with gene amplification. We then focused on the 13q34 amplicon and in the TFDP1 candidate oncogene. To further determine the oncogenic properties of DP1, we searched for lung cancer cell lines carrying TFDP1 amplification. Depletion of TFDP1 expression by small interference RNA in a lung cancer cell line (HCC33) with TFDP1 amplification and protein over-expression reduced cell viability by 50%. In conclusion, we report the identification of two novel amplicons, at 13q34 and 11q12, each occurring at a frequency of 3% of non-small cell lung cancers. TFDP1, which encodes the E2F-associated transcription factor DP1 is a candidate oncogene at 13q34. The data discussed in this publication have been deposited in NCBIs Gene Expression Omnibus (GEO; http://www.ncbi.nlm.nih.gov/geo/) and are accessible through GEO Series Accession No. GSE21168.
Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers
Pabinger, Stephan; Ernst, Karina; Pulverer, Walter; Kallmeyer, Rainer; Valdes, Ana M.; Metrustry, Sarah; Katic, Denis; Nuzzo, Angelo; Kriegner, Albert; Vierlinger, Klemens; Weinhaeusel, Andreas
2016-01-01
Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM). Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage. TABSAT is freely available under a GNU General Public License version 3.0 (GPLv3) at https://github.com/tadkeys/tabsat/ and http://demo.platomics.com/. PMID:27467908
Constable, Fiona E.; Nancarrow, Narelle; Plummer, Kim M.; Rodoni, Brendan
2017-01-01
PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored. PMID:28632759
Region of interest methylation analysis: a comparison of MSP with MS-HRM and direct BSP.
Akika, Reem; Awada, Zainab; Mogharbil, Nahed; Zgheib, Nathalie K
2017-07-01
The aim of this study was to compare and contrast three DNA methylation methods of a specific region of interest (ROI): methylation-specific PCR (MSP), methylation-sensitive high resolution melting (MS-HRM) and direct bisulfite sequencing (BSP). The methylation of a CpG area in the promoter region of Estrogen receptor alpha (ESR1) was evaluated by these three methods with samples and standards of different methylation percentages. MSP data were neither reproducible nor sensitive, and the assay was not specific due to non-specific binding of primers. MS-HRM was highly reproducible and a step forward into categorizing the methylation status of the samples as percent ranges. Direct BSP was the most informative method regarding methylation percentage of each CpG site. Though not perfect, it was reproducible and sensitive. We recommend the use of either method depending on the research question and target amplicon, and provided that the designed primers and expected amplicons are within recommendations. If the research question targets a limited number of CpG sites and simple yes/no results are enough, MSP may be attempted. For short amplicons that are crowded with CpG sites and of single melting domain, MS-HRM may be the method of choice though it only indicates the overall methylation percentage of the entire amplicon. Although the assay is highly reproducible, being semi-quantitative makes it of lesser interest to study ROI methylation of samples with little methylation differences. Direct BSP is a step forward as it gives information about the methylation percentage at each CpG site.
Shaw, Jennifer L. A.; Weyrich, Laura S.; Sawade, Emma; Drikas, Mary; Cooper, Alan J.
2015-01-01
Drinking water assessments use a variety of microbial, physical, and chemical indicators to evaluate water treatment efficiency and product water quality. However, these indicators do not allow the complex biological communities, which can adversely impact the performance of drinking water distribution systems (DWDSs), to be characterized. Entire bacterial communities can be studied quickly and inexpensively using targeted metagenomic amplicon sequencing. Here, amplicon sequencing of the 16S rRNA gene region was performed alongside traditional water quality measures to assess the health, quality, and efficiency of two distinct, full-scale DWDSs: (i) a linear DWDS supplied with unfiltered water subjected to basic disinfection before distribution and (ii) a complex, branching DWDS treated by a four-stage water treatment plant (WTP) prior to disinfection and distribution. In both DWDSs bacterial communities differed significantly after disinfection, demonstrating the effectiveness of both treatment regimes. However, bacterial repopulation occurred further along in the DWDSs, and some end-user samples were more similar to the source water than to the postdisinfection water. Three sample locations appeared to be nitrified, displaying elevated nitrate levels and decreased ammonia levels, and nitrifying bacterial species, such as Nitrospira, were detected. Burkholderiales were abundant in samples containing large amounts of monochloramine, indicating resistance to disinfection. Genera known to contain pathogenic and fecal-associated species were also identified in several locations. From this study, we conclude that metagenomic amplicon sequencing is an informative method to support current compliance-based methods and can be used to reveal bacterial community interactions with the chemical and physical properties of DWDSs. PMID:26162884
Repair of DNA damage caused by cytosine deamination in mitochondrial DNA of forensic case samples.
Gorden, Erin M; Sturk-Andreaggi, Kimberly; Marshall, Charla
2018-05-01
DNA sequence damage from cytosine deamination is well documented in degraded samples, such as those from ancient and forensic contexts. This study examined the effect of a DNA repair treatment on mitochondrial DNA (mtDNA) from aged and degraded skeletal samples. DNA extracts from 21 non-probative, degraded skeletal samples (aged 50-70 years) were utilized for the analysis. A portion of each sample extract was subjected to DNA repair using a commercial repair kit, the New England BioLabs' NEBNext FFPE DNA Repair Kit (Ipswich, MA). MtDNA was enriched using PCR and targeted capture in a side-by-side experiment of untreated and repaired DNA. Sequencing was performed using both traditional (Sanger-type; STS) and next-generation sequencing (NGS) methods Although cytosine deamination was evident in the mtDNA sequence data, the observed level of damaged bases varied by sequencing method as well as by enrichment type. The STS PCR amplicon data did not show evidence of cytosine deamination that could be distinguished from background signal in either the untreated or repaired sample set. However, the same PCR amplicons showed 850 C → T/G → A substitutions consistent with cytosine deamination with variant frequencies (VFs) of up to 25% when sequenced using NGS methods The occurrence of base misincorporation due to cytosine deamination was reduced by 98% (to 10) in the NGS amplicon data after repair. The NGS capture data indicated low levels (1-2%) of cytosine deamination in mtDNA fragments that was effectively mitigated by DNA repair. The observed difference in the level of cytosine deamination between the PCR and capture enrichment methods can be attributed to the greater propensity for stochastic effects from the PCR enrichment technique employed (e.g., low template input, increased PCR cycles). Altogether these results indicate that DNA repair may be required when sequencing PCR-amplified DNA from degraded forensic case samples with NGS methods. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Guo, Longhua; Yang, Huanghao; Qiu, Bin; Xiao, Xueyang; Xue, Linlin; Kim, Donghwan; Chen, Guonan
2009-12-01
A capillary electrophoresis coupled with electrochemiluminescent detection system (CE-ECL) was developed for the detection of polymerase chain reaction (PCR) amplicons. The ECL luminophore, tris(1,10-phenanthroline) ruthenium(II) (Ru(phen)(3)(2+)), was labeled to the PCR primers before amplification. Ru(phen)(3)(2+) was then introduced to PCR amplicons by PCR amplification. Eventually, the PCR amplicons were separated and detected by the homemade CE-ECL system. The detection of a typical genetically modified organism (GMO), Roundup Ready Soy (RRS), was shown as an example to demonstrate the reliability of the proposed approach. Four pairs of primers were amplified by multiple PCR (MPCR) simultaneously, three of which were targeted on the specific sequence of exogenous genes of RRS, and another was targeted on the endogenous reference gene of soybean. Both the conditions for PCR amplification and CE-ECL separation and detection were investigated in detail. Results showed that, under the optimal conditions, the proposed method can accurately identifying RRS. The corresponding limit of detection (LOD) was below 0.01% with 35 PCR cycles.
Prescreening of microbial populations for the assessment of sequencing potential.
Hanning, Irene B; Ricke, Steven C
2011-01-01
Next-generation sequencing (NGS) is a powerful tool that can be utilized to profile and compare microbial populations. By amplifying a target gene present in all bacteria and subsequently sequencing amplicons, the bacteria genera present in the populations can be identified and compared. In some scenarios, little to no difference may exist among microbial populations being compared in which case a prescreening method would be practical to determine which microbial populations would be suitable for further analysis by NGS. Denaturing density-gradient electrophoresis (DGGE) is relatively cheaper than NGS and the data comparing microbial populations are ready to be viewed immediately after electrophoresis. DGGE follows essentially the same initial methodology as NGS by targeting and amplifying the 16S rRNA gene. However, as opposed to sequencing amplicons, DGGE amplicons are analyzed by electrophoresis. By prescreening microbial populations with DGGE, more efficient use of NGS methods can be accomplished. In this chapter, we outline the protocol for DGGE targeting the same gene (16S rRNA) that would be targeted for NGS to compare and determine differences in microbial populations from a wide range of ecosystems.
Saeidabadi, Mohammad Sadegh; Nili, Hassan; Dadras, Habibollah; Sharifiyazdi, Hassan; Connolly, Joanne; Valcanis, Mary; Raidal, Shane; Ghorashi, Seyed Ali
2017-06-01
Consumption of poultry products contaminated with Salmonella is one of the major causes of foodborne diseases worldwide and therefore detection and differentiation of Salmonella spp. in poultry is important. In this study, oligonucleotide primers were designed from hemD gene and a PCR followed by high-resolution melt (HRM) curve analysis was developed for rapid differentiation of Salmonella isolates. Amplicons of 228 bp were generated from 16 different Salmonella reference strains and from 65 clinical field isolates mainly from poultry farms. HRM curve analysis of the amplicons differentiated Salmonella isolates and analysis of the nucleotide sequence of the amplicons from selected isolates revealed that each melting curve profile was related to a unique DNA sequence. The relationship between reference strains and tested specimens was also evaluated using a mathematical model without visual interpretation of HRM curves. In addition, the potential of the PCR-HRM curve analysis was evaluated for genotyping of additional Salmonella isolates from different avian species. The findings indicate that PCR followed by HRM curve analysis provides a rapid and robust technique for genotyping of Salmonella isolates to determine the serovar/serotype.
Lopez-Doriga, Adriana; Feliubadaló, Lídia; Menéndez, Mireia; Lopez-Doriga, Sergio; Morón-Duran, Francisco D; del Valle, Jesús; Tornero, Eva; Montes, Eva; Cuesta, Raquel; Campos, Olga; Gómez, Carolina; Pineda, Marta; González, Sara; Moreno, Victor; Capellá, Gabriel; Lázaro, Conxi
2014-03-01
Next-generation sequencing (NGS) has revolutionized genomic research and is set to have a major impact on genetic diagnostics thanks to the advent of benchtop sequencers and flexible kits for targeted libraries. Among the main hurdles in NGS are the difficulty of performing bioinformatic analysis of the huge volume of data generated and the high number of false positive calls that could be obtained, depending on the NGS technology and the analysis pipeline. Here, we present the development of a free and user-friendly Web data analysis tool that detects and filters sequence variants, provides coverage information, and allows the user to customize some basic parameters. The tool has been developed to provide accurate genetic analysis of targeted sequencing of common high-risk hereditary cancer genes using amplicon libraries run in a GS Junior System. The Web resource is linked to our own mutation database, to assist in the clinical classification of identified variants. We believe that this tool will greatly facilitate the use of the NGS approach in routine laboratories.
Fessehaie, Anania; De Boer, Solke H; Lévesque, C André
2003-03-01
ABSTRACT Oligonucleotides, 16 to 24 bases long, were selected from the 3' end of the 16S gene and the 16S-23S intergenic spacer regions of bacteria pathogenic on potato, including Clavibacter michiganensis subsp. sepedonicus, Ralstonia solanacearum, and the pectolytic erwinias, including Erwinia carotovora subsp. atroseptica and carotovora and E. chrysanthemi. Oligonucleotides were designed and formatted into an array by pin spotting on nylon membranes. Genomic DNA from bacterial cultures was amplified by polymerase chain reaction using conserved ribosomal primers and labeled simultaneously with digoxigenin-dUTP. Hybridization of amplicons to the array and subsequent serological detection of digoxigenin label revealed different hybridization patterns that were distinct for each species and subspecies tested. Hybridization of amplicons generally was restricted to appropriate homologous oligonucleotides and cross-hybridization with heterologous oligonucleotides was rare. Hybridization patterns were recorded as separate gray values for each hybridized spot and revealed a consistent pattern for multiple strains of each species or subspecies isolated from diverse geographical regions. In preliminary tests, bacteria could be correctly identified and detected by hybridizing to the array amplicons from mixed cultures and inoculated potato tissue.
Hélias-Rodzewicz, Zofia; Pérot, Gaëlle; Chibon, Frédéric; Ferreira, Céline; Lagarde, Pauline; Terrier, Philippe; Coindre, Jean-Michel; Aurias, Alain
2010-12-01
In a series of 404 adult soft tissue sarcomas, analyzed by array-CGH, we have observed in approximately 10% of them a genomic amplification of either chromosome bands 11q22 or 3p12. These two amplicons likely target the YAP1 and VGLL3 genes, respectively. Both genes encode proteins that are cofactors of the TEAD family of transcription factors. Very good correlations between amplification and expression levels were observed. Welch test analyses of transcriptome data demonstrate that tumors with amplicons share a large set of upregulated and downregulated genes. Inhibition of YAP1 and VGLL3 in cell lines with these amplifications/overexpressions leads to similar phenotypes: decrease of proliferation rate, and to a lesser extent decrease of migration properties. These data, and the fact that these amplicons are observed either in de-differentiated liposarcomas or in undifferentiated pleomorphic sarcomas, suggest that these genetics events could be involved in oncogenesis and progression of soft tissue sarcomas. © 2010 Wiley-Liss, Inc.
Rapid Detection of the Chlamydiaceae and Other Families in the Order Chlamydiales: Three PCR Tests
Everett, Karin D. E.; Hornung, Linda J.; Andersen, Arthur A.
1999-01-01
Few identification methods will rapidly or specifically detect all bacteria in the order Chlamydiales, family Chlamydiaceae. In this study, three PCR tests based on sequence data from over 48 chlamydial strains were developed for identification of these bacteria. Two tests exclusively recognized the Chlamydiaceae: a multiplex test targeting the ompA gene and the rRNA intergenic spacer and a TaqMan test targeting the 23S ribosomal DNA. The multiplex test was able to detect as few as 200 inclusion-forming units (IFU), while the TaqMan test could detect 2 IFU. The amplicons produced in these tests ranged from 132 to 320 bp in length. The third test, targeting the 23S rRNA gene, produced a 600-bp amplicon from strains belonging to several families in the order Chlamydiales. Direct sequence analysis of this amplicon has facilitated the identification of new chlamydial strains. These three tests permit ready identification of chlamydiae for diagnostic and epidemiologic study. The specificity of these tests indicates that they might also be used to identify chlamydiae without culture or isolation. PMID:9986815
ERIC Educational Resources Information Center
American Association of Physics Teachers (NJ1), 2009
2009-01-01
Physics First represents an organizational alternative to the traditional high school science sequence. It calls for a re-sequencing of high school courses so that students study physics before chemistry and biology. The purpose of this pamphlet is to provide: (1) Basic information and rationale for the Physics First curriculum; (2) Strategies for…
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Gretchen H. Roffler; Stephen J. Amish; Seth Smith; Ted Cosart; Marty Kardos; Michael K. Schwartz; Gordon Luikart
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding...
A New and Improved Rainbow Trout (Oncorhynchus mykiss) Reference Genome Assembly
USDA-ARS?s Scientific Manuscript database
In an effort to improve the rainbow trout reference genome assembly, we re-sequenced the doubled-haploid Swanson line using the longest available reads from the Illumina technology; generating over 510 million paired-end shotgun reads (2x260nt), and 1 billion mate-pair reads (2x160nt) from four sequ...
USDA-ARS?s Scientific Manuscript database
CBF/DREB related genes are considered important genes for regulation of abiotic stress in plants. In this study, CBF/DREB genes in perennial ryegrass (Lolium perenne L.), also known as LpCBF genes, were resequenced from several cultivated and landrace plants from a worldwide collection. The same pla...
Resequencing Skills and Concepts in Applied Calculus Using the Computer as a Tool.
ERIC Educational Resources Information Center
Heid, M. Kathleen
1988-01-01
During the first 12 weeks of an applied calculus course, two classes of college students studied calculus concepts using graphical and symbol-manipulation computer programs to perform routine manipulations. Three weeks were spent on skill development. Students showed better understanding of concepts and performed almost as well on routine skills.…
USDA-ARS?s Scientific Manuscript database
Trichinella spiralis is a parasitic roundworm that infects domestic swine, rats and humans. Ingestion of infected pork by humans can lead to the potentially fatal disease trichinellosis. The phylogeny and historical dispersal of Trichinella spp. have been studied, in part, by sequencing portions of...
Universal Detection and Identification of Avian Influenza Virus by Use of Resequencing Microarrays
2009-04-01
For the RT step, primer LN was replaced by primer NLN (a random 9-mer with a linker se- quence). One picogram each of two internal controls (NAC1...samples (data not shown). These data indicated that most of the avian H5N1 samples identified were presumably sensitive to neuraminidase inhibitors
USDA-ARS?s Scientific Manuscript database
A small fast neutron mutant population has been established from Phaseolus vulgaris cv. Red Hawk. We leveraged the available P. vulgaris genome sequence and high throughput next generation DNA sequencing to examine the genomic structure of five Phaseolus vulgaris cv. Red Hawk fast neutron mutants wi...
Using expected sequence features to improve basecalling accuracy of amplicon pyrosequencing data.
Rask, Thomas S; Petersen, Bent; Chen, Donald S; Day, Karen P; Pedersen, Anders Gorm
2016-04-22
Amplicon pyrosequencing targets a known genetic region and thus inherently produces reads highly anticipated to have certain features, such as conserved nucleotide sequence, and in the case of protein coding DNA, an open reading frame. Pyrosequencing errors, consisting mainly of nucleotide insertions and deletions, are on the other hand likely to disrupt open reading frames. Such an inverse relationship between errors and expectation based on prior knowledge can be used advantageously to guide the process known as basecalling, i.e. the inference of nucleotide sequence from raw sequencing data. The new basecalling method described here, named Multipass, implements a probabilistic framework for working with the raw flowgrams obtained by pyrosequencing. For each sequence variant Multipass calculates the likelihood and nucleotide sequence of several most likely sequences given the flowgram data. This probabilistic approach enables integration of basecalling into a larger model where other parameters can be incorporated, such as the likelihood for observing a full-length open reading frame at the targeted region. We apply the method to 454 amplicon pyrosequencing data obtained from a malaria virulence gene family, where Multipass generates 20 % more error-free sequences than current state of the art methods, and provides sequence characteristics that allow generation of a set of high confidence error-free sequences. This novel method can be used to increase accuracy of existing and future amplicon sequencing data, particularly where extensive prior knowledge is available about the obtained sequences, for example in analysis of the immunoglobulin VDJ region where Multipass can be combined with a model for the known recombining germline genes. Multipass is available for Roche 454 data at http://www.cbs.dtu.dk/services/MultiPass-1.0 , and the concept can potentially be implemented for other sequencing technologies as well.
Shaw, Jennifer L A; Monis, Paul; Weyrich, Laura S; Sawade, Emma; Drikas, Mary; Cooper, Alan J
2015-09-01
Drinking water assessments use a variety of microbial, physical, and chemical indicators to evaluate water treatment efficiency and product water quality. However, these indicators do not allow the complex biological communities, which can adversely impact the performance of drinking water distribution systems (DWDSs), to be characterized. Entire bacterial communities can be studied quickly and inexpensively using targeted metagenomic amplicon sequencing. Here, amplicon sequencing of the 16S rRNA gene region was performed alongside traditional water quality measures to assess the health, quality, and efficiency of two distinct, full-scale DWDSs: (i) a linear DWDS supplied with unfiltered water subjected to basic disinfection before distribution and (ii) a complex, branching DWDS treated by a four-stage water treatment plant (WTP) prior to disinfection and distribution. In both DWDSs bacterial communities differed significantly after disinfection, demonstrating the effectiveness of both treatment regimes. However, bacterial repopulation occurred further along in the DWDSs, and some end-user samples were more similar to the source water than to the postdisinfection water. Three sample locations appeared to be nitrified, displaying elevated nitrate levels and decreased ammonia levels, and nitrifying bacterial species, such as Nitrospira, were detected. Burkholderiales were abundant in samples containing large amounts of monochloramine, indicating resistance to disinfection. Genera known to contain pathogenic and fecal-associated species were also identified in several locations. From this study, we conclude that metagenomic amplicon sequencing is an informative method to support current compliance-based methods and can be used to reveal bacterial community interactions with the chemical and physical properties of DWDSs. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Maus, Irena; Kim, Yong Sung; Wibberg, Daniel; Stolze, Yvonne; Off, Sandra; Antonczyk, Sebastian; Pühler, Alfred; Scherer, Paul; Schlüter, Andreas
2017-02-28
Process surveillance within agricultural biogas plants (BGPs) was concurrently studied by high-throughput 16S rRNA gene amplicon sequencing and an optimized quantitative microscopic fingerprinting (QMF) technique. In contrast to 16S rRNA gene amplicons, digitalized microscopy is a rapid and cost-effective method that facilitates enumeration and morphological differentiation of the most significant groups of methanogens regarding their shape and characteristic autofluorescent factor 420. Moreover, the fluorescence signal mirrors cell vitality. In this study, four different BGPs were investigated. The results indicated stable process performance in the mesophilic BGPs and in the thermophilic reactor. Bacterial subcommunity characterization revealed significant differences between the four BGPs. Most remarkably, the genera Defluviitoga and Halocella dominated the thermophilic bacterial subcommunity, whereas members of another taxon, Syntrophaceticus , were found to be abundant in the mesophilic BGP. The domain Archaea was dominated by the genus Methanoculleus in all four BGPs, followed by Methanosaeta in BGP1 and BGP3. In contrast, Methanothermobacter members were highly abundant in the thermophilic BGP4. Furthermore, a high consistency between the sequencing approach and the QMF method was shown, especially for the thermophilic BGP. The differences elucidated that using this biphasic approach for mesophilic BGPs provided novel insights regarding disaggregated single cells of Methanosarcina and Methanosaeta species. Both dominated the archaeal subcommunity and replaced coccoid Methanoculleus members belonging to the same group of Methanomicrobiales that have been frequently observed in similar BGPs. This work demonstrates that combining QMF and 16S rRNA gene amplicon sequencing is a complementary strategy to describe archaeal community structures within biogas processes.
Wahyuningsih, Hesty; K Cayami, Ferdy; Bahrudin, Udin; A Sobirin, Mochamad; Ep Mundhofir, Farmaditya; Mh Faradz, Sultana; Hisatome, Ichiro
2017-03-01
High resolution melting (HRM) is a post-PCR technique for variant screening and genotyping based on the different melting points of DNA fragments. The advantages of this technique are that it is fast, simple, and efficient and has a high output, particularly for screening of a large number of samples. APOA1 encodes apolipoprotein A1 (apoA1) which is a major component of high density lipoprotein cholesterol (HDL-C). This study aimed to obtain an optimal quantitative polymerase chain reaction (qPCR)-HRM condition for screening of APOA1 variance. Genomic DNA was isolated from a peripheral blood sample using the salting out method. APOA1 was amplified using the RotorGeneQ 5Plex HRM. The PCR product was visualized with the HRM amplification curve and confirmed using gel electrophoresis. The melting profile was confirmed by looking at the melting curve. Five sets of primers covering the translated region of APOA1 exons were designed with expected PCR product size of 100-400 bps. The amplified segments of DNA were amplicons 2, 3, 4A, 4B, and 4C. Amplicons 2, 3 and 4B were optimized at an annealing temperature of 60 °C at 40 PCR cycles. Amplicon 4A was optimized at an annealing temperature of 62 °C at 45 PCR cycles. Amplicon 4C was optimized at an annealing temperature of 63 °C at 50 PCR cycles. In addition to the suitable procedures of DNA isolation and quantification, primer design and an estimated PCR product size, the data of this study showed that appropriate annealing temperature and PCR cycles were important factors in optimization of HRM technique for variant screening in APOA1 .
Wahyuningsih, Hesty; K Cayami, Ferdy; Bahrudin, Udin; A Sobirin, Mochamad; EP Mundhofir, Farmaditya; MH Faradz, Sultana; Hisatome, Ichiro
2017-01-01
Background High resolution melting (HRM) is a post-PCR technique for variant screening and genotyping based on the different melting points of DNA fragments. The advantages of this technique are that it is fast, simple, and efficient and has a high output, particularly for screening of a large number of samples. APOA1 encodes apolipoprotein A1 (apoA1) which is a major component of high density lipoprotein cholesterol (HDL-C). This study aimed to obtain an optimal quantitative polymerase chain reaction (qPCR)-HRM condition for screening of APOA1 variance. Methods Genomic DNA was isolated from a peripheral blood sample using the salting out method. APOA1 was amplified using the RotorGeneQ 5Plex HRM. The PCR product was visualized with the HRM amplification curve and confirmed using gel electrophoresis. The melting profile was confirmed by looking at the melting curve. Results Five sets of primers covering the translated region of APOA1 exons were designed with expected PCR product size of 100–400 bps. The amplified segments of DNA were amplicons 2, 3, 4A, 4B, and 4C. Amplicons 2, 3 and 4B were optimized at an annealing temperature of 60 °C at 40 PCR cycles. Amplicon 4A was optimized at an annealing temperature of 62 °C at 45 PCR cycles. Amplicon 4C was optimized at an annealing temperature of 63 °C at 50 PCR cycles. Conclusion In addition to the suitable procedures of DNA isolation and quantification, primer design and an estimated PCR product size, the data of this study showed that appropriate annealing temperature and PCR cycles were important factors in optimization of HRM technique for variant screening in APOA1. PMID:28331418
Moreno, Lilliana I; Mills, DeEtta; Fetscher, Jill; John-Williams, Krista; Meadows-Jantz, Lee; McCord, Bruce
2011-03-01
The placement of cadavers in shallow, clandestine graves may alter the microbial and geochemical composition of the underlying and adjacent soils. Using amplicon length heterogeneity-PCR (LH-PCR) the microbial community changes in these soils can be assessed. In this investigation, nine different grave sites were examined over a period of 16weeks. The results indicated that measurable changes occurred in the soil bacterial community during the decomposition process. In this study, amplicons corresponding to anaerobic bacteria, not indigenous to the soil, were shown to produce differences between grave sites and control soils. Among the bacteria linked to these amplicons are those that are most often part of the commensal flora of the intestines, mouth and skin. In addition, over the 16week sampling interval, the level of indicator organisms (i.e., nitrogen fixing bacteria) dropped as the body decomposed and after four weeks of environmental exposure they began to increase again; thus differences in the abundance of nitrogen fixing bacteria were also found to contribute to the variation between controls and grave soils. These results were verified using primers that specifically targeted the nifH gene coding for nitrogenase reductase. LH-PCR provides a fast, robust and reproducible method to measure microbial changes in soil and could be used to determine potential cadaveric contact in a given area. The results obtained with this method could ultimately provide leads to investigators in criminal or missing person scenarios and allow for further analysis using human specific DNA assays to establish the identity of the buried body. Copyright © 2010 Elsevier B.V. All rights reserved.
Mitchell, Andrew
2015-09-01
Natural history museums are vastly underutilized as a source of material for DNA analysis because of perceptions about the limitations of DNA degradation in older specimens. Despite very few exceptions, most DNA barcoding projects, which aim to obtain sequence data from all species, generally use specimens collected specifically for that purpose, instead of the wealth of identified material in museums, constrained by the lack of suitable PCR methods. Any techniques that extend the utility of museum specimens for DNA analysis therefore are highly valuable. This study first tested the effects of specimen age and PCR amplicon size on PCR success rates in pinned insect specimens, then developed a PCR primer set and amplification strategy allowing greatly increased utilization of older museum specimens for DNA barcoding. PCR success rates compare favourably with the few published studies utilizing similar aged specimens, and this new strategy has the advantage of being easily automated for high-throughput laboratory workflows. The strategy uses hemi-nested, degenerate, M13-tailed PCR primers to amplify two overlapping amplicons, using two PCRs per amplicon (i.e. four PCRs per DNA sample). Initial PCR products are reamplified using an internal primer and a M13 primer. Together the two PCR amplicons yield 559 bp of the COI gene from Coleoptera, Lepidoptera, Diptera, Hemiptera, Odonata and presumably also other insects. BARCODE standard-compliant data were recovered from 67% (56 of 84) of specimens up to 25 years old, and 51% (102 of 197) of specimens up to 55 years old. Given the time, cost and specialist expertise required for fieldwork and identification, 'collecting in collections' is a viable alternative allowing researchers to capitalize on the knowledge captured by curation work in decades past. © 2015 John Wiley & Sons Ltd.
Song, Zhewei; Du, Hai; Zhang, Yan; Xu, Yan
2017-01-01
Fermentation microbiota is specific microorganisms that generate different types of metabolites in many productions. In traditional solid-state fermentation, the structural composition and functional capacity of the core microbiota determine the quality and quantity of products. As a typical example of food fermentation, Chinese Maotai-flavor liquor production involves a complex of various microorganisms and a wide variety of metabolites. However, the microbial succession and functional shift of the core microbiota in this traditional food fermentation remain unclear. Here, high-throughput amplicons (16S rRNA gene amplicon sequencing and internal transcribed space amplicon sequencing) and metatranscriptomics sequencing technologies were combined to reveal the structure and function of the core microbiota in Chinese soy sauce aroma type liquor production. In addition, ultra-performance liquid chromatography and headspace-solid phase microextraction-gas chromatography-mass spectrometry were employed to provide qualitative and quantitative analysis of the major flavor metabolites. A total of 10 fungal and 11 bacterial genera were identified as the core microbiota. In addition, metatranscriptomic analysis revealed pyruvate metabolism in yeasts (genera Pichia, Schizosaccharomyces, Saccharomyces , and Zygosaccharomyces ) and lactic acid bacteria (genus Lactobacillus ) classified into two stages in the production of flavor components. Stage I involved high-level alcohol (ethanol) production, with the genus Schizosaccharomyces serving as the core functional microorganism. Stage II involved high-level acid (lactic acid and acetic acid) production, with the genus Lactobacillus serving as the core functional microorganism. The functional shift from the genus Schizosaccharomyces to the genus Lactobacillus drives flavor component conversion from alcohol (ethanol) to acid (lactic acid and acetic acid) in Chinese Maotai-flavor liquor production. Our findings provide insight into the effects of the core functional microbiota in soy sauce aroma type liquor production and the characteristics of the fermentation microbiota under different environmental conditions.
2013-01-01
Background Besides the development of comprehensive tools for high-throughput 16S ribosomal RNA amplicon sequence analysis, there exists a growing need for protocols emphasizing alternative phylogenetic markers such as those representing eukaryotic organisms. Results Here we introduce CloVR-ITS, an automated pipeline for comparative analysis of internal transcribed spacer (ITS) pyrosequences amplified from metagenomic DNA isolates and representing fungal species. This pipeline performs a variety of steps similar to those commonly used for 16S rRNA amplicon sequence analysis, including preprocessing for quality, chimera detection, clustering of sequences into operational taxonomic units (OTUs), taxonomic assignment (at class, order, family, genus, and species levels) and statistical analysis of sample groups of interest based on user-provided information. Using ITS amplicon pyrosequencing data from a previous human gastric fluid study, we demonstrate the utility of CloVR-ITS for fungal microbiota analysis and provide runtime and cost examples, including analysis of extremely large datasets on the cloud. We show that the largest fractions of reads from the stomach fluid samples were assigned to Dothideomycetes, Saccharomycetes, Agaricomycetes and Sordariomycetes but that all samples were dominated by sequences that could not be taxonomically classified. Representatives of the Candida genus were identified in all samples, most notably C. quercitrusa, while sequence reads assigned to the Aspergillus genus were only identified in a subset of samples. CloVR-ITS is made available as a pre-installed, automated, and portable software pipeline for cloud-friendly execution as part of the CloVR virtual machine package (http://clovr.org). Conclusion The CloVR-ITS pipeline provides fungal microbiota analysis that can be complementary to bacterial 16S rRNA and total metagenome sequence analysis allowing for more comprehensive studies of environmental and host-associated microbial communities. PMID:24451270
USDA-ARS?s Scientific Manuscript database
High-density genetic linkage maps are essential for fine mapping QTLs controlling disease resistance traits, such as early leaf spot (ELS), late leaf spot (LLS), and Tomato spotted wilt virus (TSWV). With completion of the genome sequences of two diploid ancestors of cultivated peanut, we could use ...
Jennifer Yuzon; David M. Rizzo; Mathu Malar C; Sucheta Tripathy; Takao Kasuga
2017-01-01
Phytophthora ramorum has spread and diversified throughout Californiaâs northwestern coast since its introduction in the 1990s. Tracking the spread of P. ramorum and the functional response of the pathogen to the environment is of particular interest to managing the epidemic. Using genetic tools such as microsatellite...
75 FR 28032 - National Heart, Lung, and Blood Institute; Notice of Closed Meetings
Federal Register 2010, 2011, 2012, 2013, 2014
2010-05-19
... Coordinating Center. Date: June 4, 2010. Time: 1 p.m. to 2 p.m. Agenda: To review and evaluate contract..., NHLBI DNA Resequencing and Genotyping (RS&G) Service: Laboratory Center(s). Date: June 4, 2010. Time: 2... Career Enhancement Awards. Date: June 8-9, 2010. Time: 8 a.m. to 5 p.m. Agenda: To review and evaluate...
USDA-ARS?s Scientific Manuscript database
The objective of this study was to develop a canonical SNP panel for subtyping of Shiga-toxin producing Escherichia coli (STEC). To this purpose, 906 putative SNPs were identified using resequencing tiling arrays. A subset of 391 SNPs was further screened using high-throughput TaqMan PCR against a d...
USDA-ARS?s Scientific Manuscript database
A haplotype on cattle chromosome 5 carrying a recessive lethal allele was found to originate in a Holstein-Friesian foundation sire. Resequencing led to the identification of a stop-gain mutation in exon 11 of APAF1, a gene known to cause embryonic lethality and neurodevelopmental abnormalities in ...
Applications of microarray technology in breast cancer research
Cooper, Colin S
2001-01-01
Microarrays provide a versatile platform for utilizing information from the Human Genome Project to benefit human health. This article reviews the ways in which microarray technology may be used in breast cancer research. Its diverse applications include monitoring chromosome gains and losses, tumour classification, drug discovery and development, DNA resequencing, mutation detection and investigating the mechanism of tumour development. PMID:11305951
USDA-ARS?s Scientific Manuscript database
The soybean Consensus Map 4.0 facilitated the anchoring of 95.6% of the soybean whole genome sequence developed by the Joint Genome Institute, Department of Energy but only properly oriented 66% of the sequence scaffolds. To find additional single nucleotide polymorphism (SNP) markers for additiona...
USDA-ARS?s Scientific Manuscript database
The objective of this study was to identify single nucleotide polymorphisms (SNP) associated to fertility in female cows raised under a subtropical environment. Re-sequencing of 9 genes associated to GH-IGF endocrine pathway located in bovine chromosome 5, identified 75 SNP useful for associative ge...
Nucleotide diversity and linkage disequilibrium in wild avocado (Persea americana Mill.).
Chen, Haofeng; Morrell, Peter L; de la Cruz, Marlene; Clegg, Michael T
2008-01-01
Resequencing studies provide the ultimate resolution of genetic diversity because they identify all mutations in a gene that are present within the sampled individuals. We report a resequencing study of Persea americana, a subtropical tree species native to Meso- and Central America and the progenitor of cultivated avocado. The sample includes 21 wild accessions from Mexico, Costa Rica, Ecuador, and the Dominican Republic. Estimated levels of nucleotide polymorphism and linkage disequilibrium (LD) are obtained from fully resolved haplotype data from 4 nuclear loci that span 5960 nucleotide sites. Results show that, although avocado is a subtropical tree crop and a predominantly outcrossing plant, the overall level of genetic variation is not exceptionally high (nucleotide diversity at silent sites, pi(sil) = 0.0102) compared with available estimates from temperate plant species. Intralocus LD decays rapidly to half the initial value within about 1 kb. Estimates of recombination rate (based on the sequence data) show that the rate is not exceptionally high when compared with annual plants such as wild barley or maize. Interlocus LD is significant owing to substantial population structure induced by mixing of the 3 botanical races of avocado.
LipidSeq: a next-generation clinical resequencing panel for monogenic dyslipidemias.
Johansen, Christopher T; Dubé, Joseph B; Loyzer, Melissa N; MacDonald, Austin; Carter, David E; McIntyre, Adam D; Cao, Henian; Wang, Jian; Robinson, John F; Hegele, Robert A
2014-04-01
We report the design of a targeted resequencing panel for monogenic dyslipidemias, LipidSeq, for the purpose of replacing Sanger sequencing in the clinical detection of dyslipidemia-causing variants. We also evaluate the performance of the LipidSeq approach versus Sanger sequencing in 84 patients with a range of phenotypes including extreme blood lipid concentrations as well as additional dyslipidemias and related metabolic disorders. The panel performs well, with high concordance (95.2%) in samples with known mutations based on Sanger sequencing and a high detection rate (57.9%) of mutations likely to be causative for disease in samples not previously sequenced. Clinical implementation of LipidSeq has the potential to aid in the molecular diagnosis of patients with monogenic dyslipidemias with a high degree of speed and accuracy and at lower cost than either Sanger sequencing or whole exome sequencing. Furthermore, LipidSeq will help to provide a more focused picture of monogenic and polygenic contributors that underlie dyslipidemia while excluding the discovery of incidental pathogenic clinically actionable variants in nonmetabolism-related genes, such as oncogenes, that would otherwise be identified by a whole exome approach, thus minimizing potential ethical issues.
LipidSeq: a next-generation clinical resequencing panel for monogenic dyslipidemias[S
Johansen, Christopher T.; Dubé, Joseph B.; Loyzer, Melissa N.; MacDonald, Austin; Carter, David E.; McIntyre, Adam D.; Cao, Henian; Wang, Jian; Robinson, John F.; Hegele, Robert A.
2014-01-01
We report the design of a targeted resequencing panel for monogenic dyslipidemias, LipidSeq, for the purpose of replacing Sanger sequencing in the clinical detection of dyslipidemia-causing variants. We also evaluate the performance of the LipidSeq approach versus Sanger sequencing in 84 patients with a range of phenotypes including extreme blood lipid concentrations as well as additional dyslipidemias and related metabolic disorders. The panel performs well, with high concordance (95.2%) in samples with known mutations based on Sanger sequencing and a high detection rate (57.9%) of mutations likely to be causative for disease in samples not previously sequenced. Clinical implementation of LipidSeq has the potential to aid in the molecular diagnosis of patients with monogenic dyslipidemias with a high degree of speed and accuracy and at lower cost than either Sanger sequencing or whole exome sequencing. Furthermore, LipidSeq will help to provide a more focused picture of monogenic and polygenic contributors that underlie dyslipidemia while excluding the discovery of incidental pathogenic clinically actionable variants in nonmetabolism-related genes, such as oncogenes, that would otherwise be identified by a whole exome approach, thus minimizing potential ethical issues. PMID:24503134
The genome of Eucalyptus grandis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Myburg, Alexander A.; Grattapaglia, Dario; Tuskan, Gerald A.
Eucalypts are the world s most widely planted hardwood trees. Their broad adaptability, rich species diversity, fast growth and superior multipurpose wood, have made them a global renewable resource of fiber and energy that mitigates human pressures on natural forests. We sequenced and assembled >94% of the 640 Mbp genome of Eucalyptus grandis into its 11 chromosomes. A set of 36,376 protein coding genes were predicted revealing that 34% occur in tandem duplications, the largest proportion found thus far in any plant genome. Eucalypts also show the highest diversity of genes for plant specialized metabolism that act as chemical defencemore » against biotic agents and provide unique pharmaceutical oils. Resequencing of a set of inbred tree genomes revealed regions of strongly conserved heterozygosity, likely hotspots of inbreeding depression. The resequenced genome of the sister species E. globulus underscored the high inter-specific genome colinearity despite substantial genome size variation in the genus. The genome of E. grandis is the first reference for the early diverging Rosid order Myrtales and is placed here basal to the Eurosids. This resource expands knowledge on the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.« less
Hilger, Alina C; Halbritter, Jan; Pennimpede, Tracie; van der Ven, Amelie; Sarma, Georgia; Braun, Daniela A; Porath, Jonathan D; Kohl, Stefan; Hwang, Daw-Yang; Dworschak, Gabriel C; Hermann, Bernhard G; Pavlova, Anna; El-Maarri, Osman; Nöthen, Markus M; Ludwig, Michael; Reutter, Heiko; Hildebrandt, Friedhelm
2015-12-01
The VATER/VACTERL association describes the combination of congenital anomalies including vertebral defects, anorectal malformations, cardiac defects, tracheoesophageal fistula with or without esophageal atresia, renal malformations, and limb defects. As mutations in ciliary genes were observed in diseases related to VATER/VACTERL, we performed targeted resequencing of 25 ciliary candidate genes as well as disease-associated genes (FOXF1, HOXD13, PTEN, ZIC3) in 123 patients with VATER/VACTERL or VATER/VACTERL-like phenotype. We detected no biallelic mutation in any of the 25 ciliary candidate genes; however, identified an identical, probably disease-causing ZIC3 missense mutation (p.Gly17Cys) in four patients and a FOXF1 de novo mutation (p.Gly220Cys) in a further patient. In situ hybridization analyses in mouse embryos between E9.5 and E14.5 revealed Zic3 expression in limb and prevertebral structures, and Foxf1 expression in esophageal, tracheal, vertebral, anal, and genital tubercle tissues, hence VATER/VACTERL organ systems. These data provide strong evidence that mutations in ZIC3 or FOXF1 contribute to VATER/VACTERL. © 2015 WILEY PERIODICALS, INC.
Non-coding variants contribute to the clinical heterogeneity of TTR amyloidosis.
Iorio, Andrea; De Lillo, Antonella; De Angelis, Flavio; Di Girolamo, Marco; Luigetti, Marco; Sabatelli, Mario; Pradotto, Luca; Mauro, Alessandro; Mazzeo, Anna; Stancanelli, Claudia; Perfetto, Federico; Frusconi, Sabrina; My, Filomena; Manfellotto, Dario; Fuciarelli, Maria; Polimanti, Renato
2017-09-01
Coding mutations in TTR gene cause a rare hereditary form of systemic amyloidosis, which has a complex genotype-phenotype correlation. We investigated the role of non-coding variants in regulating TTR gene expression and consequently amyloidosis symptoms. We evaluated the genotype-phenotype correlation considering the clinical information of 129 Italian patients with TTR amyloidosis. Then, we conducted a re-sequencing of TTR gene to investigate how non-coding variants affect TTR expression and, consequently, phenotypic presentation in carriers of amyloidogenic mutations. Polygenic scores for genetically determined TTR expression were constructed using data from our re-sequencing analysis and the GTEx (Genotype-Tissue Expression) project. We confirmed a strong phenotypic heterogeneity across coding mutations causing TTR amyloidosis. Considering the effects of non-coding variants on TTR expression, we identified three patient clusters with specific expression patterns associated with certain phenotypic presentations, including late onset, autonomic neurological involvement, and gastrointestinal symptoms. This study provides novel data regarding the role of non-coding variation and the gene expression profiles in patients affected by TTR amyloidosis, also putting forth an approach that could be used to investigate the mechanisms at the basis of the genotype-phenotype correlation of the disease.
Is the child 'father of the man'? evaluating the stability of genetic influences across development.
Ronald, Angelica
2011-11-01
This selective review considers findings in genetic research that have shed light on how genes operate across development. We will address the question of whether the child is 'father of the Man' from a genetic perspective. In other words, do the same genetic influences affect the same traits across development? Using a 'taster menu' approach and prioritizing newer findings on cognitive and behavioral traits, examples from the following genetic disciplines will be discussed: (a) developmental quantitative genetics (such as longitudinal twin studies), (b) neurodevelopmental genetic syndromes with known genetic causes (such as Williams syndrome), (c) developmental candidate gene studies (such as those that link infant and adult populations), (d) developmental genome-wide association studies (GWAS), and (e) DNA resequencing. Evidence presented here suggests that there is considerable genetic stability of cognitive and behavioral traits across development, but there is also evidence for genetic change. Quantitative genetic studies have a long history of assessing genetic continuity and change across development. It is now time for the newer, more technology-enabled fields such as GWAS and DNA resequencing also to take on board the dynamic nature of human behavior. 2011 Blackwell Publishing Ltd.
High-throughput discovery of rare human nucleotide polymorphisms by Ecotilling
Till, Bradley J.; Zerr, Troy; Bowers, Elisabeth; Greene, Elizabeth A.; Comai, Luca; Henikoff, Steven
2006-01-01
Human individuals differ from one another at only ∼0.1% of nucleotide positions, but these single nucleotide differences account for most heritable phenotypic variation. Large-scale efforts to discover and genotype human variation have been limited to common polymorphisms. However, these efforts overlook rare nucleotide changes that may contribute to phenotypic diversity and genetic disorders, including cancer. Thus, there is an increasing need for high-throughput methods to robustly detect rare nucleotide differences. Toward this end, we have adapted the mismatch discovery method known as Ecotilling for the discovery of human single nucleotide polymorphisms. To increase throughput and reduce costs, we developed a universal primer strategy and implemented algorithms for automated band detection. Ecotilling was validated by screening 90 human DNA samples for nucleotide changes in 5 gene targets and by comparing results to public resequencing data. To increase throughput for discovery of rare alleles, we pooled samples 8-fold and found Ecotilling to be efficient relative to resequencing, with a false negative rate of 5% and a false discovery rate of 4%. We identified 28 new rare alleles, including some that are predicted to damage protein function. The detection of rare damaging mutations has implications for models of human disease. PMID:16893952
Turner, Thomas L.; Stewart, Andrew D.; Fields, Andrew T.; Rice, William R.; Tarone, Aaron M.
2011-01-01
Body size is a classic quantitative trait with evolutionarily significant variation within many species. Locating the alleles responsible for this variation would help understand the maintenance of variation in body size in particular, as well as quantitative traits in general. However, successful genome-wide association of genotype and phenotype may require very large sample sizes if alleles have low population frequencies or modest effects. As a complementary approach, we propose that population-based resequencing of experimentally evolved populations allows for considerable power to map functional variation. Here, we use this technique to investigate the genetic basis of natural variation in body size in Drosophila melanogaster. Significant differentiation of hundreds of loci in replicate selection populations supports the hypothesis that the genetic basis of body size variation is very polygenic in D. melanogaster. Significantly differentiated variants are limited to single genes at some loci, allowing precise hypotheses to be formed regarding causal polymorphisms, while other significant regions are large and contain many genes. By using significantly associated polymorphisms as a priori candidates in follow-up studies, these data are expected to provide considerable power to determine the genetic basis of natural variation in body size. PMID:21437274
Droplet-based pyrosequencing using digital microfluidics.
Boles, Deborah J; Benton, Jonathan L; Siew, Germaine J; Levy, Miriam H; Thwar, Prasanna K; Sandahl, Melissa A; Rouse, Jeremy L; Perkins, Lisa C; Sudarsan, Arjun P; Jalili, Roxana; Pamula, Vamsee K; Srinivasan, Vijay; Fair, Richard B; Griffin, Peter B; Eckhardt, Allen E; Pollack, Michael G
2011-11-15
The feasibility of implementing pyrosequencing chemistry within droplets using electrowetting-based digital microfluidics is reported. An array of electrodes patterned on a printed-circuit board was used to control the formation, transportation, merging, mixing, and splitting of submicroliter-sized droplets contained within an oil-filled chamber. A three-enzyme pyrosequencing protocol was implemented in which individual droplets contained enzymes, deoxyribonucleotide triphosphates (dNTPs), and DNA templates. The DNA templates were anchored to magnetic beads which enabled them to be thoroughly washed between nucleotide additions. Reagents and protocols were optimized to maximize signal over background, linearity of response, cycle efficiency, and wash efficiency. As an initial demonstration of feasibility, a portion of a 229 bp Candida parapsilosis template was sequenced using both a de novo protocol and a resequencing protocol. The resequencing protocol generated over 60 bp of sequence with 100% sequence accuracy based on raw pyrogram levels. Excellent linearity was observed for all of the homopolymers (two, three, or four nucleotides) contained in the C. parapsilosis sequence. With improvements in microfluidic design it is expected that longer reads, higher throughput, and improved process integration (i.e., "sample-to-sequence" capability) could eventually be achieved using this low-cost platform.
Chung, Won-Hyong; Jeong, Namhee; Kim, Jiwoong; Lee, Woo Kyu; Lee, Yun-Gyeong; Lee, Sang-Heon; Yoon, Woongchang; Kim, Jin-Hyun; Choi, Ik-Young; Choi, Hong-Kyu; Moon, Jung-Kyung; Kim, Namshin; Jeong, Soon-Chun
2014-01-01
Despite the importance of soybean as a major crop, genome-wide variation and evolution of cultivated soybeans are largely unknown. Here, we catalogued genome variation in an annual soybean population by high-depth resequencing of 10 cultivated and 6 wild accessions and obtained 3.87 million high-quality single-nucleotide polymorphisms (SNPs) after excluding the sites with missing data in any accession. Nuclear genome phylogeny supported a single origin for the cultivated soybeans. We identified 10-fold longer linkage disequilibrium (LD) in the wild soybean relative to wild maize and rice. Despite the small population size, the long LD and large SNP data allowed us to identify 206 candidate domestication regions with significantly lower diversity in the cultivated, but not in the wild, soybeans. Some of the genes in these candidate regions were associated with soybean homologues of canonical domestication genes. However, several examples, which are likely specific to soybean or eudicot crop plants, were also observed. Consequently, the variation data identified in this study should be valuable for breeding and for identifying agronomically important genes in soybeans. However, the long LD of wild soybeans may hinder pinpointing causal gene(s) in the candidate regions. PMID:24271940
Miyatake, Satoko; Koshimizu, Eriko; Hayashi, Yukiko K; Miya, Kazushi; Shiina, Masaaki; Nakashima, Mitsuko; Tsurusaki, Yoshinori; Miyake, Noriko; Saitsu, Hirotomo; Ogata, Kazuhiro; Nishino, Ichizo; Matsumoto, Naomichi
2014-07-01
When an expected mutation in a particular disease-causing gene is not identified in a suspected carrier, it is usually assumed to be due to germline mosaicism. We report here very-low-grade somatic mosaicism in ACTA1 in an unaffected mother of two siblings affected with a neonatal form of nemaline myopathy. The mosaicism was detected by deep resequencing using a next-generation sequencer. We identified a novel heterozygous mutation in ACTA1, c.448A>G (p.Thr150Ala), in the affected siblings. Three-dimensional structural modeling suggested that this mutation may affect polymerization and/or actin's interactions with other proteins. In this family, we expected autosomal dominant inheritance with either parent demonstrating germline or somatic mosaicism. Sanger sequencing identified no mutation. However, further deep resequencing of this mutation on a next-generation sequencer identified very-low-grade somatic mosaicism in the mother: 0.4%, 1.1%, and 8.3% in the saliva, blood leukocytes, and nails, respectively. Our study demonstrates the possibility of very-low-grade somatic mosaicism in suspected carriers, rather than germline mosaicism. Copyright © 2014 Elsevier B.V. All rights reserved.
Droplet-Based Pyrosequencing Using Digital Microfluidics
Boles, Deborah J.; Benton, Jonathan L.; Siew, Germaine J.; Levy, Miriam H.; Thwar, Prasanna K.; Sandahl, Melissa A.; Rouse, Jeremy L.; Perkins, Lisa C.; Sudarsan, Arjun P.; Jalili, Roxana; Pamula, Vamsee K.; Srinivasan, Vijay; Fair, Richard B.; Griffin, Peter B.; Eckhardt, Allen E.; Pollack, Michael G.
2013-01-01
The feasibility of implementing pyrosequencing chemistry within droplets using electrowetting-based digital microfluidics is reported. An array of electrodes patterned on a printed-circuit board was used to control the formation, transportation, merging, mixing, and splitting of submicroliter-sized droplets contained within an oil-filled chamber. A three-enzyme pyrosequencing protocol was implemented in which individual droplets contained enzymes, deoxyribonucleotide triphosphates (dNTPs), and DNA templates. The DNA templates were anchored to magnetic beads which enabled them to be thoroughly washed between nucleotide additions. Reagents and protocols were optimized to maximize signal over background, linearity of response, cycle efficiency, and wash efficiency. As an initial demonstration of feasibility, a portion of a 229 bp Candida parapsilosis template was sequenced using both a de novo protocol and a resequencing protocol. The resequencing protocol generated over 60 bp of sequence with 100% sequence accuracy based on raw pyrogram levels. Excellent linearity was observed for all of the homopolymers (two, three, or four nucleotides) contained in the C. parapsilosis sequence. With improvements in microfluidic design it is expected that longer reads, higher throughput, and improved process integration (i.e., “sample-to-sequence” capability) could eventually be achieved using this low-cost platform. PMID:21932784
Duan, Naibin; Bai, Yang; Sun, Honghe; Wang, Nan; Ma, Yumin; Li, Mingjun; Wang, Xin; Jiao, Chen; Legall, Noah; Mao, Linyong; Wan, Sibao; Wang, Kun; He, Tianming; Feng, Shouqian; Zhang, Zongying; Mao, Zhiquan; Shen, Xiang; Chen, Xiaoliu; Jiang, Yuanmao; Wu, Shujing; Yin, Chengmiao; Ge, Shunfeng; Yang, Long; Jiang, Shenghui; Xu, Haifeng; Liu, Jingxuan; Wang, Deyun; Qu, Changzhi; Wang, Yicheng; Zuo, Weifang; Xiang, Li; Liu, Chang; Zhang, Daoyuan; Gao, Yuan; Xu, Yimin; Xu, Kenong; Chao, Thomas; Fazio, Gennaro; Shu, Huairui; Zhong, Gan-Yuan; Cheng, Lailiang; Fei, Zhangjun; Chen, Xuesen
2017-08-15
Human selection has reshaped crop genomes. Here we report an apple genome variation map generated through genome sequencing of 117 diverse accessions. A comprehensive model of apple speciation and domestication along the Silk Road is proposed based on evidence from diverse genomic analyses. Cultivated apples likely originate from Malus sieversii in Kazakhstan, followed by intensive introgressions from M. sylvestris. M. sieversii in Xinjiang of China turns out to be an "ancient" isolated ecotype not directly contributing to apple domestication. We have identified selective sweeps underlying quantitative trait loci/genes of important fruit quality traits including fruit texture and flavor, and provide evidences supporting a model of apple fruit size evolution comprising two major events with one occurring prior to domestication and the other during domestication. This study outlines the genetic basis of apple domestication and evolution, and provides valuable information for facilitating marker-assisted breeding and apple improvement.Apple is one of the most important fruit crops. Here, the authors perform deep genome resequencing of 117 diverse accessions and reveal comprehensive models of apple origin, speciation, domestication, and fruit size evolution as well as candidate genes associated with important agronomic traits.
Borges, Sofia; Cravo, Pedro; Creasey, Alison; Fawcett, Richard; Modrzynska, Katarzyna; Rodrigues, Louise; Martinelli, Axel; Hunt, Paul
2011-01-01
Multidrug-resistant Plasmodium falciparum malaria parasites pose a threat to effective drug control, even to artemisinin-based combination therapies (ACTs). Here we used linkage group selection and Solexa whole-genome resequencing to investigate the genetic basis of resistance to component drugs of ACTs. Using the rodent malaria parasite P. chabaudi, we analyzed the uncloned progeny of a genetic backcross between the mefloquine-, lumefantrine-, and artemisinin-resistant mutant AS-15MF and a genetically distinct sensitive clone, AJ, following drug treatment. Genomewide scans of selection showed that parasites surviving each drug treatment bore a duplication of a segment of chromosome 12 (translocated to chromosome 04) present in AS-15MF. Whole-genome resequencing identified the size of the duplicated segment and its position on chromosome 4. The duplicated fragment extends for ∼393 kbp and contains over 100 genes, including mdr1, encoding the multidrug resistance P-glycoprotein homologue 1. We therefore show that resistance to chemically distinct components of ACTs is mediated by the same genetic mutation, highlighting a possible limitation of these therapies. PMID:21709099
Fu, Yong-Bi
2012-01-01
Cultivated flax (Linum usitatissimum L.) is the earliest oil and fiber crop and its early domestication history may involve multiple events of domestication for oil, fiber, capsular indehiscence, and winter hardiness. Genetic studies have demonstrated that winter cultivated flax is closely related to oil and fiber cultivated flax and shows little relatedness to its progenitor, pale flax (L. bienne Mill.), but winter hardiness is one major characteristic of pale flax. Here, we assessed the genetic relationships of 48 Linum samples representing pale flax and four trait-specific groups of cultivated flax (dehiscent, fiber, oil, and winter) through population-based resequencing at 24 genomic regions, and revealed a winter group of cultivated flax that displayed close relatedness to the pale flax samples. Overall, the cultivated flax showed a 27% reduction of nucleotide diversity when compared with the pale flax. Recombination frequently occurred at these sampled genomic regions, but the signal of selection and bottleneck was relatively weak. These findings provide some insight into the impact and processes of flax domestication and are significant for expanding our knowledge about early flax domestication, particularly for winter hardiness. PMID:22822439
Podar, Mircea; Shakya, Migun; D'Amore, Rosalinda; ...
2016-01-14
In the last 5 years, the rapid pace of innovations and improvements in sequencing technologies has completely changed the landscape of metagenomic and metagenetic experiments. Therefore, it is critical to benchmark the various methodologies for interrogating the composition of microbial communities, so that we can assess their strengths and limitations. Here, the most common phylogenetic marker for microbial community diversity studies is the 16S ribosomal RNA gene and in the last 10 years the field has moved from sequencing a small number of amplicons and samples to more complex studies where thousands of samples and multiple different gene regions aremore » interrogated.« less
Sonkar, Subash C; Sachdev, Divya; Mishra, Prashant K; Kumar, Anita; Mittal, Pratima; Saluja, Daman
2016-12-15
The currently available nucleic acid amplification tests (NAATs) for trichomoniasis are accurate, quick and confirmative with superior sensitivity than traditional culture-based microbiology assays. However, these assays are associated with problems of carry over contamination, false positive results, requirement of technical expertise for performance and detection of end product. Hence, a diagnostic assay with easy visualization of the amplified product will be profitable. An in-house, rapid, sensitive, specific molecular-beacon-based PCR assay, using primers against pfoB gene of Trichomonas vaginalis, was developed and evaluated using dry ectocervical swabs (n=392) from symptomatic females with vaginal discharge. Total DNA was isolated and used as template for the PCR assays. The performance and reproducibility of PCR assay was evaluated by composite reference standard (CRS). For easy visualization of the amplified product, molecular-beacon was designed and amplicons were visualized directly using fluorescent handheld dark reader or by Micro-Plate Reader. Molecular-beacons are single-stranded hairpin shaped nucleic acid probes composed of a stem, with fluorophore/quencher pair and a loop region complementary to the desired DNA. The beacon-based PCR assay designed in the present study is highly specific as confirmed by competition experiments and extremely sensitive with detection limit of 20fg of genomic DNA (3-4 pathogens). The minimum infrastructure requirement and ease to perform the assay makes this method highly useful for resource poor countries for better disease management. Copyright © 2016 Elsevier B.V. All rights reserved.
Hill, Richard; Saetnan, Eli R; Scullion, John; Gwynn-Jones, Dylan; Ostle, Nick; Edwards, Arwyn
2016-06-01
Microbial responses to Arctic climate change could radically alter the stability of major stores of soil carbon. However, the sensitivity of plot-scale experiments simulating climate change effects on Arctic heathland soils to potential confounding effects of spatial and temporal changes in soil microbial communities is unknown. Here, the variation in heathland soil bacterial communities at two survey sites in Sweden between spring and summer 2013 and at scales between 0-1 m and, 1-100 m and between sites (> 100 m) were investigated in parallel using 16S rRNA gene T-RFLP and amplicon sequencing. T-RFLP did not reveal spatial structuring of communities at scales < 100 m in any site or season. However, temporal changes were striking. Amplicon sequencing corroborated shifts from r- to K-selected taxon-dominated communities, influencing in silico predictions of functional potential. Network analyses reveal temporal keystone taxa, with a spring betaproteobacterial sub-network centred upon a Burkholderia operational taxonomic unit (OTU) and a reconfiguration to a summer sub-network centred upon an alphaproteobacterial OTU. Although spatial structuring effects may not confound comparison between plot-scale treatments, temporal change is a significant influence. Moreover, the prominence of two temporally exclusive keystone taxa suggests that the stability of Arctic heathland soil bacterial communities could be disproportionally influenced by seasonal perturbations affecting individual taxa. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Molecular detection of two adenoviruses associated with disease in Australian lizards.
Hyndman, T; Shilton, C M
2011-06-01
We give the first published description of the pathology and molecular findings associated with adenovirus infection in lizards in Australia. A central netted dragon (Ctenophorus nuchalis) exhibited severe necrotising hepatitis with abundant intranuclear inclusion bodies within hepatocytes and rarely within intestinal epithelial cells. Polymerase chain reaction (PCR) using pooled tissues yielded an amplicon that shared strong nucleotide identity with an agamid adenovirus (EU914203). PCR on the liver of a bearded dragon (Pogona minor minor) with illthrift, coccidiosis, nematodiasis and hepatic lipidosis yielded an amplicon with strong nucleotide identity to a helodermatid adenovirus (EU914207). © 2011 The Authors. Australian Veterinary Journal © 2011 Australian Veterinary Association.
PCR-Based Method for the Detection of Toxic Mushrooms Causing Food-Poisoning Incidents.
Nomura, Chie; Masayama, Atsushi; Yamaguchi, Mizuka; Sakuma, Daisuke; Kajimura, Keiji
2017-01-01
In this study, species-specific identification of five toxic mushrooms, Chlorophyllum molybdites, Gymnopilus junonius, Hypholoma fasciculare, Pleurocybella porrigens, and Tricholoma ustale, which have been involved in food-poisoning incidents in Japan, was investigated. Specific primer pairs targeting internal transcribed spacer (ITS) regions were designed for PCR detection. The specific amplicons were obtained from fresh, cooked, and simulated gastric fluid (SGF)-treated samples. No amplicons were detected from other mushrooms with similar morphology. Our method using one-step extraction of mushrooms allows rapid detection within 2.5 hr. It could be utilized for rapid identification or screening of toxic mushrooms.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Il`inskaya, G.V.; Kopnin, B.P.; Demidova, N.S.
1995-10-01
Previously, we showed that development of multidrug resistance (MDR) in mouse P388 leukemia cells is often associated with the appearance of newly-formed chromosomelike structures that contain amplified copies of the mdr1 gene. In the present study, we compared amplicon content in P388 sublines showing different types of these structures. A strong correlation between the formation of specific acentric markers consisting of two identical arms and the absence of the sorcin gene coamplification was found. In all the sublines containing other types of chromosomelike structures, the sorcin gene is coamplified. 9 refs., 2 figs., 1 tab.
Weusten, Jos J A M; Carpay, Wim M; Oosterlaken, Tom A M; van Zuijlen, Martien C A; van de Wiel, Paul A
2002-03-15
For quantitative NASBA-based viral load assays using homogeneous detection with molecular beacons, such as the NucliSens EasyQ HIV-1 assay, a quantitation algorithm is required. During the amplification process there is a constant growth in the concentration of amplicons to which the beacon can bind while generating a fluorescence signal. The overall fluorescence curve contains kinetic information on both amplicon formation and beacon binding, but only the former is relevant for quantitation. In the current paper, mathematical modeling of the relevant processes is used to develop an equation describing the fluorescence curve as a function of the amplification time and the relevant kinetic parameters. This equation allows reconstruction of RNA formation, which is characterized by an exponential increase in concentrations as long as the primer concentrations are not rate limiting and by linear growth over time after the primer pool is depleted. During the linear growth phase, the actual quantitation is based on assessing the amplicon formation rate from the viral RNA relative to that from a fixed amount of calibrator RNA. The quantitation procedure has been successfully applied in the NucliSens EasyQ HIV-1 assay.
Rota, Rosana P; Palacios, Carlos A; Temprana, C Facundo; Argüelles, Marcelo H; Mandile, Marcelo G; Mattion, Nora; Laimbacher, Andrea S; Fraefel, Cornell; Castello, Alejandro A; Glikmann, Graciela
2018-06-01
Group C Rotavirus (RVC) has been associated globally with sporadic outbreaks of gastroenteritis in children and adults. RVC also infects animals, and interspecies transmission has been reported as well as its zoonotic potential. Considering its genetic diversity and the absence of effective vaccines, it is important and necessary to develop new generation vaccines against RVC for both humans and animals. The aim of the present study was to develop and characterize an HSV-1-based amplicon vector expressing a human RVC-VP6 protein and evaluate the humoral immune response induced after immunizing BALB/c mice. Local fecal samples positive for RVC were used for isolation and sequencing of the vp6 gene, which phylogenetically belongs to the I2 genotype. We show here that cells infected with the HSV[VP6C] amplicon vector efficiently express the VP6 protein, and induced specific anti-RVC antibodies in mice immunized with HSV[VP6C], in a prime-boost schedule. This work highlights that amplicon vectors are an attractive platform for the generation of safe genetic immunogens against RVC, without the addition of external adjuvants. Copyright © 2018 Elsevier B.V. All rights reserved.
Bintz, Brittania J; Dixon, Groves B; Wilson, Mark R
2014-07-01
Next-generation sequencing technologies enable the identification of minor mitochondrial DNA variants with higher sensitivity than Sanger methods, allowing for enhanced identification of minor variants. In this study, mixtures of human mtDNA control region amplicons were subjected to pyrosequencing to determine the detection threshold of the Roche GS Junior(®) instrument (Roche Applied Science, Indianapolis, IN). In addition to expected variants, a set of reproducible variants was consistently found in reads from one particular amplicon. A BLASTn search of the variant sequence revealed identity to a segment of a 611-bp nuclear insertion of the mitochondrial control region (NumtS) spanning the primer-binding sites of this amplicon (Nature 1995;378:489). Primers (Hum Genet 2012;131:757; Hum Biol 1996;68:847) flanking the insertion were used to confirm the presence or absence of the NumtS in buccal DNA extracts from twenty donors. These results further our understanding of human mtDNA variation and are expected to have a positive impact on the interpretation of mtDNA profiles using deep-sequencing methods in casework. © 2014 American Academy of Forensic Sciences.
Herpes simplex virus type 1-derived recombinant and amplicon vectors.
Fraefel, Cornel; Marconi, Peggy; Epstein, Alberto L
2011-01-01
Herpes simplex virus type 1 (HSV-1) is a human pathogen whose lifestyle is based on a long-term dual interaction with the infected host, being able to establish both lytic and latent infections. The virus genome is a 153 kbp double-stranded DNA molecule encoding more than 80 genes. The interest of HSV-1 as gene transfer vector stems from its ability to infect many different cell types, both quiescent and proliferating cells, the very high packaging capacity of the virus capsid, the outstanding neurotropic adaptations that this virus has evolved, and the fact that it never integrates into the cellular chromosomes, thus avoiding the risk of insertional mutagenesis. Two types of vectors can be derived from HSV-1, recombinant vectors and amplicon vectors, and different methodologies have been developed to prepare large stocks of each type of vector. This chapter summarizes (1) the two approaches most commonly used to prepare recombinant vectors through homologous recombination, either in eukaryotic cells or in bacteria, and (2) the two methodologies currently used to generate helper-free amplicon vectors, either using a bacterial artificial chromosome (BAC)-based approach or a Cre/loxP site-specific recombination strategy.
Moussavou-Boundzanga, Pamela; Koumakpayi, Ismaël Hervé; Labouba, Ingrid; Leroy, Eric M; Belembaogo, Ernest; Berthet, Nicolas
2017-12-21
Cervical cancer is the fourth most common malignancy in women worldwide. However, screening with human papillomavirus (HPV) molecular tests holds promise for reducing cervical cancer incidence and mortality in low- and middle-income countries. The performance of the Abbott RealTime High-Risk HPV test (AbRT) was evaluated in 83 cervical smear specimens and compared with a conventional nested PCR coupled to high-throughput sequencing (HTS) to identify the amplicons. The AbRT assay detected at least one HPV genotype in 44.57% of women regardless of the grade of cervical abnormalities. Except for one case, good concordance was observed for the genotypes detected with the AbRT assay in the high-risk HPV category determined with HTS of the amplicon generated by conventional nested PCR. The AbRT test is an easy and reliable molecular tool and was as sensitive as conventional nested PCR in cervical smear specimens for detection HPVs associated with high-grade lesions. Moreover, sequencing amplicons using an HTS approach effectively identified the genotype of the hrHPV identified with the AbRT test.
Edwardson, Christian F.; Hollibaugh, James T.
2018-01-01
We compared the composition of microbial communities obtained by sequencing 16S rRNA gene amplicons with taxonomy derived from metatranscriptomes from the same samples. Samples were collected from alkaline, hypersaline Mono Lake, California, USA at five depths that captured the major redox zones of the lake during the onset of meromixis. The prokaryotic community was dominated by bacteria from the phyla Proteobacteria, Firmicutes, and Bacteroidetes, while the picoeukaryotic chlorophyte Picocystis dominated the eukaryotes. Most (80%) of the abundant (>1% relative abundance) OTUs recovered as amplicons of 16S rRNA genes have been reported in previous surveys, indicating that Mono Lake's microbial community has remained stable over 12 years that have included periods of regular, annual overturn interspersed by episodes of prolonged meromixis that result in extremely reducing conditions in bottom water. Metatranscriptomic sequences binned predominately to the Gammaproteobacteria genera Thioalkalivibrio (4–13%) and Thioalkalimicrobium (0–14%); and to the Firmicutes genera Dethiobacter (0–5%) and Clostridium (1–4%), which were also abundant in the 16S rRNA gene amplicon libraries. This study provides insight into the taxonomic affiliations of transcriptionally active communities of the lake's water column under different redox conditions. PMID:29445359
Lagkouvardos, Ilias; Joseph, Divya; Kapfhammer, Martin; Giritli, Sabahattin; Horn, Matthias; Haller, Dirk; Clavel, Thomas
2016-09-23
The SRA (Sequence Read Archive) serves as primary depository for massive amounts of Next Generation Sequencing data, and currently host over 100,000 16S rRNA gene amplicon-based microbial profiles from various host habitats and environments. This number is increasing rapidly and there is a dire need for approaches to utilize this pool of knowledge. Here we created IMNGS (Integrated Microbial Next Generation Sequencing), an innovative platform that uniformly and systematically screens for and processes all prokaryotic 16S rRNA gene amplicon datasets available in SRA and uses them to build sample-specific sequence databases and OTU-based profiles. Via a web interface, this integrative sequence resource can easily be queried by users. We show examples of how the approach allows testing the ecological importance of specific microorganisms in different hosts or ecosystems, and performing targeted diversity studies for selected taxonomic groups. The platform also offers a complete workflow for de novo analysis of users' own raw 16S rRNA gene amplicon datasets for the sake of comparison with existing data. IMNGS can be accessed at www.imngs.org.
Mutation detection in the human HSP70B′ gene by denaturing high-performance liquid chromatography
Hecker, Karl H.; Asea, Alexzander; Kobayashi, Kaoru; Green, Stacy; Tang, Dan; Calderwood, Stuart K.
2000-01-01
Variances, particularly single nucleotide polymorphisms (SNP), in the genomic sequence of individuals are the primary key to understanding gene function as it relates to differences in the susceptibility to disease, environmental influences, and therapy. In this report, the HSP70B′ gene is the target sequence for mutation detection in biopsy samples from human prostate cancer patients undergoing combined hyperthermia and radiation therapy at the Dana-Farber Cancer Institute, using temperature-modulated heteroduplex analysis (TMHA). The underlying principles of TMHA for mutation detection using DHPLC technology are discussed. The procedures involved in amplicon design for mutation analysis by DHPLC are detailed. The melting behavior of the complete coding sequence of the target gene is characterized using WAVEMAKERTM software. Four overlapping amplicons, which span the complete coding region of the HSP70B′ gene, amenable to mutation detection by DHPLC were identified based on the software-predicted melting profile of the target sequence. TMHA was performed on PCR products of individual amplicons of the HSP70B′ gene on the WAVE® Nucleic Acid Fragment Analysis System. The criteria for mutation calling by comparing wild-type and mutant chromatographic patterns are discussed. PMID:11189446
Mutation detection in the human HSP7OB' gene by denaturing high-performance liquid chromatography.
Hecker, K H; Asea, A; Kobayashi, K; Green, S; Tang, D; Calderwood, S K
2000-11-01
Variances, particularly single nucleotide polymorphisms (SNP), in the genomic sequence of individuals are the primary key to understanding gene function as it relates to differences in the susceptibility to disease, environmental influences, and therapy. In this report, the HSP70B' gene is the target sequence for mutation detection in biopsy samples from human prostate cancer patients undergoing combined hyperthermia and radiation therapy at the Dana-Farber Cancer Institute, using temperature-modulated heteroduplex analysis (TMHA). The underlying principles of TMHA for mutation detection using DHPLC technology are discussed. The procedures involved in amplicon design for mutation analysis by DHPLC are detailed. The melting behavior of the complete coding sequence of the target gene is characterized using WAVEMAKER software. Four overlapping amplicons, which span the complete coding region of the HSP70B' gene, amenable to mutation detection by DHPLC were identified based on the software-predicted melting profile of the target sequence. TMHA was performed on PCR products of individual amplicons of the HSP70B' gene on the WAVE Nucleic Acid Fragment Analysis System. The criteria for mutation calling by comparing wild-type and mutant chromatographic patterns are discussed.
Zhong, Daibin; Lo, Eugenia; Wang, Xiaoming; Yewhalaw, Delenasaw; Zhou, Guofa; Atieli, Harrysone E; Githeko, Andrew; Hemming-Schroeder, Elizabeth; Lee, Ming-Chieh; Afrane, Yaw; Yan, Guiyun
2018-05-02
Parasite genetic diversity and multiplicity of infection (MOI) affect clinical outcomes, response to drug treatment and naturally-acquired or vaccine-induced immunity. Traditional methods often underestimate the frequency and diversity of multiclonal infections due to technical sensitivity and specificity. Next-generation sequencing techniques provide a novel opportunity to study complexity of parasite populations and molecular epidemiology. Symptomatic and asymptomatic Plasmodium vivax samples were collected from health centres/hospitals and schools, respectively, from 2011 to 2015 in Ethiopia. Similarly, both symptomatic and asymptomatic Plasmodium falciparum samples were collected, respectively, from hospitals and schools in 2005 and 2015 in Kenya. Finger-pricked blood samples were collected and dried on filter paper. Long amplicon (> 400 bp) deep sequencing of merozoite surface protein 1 (msp1) gene was conducted to determine multiplicity and molecular epidemiology of P. vivax and P. falciparum infections. The results were compared with those based on short amplicon (117 bp) deep sequencing. A total of 139 P. vivax and 222 P. falciparum samples were pyro-sequenced for pvmsp1 and pfmsp1, yielding a total of 21 P. vivax and 99 P. falciparum predominant haplotypes. The average MOI for P. vivax and P. falciparum were 2.16 and 2.68, respectively, which were significantly higher than that of microsatellite markers and short amplicon (117 bp) deep sequencing. Multiclonal infections were detected in 62.2% of the samples for P. vivax and 74.8% of the samples for P. falciparum. Four out of the five subjects with recurrent P. vivax malaria were found to be a relapse 44-65 days after clearance of parasites. No difference was observed in MOI among P. vivax patients of different symptoms, ages and genders. Similar patterns were also observed in P. falciparum except for one study site in Kenyan lowland areas with significantly higher MOI. The study used a novel method to evaluate Plasmodium MOI and molecular epidemiological patterns by long amplicon ultra-deep sequencing. The complexity of infections were similar among age groups, symptoms, genders, transmission settings (spatial heterogeneity), as well as over years (pre- vs. post-scale-up interventions). This study demonstrated that long amplicon deep sequencing is a useful tool to investigate multiplicity and molecular epidemiology of Plasmodium parasite infections.
Functional characterization of the 19q12 amplicon in grade III breast cancers
2012-01-01
Introduction The 19q12 locus is amplified in a subgroup of oestrogen receptor (ER)-negative grade III breast cancers. This amplicon comprises nine genes, including cyclin E1 (CCNE1), which has been proposed as its 'driver'. The aim of this study was to identify the genes within the 19q12 amplicon whose expression is required for the survival of cancer cells harbouring their amplification. Methods We investigated the presence of 19q12 amplification in a series of 313 frozen primary breast cancers and 56 breast cancer cell lines using microarray comparative genomic hybridisation (aCGH). The nine genes mapping to the smallest region of amplification on 19q12 were silenced using RNA interference in phenotypically matched breast cancer cell lines with (MDA-MB-157 and HCC1569) and without (Hs578T, MCF7, MDA-MB-231, ZR75.1, JIMT1 and BT474) amplification of this locus. Genes whose silencing was selectively lethal in amplified cells were taken forward for further validation. The effects of cyclin-dependent kinase 2 (CDK2) silencing and chemical inhibition were tested in cancer cells with and without CCNE1 amplification. Results 19q12 amplification was identified in 7.8% of ER-negative grade III breast cancer. Of the nine genes mapping to this amplicon, UQCRFS1, POP4, PLEKHF1, C19ORF12, CCNE1 and C19ORF2 were significantly over-expressed when amplified in primary breast cancers and/or breast cancer cell lines. Silencing of POP4, PLEKHF1, CCNE1 and TSZH3 selectively reduced cell viability in cancer cells harbouring their amplification. Cancer cells with CCNE1 amplification were shown to be dependent on CDK2 expression and kinase activity for their survival. Conclusions The 19q12 amplicon may harbour more than a single 'driver', given that expression of POP4, PLEKHF1, CCNE1 and TSZH3 is required for the survival of cancer cells displaying their amplification. The observation that cancer cells harbouring CCNE1 gene amplification are sensitive to CDK2 inhibitors provides a rationale for the testing of these chemical inhibitors in a subgroup of patients with ER-negative grade III breast cancers. PMID:22433433
The GENCODE exome: sequencing the complete human exome
Coffey, Alison J; Kokocinski, Felix; Calafato, Maria S; Scott, Carol E; Palta, Priit; Drury, Eleanor; Joyce, Christopher J; LeProust, Emily M; Harrow, Jen; Hunt, Sarah; Lehesjoki, Anna-Elina; Turner, Daniel J; Hubbard, Tim J; Palotie, Aarno
2011-01-01
Sequencing the coding regions, the exome, of the human genome is one of the major current strategies to identify low frequency and rare variants associated with human disease traits. So far, the most widely used commercial exome capture reagents have mainly targeted the consensus coding sequence (CCDS) database. We report the design of an extended set of targets for capturing the complete human exome, based on annotation from the GENCODE consortium. The extended set covers an additional 5594 genes and 10.3 Mb compared with the current CCDS-based sets. The additional regions include potential disease genes previously inaccessible to exome resequencing studies, such as 43 genes linked to ion channel activity and 70 genes linked to protein kinase activity. In total, the new GENCODE exome set developed here covers 47.9 Mb and performed well in sequence capture experiments. In the sample set used in this study, we identified over 5000 SNP variants more in the GENCODE exome target (24%) than in the CCDS-based exome sequencing. PMID:21364695
Zhu, Zhigang; Noel, Samantha Joan; Difford, Gareth Frank; Al-Soud, Waleed Abu; Brejnrod, Asker; Sørensen, Søren Johannes; Lassen, Jan; Løvendahl, Peter; Højberg, Ole
2017-01-01
Dairy cows experience dramatic changes in host physiology from gestation to lactation period and dietary switch from high-forage prepartum diet to high-concentrate postpartum diet over the transition period (parturition +/- three weeks). Understanding the community structure and activity of the rumen microbiota and its associative patterns over the transition period may provide insight for e.g. improving animal health and production. In the present study, rumen samples from ten primiparous Holstein dairy cows were collected over seven weeks spanning the transition period. Total RNA was extracted from the rumen samples and cDNA thereof was subsequently used for characterizing the metabolically active bacterial (16S rRNA transcript amplicon sequencing) and archaeal (qPCR, T-RFLP and mcrA and 16S rRNA transcript amplicon sequencing) communities. The metabolically active bacterial community was dominated by three phyla, showing significant changes in relative abundance range over the transition period: Firmicutes (from prepartum 57% to postpartum 35%), Bacteroidetes (from prepartum 22% to postpartum 18%) and Proteobacteria (from prepartum 7% to postpartum 32%). For the archaea, qPCR analysis of 16S rRNA transcript number, revealed a significant prepartum to postpartum increase in Methanobacteriales, in accordance with an observed increase (from prepartum 80% to postpartum 89%) in relative abundance of 16S rRNA transcript amplicons allocated to this order. On the other hand, a significant prepartum to postpartum decrease (from 15% to 2%) was observed in relative abundance of Methanomassiliicoccales 16S rRNA transcripts. In contrast to qPCR analysis of the 16S rRNA transcripts, quantification of mcrA transcripts revealed no change in total abundance of metabolically active methanogens over the transition period. According to T-RFLP analysis of the mcrA transcripts, two Methanobacteriales genera, Methanobrevibacter and Methanosphaera (represented by the T-RFs 39 and 267 bp), represented more than 70% of the metabolically active methanogens, showing no significant changes over the transition period; minor T-RFs, likely to represent members of the order Methanomassiliicoccales and with a relative abundance below 5% in total, decreased significantly over the transition period. In accordance with the T-RFLP analysis, the mcrA transcript amplicon sequencing revealed Methanobacteriales to cover 99% of the total reads, dominated by the genera Methanobrevibacter (75%) and Methanosphaera (24%), whereas the Methanomassiliicoccales order covered only 0.2% of the total reads. In conclusion, the present study showed that the structure of the metabolically active bacterial and archaeal rumen communities changed over the transition period, likely in response to the dramatic changes in physiology and nutritional factors like dry matter intake and feed composition. It should be noted however that for the methanogens, the observed community changes were influenced by the analyzed gene (mcrA or 16S rRNA). PMID:29117259
Hancock, Angela M.; Clark, Vanessa J.; Qian, Yudong; Di Rienzo, Anna
2011-01-01
Production of heat via nonshivering thermogenesis (NST) is critical for temperature homeostasis in mammals. Uncoupling protein UCP1 plays a central role in NST by uncoupling the proton gradients produced in the inner membranes of mitochondria to produce heat; however, the extent to which UCP1 homologues, UCP2 and UCP3, are involved in NST is the subject of an ongoing debate. We used an evolutionary approach to test the hypotheses that variants that are associated with increased expression of these genes (UCP1 −3826A, UCP2 −866A, and UCP3 −55T) show evidence of adaptation with winter climate. To that end, we calculated correlations between allele frequencies and winter climate variables for these single-nucleotide polymorphisms (SNPs), which we genotyped in a panel of 52 worldwide populations. We found significant correlations with winter climate for UCP1 −3826G/A and UCP3 −55C/T. Further, by analyzing previously published genotype data for these SNPs, we found that the peak of the correlation for the UCP1 region occurred at the disease-associated −3826A/G variant and that the UCP3 region has a striking signal overall, with several individual SNPs showing interesting patterns, including the −55C/T variant. Resequencing of the regions in a set of three diverse population samples helped to clarify the signals that we found with the genotype data. At UCP1, the resequencing data revealed modest evidence that the haplotype carrying the −3826A variant was driven to high frequency by selection. In the UCP3 region, combining results from the climate analysis and resequencing survey suggest a more complex model in which variants on multiple haplotypes may independently be correlated with temperature. This is further supported by an excess of intermediate frequency variants in the UCP3 region in the Han Chinese population. Taken together, our results suggest that adaptation to climate influenced the global distribution of allele frequencies in UCP1 and UCP3 and provide an independent source of evidence for a role in cold resistance for UCP3. PMID:20802238
Tanner, Callum; Boocock, James; Stahl, Eli A; Dobbyn, Amanda; Mandal, Asim K; Cadzow, Murray; Phipps-Green, Amanda J; Topless, Ruth K; Hindmarsh, Jennie Harré; Stamp, Lisa K; Dalbeth, Nicola; Choi, Hyon K; Mount, David B; Merriman, Tony R
2017-07-01
There is no evidence for a genetic association between organic anion transporters 1-3 (SLC22A6, SLC22A7, and SLC22A8) and multidrug resistance protein 4 (MRP4; encoded by ABCC4) with the levels of serum urate or gout. The Māori and Pacific (Polynesian) population of New Zealand has the highest prevalence of gout worldwide. The aim of this study was to determine whether any Polynesian population-specific genetic variants in SLC22A6-8 and ABCC4 are associated with gout. All participants had ≥3 self-reported Māori and/or Pacific grandparents. Among the total sample set of 1,808 participants, 191 hyperuricemic and 202 normouricemic individuals were resequenced over the 4 genes, and the remaining 1,415 individuals were used for replication. Regression analyses were performed, adjusting for age, sex, and Polynesian ancestry. To study the functional effect of nonsynonymous variants of ABCC4, transport assays were performed in Xenopus laevis oocytes. A total of 39 common variants were detected, with an ABCC4 variant (rs4148500) significantly associated with hyperuricemia and gout. This variant was monomorphic for the urate-lowering allele in Europeans. There was evidence for an association of rs4148500 with gout in the resequenced samples (odds ratio [OR] 1.62 [P = 0.012]) that was replicated (OR 1.25 [P = 0.033]) and restricted to men (OR 1.43 [P = 0.001] versus OR 0.98 [P = 0.89] in women). The gout risk allele was associated with fractional excretion of uric acid in male individuals (β = -0.570 [P = 0.01]). A rare population-specific allele (P1036L) with predicted strong functional consequence reduced the uric acid transport activity of ABCC4 by 30%. An association between ABCC4 and gout and fractional excretion of uric acid is consistent with the established role of MRP4 as a unidirectional renal uric acid efflux pump. © 2017, American College of Rheumatology.
2012-01-01
Background Drug resistance in the malaria parasite Plasmodium falciparum severely compromises the treatment and control of malaria. A knowledge of the critical mutations conferring resistance to particular drugs is important in understanding modes of drug action and mechanisms of resistances. They are required to design better therapies and limit drug resistance. A mutation in the gene (pfcrt) encoding a membrane transporter has been identified as a principal determinant of chloroquine resistance in P. falciparum, but we lack a full account of higher level chloroquine resistance. Furthermore, the determinants of resistance in the other major human malaria parasite, P. vivax, are not known. To address these questions, we investigated the genetic basis of chloroquine resistance in an isogenic lineage of rodent malaria parasite P. chabaudi in which high level resistance to chloroquine has been progressively selected under laboratory conditions. Results Loci containing the critical genes were mapped by Linkage Group Selection, using a genetic cross between the high-level chloroquine-resistant mutant and a genetically distinct sensitive strain. A novel high-resolution quantitative whole-genome re-sequencing approach was used to reveal three regions of selection on chr11, chr03 and chr02 that appear progressively at increasing drug doses on three chromosomes. Whole-genome sequencing of the chloroquine-resistant parent identified just four point mutations in different genes on these chromosomes. Three mutations are located at the foci of the selection valleys and are therefore predicted to confer different levels of chloroquine resistance. The critical mutation conferring the first level of chloroquine resistance is found in aat1, a putative aminoacid transporter. Conclusions Quantitative trait loci conferring selectable phenotypes, such as drug resistance, can be mapped directly using progressive genome-wide linkage group selection. Quantitative genome-wide short-read genome resequencing can be used to reveal these signatures of drug selection at high resolution. The identities of three genes (and mutations within them) conferring different levels of chloroquine resistance generate insights regarding the genetic architecture and mechanisms of resistance to chloroquine and other drugs. Importantly, their orthologues may now be evaluated for critical or accessory roles in chloroquine resistance in human malarias P. vivax and P. falciparum. PMID:22435897
Effect of long-term farming strategies on soil microbiota and soil health
NASA Astrophysics Data System (ADS)
Sommermann, Loreen; Babin, Doreen; Sandmann, Martin; Smalla, Kornelia; Schellenberg, Ingo; Grosch, Rita; Geistlinger, Joerg
2017-04-01
Increasing food and energy demands have resulted in considerable intensification of farming practices, which brought about severe consequences for agricultural soils, e.g. loss of fertility, erosion and enrichment of soil-borne plant diseases. In order to maintain soil quality and health for the future, the development of more extensive and sustainable farming strategies is urgently needed. The soil microbiome is regarded as a key player in soil ecosystem functions, particularly the natural ability of soils to suppress plant pathogens (suppressiveness). Recent studies showed that soil microbial communities are influenced by agricultural management. To further analyze the effects of farming strategies on soil suppressiveness and plant performance, agricultural soils from three long-term field trials in Thyrow, Bernburg (both in Germany) and Therwil (Switzerland) were sampled and subjected to molecular profiling of soil bacteria and fungi using marker genes and high-throughput amplicon sequencing. Significant effects on bacterial as well as fungal community composition, including plant pathogenic and beneficial taxa, were observed among variants of tillage and crop rotation. The least effect on both communities had fertilization, with no significance between variants. Subsequently, the same soils were subjected to growth chamber pot experiments with lettuce as a model (Lactuca sativa). After a growth period of six weeks significant differences in lettuce shoot and soil microbial biomass were observed among soil samples of the different long-term trials. Furthermore, the lettuce rhizosphere exhibited diverse bacterial community compositions as observed by DGGE (denaturing gradient gel electrophoresis). Using group-specific PCR-DGGE fingerprints, bacterial responders to fertilization, soil management and crop rotation were identified among different taxonomic groups. Currently, bacterial and fungal amplicon sequencing of rhizosphere and bulk soil from these pot experiments is ongoing in order to provide further insights into taxa potentially indicative for agricultural management and soil health. Presently, we are testing the potential of the different soil microbiomes to suppress the lettuce pathogen Rhizoctonia solani.
Dereeper, Alexis; Nicolas, Stéphane; Le Cunff, Loïc; Bacilieri, Roberto; Doligez, Agnès; Peros, Jean-Pierre; Ruiz, Manuel; This, Patrice
2011-05-05
High-throughput re-sequencing, new genotyping technologies and the availability of reference genomes allow the extensive characterization of Single Nucleotide Polymorphisms (SNPs) and insertion/deletion events (indels) in many plant species. The rapidly increasing amount of re-sequencing and genotyping data generated by large-scale genetic diversity projects requires the development of integrated bioinformatics tools able to efficiently manage, analyze, and combine these genetic data with genome structure and external data. In this context, we developed SNiPlay, a flexible, user-friendly and integrative web-based tool dedicated to polymorphism discovery and analysis. It integrates:1) a pipeline, freely accessible through the internet, combining existing softwares with new tools to detect SNPs and to compute different types of statistical indices and graphical layouts for SNP data. From standard sequence alignments, genotyping data or Sanger sequencing traces given as input, SNiPlay detects SNPs and indels events and outputs submission files for the design of Illumina's SNP chips. Subsequently, it sends sequences and genotyping data into a series of modules in charge of various processes: physical mapping to a reference genome, annotation (genomic position, intron/exon location, synonymous/non-synonymous substitutions), SNP frequency determination in user-defined groups, haplotype reconstruction and network, linkage disequilibrium evaluation, and diversity analysis (Pi, Watterson's Theta, Tajima's D).Furthermore, the pipeline allows the use of external data (such as phenotype, geographic origin, taxa, stratification) to define groups and compare statistical indices.2) a database storing polymorphisms, genotyping data and grapevine sequences released by public and private projects. It allows the user to retrieve SNPs using various filters (such as genomic position, missing data, polymorphism type, allele frequency), to compare SNP patterns between populations, and to export genotyping data or sequences in various formats. Our experiments on grapevine genetic projects showed that SNiPlay allows geneticists to rapidly obtain advanced results in several key research areas of plant genetic diversity. Both the management and treatment of large amounts of SNP data are rendered considerably easier for end-users through automation and integration. Current developments are taking into account new advances in high-throughput technologies.SNiPlay is available at: http://sniplay.cirad.fr/.
Agarose droplet microfluidics for highly parallel and efficient single molecule emulsion PCR.
Leng, Xuefei; Zhang, Wenhua; Wang, Chunming; Cui, Liang; Yang, Chaoyong James
2010-11-07
An agarose droplet method was developed for highly parallel and efficient single molecule emulsion PCR. The method capitalizes on the unique thermoresponsive sol-gel switching property of agarose for highly efficient DNA amplification and amplicon trapping. Uniform agarose solution droplets generated via a microfluidic chip serve as robust and inert nanolitre PCR reactors for single copy DNA molecule amplification. After PCR, agarose droplets are gelated to form agarose beads, trapping all amplicons in each reactor to maintain the monoclonality of each droplet. This method does not require cocapsulation of primer labeled microbeads, allows high throughput generation of uniform droplets and enables high PCR efficiency, making it a promising platform for many single copy genetic studies.
2005-01-01
Abstract A typing procedure based on polymorphism of the coagulase gene (coa) was used to discriminate Staphylococcus aureus isolated from Minas Gerais dairy cows with mastitis. Amplification of the gene from the 64 S. aureus isolates produced 27 different polymerase chain reaction (PCR) products; 60 isolates showed only 1 amplicon, and 4 showed 2 amplicons. The isolates were grouped into 49 types by analyzing the restriction fragment length polymorphism (RFLP) of the coa gene; the 10 most common types accounted for 39% of the isolates. The results demonstrate that many variants of the coa gene are present in the studied region, although only a few predominate. PMID:16479723
Bhat, Somanath; McLaughlin, Jacob L H; Emslie, Kerry R
2011-02-21
Digital polymerase chain reaction (dPCR) has the potential to enable accurate quantification of target DNA copy number provided that all target DNA molecules are successfully amplified. Following duplex dPCR analysis from a linear DNA target sequence that contains single copies of two independent template sequences, we have observed that amplification of both templates in a single partition does not always occur. To investigate this finding, we heated the target DNA solution to 95 °C for increasing time intervals and then immediately chilled on ice prior to preparing the dPCR mix. We observed an exponential decline in estimated copy number (R(2)≥ 0.98) of the two template sequences when amplified from either a linearized plasmid or a 388 base pair (bp) amplicon containing the same two template sequences. The distribution of amplifiable templates and the final concentration (copies per µL) were both affected by heat treatment of the samples at 95 °C from 0 s to 30 min. The proportion of target sequences from which only one of the two templates was amplified in a single partition (either 1507 or hmg only) increased over time, while the proportion of target sequences where both templates were amplified (1507 and hmg) in each individual partition declined rapidly from 94% to 52% (plasmid) and 88% to 31% (388 bp amplicon) suggesting an increase in number of targets from which both templates no longer amplify. A 10 min incubation at 95 °C reduced the initial amplifiable template concentration of the plasmid and the 388 bp amplicon by 59% and 91%, respectively. To determine if a similar decrease in amplifiable target occurs during the default pre-activation step of typical PCR amplification protocol, we used mastermixes with a 20 s or 10 min hot-start. The choice of mastermix and consequent pre-activation time did not affect the estimated plasmid concentration. Therefore, we conclude that prolonged exposure of this DNA template to elevated temperatures could lead to significant bias in dPCR measurements. However, care must be taken when designing PCR and non-PCR based experiments by reducing exposure of the DNA template to sustained elevated temperatures in order to improve accuracy in copy number estimation and concentration determination.
Jacob, Jacob H; Hussein, Emad I; Shakhatreh, Muhamad Ali K; Cornelison, Christopher T
2017-10-01
Amplicon sequencing using next-generation technology (bTEFAP ® ) has been utilized in describing the diversity of Dead Sea microbiota. The investigated area is a well-known salt lake in the western part of Jordan found in the lowest geographical location in the world (more than 420 m below sea level) and characterized by extreme salinity (approximately, 34%) in addition to other extreme conditions (low pH, unique ionic composition different from sea water). DNA was extracted from Dead Sea water. A total of 314,310 small subunit RNA (SSU rRNA) sequences were parsed, and 288,452 sequences were then clustered. For alpha diversity analysis, sample was rarefied to 3,000 sequences. The Shannon-Wiener index curve plot reached a plateau at approximately 3,000 sequences indicating that sequencing depth was sufficient to capture the full scope of microbial diversity. Archaea was found to be dominating the sequences (52%), whereas Bacteria constitute 45% of the sequences. Altogether, prokaryotic sequences (which constitute 97% of all sequences) were found to predominate. The findings expand on previous studies by using high-throughput amplicon sequencing to describe the microbial community in an environment which in recent years has been shown to hide some interesting diversity. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
Zaboikin, Michail; Freter, Carl
2018-01-01
We describe a method for measuring genome editing efficiency from in silico analysis of high-resolution melt curve data. The melt curve data derived from amplicons of genome-edited or unmodified target sites were processed to remove the background fluorescent signal emanating from free fluorophore and then corrected for temperature-dependent quenching of fluorescence of double-stranded DNA-bound fluorophore. Corrected data were normalized and numerically differentiated to obtain the first derivatives of the melt curves. These were then mathematically modeled as a sum or superposition of minimal number of Gaussian components. Using Gaussian parameters determined by modeling of melt curve derivatives of unedited samples, we were able to model melt curve derivatives from genetically altered target sites where the mutant population could be accommodated using an additional Gaussian component. From this, the proportion contributed by the mutant component in the target region amplicon could be accurately determined. Mutant component computations compared well with the mutant frequency determination from next generation sequencing data. The results were also consistent with our earlier studies that used difference curve areas from high-resolution melt curves for determining the efficiency of genome-editing reagents. The advantage of the described method is that it does not require calibration curves to estimate proportion of mutants in amplicons of genome-edited target sites. PMID:29300734
Sergeant, Martin J.; Constantinidou, Chrystala; Cogan, Tristan; Penn, Charles W.; Pallen, Mark J.
2012-01-01
The analysis of 16S-rDNA sequences to assess the bacterial community composition of a sample is a widely used technique that has increased with the advent of high throughput sequencing. Although considerable effort has been devoted to identifying the most informative region of the 16S gene and the optimal informatics procedures to process the data, little attention has been paid to the PCR step, in particular annealing temperature and primer length. To address this, amplicons derived from 16S-rDNA were generated from chicken caecal content DNA using different annealing temperatures, primers and different DNA extraction procedures. The amplicons were pyrosequenced to determine the optimal protocols for capture of maximum bacterial diversity from a chicken caecal sample. Even at very low annealing temperatures there was little effect on the community structure, although the abundance of some OTUs such as Bifidobacterium increased. Using shorter primers did not reveal any novel OTUs but did change the community profile obtained. Mechanical disruption of the sample by bead beating had a significant effect on the results obtained, as did repeated freezing and thawing. In conclusion, existing primers and standard annealing temperatures captured as much diversity as lower annealing temperatures and shorter primers. PMID:22666455
Sergeant, Martin J; Constantinidou, Chrystala; Cogan, Tristan; Penn, Charles W; Pallen, Mark J
2012-01-01
The analysis of 16S-rDNA sequences to assess the bacterial community composition of a sample is a widely used technique that has increased with the advent of high throughput sequencing. Although considerable effort has been devoted to identifying the most informative region of the 16S gene and the optimal informatics procedures to process the data, little attention has been paid to the PCR step, in particular annealing temperature and primer length. To address this, amplicons derived from 16S-rDNA were generated from chicken caecal content DNA using different annealing temperatures, primers and different DNA extraction procedures. The amplicons were pyrosequenced to determine the optimal protocols for capture of maximum bacterial diversity from a chicken caecal sample. Even at very low annealing temperatures there was little effect on the community structure, although the abundance of some OTUs such as Bifidobacterium increased. Using shorter primers did not reveal any novel OTUs but did change the community profile obtained. Mechanical disruption of the sample by bead beating had a significant effect on the results obtained, as did repeated freezing and thawing. In conclusion, existing primers and standard annealing temperatures captured as much diversity as lower annealing temperatures and shorter primers.
Bokulich, Nicholas A.
2013-01-01
Ultra-high-throughput sequencing (HTS) of fungal communities has been restricted by short read lengths and primer amplification bias, slowing the adoption of newer sequencing technologies to fungal community profiling. To address these issues, we evaluated the performance of several common internal transcribed spacer (ITS) primers and designed a novel primer set and work flow for simultaneous quantification and species-level interrogation of fungal consortia. Primer comparison and validation were predicted in silico and by sequencing a “mock community” of mixed yeast species to explore the challenges of amplicon length and amplification bias for reconstructing defined yeast community structures. The amplicon size and distribution of this primer set are smaller than for all preexisting ITS primer sets, maximizing sequencing coverage of hypervariable ITS domains by very-short-amplicon, high-throughput sequencing platforms. This feature also enables the optional integration of quantitative PCR (qPCR) directly into the HTS preparatory work flow by substituting qPCR with these primers for standard PCR, yielding quantification of individual community members. The complete work flow described here, utilizing any of the qualified primer sets evaluated, can rapidly profile mixed fungal communities and capably reconstructed well-characterized beer and wine fermentation fungal communities. PMID:23377949
Houzet, Laurent; Deleage, Claire; Satie, Anne-Pascale; Merlande, Laetitia; Mahe, Dominique; Dejucq-Rainsford, Nathalie
2015-01-01
PCR is the most widely applied technique for large scale screening of bacterial clones, mouse genotypes, virus genomes etc. A drawback of large PCR screening is that amplicon analysis is usually performed using gel electrophoresis, a step that is very labor intensive, tedious and chemical waste generating. Single genome amplification (SGA) is used to characterize the diversity and evolutionary dynamics of virus populations within infected hosts. SGA is based on the isolation of single template molecule using limiting dilution followed by nested PCR amplification and requires the analysis of hundreds of reactions per sample, making large scale SGA studies very challenging. Here we present a novel approach entitled Long Amplicon Melt Profiling (LAMP) based on the analysis of the melting profile of the PCR reactions using SYBR Green and/or EvaGreen fluorescent dyes. The LAMP method represents an attractive alternative to gel electrophoresis and enables the quick discrimination of positive reactions. We validate LAMP for SIV and HIV env-SGA, in 96- and 384-well plate formats. Because the melt profiling allows the screening of several thousands of PCR reactions in a cost-effective, rapid and robust way, we believe it will greatly facilitate any large scale PCR screening. PMID:26053379
DNA sequence templates adjacent nucleosome and ORC sites at gene amplification origins in Drosophila
Liu, Jun; Zimmer, Kurt; Rusch, Douglas B.; Paranjape, Neha; Podicheti, Ram; Tang, Haixu; Calvi, Brian R.
2015-01-01
Eukaryotic origins of DNA replication are bound by the origin recognition complex (ORC), which scaffolds assembly of a pre-replicative complex (pre-RC) that is then activated to initiate replication. Both pre-RC assembly and activation are strongly influenced by developmental changes to the epigenome, but molecular mechanisms remain incompletely defined. We have been examining the activation of origins responsible for developmental gene amplification in Drosophila. At a specific time in oogenesis, somatic follicle cells transition from genomic replication to a locus-specific replication from six amplicon origins. Previous evidence indicated that these amplicon origins are activated by nucleosome acetylation, but how this affects origin chromatin is unknown. Here, we examine nucleosome position in follicle cells using micrococcal nuclease digestion with Ilumina sequencing. The results indicate that ORC binding sites and other essential origin sequences are nucleosome-depleted regions (NDRs). Nucleosome position at the amplicons was highly similar among developmental stages during which ORC is or is not bound, indicating that being an NDR is not sufficient to specify ORC binding. Importantly, the data suggest that nucleosomes and ORC have opposite preferences for DNA sequence and structure. We propose that nucleosome hyperacetylation promotes pre-RC assembly onto adjacent DNA sequences that are disfavored by nucleosomes but favored by ORC. PMID:26227968
DOE Office of Scientific and Technical Information (OSTI.GOV)
Muchero, Wellington; Labbe, Jessy L; Priya, Ranjan
2014-01-01
To date, Populus ranks among a few plant species with a complete genome sequence and other highly developed genomic resources. With the first genome sequence among all tree species, Populus has been adopted as a suitable model organism for genomic studies in trees. However, far from being just a model species, Populus is a key renewable economic resource that plays a significant role in providing raw materials for the biofuel and pulp and paper industries. Therefore, aside from leading frontiers of basic tree molecular biology and ecological research, Populus leads frontiers in addressing global economic challenges related to fuel andmore » fiber production. The latter fact suggests that research aimed at improving quality and quantity of Populus as a raw material will likely drive the pursuit of more targeted and deeper research in order to unlock the economic potential tied in molecular biology processes that drive this tree species. Advances in genome sequence-driven technologies, such as resequencing individual genotypes, which in turn facilitates large scale SNP discovery and identification of large scale polymorphisms are key determinants of future success in these initiatives. In this treatise we discuss implications of genome sequence-enable technologies on Populus genomic and genetic studies of complex and specialized-traits.« less
Novel variants in human and monkey CETP.
Lloyd, David B; Reynolds, Jennifer M; Cronan, Melissa T; Williams, Suzanne P; Lira, Maruja E; Wood, Linda S; Knight, Delvin R; Thompson, John F
2005-10-15
Variation in CETP has been shown to play an important role in HDL-C levels and cardiovascular disease. To better characterize this variation, the promoter and exonic DNA for CETP was resequenced in 189 individuals with extreme HDL-C or age. Two novel amino acid variants were found in humans (V-12D and Y361C) and an additional variant (R137W) not previously studied in vitro were expressed. D-12 was not secreted and had no detectable activity in cells. C361 and W137 retained near normal amounts of cholesteryl ester transfer activity when purified but were less well secreted than wild type. Torcetrapib, a CETP inhibitor in clinical development with atorvastatin, was found to have a uniform effect on inhibition of wild type CETP versus W137 or C361. In addition, the level of variation in other species was assessed by resequencing DNA from nine cynomolgus monkeys. Numerous intronic and silent SNPs were found as well as two variable amino acids. The amino acid altering SNPs were genotyped in 29 monkeys and not found to be significantly associated with HDL-C levels. Three SNPs found in monkeys were identical to three found in humans with these SNPs all occurring at CpG sites.
A two-stage stochastic rule-based model to determine pre-assembly buffer content
NASA Astrophysics Data System (ADS)
Gunay, Elif Elcin; Kula, Ufuk
2018-01-01
This study considers instant decision-making needs of the automobile manufactures for resequencing vehicles before final assembly (FA). We propose a rule-based two-stage stochastic model to determine the number of spare vehicles that should be kept in the pre-assembly buffer to restore the altered sequence due to paint defects and upstream department constraints. First stage of the model decides the spare vehicle quantities, where the second stage model recovers the scrambled sequence respect to pre-defined rules. The problem is solved by sample average approximation (SAA) algorithm. We conduct a numerical study to compare the solutions of heuristic model with optimal ones and provide following insights: (i) as the mismatch between paint entrance and scheduled sequence decreases, the rule-based heuristic model recovers the scrambled sequence as good as the optimal resequencing model, (ii) the rule-based model is more sensitive to the mismatch between the paint entrance and scheduled sequences for recovering the scrambled sequence, (iii) as the defect rate increases, the difference in recovery effectiveness between rule-based heuristic and optimal solutions increases, (iv) as buffer capacity increases, the recovery effectiveness of the optimization model outperforms heuristic model, (v) as expected the rule-based model holds more inventory than the optimization model.
Rathinasabapathi, Pasupathi; Purushothaman, Natarajan; Parani, Madasamy
2016-05-01
Although rice genome was sequenced in the year 2002, efforts in resequencing the large number of available accessions, landraces, traditional cultivars, and improved varieties of this important food crop are limited. We have initiated resequencing of the traditional cultivars from India. Kavuni is an important traditional rice cultivar from South India that attracts premium price for its nutritional and therapeutic properties. Whole-genome sequencing of Kavuni using Illumina platform and SNPs analysis using Nipponbare reference genome identified 1 150 711 SNPs of which 377 381 SNPs were located in the genic regions. Non-synonymous SNPs (62 708) were distributed in 19 251 genes, and their number varied between 1 and 115 per gene. Large-effect DNA polymorphisms (7769) were present in 3475 genes. Pathway mapping of these polymorphisms revealed the involvement of genes related to carbohydrate metabolism, translation, protein-folding, and cell death. Analysis of the starch biosynthesis related genes revealed that the granule-bound starch synthase I gene had T/G SNPs at the first intron/exon junction and a two-nucleotide combination, which were reported to favour high amylose content and low glycemic index. The present study provided a valuable genomics resource to study the rice varieties with nutritional and medicinal properties.
Goold, Hugh Douglas; Nguyen, Hoa Mai; Kong, Fantao; Beyly-Adriano, Audrey; Légeret, Bertrand; Billon, Emmanuelle; Cuiné, Stéphan; Beisson, Fred; Peltier, Gilles; Li-Beisson, Yonghua
2016-01-01
Microalgae have emerged as a promising source for biofuel production. Massive oil and starch accumulation in microalgae is possible, but occurs mostly when biomass growth is impaired. The molecular networks underlying the negative correlation between growth and reserve formation are not known. Thus isolation of strains capable of accumulating carbon reserves during optimal growth would be highly desirable. To this end, we screened an insertional mutant library of Chlamydomonas reinhardtii for alterations in oil content. A mutant accumulating five times more oil and twice more starch than wild-type during optimal growth was isolated and named constitutive oil accumulator 1 (coa1). Growth in photobioreactors under highly controlled conditions revealed that the increase in oil and starch content in coa1 was dependent on light intensity. Genetic analysis and DNA hybridization pointed to a single insertional event responsible for the phenotype. Whole genome re-sequencing identified in coa1 a >200 kb deletion on chromosome 14 containing 41 genes. This study demonstrates that, 1), the generation of algal strains accumulating higher reserve amount without compromising biomass accumulation is feasible; 2), light is an important parameter in phenotypic analysis; and 3), a chromosomal region (Quantitative Trait Locus) acts as suppressor of carbon reserve accumulation during optimal growth. PMID:27141848
Aokic, Jun-ya; Kawase, Junya; Hamada, Kazuhisa; Fujimoto, Hiroshi; Yamamoto, Ikki; Usuki, Hironori
2018-01-01
Greater amberjack (Seriola dumerili) is distributed in tropical and temperate waters worldwide and is an important aquaculture fish. We carried out de novo sequencing of the greater amberjack genome to construct a reference genome sequence to identify single nucleotide polymorphisms (SNPs) for breeding amberjack by marker-assisted or gene-assisted selection as well as to identify functional genes for biological traits. We obtained 200 times coverage and constructed a high-quality genome assembly using next generation sequencing technology. The assembled sequences were aligned onto a yellowtail (Seriola quinqueradiata) radiation hybrid (RH) physical map by sequence homology. A total of 215 of the longest amberjack sequences, with a total length of 622.8 Mbp (92% of the total length of the genome scaffolds), were lined up on the yellowtail RH map. We resequenced the whole genomes of 20 greater amberjacks and mapped the resulting sequences onto the reference genome sequence. About 186,000 nonredundant SNPs were successfully ordered on the reference genome. Further, we found differences in the genome structural variations between two greater amberjack populations using BreakDancer. We also analyzed the greater amberjack transcriptome and mapped the annotated sequences onto the reference genome sequence. PMID:29785397
Liu, Xuxia; Jiang, Tengyong; Piao, Chunmei; Li, Xiaoyan; Guo, Jun; Zheng, Shuai; Zhang, Xiaoping; Cai, Tao; Du, Jie
2015-06-19
Hypertrophic cardiomyopathy (HCM) is a major cause of sudden cardiac death. Mutations in the MYBPC3 gene represent the cause of HCM in ~35% of patients with HCM. However, genetic testing in clinic setting has been limited due to the cost and relatively time-consuming by Sanger sequencing. Here, we developed a HCM Molecular Diagnostic Kit enabling ultra-low-cost targeted gene resequencing in a large cohort and investigated the mutation spectrum of MYBPC3. In a cohort of 114 patients with HCM, a total of 20 different mutations (8 novel and 12 known mutations) of MYBPC3 were identified from 25 patients (21.9%). We demonstrated that the power of targeted resequencing in a cohort of HCM patients, and found that MYBPC3 is a common HCM-causing gene in Chinese patients. Phenotype-genotype analyses showed that the patients with double mutations (n = 2) or premature termination codon mutations (n = 12) showed more severe manifestations, compared with patients with missense mutations (n = 11). Particularly, we identified a recurrent truncation mutation (p.Y842X) in four unrelated cases (4/25, 16%), who showed severe phenotypes, and suggest that the p.Y842X is a frequent mutation in Chinese HCM patients with severe phenotypes.
Information Commons for Rice (IC4R)
2016-01-01
Rice is the most important staple food for a large part of the world's human population and also a key model organism for plant research. Here, we present Information Commons for Rice (IC4R; http://ic4r.org), a rice knowledgebase featuring adoption of an extensible and sustainable architecture that integrates multiple omics data through community-contributed modules. Each module is developed and maintained by different committed groups, deals with data collection, processing and visualization, and delivers data on-demand via web services. In the current version, IC4R incorporates a variety of rice data through multiple committed modules, including genome-wide expression profiles derived entirely from RNA-Seq data, resequencing-based genomic variations obtained from re-sequencing data of thousands of rice varieties, plant homologous genes covering multiple diverse plant species, post-translational modifications, rice-related literatures and gene annotations contributed by the rice research community. Unlike extant related databases, IC4R is designed for scalability and sustainability and thus also features collaborative integration of rice data and low costs for database update and maintenance. Future directions of IC4R include incorporation of other omics data and association of multiple omics data with agronomically important traits, dedicating to build IC4R into a valuable knowledgebase for both basic and translational researches in rice. PMID:26519466
Xu, Zhenbo; Xie, Jinhong; Liu, Junyan; Ji, Lili; Soteyome, Thanapop; Peters, Brian M; Chen, Dingqiang; Li, Bing; Li, Lin; Shirtliff, Mark E
2017-03-01
Bacillus cereus is one of the most common opportunistic pathogens responsible for various foodborn diseases. To investigate the regulatory mechanism of B. cereus under high osmotic pressure, two B. cereus strains B25 and B26 were isolated from the industrial soy sauce residue containing high-salt concentration. Resequencing was performed by Illumina/Solexa platform and 13,646 SNPs and 434 InDels were identified as common variants between B25 and B26 against reference genome, followed by COG, GO, and KEGG enrichment analysis. Furthermore, 49 key genes involving in Na + /H + ,K + transporter, dipeptide or tripeptide transporter, stress response were selected and classified into 27 groups. Further validation was performed by qRT-PCR, and 4 candidate genes were found most associated with osmotic response. Gene expression of the 4 candidate genes was then analyzed accordingly, and down regulation was obtained for gene BC0669 and BC0754 associated with K + transport system. However, dramatic up regulation was detected for gene BC2114 involving in glutathione peroxidase, indicating the activation of antioxidant responses by osmotic stress via genetic regulation. As concluded, bioinformatic analysis and gene expression profile represented the basis of further investigation on the genetic and regulatory mechanism of bacterial salt tolerance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Ditommaso, Savina; Giacomuzzi, Monica; Ricciardi, Elisa; Zotti, Carla M
2015-08-01
Two different real-time quantitative PCR (PMA-qPCR) assays were applied for quantification of Legionella spp. by targeting a long amplicon (approx 400 bp) of 16S rRNA gene and a short amplicon (approx. 100 bp) of 5S rRNA gene. Purified DNA extracts from pure cultures of Legionella spp. and from environmental water samples were quantified. Application of the two assays to quantify Legionella in artificially contaminated water achieved that both assays were able to detect Legionella over a linear range of 10 to 10(5) cells ml(-1). A statistical analysis of the standard curves showed that both assays were linear with a good correlation coefficient (R(2) = 0.99) between the Ct and the copy number. Amplification with the reference assay was the most effective for detecting low copy numbers (1 bacterium per PCR mixture). Using selective quantification of viable Legionella by the PMA-qPCR method we obtained a greater inhibition of the amplification of the 400-bp 16S gene fragment (Δlog(10) = 3.74 ± 0.39 log(10) GU ml(-1)). A complete inhibition of the PCR signal was obtained when heat-killed cells in a concentration below 1 × 10(5) cells ml(-1) were pretreated with PMA. Analysing short amplicon sizes led to only 2.08 log reductions in the Legionella dead-cell signal. When we tested environmental water samples, the two qPCR assays were in good agreement according to the kappa index (0.741). Applying qPCR combined with PMA treatment, we also obtained a good agreement (kappa index 0.615). The comparison of quantitative results shows that both assays yielded the same quantification sensitivity (mean log = 4.59 vs mean log = 4.31). Copyright © 2015 Elsevier Ltd. All rights reserved.
Song, Zhewei; Du, Hai; Zhang, Yan; Xu, Yan
2017-01-01
Fermentation microbiota is specific microorganisms that generate different types of metabolites in many productions. In traditional solid-state fermentation, the structural composition and functional capacity of the core microbiota determine the quality and quantity of products. As a typical example of food fermentation, Chinese Maotai-flavor liquor production involves a complex of various microorganisms and a wide variety of metabolites. However, the microbial succession and functional shift of the core microbiota in this traditional food fermentation remain unclear. Here, high-throughput amplicons (16S rRNA gene amplicon sequencing and internal transcribed space amplicon sequencing) and metatranscriptomics sequencing technologies were combined to reveal the structure and function of the core microbiota in Chinese soy sauce aroma type liquor production. In addition, ultra-performance liquid chromatography and headspace-solid phase microextraction-gas chromatography-mass spectrometry were employed to provide qualitative and quantitative analysis of the major flavor metabolites. A total of 10 fungal and 11 bacterial genera were identified as the core microbiota. In addition, metatranscriptomic analysis revealed pyruvate metabolism in yeasts (genera Pichia, Schizosaccharomyces, Saccharomyces, and Zygosaccharomyces) and lactic acid bacteria (genus Lactobacillus) classified into two stages in the production of flavor components. Stage I involved high-level alcohol (ethanol) production, with the genus Schizosaccharomyces serving as the core functional microorganism. Stage II involved high-level acid (lactic acid and acetic acid) production, with the genus Lactobacillus serving as the core functional microorganism. The functional shift from the genus Schizosaccharomyces to the genus Lactobacillus drives flavor component conversion from alcohol (ethanol) to acid (lactic acid and acetic acid) in Chinese Maotai-flavor liquor production. Our findings provide insight into the effects of the core functional microbiota in soy sauce aroma type liquor production and the characteristics of the fermentation microbiota under different environmental conditions. PMID:28769888
Ceuppens, Siele; De Coninck, Dieter; Bottledoorn, Nadine; Van Nieuwerburgh, Filip; Uyttendaele, Mieke
2017-09-18
Application of 16S rRNA (gene) amplicon sequencing on food samples is increasingly applied for assessing microbial diversity but may as unintended advantage also enable simultaneous detection of any human pathogens without a priori definition. In the present study high-throughput next-generation sequencing (NGS) of the V1-V2-V3 regions of the 16S rRNA gene was applied to identify the bacteria present on fresh basil leaves. However, results were strongly impacted by variations in the bioinformatics analysis pipelines (MEGAN, SILVAngs, QIIME and MG-RAST), including the database choice (Greengenes, RDP and M5RNA) and the annotation algorithm (best hit, representative hit and lowest common ancestor). The use of pipelines with default parameters will lead to discrepancies. The estimate of microbial diversity of fresh basil using 16S rRNA (gene) amplicon sequencing is thus indicative but subject to biases. Salmonella enterica was detected at low frequencies, between 0.1% and 0.4% of bacterial sequences, corresponding with 37 to 166 reads. However, this result was dependent upon the pipeline used: Salmonella was detected by MEGAN, SILVAngs and MG-RAST, but not by QIIME. Confirmation of Salmonella sequences by real-time PCR was unsuccessful. It was shown that taxonomic resolution obtained from the short (500bp) sequence reads of the 16S rRNA gene containing the hypervariable regions V1-V3 cannot allow distinction of Salmonella with closely related enterobacterial species. In conclusion 16S amplicon sequencing, getting the status of standard method in microbial ecology studies of foods, needs expertise on both bioinformatics and microbiology for analysis of results. It is a powerful tool to estimate bacterial diversity but amenable to biases. Limitations concerning taxonomic resolution for some bacterial species or its inability to detect sub-dominant (pathogenic) species should be acknowledged in order to avoid overinterpretation of results. Copyright © 2017 Elsevier B.V. All rights reserved.
Sanz, Yolanda
2017-01-01
Abstract The miniaturized and portable DNA sequencer MinION™ has demonstrated great potential in different analyses such as genome-wide sequencing, pathogen outbreak detection and surveillance, human genome variability, and microbial diversity. In this study, we tested the ability of the MinION™ platform to perform long amplicon sequencing in order to design new approaches to study microbial diversity using a multi-locus approach. After compiling a robust database by parsing and extracting the rrn bacterial region from more than 67000 complete or draft bacterial genomes, we demonstrated that the data obtained during sequencing of the long amplicon in the MinION™ device using R9 and R9.4 chemistries were sufficient to study 2 mock microbial communities in a multiplex manner and to almost completely reconstruct the microbial diversity contained in the HM782D and D6305 mock communities. Although nanopore-based sequencing produces reads with lower per-base accuracy compared with other platforms, we presented a novel approach consisting of multi-locus and long amplicon sequencing using the MinION™ MkIb DNA sequencer and R9 and R9.4 chemistries that help to overcome the main disadvantage of this portable sequencing platform. Furthermore, the nanopore sequencing library, constructed with the last releases of pore chemistry (R9.4) and sequencing kit (SQK-LSK108), permitted the retrieval of the higher level of 1D read accuracy sufficient to characterize the microbial species present in each mock community analysed. Improvements in nanopore chemistry, such as minimizing base-calling errors and new library protocols able to produce rapid 1D libraries, will provide more reliable information in the near future. Such data will be useful for more comprehensive and faster specific detection of microbial species and strains in complex ecosystems. PMID:28605506
Usha, Lydia; Tabesh, Bita; Morrison, Larry E; Rao, Ruta D; Jacobson, Kris; Zhu, April; Basu, Sanjib; Coon, John S
2008-01-01
Background Amplification of the ERBB2 (Her-2/neu) oncogene, which occurs in approximately 25% of breast carcinomas, is a known negative prognostic factor. Available data indicate that a variable number of nearby genes on chromosome 17q may be co-amplified or deleted, forming a continuous amplicon of variable size. In approximately 25% of these patients, the amplicon extends to the gene for topoisomerase II alpha (TOP2A), a target for anthracyclines. We sought to understand the significance of these associated genomic changes for breast cancer prognosis and predicting response to therapy. Methods and patients Archival tissue samples from 63 breast cancer patients with ERBB2 amplification, stages 0–IV, were previously analyzed with FISH probes for genes located near ERBB2. In the present study, the clinical outcome data were determined for all patients presenting at stages I–III for whom adequate clinical follow up was available. Results Four amplicon patterns (Classes) were identified. These were significantly associated with the clinical outcome, specifically, recurrence of breast cancer. The Amplicon class IV with deleted TOP2A had 67% (6/9) cases with recurrence, whereas the other three classes combined had only 12% (3/25) cases (p-value = 0.004) at the time of last follow-up. TOP2A deletion was also significantly associated with time to recurrence (p-value = 0.0002). After adjusting for age in Cox regression analysis, the association between TOP2A deletion and time to recurrence remains strongly significant (p-value = 0.002) whereas the association with survival is marginally significant (p-value = 0.06). Conclusion TOP2A deletion is associated with poor prognosis in ERBB2-amplified breast carcinomas. Clarification of the mechanism of this association will require additional study. PMID:18702822
Ibarbalz, Federico M; Pérez, María Victoria; Figuerola, Eva L M; Erijman, Leonardo
2014-01-01
The performance of two sets of primers targeting variable regions of the 16S rRNA gene V1-V3 and V4 was compared in their ability to describe changes of bacterial diversity and temporal turnover in full-scale activated sludge. Duplicate sets of high-throughput amplicon sequencing data of the two 16S rRNA regions shared a collection of core taxa that were observed across a series of twelve monthly samples, although the relative abundance of each taxon was substantially different between regions. A case in point was the changes in the relative abundance of filamentous bacteria Thiothrix, which caused a large effect on diversity indices, but only in the V1-V3 data set. Yet the relative abundance of Thiothrix in the amplicon sequencing data from both regions correlated with the estimation of its abundance determined using fluorescence in situ hybridization. In nonmetric multidimensional analysis samples were distributed along the first ordination axis according to the sequenced region rather than according to sample identities. The dynamics of microbial communities indicated that V1-V3 and the V4 regions of the 16S rRNA gene yielded comparable patterns of: 1) the changes occurring within the communities along fixed time intervals, 2) the slow turnover of activated sludge communities and 3) the rate of species replacement calculated from the taxa-time relationships. The temperature was the only operational variable that showed significant correlation with the composition of bacterial communities over time for the sets of data obtained with both pairs of primers. In conclusion, we show that despite the bias introduced by amplicon sequencing, the variable regions V1-V3 and V4 can be confidently used for the quantitative assessment of bacterial community dynamics, and provide a proper qualitative account of general taxa in the community, especially when the data are obtained over a convenient time window rather than at a single time point.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jothikumar, N., E-mail: jin2@cdc.gov; Hill, Vincent R.
Highlights: •Uses a single-labeled fluorescent primer for real-time PCR. •The detection sensitivity of PET PCR was comparable to TaqMan PCR. •Melt curve analysis can be performed to confirm target amplicon production. •Conventional PCR primers can be converted to PET PCR primers. -- Abstract: We report the development of a fluorescently labeled oligonucleotide primer that can be used to monitor real-time PCR. The primer has two parts, the 3′-end of the primer is complimentary to the target and a universal 17-mer stem loop at the 5′-end forms a hairpin structure. A fluorescent dye is attached to 5′-end of either the forwardmore » or reverse primer. The presence of guanosine residues at the first and second position of the 3′ dangling end effectively quenches the fluorescence due to the photo electron transfer (PET) mechanism. During the synthesis of nucleic acid, the hairpin structure is linearized and the fluorescence of the incorporated primer increases several-fold due to release of the fluorescently labeled tail and the absence of guanosine quenching. As amplicons are synthesized during nucleic acid amplification, the fluorescence increase in the reaction mixture can be measured with commercially available real-time PCR instruments. In addition, a melting procedure can be performed to denature the double-stranded amplicons, thereby generating fluorescence peaks that can differentiate primer dimers and other non-specific amplicons if formed during the reaction. We demonstrated the application of PET-PCR for the rapid detection and quantification of Cryptosporidium parvum DNA. Comparison with a previously published TaqMan® assay demonstrated that the two real-time PCR assays exhibited similar sensitivity for a dynamic range of detection of 6000–0.6 oocysts per reaction. PET PCR primers are simple to design and less-expensive than dual-labeled probe PCR methods, and should be of interest for use by laboratories operating in resource-limited environments.« less
Ma, Y; Dai, X; Hong, T; Munk, G B; Libera, M
2016-12-19
Despite their many advantages and successes, molecular beacon (MB) hybridization probes have not been extensively used in microarray formats because of the complicating probe-substrate interactions that increase the background intensity. We have previously shown that tethering to surface-patterned microgels is an effective means for localizing MB probes to specific surface locations in a microarray format while simultaneously maintaining them in as water-like an environment as possible and minimizing probe-surface interactions. Here we extend this approach to include both real-time detection together with integrated NASBA amplification. We fabricate small (∼250 μm × 250 μm) simplex, duplex, and five-plex assays with microarray spots of controllable size (∼20 μm diameter), position, and shape to detect bacteria and fungi in a bloodstream-infection model. The targets, primers, and microgel-tethered probes can be combined in a single isothermal reaction chamber with no post-amplification labelling. We extract total RNA from clinical blood samples and differentiate between Gram-positive and Gram-negative bloodstream infection in a duplex assay to detect RNA- amplicons. The sensitivity based on our current protocols in a simplex assay to detect specific ribosomal RNA sequences within total RNA extracted from S. aureus and E. coli cultures corresponds to tens of bacteria per ml. We furthermore show that the platform can detect RNA- amplicons from synthetic target DNA with 1 fM sensitivity in sample volumes that contain about 12 000 DNA molecules. These experiments demonstrate an alternative approach that can enable rapid and real-time microarray-based molecular diagnostics.
Yanthan, Mhathung; Misra, Arvind K
2013-11-01
Trees of Myrica sp. grow abundantly in the forests of Meghalaya, India. These trees are actinorhizal and harbour nitrogen-fixing Frankia in their root nodules and contribute positively towards the enhancement of nitrogen status of forest areas. They can be used in rejuvenation of mine spoils and nitrogen-depleted fallow lands generated due to slash and burn agriculture practiced in the area. We have studied the association of amplicon restriction patterns (ARPs) of Myrica ribosomal RNA gene and internal transcribed spacer (ITS) region and nitrogenase activity of its root nodules. We found that ARPs thus obtained could be used as markers for early screening of seedlings that could support strains of Frankia that fix atmospheric nitrogen more efficiently.
Shewale, Jaiprakash G; Schneida, Elaine; Wilson, Jonathan; Walker, Jerilyn A; Batzer, Mark A; Sinha, Sudhir K
2007-03-01
The human DNA quantification (H-Quant) system, developed for use in human identification, enables quantitation of human genomic DNA in biological samples. The assay is based on real-time amplification of AluYb8 insertions in hominoid primates. The relatively high copy number of subfamily-specific Alu repeats in the human genome enables quantification of very small amounts of human DNA. The oligonucleotide primers present in H-Quant are specific for human DNA and closely related great apes. During the real-time PCR, the SYBR Green I dye binds to the DNA that is synthesized by the human-specific AluYb8 oligonucleotide primers. The fluorescence of the bound SYBR Green I dye is measured at the end of each PCR cycle. The cycle at which the fluorescence crosses the chosen threshold correlates to the quantity of amplifiable DNA in that sample. The minimal sensitivity of the H-Quant system is 7.6 pg/microL of human DNA. The amplicon generated in the H-Quant assay is 216 bp, which is within the same range of the common amplifiable short tandem repeat (STR) amplicons. This size amplicon enables quantitation of amplifiable DNA as opposed to a quantitation of degraded or nonamplifiable DNA of smaller sizes. Development and validation studies were performed on the 7500 real-time PCR system following the Quality Assurance Standards for Forensic DNA Testing Laboratories.
Gentilini, Fabio; Turba, Maria E
2014-01-01
A novel technique, called Divergent, for single-tube real-time PCR genotyping of point mutations without the use of fluorescently labeled probes has recently been reported. This novel PCR technique utilizes a set of four primers and a particular denaturation temperature for simultaneously amplifying two different amplicons which extend in opposite directions from the point mutation. The two amplicons can readily be detected using the melt curve analysis downstream to a closed-tube real-time PCR. In the present study, some critical aspects of the original method were specifically addressed to further implement the technique for genotyping the DNM1 c.G767T mutation responsible for exercise-induced collapse in Labrador retriever dogs. The improved Divergent assay was easily set up using a standard two-step real-time PCR protocol. The melting temperature difference between the mutated and the wild-type amplicons was approximately 5°C which could be promptly detected by all the thermal cyclers. The upgraded assay yielded accurate results with 157pg of genomic DNA per reaction. This optimized technique represents a flexible and inexpensive alternative to the minor grove binder fluorescently labeled method and to high resolution melt analysis for high-throughput, robust and cheap genotyping of single nucleotide variations. Copyright © 2014 Elsevier B.V. All rights reserved.
Aslan, O; Hamill, R M; Sweeney, T; Reardon, W; Mullen, A M
2009-01-01
It is essential to isolate high-quality DNA from muscle tissue for PCR-based applications in traceability of animal origin. We wished to examine the impact of cooking meat to a range of core temperatures on the quality and quantity of subsequently isolated genomic (specifically, nuclear) DNA. Triplicate steak samples were cooked in a water bath (100 degrees C) until their final internal temperature was 75, 80, 85, 90, 95, or 100 degrees C, and DNA was extracted. Deoxyribonucleic acid quantity was significantly reduced in cooked meat samples compared with raw (6.5 vs. 56.6 ng/microL; P < 0.001), but there was no relationship with cooking temperature. Quality (A(260)/A(280), i.e., absorbance at 260 and 280 nm) was also affected by cooking (P < 0.001). For all 3 genes, large PCR amplicons (product size >800 bp) were observed only when using DNA from raw meat and steak cooked to lower core temperatures. Small amplicons (<200 bp) were present for all core temperatures. Cooking meat to high temperatures thus resulted in a reduced overall yield and probable fragmentation of DNA to sizes less than 800 bp. Although nuclear DNA is preferable to mitochondrial DNA for food authentication, it is less abundant, and results suggest that analyses should be designed to use small amplicon sizes for meat cooked to high core temperatures.
Bell, Courtnee R; Wilkinson, Jeremy E; Robertson, Boakai K; Javan, Gulnaz T
2018-05-10
Recent studies have revealed distinct thanatomicrobiome (microbiome of death) signatures in human body sites after death. Thanatomicrobiome studies suggest that microbial succession after death may have the potential to reveal important postmortem biomarkers for the identification of time of death. We surveyed the postmortem microbiomes of cardiac tissues from ten corpses with varying times of death (6-58 h) using amplicon-based sequencing of the 16S rRNA gene' V1-2 and V4 hypervariable regions. The results demonstrated that amplicons had statistically significant (p <0.05) sex-dependent changes. Clostridium sp., Pseudomonas sp., Pantoea sp., and Streptococcus sp. had the highest enrichment for both V1-2 and V4 regions. Interestingly, the results also show that V4 amplicons had higher abundance of Clostridium sp. and Pseudomonas sp. in female hearts compared to males. Additionally, Streptococcus sp. was solely found in male heart samples. The distinction between sexes was further supported by Principle Coordinate Analysis, which revealed microbes in female hearts formed a distinctive cluster separate from male cadavers for both hypervariable regions. This study provides data that demonstrates that two hypervariable regions show discriminatory power for sex differences in postmortem heart samples. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Chomic, Anastasija; Winder, Louise; Armstrong, Karen F; Pearson, Michael N; Hampton, John G
2011-01-01
This study investigated the suitability of a two step real-time RT-PCR melting curve analysis as a tool for the detection and discrimination of nine species in the plant virus family Luteoviridae, being Soybean dwarf virus [SbDV], Bean leafroll virus [BLRV], Beet chlorosis virus [BChV], Beet mild yellowing virus [BMYV], Beet western yellows virus [BWYV], Cereal yellow dwarf virus-RPV [CYDV-RPV], Cucurbit aphid-borne yellows virus [CABYV], Potato leafroll virus [PLRV] and Turnip yellows virus [TuYV]. Melting temperature and shape of the melting peak were analysed for 68 bp and 148 bp coat protein gene amplicons using SYBR® GreenER™ fluorescent dye. Specific melting peaks with unique melting temperature were observed for the various species of the family Luteoviridae using the 68 bp amplicon, but not with the 148 bp amplicon. Due to the high variability of sequences for some members of this family, different melting temperatures were also observed between different isolates of the species CYDV-RPV and TuYV. Nevertheless, discrimination between species was achieved for SbDV, BLRV, BChV, BMYV, CABYV and either PLRV or BWYV. Melting curve analysis, in this study, is a faster and more discriminatory alternative to gel electrophoresis of end-point PCR products for the detection of Luteoviridae infection. Copyright © 2010 Elsevier B.V. All rights reserved.
Tang, Yidan; Lu, Baiyang; Zhu, Zhentong; Li, Bingling
2018-01-21
The polymerase chain reaction and many isothermal amplifications are able to achieve super gene amplification. Unfortunately, most commonly-used transduction methods, such as dye staining and Taqman-like probing, still suffer from shortcomings including false signals or difficult probe design, or are incompatible with multi-analysis. Here a universal and rational gene detection strategy has been established by translating isothermal amplicons to enzyme-free strand displacement circuits via three-way junction-based remote transduction. An assistant transduction probe was imported to form a partial hybrid with the target single-stranded nucleic acid. After systematic optimization the hybrid could serve as an associative trigger to activate a downstream circuit detector via a strand displacement reaction across the three-way junction. By doing so, the detection selectivity can be double-guaranteed through both amplicon-transducer recognition and the amplicon-circuit reaction. A well-optimized circuit can be immediately applied to a new target detection through simply displacing only 10-12 nt on only one component, according to the target. More importantly, this property for the first time enables multi-analysis and logic-analysis in a single reaction, sharing a single fluorescence reporter. In an applicable model, trace amounts of Cronobacter and Enterobacteria genes have been clearly distinguished from samples with no bacteria or one bacterium, with ultra-high sensitivity and selectivity.
Fuselli, S; Baptista, R P; Panziera, A; Magi, A; Guglielmi, S; Tonin, R; Benazzo, A; Bauzer, L G; Mazzoni, C J; Bertorelle, G
2018-03-24
The major histocompatibility complex (MHC) acts as an interface between the immune system and infectious diseases. Accurate characterization and genotyping of the extremely variable MHC loci are challenging especially without a reference sequence. We designed a combination of long-range PCR, Illumina short-reads, and Oxford Nanopore MinION long-reads approaches to capture the genetic variation of the MHC II DRB locus in an Italian population of the Alpine chamois (Rupicapra rupicapra). We utilized long-range PCR to generate a 9 Kb fragment of the DRB locus. Amplicons from six different individuals were fragmented, tagged, and simultaneously sequenced with Illumina MiSeq. One of these amplicons was sequenced with the MinION device, which produced long reads covering the entire amplified fragment. A pipeline that combines short and long reads resolved several short tandem repeats and homopolymers and produced a de novo reference, which was then used to map and genotype the short reads from all individuals. The assembled DRB locus showed a high level of polymorphism and the presence of a recombination breakpoint. Our results suggest that an amplicon-based NGS approach coupled with single-molecule MinION nanopore sequencing can efficiently achieve both the assembly and the genotyping of complex genomic regions in multiple individuals in the absence of a reference sequence.
TE-Tracker: systematic identification of transposition events through whole-genome resequencing.
Gilly, Arthur; Etcheverry, Mathilde; Madoui, Mohammed-Amin; Guy, Julie; Quadrana, Leandro; Alberti, Adriana; Martin, Antoine; Heitkam, Tony; Engelen, Stefan; Labadie, Karine; Le Pen, Jeremie; Wincker, Patrick; Colot, Vincent; Aury, Jean-Marc
2014-11-19
Transposable elements (TEs) are DNA sequences that are able to move from their location in the genome by cutting or copying themselves to another locus. As such, they are increasingly recognized as impacting all aspects of genome function. With the dramatic reduction in cost of DNA sequencing, it is now possible to resequence whole genomes in order to systematically characterize novel TE mobilization in a particular individual. However, this task is made difficult by the inherently repetitive nature of TE sequences, which in some eukaryotes compose over half of the genome sequence. Currently, only a few software tools dedicated to the detection of TE mobilization using next-generation-sequencing are described in the literature. They often target specific TEs for which annotation is available, and are only able to identify families of closely related TEs, rather than individual elements. We present TE-Tracker, a general and accurate computational method for the de-novo detection of germ line TE mobilization from re-sequenced genomes, as well as the identification of both their source and destination sequences. We compare our method with the two classes of existing software: specialized TE-detection tools and generic structural variant (SV) detection tools. We show that TE-Tracker, while working independently of any prior annotation, bridges the gap between these two approaches in terms of detection power. Indeed, its positive predictive value (PPV) is comparable to that of dedicated TE software while its sensitivity is typical of a generic SV detection tool. TE-Tracker demonstrates the benefit of adopting an annotation-independent, de novo approach for the detection of TE mobilization events. We use TE-Tracker to provide a comprehensive view of transposition events induced by loss of DNA methylation in Arabidopsis. TE-Tracker is freely available at http://www.genoscope.cns.fr/TE-Tracker . We show that TE-Tracker accurately detects both the source and destination of novel transposition events in re-sequenced genomes. Moreover, TE-Tracker is able to detect all potential donor sequences for a given insertion, and can identify the correct one among them. Furthermore, TE-Tracker produces significantly fewer false positives than common SV detection programs, thus greatly facilitating the detection and analysis of TE mobilization events.
Montesino, Marta; Prieto, Lourdes
2012-01-01
Cycle sequencing reaction with Big-Dye terminators provides the methodology to analyze mtDNA Control Region amplicons by means of capillary electrophoresis. DNA sequencing with ddNTPs or terminators was developed by (1). The progressive automation of the method by combining the use of fluorescent-dye terminators with cycle sequencing has made it possible to increase the sensibility and efficiency of the method and hence has allowed its introduction into the forensic field. PCR-generated mitochondrial DNA products are the templates for sequencing reactions. Different set of primers can be used to generate amplicons with different sizes according to the quality and quantity of the DNA extract providing sequence data for different ranges inside the Control Region.
Microarray-based Resequencing of Multiple Bacillus anthracis Isolates
2004-12-17
generated an Unweighted Pair Group Method Arithmetic Mean ( UPGMA ) tree (see methods [56]; Figure 3). The strains group together in a manner broadly similar...was created using DNADIST, plotted as a UPGMA tree using NEIGHBOR and the tree plotted using DRAWGRAM [56]. The B1 strain A0465 was used as an...distance matrix was created using DNADIST, plotted as a UPGMA tree using NEIGHBOR and the tree plotted using DRAWGRAM [57]. Additional data files The
Chen, Chao; Liu, Zhiguang; Pan, Qi; Chen, Xiao; Wang, Huihua; Guo, Haikun; Liu, Shidong; Lu, Hongfeng; Tian, Shilin; Li, Ruiqiang; Shi, Wei
2016-01-01
Studying the genetic signatures of climate-driven selection can produce insights into local adaptation and the potential impacts of climate change on populations. The honey bee (Apis mellifera) is an interesting species to study local adaptation because it originated in tropical/subtropical climatic regions and subsequently spread into temperate regions. However, little is known about the genetic basis of its adaptation to temperate climates. Here, we resequenced the whole genomes of ten individual bees from a newly discovered population in temperate China and downloaded resequenced data from 35 individuals from other populations. We found that the new population is an undescribed subspecies in the M-lineage of A. mellifera (Apis mellifera sinisxinyuan). Analyses of population history show that long-term global temperature has strongly influenced the demographic history of A. m. sinisxinyuan and its divergence from other subspecies. Further analyses comparing temperate and tropical populations identified several candidate genes related to fat body and the Hippo signaling pathway that are potentially involved in adaptation to temperate climates. Our results provide insights into the demographic history of the newly discovered A. m. sinisxinyuan, as well as the genetic basis of adaptation of A. mellifera to temperate climates at the genomic level. These findings will facilitate the selective breeding of A. mellifera to improve the survival of overwintering colonies. PMID:26823447
Ma, Xin; Fu, Yongcai; Zhao, Xinhui; Jiang, Liyun; Zhu, Zuofeng; Gu, Ping; Xu, Wenying; Su, Zhen; Sun, Chuanqing; Tan, Lubin
2016-01-01
Oryza nivara, an annual wild AA-genome species of rice, is an important gene pool for broadening the genetic diversity of cultivated rice (O. sativa L.). Towards identifying and utilizing favourable alleles from O. nivara, we developed a set of introgression lines (ILs) by introducing O. nivara segments into the elite indica rice variety 93-11 background through advanced backcrossing and repeated selfing. Using whole-genome resequencing, a high-density genetic map containing 1,070 bin-markers was constructed for the 131 ILs, with an average length of 349 kb per bin. The 131 ILs cover 95% of O. nivara genome, providing a relatively complete genomic library for introgressing O. nivara alleles for trait improvement. Using this high-density bin-map, QTL mapping for 13 yield-related traits was performed and a total of 65 QTLs were detected across two environments. At ~36.9% of detected QTLs, the alleles from O. nivara conferred improving effects on yield-associated traits. Six cloned genes, Sh4/SHA1, Bh4, Sd1, TE/TAD1, GS3 and FZP, colocalised in the peak intervals of 9 QTLs. In conclusion, we developed new genetic materials for exploration and use of beneficial alleles from wild rice and provided a basis for future fine mapping and cloning of the favourable O. nivara-derived QTLs. PMID:27251022
Jha, Aashish R.; Miles, Cecelia M.; Lippert, Nodia R.; Brown, Christopher D.; White, Kevin P.; Kreitman, Martin
2015-01-01
Complete genome resequencing of populations holds great promise in deconstructing complex polygenic traits to elucidate molecular and developmental mechanisms of adaptation. Egg size is a classic adaptive trait in insects, birds, and other taxa, but its highly polygenic architecture has prevented high-resolution genetic analysis. We used replicated experimental evolution in Drosophila melanogaster and whole-genome sequencing to identify consistent signatures of polygenic egg-size adaptation. A generalized linear-mixed model revealed reproducible allele frequency differences between replicated experimental populations selected for large and small egg volumes at approximately 4,000 single nucleotide polymorphisms (SNPs). Several hundred distinct genomic regions contain clusters of these SNPs and have lower heterozygosity than the genomic background, consistent with selection acting on polymorphisms in these regions. These SNPs are also enriched among genes expressed in Drosophila ovaries and many of these genes have well-defined functions in Drosophila oogenesis. Additional genes regulating egg development, growth, and cell size show evidence of directional selection as genes regulating these biological processes are enriched for highly differentiated SNPs. Genetic crosses performed with a subset of candidate genes demonstrated that these genes influence egg size, at least in the large genetic background. These findings confirm the highly polygenic architecture of this adaptive trait, and suggest the involvement of many novel candidate genes in regulating egg size. PMID:26044351
Application of Broad-Spectrum Resequencing Microarray for Genotyping Rhabdoviruses▿
Dacheux, Laurent; Berthet, Nicolas; Dissard, Gabriel; Holmes, Edward C.; Delmas, Olivier; Larrous, Florence; Guigon, Ghislaine; Dickinson, Philip; Faye, Ousmane; Sall, Amadou A.; Old, Iain G.; Kong, Katherine; Kennedy, Giulia C.; Manuguerra, Jean-Claude; Cole, Stewart T.; Caro, Valérie; Gessain, Antoine; Bourhy, Hervé
2010-01-01
The rapid and accurate identification of pathogens is critical in the control of infectious disease. To this end, we analyzed the capacity for viral detection and identification of a newly described high-density resequencing microarray (RMA), termed PathogenID, which was designed for multiple pathogen detection using database similarity searching. We focused on one of the largest and most diverse viral families described to date, the family Rhabdoviridae. We demonstrate that this approach has the potential to identify both known and related viruses for which precise sequence information is unavailable. In particular, we demonstrate that a strategy based on consensus sequence determination for analysis of RMA output data enabled successful detection of viruses exhibiting up to 26% nucleotide divergence with the closest sequence tiled on the array. Using clinical specimens obtained from rabid patients and animals, this method also shows a high species level concordance with standard reference assays, indicating that it is amenable for the development of diagnostic assays. Finally, 12 animal rhabdoviruses which were currently unclassified, unassigned, or assigned as tentative species within the family Rhabdoviridae were successfully detected. These new data allowed an unprecedented phylogenetic analysis of 106 rhabdoviruses and further suggest that the principles and methodology developed here may be used for the broad-spectrum surveillance and the broader-scale investigation of biodiversity in the viral world. PMID:20610710
Whole genome resequencing of a laboratory-adapted Drosophila melanogaster population sample
Gilks, William P.; Pennell, Tanya M.; Flis, Ilona; Webster, Matthew T.; Morrow, Edward H.
2016-01-01
As part of a study into the molecular genetics of sexually dimorphic complex traits, we used high-throughput sequencing to obtain data on genomic variation in an outbred laboratory-adapted fruit fly ( Drosophila melanogaster) population. We successfully resequenced the whole genome of 220 hemiclonal females that were heterozygous for the same Berkeley reference line genome (BDGP6/dm6), and a unique haplotype from the outbred base population (LH M). The use of a static and known genetic background enabled us to obtain sequences from whole-genome phased haplotypes. We used a BWA-Picard-GATK pipeline for mapping sequence reads to the dm6 reference genome assembly, at a median depth-of coverage of 31X, and have made the resulting data publicly-available in the NCBI Short Read Archive (Accession number SRP058502). We used Haplotype Caller to discover and genotype 1,726,931 small genomic variants (SNPs and indels, <200bp). Additionally we detected and genotyped 167 large structural variants (1-100Kb in size) using GenomeStrip/2.0. Sequence and genotype data are publicly-available at the corresponding NCBI databases: Short Read Archive, dbSNP and dbVar (BioProject PRJNA282591). We have also released the unfiltered genotype data, and the code and logs for data processing and summary statistics ( https://zenodo.org/communities/sussex_drosophila_sequencing/). PMID:27928499
Gardiner, Laura-Jayne; Gawroński, Piotr; Olohan, Lisa; Schnurbusch, Thorsten; Hall, Neil; Hall, Anthony
2014-12-01
Mapping-by-sequencing analyses have largely required a complete reference sequence and employed whole genome re-sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re-sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early-flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene-rich regions of hexaploid bread wheat to design a 110-Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo-chromosomes derived from the capture probe target sequence, with a long-range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval. © 2014 The Authors.The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Gopalakrishnan, Shyam; Samaniego Castruita, Jose A; Sinding, Mikkel-Holger S; Kuderna, Lukas F K; Räikkönen, Jannikke; Petersen, Bent; Sicheritz-Ponten, Thomas; Larson, Greger; Orlando, Ludovic; Marques-Bonet, Tomas; Hansen, Anders J; Dalén, Love; Gilbert, M Thomas P
2017-06-29
An increasing number of studies are addressing the evolutionary genomics of dog domestication, principally through resequencing dog, wolf and related canid genomes. There is, however, only one de novo assembled canid genome currently available against which to map such data - that of a boxer dog (Canis lupus familiaris). We generated the first de novo wolf genome (Canis lupus lupus) as an additional choice of reference, and explored what implications may arise when previously published dog and wolf resequencing data are remapped to this reference. Reassuringly, we find that regardless of the reference genome choice, most evolutionary genomic analyses yield qualitatively similar results, including those exploring the structure between the wolves and dogs using admixture and principal component analysis. However, we do observe differences in the genomic coverage of re-mapped samples, the number of variants discovered, and heterozygosity estimates of the samples. In conclusion, the choice of reference is dictated by the aims of the study being undertaken; if the study focuses on the differences between the different dog breeds or the fine structure among dogs, then using the boxer reference genome is appropriate, but if the aim of the study is to look at the variation within wolves and their relationships to dogs, then there are clear benefits to using the de novo assembled wolf reference genome.
2014-01-01
Background With over 50 different disorders and a combined incidence of up to 1/3000 births, lysosomal storage diseases (LSDs) constitute a major public health problem and place an enormous burden on affected individuals and their families. Many factors make LSD diagnosis difficult, including phenotype and penetrance variability, shared signs and symptoms, and problems inherent to biochemical diagnosis. Developing a powerful diagnostic tool could mitigate the protracted diagnostic process for these families, lead to better outcomes for current and proposed therapies, and provide the basis for more appropriate genetic counseling. Methods We have designed a targeted resequencing assay for the simultaneous testing of 57 lysosomal genes, using in-solution capture as the enrichment method and two different sequencing platforms. A total of 84 patients with high to moderate-or low suspicion index for LSD were enrolled in different centers in Spain and Portugal, including 18 positive controls. Results We correctly diagnosed 18 positive blinded controls, provided genetic diagnosis to 25 potential LSD patients, and ended with 18 diagnostic odysseys. Conclusion We report the assessment of a next–generation-sequencing-based approach as an accessory tool in the diagnosis of LSDs, a group of disorders which have overlapping clinical profiles and genetic heterogeneity. We have also identified and quantified the strengths and limitations of next generation sequencing (NGS) technology applied to diagnosis. PMID:24767253
High performance interconnection between high data rate networks
NASA Technical Reports Server (NTRS)
Foudriat, E. C.; Maly, K.; Overstreet, C. M.; Zhang, L.; Sun, W.
1992-01-01
The bridge/gateway system needed to interconnect a wide range of computer networks to support a wide range of user quality-of-service requirements is discussed. The bridge/gateway must handle a wide range of message types including synchronous and asynchronous traffic, large, bursty messages, short, self-contained messages, time critical messages, etc. It is shown that messages can be classified into three basic classes, synchronous and large and small asynchronous messages. The first two require call setup so that packet identification, buffer handling, etc. can be supported in the bridge/gateway. Identification enables resequences in packet size. The third class is for messages which do not require call setup. Resequencing hardware based to handle two types of resequencing problems is presented. The first is for a virtual parallel circuit which can scramble channel bytes. The second system is effective in handling both synchronous and asynchronous traffic between networks with highly differing packet sizes and data rates. The two other major needs for the bridge/gateway are congestion and error control. A dynamic, lossless congestion control scheme which can easily support effective error correction is presented. Results indicate that the congestion control scheme provides close to optimal capacity under congested conditions. Under conditions where error may develop due to intervening networks which are not lossless, intermediate error recovery and correction takes 1/3 less time than equivalent end-to-end error correction under similar conditions.
Genetic evidence of multiple loci in dystocia - difficult labour
2010-01-01
Background Dystocia, difficult labour, is a common but also complex problem during childbirth. It can be attributed to either weak contractions of the uterus, a large infant, reduced capacity of the pelvis or combinations of these. Previous studies have indicated that there is a genetic component in the susceptibility of experiencing dystocia. The purpose of this study was to identify susceptibility genes in dystocia. Methods A total of 104 women in 47 families were included where at least two sisters had undergone caesarean section at a gestational length of 286 days or more at their first delivery. Study of medical records and a telephone interview was performed to identify subjects with dystocia. Whole-genome scanning using Affymetrix genotyping-arrays and non-parametric linkage (NPL) analysis was made in 39 women exhibiting the phenotype of dystocia from 19 families. In 68 women re-sequencing was performed of candidate genes showing suggestive linkage: oxytocin (OXT) on chromosome 20 and oxytocin-receptor (OXTR) on chromosome 3. Results We found a trend towards linkage with suggestive NPL-score (3.15) on chromosome 12p12. Suggestive linkage peaks were observed on chromosomes 3, 4, 6, 10, 20. Re-sequencing of OXT and OXTR did not reveal any causal variants. Conclusions Dystocia is likely to have a genetic component with variations in multiple genes affecting the patient outcome. We found 6 loci that could be re-evaluated in larger patient cohorts. PMID:20587075
Nunes, Márcio Roberto Teixeira; de Souza, William Marciel; Acrani, Gustavo Olszanski; Cardoso, Jedson Ferreira; da Silva, Sandro Patroca; Badra, Soraya Jabur; Figueiredo, Luiz Tadeu Moraes; Vasconcelos, Pedro Fernando da Costa
2018-01-01
Group C serogroup includes members of the Orthobunyavirus genus (family Peribunyaviridae) and comprises 15 arboviruses that can be associated with febrile illness in humans. Although previous studies described the genome characterization of Group C orthobunyavirus, there is a gap in genomic information about the other viruses in this group. Therefore, in this study, complete genomes of members of Group C serogroup were sequenced or re-sequenced and used for genetic characterization, as well as to understand their phylogenetic and evolutionary aspects. Thus, our study reported the genomes of three new members in Group C virus (Apeu strain BeAn848, Itaqui strain BeAn12797 and Nepuyo strain BeAn10709), as well as re-sequencing of original strains of five members: Caraparu (strain BeAn3994), Madrid (strain BT4075), Murucutu (strain BeAn974), Oriboca (strain BeAn17), and Marituba (strain BeAn15). These viruses presented a typical genomic organization related to members of the Orthobunyavirus genus. Interestingly, all viruses of this serogroup showed an open reading frame (ORF) that encodes the putative nonstructural NSs protein that precedes the nucleoprotein ORF, an unprecedented fact in Group C virus. Also, we confirmed the presence of natural reassortment events. This study expands the genomic information of Group C viruses, as well as revalidates the genomic organization of viruses that were previously reported.
Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
Zhu, Yuan; Bergland, Alan O.; González, Josefa; Petrov, Dmitri A.
2012-01-01
The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of largely isogenic, individually sequenced Drosophila melanogaster strains. We called SNPs in the pooled data and estimated false positive and false negative rates using the SNPs called in individual strain as a reference. We also estimated allele frequency of the SNPs using “pooled” data and compared them with “true” frequencies taken from the estimates in the individual strains. We demonstrate that pooled sequencing provides a faithful estimate of population allele frequency with the error well approximated by binomial sampling, and is a reliable means of novel SNP discovery with low false positive rates. However, a sufficient number of strains should be used in the pooling because variation in the amount of DNA derived from individual strains is a substantial source of noise when the number of pooled strains is low. Our results and analysis confirm that pooled sequencing is a very powerful and cost-effective technique for assessing of patterns of sequence variation in populations on genome-wide scales, and is applicable to any dataset where sequencing individuals or individual cells is impossible, difficult, time consuming, or expensive. PMID:22848651
Lowry, David B.; Logan, Tierney L.; Santuari, Luca; Hardtke, Christian S.; Richards, James H.; DeRose-Wilson, Leah J.; McKay, John K.; Sen, Saunak; Juenger, Thomas E.
2013-01-01
The regulation of gene expression is crucial for an organism’s development and response to stress, and an understanding of the evolution of gene expression is of fundamental importance to basic and applied biology. To improve this understanding, we conducted expression quantitative trait locus (eQTL) mapping in the Tsu-1 (Tsushima, Japan) × Kas-1 (Kashmir, India) recombinant inbred line population of Arabidopsis thaliana across soil drying treatments. We then used genome resequencing data to evaluate whether genomic features (promoter polymorphism, recombination rate, gene length, and gene density) are associated with genes responding to the environment (E) or with genes with genetic variation (G) in gene expression in the form of eQTLs. We identified thousands of genes that responded to soil drying and hundreds of main-effect eQTLs. However, we identified very few statistically significant eQTLs that interacted with the soil drying treatment (GxE eQTL). Analysis of genome resequencing data revealed associations of several genomic features with G and E genes. In general, E genes had lower promoter diversity and local recombination rates. By contrast, genes with eQTLs (G) had significantly greater promoter diversity and were located in genomic regions with higher recombination. These results suggest that genomic architecture may play an important a role in the evolution of gene expression. PMID:24045022
Howard, Thomas P; Hayward, Andrew P; Tordillos, Anthony; Fragoso, Christopher; Moreno, Maria A; Tohme, Joe; Kausch, Albert P; Mottinger, John P; Dellaporta, Stephen L
2014-01-01
Since their initial discovery, transposons have been widely used as mutagens for forward and reverse genetic screens in a range of organisms. The problems of high copy number and sequence divergence among related transposons have often limited the efficiency at which tagged genes can be identified. A method was developed to identity the locations of Mutator (Mu) transposons in the Zea mays genome using a simple enrichment method combined with genome resequencing to identify transposon junction fragments. The sequencing library was prepared from genomic DNA by digesting with a restriction enzyme that cuts within a perfectly conserved motif of the Mu terminal inverted repeats (TIR). Paired-end reads containing Mu TIR sequences were computationally identified and chromosomal sequences flanking the transposon were mapped to the maize reference genome. This method has been used to identify Mu insertions in a number of alleles and to isolate the previously unidentified lazy plant1 (la1) gene. The la1 gene is required for the negatively gravitropic response of shoots and mutant plants lack the ability to sense gravity. Using bioinformatic and fluorescence microscopy approaches, we show that the la1 gene encodes a cell membrane and nuclear localized protein. Our Mu-Taq method is readily adaptable to identify the genomic locations of any insertion of a known sequence in any organism using any sequencing platform.
Kunihisa, Miyuki; Moriya, Shigeki; Abe, Kazuyuki; Okada, Kazuma; Haji, Takashi; Hayashi, Takeshi; Kawahara, Yoshihiro; Itoh, Ryutaro; Itoh, Takeshi; Katayose, Yuichi; Kanamori, Hiroyuki; Matsumoto, Toshimi; Mori, Satomi; Sasaki, Harumi; Matsumoto, Takashi; Nishitani, Chikako; Terakami, Shingo; Yamamoto, Toshiya
2016-01-01
‘Fuji’ is one of the most popular and highly-produced apple cultivars worldwide, and has been frequently used in breeding programs. The development of genotypic markers for the preferable phenotypes of ‘Fuji’ is required. Here, we aimed to define the haplotypes of ‘Fuji’ and find associations between haplotypes and phenotypes of five traits (harvest day, fruit weight, acidity, degree of watercore, and flesh mealiness) by using 115 accessions related to ‘Fuji’. Through the re-sequencing of ‘Fuji’ genome, total of 2,820,759 variants, including single nucleotide polymorphisms (SNPs) and insertions or deletions (indels) were detected between ‘Fuji’ and ‘Golden Delicious’ reference genome. We selected mapping-validated 1,014 SNPs, most of which were heterozygous in ‘Fuji’ and capable of distinguishing alleles inherited from the parents of ‘Fuji’ (i.e., ‘Ralls Janet’ and ‘Delicious’). We used these SNPs to define the haplotypes of ‘Fuji’ and trace their inheritance in relatives, which were shown to have an average of 27% of ‘Fuji’ genome. Analysis of variance (ANOVA) based on ‘Fuji’ haplotypes identified one quantitative trait loci (QTL) each for harvest time, acidity, degree of watercore, and mealiness. A haplotype from ‘Delicious’ chr14 was considered to dominantly cause watercore, and one from ‘Ralls Janet’ chr1 was related to low-mealiness. PMID:27795675
Howard, Thomas P.; Hayward, Andrew P.; Tordillos, Anthony; Fragoso, Christopher; Moreno, Maria A.; Tohme, Joe; Kausch, Albert P.; Mottinger, John P.; Dellaporta, Stephen L.
2014-01-01
Since their initial discovery, transposons have been widely used as mutagens for forward and reverse genetic screens in a range of organisms. The problems of high copy number and sequence divergence among related transposons have often limited the efficiency at which tagged genes can be identified. A method was developed to identity the locations of Mutator (Mu) transposons in the Zea mays genome using a simple enrichment method combined with genome resequencing to identify transposon junction fragments. The sequencing library was prepared from genomic DNA by digesting with a restriction enzyme that cuts within a perfectly conserved motif of the Mu terminal inverted repeats (TIR). Paired-end reads containing Mu TIR sequences were computationally identified and chromosomal sequences flanking the transposon were mapped to the maize reference genome. This method has been used to identify Mu insertions in a number of alleles and to isolate the previously unidentified lazy plant1 (la1) gene. The la1 gene is required for the negatively gravitropic response of shoots and mutant plants lack the ability to sense gravity. Using bioinformatic and fluorescence microscopy approaches, we show that the la1 gene encodes a cell membrane and nuclear localized protein. Our Mu-Taq method is readily adaptable to identify the genomic locations of any insertion of a known sequence in any organism using any sequencing platform. PMID:24498020
2013-01-01
Background Artificial selection played an important role in the origin of modern Glycine max cultivars from the wild soybean Glycine soja. To elucidate the consequences of artificial selection accompanying the domestication and modern improvement of soybean, 25 new and 30 published whole-genome re-sequencing accessions, which represent wild, domesticated landrace, and Chinese elite soybean populations were analyzed. Results A total of 5,102,244 single nucleotide polymorphisms (SNPs) and 707,969 insertion/deletions were identified. Among the SNPs detected, 25.5% were not described previously. We found that artificial selection during domestication led to more pronounced reduction in the genetic diversity of soybean than the switch from landraces to elite cultivars. Only a small proportion (2.99%) of the whole genomic regions appear to be affected by artificial selection for preferred agricultural traits. The selection regions were not distributed randomly or uniformly throughout the genome. Instead, clusters of selection hotspots in certain genomic regions were observed. Moreover, a set of candidate genes (4.38% of the total annotated genes) significantly affected by selection underlying soybean domestication and genetic improvement were identified. Conclusions Given the uniqueness of the soybean germplasm sequenced, this study drew a clear picture of human-mediated evolution of the soybean genomes. The genomic resources and information provided by this study would also facilitate the discovery of genes/loci underlying agronomically important traits. PMID:23984715
Jiang, Hong; Wang, Limin; Xu, Rujun; Shi, Yanbin; Zhang, Jianguang; Xu, Mengnan; Cram, David S.; Ma, Shenglin
2016-01-01
Activating and resistance mutations in the tyrosine kinase domain of several oncogenes are frequently associated with non-small cell lung carcinoma (NSCLC). In this study we assessed the frequency, type and abundance of EGFR, KRAS, BRAF, TP53 and ALK mutations in tumour specimens from 184 patients with early and late stage disease using single molecule amplification and re-sequencing technology (SMART). Based on modelling of EGFR mutations, the detection sensitivity of the SMART assay was at least 0.1%. Benchmarking EGFR mutation detection against the gold standard ARMS-PCR assay, SMART assay had a sensitivity and specificity of 98.7% and 99.0%. Amongst the 184 samples, EGFR mutations were the most prevalent (59.9%), followed by KRAS (16.9%), TP53 (12.7%), EML4-ALK fusions (6.3%) and BRAF (4.2%) mutations. The abundance and types of mutations in tumour specimens were extremely heterogeneous, involving either monoclonal (51.6%) or polyclonal (12.6%) mutation events. At the clinical level, although the spectrum of tumour mutation(s) was unique to each patient, the overall patterns in early or advanced stage disease were relatively similar. Based on these findings, we propose that personalized profiling and quantitation of clinically significant oncogenic mutations will allow better classification of patients according to tumour characteristics and provide clinicians with important ancillary information for treatment decision-making. PMID:27409166
Zhang, Shirong; Xia, Bing; Jiang, Hong; Wang, Limin; Xu, Rujun; Shi, Yanbin; Zhang, Jianguang; Xu, Mengnan; Cram, David S; Ma, Shenglin
2016-08-02
Activating and resistance mutations in the tyrosine kinase domain of several oncogenes are frequently associated with non-small cell lung carcinoma (NSCLC). In this study we assessed the frequency, type and abundance of EGFR, KRAS, BRAF, TP53 and ALK mutations in tumour specimens from 184 patients with early and late stage disease using single molecule amplification and re-sequencing technology (SMART). Based on modelling of EGFR mutations, the detection sensitivity of the SMART assay was at least 0.1%. Benchmarking EGFR mutation detection against the gold standard ARMS-PCR assay, SMART assay had a sensitivity and specificity of 98.7% and 99.0%. Amongst the 184 samples, EGFR mutations were the most prevalent (59.9%), followed by KRAS (16.9%), TP53 (12.7%), EML4-ALK fusions (6.3%) and BRAF (4.2%) mutations. The abundance and types of mutations in tumour specimens were extremely heterogeneous, involving either monoclonal (51.6%) or polyclonal (12.6%) mutation events. At the clinical level, although the spectrum of tumour mutation(s) was unique to each patient, the overall patterns in early or advanced stage disease were relatively similar. Based on these findings, we propose that personalized profiling and quantitation of clinically significant oncogenic mutations will allow better classification of patients according to tumour characteristics and provide clinicians with important ancillary information for treatment decision-making.
Strategies to improve reference databases for soil microbiomes
Choi, Jinlyung; Yang, Fan; Stepanauskas, Ramunas; ...
2016-12-09
A database of curated genomes is needed to better assess soil microbial communities and their processes associated with differing land management and environmental impacts. Interpreting soil metagenomic datasets with existing sequence databases is challenging because these datasets are biased towards medical and biotechnology research and can result in misleading annotations. We have curated a database of 928 genomes of soil-associated organisms (888 bacteria, 34 archaea, and 6 fungi). Using this database as a representation of the current state of knowledge of soil microbes that are well-characterized, we evaluated its composition and compared it to broader microbial databases, specifically NCBI’s RefSeq,more » as well as 3,035 publicly available soil amplicon datasets. These comparisons identified phyla and functions that are enriched in soils as well as those that may be underrepresented in RefSoil. For example, RefSoil was observed to have increased representation of Firmicutes despite its low abundance in soil environments and also lacked representation of Acidobacteria and Verrucomicrobia, which are abundant in soils. Our comparison of RefSoil to soil amplicon datasets allowed us to identify targets that if cultured or sequenced would significantly increase the biodiversity represented within RefSoil. To demonstrate the opportunities to access these underrepresented targets, we employed single cell genomics in a pilot experiment to recover 14 genomes from the "most wanted" list, which improved RefSoil's representation of EMP sequences by 7% by abundance. This effort demonstrates the value of RefSoil in the guidance of future research efforts and the capability of single cell genomics as a practical means to fill the existing genomic data gaps.« less
Strategies to improve reference databases for soil microbiomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Choi, Jinlyung; Yang, Fan; Stepanauskas, Ramunas
A database of curated genomes is needed to better assess soil microbial communities and their processes associated with differing land management and environmental impacts. Interpreting soil metagenomic datasets with existing sequence databases is challenging because these datasets are biased towards medical and biotechnology research and can result in misleading annotations. We have curated a database of 928 genomes of soil-associated organisms (888 bacteria, 34 archaea, and 6 fungi). Using this database as a representation of the current state of knowledge of soil microbes that are well-characterized, we evaluated its composition and compared it to broader microbial databases, specifically NCBI’s RefSeq,more » as well as 3,035 publicly available soil amplicon datasets. These comparisons identified phyla and functions that are enriched in soils as well as those that may be underrepresented in RefSoil. For example, RefSoil was observed to have increased representation of Firmicutes despite its low abundance in soil environments and also lacked representation of Acidobacteria and Verrucomicrobia, which are abundant in soils. Our comparison of RefSoil to soil amplicon datasets allowed us to identify targets that if cultured or sequenced would significantly increase the biodiversity represented within RefSoil. To demonstrate the opportunities to access these underrepresented targets, we employed single cell genomics in a pilot experiment to recover 14 genomes from the "most wanted" list, which improved RefSoil's representation of EMP sequences by 7% by abundance. This effort demonstrates the value of RefSoil in the guidance of future research efforts and the capability of single cell genomics as a practical means to fill the existing genomic data gaps.« less
Pedersen, M S; Fahnøe, U; Hansen, T A; Pedersen, A G; Jenssen, H; Bukh, J; Schønning, K
2018-06-01
The current treatment options for hepatitis C virus (HCV), based on direct acting antivirals (DAA), are dependent on virus genotype and previous treatment experience. Treatment failures have been associated with detection of resistance-associated substitutions (RASs) in the DAA targets of HCV, the NS3, NS5A and NS5 B proteins. To develop a next generation sequencing based method that provides genotype and detection of HCV NS3, NS5A, and NS5 B RASs without prior knowledge of sample genotype. In total, 101 residual plasma samples from patients with HCV covering 10 different viral subtypes across 4 genotypes with viral loads of 3.84-7.61 Log IU/mL were included. All samples were de-identified and consequently prior treatment status for patients was unknown. Almost full open reading frame amplicons (∼ 9 kb) were generated using RT-PCR with a single primer set. The resulting amplicons were sequenced with high throughput sequencing and analysed using an in-house developed script for detecting RASs. The method successfully amplified and sequenced 94% (95/101) of samples with an average coverage of 14,035; four of six failed samples were genotype 4a. Samples analysed twice yielded reproducible nucleotide frequencies across all sites. RASs were detected in 21/95 (22%) samples at a 15% threshold. The method identified one patient infected with two genotype 2b variants, and the presence of subgenomic deletion variants in 8 (8.4%) of 95 successfully sequenced samples. The presented method may provide identification of HCV genotype, RASs detection, and detect multiple HCV infection without prior knowledge of sample genotype. Copyright © 2018 Elsevier B.V. All rights reserved.
McGee, C F; Storey, S; Clipson, N; Doyle, E
2017-04-01
Soil microorganisms are key contributors to nutrient cycling and are essential for the maintenance of healthy soils and sustainable agriculture. Although the antimicrobial effects of a broad range of nanoparticulate substances have been characterised in vitro, little is known about the impact of these compounds on microbial communities in environments such as soil. In this study, the effect of three widely used nanoparticulates (silver, silicon dioxide and aluminium oxide) on bacterial and fungal communities in an agricultural pastureland soil was examined in a microcosm-based experiment using a combination of enzyme analysis, molecular fingerprinting and amplicon sequencing. A relatively low concentration of silver nanoparticles (AgNPs) significantly reduced total soil dehydrogenase and urease activity, while Al 2 O 3 and SiO 2 nanoparticles had no effect. Amplicon sequencing revealed substantial shifts in bacterial community composition in soils amended with AgNPs, with significant decreases in the relative abundance of Acidobacteria and Verrucomicrobia and an increase in Proteobacteria. In particular, the relative abundance of the Proteobacterial genus Dyella significantly increased in AgNP amended soil. The effects of Al 2 O 3 and SiO 2 NPs on bacterial community composition were less pronounced. AgNPs significantly reduced bacterial and archaeal amoA gene abundance in soil, with the archaea more susceptible than bacteria. AgNPs also significantly impacted soil fungal community structure, while Al 2 O 3 and SiO 2 NPs had no effect. Several fungal ribotypes increased in soil amended with AgNPs, compared to control soil. This study highlights the need to consider the effects of individual nanoparticles on soil microbial communities when assessing their environmental impact.
Stokholm-Bjerregaard, Mikkel; McIlroy, Simon J; Nierychlo, Marta; Karst, Søren M; Albertsen, Mads; Nielsen, Per H
2017-01-01
Understanding the microbiology of phosphorus (P) removal is considered essential to knowledge-based optimization of enhanced biological P removal (EBPR) systems. Biological P removal is achieved in these systems by promoting the growth of organisms collectively known as the polyphosphate accumulating organisms (PAOs). Also considered important to EBPR are the glycogen accumulating organisms (GAOs), which are theorized to compete with the PAOs for resources at the expense of P removal efficiency. Numerous studies have sought to identify the PAOs and their GAOs competitors, with several candidates proposed for each over the last few decades. The current study collectively assessed the abundance and diversity of all proposed PAOs and GAOs in 18 Danish full-scale wastewater treatment plants with well-working biological nutrient removal over a period of 9 years using 16S rRNA gene amplicon sequencing. The microbial community structure in all plants was relatively stable over time. Evidence for the role of the proposed PAOs and GAOs in EBPR varies and is critically assessed, in light of their calculated amplicon abundances, to indicate which of these are important in full-scale systems. Bacteria from the genus Tetrasphaera were the most abundant of the PAOs. The " Candidatus Accumulibacter" PAOs were in much lower abundance and appear to be biased by the amplicon-based method applied. The genera Dechloromonas, Microlunatus , and Tessaracoccus were identified as abundant putative PAO that require further research attention. Interestingly, the actinobacterial Micropruina and sbr-gs28 phylotypes were among the most abundant of the putative GAOs. Members of the genera Defluviicoccus, Propionivibrio , the family Competibacteraceae, and the spb280 group were also relatively abundant in some plants. Despite observed high abundances of GAOs (periodically exceeding 20% of the amplicon reads), P removal performance was maintained, indicating that these organisms were not outcompeting the PAOs in these EBPR systems. Phylogenetic diversity within each of the PAOs and GAOs genera was observed, which is consistent with reported metabolic diversity for these. Whether or not key traits can be assigned to sub-genus level clades requires further investigation.
Guibert, N; Hu, Y; Feeney, N; Kuang, Y; Plagnol, V; Jones, G; Howarth, K; Beeler, J F; Paweletz, C P; Oxnard, G R
2018-04-01
Genomic analysis of plasma cell-free DNA is transforming lung cancer care; however, available assays are limited by cost, turnaround time, and imperfect accuracy. Here, we study amplicon-based plasma next-generation sequencing (NGS), rather than hybrid-capture-based plasma NGS, hypothesizing this would allow sensitive detection and monitoring of driver and resistance mutations in advanced non-small cell lung cancer (NSCLC). Plasma samples from patients with NSCLC and a known targetable genotype (EGFR, ALK/ROS1, and other rare genotypes) were collected while on therapy and analyzed blinded to tumor genotype. Plasma NGS was carried out using enhanced tagged amplicon sequencing of hotspots and coding regions from 36 genes, as well as intronic coverage for detection of ALK/ROS1 fusions. Diagnostic accuracy was compared with plasma droplet digital PCR (ddPCR) and tumor genotype. A total of 168 specimens from 46 patients were studied. Matched plasma NGS and ddPCR across 120 variants from 80 samples revealed high concordance of allelic fraction (R2 = 0.95). Pretreatment, sensitivity of plasma NGS for the detection of EGFR driver mutations was 100% (30/30), compared with 87% for ddPCR (26/30). A full spectrum of rare driver oncogenic mutations could be detected including sensitive detection of ALK/ROS1 fusions (8/9 detected, 89%). Studying 25 patients positive for EGFR T790M that developed resistance to osimertinib, 15 resistance mechanisms could be detected including tertiary EGFR mutations (C797S, Q791P) and mutations or amplifications of non-EGFR genes, some of which could be detected pretreatment or months before progression. This blinded analysis demonstrates the ability of amplicon-based plasma NGS to detect a full range of targetable genotypes in NSCLC, including fusion genes, with high accuracy. The ability of plasma NGS to detect a range of preexisting and acquired resistance mechanisms highlights its potential value as an alternative to single mutation digital PCR-based plasma assays for personalizing treatment of TKI resistance in lung cancer.
Targeted RNA-Sequencing with Competitive Multiplex-PCR Amplicon Libraries
Blomquist, Thomas M.; Crawford, Erin L.; Lovett, Jennie L.; Yeo, Jiyoun; Stanoszek, Lauren M.; Levin, Albert; Li, Jia; Lu, Mei; Shi, Leming; Muldrew, Kenneth; Willey, James C.
2013-01-01
Whole transcriptome RNA-sequencing is a powerful tool, but is costly and yields complex data sets that limit its utility in molecular diagnostic testing. A targeted quantitative RNA-sequencing method that is reproducible and reduces the number of sequencing reads required to measure transcripts over the full range of expression would be better suited to diagnostic testing. Toward this goal, we developed a competitive multiplex PCR-based amplicon sequencing library preparation method that a) targets only the sequences of interest and b) controls for inter-target variation in PCR amplification during library preparation by measuring each transcript native template relative to a known number of synthetic competitive template internal standard copies. To determine the utility of this method, we intentionally selected PCR conditions that would cause transcript amplification products (amplicons) to converge toward equimolar concentrations (normalization) during library preparation. We then tested whether this approach would enable accurate and reproducible quantification of each transcript across multiple library preparations, and at the same time reduce (through normalization) total sequencing reads required for quantification of transcript targets across a large range of expression. We demonstrate excellent reproducibility (R2 = 0.997) with 97% accuracy to detect 2-fold change using External RNA Controls Consortium (ERCC) reference materials; high inter-day, inter-site and inter-library concordance (R2 = 0.97–0.99) using FDA Sequencing Quality Control (SEQC) reference materials; and cross-platform concordance with both TaqMan qPCR (R2 = 0.96) and whole transcriptome RNA-sequencing following “traditional” library preparation using Illumina NGS kits (R2 = 0.94). Using this method, sequencing reads required to accurately quantify more than 100 targeted transcripts expressed over a 107-fold range was reduced more than 10,000-fold, from 2.3×109 to 1.4×105 sequencing reads. These studies demonstrate that the competitive multiplex-PCR amplicon library preparation method presented here provides the quality control, reproducibility, and reduced sequencing reads necessary for development and implementation of targeted quantitative RNA-sequencing biomarkers in molecular diagnostic testing. PMID:24236095
Kutyavin, Igor V.
2010-01-01
The article describes a new technology for real-time polymerase chain reaction (PCR) detection of nucleic acids. Similar to Taqman, this new method, named Snake, utilizes the 5′-nuclease activity of Thermus aquaticus (Taq) DNA polymerase that cleaves dual-labeled Förster resonance energy transfer (FRET) probes and generates a fluorescent signal during PCR. However, the mechanism of the probe cleavage in Snake is different. In this assay, PCR amplicons fold into stem–loop secondary structures. Hybridization of FRET probes to one of these structures leads to the formation of optimal substrates for the 5′-nuclease activity of Taq. The stem–loop structures in the Snake amplicons are introduced by the unique design of one of the PCR primers, which carries a special 5′-flap sequence. It was found that at a certain length of these 5′-flap sequences the folded Snake amplicons have very little, if any, effect on PCR yield but benefit many aspects of the detection process, particularly the signal productivity. Unlike Taqman, the Snake system favors the use of short FRET probes with improved fluorescence background. The head-to-head comparison study of Snake and Taqman revealed that these two technologies have more differences than similarities with respect to their responses to changes in PCR protocol, e.g. the variations in primer concentration, annealing time, PCR asymmetry. The optimal PCR protocol for Snake has been identified. The technology’s real-time performance was compared to a number of conventional assays including Taqman, 3′-MGB-Taqman, Molecular Beacon and Scorpion primers. The test trial showed that Snake supersedes the conventional assays in the signal productivity and detection of sequence variations as small as single nucleotide polymorphisms. Due to the assay’s cost-effectiveness and simplicity of design, the technology is anticipated to quickly replace all known conventional methods currently used for real-time nucleic acid detection. PMID:19969535
The development of miniplex primer sets for the analysis of degraded DNA
NASA Astrophysics Data System (ADS)
McCord, Bruce; Opel, Kerry; Chung, Denise; Drabek, Jiri; Tatarek, Nancy; Meadows Jantz, Lee; Butler, John
2005-05-01
In this project, a new set of multiplexed PCR reactions has been developed for the analysis of degraded DNA. These DNA markers, known as Miniplexes, utilize primers that have shorter amplicons for use in short tandem repeat (STR) analysis of degraded DNA. In our work we have defined six of these new STR multiplexes, each of which consists of 3 to 4 reduced size STR loci, and each labeled with a different fluorescent dye. When compared to commercially available STR systems, reductions in size of up to 300 basepairs are possible. In addition, these newly designed amplicons consist of loci that are fully compatible with the the national computer DNA database known as CODIS. To demonstrate compatibility with commercial STR kits, a concordance study of 532 DNA samples of Caucasian, African American, and Hispanic origin was undertaken There was 99.77% concordance between allele calls with the two methods. Of these 532 samples, only 15 samples showed discrepancies at one of 12 loci. These occurred predominantly at 2 loci, vWA and D13S317. DNA sequencing revealed that these locations had deletions between the two primer binding sites. Uncommon deletions like these can be expected in certain samples and will not affect the utility of the Miniplexes as tools for degraded DNA analysis. The Miniplexes were also applied to enzymatically digested DNA to assess their potential in degraded DNA analysis. The results demonstrated a greatly improved efficiency in the analysis of degraded DNA when compared to commercial STR genotyping kits. A series of human skeletal remains that had been exposed to a variety of environmental conditions were also examined. Sixty-four percent of the samples generated full profiles when amplified with the Miniplexes, while only sixteen percent of the samples tested generated full profiles with a commercial kit. In addition, complete profiles were obtained for eleven of the twelve Miniplex loci which had amplicon size ranges less than 200 base pairs. These data clearly demonstrate that smaller PCR amplicons provide an attractive alternative to mitochondrial DNA for forensic analysis of degraded DNA.
Figuerola, Eva L. M.; Erijman, Leonardo
2014-01-01
The performance of two sets of primers targeting variable regions of the 16S rRNA gene V1–V3 and V4 was compared in their ability to describe changes of bacterial diversity and temporal turnover in full-scale activated sludge. Duplicate sets of high-throughput amplicon sequencing data of the two 16S rRNA regions shared a collection of core taxa that were observed across a series of twelve monthly samples, although the relative abundance of each taxon was substantially different between regions. A case in point was the changes in the relative abundance of filamentous bacteria Thiothrix, which caused a large effect on diversity indices, but only in the V1–V3 data set. Yet the relative abundance of Thiothrix in the amplicon sequencing data from both regions correlated with the estimation of its abundance determined using fluorescence in situ hybridization. In nonmetric multidimensional analysis samples were distributed along the first ordination axis according to the sequenced region rather than according to sample identities. The dynamics of microbial communities indicated that V1–V3 and the V4 regions of the 16S rRNA gene yielded comparable patterns of: 1) the changes occurring within the communities along fixed time intervals, 2) the slow turnover of activated sludge communities and 3) the rate of species replacement calculated from the taxa–time relationships. The temperature was the only operational variable that showed significant correlation with the composition of bacterial communities over time for the sets of data obtained with both pairs of primers. In conclusion, we show that despite the bias introduced by amplicon sequencing, the variable regions V1–V3 and V4 can be confidently used for the quantitative assessment of bacterial community dynamics, and provide a proper qualitative account of general taxa in the community, especially when the data are obtained over a convenient time window rather than at a single time point. PMID:24923665
Yokoyama, Eiji; Hashimoto, Ruiko; Etoh, Yoshiki; Ichihara, Sachiko; Horikawa, Kazumi; Uchimura, Masako
2011-01-01
The distribution of insertion sequence (IS) 629 among strains of enterohemorrhagic Escherichia coli serovar O157 (O157) was investigated and compared with the strain lineages defined by lineage specific polymorphism assay-6 (LSPA-6) to demonstrate the effectiveness of IS629 analysis for population genetics analysis. Using pulsed-field gel electrophoresis and variable-number tandem repeat typing, 140 strains producing both VT1 and VT2 and 98 strains producing only VT2 were selected from a total of 592 strains isolated from patients and asymptomatic carriers in Chiba Prefecture, Japan, during 2003-2008. By LSPA-6 analysis, six strains had atypical amplicon sizes in their Z5935 loci and five strains had atypical amplicon sizes in their arp-iclR intergenic regions. Sequence analyses of PCR amplified DNAs showed that five of the six loci used for LSPA-6 analysis had tandem repeats and the allele changes were due to changes in the number of tandem repeats. Subculturing and long-term incubation was found to have no detectable effect on the lineages defined by LSPA-6 analysis, demonstrating the robustness of LSPA-6 analysis. Minimum spanning tree analysis reconstruction revealed that strains in lineage I, I/II, and II clustered on separate branches, indicating that the distribution of IS629 was biased among O157 strains in different lineages. Strains with LSPA-6 codes 231111, 211113, and 211114 had atypical amplicon sizes and were clustered in lineage I/II branch, and strains with LSPA-6 codes 212114, 221123, 221223, 222123, 222224, 242123, 252123, and 242222 had atypical amplicon sizes and clustered in lineage II branches. Linkage disequilibrium was observed in strains in every lineage when the standardized index of association was calculated using IS629 distribution data. Therefore, the distribution analysis of IS629 may be effective for population genetics analysis of O157 due to the biased IS629 distribution among strains in the three O157 lineages. Copyright © 2010 Elsevier B.V. All rights reserved.
Owa, Chie; Poulin, Matthew; Yan, Liying; Shioda, Toshi
2018-01-01
The existence of cytosine methylation in mammalian mitochondrial DNA (mtDNA) is a controversial subject. Because detection of DNA methylation depends on resistance of 5'-modified cytosines to bisulfite-catalyzed conversion to uracil, examined parameters that affect technical adequacy of mtDNA methylation analysis. Negative control amplicons (NCAs) devoid of cytosine methylation were amplified to cover the entire human or mouse mtDNA by long-range PCR. When the pyrosequencing template amplicons were gel-purified after bisulfite conversion, bisulfite pyrosequencing of NCAs did not detect significant levels of bisulfite-resistant cytosines (brCs) at ND1 (7 CpG sites) or CYTB (8 CpG sites) genes (CI95 = 0%-0.94%); without gel-purification, significant false-positive brCs were detected from NCAs (CI95 = 4.2%-6.8%). Bisulfite pyrosequencing of highly purified, linearized mtDNA isolated from human iPS cells or mouse liver detected significant brCs (~30%) in human ND1 gene when the sequencing primer was not selective in bisulfite-converted and unconverted templates. However, repeated experiments using a sequencing primer selective in bisulfite-converted templates almost completely (< 0.8%) suppressed brC detection, supporting the false-positive nature of brCs detected using the non-selective primer. Bisulfite-seq deep sequencing of linearized, gel-purified human mtDNA detected 9.4%-14.8% brCs for 9 CpG sites in ND1 gene. However, because all these brCs were associated with adjacent non-CpG brCs showing the same degrees of bisulfite resistance, DNA methylation in this mtDNA-encoded gene was not confirmed. Without linearization, data generated by bisulfite pyrosequencing or deep sequencing of purified mtDNA templates did not pass the quality control criteria. Shotgun bisulfite sequencing of human mtDNA detected extremely low levels of CpG methylation (<0.65%) over non-CpG methylation (<0.55%). Taken together, our study demonstrates that adequacy of mtDNA methylation analysis using methods dependent on bisulfite conversion needs to be established for each experiment, taking effects of incomplete bisulfite conversion and template impurity or topology into consideration.
Kimura, Yasumasa; Soma, Takahiro; Kasahara, Naoko; Delobel, Diane; Hanami, Takeshi; Tanaka, Yuki; de Hoon, Michiel J L; Hayashizaki, Yoshihide; Usui, Kengo; Harbers, Matthias
2016-01-01
Analytical PCR experiments preferably use internal probes for monitoring the amplification reaction and specific detection of the amplicon. Such internal probes have to be designed in close context with the amplification primers, and may require additional considerations for the detection of genetic variations. Here we describe Edesign, a new online and stand-alone tool for designing sets of PCR primers together with an internal probe for conducting quantitative real-time PCR (qPCR) and genotypic experiments. Edesign can be used for selecting standard DNA oligonucleotides like for instance TaqMan probes, but has been further extended with new functions and enhanced design features for Eprobes. Eprobes, with their single thiazole orange-labelled nucleotide, allow for highly sensitive genotypic assays because of their higher DNA binding affinity as compared to standard DNA oligonucleotides. Using new thermodynamic parameters, Edesign considers unique features of Eprobes during primer and probe design for establishing qPCR experiments and genotyping by melting curve analysis. Additional functions in Edesign allow probe design for effective discrimination between wild-type sequences and genetic variations either using standard DNA oligonucleotides or Eprobes. Edesign can be freely accessed online at http://www.dnaform.com/edesign2/, and the source code is available for download.
Kasahara, Naoko; Delobel, Diane; Hanami, Takeshi; Tanaka, Yuki; de Hoon, Michiel J. L.; Hayashizaki, Yoshihide; Usui, Kengo; Harbers, Matthias
2016-01-01
Analytical PCR experiments preferably use internal probes for monitoring the amplification reaction and specific detection of the amplicon. Such internal probes have to be designed in close context with the amplification primers, and may require additional considerations for the detection of genetic variations. Here we describe Edesign, a new online and stand-alone tool for designing sets of PCR primers together with an internal probe for conducting quantitative real-time PCR (qPCR) and genotypic experiments. Edesign can be used for selecting standard DNA oligonucleotides like for instance TaqMan probes, but has been further extended with new functions and enhanced design features for Eprobes. Eprobes, with their single thiazole orange-labelled nucleotide, allow for highly sensitive genotypic assays because of their higher DNA binding affinity as compared to standard DNA oligonucleotides. Using new thermodynamic parameters, Edesign considers unique features of Eprobes during primer and probe design for establishing qPCR experiments and genotyping by melting curve analysis. Additional functions in Edesign allow probe design for effective discrimination between wild-type sequences and genetic variations either using standard DNA oligonucleotides or Eprobes. Edesign can be freely accessed online at http://www.dnaform.com/edesign2/, and the source code is available for download. PMID:26863543
Prebiotic Wheat Bran Fractions Induce Specific Microbiota Changes
D’hoe, Kevin; Conterno, Lorenza; Fava, Francesca; Falony, Gwen; Vieira-Silva, Sara; Vermeiren, Joan; Tuohy, Kieran; Raes, Jeroen
2018-01-01
Wheat bran fibers are considered beneficial to human health through their impact on gut microbiota composition and activity. Here, we assessed the prebiotic potential of selected bran fractions by performing a series of fecal slurry anaerobic fermentation experiments using aleurone as well as total, ultrafine, and soluble wheat bran (swb) as carbon sources. By combining amplicon-based community profiling with a fluorescent in situ hybridization (FISH) approach, we found that incubation conditions favor the growth of Proteobacteria such as Escherichia and Bilophila. These effects were countered in all but one [total wheat bran (twb)] fermentation experiments. Growth of Bifidobacterium species was stimulated after fermentation using ultrafine, soluble, and twb, in the latter two as part of a general increase in bacterial load. Both ultrafine and swb fermentation resulted in a trade-off between Bifidobacterium and Bilophila, as previously observed in human dietary supplementation studies looking at the effect of inulin-type fructans on the human gut microbiota. Aleurone selectively stimulated growth of Dorea and butyrate-producing Roseburia. All fermentation experiments induced enhanced gas production; increased butyrate concentrations were only observed following soluble bran incubation. Our results open perspectives for the development of aleurone as a complementary prebiotic selectively targeting colon butyrate producers. PMID:29416529
2013-01-01
Analyzing and storing data and results from next-generation sequencing (NGS) experiments is a challenging task, hampered by ever-increasing data volumes and frequent updates of analysis methods and tools. Storage and computation have grown beyond the capacity of personal computers and there is a need for suitable e-infrastructures for processing. Here we describe UPPNEX, an implementation of such an infrastructure, tailored to the needs of data storage and analysis of NGS data in Sweden serving various labs and multiple instruments from the major sequencing technology platforms. UPPNEX comprises resources for high-performance computing, large-scale and high-availability storage, an extensive bioinformatics software suite, up-to-date reference genomes and annotations, a support function with system and application experts as well as a web portal and support ticket system. UPPNEX applications are numerous and diverse, and include whole genome-, de novo- and exome sequencing, targeted resequencing, SNP discovery, RNASeq, and methylation analysis. There are over 300 projects that utilize UPPNEX and include large undertakings such as the sequencing of the flycatcher and Norwegian spruce. We describe the strategic decisions made when investing in hardware, setting up maintenance and support, allocating resources, and illustrate major challenges such as managing data growth. We conclude with summarizing our experiences and observations with UPPNEX to date, providing insights into the successful and less successful decisions made. PMID:23800020
MMP21 is mutated in human heterotaxy and is required for normal left-right asymmetry in vertebrates.
Guimier, Anne; Gabriel, George C; Bajolle, Fanny; Tsang, Michael; Liu, Hui; Noll, Aaron; Schwartz, Molly; El Malti, Rajae; Smith, Laurie D; Klena, Nikolai T; Jimenez, Gina; Miller, Neil A; Oufadem, Myriam; Moreau de Bellaing, Anne; Yagi, Hisato; Saunders, Carol J; Baker, Candice N; Di Filippo, Sylvie; Peterson, Kevin A; Thiffault, Isabelle; Bole-Feysot, Christine; Cooley, Linda D; Farrow, Emily G; Masson, Cécile; Schoen, Patric; Deleuze, Jean-François; Nitschké, Patrick; Lyonnet, Stanislas; de Pontual, Loic; Murray, Stephen A; Bonnet, Damien; Kingsmore, Stephen F; Amiel, Jeanne; Bouvagnet, Patrice; Lo, Cecilia W; Gordon, Christopher T
2015-11-01
Heterotaxy results from a failure to establish normal left-right asymmetry early in embryonic development. By whole-exome sequencing, whole-genome sequencing and high-throughput cohort resequencing, we identified recessive mutations in MMP21 (encoding matrix metallopeptidase 21) in nine index cases with heterotaxy. In addition, Mmp21-mutant mice and mmp21-morphant zebrafish displayed heterotaxy and abnormal cardiac looping, respectively, suggesting a new role for extracellular matrix remodeling in the establishment of laterality in vertebrates.
MMP21 is mutated in human heterotaxy and is required for normal left-right asymmetry in vertebrates
Guimier, Anne; Gabriel, George C.; Bajolle, Fanny; Tsang, Michael; Liu, Hui; Noll, Aaron; Schwartz, Molly; El Malti, Rajae; Smith, Laurie D.; Klena, Nikolai T.; Jimenez, Gina; Miller, Neil A.; Oufadem, Myriam; Moreau de Bellaing, Anne; Yagi, Hisato; Saunders, Carol J.; Baker, Candice N.; Di Filippo, Sylvie; Peterson, Kevin A.; Thiffault, Isabelle; Bole-Feysot, Christine; Cooley, Linda D.; Farrow, Emily G.; Masson, Cécile; Schoen, Patric; Deleuze, Jean-François; Nitschké, Patrick; Lyonnet, Stanislas; de Pontual, Loic; Murray, Stephen A.; Bonnet, Damien; Kingsmore, Stephen F.; Amiel, Jeanne; Bouvagnet, Patrice; Lo, Cecilia W.; Gordon, Christopher T.
2017-01-01
Heterotaxy results from a failure to establish normal left-right asymmetry early in embryonic development. By whole exome sequencing, whole genome sequencing and high-throughput cohort resequencing we identified recessive mutations in matrix metallopeptidase 21 (MMP21), in nine index cases with heterotaxy. In addition, Mmp21 mutant mice and morphant zebrafish display heterotaxy and abnormal cardiac looping, respectively, suggesting a novel role for extra-cellular remodeling in the establishment of laterality in vertebrates. PMID:26437028
NASA Astrophysics Data System (ADS)
2011-12-01
Research on Global Carbon Emission and Sequestration NSFC Funded Project Made Significant Progress in Quantum Dynamics Functional Human Blood Protein Obtained from Rice How Giant Pandas Thrive on a Bamboo Diet New Evidence of Interpersonal Violence from 129,000 Years Ago Found in China Aptamer-Mediated Efficient Capture and Release of T Lymphocytes on Nanostructured Surfaces BGI Study Results on Resequencing 50 Accessions of Rice Cast New Light on Molecular Breeding BGI Reports Study Results on Frequent Mutation of Genes Encoding UMPP Components in Kidney Cancer Research on Habitat Shift Promoting Species Diversification
JVM: Java Visual Mapping tool for next generation sequencing read.
Yang, Ye; Liu, Juan
2015-01-01
We developed a program JVM (Java Visual Mapping) for mapping next generation sequencing read to reference sequence. The program is implemented in Java and is designed to deal with millions of short read generated by sequence alignment using the Illumina sequencing technology. It employs seed index strategy and octal encoding operations for sequence alignments. JVM is useful for DNA-Seq, RNA-Seq when dealing with single-end resequencing. JVM is a desktop application, which supports reads capacity from 1 MB to 10 GB.
A novel PTCH1 mutation in a patient with Gorlin syndrome
Okamoto, Nana; Naruto, Takuya; Kohmoto, Tomohiro; Komori, Takahide; Imoto, Issei
2014-01-01
Gorlin syndrome is an autosomal dominant disorder characterized by a wide range of developmental abnormalities and a predisposition to various tumors, and it is linked to the alteration of several causative genes, including PTCH1. We performed targeted resequencing using a next-generation sequencer to analyze genes associated with known clinical phenotypes in an 11-year-old male with sporadic jaw keratocysts. A novel duplication mutation (c.426dup) in PTCH1, resulting in a truncated protein, was identified. PMID:27081512
A novel PTCH1 mutation in a patient with Gorlin syndrome.
Okamoto, Nana; Naruto, Takuya; Kohmoto, Tomohiro; Komori, Takahide; Imoto, Issei
2014-01-01
Gorlin syndrome is an autosomal dominant disorder characterized by a wide range of developmental abnormalities and a predisposition to various tumors, and it is linked to the alteration of several causative genes, including PTCH1. We performed targeted resequencing using a next-generation sequencer to analyze genes associated with known clinical phenotypes in an 11-year-old male with sporadic jaw keratocysts. A novel duplication mutation (c.426dup) in PTCH1, resulting in a truncated protein, was identified.
2009-08-11
Competing Interests: One of the contributing authors : Clark Tibbetts, is the Executive Vice President and Chief Technology Officer of Tessarae, LLC...Detection 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR (S) 5d. PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7...N/A 1021 ng No detection Sin nombre Bunyaviridae III 1021 ng Pulmonary syndrome hantavirus strain Convict Creek 107 1CCHFV = Crimean-Congo hemorrhagic
Modeling the evolution space of breakage fusion bridge cycles with a stochastic folding process.
Greenman, C D; Cooke, S L; Marshall, J; Stratton, M R; Campbell, P J
2016-01-01
Breakage-fusion-bridge cycles in cancer arise when a broken segment of DNA is duplicated and an end from each copy joined together. This structure then 'unfolds' into a new piece of palindromic DNA. This is one mechanism responsible for the localised amplicons observed in cancer genome data. Here we study the evolution space of breakage-fusion-bridge structures in detail. We firstly consider discrete representations of this space with 2-d trees to demonstrate that there are [Formula: see text] qualitatively distinct evolutions involving [Formula: see text] breakage-fusion-bridge cycles. Secondly we consider the stochastic nature of the process to show these evolutions are not equally likely, and also describe how amplicons become localized. Finally we highlight these methods by inferring the evolution of breakage-fusion-bridge cycles with data from primary tissue cancer samples.
McConnell, Kristopher H.; Dixon, Michael; Calvi, Brian R.
2012-01-01
DNA replication origin activity changes during development. Chromatin modifications are known to influence the genomic location of origins and the time during S phase that they initiate replication in different cells. However, how chromatin regulates origins in concert with cell differentiation remains poorly understood. Here, we use developmental gene amplification in Drosophila ovarian follicle cells as a model to investigate how chromatin modifiers regulate origins in a developmental context. We find that the histone acetyltransferase (HAT) Chameau (Chm) binds to amplicon origins and is partially required for their function. Depletion of Chm had relatively mild effects on origins during gene amplification and genomic replication compared with previous knockdown of its ortholog HBO1 in human cells, which has severe effects on origin function. We show that another HAT, CBP (Nejire), also binds amplicon origins and is partially required for amplification. Knockdown of Chm and CBP together had a more severe effect on nucleosome acetylation and amplicon origin activity than knockdown of either HAT alone, suggesting that these HATs collaborate in origin regulation. In addition to their local function at the origin, we show that Chm and CBP also globally regulate the developmental transition of follicle cells into the amplification stages of oogenesis. Our results reveal a complexity of origin epigenetic regulation by multiple HATs during development and suggest that chromatin modifiers are a nexus that integrates differentiation and DNA replication programs. PMID:22951641
Error minimization algorithm for comparative quantitative PCR analysis: Q-Anal.
OConnor, William; Runquist, Elizabeth A
2008-07-01
Current methods for comparative quantitative polymerase chain reaction (qPCR) analysis, the threshold and extrapolation methods, either make assumptions about PCR efficiency that require an arbitrary threshold selection process or extrapolate to estimate relative levels of messenger RNA (mRNA) transcripts. Here we describe an algorithm, Q-Anal, that blends elements from current methods to by-pass assumptions regarding PCR efficiency and improve the threshold selection process to minimize error in comparative qPCR analysis. This algorithm uses iterative linear regression to identify the exponential phase for both target and reference amplicons and then selects, by minimizing linear regression error, a fluorescence threshold where efficiencies for both amplicons have been defined. From this defined fluorescence threshold, cycle time (Ct) and the error for both amplicons are calculated and used to determine the expression ratio. Ratios in complementary DNA (cDNA) dilution assays from qPCR data were analyzed by the Q-Anal method and compared with the threshold method and an extrapolation method. Dilution ratios determined by the Q-Anal and threshold methods were 86 to 118% of the expected cDNA ratios, but relative errors for the Q-Anal method were 4 to 10% in comparison with 4 to 34% for the threshold method. In contrast, ratios determined by an extrapolation method were 32 to 242% of the expected cDNA ratios, with relative errors of 67 to 193%. Q-Anal will be a valuable and quick method for minimizing error in comparative qPCR analysis.
Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform.
Schirmer, Melanie; Ijaz, Umer Z; D'Amore, Rosalinda; Hall, Neil; Sloan, William T; Quince, Christopher
2015-03-31
With read lengths of currently up to 2 × 300 bp, high throughput and low sequencing costs Illumina's MiSeq is becoming one of the most utilized sequencing platforms worldwide. The platform is manageable and affordable even for smaller labs. This enables quick turnaround on a broad range of applications such as targeted gene sequencing, metagenomics, small genome sequencing and clinical molecular diagnostics. However, Illumina error profiles are still poorly understood and programs are therefore not designed for the idiosyncrasies of Illumina data. A better knowledge of the error patterns is essential for sequence analysis and vital if we are to draw valid conclusions. Studying true genetic variation in a population sample is fundamental for understanding diseases, evolution and origin. We conducted a large study on the error patterns for the MiSeq based on 16S rRNA amplicon sequencing data. We tested state-of-the-art library preparation methods for amplicon sequencing and showed that the library preparation method and the choice of primers are the most significant sources of bias and cause distinct error patterns. Furthermore we tested the efficiency of various error correction strategies and identified quality trimming (Sickle) combined with error correction (BayesHammer) followed by read overlapping (PANDAseq) as the most successful approach, reducing substitution error rates on average by 93%. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Guillaud-Bataille, M; Brison, O; Danglot, G; Lavialle, C; Raynal, B; Lazar, V; Dessen, P; Bernheim, A
2009-01-01
High-level amplifications observed in tumor cells are usually indicative of genes involved in oncogenesis. We report here a high resolution characterization of a new amplified region in the SW613-S carcinoma cell line. This cell line contains tumorigenic cells displaying high-level MYC amplification in the form of double minutes (DM(+) cells) and non tumorigenic cells exhibiting low-level MYC amplification in the form of homogeneously staining regions (DM(-) cells). Both cell types were studied at genomic and functional levels. The DM(+) cells display a second amplification, corresponding to the 14q24.1 region, in a distinct population of DMs. The 0.43-Mb amplified and overexpressed region contains the PLEK2, PIGH, ARG2, VTI1B, RDH11, and ZFYVE26 genes. Both amplicons were stably maintained upon in vitro and in vivo propagation. However, the 14q24.1 amplicon was not found in cells with high-level MYC amplification in the form of HSRs, either obtained after spontaneous integration of endogenous DM MYC copies or after transfection of DM(-) cells with a MYC gene expression vector. These HSR-bearing cells are highly tumorigenic. The 14q24.1 amplification may not play a role in malignancy per se but might contribute to maintaining the amplification in the form of DMs. Copyright 2009 S. Karger AG, Basel.
Diversity and Activity of Alternative Nitrogenases in Sequenced Genomes and Coastal Environments
McRose, Darcy L.; Zhang, Xinning; Kraepiel, Anne M. L.; Morel, François M. M.
2017-01-01
The nitrogenase enzyme, which catalyzes the reduction of N2 gas to NH4+, occurs as three separate isozyme that use Mo, Fe-only, or V. The majority of global nitrogen fixation is attributed to the more efficient ‘canonical’ Mo-nitrogenase, whereas Fe-only and V-(‘alternative’) nitrogenases are often considered ‘backup’ enzymes, used when Mo is limiting. Yet, the environmental distribution and diversity of alternative nitrogenases remains largely unknown. We searched for alternative nitrogenase genes in sequenced genomes and used PacBio sequencing to explore the diversity of canonical (nifD) and alternative (anfD and vnfD) nitrogenase amplicons in two coastal environments: the Florida Everglades and Sippewissett Marsh (MA). Genome-based searches identified an additional 25 species and 10 genera not previously known to encode alternative nitrogenases. Alternative nitrogenase amplicons were found in both Sippewissett Marsh and the Florida Everglades and their activity was further confirmed using newly developed isotopic techniques. Conserved amino acid sequences corresponding to cofactor ligands were also analyzed in anfD and vnfD amplicons, offering insight into environmental variants of these motifs. This study increases the number of available anfD and vnfD sequences ∼20-fold and allows for the first comparisons of environmental Mo-, Fe-only, and V-nitrogenase diversity. Our results suggest that alternative nitrogenases are maintained across a range of organisms and environments and that they can make important contributions to nitrogenase diversity and nitrogen fixation. PMID:28293220
Diversity and Activity of Alternative Nitrogenases in Sequenced Genomes and Coastal Environments.
McRose, Darcy L; Zhang, Xinning; Kraepiel, Anne M L; Morel, François M M
2017-01-01
The nitrogenase enzyme, which catalyzes the reduction of N 2 gas to NH 4 + , occurs as three separate isozyme that use Mo, Fe-only, or V. The majority of global nitrogen fixation is attributed to the more efficient 'canonical' Mo-nitrogenase, whereas Fe-only and V-('alternative') nitrogenases are often considered 'backup' enzymes, used when Mo is limiting. Yet, the environmental distribution and diversity of alternative nitrogenases remains largely unknown. We searched for alternative nitrogenase genes in sequenced genomes and used PacBio sequencing to explore the diversity of canonical ( nifD ) and alternative ( anfD and vnfD ) nitrogenase amplicons in two coastal environments: the Florida Everglades and Sippewissett Marsh (MA). Genome-based searches identified an additional 25 species and 10 genera not previously known to encode alternative nitrogenases. Alternative nitrogenase amplicons were found in both Sippewissett Marsh and the Florida Everglades and their activity was further confirmed using newly developed isotopic techniques. Conserved amino acid sequences corresponding to cofactor ligands were also analyzed in anfD and vnfD amplicons, offering insight into environmental variants of these motifs. This study increases the number of available anfD and vnfD sequences ∼20-fold and allows for the first comparisons of environmental Mo-, Fe-only, and V-nitrogenase diversity. Our results suggest that alternative nitrogenases are maintained across a range of organisms and environments and that they can make important contributions to nitrogenase diversity and nitrogen fixation.
A genetic investigation of Korean mummies from the Joseon Dynasty.
Kim, Na Young; Lee, Hwan Young; Park, Myung Jin; Yang, Woo Ick; Shin, Kyoung-Jin
2011-01-01
Two Korean mummies (Danwoong-mirra and Yoon-mirra) found in medieval tombs in the central region of the Korean peninsula were genetically investigated by analysis of mitochondrial DNA (mtDNA), Y-chromosomal short tandem repeat (Y-STR) and the ABO gene. Danwoong-mirra is a male child mummy and Yoon-mirra is a pregnant female mummy, dating back about 550 and 450 years, respectively. DNA was extracted from soft tissues or bones. mtDNA, Y-STR and the ABO gene were amplified using a small size amplicon strategy and were analyzed according to the criteria of ancient DNA analysis to ensure that authentic DNA typing results were obtained from these ancient samples. Analysis of mtDNA hypervariable region sequence and coding region single nucleotide polymorphism (SNP) information revealed that Danwoong-mirra and Yoon-mirra belong to the East Asian mtDNA haplogroups D4 and M7c, respectively. The Y-STRs were analyzed in the male child mummy (Danwoong-mirra) using the AmpFlSTR® Yfiler PCR Amplification Kit and an in-house Y-miniplex plus system, and could be characterized in 4 loci with small amplicon size. The analysis of ABO gene SNPs using multiplex single base extension methods revealed that the ABO blood types of Danwoong-mirra and Yoon-mirra are AO01 and AB, respectively. The small size amplicon strategy and the authentication process in the present study will be effectively applicable to future genetic analyses of various forensic and ancient samples.
Xu, Xiangming; Passey, Thomas; Wei, Feng; Saville, Robert; Harrison, Richard J.
2015-01-01
A phenomenon of yield decline due to weak plant growth in strawberry was recently observed in non-chemo-fumigated soils, which was not associated with the soil fungal pathogen Verticillium dahliae, the main target of fumigation. Amplicon-based metagenomics was used to profile soil microbiota in order to identify microbial organisms that may have caused the yield decline. A total of 36 soil samples were obtained in 2013 and 2014 from four sites for metagenomic studies; two of the four sites had a yield-decline problem, the other two did not. More than 2000 fungal or bacterial operational taxonomy units (OTUs) were found in these samples. Relative abundance of individual OTUs was statistically compared for differences between samples from sites with or without yield decline. A total of 721 individual comparisons were statistically significant – involving 366 unique bacterial and 44 unique fungal OTUs. Based on further selection criteria, we focused on 34 bacterial and 17 fungal OTUs and found that yield decline resulted probably from one or more of the following four factors: (1) low abundance of Bacillus and Pseudomonas populations, which are well known for their ability of supressing pathogen development and/or promoting plant growth; (2) lack of the nematophagous fungus (Paecilomyces species); (3) a high level of two non-specific fungal root rot pathogens; and (4) wet soil conditions. This study demonstrated the usefulness of an amplicon-based metagenomics approach to profile soil microbiota and to detect differential abundance in microbes. PMID:26504572
Banowary, Banya; Dang, Van Tuan; Sarker, Subir; Connolly, Joanne H.; Chenu, Jeremy; Groves, Peter; Ayton, Michelle; Raidal, Shane; Devi, Aruna; Vanniasinkam, Thiru; Ghorashi, Seyed A.
2015-01-01
Campylobacter spp. are important causes of bacterial gastroenteritis in humans in developed countries. Among Campylobacter spp. Campylobacter jejuni (C. jejuni) and C. coli are the most common causes of human infection. In this study, a multiplex PCR (mPCR) and high resolution melt (HRM) curve analysis were optimized for simultaneous detection and differentiation of C. jejuni and C. coli isolates. A segment of the hippuricase gene (hipO) of C. jejuni and putative aspartokinase (asp) gene of C. coli were amplified from 26 Campylobacter isolates and amplicons were subjected to HRM curve analysis. The mPCR-HRM was able to differentiate between C. jejuni and C. coli species. All DNA amplicons generated by mPCR were sequenced. Analysis of the nucleotide sequences from each isolate revealed that the HRM curves were correlated with the nucleotide sequences of the amplicons. Minor variation in melting point temperatures of C. coli or C. jejuni isolates was also observed and enabled some intraspecies differentiation between C. coli and/or C. jejuni isolates. The potential of PCR-HRM curve analysis for the detection and speciation of Campylobacter in additional human clinical specimens and chicken swab samples was also confirmed. The sensitivity and specificity of the test were found to be 100% and 92%, respectively. The results indicated that mPCR followed by HRM curve analysis provides a rapid (8 hours) technique for differentiation between C. jejuni and C. coli isolates. PMID:26394042
Dinesh, Krishanender; Verma, Archana; Das Gupta, Ishwar; Thakur, Yash Pal; Verma, Nishant; Arya, Ashwani
2015-04-01
Lactoferrin gene is one of the important candidate genes for mastitis resistance. The gene is located on chromosome BTA 22 and consists of 17 exons spanning over 34.5 kb of genomic DNA. The present study was undertaken with the objectives to identify allelic variants in exons 7 and 12 of lactoferrin gene and to analyze association between its genetic variants and incidence of clinical mastitis in Murrah buffalo. The amplification of exons 7 and 12 of lactoferrin gene yielded amplicons of 232- and 461-bp sizes. PCR-restriction fragment length polymorphism (RFLP) analysis of 232-bp amplicon using BccI restriction enzyme revealed three genotypes (AA, AB, and BB) with frequencies of 0.62, 0.22, and 0.16, respectively. The frequencies of two alleles, A and B, were estimated as 0.73 and 0.27. Hpy188I-RFLP for 461-bp amplicon revealed polymorphism with three genotypes, CC, CD, and DD, with respective frequencies of 0.06, 0.39, and 0.56, whereas frequencies for C and D alleles were 0.25 and 0.75. The chi-square (χ(2)) analysis revealed a significant association between incidence of clinical mastitis and genetic variants of exon 7, and animals of AA genotype of exon 7 were found to be least susceptible to mastitis. The findings indicate potential scope for incorporation of lactoferrin gene in selection and breeding of Murrah buffaloes for improved genetic resistance to mastitis.
Hernández, Marta; Rodríguez-Lázaro, David; Zhang, David; Esteve, Teresa; Pla, Maria; Prat, Salomé
2005-05-04
The number of cultured hectares and commercialized genetically modified organisms (GMOs) has increased exponentially in the past 9 years. Governments in many countries have established a policy of labeling all food and feed containing or produced by GMOs. Consequently, versatile, laboratory-transferable GMO detection methods are in increasing demand. Here, we describe a qualitative PCR-based multiplex method for simultaneous detection and identification of four genetically modified maize lines: Bt11, MON810, T25, and GA21. The described system is based on the use of five primers directed to specific sequences in these insertion events. Primers were used in a single optimized multiplex PCR reaction, and sequences of the amplified fragments are reported. The assay allows amplification of the MON810 event from the 35S promoter to the hsp intron yielding a 468 bp amplicon. Amplification of the Bt11 and T25 events from the 35S promoter to the PAT gene yielded two different amplicons of 280 and 177 bp, respectively, whereas amplification of the 5' flanking region of the GA21 gave rise to an amplicon of 72 bp. These fragments are clearly distinguishable in agarose gels and have been reproduced successfully in a different laboratory. Hence, the proposed method comprises a rapid, simple, reliable, and sensitive (down to 0.05%) PCR-based assay, suitable for detection of these four GM maize lines in a single reaction.
Hayashi, Masahiro; Natori, Tatsuya; Kubota-Hayashi, Sayoko; Miyata, Machiko; Ohkusu, Kiyofumi; Kawamoto, Keiko; Kurazono, Hisao; Makino, Souichi; Ezaki, Takayuki
2013-01-01
A quick foodborne pathogen screening method after six-hour enrichment culture with a broad-range food pathogen enrichment broth is described. Pathogenic factors of Salmonella enterica, Shigella spp., enteroinvasive Escherichia coli, and enterohemorrhagic E. coli are amplified with a cocktail primer and rapid polymerase chain reaction (PCR), which finishes amplification in 30 min. The PCR amplicon was differentiated with a dipstick DNA chromatography assay in 5-10 min. Starting from a four- to six-hour enrichment culture, this assay was finished within 45 min. Detection sensitivity of this protocol was less than 2.5 CFU/25 g for S. enterica and 3.3 CFU/25 g for enterohemorrhagic E. coli in spiked ground meat experiments.
Chen, Chao; Liu, Zhiguang; Pan, Qi; Chen, Xiao; Wang, Huihua; Guo, Haikun; Liu, Shidong; Lu, Hongfeng; Tian, Shilin; Li, Ruiqiang; Shi, Wei
2016-05-01
Studying the genetic signatures of climate-driven selection can produce insights into local adaptation and the potential impacts of climate change on populations. The honey bee (Apis mellifera) is an interesting species to study local adaptation because it originated in tropical/subtropical climatic regions and subsequently spread into temperate regions. However, little is known about the genetic basis of its adaptation to temperate climates. Here, we resequenced the whole genomes of ten individual bees from a newly discovered population in temperate China and downloaded resequenced data from 35 individuals from other populations. We found that the new population is an undescribed subspecies in the M-lineage of A. mellifera (Apis mellifera sinisxinyuan). Analyses of population history show that long-term global temperature has strongly influenced the demographic history of A. m. sinisxinyuan and its divergence from other subspecies. Further analyses comparing temperate and tropical populations identified several candidate genes related to fat body and the Hippo signaling pathway that are potentially involved in adaptation to temperate climates. Our results provide insights into the demographic history of the newly discovered A. m. sinisxinyuan, as well as the genetic basis of adaptation of A. mellifera to temperate climates at the genomic level. These findings will facilitate the selective breeding of A. mellifera to improve the survival of overwintering colonies. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Song, Yijun; Zhou, Xiya; Huang, Saiqiong; Li, Xiaohong; Qi, Qingwei; Jiang, Yulin; Liu, Yiqian; Ma, Chengcheng; Li, Zhifeng; Xu, Mengnan; Cram, David S; Liu, Juntao
2016-05-01
Calculation of the fetal DNA fraction (FF) is important for reliable and accurate noninvasive prenatal testing (NIPT) for fetal genetic abnormalities. The aim of the study was to develop and validate a novel method for FF determination. FF was calculated using the chromosome Y (ChrY) sequence read assay and by circulating single molecule amplification and re-sequencing technology of 76 autosomal SNPs. By Pearson correlation for FF (4.73-22.11%) in 33 male pregnancy samples, the R(2) co-efficient for the 76-SNP versus the ChrY assay was 0.9572 (p<0.001). In addition, the co-efficient of variation (CV) of FF measurement by the 76-SNP assay was low (0.15-0.35). As a control, the FF measurement for four non-pregnant plasma samples was virtually zero. In prospective longitudinal studies of 14 women with normal pregnancies, FF generally increased with gestational age. However, in eight women (71%) there was a significant decrease in FF between the first trimester (11-13 weeks) and the second trimester (15-19 weeks), and this was attributable to significant maternal weight gain. The novel 76-SNP cSMART assay has the precision to accurately measure FF in all pregnancies at a detection threshold of 5%. Based on FF trends in individual pregnancies, our results suggest that the end of the first trimester may be a more optimal window for performing NIPT. Copyright © 2016 Elsevier B.V. All rights reserved.
How important are rare variants in common disease?
Saint Pierre, Aude; Génin, Emmanuelle
2014-09-01
Genome-wide association studies have uncovered hundreds of common genetic variants involved in complex diseases. However, for most complex diseases, these common genetic variants only marginally contribute to disease susceptibility. It is now argued that rare variants located in different genes could in fact play a more important role in disease susceptibility than common variants. These rare genetic variants were not captured by genome-wide association studies using single nucleotide polymorphism-chips but with the advent of next-generation sequencing technologies, they have become detectable. It is now possible to study their contribution to common disease by resequencing samples of cases and controls or by using new genotyping exome arrays that cover rare alleles. In this review, we address the question of the contribution of rare variants in common disease by taking the examples of different diseases for which some resequencing studies have already been performed, and by summarizing the results of simulation studies conducted so far to investigate the genetic architecture of complex traits in human. So far, empirical data have not allowed the exclusion of many models except the most extreme ones involving only a small number of rare variants with large effects contributing to complex disease. To unravel the genetic architecture of complex disease, case-control data will not be sufficient, and alternative study designs need to be proposed together with methodological developments. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Jha, Aashish R; Miles, Cecelia M; Lippert, Nodia R; Brown, Christopher D; White, Kevin P; Kreitman, Martin
2015-10-01
Complete genome resequencing of populations holds great promise in deconstructing complex polygenic traits to elucidate molecular and developmental mechanisms of adaptation. Egg size is a classic adaptive trait in insects, birds, and other taxa, but its highly polygenic architecture has prevented high-resolution genetic analysis. We used replicated experimental evolution in Drosophila melanogaster and whole-genome sequencing to identify consistent signatures of polygenic egg-size adaptation. A generalized linear-mixed model revealed reproducible allele frequency differences between replicated experimental populations selected for large and small egg volumes at approximately 4,000 single nucleotide polymorphisms (SNPs). Several hundred distinct genomic regions contain clusters of these SNPs and have lower heterozygosity than the genomic background, consistent with selection acting on polymorphisms in these regions. These SNPs are also enriched among genes expressed in Drosophila ovaries and many of these genes have well-defined functions in Drosophila oogenesis. Additional genes regulating egg development, growth, and cell size show evidence of directional selection as genes regulating these biological processes are enriched for highly differentiated SNPs. Genetic crosses performed with a subset of candidate genes demonstrated that these genes influence egg size, at least in the large genetic background. These findings confirm the highly polygenic architecture of this adaptive trait, and suggest the involvement of many novel candidate genes in regulating egg size. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Dickinson, Peter; Xiong, Anqi; York, Daniel; Jayashankar, Kartika; Pielberg, Gerli; Koltookian, Michele; Murén, Eva; Fuxelius, Hans-Henrik; Weishaupt, Holger; Andersson, Göran; Hedhammar, Åke; Bongcam-Rudloff, Erik; Forsberg-Nilsson, Karin
2016-01-01
Gliomas are the most common form of malignant primary brain tumors in humans and second most common in dogs, occurring with similar frequencies in both species. Dogs are valuable spontaneous models of human complex diseases including cancers and may provide insight into disease susceptibility and oncogenesis. Several brachycephalic breeds such as Boxer, Bulldog and Boston Terrier have an elevated risk of developing glioma, but others, including Pug and Pekingese, are not at higher risk. To identify glioma-associated genetic susceptibility factors, an across-breed genome-wide association study (GWAS) was performed on 39 dog glioma cases and 141 controls from 25 dog breeds, identifying a genome-wide significant locus on canine chromosome (CFA) 26 (p = 2.8 x 10−8). Targeted re-sequencing of the 3.4 Mb candidate region was performed, followed by genotyping of the 56 SNVs that best fit the association pattern between the re-sequenced cases and controls. We identified three candidate genes that were highly associated with glioma susceptibility: CAMKK2, P2RX7 and DENR. CAMKK2 showed reduced expression in both canine and human brain tumors, and a non-synonymous variant in P2RX7, previously demonstrated to have a 50% decrease in receptor function, was also associated with disease. Thus, one or more of these genes appear to affect glioma susceptibility. PMID:27171399
Tranchida-Lombardo, Valentina; Aiese Cigliano, Riccardo; Anzar, Irantzu; Landi, Simone; Palombieri, Samuela; Colantuono, Chiara; Bostan, Hamed; Termolino, Pasquale; Aversano, Riccardo; Batelli, Giorgia; Cammareri, Maria; Carputo, Domenico; Chiusano, Maria Luisa; Conicella, Clara; Consiglio, Federica; D'Agostino, Nunzio; De Palma, Monica; Di Matteo, Antonio; Grandillo, Silvana; Sanseverino, Walter; Tucci, Marina; Grillo, Stefania
2017-11-14
Tomato is a high value crop and the primary model for fleshy fruit development and ripening. Breeding priorities include increased fruit quality, shelf life and tolerance to stresses. To contribute towards this goal, we re-sequenced the genomes of Corbarino (COR) and Lucariello (LUC) landraces, which both possess the traits of plant adaptation to water deficit, prolonged fruit shelf-life and good fruit quality. Through the newly developed pipeline Reconstructor, we generated the genome sequences of COR and LUC using datasets of 65.8 M and 56.4 M of 30-150 bp paired-end reads, respectively. New contigs including reads that could not be mapped to the tomato reference genome were assembled, and a total of 43, 054 and 44, 579 gene loci were annotated in COR and LUC. Both genomes showed novel regions with similarity to Solanum pimpinellifolium and Solanum pennellii. In addition to small deletions and insertions, 2, 000 and 1, 700 single nucleotide polymorphisms (SNPs) could exert potentially disruptive effects on 1, 371 and 1, 201 genes in COR and LUC, respectively. A detailed survey of the SNPs occurring in fruit quality, shelf life and stress tolerance related-genes identified several candidates of potential relevance. Variations in ethylene response components may concur in determining peculiar phenotypes of COR and LUC. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Manz, Judith; Rodríguez, Elke; ElSharawy, Abdou; Oesau, Eva-Maria; Petersen, Britt-Sabina; Baurecht, Hansjörg; Mayr, Gabriele; Weber, Susanne; Harder, Jürgen; Reischl, Eva; Schwarz, Agatha; Novak, Natalija; Franke, Andre; Weidinger, Stephan
2016-12-01
Gene-mapping studies have consistently identified a susceptibility locus for atopic dermatitis and other inflammatory diseases on chromosome band 11q13.5, with the strongest association observed for a common variant located in an intergenic region between the two annotated genes C11orf30 and LRRC32. Using a targeted resequencing approach we identified low-frequency and rare missense mutations within the LRRC32 gene encoding the protein GARP, a receptor on activated regulatory T cells that binds latent transforming growth factor-β. Subsequent association testing in more than 2,000 atopic dermatitis patients and 2,000 control subjects showed a significant excess of these LRRC32 variants in individuals with atopic dermatitis. Structural protein modeling and bioinformatic analysis predicted a disruption of protein transport upon these variants, and overexpression assays in CD4 + CD25 - T cells showed a significant reduction in surface expression of the mutated protein. Consistently, flow cytometric (FACS) analyses of different T-cell subtypes obtained from atopic dermatitis patients showed a significantly reduced surface expression of GARP and a reduced conversion of CD4 + CD25 - T cells into regulatory T cells, along with lower expression of latency-associated protein upon stimulation in carriers of the LRRC32 A407T variant. These results link inherited disturbances of transforming growth factor-β signaling with atopic dermatitis risk. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
A new DPYD genotyping assay for improving the safety of 5-fluorouracil therapy.
Sistonen, Johanna; Smith, Chingying; Fu, Yung-Kang; Largiadèr, Carlo R
2012-12-24
Chemotherapeutic use of 5-fluorouracil (5FU) is compromised by 10-20% of patients developing severe toxicity. Recently described genetic variation in dihydropyrimidine dehydrogenase (DPYD) has been shown to be a major predictor of 5FU toxicity. Here, we describe a new genotyping assay for routine clinical use that covers all the major DPYD risk variants. Genomic regions targeting DPYD risk variants (c.1129-5923C>G, c.1679T>G/A, c.1905+1G>A, c.2846A>T) and additional markers (c.234-123G>C, c.496A>G, c.775A>G) were amplified in a multiplex PCR reaction. The subsequent steps including allele-specific primer extension, hybridization of the primers to a microarray, scanning of the array, and data analysis were automated within the INFINITI® Analyzer (AutoGenomics). The assay was validated by analyzing 107 blood samples obtained from patients previously re-sequenced for the DPYD. The genotypes obtained with the developed assay were 100% concordant with the re-sequencing. The procedure is suitable for routine clinical use since the results are obtained within one day. For heterozygous risk variant carriers (~7% of Europeans), the treatment can be adjusted by 5FU dose reduction, whereas carriers of two risk alleles should be treated with an alternative therapy. The developed assay provides a novel tool to improve the safety of commonly used 5FU-based chemotherapies. Copyright © 2012 Elsevier B.V. All rights reserved.
Natural Selection and Genetic Diversity in the Butterfly Heliconius melpomene.
Martin, Simon H; Möst, Markus; Palmer, William J; Salazar, Camilo; McMillan, W Owen; Jiggins, Francis M; Jiggins, Chris D
2016-05-01
A combination of selective and neutral evolutionary forces shape patterns of genetic diversity in nature. Among the insects, most previous analyses of the roles of drift and selection in shaping variation across the genome have focused on the genus Drosophila A more complete understanding of these forces will come from analyzing other taxa that differ in population demography and other aspects of biology. We have analyzed diversity and signatures of selection in the neotropical Heliconius butterflies using resequenced genomes from 58 wild-caught individuals of Heliconius melpomene and another 21 resequenced genomes representing 11 related species. By comparing intraspecific diversity and interspecific divergence, we estimate that 31% of amino acid substitutions between Heliconius species are adaptive. Diversity at putatively neutral sites is negatively correlated with the local density of coding sites as well as nonsynonymous substitutions and positively correlated with recombination rate, indicating widespread linked selection. This process also manifests in significantly reduced diversity on longer chromosomes, consistent with lower recombination rates. Although hitchhiking around beneficial nonsynonymous mutations has significantly shaped genetic variation in H. melpomene, evidence for strong selective sweeps is limited overall. We did however identify two regions where distinct haplotypes have swept in different populations, leading to increased population differentiation. On the whole, our study suggests that positive selection is less pervasive in these butterflies as compared to fruit flies, a fact that curiously results in very similar levels of neutral diversity in these very different insects. Copyright © 2016 by the Genetics Society of America.
Validation of Pooled Whole-Genome Re-Sequencing in Arabidopsis lyrata.
Fracassetti, Marco; Griffin, Philippa C; Willi, Yvonne
2015-01-01
Sequencing pooled DNA of multiple individuals from a population instead of sequencing individuals separately has become popular due to its cost-effectiveness and simple wet-lab protocol, although some criticism of this approach remains. Here we validated a protocol for pooled whole-genome re-sequencing (Pool-seq) of Arabidopsis lyrata libraries prepared with low amounts of DNA (1.6 ng per individual). The validation was based on comparing single nucleotide polymorphism (SNP) frequencies obtained by pooling with those obtained by individual-based Genotyping By Sequencing (GBS). Furthermore, we investigated the effect of sample number, sequencing depth per individual and variant caller on population SNP frequency estimates. For Pool-seq data, we compared frequency estimates from two SNP callers, VarScan and Snape; the former employs a frequentist SNP calling approach while the latter uses a Bayesian approach. Results revealed concordance correlation coefficients well above 0.8, confirming that Pool-seq is a valid method for acquiring population-level SNP frequency data. Higher accuracy was achieved by pooling more samples (25 compared to 14) and working with higher sequencing depth (4.1× per individual compared to 1.4× per individual), which increased the concordance correlation coefficient to 0.955. The Bayesian-based SNP caller produced somewhat higher concordance correlation coefficients, particularly at low sequencing depth. We recommend pooling at least 25 individuals combined with sequencing at a depth of 100× to produce satisfactory frequency estimates for common SNPs (minor allele frequency above 0.05).
Genome-wide SNP identification and QTL mapping for black rot resistance in cabbage.
Lee, Jonghoon; Izzah, Nur Kholilatul; Jayakodi, Murukarthick; Perumal, Sampath; Joh, Ho Jun; Lee, Hyeon Ju; Lee, Sang-Choon; Park, Jee Young; Yang, Ki-Woung; Nou, Il-Sup; Seo, Joodeok; Yoo, Jaeheung; Suh, Youngdeok; Ahn, Kyounggu; Lee, Ji Hyun; Choi, Gyung Ja; Yu, Yeisoo; Kim, Heebal; Yang, Tae-Jin
2015-02-03
Black rot is a destructive bacterial disease causing large yield and quality losses in Brassica oleracea. To detect quantitative trait loci (QTL) for black rot resistance, we performed whole-genome resequencing of two cabbage parental lines and genome-wide SNP identification using the recently published B. oleracea genome sequences as reference. Approximately 11.5 Gb of sequencing data was produced from each parental line. Reference genome-guided mapping and SNP calling revealed 674,521 SNPs between the two cabbage lines, with an average of one SNP per 662.5 bp. Among 167 dCAPS markers derived from candidate SNPs, 117 (70.1%) were validated as bona fide SNPs showing polymorphism between the parental lines. We then improved the resolution of a previous genetic map by adding 103 markers including 87 SNP-based dCAPS markers. The new map composed of 368 markers and covers 1467.3 cM with an average interval of 3.88 cM between adjacent markers. We evaluated black rot resistance in the mapping population in three independent inoculation tests using F2:3 progenies and identified one major QTL and three minor QTLs. We report successful utilization of whole-genome resequencing for large-scale SNP identification and development of molecular markers for genetic map construction. In addition, we identified novel QTLs for black rot resistance. The high-density genetic map will promote QTL analysis for other important agricultural traits and marker-assisted breeding of B. oleracea.
Castañeda, María; Odriozola, Adrián; Gómez, Javier; Zarrabeitia, María T
2013-07-01
We report the development of an effective system for analyzing X chromosome-linked mini short tandem repeat loci with reduced-size amplicons (less than 220 bp), useful for analyzing highly degraded DNA samples. To generate smaller amplicons, we redesigned primers for eight X-linked microsatellites (DXS7132, DXS10079, DXS10074, DXS10075, DXS6801, DXS6809, DXS6789, and DXS6799) and established efficient conditions for a multiplex PCR system (miniX). The validation tests confirmed that it has good sensitivity, requiring as little as 20 pg of DNA, and performs well with DNA from paraffin-embedded tissues, thus showing potential for improved analysis and identification of highly degraded and/or very limited DNA samples. Consequently, this system may help to solve complex forensic cases, particularly when autosomal markers convey insufficient information.
Matsuda, M; Tai, K; Moore, J E; Millar, B C; Murayama, O
2004-01-01
Nucleotide sequencing after TA cloning of the amplicon of the almost-full length recA gene from three strains of UPTC (A1, A2, and A3) isolated from seagulls in Northern Ireland, the phenotypical and genotypical characteristics of which have been demonstrated to be indistinguishable, clarified nucleotide differences at three nucleotide positions among the three strains. In conclusion, the nucleotide sequences of the recA gene were found to discriminate among the three strains of UPTC, A1, A2, and A3, which are indistinguishable phenotypically and genotypically. Thus, the present study strongly suggests that nucleotide sequence data of the amplicon of a suitable gene or region could aid in discriminating among isolates of the UPTC group, which are indistinguishable phenotypically and genotypically. Copyright 2004 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Swarm v2: highly-scalable and high-resolution amplicon clustering.
Mahé, Frédéric; Rognes, Torbjørn; Quince, Christopher; de Vargas, Colomban; Dunthorn, Micah
2015-01-01
Previously we presented Swarm v1, a novel and open source amplicon clustering program that produced fine-scale molecular operational taxonomic units (OTUs), free of arbitrary global clustering thresholds and input-order dependency. Swarm v1 worked with an initial phase that used iterative single-linkage with a local clustering threshold (d), followed by a phase that used the internal abundance structures of clusters to break chained OTUs. Here we present Swarm v2, which has two important novel features: (1) a new algorithm for d = 1 that allows the computation time of the program to scale linearly with increasing amounts of data; and (2) the new fastidious option that reduces under-grouping by grafting low abundant OTUs (e.g., singletons and doubletons) onto larger ones. Swarm v2 also directly integrates the clustering and breaking phases, dereplicates sequencing reads with d = 0, outputs OTU representatives in fasta format, and plots individual OTUs as two-dimensional networks.
A seminested PCR assay for detection and typing of human papillomavirus based on E1 gene sequences.
Cavalcante, Gustavo Henrique O; de Araújo, Josélio M G; Fernandes, José Veríssimo; Lanza, Daniel C F
2018-05-01
HPV infection is considered one of the leading causes of cervical cancer in the world. To date, more than 180 types of HPV have been described and viral typing is critical for defining the prognosis of cancer. In this work, a seminested PCR which allow fast and inexpensively detection and typing of HPV is presented. The system is based on the amplification of a variable length region within the viral gene E1, using three primers that potentially anneal in all HPV genomes. The amplicons produced in the first step can be identified by high resolution electrophoresis or direct sequencing. The seminested step includes nine specific primers which can be used in multiplex or individual reactions to discriminate the main types of HPV by amplicon size differentiation using agarose electrophoresis, reducing the time spent and cost per analysis. Copyright © 2017 Elsevier Inc. All rights reserved.
Meisal, Roger; Rounge, Trine Ballestad; Christiansen, Irene Kraus; Eieland, Alexander Kirkeby; Worren, Merete Molton; Molden, Tor Faksvaag; Kommedal, Øyvind; Hovig, Eivind; Leegaard, Truls Michael
2017-01-01
Sensitive and specific genotyping of human papillomaviruses (HPVs) is important for population-based surveillance of carcinogenic HPV types and for monitoring vaccine effectiveness. Here we compare HPV genotyping by Next Generation Sequencing (NGS) to an established DNA hybridization method. In DNA isolated from urine, the overall analytical sensitivity of NGS was found to be 22% higher than that of hybridization. NGS was also found to be the most specific method and expanded the detection repertoire beyond the 37 types of the DNA hybridization assay. Furthermore, NGS provided an increased resolution by identifying genetic variants of individual HPV types. The same Modified General Primers (MGP)-amplicon was used in both methods. The NGS method is described in detail to facilitate implementation in the clinical microbiology laboratory and includes suggestions for new standards for detection and calling of types and variants with improved resolution. PMID:28045981
Meisal, Roger; Rounge, Trine Ballestad; Christiansen, Irene Kraus; Eieland, Alexander Kirkeby; Worren, Merete Molton; Molden, Tor Faksvaag; Kommedal, Øyvind; Hovig, Eivind; Leegaard, Truls Michael; Ambur, Ole Herman
2017-01-01
Sensitive and specific genotyping of human papillomaviruses (HPVs) is important for population-based surveillance of carcinogenic HPV types and for monitoring vaccine effectiveness. Here we compare HPV genotyping by Next Generation Sequencing (NGS) to an established DNA hybridization method. In DNA isolated from urine, the overall analytical sensitivity of NGS was found to be 22% higher than that of hybridization. NGS was also found to be the most specific method and expanded the detection repertoire beyond the 37 types of the DNA hybridization assay. Furthermore, NGS provided an increased resolution by identifying genetic variants of individual HPV types. The same Modified General Primers (MGP)-amplicon was used in both methods. The NGS method is described in detail to facilitate implementation in the clinical microbiology laboratory and includes suggestions for new standards for detection and calling of types and variants with improved resolution.
Detection of Theileria luwenshuni in sheep from Great Britain.
Phipps, L Paul; Hernández-Triana, Luis M; Goharriz, Hooman; Welchman, David; Johnson, Nicholas
2016-04-13
Theileria spp. are tick-borne protozoan parasites of the Phylum Apicomplexa, Order Piroplasmida that infect a wide range of wild and domestic animals. In Great Britain, Theileria spp. have been reported from livestock associated with transmission by the tick Haemaphysalis punctata. However, these reports have not been associated with disease. This study has investigated the cause of a disease outbreak accompanied by mortality in a flock of sheep grazing reclaimed marshland in north Kent. A polymerase chain reaction-reverse line blot assay indicated the presence of Theileria spp. in blood samples from five animals. Subsequent testing with a pan-piroplasm PCR of a larger panel of blood samples detected a piroplasm amplicon in 19 of 21 sheep submitted from the affected flock. Automated sequencing confirmed that these amplicons shared 99-100% identity with T. luwenshuni. The clinical and PCR data suggest infection with T. luwenshuni was associated with disease and mortality in this flock.
Isolation and characterization of a herpesvirus from feral pigeons in China.
Zhao, Panpan; Ma, Jian; Guo, Ying; Tian, Li; Guo, Guangyang; Zhang, Kexin; Xing, Mingwei
2015-12-01
A herpesvirus was isolated during a diagnostic investigation of severe cases of conjunctivitis in feral pigeons (Columba livia f. domestica). Isolates of the virus were recovered from throat swabs of the pigeons followed by inoculation of the swab samples in chicken embryo fibroblasts. Pigeons inoculated with the isolated virus had similar clinical signs to those observed in naturally infected birds. Transmission electron microscopy revealed viral structures with typical herpesvirus morphology. Polymerase chain reaction amplification, using herpesvirus-identifying primers resulted in an amplicon of the expected size for herpesvirus. Sequencing of these amplicons and database comparisons identified the herpesvirus UL30 homologue. Phylogenetic reconstructions suggested that the isolated herpesvirus belongs to the Mardivirus genus of Alphaherpesvirinae. Using the current herpesvirus nomenclature conventions, the authors propose that the herpesvirus be named Columbid herpesvirus-1 Heilongjiang. Copyright © 2015 Elsevier Ltd. All rights reserved.
Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M
2007-01-01
Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442
Label-free electrical quantification of amplified nucleic acids through nanofluidic diodes.
Liu, Yifan; Yobas, Levent
2013-12-15
A label-free method of quantifying nucleic acids in polymerase chain reaction (PCR) is described and could be the basis for miniaturized devices that can amplify and detect target nucleic acids in real time. The method takes advantage of ionic current rectification effect discovered in nanofluidic channels exhibiting a broken symmetry in electrochemical potential - nanofluidic diodes. Nanofluidic diodes are prototyped here on nanopipettes readily pulled from individual thin-walled glass capillaries for a proof of concept demonstration yet the basic concept would be applicable to ionic rectifiers constructed through other means. When a nanopipette modified in the tip region with cationic polyelectrolytes is presented with an unpurified PCR product, the tip surface electrostatically interacts with the amplicons and modulates its ionic rectification direction in response to the intrinsic charge of those adsorbed. Modulations are gradual and correlate well with the mass concentration of the amplicons above 2.5 ng/μL, rather than their sizes, with adequate discrimination against the background. Moreover, the tip surface, following a measurement, is regenerated through a layer-by-layer assembly of cationic polyelectrolytes and amplicons. The regenerated tips are capable of measuring distinct mass concentrations without signs of noticeable degradation in sensitivity. Further, the tips are shown capable of reproducing the amplification curve of real-time PCR through sequential steps of surface regeneration and simple electrical readout during the intermediate reaction stages. This suggests that nanopipettes as nanofluidic diodes are at a capacity to be employed for monitoring the PCR progress. Copyright © 2013 Elsevier B.V. All rights reserved.
Barret, Maialen; Gagnon, Nathalie; Kalmokoff, Martin L; Topp, Edward; Verastegui, Yris; Brooks, Stephen P J; Matias, Fernando; Neufeld, Josh D; Talbot, Guylaine
2013-01-01
Methane emissions represent a major environmental concern associated with manure management in the livestock industry. A more thorough understanding of how microbial communities function in manure storage tanks is a prerequisite for mitigating methane emissions. Identifying the microorganisms that are metabolically active is an important first step. Methanogenic archaea are major contributors to methanogenesis in stored swine manure, and we investigated active methanogenic populations by DNA stable isotope probing (DNA-SIP). Following a preincubation of manure samples under anoxic conditions to induce substrate starvation, [U-(13)C]acetate was added as a labeled substrate. Fingerprint analysis of density-fractionated DNA, using length-heterogeneity analysis of PCR-amplified mcrA genes (encoding the alpha subunit of methyl coenzyme M reductase), showed that the incorporation of (13)C into DNA was detectable at in situ acetate concentrations (~7 g/liter). Fingerprints of DNA retrieved from heavy fractions of the (13)C treatment were primarily enriched in a 483-bp amplicon and, to a lesser extent, in a 481-bp amplicon. Analyses based on clone libraries of the mcrA and 16S rRNA genes revealed that both of these heavy DNA amplicons corresponded to Methanoculleus spp. Our results demonstrate that uncultivated methanogenic archaea related to Methanoculleus spp. were major contributors to acetate-C assimilation during the anoxic incubation of swine manure storage tank samples. Carbon assimilation and dissimilation rate estimations suggested that Methanoculleus spp. were also major contributors to methane emissions and that the hydrogenotrophic pathway predominated during methanogenesis.
König, Katharina; Peifer, Martin; Fassunke, Jana; Ihle, Michaela A; Künstlinger, Helen; Heydt, Carina; Stamm, Katrin; Ueckeroth, Frank; Vollbrecht, Claudia; Bos, Marc; Gardizi, Masyar; Scheffler, Matthias; Nogova, Lucia; Leenders, Frauke; Albus, Kerstin; Meder, Lydia; Becker, Kerstin; Florin, Alexandra; Rommerscheidt-Fuss, Ursula; Altmüller, Janine; Kloth, Michael; Nürnberg, Peter; Henkel, Thomas; Bikár, Sven-Ernö; Sos, Martin L; Geese, William J; Strauss, Lewis; Ko, Yon-Dschun; Gerigk, Ulrich; Odenthal, Margarete; Zander, Thomas; Wolf, Jürgen; Merkelbach-Bruse, Sabine; Buettner, Reinhard; Heukamp, Lukas C
2015-07-01
The Network Genomic Medicine Lung Cancer was set up to rapidly translate scientific advances into early clinical trials of targeted therapies in lung cancer performing molecular analyses of more than 3500 patients annually. Because sequential analysis of the relevant driver mutations on fixated samples is challenging in terms of workload, tissue availability, and cost, we established multiplex parallel sequencing in routine diagnostics. The aim was to analyze all therapeutically relevant mutations in lung cancer samples in a high-throughput fashion while significantly reducing turnaround time and amount of input DNA compared with conventional dideoxy sequencing of single polymerase chain reaction amplicons. In this study, we demonstrate the feasibility of a 102 amplicon multiplex polymerase chain reaction followed by sequencing on an Illumina sequencer on formalin-fixed paraffin-embedded tissue in routine diagnostics. Analysis of a validation cohort of 180 samples showed this approach to require significantly less input material and to be more reliable, robust, and cost-effective than conventional dideoxy sequencing. Subsequently, 2657 lung cancer patients were analyzed. We observed that comprehensive biomarker testing provided novel information in addition to histological diagnosis and clinical staging. In 2657 consecutively analyzed lung cancer samples, we identified driver mutations at the expected prevalence. Furthermore we found potentially targetable DDR2 mutations at a frequency of 3% in both adenocarcinomas and squamous cell carcinomas. Overall, our data demonstrate the utility of systematic sequencing analysis in a clinical routine setting and highlight the dramatic impact of such an approach on the availability of therapeutic strategies for the targeted treatment of individual cancer patients.
Lagkouvardos, Ilias; Weinmaier, Thomas; Lauro, Federico M; Cavicchioli, Ricardo; Rattei, Thomas; Horn, Matthias
2014-01-01
In the era of metagenomics and amplicon sequencing, comprehensive analyses of available sequence data remain a challenge. Here we describe an approach exploiting metagenomic and amplicon data sets from public databases to elucidate phylogenetic diversity of defined microbial taxa. We investigated the phylum Chlamydiae whose known members are obligate intracellular bacteria that represent important pathogens of humans and animals, as well as symbionts of protists. Despite their medical relevance, our knowledge about chlamydial diversity is still scarce. Most of the nine known families are represented by only a few isolates, while previous clone library-based surveys suggested the existence of yet uncharacterized members of this phylum. Here we identified more than 22 000 high quality, non-redundant chlamydial 16S rRNA gene sequences in diverse databases, as well as 1900 putative chlamydial protein-encoding genes. Even when applying the most conservative approach, clustering of chlamydial 16S rRNA gene sequences into operational taxonomic units revealed an unexpectedly high species, genus and family-level diversity within the Chlamydiae, including 181 putative families. These in silico findings were verified experimentally in one Antarctic sample, which contained a high diversity of novel Chlamydiae. In our analysis, the Rhabdochlamydiaceae, whose known members infect arthropods, represents the most diverse and species-rich chlamydial family, followed by the protist-associated Parachlamydiaceae, and a putative new family (PCF8) with unknown host specificity. Available information on the origin of metagenomic samples indicated that marine environments contain the majority of the newly discovered chlamydial lineages, highlighting this environment as an important chlamydial reservoir. PMID:23949660
Analysis of bacterial xylose isomerase gene diversity using gene-targeted metagenomics.
Nurdiani, Dini; Ito, Michihiro; Maruyama, Toru; Terahara, Takeshi; Mori, Tetsushi; Ugawa, Shin; Takeyama, Haruko
2015-08-01
Bacterial xylose isomerases (XI) are promising resources for efficient biofuel production from xylose in lignocellulosic biomass. Here, we investigated xylose isomerase gene (xylA) diversity in three soil metagenomes differing in plant vegetation and geographical location, using an amplicon pyrosequencing approach and two newly-designed primer sets. A total of 158,555 reads from three metagenomic DNA replicates for each soil sample were classified into 1127 phylotypes, detected in triplicate and defined by 90% amino acid identity. The phylotype coverage was estimated to be within the range of 84.0-92.7%. The xylA gene phylotypes obtained were phylogenetically distributed across the two known xylA groups. They shared 49-100% identities with their closest-related XI sequences in GenBank. Phylotypes demonstrating <90% identity with known XIs in the database accounted for 89% of the total xylA phylotypes. The differences among xylA members and compositions within each soil sample were significantly smaller than they were between different soils based on a UniFrac distance analysis, suggesting soil-specific xylA genotypes and taxonomic compositions. The differences among xylA members and their compositions in the soil were strongly correlated with 16S rRNA variation between soil samples, also assessed by amplicon pyrosequencing. This is the first report of xylA diversity in environmental samples assessed by amplicon pyrosequencing. Our data provide information regarding xylA diversity in nature, and can be a basis for the screening of novel xylA genotypes for practical applications. Copyright © 2015. Published by Elsevier B.V.
Barret, Maialen; Gagnon, Nathalie; Kalmokoff, Martin L.; Topp, Edward; Verastegui, Yris; Brooks, Stephen P. J.; Matias, Fernando; Neufeld, Josh D.
2013-01-01
Methane emissions represent a major environmental concern associated with manure management in the livestock industry. A more thorough understanding of how microbial communities function in manure storage tanks is a prerequisite for mitigating methane emissions. Identifying the microorganisms that are metabolically active is an important first step. Methanogenic archaea are major contributors to methanogenesis in stored swine manure, and we investigated active methanogenic populations by DNA stable isotope probing (DNA-SIP). Following a preincubation of manure samples under anoxic conditions to induce substrate starvation, [U-13C]acetate was added as a labeled substrate. Fingerprint analysis of density-fractionated DNA, using length-heterogeneity analysis of PCR-amplified mcrA genes (encoding the alpha subunit of methyl coenzyme M reductase), showed that the incorporation of 13C into DNA was detectable at in situ acetate concentrations (∼7 g/liter). Fingerprints of DNA retrieved from heavy fractions of the 13C treatment were primarily enriched in a 483-bp amplicon and, to a lesser extent, in a 481-bp amplicon. Analyses based on clone libraries of the mcrA and 16S rRNA genes revealed that both of these heavy DNA amplicons corresponded to Methanoculleus spp. Our results demonstrate that uncultivated methanogenic archaea related to Methanoculleus spp. were major contributors to acetate-C assimilation during the anoxic incubation of swine manure storage tank samples. Carbon assimilation and dissimilation rate estimations suggested that Methanoculleus spp. were also major contributors to methane emissions and that the hydrogenotrophic pathway predominated during methanogenesis. PMID:23104405
Molecular Characterization of Brevibacillus laterosporus and Its Potential Use in Biological Control
de Oliveira, Edmar Justo; Rabinovitch, Leon; Monnerat, Rose Gomes; Passos, Liana Konovaloff Jannotti; Zahner, Viviane
2004-01-01
Thirty-three strains of Brevibacillus laterosporus, including three novel strains isolated from Brazilian soil samples, were examined for genetic variability by the use of different PCR-based methods. Molecular markers that could characterize bacterial strains with regards to their pathogenic potential were investigated. In addition, toxicity was assessed by the use of insects belonging to the orders Lepidoptera and Coleoptera and the mollusk Biomphalaria glabrata. Among the targets tested, Biomphalaria glabrata demonstrated the highest degree of sensitivity to B. laterosporus, with some strains inducing 90 to 100% mortality in snails aged 3 and 12 days posteclosion. Larvae of the coleopteron Anthonomus grandis were also susceptible, presenting mortality levels of between 33 and 63%. Toxicity was also noted towards the lepidopteron Anticarsia gemmatalis. In contrast, no mortality was recorded among test populations of Tenebrio molitor or Spodoptera frugiperda. The application of intergenic transcribed spacer PCR and BOX-PCR generated 15 and 17 different genotypes, respectively. None of the molecular techniques allowed the identification of a convenient marker that was associated with any entomopathogenic phenotype. However, a 1,078-bp amplicon was detected for all strains of B. laterosporus when a primer for amplification of the BOXA1R region was used. Similarly, a 900-bp amplicon was generated from all isolates by use of the primer OPA-11 for randomly amplified polymorphic DNA analysis. These amplicons were not detected for other phenotypically related Brevibacillus species, indicating that they represent markers that are specific for B. laterosporus, which may prove useful for the isolation and identification of new strains of this species. PMID:15528531
Mundo, Silvia Leonor; Gilardoni, Liliana Rosa; Hoffman, Federico José; Lopez, Osvaldo Jorge
2013-03-01
Paratuberculosis is an infectious, chronic, and incurable disease that affects ruminants, caused by Mycobacterium avium subsp. paratuberculosis. This bacterium is shed primarily through feces of infected cows but can be also excreted in colostrum and milk and might survive pasteurization. Since an association of genomic sequences of M. avium subsp. paratuberculosis in patients with Crohn's disease has been described; it is of interest to rapidly detect M. avium subsp. paratuberculosis in milk for human consumption. IS900 insertion is used as a target for PCR amplification to identify the presence of M. avium subsp. paratuberculosis in biological samples. Two target sequences were selected: IS1 (155 bp) and IS2 (94 bp). These fragments have a 100% identity among all M. avium subsp. paratuberculosis strains sequenced. M. avium subsp. paratuberculosis was specifically concentrated from milk samples by immunomagnetic separation prior to performing PCR. The amplicons were characterized using DNA methylase Genotyping, i.e., the amplicons were methylated with 6-methyl-adenine and digested with restriction enzymes to confirm their identity. The methylated amplicons from 100 CFU of M. avium subsp. paratuberculosis can be visualized in a Western blot format using an anti-6-methyl-adenine monoclonal antibody. The use of DNA methyltransferase genotyping coupled to a scintillation proximity assay allows for the detection of up to 10 CFU of M. avium subsp. paratuberculosis per ml of milk. This test is rapid and sensitive and allows for automation and thus multiple samples can be tested at the same time.
Mundo, Silvia Leonor; Gilardoni, Liliana Rosa; Hoffman, Federico José
2013-01-01
Paratuberculosis is an infectious, chronic, and incurable disease that affects ruminants, caused by Mycobacterium avium subsp. paratuberculosis. This bacterium is shed primarily through feces of infected cows but can be also excreted in colostrum and milk and might survive pasteurization. Since an association of genomic sequences of M. avium subsp. paratuberculosis in patients with Crohn's disease has been described; it is of interest to rapidly detect M. avium subsp. paratuberculosis in milk for human consumption. IS900 insertion is used as a target for PCR amplification to identify the presence of M. avium subsp. paratuberculosis in biological samples. Two target sequences were selected: IS1 (155 bp) and IS2 (94 bp). These fragments have a 100% identity among all M. avium subsp. paratuberculosis strains sequenced. M. avium subsp. paratuberculosis was specifically concentrated from milk samples by immunomagnetic separation prior to performing PCR. The amplicons were characterized using DNA methylase Genotyping, i.e., the amplicons were methylated with 6-methyl-adenine and digested with restriction enzymes to confirm their identity. The methylated amplicons from 100 CFU of M. avium subsp. paratuberculosis can be visualized in a Western blot format using an anti-6-methyl-adenine monoclonal antibody. The use of DNA methyltransferase genotyping coupled to a scintillation proximity assay allows for the detection of up to 10 CFU of M. avium subsp. paratuberculosis per ml of milk. This test is rapid and sensitive and allows for automation and thus multiple samples can be tested at the same time. PMID:23275511
Holmes, Scott; Pena Diaz, Ana M; Athwal, George S; Faber, Kenneth J; O'Gorman, David B
2017-02-01
Propionibacterium (P) acnes infection of the shoulder after arthroplasty is a common and serious complication. Current detection methods for P acnes involve anaerobic cultures that require prolonged incubation periods (typically 7-14 days). We have developed a polymerase chain reaction (PCR)-restriction fragment length polymorphism (RFLP) approach that sensitively and specifically identifies P acnes in tissue specimens within a 24-hour period. Primers were designed to amplify a unique region of the 16S rRNA gene in P acnes that contained a unique HaeIII restriction enzyme site. PCR and RFLP analyses were optimized to detect P acnes DNA in in vitro cultures and in arthroscopic surgical biopsy specimens from patients with P acnes infections. A 564 base-pair PCR amplicon was derived from all of the known P acnes strains. HaeIII digests of the amplicon yielded a restriction fragment pattern that was unique to P acnes. P acnes-specific amplicons were detected in as few as 10 bacterial cells and in clinical biopsy specimens of infected shoulder tissues. This PCR-RFLP assay combines the sensitivity of PCR with the specificity of RFLP mapping to identify P acnes in surgical isolates. The assay is robust and rapid, and a P acnes-positive tissue specimen can be confirmed within 24 hours of sampling, facilitating treatment decision making, targeted antibiotic therapy, and monitoring to minimize implant failure and revision surgery. Copyright © 2017 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Pester, Michael; Rattei, Thomas; Flechl, Stefan; Gröngröft, Alexander; Richter, Andreas; Overmann, Jörg; Reinhold-Hurek, Barbara; Loy, Alexander; Wagner, Michael
2012-01-01
Ammonia-oxidizing archaea (AOA) play an important role in nitrification and many studies exploit their amoA genes as marker for their diversity and abundance. We present an archaeal amoA consensus phylogeny based on all publicly available sequences (status June 2010) and provide evidence for the diversification of AOA into four previously recognized clusters and one newly identified major cluster. These clusters, for which we suggest a new nomenclature, harboured 83 AOA species-level OTU (using an inferred species threshold of 85% amoA identity). 454 pyrosequencing of amoA amplicons from 16 soils sampled in Austria, Costa Rica, Greenland and Namibia revealed that only 2% of retrieved sequences had no database representative on the species-level and represented 30–37 additional species-level OTUs. With the exception of an acidic soil from which mostly amoA amplicons of the Nitrosotalea cluster were retrieved, all soils were dominated by amoA amplicons from the Nitrososphaera cluster (also called group I.1b), indicating that the previously reported AOA from the Nitrosopumilus cluster (also called group I.1a) are absent or represent minor populations in soils. AOA richness estimates on the species level ranged from 8–83 co-existing AOAs per soil. Presence/absence of amoA OTUs (97% identity level) correlated with geographic location, indicating that besides contemporary environmental conditions also dispersal limitation across different continents and/or historical environmental conditions might influence AOA biogeography in soils. PMID:22141924
Constable, Fiona E.; Nancarrow, Narelle; Rodoni, Brendan
2018-01-01
Apple mosaic virus (ApMV) and prune dwarf virus (PDV) are amongst the most common viruses infecting Prunus species worldwide but their incidence and genetic diversity in Australia is not known. In a survey of 127 Prunus tree samples collected from five states in Australia, ApMV and PDV occurred in 4 (3%) and 13 (10%) of the trees respectively. High-throughput sequencing (HTS) of amplicons from partial conserved regions of RNA1, RNA2, and RNA3, encoding the methyltransferase (MT), RNA-dependent RNA polymerase (RdRp), and the coat protein (CP) genes respectively, of ApMV and PDV was used to determine the genetic diversity of the Australian isolates of each virus. Phylogenetic comparison of Australian ApMV and PDV amplicon HTS variants and full length genomes of both viruses with isolates occurring in other countries identified genetic strains of each virus occurring in Australia. A single Australian Prunus infecting ApMV genetic strain was identified as all ApMV isolates sequence variants formed a single phylogenetic group in each of RNA1, RNA2, and RNA3. Two Australian PDV genetic strains were identified based on the combination of observed phylogenetic groups in each of RNA1, RNA2, and RNA3 and one Prunus tree had both strains. The accuracy of amplicon sequence variants phylogenetic analysis based on segments of each virus RNA were confirmed by phylogenetic analysis of full length genome sequences of Australian ApMV and PDV isolates and all published ApMV and PDV genomes from other countries. PMID:29562672
Oshiki, Mamoru; Segawa, Takahiro; Ishii, Satoshi
2018-02-02
Various microorganisms play key roles in the Nitrogen (N) cycle. Quantitative PCR (qPCR) and PCR-amplicon sequencing of the N cycle functional genes allow us to analyze the abundance and diversity of microbes responsible in the N transforming reactions in various environmental samples. However, analysis of multiple target genes can be cumbersome and expensive. PCR-independent analysis, such as metagenomics and metatranscriptomics, is useful but expensive especially when we analyze multiple samples and try to detect N cycle functional genes present at relatively low abundance. Here, we present the application of microfluidic qPCR chip technology to simultaneously quantify and prepare amplicon sequence libraries for multiple N cycle functional genes as well as taxon-specific 16S rRNA gene markers for many samples. This approach, named as N cycle evaluation (NiCE) chip, was evaluated by using DNA from pure and artificially mixed bacterial cultures and by comparing the results with those obtained by conventional qPCR and amplicon sequencing methods. Quantitative results obtained by the NiCE chip were comparable to those obtained by conventional qPCR. In addition, the NiCE chip was successfully applied to examine abundance and diversity of N cycle functional genes in wastewater samples. Although non-specific amplification was detected on the NiCE chip, this could be overcome by optimizing the primer sequences in the future. As the NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes, this tool should advance our ability to explore N cycling in various samples. Importance. We report a novel approach, namely Nitrogen Cycle Evaluation (NiCE) chip by using microfluidic qPCR chip technology. By sequencing the amplicons recovered from the NiCE chip, we can assess diversities of the N cycle functional genes. The NiCE chip technology is applicable to analyze the temporal dynamics of the N cycle gene transcriptions in wastewater treatment bioreactors. The NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes. While there is a room for future improvement, this tool should significantly advance our ability to explore the N cycle in various environmental samples. Copyright © 2018 American Society for Microbiology.
Quantitative trait nucleotide analysis using Bayesian model selection.
Blangero, John; Goring, Harald H H; Kent, Jack W; Williams, Jeff T; Peterson, Charles P; Almasy, Laura; Dyer, Thomas D
2005-10-01
Although much attention has been given to statistical genetic methods for the initial localization and fine mapping of quantitative trait loci (QTLs), little methodological work has been done to date on the problem of statistically identifying the most likely functional polymorphisms using sequence data. In this paper we provide a general statistical genetic framework, called Bayesian quantitative trait nucleotide (BQTN) analysis, for assessing the likely functional status of genetic variants. The approach requires the initial enumeration of all genetic variants in a set of resequenced individuals. These polymorphisms are then typed in a large number of individuals (potentially in families), and marker variation is related to quantitative phenotypic variation using Bayesian model selection and averaging. For each sequence variant a posterior probability of effect is obtained and can be used to prioritize additional molecular functional experiments. An example of this quantitative nucleotide analysis is provided using the GAW12 simulated data. The results show that the BQTN method may be useful for choosing the most likely functional variants within a gene (or set of genes). We also include instructions on how to use our computer program, SOLAR, for association analysis and BQTN analysis.
A novel COL11A1 mutation affecting splicing in a patient with Stickler syndrome.
Kohmoto, Tomohiro; Naruto, Takuya; Kobayashi, Haruka; Watanabe, Miki; Okamoto, Nana; Masuda, Kiyoshi; Imoto, Issei; Okamoto, Nobuhiko
2015-01-01
Stickler syndrome is a clinically and genetically heterogeneous collagenopathy characterized by ocular, auditory, skeletal and orofacial abnormalities, commonly occurring as an autosomal dominant trait. We conducted target resequencing to analyze candidate genes associated with known clinical phenotypes from a 4-year-old girl with Stickler syndrome. We detected a novel heterozygous intronic mutation (NM_001854.3:c.3168+5G>A) in COL11A1 that may impair splicing, which was suggested by in silico prediction and a minigene assay.
A novel COL11A1 mutation affecting splicing in a patient with Stickler syndrome
Kohmoto, Tomohiro; Naruto, Takuya; Kobayashi, Haruka; Watanabe, Miki; Okamoto, Nana; Masuda, Kiyoshi; Imoto, Issei; Okamoto, Nobuhiko
2015-01-01
Stickler syndrome is a clinically and genetically heterogeneous collagenopathy characterized by ocular, auditory, skeletal and orofacial abnormalities, commonly occurring as an autosomal dominant trait. We conducted target resequencing to analyze candidate genes associated with known clinical phenotypes from a 4-year-old girl with Stickler syndrome. We detected a novel heterozygous intronic mutation (NM_001854.3:c.3168+5G>A) in COL11A1 that may impair splicing, which was suggested by in silico prediction and a minigene assay. PMID:27081549
Hayashi, Masahiro; Natori, Tatsuya; Kubota-Hayashi, Sayoko; Miyata, Machiko; Ohkusu, Kiyofumi; Kurazono, Hisao; Makino, Souichi; Ezaki, Takayuki
2013-01-01
A quick foodborne pathogen screening method after six-hour enrichment culture with a broad-range food pathogen enrichment broth is described. Pathogenic factors of Salmonella enterica, Shigella spp., enteroinvasive Escherichia coli, and enterohemorrhagic E. coli are amplified with a cocktail primer and rapid polymerase chain reaction (PCR), which finishes amplification in 30 min. The PCR amplicon was differentiated with a dipstick DNA chromatography assay in 5–10 min. Starting from a four- to six-hour enrichment culture, this assay was finished within 45 min. Detection sensitivity of this protocol was less than 2.5 CFU/25 g for S. enterica and 3.3 CFU/25 g for enterohemorrhagic E. coli in spiked ground meat experiments. PMID:24364031
A., Kluber Laurel [Oak Ridge National Laboratory, U.S. Department of Energy, Oak Ridge, Tennessee, U.S.A.; Allen, Samantha A. [Oak Ridge National Laboratory, U.S. Department of Energy, Oak Ridge, Tennessee, U.S.A.; Hendershot, Nicholas [Oak Ridge National Laboratory, U.S. Department of Energy, Oak Ridge, Tennessee, U.S.A.; Hanson, Paul J. [Oak Ridge National Laboratory, U.S. Department of Energy, Oak Ridge, Tennessee, U.S.A.; Schadt, Christopher W. [Oak Ridge National Laboratory, U.S. Department of Energy, Oak Ridge, Tennessee, U.S.A.
2014-09-01
This data set contains the results of a microcosm incubation study on deep peat collected from the SPRUCE experimental site in the S1 Bog in September 2014. Microcosms were monitored for CO2 and CH4 production, and microbial community dynamics were assessed using qPCR and amplicon sequencing.The experiment was designed with a full factorial design with elevated temperature, nitrogen (N), (P), and pH treatments was used with samples from each transect serving replicates. In all, 96 microcosms were constructed to account for the 16 treatment combinations (N x P x pH x temperature), 2 time points, and 3 replicates. Temperature treatments were 6 °C, to mimic the SPRUCE ambient plot temperatures, and 15 °C to mimic the SPRUCE +9 °C treatment.
Ayinmode, Adekunle Bamidele; Ogbonna, Nkeiruka Fortunate; Widmer, Giovanni
To study the occurrence of Cryptosporidium infection in laboratory rats (Rattus norvegicus) raised for experimental usage, 134 faecal samples were obtained from two rearing houses in Ibadan and examined for the presence of Cryptosporidium oocyst using the modified acid fast staining technique. Cryptosporidium species in 2 samples positive for microscopy were further characterized by a nested polymerase chain reaction (PCR) amplifying the 18S rRNA gene. Two of 134 samples were positive for the Cryptosporidium oocysts. Sequencing of the small-subunit rRNA amplicons identified the species in the two PCR positive samples as Cryptosporidium andersoni and Cryptosporidium rat genotype. These findings showed that laboratory rat is a potential reservoir for diverse Cryptosporidium species and suggests that laboratory rats should be screened for Cryptosporidium infection prior to experiments, especially where pathogen free animals are not available. This the first report to identify Cryptosporidium species infecting laboratory rats in Nigeria.
Batt, Sarah L.; Charalambous, Bambos M.; McHugh, Timothy D.; Martin, Siobhan; Gillespie, Stephen H.
2005-01-01
Serotyping Streptococcus pneumoniae is a technique generally confined to reference laboratories, as purchasing pneumococcal antisera is a huge investment. Many attempts have been made to modify serological agglutination techniques to make them more accessible, and more recently developments in serotyping have focused on molecular techniques. This paper describes a PCR assay which amplifies the entire capsulation locus between dexB and aliA. Amplicons are digested to produce serotype-specific patterns. We have shown, using 81 epidemiologically unrelated strains representing 46 different serotypes, that the patterns correlate with a 90 to 100% similarity range for the same serotype or serogroup. Prospective testing of 73 isolates of unknown serotype confirmed reliable serotype attribution, and serotype profiles are reproducible on repeated testing. Once our database contains all 90 serotypes, this technique should be fully portable, cost-effective, and useful in any laboratory with sufficient molecular experience. PMID:15956380
Maternal lineages of peach genotypes
USDA-ARS?s Scientific Manuscript database
Simple sequence repeats (SSRs) in chloroplast genomes are useful markers to determine maternal lineages. The SSR mining results revealed that most chloroplast SSRs among three Prunus chloroplast genomes were conserved in locations and motif types, but polymorphic in motif and/or amplicon lengths. Fi...
Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes.
Haiminen, Niina; Feltus, F Alex; Parida, Laxmi
2011-04-15
We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies.
Christe, Camille; Stölting, Kai N; Bresadola, Luisa; Fussi, Barbara; Heinze, Berthold; Wegmann, Daniel; Lexer, Christian
2016-06-01
Natural hybrid zones have proven to be precious tools for understanding the origin and maintenance of reproductive isolation (RI) and therefore species. Most available genomic studies of hybrid zones using whole- or partial-genome resequencing approaches have focused on comparisons of the parental source populations involved in genome admixture, rather than exploring fine-scale patterns of chromosomal ancestry across the full admixture gradient present between hybridizing species. We have studied three well-known European 'replicate' hybrid zones of Populus alba and P. tremula, two widespread, ecologically divergent forest trees, using up to 432 505 single-nucleotide polymorphisms (SNPs) from restriction site-associated DNA (RAD) sequencing. Estimates of fine-scale chromosomal ancestry, genomic divergence and differentiation across all 19 poplar chromosomes revealed strikingly contrasting results, including an unexpected preponderance of F1 hybrids in the centre of genomic clines on the one hand, and genomically localized, spatially variable shared variants consistent with ancient introgression between the parental species on the other. Genetic ancestry had a significant effect on survivorship of hybrid seedlings in a common garden trial, pointing to selection against early-generation recombinants. Our results indicate a role for selection against recombinant genotypes in maintaining RI in the face of apparent F1 fertility, consistent with the intragenomic 'coadaptation' model of barriers to introgression upon secondary contact. Whole-genome resequencing of hybridizing populations will clarify the roles of specific genetic pathways in RI between these model forest trees and may reveal which loci are affected most strongly by its cyclic breakdown. © 2016 John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Leski, T. A.; Ansumana, R.; Jimmy, D. H.; Bangura, U.; Malanoski, A. P.; Lin, B.; Stenger, D. A.
2011-06-01
Multiplexed microbial diagnostic assays are a promising method for detection and identification of pathogens causing syndromes characterized by nonspecific symptoms in which traditional differential diagnosis is difficult. Also such assays can play an important role in outbreak investigations and environmental screening for intentional or accidental release of biothreat agents, which requires simultaneous testing for hundreds of potential pathogens. The resequencing pathogen microarray (RPM) is an emerging technological platform, relying on a combination of massively multiplex PCR and high-density DNA microarrays for rapid detection and high-resolution identification of hundreds of infectious agents simultaneously. The RPM diagnostic system was deployed in Sierra Leone, West Africa in collaboration with Njala University and Mercy Hospital Research Laboratory located in Bo. We used the RPM-Flu microarray designed for broad-range detection of human respiratory pathogens, to investigate a suspected outbreak of avian influenza in a number of poultry farms in which significant mortality of chickens was observed. The microarray results were additionally confirmed by influenza specific real-time PCR. The results of the study excluded the possibility that the outbreak was caused by influenza, but implicated Klebsiella pneumoniae as a possible pathogen. The outcome of this feasibility study confirms that application of broad-spectrum detection platforms for outbreak investigation in low-resource locations is possible and allows for rapid discovery of the responsible agents, even in cases when different agents are suspected. This strategy enables quick and cost effective detection of low probability events such as outbreak of a rare disease or intentional release of a biothreat agent.
Novak, Rachel L; Harper, David P; Caudell, David; Slape, Christopher; Beachy, Sarah H; Aplan, Peter D
2012-12-01
NUP98-HOXD13 (NHD13) and CALM-AF10 (CA10) are oncogenic fusion proteins produced by recurrent chromosomal translocations in patients with acute myeloid leukemia (AML). Transgenic mice that express these fusions develop AML with a long latency and incomplete penetrance, suggesting that collaborating genetic events are required for leukemic transformation. We employed genetic techniques to identify both preleukemic abnormalities in healthy transgenic mice as well as collaborating events leading to leukemic transformation. Candidate gene resequencing revealed that 6 of 27 (22%) CA10 AMLs spontaneously acquired a Ras pathway mutation and 8 of 27 (30%) acquired an Flt3 mutation. Two CA10 AMLs acquired an Flt3 internal-tandem duplication, demonstrating that these mutations can be acquired in murine as well as human AML. Gene expression profiles revealed a marked upregulation of Hox genes, particularly Hoxa5, Hoxa9, and Hoxa10 in both NHD13 and CA10 mice. Furthermore, mir196b, which is embedded within the Hoxa locus, was overexpressed in both CA10 and NHD13 samples. In contrast, the Hox cofactors Meis1 and Pbx3 were differentially expressed; Meis1 was increased in CA10 AMLs but not NHD13 AMLs, whereas Pbx3 was consistently increased in NHD13 but not CA10 AMLs. Silencing of Pbx3 in NHD13 cells led to decreased proliferation, increased apoptosis, and decreased colony formation in vitro, suggesting a previously unexpected role for Pbx3 in leukemic transformation. Published by Elsevier Inc.
Stafuzza, Nedenia Bonvino; Zerlotini, Adhemar; Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto
2017-01-01
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.
Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J.; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto
2017-01-01
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs. PMID:28323836
Improved hybrid de novo genome assembly of domesticated apple (Malus x domestica).
Li, Xuewei; Kui, Ling; Zhang, Jing; Xie, Yinpeng; Wang, Liping; Yan, Yan; Wang, Na; Xu, Jidi; Li, Cuiying; Wang, Wen; van Nocker, Steve; Dong, Yang; Ma, Fengwang; Guan, Qingmei
2016-08-08
Domesticated apple (Malus × domestica Borkh) is a popular temperate fruit with high nutrient levels and diverse flavors. In 2012, global apple production accounted for at least one tenth of all harvested fruits. A high-quality apple genome assembly is crucial for the selection and breeding of new cultivars. Currently, a single reference genome is available for apple, assembled from 16.9 × genome coverage short reads via Sanger and 454 sequencing technologies. Although a useful resource, this assembly covers only ~89 % of the non-repetitive portion of the genome, and has a relatively short (16.7 kb) contig N50 length. These downsides make it difficult to apply this reference in transcriptive or whole-genome re-sequencing analyses. Here we present an improved hybrid de novo genomic assembly of apple (Golden Delicious), which was obtained from 76 Gb (~102 × genome coverage) Illumina HiSeq data and 21.7 Gb (~29 × genome coverage) PacBio data. The final draft genome is approximately 632.4 Mb, representing ~ 90 % of the estimated genome. The contig N50 size is 111,619 bp, representing a 7 fold improvement. Further annotation analyses predicted 53,922 protein-coding genes and 2,765 non-coding RNA genes. The new apple genome assembly will serve as a valuable resource for investigating complex apple traits at the genomic level. It is not only suitable for genome editing and gene cloning, but also for RNA-seq and whole-genome re-sequencing studies.
The 3,000 rice genomes project
2014-01-01
Background Rice, Oryza sativa L., is the staple food for half the world’s population. By 2030, the production of rice must increase by at least 25% in order to keep up with global population growth and demand. Accelerated genetic gains in rice improvement are needed to mitigate the effects of climate change and loss of arable land, as well as to ensure a stable global food supply. Findings We resequenced a core collection of 3,000 rice accessions from 89 countries. All 3,000 genomes had an average sequencing depth of 14×, with average genome coverages and mapping rates of 94.0% and 92.5%, respectively. From our sequencing efforts, approximately 18.9 million single nucleotide polymorphisms (SNPs) in rice were discovered when aligned to the reference genome of the temperate japonica variety, Nipponbare. Phylogenetic analyses based on SNP data confirmed differentiation of the O. sativa gene pool into 5 varietal groups – indica, aus/boro, basmati/sadri, tropical japonica and temperate japonica. Conclusions Here, we report an international resequencing effort of 3,000 rice genomes. This data serves as a foundation for large-scale discovery of novel alleles for important rice phenotypes using various bioinformatics and/or genetic approaches. It also serves to understand the genomic diversity within O. sativa at a higher level of detail. With the release of the sequencing data, the project calls for the global rice community to take advantage of this data as a foundation for establishing a global, public rice genetic/genomic database and information platform for advancing rice breeding technology for future rice improvement. PMID:24872877
Fu, Chong-Yun; Liu, Wu-Ge; Liu, Di-Lin; Li, Ji-Hua; Zhu, Man-Shan; Liao, Yi-Long; Liu, Zhen-Rong; Zeng, Xue-Qin; Wang, Feng
2016-03-01
Next-generation sequencing technologies provide opportunities to further understand genetic variation, even within closely related cultivars. We performed whole genome resequencing of two elite indica rice varieties, RGD-7S and Taifeng B, whose F1 progeny showed hybrid weakness and hybrid vigor when grown in the early- and late-cropping seasons, respectively. Approximately 150 million 100-bp pair-end reads were generated, which covered ∼86% of the rice (Oryza sativa L. japonica 'Nipponbare') reference genome. A total of 2,758,740 polymorphic sites including 2,408,845 SNPs and 349,895 InDels were detected in RGD-7S and Taifeng B, respectively. Applying stringent parameters, we identified 961,791 SNPs and 46,640 InDels between RGD-7S and Taifeng B (RGD-7S/Taifeng B). The density of DNA polymorphisms was 256.8 SNPs and 12.5 InDels per 100 kb for RGD-7S/Taifeng B. Copy number variations (CNVs) were also investigated. In RGD-7S, 1989 of 2727 CNVs were overlapped in 218 genes, and 1231 of 2010 CNVs were annotated in 175 genes in Taifeng B. In addition, we verified a subset of InDels in the interval of hybrid weakness genes, Hw3 and Hw4, and obtained some polymorphic InDel markers, which will provide a sound foundation for cloning hybrid weakness genes. Analysis of genomic variations will also contribute to understanding the genetic basis of hybrid weakness and heterosis.
Moraes, Luis E.; Blow, Matthew J.; Hawley, Erik R.; ...
2017-02-16
Cyanobacteria have the potential to produce bulk and fine chemicals and members belonging to Nostoc sp. have received particular attention due to their relatively fast growth rate and the relative ease with which they can be harvested. Nostoc punctiforme is an aerobic, motile, Gram-negative, filamentous cyanobacterium that has been studied intensively to enhance our understanding of microbial carbon and nitrogen fixation. The genome of the type strain N. punctiforme ATCC 29133 was sequenced in 2001 and the scientific community has used these genome data extensively since then. Advances in bioinformatics tools for sequence annotation and the importance of this organismmore » prompted us to resequence and reanalyze its genome and to make both, the initial and improved annotation, available to the scientific community. The new draft genome has a total size of 9.1 Mbp and consists of 65 contiguous pieces of DNA with a GC content of 41.38% and 7664 protein-coding genes. Furthermore, the resequenced genome is slightly (5152 bp) larger and contains 987 more genes with functional prediction when compared to the previously published version. We deposited the annotation of both genomes in the Department of Energy’s IMG database to facilitate easy genome exploration by the scientific community without the need of in-depth bioinformatics skills. We expect that an facilitated access and ability to search the N. punctiforme ATCC 29133 for genes of interest will significantly facilitate metabolic engineering and genome prospecting efforts and ultimately the synthesis of biofuels and natural products from this keystone organism and closely related cyanobacteria.« less
Lan, Daoliang; Xiong, Xianrong; Mipam, Tserang-Donko; Fu, Changxiu; Li, Qiang; Ai, Yi; Hou, Dingchao; Chai, Zhixin; Zhong, Jincheng; Li, Jian
2018-01-01
Jinchuan yak, a newly discovered yak breed, not only possesses a large proportion of multi-ribs but also exhibits many good characteristics, such as high meat production, milk yield, and reproductive performance. However, there is limited information about its overall genetic structure, relationship with yaks in other areas, and possible origins and evolutionary processes. In this study, 7,693,689 high-quality single-nucleotide polymorphisms were identified by resequencing the genome of Jinchuan yak. Principal component and population genetic structure analyses showed that Jinchuan yak could be distinguished as an independent population among the domestic yak population. Linkage disequilibrium analysis showed that the decay rate of Jinchuan yak was the lowest of the domestic yak breeds, indicating that the degree of domestication and selection intensity of Jinchuan yak were higher than those of other yak breeds. Combined with archaeological data, we speculated that the origin of domestication of Jinchuan yak was ∼6000 yr ago (4000–10,000 yr ago). The quantitative dynamics of population growth history in Jinchuan yak was similar to that of other breeds of domestic and wild yaks, but was closer to that of the wild yak. No significant gene exchange between Jinchuan and other domestic yaks occurred. Compared with other domestic yaks, Jinchuan yak possessed 339 significantly and positively selected genes, several of which relate to physiological rhythm, histones, and the breed’s excellent production characteristics. Our results provide a basis for the discovery of the evolution, molecular origin, and unique traits of Jinchuan yak. PMID:29339406
Lan, Daoliang; Xiong, Xianrong; Mipam, Tserang-Donko; Fu, Changxiu; Li, Qiang; Ai, Yi; Hou, Dingchao; Chai, Zhixin; Zhong, Jincheng; Li, Jian
2018-03-02
Jinchuan yak, a newly discovered yak breed, not only possesses a large proportion of multi-ribs but also exhibits many good characteristics, such as high meat production, milk yield, and reproductive performance. However, there is limited information about its overall genetic structure, relationship with yaks in other areas, and possible origins and evolutionary processes. In this study, 7,693,689 high-quality single-nucleotide polymorphisms were identified by resequencing the genome of Jinchuan yak. Principal component and population genetic structure analyses showed that Jinchuan yak could be distinguished as an independent population among the domestic yak population. Linkage disequilibrium analysis showed that the decay rate of Jinchuan yak was the lowest of the domestic yak breeds, indicating that the degree of domestication and selection intensity of Jinchuan yak were higher than those of other yak breeds. Combined with archaeological data, we speculated that the origin of domestication of Jinchuan yak was ∼6000 yr ago (4000-10,000 yr ago). The quantitative dynamics of population growth history in Jinchuan yak was similar to that of other breeds of domestic and wild yaks, but was closer to that of the wild yak. No significant gene exchange between Jinchuan and other domestic yaks occurred. Compared with other domestic yaks, Jinchuan yak possessed 339 significantly and positively selected genes, several of which relate to physiological rhythm, histones, and the breed's excellent production characteristics. Our results provide a basis for the discovery of the evolution, molecular origin, and unique traits of Jinchuan yak. Copyright © 2018 Lan et al.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moraes, Luis E.; Blow, Matthew J.; Hawley, Erik R.
Cyanobacteria have the potential to produce bulk and fine chemicals and members belonging to Nostoc sp. have received particular attention due to their relatively fast growth rate and the relative ease with which they can be harvested. Nostoc punctiforme is an aerobic, motile, Gram-negative, filamentous cyanobacterium that has been studied intensively to enhance our understanding of microbial carbon and nitrogen fixation. The genome of the type strain N. punctiforme ATCC 29133 was sequenced in 2001 and the scientific community has used these genome data extensively since then. Advances in bioinformatics tools for sequence annotation and the importance of this organismmore » prompted us to resequence and reanalyze its genome and to make both, the initial and improved annotation, available to the scientific community. The new draft genome has a total size of 9.1 Mbp and consists of 65 contiguous pieces of DNA with a GC content of 41.38% and 7664 protein-coding genes. Furthermore, the resequenced genome is slightly (5152 bp) larger and contains 987 more genes with functional prediction when compared to the previously published version. We deposited the annotation of both genomes in the Department of Energy’s IMG database to facilitate easy genome exploration by the scientific community without the need of in-depth bioinformatics skills. We expect that an facilitated access and ability to search the N. punctiforme ATCC 29133 for genes of interest will significantly facilitate metabolic engineering and genome prospecting efforts and ultimately the synthesis of biofuels and natural products from this keystone organism and closely related cyanobacteria.« less
Kazama, Yusuke; Ishii, Kotaro; Hirano, Tomonari; Wakana, Taeko; Yamada, Mieko; Ohbu, Sumie; Abe, Tomoko
2017-12-01
Heavy-ion irradiation is a powerful mutagen that possesses high linear energy transfer (LET). Several studies have indicated that the value of LET affects DNA lesion formation in several ways, including the efficiency and the density of double-stranded break induction along the particle path. We assumed that the mutation type can be altered by selecting an appropriate LET value. Here, we quantitatively demonstrate differences in the mutation type induced by irradiation with two representative ions, Ar ions (LET: 290 keV μm -1 ) and C ions (LET: 30.0 keV μm -1 ), by whole-genome resequencing of the Arabidopsis mutants produced by these irradiations. Ar ions caused chromosomal rearrangements or large deletions (≥100 bp) more frequently than C ions, with 10.2 and 2.3 per mutant genome under Ar- and C-ion irradiation, respectively. Conversely, C ions induced more single-base substitutions and small indels (<100 bp) than Ar ions, with 28.1 and 56.9 per mutant genome under Ar- and C-ion irradiation, respectively. Moreover, the rearrangements induced by Ar-ion irradiation were more complex than those induced by C-ion irradiation, and tended to accompany single base substitutions or small indels located close by. In conjunction with the detection of causative genes through high-throughput sequencing, selective irradiation by beams with different effects will be a powerful tool for forward genetics as well as studies on chromosomal rearrangements. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Analysis of dust samples from the Middle East using high-density resequencing micro-array RPM-TEI
NASA Astrophysics Data System (ADS)
Leski, T. A.; Gregory, M. J.; Malanoski, A. P.; Smith, J. P.; Glaven, R. H.; Wang, Z.; Stenger, D. A.; Lin, B.
2010-04-01
A previously developed resequencing microarray, "Tropical and Emerging Infections (RPM-TEI v.1.0 chip)", designed to identify and discriminate between tropical diseases and other potential biothreat agents, their near-neighbor species, and/or potential confounders, was used to characterize the microbes present in the silt/clay fraction of surface soils and airborne dust collected from the Middle East. Local populations and U.S. military personnel deployed to the Middle East are regularly subjected to high levels of airborne desert dust containing a significant fraction of inhalable particles and some portion require clinical aid. Not all of the clinical symptoms can be directly attributed to the physical action of material in the human respiratory tract. To better understand the potential health effects of the airborne dust, the composition of the microbial communities associated with surface soil and/or airborne dust (air filter) samples from 19 different sites in Iraq and Kuwait was identified using RPM-TEI v.1.0. Results indicated that several microorganisms including a class of rapidly growing Mycobacterium, Bacillus, Brucella, Clostridium and Coxiella burnetti, were present in the samples. The presence of these organisms in the surface soils and the inhalable fraction of airborne dust analyzed may pose a human health risk and warrants further investigation. Better understanding of the factors influencing the composition of these microbial communities is important to address questions related to human health and is critical to achieving Force Health Protection for the Warfighter operating in the Middle East, Afghanistan, North Africa and other arid regions.
Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Kumar, Vinay; Kale, Sandip M; Sinha, Pallavi; Chitikineni, Annapurna; Pazhamala, Lekha T; Garg, Vanika; Sharma, Mamta; Sameer Kumar, Chanda Venkata; Parupalli, Swathi; Vechalapu, Suryanarayana; Patil, Suyash; Muniswamy, Sonnappa; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Dharmaraj, Pallavi Subbanna; Varshney, Rajeev K
2016-05-01
To map resistance genes for Fusarium wilt (FW) and sterility mosaic disease (SMD) in pigeonpea, sequencing-based bulked segregant analysis (Seq-BSA) was used. Resistant (R) and susceptible (S) bulks from the extreme recombinant inbred lines of ICPL 20096 × ICPL 332 were sequenced. Subsequently, SNP index was calculated between R- and S-bulks with the help of draft genome sequence and reference-guided assembly of ICPL 20096 (resistant parent). Seq-BSA has provided seven candidate SNPs for FW and SMD resistance in pigeonpea. In parallel, four additional genotypes were re-sequenced and their combined analysis with R- and S-bulks has provided a total of 8362 nonsynonymous (ns) SNPs. Of 8362 nsSNPs, 60 were found within the 2-Mb flanking regions of seven candidate SNPs identified through Seq-BSA. Haplotype analysis narrowed down to eight nsSNPs in seven genes. These eight nsSNPs were further validated by re-sequencing 11 genotypes that are resistant and susceptible to FW and SMD. This analysis revealed association of four candidate nsSNPs in four genes with FW resistance and four candidate nsSNPs in three genes with SMD resistance. Further, In silico protein analysis and expression profiling identified two most promising candidate genes namely C.cajan_01839 for SMD resistance and C.cajan_03203 for FW resistance. Identified candidate genomic regions/SNPs will be useful for genomics-assisted breeding in pigeonpea. © 2015 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Tavtigian, Sean V; Byrnes, Graham B; Goldgar, David E; Thomas, Alun
2008-11-01
Many individually rare missense substitutions are encountered during deep resequencing of candidate susceptibility genes and clinical mutation screening of known susceptibility genes. BRCA1 and BRCA2 are among the most resequenced of all genes, and clinical mutation screening of these genes provides an extensive data set for analysis of rare missense substitutions. Align-GVGD is a mathematically simple missense substitution analysis algorithm, based on the Grantham difference, which has already contributed to classification of missense substitutions in BRCA1, BRCA2, and CHEK2. However, the distribution of genetic risk as a function of Align-GVGD's output variables Grantham variation (GV) and Grantham deviation (GD) has not been well characterized. Here, we used data from the Myriad Genetic Laboratories database of nearly 70,000 full-sequence tests plus two risk estimates, one approximating the odds ratio and the other reflecting strength of selection, to display the distribution of risk in the GV-GD plane as a series of surfaces. We abstracted contours from the surfaces and used the contours to define a sequence of missense substitution grades ordered from greatest risk to least risk. The grades were validated internally using a third, personal and family history-based, measure of risk. The Align-GVGD grades defined here are applicable to both the genetic epidemiology problem of classifying rare missense substitutions observed in known susceptibility genes and the molecular epidemiology problem of analyzing rare missense substitutions observed during case-control mutation screening studies of candidate susceptibility genes. (c) 2008 Wiley-Liss, Inc.
Kawakami, Takeshi; Backström, Niclas; Burri, Reto; Husby, Arild; Olason, Pall; Rice, Amber M; Ålund, Murielle; Qvarnström, Anna; Ellegren, Hans
2014-01-01
With the access to draft genome sequence assemblies and whole-genome resequencing data from population samples, molecular ecology studies will be able to take truly genome-wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million single-nucleotide polymorphism (SNP)s in population resequencing of this species and its sister species, pied flycatcher. Here, we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. Using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. Linkage disequilibrium declined quickly to the background level at an average distance of 17 kb, but the extent of LD varied markedly within the genome and was more than 10-fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later-generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 Ma, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system. PMID:24784959
Silva-Junior, Orzenil B; Grattapaglia, Dario
2015-11-01
We used high-density single nucleotide polymorphism (SNP) data and whole-genome pooled resequencing to examine the landscape of population recombination (ρ) and nucleotide diversity (ϴw ), assess the extent of linkage disequilibrium (r(2) ) and build the highest density linkage maps for Eucalyptus. At the genome-wide level, linkage disequilibrium (LD) decayed within c. 4-6 kb, slower than previously reported from candidate gene studies, but showing considerable variation from absence to complete LD up to 50 kb. A sharp decrease in the estimate of ρ was seen when going from short to genome-wide inter-SNP distances, highlighting the dependence of this parameter on the scale of observation adopted. Recombination was correlated with nucleotide diversity, gene density and distance from the centromere, with hotspots of recombination enriched for genes involved in chemical reactions and pathways of the normal metabolic processes. The high nucleotide diversity (ϴw = 0.022) of E. grandis revealed that mutation is more important than recombination in shaping its genomic diversity (ρ/ϴw = 0.645). Chromosome-wide ancestral recombination graphs allowed us to date the split of E. grandis (1.7-4.8 million yr ago) and identify a scenario for the recent demographic history of the species. Our results have considerable practical importance to Genome Wide Association Studies (GWAS), while indicating bright prospects for genomic prediction of complex phenotypes in eucalypt breeding. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Re-sequencing and genetic variation identification of a rice line with ideal plant architecture.
Li, Shuangcheng; Xie, Kailong; Li, Wenbo; Zou, Ting; Ren, Yun; Wang, Shiquan; Deng, Qiming; Zheng, Aiping; Zhu, Jun; Liu, Huainian; Wang, Lingxia; Ai, Peng; Gao, Fengyan; Huang, Bin; Cao, Xuemei; Li, Ping
2012-12-01
The ideal plant architecture (IPA) includes several important characteristics such as low tiller numbers, few or no unproductive tillers, more grains per panicle, and thick and sturdy stems. We have developed an indica restorer line 7302R that displays the IPA phenotype in terms of tiller number, grain number, and stem strength. However, its mechanism had to be clarified. We performed re-sequencing and genome-wide variation analysis of 7302R using the Solexa sequencing technology. With the genomic sequence of the indica cultivar 9311 as reference, 307 627 SNPs, 57 372 InDels, and 3 096 SVs were identified in the 7302R genome. The 7302R-specific variations were investigated via the synteny analysis of all the SNPs of 7302R with those of the previous sequenced none-IPA-type lines IR24, MH63, and SH527. Moreover, we found 178 168 7302R-specific SNPs across the whole genome and 30 239 SNPs in the predicted mRNA regions, among which 8 517 were Non-syn CDS. In addition, 263 large-effect SNPs that were expected to affect the integrity of encoded proteins were identified from the 7302R-specific SNPs. SNPs of several important previously cloned rice genes were also identified by aligning the 7302R sequence with other sequence lines. Our results provided several candidates account for the IPA phenotype of 7302R. These results therefore lay the groundwork for long-term efforts to uncover important genes and alleles for rice plant architecture construction, also offer useful data resources for future genetic and genomic studies in rice.
Zhang, Qian; Gou, Wenyu; Wang, Xiaotong; Zhang, Yawen; Ma, Jun; Zhang, Hongliang; Zhang, Ying; Zhang, Hao
2016-01-01
Tibetan chicken, unlike their lowland counterparts, exhibit specific adaptations to high-altitude conditions. The genetic mechanisms of such adaptations in highland chickens were determined by resequencing the genomes of four highland (Tibetan and Lhasa White) and four lowland (White Leghorn, Lindian, and Chahua) chicken populations. Our results showed an evident genetic admixture in Tibetan chickens, suggesting a history of introgression from lowland gene pools. Genes showing positive selection in highland populations were related to cardiovascular and respiratory system development, DNA repair, response to radiation, inflammation, and immune responses, indicating a strong adaptation to oxygen scarcity and high-intensity solar radiation. The distribution of allele frequencies of nonsynonymous single nucleotide polymorphisms between highland and lowland populations was analyzed using chi-square test, which showed that several differentially distributed genes with missense mutations were enriched in several functional categories, especially in blood vessel development and adaptations to hypoxia and intense radiation. RNA sequencing revealed that several differentially expressed genes were enriched in gene ontology terms related to blood vessel and respiratory system development. Several candidate genes involved in the development of cardiorespiratory system (FGFR1, CTGF, ADAM9, JPH2, SATB1, BMP4, LOX, LPR, ANGPTL4, and HYAL1), inflammation and immune responses (AIRE, MYO1F, ZAP70, DDX60, CCL19, CD47, JSC, and FAS), DNA repair, and responses to radiation (VCP, ASH2L, and FANCG) were identified to play key roles in the adaptation to high-altitude conditions. Our data provide new insights into the unique adaptations of highland animals to extreme environments. PMID:26907498
Genomic variants in an inbred mouse model predict mania-like behaviors.
Saul, Michael C; Stevenson, Sharon A; Zhao, Changjiu; Driessen, Terri M; Eisinger, Brian E; Gammie, Stephen C
2018-01-01
Contemporary rodent models for bipolar disorders split the bipolar spectrum into complimentary behavioral endophenotypes representing mania and depression. Widely accepted mania models typically utilize single gene transgenics or pharmacological manipulations, but inbred rodent strains show great potential as mania models. Their acceptance is often limited by the lack of genotypic data needed to establish construct validity. In this study, we used a unique strategy to inexpensively explore and confirm population allele differences in naturally occurring candidate variants in a manic rodent strain, the Madison (MSN) mouse strain. Variants were identified using whole exome resequencing on a small population of animals. Interesting candidate variants were confirmed in a larger population with genotyping. We enriched these results with observations of locomotor behavior from a previous study. Resequencing identified 447 structural variants that are mostly fixed in the MSN strain relative to control strains. After filtering and annotation, we found 11 non-synonymous MSN variants that we believe alter protein function. The allele frequencies for 6 of these variants were consistent with explanatory variants for the Madison strain's phenotype. The variants are in the Npas2, Cp, Polr3c, Smarca4, Trpv1, and Slc5a7 genes, and many of these genes' products are in pathways implicated in human bipolar disorders. Variants in Smarca4 and Polr3c together explained over 40% of the variance in locomotor behavior in the Hsd:ICR founder strain. These results enhance the MSN strain's construct validity and implicate altered nucleosome structure and transcriptional regulation as a chief molecular system underpinning behavior.
Kawakami, Hiroshi; Ogimoto, Akiyoshi; Tokunaga, Naohito; Nishimura, Kazuhisa; Kawakami, Hideo; Higashi, Haruhiko; Iio, Chiharuko; Kono, Tamami; Aono, Jun; Uetani, Teruyoshi; Nagai, Takayuki; Inoue, Katsuji; Suzuki, Jun; Ikeda, Shuntaro; Okura, Takafumi; Ohyagi, Yasumasa; Tabara, Yasuharu; Higaki, Jitsuo
2018-05-30
The cardiac phenotype of laminopathies is characterized by cardiac conduction disorders (CCDs) and dilated cardiomyopathy (DCM). Although laminopathies have been considered monogenic, they exhibit a remarkable degree of clinical variability. This case series aimed to detect the causal mutation and to investigate the causes of clinical variability in a Japanese family with inherited CCD and DCM.Of the five family members investigated, four had either CCD/DCM or CCD alone, while one subject had no cardiovascular disease and acted as a normal control. We performed targeted resequencing of 174 inherited cardiovascular disease-associated genes in this family and pathological mutations were confirmed using Sanger sequencing. The degree of clinical severity and variability were also evaluated using long-term medical records. We discovered a novel heterozygous truncating lamin A/C (LMNA) mutation (c.774delG) in all four subjects with CCD. Because this mutation was predicted to cause a frameshift mutation and premature termination (p.Gln258HisfsTer222) in LMNA, we believe that this LMNA mutation was the causal mutation in this family with CCD and laminopathies. In addition, gender-specific intra-familiar clinical variability was observed in this Japanese family where affected males exhibited an earlier onset of CCD and more severe DCM compared to affected females. Using targeted resequencing, we discovered a novel truncating LMNA mutation associated with CCD and DCM in this family characterized by gender differences in clinical severity in LMNA carriers. Our results suggest that in patients with laminopathy, clinical severity may be the result of multiple factors.
Weller, Andreas M.; Rödelsperger, Christian; Eberhardt, Gabi; Molnar, Ruxandra I.; Sommer, Ralf J.
2014-01-01
Base substitution mutations are a major source of genetic novelty and mutation accumulation line (MAL) studies revealed a nearly universal AT bias in de novo mutation spectra. While a comparison of de novo mutation spectra with the actual nucleotide composition in the genome suggests the existence of general counterbalancing mechanisms, little is known about the evolutionary and historical details of these opposing forces. Here, we correlate MAL-derived mutation spectra with patterns observed from population resequencing. Variation observed in natural populations has already been subject to evolutionary forces. Distinction between rare and common alleles, the latter of which are close to fixation and of presumably older age, can provide insight into mutational processes and their influence on genome evolution. We provide a genome-wide analysis of de novo mutations in 22 MALs of the nematode Pristionchus pacificus and compare the spectra with natural variants observed in resequencing of 104 natural isolates. MALs show an AT bias of 5.3, one of the highest values observed to date. In contrast, the AT bias in natural variants is much lower. Specifically, rare derived alleles show an AT bias of 2.4, whereas common derived alleles close to fixation show no AT bias at all. These results indicate the existence of a strong opposing force and they suggest that the GC content of the P. pacificus genome is in equilibrium. We discuss GC-biased gene conversion as a potential mechanism acting against AT-biased mutations. This study provides insight into genome evolution by combining MAL studies with natural variation. PMID:24414549
GWASeq: targeted re-sequencing follow up to GWAS.
Salomon, Matthew P; Li, Wai Lok Sibon; Edlund, Christopher K; Morrison, John; Fortini, Barbara K; Win, Aung Ko; Conti, David V; Thomas, Duncan C; Duggan, David; Buchanan, Daniel D; Jenkins, Mark A; Hopper, John L; Gallinger, Steven; Le Marchand, Loïc; Newcomb, Polly A; Casey, Graham; Marjoram, Paul
2016-03-03
For the last decade the conceptual framework of the Genome-Wide Association Study (GWAS) has dominated the investigation of human disease and other complex traits. While GWAS have been successful in identifying a large number of variants associated with various phenotypes, the overall amount of heritability explained by these variants remains small. This raises the question of how best to follow up on a GWAS, localize causal variants accounting for GWAS hits, and as a consequence explain more of the so-called "missing" heritability. Advances in high throughput sequencing technologies now allow for the efficient and cost-effective collection of vast amounts of fine-scale genomic data to complement GWAS. We investigate these issues using a colon cancer dataset. After QC, our data consisted of 1993 cases, 899 controls. Using marginal tests of associations, we identify 10 variants distributed among six targeted regions that are significantly associated with colorectal cancer, with eight of the variants being novel to this study. Additionally, we perform so-called 'SNP-set' tests of association and identify two sets of variants that implicate both common and rare variants in the etiology of colorectal cancer. Here we present a large-scale targeted re-sequencing resource focusing on genomic regions implicated in colorectal cancer susceptibility previously identified in several GWAS, which aims to 1) provide fine-scale targeted sequencing data for fine-mapping and 2) provide data resources to address methodological questions regarding the design of sequencing-based follow-up studies to GWAS. Additionally, we show that this strategy successfully identifies novel variants associated with colorectal cancer susceptibility and can implicate both common and rare variants.
Zapata, Luis; Ding, Jia; Willing, Eva-Maria; Hartwig, Benjamin; Bezdan, Daniela; Jiao, Wen-Biao; Patel, Vipul; Velikkakam James, Geo; Koornneef, Maarten; Ossowski, Stephan; Schneeberger, Korbinian
2016-07-12
Resequencing or reference-based assemblies reveal large parts of the small-scale sequence variation. However, they typically fail to separate such local variation into colinear and rearranged variation, because they usually do not recover the complement of large-scale rearrangements, including transpositions and inversions. Besides the availability of hundreds of genomes of diverse Arabidopsis thaliana accessions, there is so far only one full-length assembled genome: the reference sequence. We have assembled 117 Mb of the A. thaliana Landsberg erecta (Ler) genome into five chromosome-equivalent sequences using a combination of short Illumina reads, long PacBio reads, and linkage information. Whole-genome comparison against the reference sequence revealed 564 transpositions and 47 inversions comprising ∼3.6 Mb, in addition to 4.1 Mb of nonreference sequence, mostly originating from duplications. Although rearranged regions are not different in local divergence from colinear regions, they are drastically depleted for meiotic recombination in heterozygotes. Using a 1.2-Mb inversion as an example, we show that such rearrangement-mediated reduction of meiotic recombination can lead to genetically isolated haplotypes in the worldwide population of A. thaliana Moreover, we found 105 single-copy genes, which were only present in the reference sequence or the Ler assembly, and 334 single-copy orthologs, which showed an additional copy in only one of the genomes. To our knowledge, this work gives first insights into the degree and type of variation, which will be revealed once complete assemblies will replace resequencing or other reference-dependent methods.
Carrot Juice Fermentations as Man-Made Microbial Ecosystems Dominated by Lactic Acid Bacteria.
Wuyts, Sander; Van Beeck, Wannes; Oerlemans, Eline F M; Wittouck, Stijn; Claes, Ingmar J J; De Boeck, Ilke; Weckx, Stefan; Lievens, Bart; De Vuyst, Luc; Lebeer, Sarah
2018-06-15
Spontaneous vegetable fermentations, with their rich flavors and postulated health benefits, are regaining popularity. However, their microbiology is still poorly understood, therefore raising concerns about food safety. In addition, such spontaneous fermentations form interesting cases of man-made microbial ecosystems. Here, samples from 38 carrot juice fermentations were collected through a citizen science initiative, in addition to three laboratory fermentations. Culturing showed that Enterobacteriaceae were outcompeted by lactic acid bacteria (LAB) between 3 and 13 days of fermentation. Metabolite-target analysis showed that lactic acid and mannitol were highly produced, as well as the biogenic amine cadaverine. High-throughput 16S rRNA gene sequencing revealed that mainly species of Leuconostoc and Lactobacillus (as identified by 8 and 20 amplicon sequence variants [ASVs], respectively) mediated the fermentations in subsequent order. The analyses at the DNA level still detected a high number of Enterobacteriaceae , but their relative abundance was low when RNA-based sequencing was performed to detect presumptive metabolically active bacterial cells. In addition, this method greatly reduced host read contamination. Phylogenetic placement indicated a high LAB diversity, with ASVs from nine different phylogenetic groups of the Lactobacillus genus complex. However, fermentation experiments with isolates showed that only strains belonging to the most prevalent phylogenetic groups preserved the fermentation dynamics. The carrot juice fermentation thus forms a robust man-made microbial ecosystem suitable for studies on LAB diversity and niche specificity. IMPORTANCE The usage of fermented food products by professional chefs is steadily growing worldwide. Meanwhile, this interest has also increased at the household level. However, many of these artisanal food products remain understudied. Here, an extensive microbial analysis was performed of spontaneous fermented carrot juices which are used as nonalcoholic alternatives for wine in a Belgian Michelin star restaurant. Samples were collected through an active citizen science approach with 38 participants, in addition to three laboratory fermentations. Identification of the main microbial players revealed that mainly species of Leuconostoc and Lactobacillus mediated the fermentations in subsequent order. In addition, a high diversity of lactic acid bacteria was found; however, fermentation experiments with isolates showed that only strains belonging to the most prevalent lactic acid bacteria preserved the fermentation dynamics. Finally, this study showed that the usage of RNA-based 16S rRNA amplicon sequencing greatly reduces host read contamination. Copyright © 2018 American Society for Microbiology.
Rinke, Jenny; Schäfer, Vivien; Schmidt, Mathias; Ziermann, Janine; Kohlmann, Alexander; Hochhaus, Andreas; Ernst, Thomas
2013-08-01
We sought to establish a convenient, sensitive next-generation sequencing (NGS) method for genotyping the 26 most commonly mutated leukemia-associated genes in a single work flow and to optimize this method for low amounts of input template DNA. We designed 184 PCR amplicons that cover all of the candidate genes. NGS was performed with genomic DNA (gDNA) from a cohort of 10 individuals with chronic myelomonocytic leukemia. The results were compared with NGS data obtained from sequencing of DNA generated by whole-genome amplification (WGA) of 20 ng template gDNA. Differences between gDNA and WGA samples in variant frequencies were determined for 2 different WGA kits. For gDNA samples, 25 of 26 genes were successfully sequenced with a sensitivity of 5%, which was achieved by a median coverage of 492 reads (range, 308-636 reads) per amplicon. We identified 24 distinct mutations in 11 genes. With WGA samples, we reliably detected all mutations above 5% sensitivity with a median coverage of 506 reads (range, 256-653 reads) per amplicon. With all variants included in the analysis, WGA amplification by the 2 kits tested yielded differences in variant frequencies that ranged from -28.19% to +9.94% [mean (SD) difference, -0.2% (4.08%)] and from -35.03% to +18.67% [mean difference, -0.75% (5.12%)]. Our method permits simultaneous analysis of a wide range of leukemia-associated target genes in a single sequencing run. NGS can be performed after WGA of template DNA for reliable detection of variants without introducing appreciable bias.
Sie, Daoud; Snijders, Peter J F; Meijer, Gerrit A; Doeleman, Marije W; van Moorsel, Marinda I H; van Essen, Hendrik F; Eijk, Paul P; Grünberg, Katrien; van Grieken, Nicole C T; Thunnissen, Erik; Verheul, Henk M; Smit, Egbert F; Ylstra, Bauke; Heideman, Daniëlle A M
2014-10-01
Next generation DNA sequencing (NGS) holds promise for diagnostic applications, yet implementation in routine molecular pathology practice requires performance evaluation on DNA derived from routine formalin-fixed paraffin-embedded (FFPE) tissue specimens. The current study presents a comprehensive analysis of TruSeq Amplicon Cancer Panel-based NGS using a MiSeq Personal sequencer (TSACP-MiSeq-NGS) for somatic mutation profiling. TSACP-MiSeq-NGS (testing 212 hotspot mutation amplicons of 48 genes) and a data analysis pipeline were evaluated in a retrospective learning/test set approach (n = 58/n = 45 FFPE-tumor DNA samples) against 'gold standard' high-resolution-melting (HRM)-sequencing for the genes KRAS, EGFR, BRAF and PIK3CA. Next, the performance of the validated test algorithm was assessed in an independent, prospective cohort of FFPE-tumor DNA samples (n = 75). In the learning set, a number of minimum parameter settings was defined to decide whether a FFPE-DNA sample is qualified for TSACP-MiSeq-NGS and for calling mutations. The resulting test algorithm revealed 82% (37/45) compliance to the quality criteria and 95% (35/37) concordant assay findings for KRAS, EGFR, BRAF and PIK3CA with HRM-sequencing (kappa = 0.92; 95% CI = 0.81-1.03) in the test set. Subsequent application of the validated test algorithm to the prospective cohort yielded a success rate of 84% (63/75), and a high concordance with HRM-sequencing (95% (60/63); kappa = 0.92; 95% CI = 0.84-1.01). TSACP-MiSeq-NGS detected 77 mutations in 29 additional genes. TSACP-MiSeq-NGS is suitable for diagnostic gene mutation profiling in oncopathology.
Shinozuka, Hiroshi; Cogan, Noel O I; Shinozuka, Maiko; Marshall, Alexis; Kay, Pippa; Lin, Yi-Han; Spangenberg, German C; Forster, John W
2015-04-11
Fragmentation at random nucleotide locations is an essential process for preparation of DNA libraries to be used on massively parallel short-read DNA sequencing platforms. Although instruments for physical shearing, such as the Covaris S2 focused-ultrasonicator system, and products for enzymatic shearing, such as the Nextera technology and NEBNext dsDNA Fragmentase kit, are commercially available, a simple and inexpensive method is desirable for high-throughput sequencing library preparation. MspJI is a recently characterised restriction enzyme which recognises the sequence motif CNNR (where R = G or A) when the first base is modified to 5-methylcytosine or 5-hydroxymethylcytosine. A semi-random enzymatic DNA amplicon fragmentation method was developed based on the unique cleavage properties of MspJI. In this method, random incorporation of 5-methyl-2'-deoxycytidine-5'-triphosphate is achieved through DNA amplification with DNA polymerase, followed by DNA digestion with MspJI. Due to the recognition sequence of the enzyme, DNA amplicons are fragmented in a relatively sequence-independent manner. The size range of the resulting fragments was capable of control through optimisation of 5-methyl-2'-deoxycytidine-5'-triphosphate concentration in the reaction mixture. A library suitable for sequencing using the Illumina MiSeq platform was prepared and processed using the proposed method. Alignment of generated short reads to a reference sequence demonstrated a relatively high level of random fragmentation. The proposed method may be performed with standard laboratory equipment. Although the uniformity of coverage was slightly inferior to the Covaris physical shearing procedure, due to efficiencies of cost and labour, the method may be more suitable than existing approaches for implementation in large-scale sequencing activities, such as bacterial artificial chromosome (BAC)-based genome sequence assembly, pan-genomic studies and locus-targeted genotyping-by-sequencing.
Donin, Daiane Güllich; de Arruda Leme, Raquel; Alfieri, Alice Fernandes; Alberton, Geraldo Camilo; Alfieri, Amauri Alcindo
2014-03-01
Porcine teschovirus (PTV), Porcine sapelovirus (PSV) and Enterovirus G (EV-G) have been associated with enteric, respiratory, reproductive and neurological disorders. Although Brazil is the world's fourth largest producer and exporter of pork, no information on the occurrence of PTV, PSV and EV-G infections is available for Brazilian pig herds. This study aimed to investigate the occurrence of Porcine enteric picornavirus infections in pig farms located in three distinct geographical regions of Brazil. Forty randomly selected diarrhoeic and normal consistency faeces of suckling (n = 22) and nursery (n = 18) pigs from farms located in 21 distinct cities of the Southern, Southeast, and Midwest regions of Brazil were evaluated by nested-RT-PCR assays. Suckling piglets presented the expected amplicon size for PTV (158 bp) and EV-G (313 bp) in single and mixed infections in 40.9 % (9/22) of the faecal samples. PSV amplicon (212 bp) was not detected in this age group. For nursery pigs, Porcine enteric picornaviruses amplicons were present in 77.8 % (14/18) of the faecal samples. PTV and EV-G were detected in single and mixed infections, while PSV was detected only in two samples in co-infection with PTV and EV-G in this age group. The Brazilian regions evaluated presented at least two of the tested viruses. Sequencing analysis revealed high similarities to the related viruses (95.3 to 99.2 % for PTV, 94.2 to 98.5 % for PSV and 86 to 100 % for EV-G). For the first time PTV, PSV and EV-G have been molecularly detected and characterised in pig faecal samples in Brazil.
Akimoto, Chizuru; Volk, Alexander E; van Blitterswijk, Marka; Van den Broeck, Marleen; Leblond, Claire S; Lumbroso, Serge; Camu, William; Neitzel, Birgit; Onodera, Osamu; van Rheenen, Wouter; Pinto, Susana; Weber, Markus; Smith, Bradley; Proven, Melanie; Talbot, Kevin; Keagle, Pamela; Chesi, Alessandra; Ratti, Antonia; van der Zee, Julie; Alstermark, Helena; Birve, Anna; Calini, Daniela; Nordin, Angelica; Tradowsky, Daniela C; Just, Walter; Daoud, Hussein; Angerbauer, Sabrina; DeJesus-Hernandez, Mariely; Konno, Takuya; Lloyd-Jani, Anjali; de Carvalho, Mamede; Mouzat, Kevin; Landers, John E; Veldink, Jan H; Silani, Vincenzo; Gitler, Aaron D; Shaw, Christopher E; Rouleau, Guy A; van den Berg, Leonard H; Van Broeckhoven, Christine; Rademakers, Rosa; Andersen, Peter M; Kubisch, Christian
2014-01-01
Background The GGGGCC-repeat expansion in C9orf72 is the most frequent mutation found in patients with amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Most of the studies on C9orf72 have relied on repeat-primed PCR (RP-PCR) methods for detection of the expansions. To investigate the inherent limitations of this technique, we compared methods and results of 14 laboratories. Methods The 14 laboratories genotyped DNA from 78 individuals (diagnosed with ALS or FTD) in a blinded fashion. Eleven laboratories used a combination of amplicon-length analysis and RP-PCR, whereas three laboratories used RP-PCR alone; Southern blotting techniques were used as a reference. Results Using PCR-based techniques, 5 of the 14 laboratories got results in full accordance with the Southern blotting results. Only 50 of the 78 DNA samples got the same genotype result in all 14 laboratories. There was a high degree of false positive and false negative results, and at least one sample could not be genotyped at all in 9 of the 14 laboratories. The mean sensitivity of a combination of amplicon-length analysis and RP-PCR was 95.0% (73.9–100%), and the mean specificity was 98.0% (87.5–100%). Overall, a sensitivity and specificity of more than 95% was observed in only seven laboratories. Conclusions Because of the wide range seen in genotyping results, we recommend using a combination of amplicon-length analysis and RP-PCR as a minimum in a research setting. We propose that Southern blotting techniques should be the gold standard, and be made obligatory in a clinical diagnostic setting. PMID:24706941
Heitlinger, Emanuel; Ferreira, Susana C M; Thierer, Dagmar; Hofer, Heribert; East, Marion L
2017-01-01
In mammals, two factors likely to affect the diversity and composition of intestinal bacteria (bacterial microbiome) and eukaryotes (eukaryome) are social status and age. In species in which social status determines access to resources, socially dominant animals maintain better immune processes and health status than subordinates. As high species diversity is an index of ecosystem health, the intestinal biome of healthier, socially dominant animals should be more diverse than those of subordinates. Gradual colonization of the juvenile intestine after birth predicts lower intestinal biome diversity in juveniles than adults. We tested these predictions on the effect of: (1) age (juvenile/adult) and (2) social status (low/high) on bacterial microbiome and eukaryome diversity and composition in the spotted hyena ( Crocuta crocuta ), a highly social, female-dominated carnivore in which social status determines access to resources. We comprehensively screened feces from 35 individually known adult females and 7 juveniles in the Serengeti ecosystem for bacteria and eukaryotes, using a set of 48 different amplicons (4 for bacterial 16S, 44 for eukaryote 18S) in a multi-amplicon sequencing approach. We compared sequence abundances to classical coprological egg or oocyst counts. For all parasite taxa detected in more than six samples, the number of sequence reads significantly predicted the number of eggs or oocysts counted, underscoring the value of an amplicon sequencing approach for quantitative measurements of parasite load. In line with our predictions, our results revealed a significantly less diverse microbiome in juveniles than adults and a significantly higher diversity of eukaryotes in high-ranking than low-ranking animals. We propose that free-ranging wildlife can provide an intriguing model system to assess the adaptive value of intestinal biome diversity for both bacteria and eukaryotes.
Heitlinger, Emanuel; Ferreira, Susana C. M.; Thierer, Dagmar; Hofer, Heribert; East, Marion L.
2017-01-01
In mammals, two factors likely to affect the diversity and composition of intestinal bacteria (bacterial microbiome) and eukaryotes (eukaryome) are social status and age. In species in which social status determines access to resources, socially dominant animals maintain better immune processes and health status than subordinates. As high species diversity is an index of ecosystem health, the intestinal biome of healthier, socially dominant animals should be more diverse than those of subordinates. Gradual colonization of the juvenile intestine after birth predicts lower intestinal biome diversity in juveniles than adults. We tested these predictions on the effect of: (1) age (juvenile/adult) and (2) social status (low/high) on bacterial microbiome and eukaryome diversity and composition in the spotted hyena (Crocuta crocuta), a highly social, female-dominated carnivore in which social status determines access to resources. We comprehensively screened feces from 35 individually known adult females and 7 juveniles in the Serengeti ecosystem for bacteria and eukaryotes, using a set of 48 different amplicons (4 for bacterial 16S, 44 for eukaryote 18S) in a multi-amplicon sequencing approach. We compared sequence abundances to classical coprological egg or oocyst counts. For all parasite taxa detected in more than six samples, the number of sequence reads significantly predicted the number of eggs or oocysts counted, underscoring the value of an amplicon sequencing approach for quantitative measurements of parasite load. In line with our predictions, our results revealed a significantly less diverse microbiome in juveniles than adults and a significantly higher diversity of eukaryotes in high-ranking than low-ranking animals. We propose that free-ranging wildlife can provide an intriguing model system to assess the adaptive value of intestinal biome diversity for both bacteria and eukaryotes. PMID:28670573
Heilig, Hans G.H.J.; Zoetendal, Erwin G.; Vaughan, Elaine E.; Marteau, Philippe; Akkermans, Antoon D.L.; de Vos, Willem M.
2002-01-01
A Lactobacillus group-specific PCR primer, S-G-Lab-0677-a-A-17, was developed to selectively amplify 16S ribosomal DNA (rDNA) from lactobacilli and related lactic acid bacteria, including members of the genera Leuconostoc, Pediococcus, and Weissella. Amplicons generated by PCR from a variety of gastrointestinal (GI) tract samples, including those originating from feces and cecum, resulted predominantly in Lactobacillus-like sequences, of which ca. 28% were most similar to the 16S rDNA of Lactobacillus ruminis. Moreover, four sequences of Leuconostoc species were retrieved that, so far, have only been detected in environments other than the GI tract, such as fermented food products. The validity of the primer was further demonstrated by using Lactobacillus-specific PCR and denaturing gradient gel electrophoresis (DGGE) of the 16S rDNA amplicons of fecal and cecal origin from different age groups. The stability of the GI-tract bacterial community in different age groups over various time periods was studied. The Lactobacillus community in three adults over a 2-year period showed variation in composition and stability depending on the individual, while successional change of the Lactobacillus community was observed during the first 5 months of an infant’s life. Furthermore, the specific PCR and DGGE approach was tested to study the retention in fecal samples of a Lactobacillus strain administered during a clinical trial. In conclusion, the combination of specific PCR and DGGE analysis of 16S rDNA amplicons allows the diversity of important groups of bacteria that are present in low numbers in specific ecosystems to be characterized, such as the lactobacilli in the human GI tract. PMID:11772617
Lagares, Antonio; Agaras, Betina; Bettiol, Marisa P; Gatti, Blanca M; Valverde, Claudio
2015-07-01
Species-specific genetic markers are crucial to develop faithful and sensitive molecular methods for the detection and identification of Pseudomonas aeruginosa (Pa). We have previously set up a PCR-RFLP protocol targeting oprF, the gene encoding the genus-specific outer membrane porin F, whose strong conservation and marked sequence diversity allowed detection and differentiation of environmental isolates (Agaras et al., 2012). Here, we evaluated the ability of the PCR-RFLP assay to genotype clinical isolates previously identified as Pa by conventional microbiological methods within a collection of 62 presumptive Pa isolates from different pediatric clinical samples and different sections of the Hospital de Niños "Sor María Ludovica" from La Plata, Argentina. All isolates, but one, gave an oprF amplicon consistent with that from reference Pa strains. The sequence of the smaller-sized amplicon revealed that the isolate was in fact a mendocina Pseudomonas strain. The oprF RFLP pattern generated with TaqI or HaeIII nucleases matched those of reference Pa strains for 59 isolates (96%). The other two Pa isolates (4%) revealed a different RFLP pattern based on HaeIII digestion, although oprF sequencing confirmed that Pa identification was correct. We next tested the effectiveness of the PCR-RFLP to detect pseudomonads on clinical samples of pediatric fibrocystic patients directly without sample cultivation. The expected amplicon and its cognate RFLP profile were obtained for all samples in which Pa was previously detected by cultivation-dependent methods. Altogether, these results provide the basis for the application of the oprF PCR-RFLP protocol to directly detect and identify Pa and other non-Pa pseudomonads in fibrocystic clinical samples. Copyright © 2015 Elsevier B.V. All rights reserved.
Moralli, Daniela; Monaco, Zoia L
2015-02-01
De novo artificial chromosomes expressing genes have been generated in human embryonic stem cells (hESc) and are maintained following differentiation into other cell types. Human artificial chromosomes (HAC) are small, functional, extrachromosomal elements, which behave as normal chromosomes in human cells. De novo HAC are generated following delivery of alpha satellite DNA into target cells. HAC are characterized by high levels of mitotic stability and are used as models to study centromere formation and chromosome organisation. They are successful and effective as gene expression vectors since they remain autonomous and can accommodate larger genes and regulatory regions for long-term expression studies in cells unlike other viral gene delivery vectors currently used. Transferring the essential DNA sequences for HAC formation intact across the cell membrane has been challenging for a number of years. A highly efficient delivery system based on HSV-1 amplicons has been used to target DNA directly to the ES cell nucleus and HAC stably generated in human embryonic stem cells (hESc) at high frequency. HAC were detected using an improved protocol for hESc chromosome harvesting, which consistently produced high-quality metaphase spreads that could routinely detect HAC in hESc. In tumour cells, the input DNA often integrated in the host chromosomes, but in the host ES genome, it remained intact. The hESc containing the HAC formed embryoid bodies, generated teratoma in mice, and differentiated into neuronal cells where the HAC were maintained. The HAC structure and chromatin composition was similar to the endogenous hESc chromosomes. This review will discuss the technological advances in HAC vector delivery using HSV-1 amplicons and the improvements in the identification of de novo HAC in hESc.
Walker, Andreas; Bergmann, Matthias; Camdereli, Jennifer; Kaiser, Rolf; Lübke, Nadine; Timm, Jörg
2017-06-01
HCV treatment options and cure rates have tremendously increased in the last decade. Although a pan-genotype HCV treatment has recently been approved, most DAA therapies are still genotype specific. Resistance-associated variants (RAVs) can limit the efficacy of DAA therapy and are associated with increased risk for therapy failure. With the approval of DAA regimens that recommend resistance testing prior to therapy, correct assessment of the genotype and testing for viruses with RAVs is clinically relevant. However, genotyping and resistance testing is generally done in costly and laborious separate reactions. The aim of the study was to establish a genotype-independent full-genome reverse transcription protocol to generate a template for both genotyping and resistance testing and to implement it into our routine diagnostic setup. The complete HCV genome was reverse transcribed with a pan-genotype primer binding at the 3'end of the viral RNA. This cDNA served as template for transcription of the genotyping amplicon in the core region as well as for the resistance testing of NS3, NS5A, and NS5B. With the established RT-protocol the HCV core region was successfully amplified and genotyped from 124 out of 125 (99.2%) HCV-positive samples. The amplification efficiency of RAV containing regions in NS3, NS5A, NS5B was 96.2%, 96.6% and 94.4%, respectively. We developed a method for HCV full-genome cDNA synthesis and implemented it into a routine diagnostic setup. This cDNA can be used as template for genotyping amplicons covering the core or NS5B region as well as for resistance testing amplicons in NS3, NS5A and NS5B. Copyright © 2017 Elsevier B.V. All rights reserved.