Oh, Hye-Seon; Kwon, Hyemi; Park, Suyeon; Kim, Mijin; Jeon, Min Ji; Kim, Tae Yong; Shong, Young Kee; Kim, Won Bae; Choi, Jene
2018-01-01
Background The BRAFV600E mutation is the most common genetic alteration identified in papillary thyroid carcinoma (PTC). Because of its costs effectiveness and sensitivity, direct Sanger sequencing has several limitations. The aim of this study was to evaluate the efficiency of immunohistochemistry (IHC) as an alternative method to detect the BRAFV600E mutation in preoperative and postoperative tissue samples. Methods We evaluated 71 patients who underwent thyroid surgery with the result of direct sequencing of the BRAFV600E mutation. IHC staining of the BRAFV600E mutation was performed in 49 preoperative and 23 postoperative thyroid specimens. Results Sixty-two patients (87.3%) had PTC, and of these, BRAFV600E was confirmed by direct sequencing in 57 patients (91.9%). In 23 postoperative tissue samples, the BRAFV600E mutation was detected in 16 samples (70%) by direct sequencing and 18 samples (78%) by IHC. In 24 fine needle aspiration (FNA) samples, BRAFV600E was detected in 18 samples (75%) by direct sequencing and 16 samples (67%) by IHC. In 25 core needle biopsy (CNB) samples, the BRAFV600E mutation was detected in 15 samples (60%) by direct sequencing and 16 samples (64%) by IHC. The sensitivity and specificity of IHC for detecting the BRAFV600E mutation were 77.8% and 66.7% in FNA samples and 99.3% and 80.0% in CNB samples. Conclusion IHC could be an alternative method to direct Sanger sequencing for BRAFV600E mutation detection both in postoperative and preoperative samples. However, application of IHC to detect the BRAFV600E mutation in FNA samples is of limited value compared with direct sequencing. PMID:29388401
Tzou, Philip L; Ariyaratne, Pramila; Varghese, Vici; Lee, Charlie; Rakhmanaliev, Elian; Villy, Carolin; Yee, Meiqi; Tan, Kevin; Michel, Gerd; Pinsky, Benjamin A; Shafer, Robert W
2018-06-01
The ability of next-generation sequencing (NGS) technologies to detect low frequency HIV-1 drug resistance mutations (DRMs) not detected by dideoxynucleotide Sanger sequencing has potential advantages for improved patient outcomes. We compared the performance of an in vitro diagnostic (IVD) NGS assay, the Sentosa SQ HIV genotyping assay for HIV-1 genotypic resistance testing, with Sanger sequencing on 138 protease/reverse transcriptase (RT) and 39 integrase sequences. The NGS assay used a 5% threshold for reporting low-frequency variants. The level of complete plus partial nucleotide sequence concordance between Sanger sequencing and NGS was 99.9%. Among the 138 protease/RT sequences, a mean of 6.4 DRMs was identified by both Sanger and NGS, a mean of 0.5 DRM was detected by NGS alone, and a mean of 0.1 DRM was detected by Sanger sequencing alone. Among the 39 integrase sequences, a mean of 1.6 DRMs was detected by both Sanger sequencing and NGS and a mean of 0.15 DRM was detected by NGS alone. Compared with Sanger sequencing, NGS estimated higher levels of resistance to one or more antiretroviral drugs for 18.2% of protease/RT sequences and 5.1% of integrase sequences. There was little evidence for technical artifacts in the NGS sequences, but the G-to-A hypermutation was detected in three samples. In conclusion, the IVD NGS assay evaluated in this study was highly concordant with Sanger sequencing. At the 5% threshold for reporting minority variants, NGS appeared to attain a modestly increased sensitivity for detecting low-frequency DRMs without compromising sequence accuracy. Copyright © 2018 American Society for Microbiology.
Rapid DNA Sequencing by Direct Nanoscale Reading of Nucleotide Bases on Individual DNA Chains
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, James Weifu; Meller, Amit
2007-01-01
Since the independent invention of DNA sequencing by Sanger and by Gilbert 30 years ago, it has grown from a small scale technique capable of reading several kilobase-pair of sequence per day into today's multibillion dollar industry. This growth has spurred the development of new sequencing technologies that do not involve either electrophoresis or Sanger sequencing chemistries. Sequencing by Synthesis (SBS) involves multiple parallel micro-sequencing addition events occurring on a surface, where data from each round is detected by imaging. New High Throughput Technologies for DNA Sequencing and Genomics is the second volume in the Perspectives in Bioanalysis series, whichmore » looks at the electroanalytical chemistry of nucleic acids and proteins, development of electrochemical sensors and their application in biomedicine and in the new fields of genomics and proteomics. The authors have expertly formatted the information for a wide variety of readers, including new developments that will inspire students and young scientists to create new tools for science and medicine in the 21st century. Reviews of complementary developments in Sanger and SBS sequencing chemistries, capillary electrophoresis and microdevice integration, MS sequencing and applications set the framework for the book.« less
Lee, Sung Hak; Chung, Arthur Minwoo; Lee, Ahwon; Oh, Woo Jin; Choi, Yeong Jin; Lee, Youn-Soo; Jung, Eun Sun
2017-01-01
Mutations in the KRAS gene have been identified in approximately 50% of colorectal cancers (CRCs). KRAS mutations are well established biomarkers in anti-epidermal growth factor receptor therapy. Therefore, assessment of KRAS mutations is needed in CRC patients to ensure appropriate treatment. We compared the analytical performance of the cobas test to Sanger sequencing in 264 CRC cases. In addition, discordant specimens were evaluated by 454 pyrosequencing. KRAS mutations for codons 12/13 were detected in 43.2% of cases (114/264) by Sanger sequencing. Of 257 evaluable specimens for comparison, KRAS mutations were detected in 112 cases (43.6%) by Sanger sequencing and 118 cases (45.9%) by the cobas test. Concordance between the cobas test and Sanger sequencing for each lot was 93.8% positive percent agreement (PPA) and 91.0% negative percent agreement (NPA) for codons 12/13. Results from the cobas test and Sanger sequencing were discordant for 20 cases (7.8%). Twenty discrepant cases were subsequently subjected to 454 pyrosequencing. After comprehensive analysis of the results from combined Sanger sequencing-454 pyrosequencing and the cobas test, PPA was 97.5% and NPA was 100%. The cobas test is an accurate and sensitive test for detecting KRAS -activating mutations and has analytical power equivalent to Sanger sequencing. Prescreening using the cobas test with subsequent application of Sanger sequencing is the best strategy for routine detection of KRAS mutations in CRC.
Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc'h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine
2017-01-01
Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus's but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies.
Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc’h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine
2017-01-01
Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus’s but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies. PMID:28362878
Inferring Short-Range Linkage Information from Sequencing Chromatograms
Beggel, Bastian; Neumann-Fraune, Maria; Kaiser, Rolf; Verheyen, Jens; Lengauer, Thomas
2013-01-01
Direct Sanger sequencing of viral genome populations yields multiple ambiguous sequence positions. It is not straightforward to derive linkage information from sequencing chromatograms, which in turn hampers the correct interpretation of the sequence data. We present a method for determining the variants existing in a viral quasispecies in the case of two nearby ambiguous sequence positions by exploiting the effect of sequence context-dependent incorporation of dideoxynucleotides. The computational model was trained on data from sequencing chromatograms of clonal variants and was evaluated on two test sets of in vitro mixtures. The approach achieved high accuracies in identifying the mixture components of 97.4% on a test set in which the positions to be analyzed are only one base apart from each other, and of 84.5% on a test set in which the ambiguous positions are separated by three bases. In silico experiments suggest two major limitations of our approach in terms of accuracy. First, due to a basic limitation of Sanger sequencing, it is not possible to reliably detect minor variants with a relative frequency of no more than 10%. Second, the model cannot distinguish between mixtures of two or four clonal variants, if one of two sets of linear constraints is fulfilled. Furthermore, the approach requires repetitive sequencing of all variants that might be present in the mixture to be analyzed. Nevertheless, the effectiveness of our method on the two in vitro test sets shows that short-range linkage information of two ambiguous sequence positions can be inferred from Sanger sequencing chromatograms without any further assumptions on the mixture composition. Additionally, our model provides new insights into the established and widely used Sanger sequencing technology. The source code of our method is made available at http://bioinf.mpi-inf.mpg.de/publications/beggel/linkageinformation.zip. PMID:24376502
Paul, Fiona; Otte, Jürgen; Schmitt, Imke; Dal Grande, Francesco
2018-06-05
The implementation of HTS (high-throughput sequencing) approaches is rapidly changing our understanding of the lichen symbiosis, by uncovering high bacterial and fungal diversity, which is often host-specific. Recently, HTS methods revealed the presence of multiple photobionts inside a single thallus in several lichen species. This differs from Sanger technology, which typically yields a single, unambiguous algal sequence per individual. Here we compared HTS and Sanger methods for estimating the diversity of green algal symbionts within lichen thalli using 240 lichen individuals belonging to two species of lichen-forming fungi. According to HTS data, Sanger technology consistently yielded the most abundant photobiont sequence in the sample. However, if the second most abundant photobiont exceeded 30% of the total HTS reads in a sample, Sanger sequencing generally failed. Our results suggest that most lichen individuals in the two analyzed species, Lasallia hispanica and L. pustulata, indeed contain a single, predominant green algal photobiont. We conclude that Sanger sequencing is a valid approach to detect the dominant photobionts in lichen individuals and populations. We discuss which research areas in lichen ecology and evolution will continue to benefit from Sanger sequencing, and which areas will profit from HTS approaches to assessing symbiont diversity.
Williams, Emma L; Bagg, Eleanor A L; Mueller, Michael; Vandrovcova, Jana; Aitman, Timothy J; Rumsby, Gill
2015-01-01
Definitive diagnosis of primary hyperoxaluria (PH) currently utilizes sequential Sanger sequencing of the AGXT, GRPHR, and HOGA1 genes but efficacy is unproven. This analysis is time-consuming, relatively expensive, and delays in diagnosis and inappropriate treatment can occur if not pursued early in the diagnostic work-up. We reviewed testing outcomes of Sanger sequencing in 200 consecutive patient samples referred for analysis. In addition, the Illumina Truseq custom amplicon system was evaluated for paralleled next-generation sequencing (NGS) of AGXT,GRHPR, and HOGA1 in 90 known PH patients. AGXT sequencing was requested in all patients, permitting a diagnosis of PH1 in 50%. All remaining patients underwent targeted exon sequencing of GRHPR and HOGA1 with 8% diagnosed with PH2 and 8% with PH3. Complete sequencing of both GRHPR and HOGA1 was not requested in 25% of patients referred leaving their diagnosis in doubt. NGS analysis showed 98% agreement with Sanger sequencing and both approaches had 100% diagnostic specificity. Diagnostic sensitivity of Sanger sequencing was 98% and for NGS it was 97%. NGS has comparable diagnostic performance to Sanger sequencing for the diagnosis of PH and, if implemented, would screen for all forms of PH simultaneously ensuring prompt diagnosis at decreased cost. PMID:25629080
Detection of Rare Mutations in EGFR-ARMS-PCR-Negative Lung Adenocarcinoma by Sanger Sequencing.
Liang, Chaoyue; Wu, Zhuolin; Gan, Xiaohong; Liu, Yuanbin; You, You; Liu, Chenxian; Zhou, Chengzhi; Liang, Ying; Mo, Haiyun; Chen, Allen M; Zhang, Jiexia
2018-01-01
This study aimed to identify potential epidermal growth factor receptor (EGFR) gene mutations in non-small cell lung cancer that went undetected by amplification refractory mutation system-Scorpion real-time PCR (ARMS-PCR). A total of 200 specimens were obtained from the First Affiliated Hospital of Guangzhou Medical University from August 2014 to August 2015. In total, 100 ARMS-negative and 100 ARMS-positive specimens were evaluated for EGFR gene mutations by Sanger sequencing. The methodology and sensitivity of each method and the outcomes of EGFR-tyrosine kinase inhibitor (TKI) therapy were analyzed. Among the 100 ARMS-PCR-positive samples, 90 were positive by Sanger sequencing, while 10 cases were considered negative, because the mutation abundance was less than 10%. Among the 100 negative cases, three were positive for a rare EGFR mutation by Sanger sequencing. In the curative effect analysis of EGFR-TKIs, the progression-free survival (PFS) analysis based on ARMS and Sanger sequencing results showed no difference. However, the PFS of patients with a high abundance of EGFR mutation was 12.4 months [95% confidence interval (CI), 11.6-12.4 months], which was significantly higher than that of patients with a low abundance of mutations detected by Sanger sequencing (95% CI, 10.7-11.3 months) (p<0.001). The ARMS method demonstrated higher sensitivity than Sanger sequencing, but was prone to missing mutations due to primer design. Sanger sequencing was able to detect rare EGFR mutations and deemed applicable for confirming EGFR status. A clinical trial evaluating the efficacy of EGFR-TKIs in patients with rare EGFR mutations is needed. © Copyright: Yonsei University College of Medicine 2018
Kiesler, Kevin M; Coble, Michael D; Hall, Thomas A; Vallone, Peter M
2014-01-01
A set of 711 samples from four U.S. population groups was analyzed using a novel mass spectrometry based method for mitochondrial DNA (mtDNA) base composition profiling. Comparison of the mass spectrometry results with Sanger sequencing derived data yielded a concordance rate of 99.97%. Length heteroplasmy was identified in 46% of samples and point heteroplasmy was observed in 6.6% of samples in the combined mass spectral and Sanger data set. Using discrimination capacity as a metric, Sanger sequencing of the full control region had the highest discriminatory power, followed by the mass spectrometry base composition method, which was more discriminating than Sanger sequencing of just the hypervariable regions. This trend is in agreement with the number of nucleotides covered by each of the three assays. Published by Elsevier Ireland Ltd.
Mu, Wenbo; Lu, Hsiao-Mei; Chen, Jefferey; Li, Shuwei; Elliott, Aaron M
2016-11-01
Next-generation sequencing (NGS) has rapidly replaced Sanger sequencing as the method of choice for diagnostic gene-panel testing. For hereditary-cancer testing, the technical sensitivity and specificity of the assay are paramount as clinicians use results to make important clinical management and treatment decisions. There is significant debate within the diagnostics community regarding the necessity of confirming NGS variant calls by Sanger sequencing, considering that numerous laboratories report having 100% specificity from the NGS data alone. Here we report our results from 20,000 hereditary-cancer NGS panels spanning 47 genes, in which all 7845 nonpolymorphic variants were Sanger- sequenced. Of these, 98.7% were concordant between NGS and Sanger sequencing and 1.3% were identified as NGS false-positives, located mainly in complex genomic regions (A/T-rich regions, G/C-rich regions, homopolymer stretches, and pseudogene regions). Simulating a false-positive rate of zero by adjusting the variant-calling quality-score thresholds decreased the sensitivity of the assay from 100% to 97.8%, resulting in the missed detection of 176 Sanger-confirmed variants, the majority in complex genomic regions (n = 114) and mosaic mutations (n = 7). The data illustrate the importance of setting quality thresholds for panel testing only after thousands of samples have been processed and the necessity of Sanger confirmation of NGS variants to maintain the highest possible sensitivity. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Zopf, Agnes; Raim, Roman; Danzer, Martin; Niklas, Norbert; Spilka, Rita; Pröll, Johannes; Gabriel, Christian; Nechansky, Andreas; Roucka, Markus
2015-03-01
The detection of KRAS mutations in codons 12 and 13 is critical for anti-EGFR therapy strategies; however, only those methodologies with high sensitivity, specificity, and accuracy as well as the best cost and turnaround balance are suitable for routine daily testing. Here we compared the performance of compact sequencing using the novel hybcell technology with 454 next-generation sequencing (454-NGS), Sanger sequencing, and pyrosequencing, using an evaluation panel of 35 specimens. A total of 32 mutations and 10 wild-type cases were reported using 454-NGS as the reference method. Specificity ranged from 100% for Sanger sequencing to 80% for pyrosequencing. Sanger sequencing and hybcell-based compact sequencing achieved a sensitivity of 96%, whereas pyrosequencing had a sensitivity of 88%. Accuracy was 97% for Sanger sequencing, 85% for pyrosequencing, and 94% for hybcell-based compact sequencing. Quantitative results were obtained for 454-NGS and hybcell-based compact sequencing data, resulting in a significant correlation (r = 0.914). Whereas pyrosequencing and Sanger sequencing were not able to detect multiple mutated cell clones within one tumor specimen, 454-NGS and the hybcell-based compact sequencing detected multiple mutations in two specimens. Our comparison shows that the hybcell-based compact sequencing is a valuable alternative to state-of-the-art methodologies used for detection of clinically relevant point mutations.
Sanger sequencing as a first-line approach for molecular diagnosis of Andersen-Tawil syndrome.
Totomoch-Serra, Armando; Marquez, Manlio F; Cervantes-Barragán, David E
2017-01-01
In 1977, Frederick Sanger developed a new method for DNA sequencing based on the chain termination method, now known as the Sanger sequencing method (SSM). Recently, massive parallel sequencing, better known as next-generation sequencing (NGS), is replacing the SSM for detecting mutations in cardiovascular diseases with a genetic background. The present opinion article wants to remark that "targeted" SSM is still effective as a first-line approach for the molecular diagnosis of some specific conditions, as is the case for Andersen-Tawil syndrome (ATS). ATS is described as a rare multisystemic autosomal dominant channelopathy syndrome caused mainly by a heterozygous mutation in the KCNJ2 gene . KCJN2 has particular characteristics that make it attractive for "directed" SSM. KCNJ2 has a sequence of 17,510 base pairs (bp), and a short coding region with two exons (exon 1=166 bp and exon 2=5220 bp), half of the mutations are located in the C-terminal cytosolic domain, a mutational hotspot has been described in residue Arg218, and this gene explains the phenotype in 60% of ATS cases that fulfill all the clinical criteria of the disease. In order to increase the diagnosis of ATS we urge cardiologists to search for facial and muscular abnormalities in subjects with frequent ventricular arrhythmias (especially bigeminy) and prominent U waves on the electrocardiogram.
Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database.
Carver, Tim; Berriman, Matthew; Tivey, Adrian; Patel, Chinmay; Böhme, Ulrike; Barrell, Barclay G; Parkhill, Julian; Rajandream, Marie-Adèle
2008-12-01
Artemis and Artemis Comparison Tool (ACT) have become mainstream tools for viewing and annotating sequence data, particularly for microbial genomes. Since its first release, Artemis has been continuously developed and supported with additional functionality for editing and analysing sequences based on feedback from an active user community of laboratory biologists and professional annotators. Nevertheless, its utility has been somewhat restricted by its limitation to reading and writing from flat files. Therefore, a new version of Artemis has been developed, which reads from and writes to a relational database schema, and allows users to annotate more complex, often large and fragmented, genome sequences. Artemis and ACT have now been extended to read and write directly to the Generic Model Organism Database (GMOD, http://www.gmod.org) Chado relational database schema. In addition, a Gene Builder tool has been developed to provide structured forms and tables to edit coordinates of gene models and edit functional annotation, based on standard ontologies, controlled vocabularies and free text. Artemis and ACT are freely available (under a GPL licence) for download (for MacOSX, UNIX and Windows) at the Wellcome Trust Sanger Institute web sites: http://www.sanger.ac.uk/Software/Artemis/ http://www.sanger.ac.uk/Software/ACT/
Paparini, Andrea; Gofton, Alexander; Yang, Rongchang; White, Nicole; Bunce, Michael; Ryan, Una M
2015-01-01
Cryptosporidium is an important enteric pathogen that infects a wide range of humans and animals. Rapid and reliable detection and characterisation methods are essential for understanding the transmission dynamics of the parasite. Sanger sequencing, and high-throughput sequencing (HTS) on an Ion Torrent platform, were compared with each other for their sensitivity and accuracy in detecting and characterising 25 Cryptosporidium-positive human and animal faecal samples. Ion Torrent reads (n = 123,857) were obtained at both 18S rRNA and actin loci for 21 of the 25 samples. Of these, one isolate at the actin locus (Cattle 05) and three at the 18S rRNA locus (HTS 10, HTS 11 and HTS 12), suffered PCR drop-out (i.e. PCR failures) when using fusion-tagged PCR. Sanger sequences were obtained for both loci for 23 of the 25 samples and showed good agreement with Ion Torrent-based genotyping. Two samples both from pythons (SK 02 and SK 05) produced mixed 18S and actin chromatograms by Sanger sequencing but were clearly identified by Ion Torrent sequencing as C. muris. One isolate (SK 03) was typed as C. muris by Sanger sequencing but was identified as a mixed C. muris and C. tyzzeri infection by HTS. 18S rRNA Type B sequences were identified in 4/6 C. parvum isolates when deep sequenced but were undetected in Sanger sequencing. Sanger was cheaper than Ion Torrent when sequencing a small numbers of samples, but when larger numbers of samples are considered (n = 60), the costs were comparative. Fusion-tagged amplicon based approaches are a powerful way of approaching mixtures, the only draw-back being the loss of PCR efficiency on low-template samples when using primers coupled to MID tags and adaptors. Taken together these data show that HTS has excellent potential for revealing the "true" composition of species/types in a Cryptosporidium infection, but that HTS workflows need to be carefully developed to ensure sensitivity, accuracy and contamination are controlled. Copyright © 2015 Elsevier Inc. All rights reserved.
Pandey, Ram Vinay; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas
2016-01-01
Traditional Sanger sequencing as well as Next-Generation Sequencing have been used for the identification of disease causing mutations in human molecular research. The majority of currently available tools are developed for research and explorative purposes and often do not provide a complete, efficient, one-stop solution. As the focus of currently developed tools is mainly on NGS data analysis, no integrative solution for the analysis of Sanger data is provided and consequently a one-stop solution to analyze reads from both sequencing platforms is not available. We have therefore developed a new pipeline called MutAid to analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and analysis of Sanger and NGS data under a single platform. It is capable of analyzing reads from multiple patients in a single run to create a list of potential disease causing base substitutions as well as insertions and deletions. MutAid has been developed for expert and non-expert users and supports four sequencing platforms including Sanger, Illumina, 454 and Ion Torrent. Furthermore, for NGS data analysis, five read mappers including BWA, TMAP, Bowtie, Bowtie2 and GSNAP and four variant callers including GATK-HaplotypeCaller, SAMTOOLS, Freebayes and VarScan2 pipelines are supported. MutAid is freely available at https://sourceforge.net/projects/mutaid.
Pandey, Ram Vinay; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas
2016-01-01
Traditional Sanger sequencing as well as Next-Generation Sequencing have been used for the identification of disease causing mutations in human molecular research. The majority of currently available tools are developed for research and explorative purposes and often do not provide a complete, efficient, one-stop solution. As the focus of currently developed tools is mainly on NGS data analysis, no integrative solution for the analysis of Sanger data is provided and consequently a one-stop solution to analyze reads from both sequencing platforms is not available. We have therefore developed a new pipeline called MutAid to analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and analysis of Sanger and NGS data under a single platform. It is capable of analyzing reads from multiple patients in a single run to create a list of potential disease causing base substitutions as well as insertions and deletions. MutAid has been developed for expert and non-expert users and supports four sequencing platforms including Sanger, Illumina, 454 and Ion Torrent. Furthermore, for NGS data analysis, five read mappers including BWA, TMAP, Bowtie, Bowtie2 and GSNAP and four variant callers including GATK-HaplotypeCaller, SAMTOOLS, Freebayes and VarScan2 pipelines are supported. MutAid is freely available at https://sourceforge.net/projects/mutaid. PMID:26840129
Altimari, Annalisa; de Biase, Dario; De Maglio, Giovanna; Gruppioni, Elisa; Capizzi, Elisa; Degiovanni, Alessio; D’Errico, Antonia; Pession, Annalisa; Pizzolitto, Stefano; Fiorentino, Michelangelo; Tallini, Giovanni
2013-01-01
Detection of KRAS mutations in archival pathology samples is critical for therapeutic appropriateness of anti-EGFR monoclonal antibodies in colorectal cancer. We compared the sensitivity, specificity, and accuracy of Sanger sequencing, ARMS-Scorpion (TheraScreen®) real-time polymerase chain reaction (PCR), pyrosequencing, chip array hybridization, and 454 next-generation sequencing to assess KRAS codon 12 and 13 mutations in 60 nonconsecutive selected cases of colorectal cancer. Twenty of the 60 cases were detected as wild-type KRAS by all methods with 100% specificity. Among the 40 mutated cases, 13 were discrepant with at least one method. The sensitivity was 85%, 90%, 93%, and 92%, and the accuracy was 90%, 93%, 95%, and 95% for Sanger sequencing, TheraScreen real-time PCR, pyrosequencing, and chip array hybridization, respectively. The main limitation of Sanger sequencing was its low analytical sensitivity, whereas TheraScreen real-time PCR, pyrosequencing, and chip array hybridization showed higher sensitivity but suffered from the limitations of predesigned assays. Concordance between the methods was k = 0.79 for Sanger sequencing and k > 0.85 for the other techniques. Tumor cell enrichment correlated significantly with the abundance of KRAS-mutated deoxyribonucleic acid (DNA), evaluated as ΔCt for TheraScreen real-time PCR (P = 0.03), percentage of mutation for pyrosequencing (P = 0.001), ratio for chip array hybridization (P = 0.003), and percentage of mutation for 454 next-generation sequencing (P = 0.004). Also, 454 next-generation sequencing showed the best cross correlation for quantification of mutation abundance compared with all the other methods (P < 0.001). Our comparison showed the superiority of next-generation sequencing over the other techniques in terms of sensitivity and specificity. Next-generation sequencing will replace Sanger sequencing as the reference technique for diagnostic detection of KRAS mutation in archival tumor tissues. PMID:23950653
Haemonchus contortus: Genome Structure, Organization and Comparative Genomics.
Laing, R; Martinelli, A; Tracey, A; Holroyd, N; Gilleard, J S; Cotton, J A
2016-01-01
One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. Copyright © 2016 Elsevier Ltd. All rights reserved.
[Molecular and prenatal diagnosis of a family with Fanconi anemia by next generation sequencing].
Gong, Zhuwen; Yu, Yongguo; Zhang, Qigang; Gu, Xuefan
2015-04-01
To provide prenatal diagnosis for a pregnant woman who had given birth to a child with Fanconi anemia with combined next-generation sequencing (NGS) and Sanger sequencing. For the affected child, potential mutations of the FANCA gene were analyzed with NGS. Suspected mutation was verified with Sanger sequencing. For prenatal diagnosis, genomic DNA was extracted from cultured fetal amniotic fluid cells and subjected to analysis of the same mutations. A low-frequency frameshifting mutation c.989_995del7 (p.H330LfsX2, inherited from his father) and a truncating mutation c.3971C>T (p.P1324L, inherited from his mother) have been identified in the affected child and considered to be pathogenic. The two mutations were subsequently verified by Sanger sequencing. Upon prenatal diagnosis, the fetus was found to carry two mutations. The combined next-generation sequencing and Sanger sequencing can reduce the time for diagnosis and identify subtypes of Fanconi anemia and the mutational sites, which has enabled reliable prenatal diagnosis of this disease.
Abdelrahman, Tamer; Hughes, Joseph; Main, Janice; McLauchlan, John; Thursz, Mark; Thomson, Emma
2015-01-01
High rates of sexually transmitted infection and reinfection with hepatitis C virus (HCV) have recently been reported in human immunodeficiency virus (HIV)-infected men who have sex with men and reinfection has also been described in monoinfected injecting drug users. The diagnosis of reinfection has traditionally been based on direct Sanger sequencing of samples pre- and posttreatment, but not on more sensitive deep sequencing techniques. We studied viral quasispecies dynamics in patients who failed standard of care therapy in a high-risk HIV-infected cohort of patients with early HCV infection to determine whether treatment failure was associated with reinfection or recrudescence of preexisting infection. Paired sequences (pre- and posttreatment) were analyzed. The HCV E2 hypervariable region-1 was amplified using nested reverse-transcription polymerase chain reaction (RT-PCR) with indexed genotype-specific primers and the same products were sequenced using both Sanger and 454 pyrosequencing approaches. Of 99 HIV-infected patients with acute HCV treated with 24-48 weeks of pegylated interferon alpha and ribavirin, 15 failed to achieve a sustained virological response (six relapsed, six had a null response, and three had a partial response). Using direct sequencing, 10/15 patients (66%) had evidence of a previously undetected strain posttreatment; in many studies, this is interpreted as reinfection. However, pyrosequencing revealed that 15/15 (100%) of patients had evidence of persisting infection; 6/15 (40%) patients had evidence of a previously undetected variant present in the posttreatment sample in addition to a variant that was detected at baseline. This could represent superinfection or a limitation of the sensitivity of pyrosequencing. In this high-risk group, the emergence of new viral strains following treatment failure is most commonly associated with emerging dominance of preexisting minority variants rather than reinfection. Superinfection may occur in this cohort but reinfection is overestimated by Sanger sequencing. © 2014 The Authors. Hepatology published by Wiley on behalf of the American Association for the Study of Liver Diseases.
Detection of a divergent variant of grapevine virus F by next-generation sequencing.
Molenaar, Nicholas; Burger, Johan T; Maree, Hans J
2015-08-01
The complete genome sequence of a South African isolate of grapevine virus F (GVF) is presented. It was first detected by metagenomic next-generation sequencing of field samples and validated through direct Sanger sequencing. The genome sequence of GVF isolate V5 consists of 7539 nucleotides and contains a poly(A) tail. It has a typical vitivirus genome arrangement that comprises five open reading frames (ORFs), which share only 88.96 % nucleotide sequence identity with the existing complete GVF genome sequence (JX105428).
Neupauerová, Jana; Grečmalová, Dagmar; Seeman, Pavel; Laššuthová, Petra
2016-05-01
We describe a patient with early onset severe axonal Charcot-Marie-Tooth disease (CMT2) with dominant inheritance, in whom Sanger sequencing failed to detect a mutation in the mitofusin 2 (MFN2) gene because of a single nucleotide polymorphism (rs2236057) under the PCR primer sequence. The severe early onset phenotype and the family history with severely affected mother (died after delivery) was very suggestive of CMT2A and this suspicion was finally confirmed by a MFN2 mutation. The mutation p.His361Tyr was later detected in the patient by massively parallel sequencing with a gene panel for hereditary neuropathies. According to this information, new primers for amplification and sequencing were designed which bind away from the polymorphic sites of the patient's DNA. Sanger sequencing with these new primers then confirmed the heterozygous mutation in the MFN2 gene in this patient. This case report shows that massively parallel sequencing may in some rare cases be more sensitive than Sanger sequencing and highlights the importance of accurate primer design which requires special attention. © 2016 John Wiley & Sons Ltd/University College London.
Pitfalls in genetic testing: the story of missed SCN1A mutations.
Djémié, Tania; Weckhuysen, Sarah; von Spiczak, Sarah; Carvill, Gemma L; Jaehn, Johanna; Anttonen, Anna-Kaisa; Brilstra, Eva; Caglayan, Hande S; de Kovel, Carolien G; Depienne, Christel; Gaily, Eija; Gennaro, Elena; Giraldez, Beatriz G; Gormley, Padhraig; Guerrero-López, Rosa; Guerrini, Renzo; Hämäläinen, Eija; Hartmann, Corinna; Hernandez-Hernandez, Laura; Hjalgrim, Helle; Koeleman, Bobby P C; Leguern, Eric; Lehesjoki, Anna-Elina; Lemke, Johannes R; Leu, Costin; Marini, Carla; McMahon, Jacinta M; Mei, Davide; Møller, Rikke S; Muhle, Hiltrud; Myers, Candace T; Nava, Caroline; Serratosa, Jose M; Sisodiya, Sanjay M; Stephani, Ulrich; Striano, Pasquale; van Kempen, Marjan J A; Verbeek, Nienke E; Usluer, Sunay; Zara, Federico; Palotie, Aarno; Mefford, Heather C; Scheffer, Ingrid E; De Jonghe, Peter; Helbig, Ingo; Suls, Arvid
2016-07-01
Sanger sequencing, still the standard technique for genetic testing in most diagnostic laboratories and until recently widely used in research, is gradually being complemented by next-generation sequencing (NGS). No single mutation detection technique is however perfect in identifying all mutations. Therefore, we wondered to what extent inconsistencies between Sanger sequencing and NGS affect the molecular diagnosis of patients. Since mutations in SCN1A, the major gene implicated in epilepsy, are found in the majority of Dravet syndrome (DS) patients, we focused on missed SCN1A mutations. We sent out a survey to 16 genetic centers performing SCN1A testing. We collected data on 28 mutations initially missed using Sanger sequencing. All patients were falsely reported as SCN1A mutation-negative, both due to technical limitations and human errors. We illustrate the pitfalls of Sanger sequencing and most importantly provide evidence that SCN1A mutations are an even more frequent cause of DS than already anticipated.
Comparison of the Equine Reference Sequence with Its Sanger Source Data and New Illumina Reads
Rebolledo-Mendez, Jovan; Hestand, Matthew S.; Coleman, Stephen J.; Zeng, Zheng; Orlando, Ludovic; MacLeod, James N.; Kalbfleisch, Ted
2015-01-01
The reference assembly for the domestic horse, EquCab2, published in 2009, was built using approximately 30 million Sanger reads from a Thoroughbred mare named Twilight. Contiguity in the assembly was facilitated using nearly 315 thousand BAC end sequences from Twilight’s half brother Bravo. Since then, it has served as the foundation for many genome-wide analyses that include not only the modern horse, but ancient horses and other equid species as well. As data mapped to this reference has accumulated, consistent variation between mapped datasets and the reference, in terms of regions with no read coverage, single nucleotide variants, and small insertions/deletions have become apparent. In many cases, it is not clear whether these differences are the result of true sequence variation between the research subjects’ and Twilight’s genome or due to errors in the reference. EquCab2 is regarded as “The Twilight Assembly.” The objective of this study was to identify inconsistencies between the EquCab2 assembly and the source Twilight Sanger data used to build it. To that end, the original Sanger and BAC end reads have been mapped back to this equine reference and assessed with the addition of approximately 40X coverage of new Illumina Paired-End sequence data. The resulting mapped datasets identify those regions with low Sanger read coverage, as well as variation in genomic content that is not consistent with either the original Twilight Sanger data or the new genomic sequence data generated from Twilight on the Illumina platform. As the haploid EquCab2 reference assembly was created using Sanger reads derived largely from a single individual, the vast majority of variation detected in a mapped dataset comprised of those same Sanger reads should be heterozygous. In contrast, homozygous variations would represent either errors in the reference or contributions from Bravo's BAC end sequences. Our analysis identifies 720,843 homozygous discrepancies between new, high throughput genomic sequence data generated for Twilight and the EquCab2 reference assembly. Most of these represent errors in the assembly, while approximately 10,000 are demonstrated to be contributions from another horse. Other results are presented that include the binary alignment map file of the mapped Sanger reads, a list of variants identified as discrepancies between the source data and resulting reference, and a BED annotation file that lists the regions of the genome whose consensus was likely derived from low coverage alignments. PMID:26107638
LipidSeq: a next-generation clinical resequencing panel for monogenic dyslipidemias.
Johansen, Christopher T; Dubé, Joseph B; Loyzer, Melissa N; MacDonald, Austin; Carter, David E; McIntyre, Adam D; Cao, Henian; Wang, Jian; Robinson, John F; Hegele, Robert A
2014-04-01
We report the design of a targeted resequencing panel for monogenic dyslipidemias, LipidSeq, for the purpose of replacing Sanger sequencing in the clinical detection of dyslipidemia-causing variants. We also evaluate the performance of the LipidSeq approach versus Sanger sequencing in 84 patients with a range of phenotypes including extreme blood lipid concentrations as well as additional dyslipidemias and related metabolic disorders. The panel performs well, with high concordance (95.2%) in samples with known mutations based on Sanger sequencing and a high detection rate (57.9%) of mutations likely to be causative for disease in samples not previously sequenced. Clinical implementation of LipidSeq has the potential to aid in the molecular diagnosis of patients with monogenic dyslipidemias with a high degree of speed and accuracy and at lower cost than either Sanger sequencing or whole exome sequencing. Furthermore, LipidSeq will help to provide a more focused picture of monogenic and polygenic contributors that underlie dyslipidemia while excluding the discovery of incidental pathogenic clinically actionable variants in nonmetabolism-related genes, such as oncogenes, that would otherwise be identified by a whole exome approach, thus minimizing potential ethical issues.
LipidSeq: a next-generation clinical resequencing panel for monogenic dyslipidemias[S
Johansen, Christopher T.; Dubé, Joseph B.; Loyzer, Melissa N.; MacDonald, Austin; Carter, David E.; McIntyre, Adam D.; Cao, Henian; Wang, Jian; Robinson, John F.; Hegele, Robert A.
2014-01-01
We report the design of a targeted resequencing panel for monogenic dyslipidemias, LipidSeq, for the purpose of replacing Sanger sequencing in the clinical detection of dyslipidemia-causing variants. We also evaluate the performance of the LipidSeq approach versus Sanger sequencing in 84 patients with a range of phenotypes including extreme blood lipid concentrations as well as additional dyslipidemias and related metabolic disorders. The panel performs well, with high concordance (95.2%) in samples with known mutations based on Sanger sequencing and a high detection rate (57.9%) of mutations likely to be causative for disease in samples not previously sequenced. Clinical implementation of LipidSeq has the potential to aid in the molecular diagnosis of patients with monogenic dyslipidemias with a high degree of speed and accuracy and at lower cost than either Sanger sequencing or whole exome sequencing. Furthermore, LipidSeq will help to provide a more focused picture of monogenic and polygenic contributors that underlie dyslipidemia while excluding the discovery of incidental pathogenic clinically actionable variants in nonmetabolism-related genes, such as oncogenes, that would otherwise be identified by a whole exome approach, thus minimizing potential ethical issues. PMID:24503134
Fantin, Yuri S.; Neverov, Alexey D.; Favorov, Alexander V.; Alvarez-Figueroa, Maria V.; Braslavskaya, Svetlana I.; Gordukova, Maria A.; Karandashova, Inga V.; Kuleshov, Konstantin V.; Myznikova, Anna I.; Polishchuk, Maya S.; Reshetov, Denis A.; Voiciehovskaya, Yana A.; Mironov, Andrei A.; Chulanov, Vladimir P.
2013-01-01
Sanger sequencing is a common method of reading DNA sequences. It is less expensive than high-throughput methods, and it is appropriate for numerous applications including molecular diagnostics. However, sequencing mixtures of similar DNA of pathogens with this method is challenging. This is important because most clinical samples contain such mixtures, rather than pure single strains. The traditional solution is to sequence selected clones of PCR products, a complicated, time-consuming, and expensive procedure. Here, we propose the base-calling with vocabulary (BCV) method that computationally deciphers Sanger chromatograms obtained from mixed DNA samples. The inputs to the BCV algorithm are a chromatogram and a dictionary of sequences that are similar to those we expect to obtain. We apply the base-calling function on a test dataset of chromatograms without ambiguous positions, as well as one with 3–14% sequence degeneracy. Furthermore, we use BCV to assemble a consensus sequence for an HIV genome fragment in a sample containing a mixture of viral DNA variants and to determine the positions of the indels. Finally, we detect drug-resistant Mycobacterium tuberculosis strains carrying frameshift mutations mixed with wild-type bacteria in the pncA gene, and roughly characterize bacterial communities in clinical samples by direct 16S rRNA sequencing. PMID:23382983
Experience of targeted Usher exome sequencing as a clinical test
Besnard, Thomas; García-García, Gema; Baux, David; Vaché, Christel; Faugère, Valérie; Larrieu, Lise; Léonard, Susana; Millan, Jose M; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise
2014-01-01
We show that massively parallel targeted sequencing of 19 genes provides a new and reliable strategy for molecular diagnosis of Usher syndrome (USH) and nonsyndromic deafness, particularly appropriate for these disorders characterized by a high clinical and genetic heterogeneity and a complex structure of several of the genes involved. A series of 71 patients including Usher patients previously screened by Sanger sequencing plus newly referred patients was studied. Ninety-eight percent of the variants previously identified by Sanger sequencing were found by next-generation sequencing (NGS). NGS proved to be efficient as it offers analysis of all relevant genes which is laborious to reach with Sanger sequencing. Among the 13 newly referred Usher patients, both mutations in the same gene were identified in 77% of cases (10 patients) and one candidate pathogenic variant in two additional patients. This work can be considered as pilot for implementing NGS for genetically heterogeneous diseases in clinical service. PMID:24498627
Deng, Yi-Mo; Spirason, Natalie; Iannello, Pina; Jelley, Lauren; Lau, Hilda; Barr, Ian G
2015-07-01
Full genome sequencing of influenza A viruses (IAV), including those that arise from annual influenza epidemics, is undertaken to determine if reassorting has occurred or if other pathogenic traits are present. Traditionally IAV sequencing has been biased toward the major surface glycoproteins haemagglutinin and neuraminidase, while the internal genes are often ignored. Despite the development of next generation sequencing (NGS), many laboratories are still reliant on conventional Sanger sequencing to sequence IAV. To develop a minimal and robust set of primers for Sanger sequencing of the full genome of IAV currently circulating in humans. A set of 13 primer pairs was designed that enabled amplification of the six internal genes of multiple human IAV subtypes including the recent avian influenza A(H7N9) virus from China. Specific primers were designed to amplify the HA and NA genes of each IAV subtype of interest. Each of the primers also incorporated a binding site at its 5'-end for either a forward or reverse M13 primer, such that only two M13 primers were required for all subsequent sequencing reactions. This minimal set of primers was suitable for sequencing the six internal genes of all currently circulating human seasonal influenza A subtypes as well as the avian A(H7N9) viruses that have infected humans in China. This streamlined Sanger sequencing protocol could be used to generate full genome sequence data more rapidly and easily than existing influenza genome sequencing protocols. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Mottishaw, Jeffery D.; Erck, Adam R.; Kramer, Jordan H.; Sun, Haoran; Koppang, Miles
2015-01-01
Frederick Sanger's early work on protein sequencing through the use of colorimetric labeling combined with liquid chromatography involves an important nucleophilic aromatic substitution (S[subscript N]Ar) reaction in which the N-terminus of a protein is tagged with Sanger's reagent. Understanding the inherent differences between this S[subscript…
Single-cell genomic sequencing using Multiple Displacement Amplification.
Lasken, Roger S
2007-10-01
Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
Next-generation sequencing for genetic testing of familial colorectal cancer syndromes.
Simbolo, Michele; Mafficini, Andrea; Agostini, Marco; Pedrazzani, Corrado; Bedin, Chiara; Urso, Emanuele D; Nitti, Donato; Turri, Giona; Scardoni, Maria; Fassan, Matteo; Scarpa, Aldo
2015-01-01
Genetic screening in families with high risk to develop colorectal cancer (CRC) prevents incurable disease and permits personalized therapeutic and follow-up strategies. The advancement of next-generation sequencing (NGS) technologies has revolutionized the throughput of DNA sequencing. A series of 16 probands for either familial adenomatous polyposis (FAP; 8 cases) or hereditary nonpolyposis colorectal cancer (HNPCC; 8 cases) were investigated for intragenic mutations in five CRC familial syndromes-associated genes (APC, MUTYH, MLH1, MSH2, MSH6) applying both a custom multigene Ion AmpliSeq NGS panel and conventional Sanger sequencing. Fourteen pathogenic variants were detected in 13/16 FAP/HNPCC probands (81.3 %); one FAP proband presented two co-existing pathogenic variants, one in APC and one in MUTYH. Thirteen of these 14 pathogenic variants were detected by both NGS and Sanger, while one MSH2 mutation (L280FfsX3) was identified only by Sanger sequencing. This is due to a limitation of the NGS approach in resolving sequences close or within homopolymeric stretches of DNA. To evaluate the performance of our NGS custom panel we assessed its capability to resolve the DNA sequences corresponding to 2225 pathogenic variants reported in the COSMIC database for APC, MUTYH, MLH1, MSH2, MSH6. Our NGS custom panel resolves the sequences where 2108 (94.7 %) of these variants occur. The remaining 117 mutations reside inside or in close proximity to homopolymer stretches; of these 27 (1.2 %) are imprecisely identified by the software but can be resolved by visual inspection of the region, while the remaining 90 variants (4.0 %) are blind spots. In summary, our custom panel would miss 4 % (90/2225) of pathogenic variants that would need a small set of Sanger sequencing reactions to be solved. The multiplex NGS approach has the advantage of analyzing multiple genes in multiple samples simultaneously, requiring only a reduced number of Sanger sequences to resolve homopolymeric DNA regions not adequately assessed by NGS. The implementation of NGS approaches in routine diagnostics of familial CRC is cost-effective and significantly reduces diagnostic turnaround times.
Janecek, Elisabeth; Streichan, Sabine; Strube, Christina
2012-10-18
Rickettsioses are caused by pathogenic species of the genus Rickettsia and play an important role as emerging diseases. The bacteria are transmitted to mammal hosts including humans by arthropod vectors. Since detection, especially in tick vectors, is usually based on PCR with genus-specific primers to include different occurring Rickettsia species, subsequent species identification is mainly achieved by Sanger sequencing. In the present study a real-time pyrosequencing approach was established with the objective to differentiate between species occurring in German Ixodes ticks, which are R. helvetica, R. monacensis, R. massiliae, and R. felis. Tick material from a quantitative real-time PCR (qPCR) based study on Rickettsia-infections in I. ricinus allowed direct comparison of both sequencing techniques, Sanger and real-time pyrosequencing. A sequence stretch of rickettsial citrate synthase (gltA) gene was identified to contain divergent single nucleotide polymorphism (SNP) sites suitable for Rickettsia species differentiation. Positive control plasmids inserting the respective target sequence of each Rickettsia species of interest were constructed for initial establishment of the real-time pyrosequencing approach using Qiagen's PSQ 96MA Pyrosequencing System operating in a 96-well format. The approach included an initial amplification reaction followed by the actual pyrosequencing, which is traceable by pyrograms in real-time. Afterwards, real-time pyrosequencing was applied to 263 Ixodes tick samples already detected Rickettsia-positive in previous qPCR experiments. Establishment of real-time pyrosequencing using positive control plasmids resulted in accurate detection of all SNPs in all included Rickettsia species. The method was then applied to 263 Rickettsia-positive Ixodes ricinus samples, of which 153 (58.2%) could be identified for their species (151 R. helvetica and 2 R. monacensis) by previous custom Sanger sequencing. Real-time pyrosequencing identified all Sanger-determined ticks as well as 35 previously undifferentiated ticks resulting in a total number of 188 (71.5%) identified samples. Pyrosequencing sensitivity was found to be strongly dependent on gltA copy numbers in the reaction setup. Whereas less than 101 copies in the initial amplification reaction resulted in identification of 15.1% of the samples only, the percentage increased to 54.2% at 101-102 copies, to 95.6% at >102-103 copies and reached 100% samples identified for their Rickettsia species if more than 103 copies were present in the template. The established real-time pyrosequencing approach represents a reliable method for detection and differentiation of Rickettsia spp. present in I. ricinus diagnostic material and prevalence studies. Furthermore, the method proved to be faster, more cost-effective as well as more sensitive than custom Sanger sequencing with simultaneous high specificity.
Thomas, Vincent; Mazard, Blandine; Garcia, Caroline; Lacan, Philippe; Gagnieu, Marie-Claude; Joly, Philippe
2013-09-23
Minucci et al. have proposed in 2010 a rapid, simple and cost-effective HRM method on the LightCycler 480® apparatus (Roche) for the determination of the 6/6, 6/7 and 7/7 genotypes of the (TA)n UGT1A1 promoter polymorphism. However, they have not studied the n=5 and n=8 alleles which can be quite frequent in sickle-cell disease patients. The aim of our study was to test this HRM protocol to all the 10 possible (TA)n UGT1A1 genotypes (i.e. 5/5, 5/6, 5/7, 5/8, 6/6, 6/7, 6/8, 7/7, 7/8 and 8/8) by using our SCD cohort of patients. All genotypes could be unambiguously identified except 6/7 and 6/8 which give a similar HRM profile. For those two genotypes, the differentiation necessitates either a direct Sanger sequencing or a second PCR protocol followed by a 3% agarose gel migration. For the (TA)n UGT1A1 promoter genotyping of African patients, each lab has to wonder what is the best way between (i) direct Sanger sequencing of all patients and (ii) HRM protocol for all patients followed by a complementary analysis to differentiate the 6/7 and 6/8 genotypes. © 2013. Published by Elsevier B.V. All rights reserved.
Riman, Sarah; Kiesler, Kevin M; Borsuk, Lisa A; Vallone, Peter M
2017-07-01
Standard Reference Materials SRM 2392 and 2392-I are intended to provide quality control when amplifying and sequencing human mitochondrial genome sequences. The National Institute of Standards and Technology (NIST) offers these SRMs to laboratories performing DNA-based forensic human identification, molecular diagnosis of mitochondrial diseases, mutation detection, evolutionary anthropology, and genetic genealogy. The entire mtGenome (∼16569bp) of SRM 2392 and 2392-I have previously been characterized at NIST by Sanger sequencing. Herein, we used the sensitivity, specificity, and accuracy offered by next generation sequencing (NGS) to: (1) re-sequence the certified values of the SRM 2392 and 2392-I; (2) confirm Sanger data with a high coverage new sequencing technology; (3) detect lower level heteroplasmies (<20%); and thus (4) support mitochondrial sequencing communities in the adoption of NGS methods. To obtain a consensus sequence for the SRMs as well as identify and control any bias, sequencing was performed using two NGS platforms and data was analyzed using different bioinformatics pipelines. Our results confirm five low level heteroplasmy sites that were not previously observed with Sanger sequencing: three sites in the GM09947A template in SRM 2392 and two sites in the HL-60 template in SRM 2392-I. Copyright © 2017 Elsevier B.V. All rights reserved.
van den Akker, Jeroen; Mishne, Gilad; Zimmer, Anjali D; Zhou, Alicia Y
2018-04-17
Next generation sequencing (NGS) has become a common technology for clinical genetic tests. The quality of NGS calls varies widely and is influenced by features like reference sequence characteristics, read depth, and mapping accuracy. With recent advances in NGS technology and software tools, the majority of variants called using NGS alone are in fact accurate and reliable. However, a small subset of difficult-to-call variants that still do require orthogonal confirmation exist. For this reason, many clinical laboratories confirm NGS results using orthogonal technologies such as Sanger sequencing. Here, we report the development of a deterministic machine-learning-based model to differentiate between these two types of variant calls: those that do not require confirmation using an orthogonal technology (high confidence), and those that require additional quality testing (low confidence). This approach allows reliable NGS-based calling in a clinical setting by identifying the few important variant calls that require orthogonal confirmation. We developed and tested the model using a set of 7179 variants identified by a targeted NGS panel and re-tested by Sanger sequencing. The model incorporated several signals of sequence characteristics and call quality to determine if a variant was identified at high or low confidence. The model was tuned to eliminate false positives, defined as variants that were called by NGS but not confirmed by Sanger sequencing. The model achieved very high accuracy: 99.4% (95% confidence interval: +/- 0.03%). It categorized 92.2% (6622/7179) of the variants as high confidence, and 100% of these were confirmed to be present by Sanger sequencing. Among the variants that were categorized as low confidence, defined as NGS calls of low quality that are likely to be artifacts, 92.1% (513/557) were found to be not present by Sanger sequencing. This work shows that NGS data contains sufficient characteristics for a machine-learning-based model to differentiate low from high confidence variants. Additionally, it reveals the importance of incorporating site-specific features as well as variant call features in such a model.
Martinuzzi, Claudia; Pastorino, Lorenza; Andreotti, Virginia; Garuti, Anna; Minuto, Michele; Fiocca, Roberto; Bianchi-Scarrà, Giovanna; Ghiorzo, Paola; Grillo, Federica; Mastracci, Luca
2016-09-01
The optimal method for BRAF mutation detection remains to be determined despite advances in molecular detection techniques. The aim of this study was to compare, against classical Sanger sequencing, the diagnostic performance of two of the most recently developed, highly sensitive methods: BRAF V600E immunohistochemistry (IHC) and peptide nucleic-acid (PNA)-clamp qPCR. BRAF exon 15 mutations were searched in formalin-fixed paraffin-embedded tissues from 86 papillary thyroid carcinoma using the three methods. The limits of detection of Sanger sequencing in borderline or discordant cases were quantified by next generation sequencing. BRAF mutations were found in 74.4 % of cases by PNA, in 71 % of cases by IHC, and in 64 % of cases by Sanger sequencing. Complete concordance for the three methods was observed in 80 % of samples. Better concordance was observed with the combination of two methods, particularly PNA and IHC (59/64) (92 %), while the combination of PNA and Sanger was concordant in 55 cases (86 %). Sensitivity of the three methods was 99 % for PNA, 94.2 % for IHC, and 89.5 % for Sanger. Our data show that IHC could be used as a cost-effective, first-line method for BRAF V600E detection in daily practice, followed by PNA analysis in negative or uninterpretable cases, as the most efficient method. PNA-clamp quantitative PCR is highly sensitive and complementary to IHC as it also recognizes other mutations besides V600E and it is suitable for diagnostic purposes.
Singh, Aditya; Bhatia, Prateek
2016-12-01
Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
Huang, Hui; Chen, Yanhua; Chen, Huishuang; Ma, Yuanyuan; Chiang, Pei-Wen; Zhong, Jing; Liu, Xuyang; Asan; Wu, Jing; Su, Yan; Li, Xin; Deng, Jianlian; Huang, Yingping; Zhang, Xinxin; Li, Yang; Fan, Ning; Wang, Ying; Tang, Lihui; Shen, Jinting; Chen, Meiyan; Zhang, Xiuqing; Te, Deng; Banerjee, Santasree; Liu, Hui; Qi, Ming; Yi, Xin
2018-01-01
Inherited eye diseases are major causes of vision loss in both children and adults. Inherited eye diseases are characterized by clinical variability and pronounced genetic heterogeneity. Genetic testing may provide an accurate diagnosis for ophthalmic genetic disorders and allow gene therapy for specific diseases. A targeted gene capture panel was designed to capture exons of 283 inherited eye disease genes including 58 known causative retinitis pigmentosa (RP) genes. 180 samples were tested with this panel, 68 were previously tested by Sanger sequencing. Systematic evaluation of our method and comprehensive molecular diagnosis were carried on 99 RP patients. 96.85% targeted regions were covered by at least 20 folds, the accuracy of variants detection was 99.994%. In 4 of the 68 samples previously tested by Sanger sequencing, mutations of other diseases not consisting with the clinical diagnosis were detected by next-generation sequencing (NGS) not Sanger. Among the 99 RP patients, 64 (64.6%) were detected with pathogenic mutations, while in 3 patients, it was inconsistent between molecular diagnosis and their initial clinical diagnosis. After revisiting, one patient's clinical diagnosis was reclassified. In addition, 3 patients were found carrying large deletions. We have systematically evaluated our method and compared it with Sanger sequencing, and have identified a large number of novel mutations in a cohort of 99 RP patients. The results showed a sufficient accuracy of our method and suggested the importance of molecular diagnosis in clinical diagnosis.
Ma, Yuanyuan; Chiang, Pei-Wen; Zhong, Jing; Liu, Xuyang; Asan; Wu, Jing; Su, Yan; Li, Xin; Deng, Jianlian; Huang, Yingping; Zhang, Xinxin; Li, Yang; Fan, Ning; Wang, Ying; Tang, Lihui; Shen, Jinting; Chen, Meiyan; Zhang, Xiuqing; Te, Deng; Banerjee, Santasree; Liu, Hui; Qi, Ming; Yi, Xin
2018-01-01
Background Inherited eye diseases are major causes of vision loss in both children and adults. Inherited eye diseases are characterized by clinical variability and pronounced genetic heterogeneity. Genetic testing may provide an accurate diagnosis for ophthalmic genetic disorders and allow gene therapy for specific diseases. Methods A targeted gene capture panel was designed to capture exons of 283 inherited eye disease genes including 58 known causative retinitis pigmentosa (RP) genes. 180 samples were tested with this panel, 68 were previously tested by Sanger sequencing. Systematic evaluation of our method and comprehensive molecular diagnosis were carried on 99 RP patients. Results 96.85% targeted regions were covered by at least 20 folds, the accuracy of variants detection was 99.994%. In 4 of the 68 samples previously tested by Sanger sequencing, mutations of other diseases not consisting with the clinical diagnosis were detected by next-generation sequencing (NGS) not Sanger. Among the 99 RP patients, 64 (64.6%) were detected with pathogenic mutations, while in 3 patients, it was inconsistent between molecular diagnosis and their initial clinical diagnosis. After revisiting, one patient’s clinical diagnosis was reclassified. In addition, 3 patients were found carrying large deletions. Conclusions We have systematically evaluated our method and compared it with Sanger sequencing, and have identified a large number of novel mutations in a cohort of 99 RP patients. The results showed a sufficient accuracy of our method and suggested the importance of molecular diagnosis in clinical diagnosis. PMID:29641573
Buttitta, Fiamma; Felicioni, Lara; Del Grammastro, Maela; Filice, Giampaolo; Di Lorito, Alessia; Malatesta, Sara; Viola, Patrizia; Centi, Irene; D'Antuono, Tommaso; Zappacosta, Roberta; Rosini, Sandra; Cuccurullo, Franco; Marchetti, Antonio
2013-02-01
The therapeutic choice for patients with lung adenocarcinoma depends on the presence of EGF receptor (EGFR) mutations. In many cases, only cytologic samples are available for molecular diagnosis. Bronchoalveolar lavage (BAL) and pleural fluid, which represent a considerable proportion of cytologic specimens, cannot always be used for molecular testing because of low rate of tumor cells. We tested the feasibility of EGFR mutation analysis on BAL and pleural fluid samples by next-generation sequencing (NGS), an innovative and extremely sensitive platform. The study was devised to extend the EGFR test to those patients who could not get it due to the paucity of biologic material. A series of 830 lung cytology specimens was used to select 48 samples (BAL and pleural fluid) from patients with EGFR mutations in resected tumors. These samples included 36 cases with 0.3% to 9% of neoplastic cells (series A) and 12 cases without evidence of tumor (series B). All samples were analyzed by Sanger sequencing and NGS on 454 Roche platform. A mean of 21,130 ± 2,370 sequences per sample were obtained by NGS. In series A, EGFR mutations were detected in 16% of cases by Sanger sequencing and in 81% of cases by NGS. Seventy-seven percent of cases found to be negative by Sanger sequencing showed mutations by NGS. In series B, all samples were negative for EGFR mutation by Sanger sequencing whereas 42% of them were positive by NGS. The very sensitive EGFR-NGS assay may open up to the possibility of specific treatments for patients otherwise doomed to re-biopsies or nontargeted therapies.
Parson, Walther; Strobl, Christina; Huber, Gabriela; Zimmermann, Bettina; Gomes, Sibylle M.; Souto, Luis; Fendt, Liane; Delport, Rhena; Langit, Reina; Wootton, Sharon; Lagacé, Robert; Irwin, Jodi
2013-01-01
Insights into the human mitochondrial phylogeny have been primarily achieved by sequencing full mitochondrial genomes (mtGenomes). In forensic genetics (partial) mtGenome information can be used to assign haplotypes to their phylogenetic backgrounds, which may, in turn, have characteristic geographic distributions that would offer useful information in a forensic case. In addition and perhaps even more relevant in the forensic context, haplogroup-specific patterns of mutations form the basis for quality control of mtDNA sequences. The current method for establishing (partial) mtDNA haplotypes is Sanger-type sequencing (STS), which is laborious, time-consuming, and expensive. With the emergence of Next Generation Sequencing (NGS) technologies, the body of available mtDNA data can potentially be extended much more quickly and cost-efficiently. Customized chemistries, laboratory workflows and data analysis packages could support the community and increase the utility of mtDNA analysis in forensics. We have evaluated the performance of mtGenome sequencing using the Personal Genome Machine (PGM) and compared the resulting haplotypes directly with conventional Sanger-type sequencing. A total of 64 mtGenomes (>1 million bases) were established that yielded high concordance with the corresponding STS haplotypes (<0.02% differences). About two-thirds of the differences were observed in or around homopolymeric sequence stretches. In addition, the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes. Further development of alignment software would be desirable to facilitate the application of NGS in mtDNA forensic genetics. PMID:23948325
Next-generation sequencing: the future of molecular genetics in poultry production and food safety.
Diaz-Sanchez, S; Hanning, I; Pendleton, Sean; D'Souza, Doris
2013-02-01
The era of molecular biology and automation of the Sanger chain-terminator sequencing method has led to discovery and advances in diagnostics and biotechnology. The Sanger methodology dominated research for over 2 decades, leading to significant accomplishments and technological improvements in DNA sequencing. Next-generation high-throughput sequencing (HT-NGS) technologies were developed subsequently to overcome the limitations of this first generation technology that include higher speed, less labor, and lowered cost. Various platforms developed include sequencing-by-synthesis 454 Life Sciences, Illumina (Solexa) sequencing, SOLiD sequencing (among others), and the Ion Torrent semiconductor sequencing technologies that use different detection principles. As technology advances, progress made toward third generation sequencing technologies are being reported, which include Nanopore Sequencing and real-time monitoring of PCR activity through fluorescent resonant energy transfer. The advantages of these technologies include scalability, simplicity, with increasing DNA polymerase performance and yields, being less error prone, and even more economically feasible with the eventual goal of obtaining real-time results. These technologies can be directly applied to improve poultry production and enhance food safety. For example, sequence-based (determination of the gut microbial community, genes for metabolic pathways, or presence of plasmids) and function-based (screening for function such as antibiotic resistance, or vitamin production) metagenomic analysis can be carried out. Gut microbialflora/communities of poultry can be sequenced to determine the changes that affect health and disease along with efficacy of methods to control pathogenic growth. Thus, the purpose of this review is to provide an overview of the principles of these current technologies and their potential application to improve poultry production and food safety as well as public health.
Valenzuela-González, Fabiola; Martínez-Porchas, Marcel; Villalpando-Canchola, Enrique; Vargas-Albores, Francisco
2016-03-01
Ultrafast-metagenomic sequence classification using exact alignments (Kraken) is a novel approach to classify 16S rDNA sequences. The classifier is based on mapping short sequences to the lowest ancestor and performing alignments to form subtrees with specific weights in each taxon node. This study aimed to evaluate the classification performance of Kraken with long 16S rDNA random environmental sequences produced by cloning and then Sanger sequenced. A total of 480 clones were isolated and expanded, and 264 of these clones formed contigs (1352 ± 153 bp). The same sequences were analyzed using the Ribosomal Database Project (RDP) classifier. Deeper classification performance was achieved by Kraken than by the RDP: 73% of the contigs were classified up to the species or variety levels, whereas 67% of these contigs were classified no further than the genus level by the RDP. The results also demonstrated that unassembled sequences analyzed by Kraken provide similar or inclusively deeper information. Moreover, sequences that did not form contigs, which are usually discarded by other programs, provided meaningful information when analyzed by Kraken. Finally, it appears that the assembly step for Sanger sequences can be eliminated when using Kraken. Kraken cumulates the information of both sequence senses, providing additional elements for the classification. In conclusion, the results demonstrate that Kraken is an excellent choice for use in the taxonomic assignment of sequences obtained by Sanger sequencing or based on third generation sequencing, of which the main goal is to generate larger sequences. Copyright © 2016 Elsevier B.V. All rights reserved.
[Two novel pathogenic mutations of GAN gene identified in a patient with giant axonal neuropathy].
Wang, Juan; Ma, Qingwen; Cai, Qin; Liu, Yanna; Wang, Wei; Ren, Zhaorui
2016-06-01
To explore the disease-causing mutations in a patient suspected for giant axonal neuropathy(GAN). Target sequence capture sequencing was used to screen potential mutations in genomic DNA extracted from peripheral blood sample of the patient. Sanger sequencing was applied to confirm the detected mutation. The mutation was verified among 400 GAN alleles from 200 healthy individuals by Sanger sequencing. The function of the mutations was predicted by bioinformatics analysis. The patient was identified as a compound heterozygote carrying two novel pathogenic GAN mutations, i.e., c.778G>T (p.Glu260Ter) and c.277G>A (p.Gly93Arg). Sanger sequencing confirmed that the c.778G>T (p.Glu260Ter) mutation was inherited from his father, while c.277G>A (p.Gly93Arg) was inherited from his mother. The same mutations was not found in the 200 healthy individuals. Bioinformatics analysis predicted that the two mutations probably caused functional abnormality of gigaxonin. Two novel GAN mutations were detected in a patient with GAN. Both mutations are pathogenic and can cause abnormalities of gigaxonin structure and function, leading to pathogenesis of GAN. The results may also offer valuable information for similar diseases.
Revollo, Javier; Wang, Yiying; McKinzie, Page; Dad, Azra; Pearce, Mason; Heflich, Robert H; Dobrovolsky, Vasily N
2017-12-01
We used Sanger sequencing and next generation sequencing (NGS) for analysis of mutations in the endogenous X-linked Pig-a gene of clonally expanded L5178YTk +/- cells. The clones developed from single cells that were sorted on a flow cytometer based upon the expression pattern of the GPI-anchored marker, CD90, on their surface. CD90-deficient and CD90-proficient cells were sorted from untreated cultures and CD90-deficient cells were sorted from cultures treated with benzo[a]pyrene (B[a]P). Pig-a mutations were identified in all clones developed from CD90-deficient cells; no Pig-a mutations were found in clones of CD90-proficient cells. The spectrum of B[a]P-induced Pig-a mutations was dominated by basepair substitutions, small insertions and deletions at G:C, or at sequences rich in G:C content. We observed high concordance between Pig-a mutations determined by Sanger sequencing and by NGS, but NGS was able to identify mutations in samples that were difficult to analyze by Sanger sequencing (e.g., mixtures of two mutant clones). Overall, the NGS method is a cost and labor efficient high throughput approach for analysis of a large number of mutant clones. Published by Elsevier B.V.
[Genetic analysis of two children patients affected with CHARGE syndrome].
Li, Guoqiang; Li, Niu; Xu, Yufei; Li, Juan; Ding, Yu; Shen, Yiping; Wang, Xiumin; Wang, Jian
2018-04-10
To analyze two Chinese pediatric patients with multiple malformations and growth and development delay. Both patients were subjected to targeted gene sequencing, and the results were analyzed with Ingenuity Variant Analysis software. Suspected pathogenic variations were verified by Sanger sequencing. High-throughput sequencing showed that both patients have carried heterozygous variants of the CHD7 gene. Patient 1 carried a nonsense mutation in exon 36 (c.7957C>T, p.Arg2653*), while patient 2 carried a nonsense mutation of exon 2 (c.718C>T, p.Gln240*). Sanger sequencing confirmed the above mutations in both patients, while their parents were of wild-type for the corresponding sites, indicating that the two mutations have happened de novo. Two patients were diagnosed with CHARGE syndrome by high-throughput sequencing.
Anderson, Steven; Bloom, Kenneth J; Vallera, Dino U; Rueschoff, Josef; Meldrum, Cliff; Schilling, Robert; Kovach, Barbara; Lee, Ju Ruey-Jiuan; Ochoa, Pam; Langland, Rachel; Halait, Harkanwal; Lawrence, H Jeffrey; Dugan, Michael C
2012-11-01
A polymerase chain reaction-based companion diagnostic (cobas 4800 BRAF V600 Mutation Test) was recently approved by the US Food and Drug Administration to select patients with BRAF-mutant metastatic melanoma for treatment with the BRAF inhibitor vemurafenib. (1) To compare the analytic performance of the cobas test to Sanger sequencing by using screening specimens from phase II and phase III trials of vemurafenib, and (2) to assess the reproducibility of the cobas test at different testing sites. Specimens from 477 patients were used to determine positive and negative percent agreements between the cobas test and Sanger sequencing for detecting V600E (1799T>A) mutations. Specimens were evaluated with a massively parallel pyrosequencing method (454) to resolve discordances between polymerase chain reaction and Sanger results. Reproducibility of the cobas test was assessed at 3 sites by using 3 reagent lots and an 8-member panel of melanoma samples. A valid cobas result was obtained for all eligible patients. Sanger sequencing had a failure rate of 9.2% (44 of 477). For the remaining 433 specimens, positive percent agreement was 96.4% (215 of 223) and negative percent agreement, 80% (168 of 210). Among 42 cobas mutation-positive/Sanger V600E-negative specimens, 17 were V600E positive and 24 were V600K positive by 454. The cobas test detected 70% of V600K mutations. In the reproducibility study, a correct interpretation was made for 100% of wild-type specimens and specimens with greater than 5% mutant alleles; V600E mutations were detected in 90% of specimens with less than 5% mutant alleles. The cobas test (1) had a lower assay failure rate than that of Sanger, (2) was more sensitive in detecting V600E mutations, (3) detected most V600K mutations, and (4) was highly reproducible.
d’Avila-Levy, Claudia Masini; Boucinha, Carolina; Kostygov, Alexei; Santos, Helena Lúcia Carneiro; Morelli, Karina Alessandra; Grybchuk-Ieremenko, Anastasiia; Duval, Linda; Votýpka, Jan; Yurchenko, Vyacheslav; Grellier, Philippe; Lukeš, Julius
2015-01-01
The class Kinetoplastea encompasses both free-living and parasitic species from a wide range of hosts. Several representatives of this group are responsible for severe human diseases and for economic losses in agriculture and livestock. While this group encompasses over 30 genera, most of the available information has been derived from the vertebrate pathogenic genera Leishmaniaand Trypanosoma. Recent studies of the previously neglected groups of Kinetoplastea indicated that the actual diversity is much higher than previously thought. This article discusses the known segment of kinetoplastid diversity and how gene-directed Sanger sequencing and next-generation sequencing methods can help to deepen our knowledge of these interesting protists. PMID:26602872
Genome Sequencing and Assembly by Long Reads in Plants
Li, Changsheng; Lin, Feng; An, Dong; Huang, Ruidong
2017-01-01
Plant genomes generated by Sanger and Next Generation Sequencing (NGS) have provided insight into species diversity and evolution. However, Sanger sequencing is limited in its applications due to high cost, labor intensity, and low throughput, while NGS reads are too short to resolve abundant repeats and polyploidy, leading to incomplete or ambiguous assemblies. The advent and improvement of long-read sequencing by Third Generation Sequencing (TGS) methods such as PacBio and Nanopore have shown promise in producing high-quality assemblies for complex genomes. Here, we review the development of sequencing, introducing the application as well as considerations of experimental design in TGS of plant genomes. We also introduce recent revolutionary scaffolding technologies including BioNano, Hi-C, and 10× Genomics. We expect that the informative guidance for genome sequencing and assembly by long reads will benefit the initiation of scientists’ projects. PMID:29283420
Development of a Web Tool for Escherichia coli Subtyping Based on fimH Alleles.
Roer, Louise; Tchesnokova, Veronika; Allesøe, Rosa; Muradova, Mariya; Chattopadhyay, Sujay; Ahrenfeldt, Johanne; Thomsen, Martin C F; Lund, Ole; Hansen, Frank; Hammerum, Anette M; Sokurenko, Evgeni; Hasman, Henrik
2017-08-01
The aim of this study was to construct a valid publicly available method for in silico fimH subtyping of Escherichia coli particularly suitable for differentiation of fine-resolution subgroups within clonal groups defined by standard multilocus sequence typing (MLST). FimTyper was constructed as a FASTA database containing all currently known fimH alleles. The software source code is publicly available at https://bitbucket.org/genomicepidemiology/fimtyper, the database is freely available at https://bitbucket.org/genomicepidemiology/fimtyper_db, and a service implementing the software is available at https://cge.cbs.dtu.dk/services/FimTyper FimTyper was validated on three data sets: one containing Sanger sequences of fimH alleles of 42 E. coli isolates generated prior to the current study (data set 1), one containing whole-genome sequence (WGS) data of 243 third-generation-cephalosporin-resistant E. coli isolates (data set 2), and one containing a randomly chosen subset of 40 E. coli isolates from data set 2 that were subjected to conventional fimH subtyping (data set 3). The combination of the three data sets enabled an evaluation and comparison of FimTyper on both Sanger sequences and WGS data. FimTyper correctly predicted all 42 fimH subtypes from the Sanger sequences from data set 1 and successfully analyzed all 243 draft genomes from data set 2. FimTyper subtyping of the Sanger sequences and WGS data from data set 3 were in complete agreement. Additionally, fimH subtyping was evaluated on a phylogenetic network of 122 sequence type 131 (ST131) E. coli isolates. There was perfect concordance between the typology and fimH -based subclones within ST131, with accurate identification of the pandemic multidrug-resistant clonal subgroup ST131- H 30. FimTyper provides a standardized tool, as a rapid alternative to conventional fimH subtyping, highly suitable for surveillance and outbreak detection. Copyright © 2017 American Society for Microbiology.
van Huet, Ramon A. C.; Pierrache, Laurence H.M.; Meester-Smoor, Magda A.; Klaver, Caroline C.W.; van den Born, L. Ingeborgh; Hoyng, Carel B.; de Wijs, Ilse J.; Collin, Rob W. J.; Hoefsloot, Lies H.
2015-01-01
Purpose To determine the efficacy of multiple versions of a commercially available arrayed primer extension (APEX) microarray chip for autosomal recessive retinitis pigmentosa (arRP). Methods We included 250 probands suspected of arRP who were genetically analyzed with the APEX microarray between January 2008 and November 2013. The mode of inheritance had to be autosomal recessive according to the pedigree (including isolated cases). If the microarray identified a heterozygous mutation, we performed Sanger sequencing of exons and exon–intron boundaries of that specific gene. The efficacy of this microarray chip with the additional Sanger sequencing approach was determined by the percentage of patients that received a molecular diagnosis. We also collected data from genetic tests other than the APEX analysis for arRP to provide a detailed description of the molecular diagnoses in our study cohort. Results The APEX microarray chip for arRP identified the molecular diagnosis in 21 (8.5%) of the patients in our cohort. Additional Sanger sequencing yielded a second mutation in 17 patients (6.8%), thereby establishing the molecular diagnosis. In total, 38 patients (15.2%) received a molecular diagnosis after analysis using the microarray and additional Sanger sequencing approach. Further genetic analyses after a negative result of the arRP microarray (n = 107) resulted in a molecular diagnosis of arRP (n = 23), autosomal dominant RP (n = 5), X-linked RP (n = 2), and choroideremia (n = 1). Conclusions The efficacy of the commercially available APEX microarray chips for arRP appears to be low, most likely caused by the limitations of this technique and the genetic and allelic heterogeneity of RP. Diagnostic yields up to 40% have been reported for next-generation sequencing (NGS) techniques that, as expected, thereby outperform targeted APEX analysis. PMID:25999674
Detection of MPL mutations by a novel allele-specific PCR-based strategy.
Furtado, Larissa V; Weigelin, Helmut C; Elenitoba-Johnson, Kojo S J; Betz, Bryan L
2013-11-01
MPL mutation testing is recommended in patients with suspected primary myelofibrosis or essential thrombocythemia who lack the JAK2 V617F mutation. MPL mutations can occur at allelic levels below 15%, which may escape detection by commonly used mutation screening methods such as Sanger sequencing. We developed a novel multiplexed allele-specific PCR assay capable of detecting most recurrent MPL exon 10 mutations associated with primary myelofibrosis and essential thrombocythemia (W515L, W515K, W515A, and S505N) down to a sensitivity of 2.5% mutant allele. Test results were reviewed from 15 reference cases and 1380 consecutive specimens referred to our laboratory for testing. Assay performance was compared to Sanger sequencing across a series of 58 specimens with MPL mutations. Positive cases consisted of 45 with W515L, 6 with S505N, 5 with W515K, 1 with W515A, and 1 with both W515L and S505N. Seven cases had mutations below 5% that were undetected by Sanger sequencing. Ten additional cases had mutation levels between 5% and 15% that were not consistently detected by sequencing. All results were easily interpreted in the allele-specific test. This assay offers a sensitive and reliable solution for MPL mutation testing. Sanger sequencing appears insufficiently sensitive for robust MPL mutation detection. Our data also suggest the relative frequency of S505N mutations may be underestimated, highlighting the necessity for inclusion of this mutation in MPL test platforms. Copyright © 2013 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Paparini, Andrea; Yang, Rongchang; Chen, Linda; Tong, Kaising; Gibson-Kueh, Susan; Lymbery, Alan; Ryan, Una M
2017-11-01
Currently, the systematics, biology and epidemiology of piscine Cryptosporidium species are poorly understood. Here, we compared Sanger ‒ and next-generation ‒ sequencing (NGS), of piscine Cryptosporidium, at the 18S rRNA and actin genes. The hosts comprised 11 ornamental fish species, spanning four orders and eight families. The objectives were: to (i) confirm the rich genetic diversity of the parasite and the high frequency of mixed infections; and (ii) explore the potential of NGS in the presence of complex genetic mixtures. By Sanger sequencing, four main genotypes were obtained at the actin locus, while for the 18S locus, seven genotypes were identified. At both loci, NGS revealed frequent mixed infections, consisting of one highly dominant variant plus substantially rarer genotypes. Both sequencing methods detected novel Cryptosporidium genotypes at both loci, including a novel and highly abundant actin genotype that was identified by both Sanger sequencing and NGS. Importantly, this genotype accounted for 68·9% of all NGS reads from all samples (249 585/362 372). The present study confirms that aquarium fish can harbour a large and unexplored Cryptosporidium genetic diversity. Although commonly used in molecular parasitology studies, nested PCR prevents quantitative comparisons and thwarts the advantages of NGS, when this latter approach is used to investigate multiple infections.
French, Deborah; Smith, Andrew; Powers, Martin P; Wu, Alan H B
2011-08-17
Binding of a ligand to the epidermal growth factor receptor (EGFR) stimulates various intracellular signaling pathways resulting in cell cycle progression, proliferation, angiogenesis and apoptosis inhibition. KRAS is involved in signaling pathways including RAF/MAPK and PI3K and mutations in this gene result in constitutive activation of these pathways, independent of EGFR activation. Seven mutations in codons 12 and 13 of KRAS comprise around 95% of the observed human mutations, rendering monoclonal antibodies against EGFR (e.g. cetuximab and panitumumab) useless in treatment of colorectal cancer. KRAS mutation testing by two different methodologies was compared; Sanger sequencing and AutoGenomics INFINITI® assay, on DNA extracted from colorectal cancers. Out of 29 colorectal tumor samples tested, 28 were concordant between the two methodologies for the KRAS mutations that were detected in both assays with the INFINITI® assay detecting a mutation in one sample that was indeterminate by Sanger sequencing and a third methodology; single nucleotide primer extension. This study indicates the utility of the AutoGenomics INFINITI® methodology in a clinical laboratory setting where technical expertise or access to equipment for DNA sequencing does not exist. Copyright © 2011 Elsevier B.V. All rights reserved.
2018-01-01
New, as yet undiscovered aptamers for Protein A were identified by applying next generation sequencing (NGS) to a previously selected aptamer pool. This pool was obtained in a classical SELEX (Systematic Evolution of Ligands by EXponential enrichment) experiment using the FluMag-SELEX procedure followed by cloning and Sanger sequencing. PA#2/8 was identified as the only Protein A-binding aptamer from the Sanger sequence pool, and was shown to be able to bind intact cells of Staphylococcus aureus. In this study, we show the extension of the SELEX results by re-sequencing of the same aptamer pool using a medium throughput NGS approach and data analysis. Both data pools were compared. They confirm the selection of a highly complex and heterogeneous oligonucleotide pool and show consistently a high content of orphans as well as a similar relative frequency of certain sequence groups. But in contrast to the Sanger data pool, the NGS pool was clearly dominated by one sequence group containing the known Protein A-binding aptamer PA#2/8 as the most frequent sequence in this group. In addition, we found two new sequence groups in the NGS pool represented by PA-C10 and PA-C8, respectively, which also have high specificity for Protein A. Comparative affinity studies reveal differences between the aptamers and confirm that PA#2/8 remains the most potent sequence within the selected aptamer pool reaching affinities in the low nanomolar range of KD = 20 ± 1 nM. PMID:29495282
Stoltenburg, Regina; Strehlitz, Beate
2018-02-24
New, as yet undiscovered aptamers for Protein A were identified by applying next generation sequencing (NGS) to a previously selected aptamer pool. This pool was obtained in a classical SELEX (Systematic Evolution of Ligands by EXponential enrichment) experiment using the FluMag-SELEX procedure followed by cloning and Sanger sequencing. PA#2/8 was identified as the only Protein A-binding aptamer from the Sanger sequence pool, and was shown to be able to bind intact cells of Staphylococcus aureus . In this study, we show the extension of the SELEX results by re-sequencing of the same aptamer pool using a medium throughput NGS approach and data analysis. Both data pools were compared. They confirm the selection of a highly complex and heterogeneous oligonucleotide pool and show consistently a high content of orphans as well as a similar relative frequency of certain sequence groups. But in contrast to the Sanger data pool, the NGS pool was clearly dominated by one sequence group containing the known Protein A-binding aptamer PA#2/8 as the most frequent sequence in this group. In addition, we found two new sequence groups in the NGS pool represented by PA-C10 and PA-C8, respectively, which also have high specificity for Protein A. Comparative affinity studies reveal differences between the aptamers and confirm that PA#2/8 remains the most potent sequence within the selected aptamer pool reaching affinities in the low nanomolar range of K D = 20 ± 1 nM.
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.
Militello, Kevin T; Lazatin, Justine C
2017-05-01
Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
Liu, Shanlin; Yang, Chentao; Zhou, Chengran; Zhou, Xin
2017-12-01
Over the past decade, biodiversity researchers have dedicated tremendous efforts to constructing DNA reference barcodes for rapid species registration and identification. Although analytical cost for standard DNA barcoding has been significantly reduced since early 2000, further dramatic reduction in barcoding costs is unlikely because Sanger sequencing is approaching its limits in throughput and chemistry cost. Constraints in barcoding cost not only led to unbalanced barcoding efforts around the globe, but also prevented high-throughput sequencing (HTS)-based taxonomic identification from applying binomial species names, which provide crucial linkages to biological knowledge. We developed an Illumina-based pipeline, HIFI-Barcode, to produce full-length Cytochrome c oxidase subunit I (COI) barcodes from pooled polymerase chain reaction amplicons generated by individual specimens. The new pipeline generated accurate barcode sequences that were comparable to Sanger standards, even for different haplotypes of the same species that were only a few nucleotides different from each other. Additionally, the new pipeline was much more sensitive in recovering amplicons at low quantity. The HIFI-Barcode pipeline successfully recovered barcodes from more than 78% of the polymerase chain reactions that didn't show clear bands on the electrophoresis gel. Moreover, sequencing results based on the single molecular sequencing platform Pacbio confirmed the accuracy of the HIFI-Barcode results. Altogether, the new pipeline can provide an improved solution to produce full-length reference barcodes at about one-tenth of the current cost, enabling construction of comprehensive barcode libraries for local fauna, leading to a feasible direction for DNA barcoding global biomes. © The Authors 2017. Published by Oxford University Press.
Sanger sequencing as a first-line approach for molecular diagnosis of Andersen-Tawil syndrome
Totomoch-Serra, Armando; Marquez, Manlio F.; Cervantes-Barragán, David E.
2017-01-01
In 1977, Frederick Sanger developed a new method for DNA sequencing based on the chain termination method, now known as the Sanger sequencing method (SSM). Recently, massive parallel sequencing, better known as next-generation sequencing (NGS), is replacing the SSM for detecting mutations in cardiovascular diseases with a genetic background. The present opinion article wants to remark that “targeted” SSM is still effective as a first-line approach for the molecular diagnosis of some specific conditions, as is the case for Andersen-Tawil syndrome (ATS). ATS is described as a rare multisystemic autosomal dominant channelopathy syndrome caused mainly by a heterozygous mutation in the KCNJ2 gene . KCJN2 has particular characteristics that make it attractive for “directed” SSM. KCNJ2 has a sequence of 17,510 base pairs (bp), and a short coding region with two exons (exon 1=166 bp and exon 2=5220 bp), half of the mutations are located in the C-terminal cytosolic domain, a mutational hotspot has been described in residue Arg218, and this gene explains the phenotype in 60% of ATS cases that fulfill all the clinical criteria of the disease. In order to increase the diagnosis of ATS we urge cardiologists to search for facial and muscular abnormalities in subjects with frequent ventricular arrhythmias (especially bigeminy) and prominent U waves on the electrocardiogram. PMID:29093808
STINGRAY: system for integrated genomic resources and analysis.
Wagner, Glauber; Jardim, Rodrigo; Tschoeke, Diogo A; Loureiro, Daniel R; Ocaña, Kary A C S; Ribeiro, Antonio C B; Emmel, Vanessa E; Probst, Christian M; Pitaluga, André N; Grisard, Edmundo C; Cavalcanti, Maria C; Campos, Maria L M; Mattoso, Marta; Dávila, Alberto M R
2014-03-07
The STINGRAY system has been conceived to ease the tasks of integrating, analyzing, annotating and presenting genomic and expression data from Sanger and Next Generation Sequencing (NGS) platforms. STINGRAY includes: (a) a complete and integrated workflow (more than 20 bioinformatics tools) ranging from functional annotation to phylogeny; (b) a MySQL database schema, suitable for data integration and user access control; and (c) a user-friendly graphical web-based interface that makes the system intuitive, facilitating the tasks of data analysis and annotation. STINGRAY showed to be an easy to use and complete system for analyzing sequencing data. While both Sanger and NGS platforms are supported, the system could be faster using Sanger data, since the large NGS datasets could potentially slow down the MySQL database usage. STINGRAY is available at http://stingray.biowebdb.org and the open source code at http://sourceforge.net/projects/stingray-biowebdb/.
STINGRAY: system for integrated genomic resources and analysis
2014-01-01
Background The STINGRAY system has been conceived to ease the tasks of integrating, analyzing, annotating and presenting genomic and expression data from Sanger and Next Generation Sequencing (NGS) platforms. Findings STINGRAY includes: (a) a complete and integrated workflow (more than 20 bioinformatics tools) ranging from functional annotation to phylogeny; (b) a MySQL database schema, suitable for data integration and user access control; and (c) a user-friendly graphical web-based interface that makes the system intuitive, facilitating the tasks of data analysis and annotation. Conclusion STINGRAY showed to be an easy to use and complete system for analyzing sequencing data. While both Sanger and NGS platforms are supported, the system could be faster using Sanger data, since the large NGS datasets could potentially slow down the MySQL database usage. STINGRAY is available at http://stingray.biowebdb.org and the open source code at http://sourceforge.net/projects/stingray-biowebdb/. PMID:24606808
Low-Cost, High-Throughput Sequencing of DNA Assemblies Using a Highly Multiplexed Nextera Process.
Shapland, Elaine B; Holmes, Victor; Reeves, Christopher D; Sorokin, Elena; Durot, Maxime; Platt, Darren; Allen, Christopher; Dean, Jed; Serber, Zach; Newman, Jack; Chandran, Sunil
2015-07-17
In recent years, next-generation sequencing (NGS) technology has greatly reduced the cost of sequencing whole genomes, whereas the cost of sequence verification of plasmids via Sanger sequencing has remained high. Consequently, industrial-scale strain engineers either limit the number of designs or take short cuts in quality control. Here, we show that over 4000 plasmids can be completely sequenced in one Illumina MiSeq run for less than $3 each (15× coverage), which is a 20-fold reduction over using Sanger sequencing (2× coverage). We reduced the volume of the Nextera tagmentation reaction by 100-fold and developed an automated workflow to prepare thousands of samples for sequencing. We also developed software to track the samples and associated sequence data and to rapidly identify correctly assembled constructs having the fewest defects. As DNA synthesis and assembly become a centralized commodity, this NGS quality control (QC) process will be essential to groups operating high-throughput pipelines for DNA construction.
Richter, Anna; Grieu, Fabienne; Carrello, Amerigo; Amanuel, Benhur; Namdarian, Kateh; Rynska, Aleksandra; Lucas, Amanda; Michael, Victoria; Bell, Anthony; Fox, Stephen B.; Hewitt, Chelsee A.; Do, Hongdo; McArthur, Grant A.; Wong, Stephen Q.; Dobrovic, Alexander; Iacopetta, Barry
2013-01-01
Melanoma patients with BRAF mutations respond to treatment with vemurafenib, thus creating a need for accurate testing of BRAF mutation status. We carried out a blinded study to evaluate various BRAF mutation testing methodologies in the clinical setting. Formalin-fixed, paraffin-embedded melanoma samples were macrodissected before screening for mutations using Sanger sequencing, single-strand conformation analysis (SSCA), high resolution melting analysis (HRM) and competitive allele-specific TaqMan® PCR (CAST-PCR). Concordance of 100% was observed between the Sanger sequencing, SSCA and HRM techniques. CAST-PCR gave rapid and accurate results for the common V600E and V600K mutations, however additional assays are required to detect rarer BRAF mutation types found in 3–4% of melanomas. HRM and SSCA followed by Sanger sequencing are effective two-step strategies for the detection of BRAF mutations in the clinical setting. CAST-PCR was useful for samples with low tumour purity and may also be a cost-effective and robust method for routine diagnostics. PMID:23584600
A high-throughput Sanger strategy for human mitochondrial genome sequencing
2013-01-01
Background A population reference database of complete human mitochondrial genome (mtGenome) sequences is needed to enable the use of mitochondrial DNA (mtDNA) coding region data in forensic casework applications. However, the development of entire mtGenome haplotypes to forensic data quality standards is difficult and laborious. A Sanger-based amplification and sequencing strategy that is designed for automated processing, yet routinely produces high quality sequences, is needed to facilitate high-volume production of these mtGenome data sets. Results We developed a robust 8-amplicon Sanger sequencing strategy that regularly produces complete, forensic-quality mtGenome haplotypes in the first pass of data generation. The protocol works equally well on samples representing diverse mtDNA haplogroups and DNA input quantities ranging from 50 pg to 1 ng, and can be applied to specimens of varying DNA quality. The complete workflow was specifically designed for implementation on robotic instrumentation, which increases throughput and reduces both the opportunities for error inherent to manual processing and the cost of generating full mtGenome sequences. Conclusions The described strategy will assist efforts to generate complete mtGenome haplotypes which meet the highest data quality expectations for forensic genetic and other applications. Additionally, high-quality data produced using this protocol can be used to assess mtDNA data developed using newer technologies and chemistries. Further, the amplification strategy can be used to enrich for mtDNA as a first step in sample preparation for targeted next-generation sequencing. PMID:24341507
Optimization of conditions to sequence long cDNAs from viruses
USDA-ARS?s Scientific Manuscript database
Fourth generation sequencing with the Minion nanopore sequencer provides opportunity to obtain deep coverage and long read for single molecules. This will benefit studies on RNA viruses. In the past, Sanger, Illumina, and Ion Torrent sequencing have been utilized to study RNA viruses. Both technique...
[The principle and application of the single-molecule real-time sequencing technology].
Yanhu, Liu; Lu, Wang; Li, Yu
2015-03-01
Last decade witnessed the explosive development of the third-generation sequencing strategy, including single-molecule real-time sequencing (SMRT), true single-molecule sequencing (tSMSTM) and the single-molecule nanopore DNA sequencing. In this review, we summarize the principle, performance and application of the SMRT sequencing technology. Compared with the traditional Sanger method and the next-generation sequencing (NGS) technologies, the SMRT approach has several advantages, including long read length, high speed, PCR-free and the capability of direct detection of epigenetic modifications. However, the disadvantage of its low accuracy, most of which resulted from insertions and deletions, is also notable. So, the raw sequence data need to be corrected before assembly. Up to now, the SMRT is a good fit for applications in the de novo genomic sequencing and the high-quality assemblies of small genomes. In the future, it is expected to play an important role in epigenetics, transcriptomic sequencing, and assemblies of large genomes.
Pena, Loren D M; Jiang, Yong-Hui; Schoch, Kelly; Spillmann, Rebecca C; Walley, Nicole; Stong, Nicholas; Rapisardo Horn, Sarah; Sullivan, Jennifer A; McConkie-Rosell, Allyn; Kansagra, Sujay; Smith, Edward C; El-Dairi, Mays; Bellet, Jane; Keels, Martha Ann; Jasien, Joan; Kranz, Peter G; Noel, Richard; Nagaraj, Shashi K; Lark, Robert K; Wechsler, Daniel S G; Del Gaudio, Daniela; Leung, Marco L; Hendon, Laura G; Parker, Collette C; Jones, Kelly L; Goldstein, David B; Shashi, Vandana
2018-04-01
PurposeTo describe examples of missed pathogenic variants on whole-exome sequencing (WES) and the importance of deep phenotyping for further diagnostic testing.MethodsGuided by phenotypic information, three children with negative WES underwent targeted single-gene testing.ResultsIndividual 1 had a clinical diagnosis consistent with infantile systemic hyalinosis, although WES and a next-generation sequencing (NGS)-based ANTXR2 test were negative. Sanger sequencing of ANTXR2 revealed a homozygous single base pair insertion, previously missed by the WES variant caller software. Individual 2 had neurodevelopmental regression and cerebellar atrophy, with no diagnosis on WES. New clinical findings prompted Sanger sequencing and copy number testing of PLA2G6. A novel homozygous deletion of the noncoding exon 1 (not included in the WES capture kit) was detected, with extension into the promoter, confirming the clinical suspicion of infantile neuroaxonal dystrophy. Individual 3 had progressive ataxia, spasticity, and magnetic resonance image changes of vanishing white matter leukoencephalopathy. An NGS leukodystrophy gene panel and WES showed a heterozygous pathogenic variant in EIF2B5; no deletions/duplications were detected. Sanger sequencing of EIF2B5 showed a frameshift indel, probably missed owing to failure of alignment.ConclusionThese cases illustrate potential pitfalls of WES/NGS testing and the importance of phenotype-guided molecular testing in yielding diagnoses.
MPRAnator: a web-based tool for the design of massively parallel reporter assay experiments
Georgakopoulos-Soares, Ilias; Jain, Naman; Gray, Jesse M; Hemberg, Martin
2017-01-01
Motivation: With the rapid advances in DNA synthesis and sequencing technologies and the continuing decline in the associated costs, high-throughput experiments can be performed to investigate the regulatory role of thousands of oligonucleotide sequences simultaneously. Nevertheless, designing high-throughput reporter assay experiments such as massively parallel reporter assays (MPRAs) and similar methods remains challenging. Results: We introduce MPRAnator, a set of tools that facilitate rapid design of MPRA experiments. With MPRA Motif design, a set of variables provides fine control of how motifs are placed into sequences, thereby allowing the investigation of the rules that govern transcription factor (TF) occupancy. MPRA single-nucleotide polymorphism design can be used to systematically examine the functional effects of single or combinations of single-nucleotide polymorphisms at regulatory sequences. Finally, the Transmutation tool allows for the design of negative controls by permitting scrambling, reversing, complementing or introducing multiple random mutations in the input sequences or motifs. Availability and implementation: MPRAnator tool set is implemented in Python, Perl and Javascript and is freely available at www.genomegeek.com and www.sanger.ac.uk/science/tools/mpranator. The source code is available on www.github.com/hemberg-lab/MPRAnator/ under the MIT license. The REST API allows programmatic access to MPRAnator using simple URLs. Contact: igs@sanger.ac.uk or mh26@sanger.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27605100
MPRAnator: a web-based tool for the design of massively parallel reporter assay experiments.
Georgakopoulos-Soares, Ilias; Jain, Naman; Gray, Jesse M; Hemberg, Martin
2017-01-01
With the rapid advances in DNA synthesis and sequencing technologies and the continuing decline in the associated costs, high-throughput experiments can be performed to investigate the regulatory role of thousands of oligonucleotide sequences simultaneously. Nevertheless, designing high-throughput reporter assay experiments such as massively parallel reporter assays (MPRAs) and similar methods remains challenging. We introduce MPRAnator, a set of tools that facilitate rapid design of MPRA experiments. With MPRA Motif design, a set of variables provides fine control of how motifs are placed into sequences, thereby allowing the investigation of the rules that govern transcription factor (TF) occupancy. MPRA single-nucleotide polymorphism design can be used to systematically examine the functional effects of single or combinations of single-nucleotide polymorphisms at regulatory sequences. Finally, the Transmutation tool allows for the design of negative controls by permitting scrambling, reversing, complementing or introducing multiple random mutations in the input sequences or motifs. MPRAnator tool set is implemented in Python, Perl and Javascript and is freely available at www.genomegeek.com and www.sanger.ac.uk/science/tools/mpranator The source code is available on www.github.com/hemberg-lab/MPRAnator/ under the MIT license. The REST API allows programmatic access to MPRAnator using simple URLs. igs@sanger.ac.uk or mh26@sanger.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Zhou, Chengran
2017-01-01
Abstract Over the past decade, biodiversity researchers have dedicated tremendous efforts to constructing DNA reference barcodes for rapid species registration and identification. Although analytical cost for standard DNA barcoding has been significantly reduced since early 2000, further dramatic reduction in barcoding costs is unlikely because Sanger sequencing is approaching its limits in throughput and chemistry cost. Constraints in barcoding cost not only led to unbalanced barcoding efforts around the globe, but also prevented high-throughput sequencing (HTS)–based taxonomic identification from applying binomial species names, which provide crucial linkages to biological knowledge. We developed an Illumina-based pipeline, HIFI-Barcode, to produce full-length Cytochrome c oxidase subunit I (COI) barcodes from pooled polymerase chain reaction amplicons generated by individual specimens. The new pipeline generated accurate barcode sequences that were comparable to Sanger standards, even for different haplotypes of the same species that were only a few nucleotides different from each other. Additionally, the new pipeline was much more sensitive in recovering amplicons at low quantity. The HIFI-Barcode pipeline successfully recovered barcodes from more than 78% of the polymerase chain reactions that didn’t show clear bands on the electrophoresis gel. Moreover, sequencing results based on the single molecular sequencing platform Pacbio confirmed the accuracy of the HIFI-Barcode results. Altogether, the new pipeline can provide an improved solution to produce full-length reference barcodes at about one-tenth of the current cost, enabling construction of comprehensive barcode libraries for local fauna, leading to a feasible direction for DNA barcoding global biomes. PMID:29077841
Sun, Shumei; Zhou, Hao; Zhou, Bin; Hu, Ziyou; Hou, Jinlin; Sun, Jian
2012-05-01
To evaluate the sensitivity and specificity of nested PCR combined with pyrosequencing in the detection of HBV drug-resistance gene. RtM204I (ATT) mutant and rtM204 (ATG) nonmutant plasmids mixed at different ratios were detected for mutations using nested-PCR combined with pyrosequencing, and the results were compared with those by conventional PCR pyrosequencing to analyze the linearity and consistency of the two methods. Clinical specimens with different viral loads were examined for drug-resistant mutations using nested PCR pyrosequencing and nested PCR combined with dideoxy sequencing (Sanger) for comparison of the detection sensitivity and specificity. The fitting curves demonstrated good linearity of both conventional PCR pyrosequencing and nested PCR pyrosequencing (R(2)>0.99, P<0.05). Nested PCR showed a better consistency with the predicted value than conventional PCR, and was superior to conventional PCR for detection of samples containing 90% mutant plasmid. In the detection of clinical specimens, Sanger sequencing had a significantly lower sensitivity than nested PCR pyrosequencing (92% vs 100%, P<0.01). The detection sensitivity of Sanger sequencing varied with the viral loads, especially in samples with low viral copies (HBV DNA ≤3log10 copies/ml), where the sensitivity was 78%, significantly lower than that of pyrosequencing (100%, P<0.01). Neither of the two methods yielded positive results for the negative control samples, suggesting their good specificity. Compared with nested PCR and Sanger sequencing method, nested PCR pyrosequencing has a higher sensitivity especially in clinical specimens with low viral copies, which can be important for early detection of HBV mutant strains and hence more effective clinical management.
USDA-ARS?s Scientific Manuscript database
The current pig reference genome sequence (Sscrofa10.2) was established using Sanger sequencing and following the clone-by-clone hierarchical shotgun sequencing approach used in the public human genome project. However, as sequence coverage was low (4-6x) the resulting assembly was only of draft qua...
The Dynamics of DNA Sequencing.
ERIC Educational Resources Information Center
Morvillo, Nancy
1997-01-01
Describes a paper-and-pencil activity that helps students understand DNA sequencing and expands student understanding of DNA structure, replication, and gel electrophoresis. Appropriate for advanced biology students who are familiar with the Sanger method. (DDR)
Meng, Lanlan; Du, Juan; Li, Wen; Lu, Guangxiu; Tan, Yueqiu
2017-08-10
To determine the molecular etiology for a Chinese pedigree affected with epidermolysis bullosa simplex (EBS). Target region sequencing using a hereditary epidermolysis bullosa capture array combined with Sanger sequencing and bioinformatics analysis were used. Mutation taster, PolyPhen-2, Provean, and SIFT software and NCBI online were employed to assess the pathogenicity and conservation of detected mutations. One hundred healthy unrelated individuals were used as controls. Target region sequencing showed that the proband has carried a unreported heterozygous c.1234A>G (p.Ile412Val) mutation of the KRT14 gene, which was confirmed by Sanger sequencing in other 8 affected individuals but not among healthy members of the pedigree. Bioinformatics analysis indicated that the mutation is highly pathogenic. Remarkably, 3 members of the family (2 affected and 1 unaffected) have carried a heterozygous c.1237G>A (p.Ala413Thr) mutation of the KRT14 gene, which was collected in Human Gene Mutation Database (HGMD). Bioinformatics analysis indicated that the mutation may not be pathogenic. Both mutations were not detected among the 100 healthy controls. The novel c.1234A>G(p.Ile412Val) mutation of the KRT14 gene is probably responsible for the disease, while c.1237G>A (p.Ala413Thr) mutation of KRT14 gene may be a polymorphism. Compared with Sanger sequencing, target region capture sequencing is more efficient and can significantly reduce the cost of genetic testing for EBS.
1996-02-01
JOICFP President Shidzue Kato was influenced by and friends with Margaret Sanger from their initial meeting in 1920 to Sanger's death in 1966. Bruce Alfred is currently directing and producing a 90-minute documentary film about Sanger and her pioneering work in promoting the development and use of family planning. Once completed in the Spring of 1997, the film will be broadcast nationally in the US on Public Television. It is being produced with support from the National Endowment for the Humanities and several private foundations. Alfred interviewed Kato for three hours to gain insight into the life and legacy of Margaret Sanger. Sanger inspired Kato to make birth control her life's work. Kato spoke about how the prewar, pronatalist Japanese government allowed Sanger to visit Japan in 1922 only on the condition that she not speak about birth control. This official opposition and the subsequent reaction, however, actually fueled interest in Sanger and her message, and caused her ideas to become widely known among the Japanese public. While in Japan, Sanger did manage to discuss family planning, but in English in a closed meeting. Oddly enough, the government honored Sanger after the second World War with the highest award presented to non-Japanese. Kato noted how unfortunate it was that Sanger died in 1966 without witnessing the realization of the UNFPA.
Whole-exome sequencing identifies USH2A mutations in a pseudo-dominant Usher syndrome family.
Zheng, Sui-Lian; Zhang, Hong-Liang; Lin, Zhen-Lang; Kang, Qian-Yan
2015-10-01
Usher syndrome (USH) is an autosomal recessive (AR) multi-sensory degenerative disorder leading to deaf-blindness. USH is clinically subdivided into three subclasses, and 10 genes have been identified thus far. Clinical and genetic heterogeneities in USH make a precise diagnosis difficult. A dominant‑like USH family in successive generations was identified, and the present study aimed to determine the genetic predisposition of this family. Whole‑exome sequencing was performed in two affected patients and an unaffected relative. Systematic data were analyzed by bioinformatic analysis to remove the candidate mutations via step‑wise filtering. Direct Sanger sequencing and co‑segregation analysis were performed in the pedigree. One novel and two known mutations in the USH2A gene were identified, and were further confirmed by direct sequencing and co‑segregation analysis. The affected mother carried compound mutations in the USH2A gene, while the unaffected father carried a heterozygous mutation. The present study demonstrates that whole‑exome sequencing is a robust approach for the molecular diagnosis of disorders with high levels of genetic heterogeneity.
Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting
NASA Astrophysics Data System (ADS)
Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.
1997-05-01
Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.
BMPR1B mutation causes Pierre Robin sequence
Yao, Xu; Zhang, Rong; Yang, Hui; Zhao, Rui; Guo, Jihong; Jin, Ke; Mei, Haibo; Luo, Yongqi; Zhao, Liu; Tu, Ming; Zhu, Yimin
2017-01-01
Background We investigated a large family with Pierre Robin sequence (PRS). Aim of the study This study aims to determine the genetic cause of PRS. Results The reciprocal translocation t(4;6)(q22;p21) was identified to be segregated with PRS in a three-generation family. Whole-genome sequencing and Sanger sequencing successfully detected breakpoints in the intragenic regions of BMRP1B and GRM4. We hypothesized that PRS in this family was caused by (i) haploinsufficiency for BMPR1B or (ii) a gain of function mechanism mediated by the BMPR1B-GRM4 fusion gene. In an unrelated family, we identified another BMPR1B-splicing mutation that co-segregated with PRS. Conclusion We detected two BMPR1B mutations in two unrelated PRS families, suggesting that BMPR1B disruption is probably a cause of human PRS. Methods GTG banding, comparative genomic hybridization, whole-genome sequencing, and Sanger sequencing were performed to identify the gene causing PRS. PMID:28418932
Zhang, Xun; Wang, Yuehua; Gao, Ning; Wang, Jinfen
2014-02-01
To compare the application values of real-time quantitative PCR-Sanger sequencing and TaqMan probe method in the detection of KRAS and BRAF mutations, and to correlate KRAS/BRAF mutations with the clinicopathological characteristics in colorectal carcinomas. Genomic DNA of the tumor cells was extracted from formalin fixed paraffin embedded (FFPE) tissue samples of 344 colorectal carcinomas by microdissection. Real-time quantitative PCR-Sanger sequencing and TaqMan probe method were performed to detect the KRAS/BRAF mutations. The frequency and types of KRAS/BRAF mutations, clinicopathological characteristics and survival time were analyzed. KRAS mutations were detected in 39.8% (137/344) and 38.7% (133/344) of 344 colorectal carcinomas by using real-time quantitative PCR-Sanger sequencing and TaqMan probe method, respectively. BRAF mutation was detected in 4.7% (16/344) and 4.1% (14/344), respectively. There was no significant correlation between the two methods. The frequency of the KRAS mutation in female was higher than that in male (P < 0.05). The frequency of the BRAF mutation in colon was higher than that in rectum. The frequency of the BRAF mutation in stage III-IV cases was higher than that in stageI-II cases. The frequency of the BRAF mutation in signet ring cell carcinoma was higher than that in mucinous carcinoma and nonspecific adenocarcinoma had the lowest mutation rate. The frequency of the BRAF mutation in grade III cases was higher than that in grade II cases (P < 0.05). The overall concordance for the two methods of KRAS/BRAF mutation detection was 98.8% (kappa = 0.976). There was statistic significance between BRAF and KRAS mutations for the survival time of colorectal carcinomas (P = 0.039). There were no statistic significance between BRAF mutation type and BRAF/KRAS wild type (P = 0.058). (1) Compared with real-time quantitative PCR-Sanger sequencing, TaqMan probe method is better with regard to handling time, efficiency, repeatability, cost and equipment. (2) The frequency of the KRAS mutation is correlated with gender. BRAF mutation is correlated with primary tumor site, TNM stage, histological types and histological grades.(3) BRAF gene mutation is an independent prognostic marker for colorectal carcinomas.
USDA-ARS?s Scientific Manuscript database
The complete nucleotide sequence of a recently discovered Florida (FL) isolate of Hibiscus infecting Cilevirus (HiCV) was determined by Sanger sequencing. The movement- and coat- protein gene sequences of the HiCV-FL isolate are more divergent than other genes of the previously sequenced HiCV-HA (Ha...
Next Generation Sequencing at the University of Chicago Genomics Core
DOE Office of Scientific and Technical Information (OSTI.GOV)
Faber, Pieter
2013-04-24
The University of Chicago Genomics Core provides University of Chicago investigators (and external clients) access to State-of-the-Art genomics capabilities: next generation sequencing, Sanger sequencing / genotyping and micro-arrays (gene expression, genotyping, and methylation). The current presentation will highlight our capabilities in the area of ultra-high throughput sequencing analysis.
Chin, Ephrem L H; da Silva, Cristina; Hegde, Madhuri
2013-02-19
Detecting mutations in disease genes by full gene sequence analysis is common in clinical diagnostic laboratories. Sanger dideoxy terminator sequencing allows for rapid development and implementation of sequencing assays in the clinical laboratory, but it has limited throughput, and due to cost constraints, only allows analysis of one or at most a few genes in a patient. Next-generation sequencing (NGS), on the other hand, has evolved rapidly, although to date it has mainly been used for large-scale genome sequencing projects and is beginning to be used in the clinical diagnostic testing. One advantage of NGS is that many genes can be analyzed easily at the same time, allowing for mutation detection when there are many possible causative genes for a specific phenotype. In addition, regions of a gene typically not tested for mutations, like deep intronic and promoter mutations, can also be detected. Here we use 20 previously characterized Sanger-sequenced positive controls in disease-causing genes to demonstrate the utility of NGS in a clinical setting using standard PCR based amplification to assess the analytical sensitivity and specificity of the technology for detecting all previously characterized changes (mutations and benign SNPs). The positive controls chosen for validation range from simple substitution mutations to complex deletion and insertion mutations occurring in autosomal dominant and recessive disorders. The NGS data was 100% concordant with the Sanger sequencing data identifying all 119 previously identified changes in the 20 samples. We have demonstrated that NGS technology is ready to be deployed in clinical laboratories. However, NGS and associated technologies are evolving, and clinical laboratories will need to invest significantly in staff and infrastructure to build the necessary foundation for success.
Problem-Solving Test: Pyrosequencing
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2013-01-01
Terms to be familiar with before you start to solve the test: Maxam-Gilbert sequencing, Sanger sequencing, gel electrophoresis, DNA synthesis reaction, polymerase chain reaction, template, primer, DNA polymerase, deoxyribonucleoside triphosphates, orthophosphate, pyrophosphate, nucleoside monophosphates, luminescence, acid anhydride bond,…
Fernández-Caballero Rico, Jose Ángel; Chueca Porcuna, Natalia; Álvarez Estévez, Marta; Mosquera Gutiérrez, María Del Mar; Marcos Maeso, María Ángeles; García, Federico
2018-02-01
To show how to generate a consensus sequence from the information of massive parallel sequences data obtained from routine HIV anti-retroviral resistance studies, and that may be suitable for molecular epidemiology studies. Paired Sanger (Trugene-Siemens) and next-generation sequencing (NGS) (454 GSJunior-Roche) HIV RT and protease sequences from 62 patients were studied. NGS consensus sequences were generated using Mesquite, using 10%, 15%, and 20% thresholds. Molecular evolutionary genetics analysis (MEGA) was used for phylogenetic studies. At a 10% threshold, NGS-Sanger sequences from 17/62 patients were phylogenetically related, with a median bootstrap-value of 88% (IQR83.5-95.5). Association increased to 36/62 sequences, median bootstrap 94% (IQR85.5-98)], using a 15% threshold. Maximum association was at the 20% threshold, with 61/62 sequences associated, and a median bootstrap value of 99% (IQR98-100). A safe method is presented to generate consensus sequences from HIV-NGS data at 20% threshold, which will prove useful for molecular epidemiological studies. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
da Fonseca, Allex Jardim; Galvão, Renata Silva; Miranda, Angelica Espinosa; Ferreira, Luiz Carlos de Lima; Chen, Zigui
2016-05-01
To compare the diagnostic performance for HPV infection using three laboratorial techniques. Ninty-five cervicovaginal samples were randomly selected; each was tested for HPV DNA and genotypes using 3 methods in parallel: Multiplex-PCR, the Nested PCR followed by Sanger sequencing, and the Next_Gen Sequencing (NGS) with two assays (NGS-A1, NGS-A2). The study was approved by the Brazilian National IRB (CONEP protocol 16,800). The prevalence of HPV by the NGS assays was higher than that using the Multiplex-PCR (64.2% vs. 45.2%, respectively; P = 0.001) and the Nested-PCR (64.2% vs. 49.5%, respectively; P = 0.003). NGS also showed better performance in detecting high-risk HPV (HR-HPV) and HPV16. There was a weak interobservers agreement between the results of Multiplex-PCR and Nested-PCR in relation to NGS for the diagnosis of HPV infection, and a moderate correlation for HR-HPV detection. Both NGS assays showed a strong correlation for detection of HPVs (k = 0.86), HR-HPVs (k = 0.91), HPV16 (k = 0.92) and HPV18 (k = 0.91). NGS is more sensitive than the traditional Sanger sequencing and the Multiplex PCR to genotype HPVs, with promising ability to detect multiple infections, and may have the potential to establish an alternative method for the diagnosis and genotyping of HPV. © 2015 Wiley Periodicals, Inc.
Ammann, Sandra; Lehmberg, Kai; Zur Stadt, Udo; Klemann, Christian; Bode, Sebastian F N; Speckmann, Carsten; Janka, Gritta; Wustrau, Katharina; Rakhmanov, Mirzokhid; Fuchs, Ilka; Hennies, Hans C; Ehl, Stephan
2017-11-01
We report our experience in using flow cytometry-based immunological screening prospectively as a decision tool for the use of genetic studies in the diagnostic approach to patients with hemophagocytic lymphohistiocytosis (HLH). We restricted genetic analysis largely to patients with abnormal immunological screening, but included whole exome sequencing (WES) for those with normal findings upon Sanger sequencing. Among 290 children with suspected HLH analyzed between 2010 and 2014 (including 17 affected, but asymptomatic siblings), 87/162 patients with "full" HLH and 79/111 patients with "incomplete/atypical" HLH had normal immunological screening results. In 10 patients, degranulation could not be tested. Among the 166 patients with normal screening, genetic analysis was not performed in 107 (all with uneventful follow-up), while 154 single gene tests by Sanger sequencing in the remaining 59 patients only identified a single atypical CHS patient. Flow cytometry correctly predicted all 29 patients with FHL-2, XLP1 or 2. Among 85 patients with defective NK degranulation (including 13 asymptomatic siblings), 70 were Sanger sequenced resulting in a genetic diagnosis in 55 (79%). Eight patients underwent WES, revealing mutations in two known and one unknown cytotoxicity genes and one metabolic disease. FHL3 was the most frequent genetic diagnosis. Immunological screening provided an excellent decision tool for the need and depth of genetic analysis of HLH patients and provided functionally relevant information for rapid patient classification, contributing to a significant reduction in the time from diagnosis to transplantation in recent years.
Tu, Bin; Masaberg, Carly; Hou, Lihua; Behm, Daniel; Brescia, Peter; Cha, Nuri; Kariyawasam, Kanthi; Lee, Jar How; Nong, Thoa; Sells, John; Tausch, Paul; Yang, Ruyan; Ng, Jennifer; Hurley, Carolyn Katovich
2017-02-01
Sanger-based DNA sequencing of exons 2+3 of HLA class I alleles from a heterozygote frequently results in two or more alternative genotypes. This study was undertaken to reduce the time and effort required to produce a single high resolution HLA genotype. Samples were typed in parallel by Sanger sequencing and oligonucleotide probe hybridization. This workflow, together with optimization of analysis software, was tested and refined during the typing of over 42,000 volunteers for an unrelated hematopoietic progenitor cell donor registry. Next generation DNA sequencing (NGS) was applied to over 1000 of these samples to identify the alleles present within the G group designations. Single genotypes at G level resolution were obtained for over 95% of the loci without additional assays. The vast majority of alleles identified (>99%) were the primary allele giving the G groups their name. Only 0.7% of the alleles identified encoded protein variants that were not detected by a focus on the antigen recognition domain (ARD)-encoding exons. Our combined method routinely provides biologically relevant typing resolution at the level of the ARD. It can be applied to both single samples or to large volume typing supporting either bone marrow or solid organ transplantation using technologies currently available in many HLA laboratories. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
BRAF mutation testing in solid tumors: a methodological comparison.
Weyant, Grace W; Wisotzkey, Jeffrey D; Benko, Floyd A; Donaldson, Keri J
2014-09-01
Solid tumor genotyping has become standard of care for the characterization of proto-oncogene mutational status, which has traditionally been accomplished with Sanger sequencing. However, companion diagnostic assays and comparable laboratory-developed tests are becoming increasingly popular, such as the cobas 4800 BRAF V600 Mutation Test and the INFINITI KRAS-BRAF assay, respectively. This study evaluates and validates the analytical performance of the INFINITI KRAS-BRAF assay and compares concordance of BRAF status with two reference assays, the cobas test and Sanger sequencing. DNA extraction from FFPE tissue specimens was performed followed by multiplex PCR amplification and fluorescent label incorporation using allele-specific primer extension. Hybridization to a microarray, signal detection, and analysis were then performed. The limits of detection were determined by testing dilutions of mutant BRAF alleles within wild-type background DNA, and accuracy was calculated based on these results. The INFINITI KRAS-BRAF assay produced 100% concordance with the cobas test and Sanger sequencing and had sensitivity equivalent to the cobas assay. The INFINITI assay is repeatable with at least 95% accuracy in the detection of mutant and wild-type BRAF alleles. These results confirm that the INFINITI KRAS-BRAF assay is comparable to traditional sequencing and the Food and Drug Administration-approved companion diagnostic assay for the detection of BRAF mutations. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Pena, Loren DM; Jiang, Yong-Hui; Schoch, Kelly; Spillmann, Rebecca C.; Walley, Nicole; Stong, Nicholas; Horn, Sarah Rapisardo; Sullivan, Jennifer A.; McConkie-Rosell, Allyn; Kansagra, Sujay; Smith, Edward C.; El-Dairi, Mays; Bellet, Jane; Ann Keels, Martha; Jasien, Joan; Kranz, Peter G.; Noel, Richard; Nagaraj, Shashi K.; Lark, Robert K.; Wechsler, Daniel SG; del Gaudio, Daniela; Leung, Marco L.; Hendon, Laura G.; Parker, Collette C.; Jones, Kelly L.; Goldstein, David B.; Shashi, Vandana
2017-01-01
Purpose To describe examples of missed pathogenic variants on whole exome sequencing (WES) and the importance of deep phenotyping for further diagnostic testing. Methods Guided by phenotypic information, three children with negative WES underwent targeted single gene testing. Results Individual 1 had a clinical diagnosis consistent with infantile systemic hyalinosis, although WES and an NGS-based ANTXR2 test were negative. Sanger sequencing of ANTXR2 revealed a homozygous single base pair insertion, previously missed by the WES variant caller software. Individual 2 had neurodevelopmental regression and cerebellar atrophy, with no diagnosis on WES. New clinical findings prompted Sanger sequencing and copy number testing of PLA2G6. A novel homozygous deletion of the non-coding exon 1 (not included in the WES capture kit) was detected, with extension into the promoter, confirming the clinical suspicion of infantile neuroaxonal dystrophy. Individual 3 had progressive ataxia, spasticity and MRI changes of vanishing white matter leukoencephalopathy. An NGS leukodystrophy gene panel and WES showed a heterozygous pathogenic variant in EIF2B5; no deletions/duplications were detected. Sanger sequencing of EIF2B5 showed a frameshift indel, likely missed due to failure of alignment. Conclusions These cases illustrate potential pitfalls of WES/NGS testing, and the importance of phenotype-guided molecular testing in yielding diagnoses. PMID:28914269
Wu, Tonghua; Yin, Biao; Zhu, Yuanchang; Li, Guangui; Ye, Lijun; Liang, Desheng; Zeng, Yong
2017-12-01
To investigate the etiology of X-linked hypohidrotic ectodermal dysplasia (XLHED) in a family with an inversion of the X chromosome [inv(X)(p21q13)] and to achieve a healthy birth following preimplantation genetic diagnosis (PGD). Next generation sequencing (NGS) and Sanger sequencing analysis were carried out to define the inversion breakpoint. Multiple displacement amplification, amplification of breakpoint junction fragments, Sanger sequencing of exon 1 of ED1, haplotyping of informative short tandem repeat markers and gender determination were performed for PGD. NGS data of the proband sample revealed that the size of the possible inverted fragment was over 42Mb, spanning from position 26, 814, 206 to position 69, 231, 915 on the X chromosome. The breakpoints were confirmed by Sanger sequencing. A total of 5 blastocyst embryos underwent trophectoderm biopsy. Two embryos were diagnosed as carriers and three were unaffected. Two unaffected blastocysts were transferred and a singleton pregnancy was achieved. Following confirmation by prenatal diagnosis, a healthy baby was delivered. This is the first report of an XLHED family with inv(X). ED1 is disrupted by the X chromosome inversion in this XLHED family and embryos with the X chromosomal abnormality can be accurately identified by means of PGD. Copyright © 2017. Published by Elsevier B.V.
First report of Cocksfoot mottle virus infecting wheat (Triticum aestivum) in Ohio
USDA-ARS?s Scientific Manuscript database
Cocksfoot mottle virus (CfMV) was discovered in Ohio wheat during a 2016 field survey utilizing RNA-Seq to identify virus-like sequences. Virus sequences were confirmed by reverse transcriptase-polymerase chain reaction (RT-PCR) and Sanger sequencing, and CfMV was transmitted to orchardgrass and pas...
Next generation sequencing as a useful tool in the diagnostics of mosaicism in Alport syndrome.
Beicht, Sonja; Strobl-Wildemann, Gertrud; Rath, Sabine; Wachter, Oliver; Alberer, Martin; Kaminsky, Elke; Weber, Lutz T; Hinrichsen, Tanja; Klein, Hanns-Georg; Hoefele, Julia
2013-09-10
Alport syndrome (ATS) is a progressive hereditary nephropathy characterized by hematuria and/or proteinuria with structural defects of the glomerular basement membrane. It can be associated with extrarenal manifestations (high-tone sensorineural hearing loss and ocular abnormalities). Somatic mutations in COL4A5 (X-linked), COL4A3 and COL4A4 genes (both autosomal recessive and autosomal dominant) cause Alport syndrome. Somatic mosaicism in Alport patients is very rare. The reason for this may be due to the difficulty of detection. We report the case of a boy and his mother who presented with Alport syndrome. Mutational analysis showed the novel hemizygote pathogenic mutation c.2396-1G>A (IVS29-1G>A) at the splice acceptor site of the intron 29 exon 30 boundary of the COL4A5 gene in the boy. The mutation in the mother would not have been detected by Sanger sequencing without the knowledge of the mutational analysis result of her son. Further investigation of the mother using next generation sequencing showed somatic mosaicism and implied potential germ cell mosaicism. The mutation in the mother has most likely occurred during early embryogenesis. Analysis of tissue of different embryonic origin in the mother confirmed mosaicism in both mesoderm and ectoderm. Low grade mosaicism is very difficult to detect by Sanger sequencing. Next generation sequencing is increasingly used in the diagnostics and might improve the detection of mosaicism. In the case of definite clinical symptoms of ATS and missing detection of a mutation by Sanger sequencing, mutational analysis should be performed by next generation sequencing. Copyright © 2013 Elsevier B.V. All rights reserved.
Simpson, Jared
2018-01-24
Wellcome Trust Sanger Institute's Jared Simpson on Memory efficient sequence analysis using compressed data structures at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.
USDA-ARS?s Scientific Manuscript database
Nine different regions totaling 9.7 Mb of the 4.02 Gb Aegilops tauschii genome were sequenced using the Sanger sequencing technology and compared with orthologous Brachypodium distachyon, Oryza sativa (rice) and Sorghum bicolor (sorghum) genomic sequences. The ancestral gene content in these regio...
Dr. Sanger's Apprentice: A Computer-Aided Instruction to Protein Sequencing.
ERIC Educational Resources Information Center
Schmidt, Thomas G.; Place, Allen R.
1985-01-01
Modeled after the program "Mastermind," this program teaches students the art of protein sequencing. The program (written in Turbo Pascal for the IBM PC, requiring 128K, a graphics adapter, and an 8070 mathematics coprocessor) generates a polypeptide whose sequence and length can be user-defined (for practice) or computer-generated (for…
Mou, Yi; Athar, Muhammad Ammar; Wu, Yuzhen; Xu, Ye; Wu, Jianhua; Xu, Zhenxing; Hayder, Zulfiqar; Khan, Saeed; Idrees, Muhammad; Nasir, Muhammad Israr; Liao, Yiqun; Li, Qingge
2016-11-01
Detection of anti-hepatitis B virus (HBV) drug resistance mutations is critical for therapeutic decisions for chronic hepatitis B virus infection. We describe a real-time PCR-based assay using multicolor melting curve analysis (MMCA) that could accurately detect 24 HBV nucleotide mutations at 10 amino acid positions in the reverse transcriptase region of the HBV polymerase gene. The two-reaction assay had a limit of detection of 5 copies per reaction and could detect a minor mutant population (5% of the total population) with the reverse transcriptase M204V amino acid mutation in the presence of the major wild-type population when the overall concentration was 10 4 copies/μl. The assay could be finished within 3 h, and the cost of materials for each sample was less than $10. Clinical validation studies using three groups of samples from both nucleos(t)ide analog-treated and -untreated patients showed that the results for 99.3% (840/846) of the samples and 99.9% (8,454/8,460) of the amino acids were concordant with those of Sanger sequencing of the PCR amplicon from the HBV reverse transcriptase region (PCR Sanger sequencing). HBV DNA in six samples with mixed infections consisting of minor mutant subpopulations was undetected by the PCR Sanger sequencing method but was detected by MMCA, and the results were confirmed by coamplification at a lower denaturation temperature-PCR Sanger sequencing. Among the treated patients, 48.6% (103/212) harbored viruses that displayed lamivudine monoresistance, adefovir monoresistance, entecavir resistance, or lamivudine and adefovir resistance. Among the untreated patients, the Chinese group had more mutation-containing samples than did the Pakistani group (3.3% versus 0.56%). Because of its accuracy, rapidness, wide-range coverage, and cost-effectiveness, the real-time PCR assay could be a robust tool for the detection if anti-HBV drug resistance mutations in resource-limited countries. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Diagnostic Applications of Next Generation Sequencing in Immunogenetics and Molecular Oncology
Grumbt, Barbara; Eck, Sebastian H.; Hinrichsen, Tanja; Hirv, Kaimo
2013-01-01
Summary With the introduction of the next generation sequencing (NGS) technologies, remarkable new diagnostic applications have been established in daily routine. Implementation of NGS is challenging in clinical diagnostics, but definite advantages and new diagnostic possibilities make the switch to the technology inevitable. In addition to the higher sequencing capacity, clonal sequencing of single molecules, multiplexing of samples, higher diagnostic sensitivity, workflow miniaturization, and cost benefits are some of the valuable features of the technology. After the recent advances, NGS emerged as a proven alternative for classical Sanger sequencing in the typing of human leukocyte antigens (HLA). By virtue of the clonal amplification of single DNA molecules ambiguous typing results can be avoided. Simultaneously, a higher sample throughput can be achieved by tagging of DNA molecules with multiplex identifiers and pooling of PCR products before sequencing. In our experience, up to 380 samples can be typed for HLA-A, -B, and -DRB1 in high-resolution during every sequencing run. In molecular oncology, NGS shows a markedly increased sensitivity in comparison to the conventional Sanger sequencing and is developing to the standard diagnostic tool in detection of somatic mutations in cancer cells with great impact on personalized treatment of patients. PMID:23922545
Hwang, Sang Mee; Lee, Ki Chan; Lee, Min Seob; Park, Kyoung Un
2018-01-01
Transition to next generation sequencing (NGS) for BRCA1 / BRCA2 analysis in clinical laboratories is ongoing but different platforms and/or data analysis pipelines give different results resulting in difficulties in implementation. We have evaluated the Ion Personal Genome Machine (PGM) Platforms (Ion PGM, Ion PGM Dx, Thermo Fisher Scientific) for the analysis of BRCA1 /2. The results of Ion PGM with OTG-snpcaller, a pipeline based on Torrent mapping alignment program and Genome Analysis Toolkit, from 75 clinical samples and 14 reference DNA samples were compared with Sanger sequencing for BRCA1 / BRCA2 . Ten clinical samples and 14 reference DNA samples were additionally sequenced by Ion PGM Dx with Torrent Suite. Fifty types of variants including 18 pathogenic or variants of unknown significance were identified from 75 clinical samples and known variants of the reference samples were confirmed by Sanger sequencing and/or NGS. One false-negative results were present for Ion PGM/OTG-snpcaller for an indel variant misidentified as a single nucleotide variant. However, eight discordant results were present for Ion PGM Dx/Torrent Suite with both false-positive and -negative results. A 40-bp deletion, a 4-bp deletion and a 1-bp deletion variant was not called and a false-positive deletion was identified. Four other variants were misidentified as another variant. Ion PGM/OTG-snpcaller showed acceptable performance with good concordance with Sanger sequencing. However, Ion PGM Dx/Torrent Suite showed many discrepant results not suitable for use in a clinical laboratory, requiring further optimization of the data analysis for calling variants.
Albitar, Adam; Ma, Wanlong; DeDios, Ivan; Estella, Jeffrey; Ahn, Inhye; Farooqui, Mohammed; Wiestner, Adrian; Albitar, Maher
2017-03-14
Patients with chronic lymphocytic leukemia (CLL) that develop resistance to Bruton tyrosine kinase (BTK) inhibitors are typically positive for mutations in BTK or phospholipase c gamma 2 (PLCγ2). We developed a high sensitivity (HS) assay utilizing wild-type blocking polymerase chain reaction achieved via bridged and locked nucleic acids. We used this high sensitivity assay in combination with Sanger sequencing and next generation sequencing (NGS) and tested cellular DNA and cell-free DNA (cfDNA) from patients with CLL treated with the BTK inhibitor, ibrutinib. We also tested ibrutinib-naïve patients with CLL. HS testing achieved 100x greater sensitivity than Sanger. HS Sanger sequencing was capable of detecting < 1 mutant allele in background of 1000 wild-type alleles (1:1000). Similar sensitivity was achieved with HS NGS. No BTK or PLCγ2 mutations were detected in any of the 44 ibrutinib-naïve CLL patients. We demonstrate that without the HS testing 56% of positive samples would have been missed for BTK and 85% of PLCγ2 would have been missed. With the use of HS, we were able to detect multiple mutant clones in the same sample in 37.5% of patients; most would have been missed without HS testing. We also demonstrate that with HS sequencing, plasma cfDNA is more reliable than cellular DNA in detecting mutations. Our studies indicate that wild-type blocking and HS sequencing is necessary for proper and early detection of BTK or PLCγ2 mutations in monitoring patients treated with BTK inhibitors. Furthermore, cfDNA from plasma is very reliable sample-type for testing.
Granato, Paul A; DeGilio, Marcia A; Wilson, Elsie M
2016-11-01
The Lyra™ Direct HSV 1+2/VZV Assay is a moderately complex, multiplex PCR assay that qualitatively detects the presence of HSV 1, HSV 2, and VZV DNA in cutaneous and mucocutaneous specimens with a time-to-result of less than 60min. To report a one-year laboratory experience using Lyra assay for testing cutaneous and mucocutaneous specimens for HSV and VZV that resulted in the unexpected detection of VZV in 14 male and female genital specimens. Over a one-year period, 2113 cutaneous and mucocutaneous specimens from male and female patients were submitted for testing using the Lyra assay. An unexpected 14 genital specimens were positive for the presence VZV DNA. Eleven of the 14 specimens were available for confirmatory testing using two alternative molecular methods and Sanger sequencing. Fourteen male and female genital specimens were positive for the presence of VZV DNA. All of the 11 specimens (9 female and 2 male) that were available for confirmatory testing by the alternative molecular method and Sanger sequencing were confirmed as containing VZV DNA. Using of the Lyra assay over a one-year time period, VZV DNA was detected in 126 specimens of which 14 (11.1%) were from male and female genital sites. This rare and unexpected finding suggests that the appearance of zoster lesions in the genital area may not be as uncommon as previously thought and that this finding would have considerable impact on patient counseling and public health considerations. Copyright © 2016 Elsevier B.V. All rights reserved.
Ten Broek, Roel W; Bekers, Elise M; de Leng, Wendy W J; Strengman, Eric; Tops, Bastiaan B J; Kutzner, Heinz; Leeuwis, Jan Willem; van Gorp, Joost M; Creytens, David H; Mentzel, Thomas; van Diest, Paul J; Eijkelenboom, Astrid; Flucke, Uta
2017-12-01
Spindle cell hemangioma (SCH) is a distinct vascular soft-tissue lesion characterized by cavernous blood vessels and a spindle cell component mainly occurring in the distal extremities of young adults. The majority of cases harbor heterozygous mutations in IDH1/2 sporadically or rarely in association with Maffucci syndrome. However, based on mosaicism and accordingly a low percentage of lesional cells harboring a mutant allele, detection can be challenging. We tested 19 sporadic SCHs by Sanger sequencing, multiplex ligation-dependent probe amplification (MLPA), conventional next generation sequencing (NGS), and NGS using a single molecule molecular inversion probes (smMIP)-based library preparation to compare their diagnostic value. Out of 10 cases tested by Sanger sequencing and 2 analyzed using MLPA, 4 and 1, respectively, revealed a mutation in IDH1 (p.R132C). The 7 remaining negative cases and additional 6 cases were investigated using smMIP/NGS, showing hot spot mutations in IDH1 (p.R132C) (8 cases) and IDH2 (3 cases; twice p.R172S and once p.R172G, respectively). One case was negative. Owing to insufficient DNA quality and insufficient coverage, 2 cases were excluded. In total, in 16 out of 17 cases successfully tested, an IDH1/2 mutation was found. Given that IDH1/2 mutations were absent in 161 other vascular lesions tested by smMIP/NGS, the mutation can be considered as highly specific for SCH. © 2017 Wiley Periodicals, Inc.
New single-copy nuclear genes for scale insect systematics
USDA-ARS?s Scientific Manuscript database
Despite the advent of next-generation sequencing, the polymerase chain reaction (PCR) and Sanger sequencing remain useful tools for molecular identification and systematics. To date, molecular systematics of scale insects has been constrained by the paucity of loci that researchers have been able to...
Nanopore Kinetic Proofreading of DNA Sequences
NASA Astrophysics Data System (ADS)
Ling, Xinsheng Sean
The concept of DNA sequencing using the time dependence of the nanopore ionic current was proposed in 1996 by Kasianowicz, Brandin, Branton, and Deamer (KBBD). The KBBD concept has generated tremendous amount interests in recent decade. In this talk, I will review the current understanding of the DNA ``translocation'' dynamics and how it can be described by Schrodinger's 1915 paper on first-passage-time distribution function. Schrodinger's distribution function can be used to give a rigorous criterion for achieving nanopore DNA sequencing which turns out to be identical to that of gel electrophoresis used by Sanger in the first-generation Sanger method. A nanopore DNA sequencing technology also requires discrimination of bases with high accuracies. I will describe a solid-state nanopore sandwich structure that can function as a proofreading device capable of discriminating between correct and incorrect hybridization probes with an accuracy rivaling that of high-fidelity DNA polymerases. The latest results from Nanjing will be presented. This work is supported by China 1000-Talent Program at Southeast University, Nanjing, China.
Tracking B-Cell Repertoires and Clonal Histories in Normal and Malignant Lymphocytes.
Weston-Bell, Nicola J; Cowan, Graeme; Sahota, Surinder S
2017-01-01
Methods for tracking B-cell repertoires and clonal history in normal and malignant B-cells based on immunoglobulin variable region (IGV) gene analysis have developed rapidly with the advent of massive parallel next-generation sequencing (mpNGS) protocols. mpNGS permits a depth of analysis of IGV genes not hitherto feasible, and presents challenges of bioinformatics analysis, which can be readily met by current pipelines. This strategy offers a potential resolution of B-cell usage at a depth that may capture fully the natural state, in a given biological setting. Conventional methods based on RT-PCR amplification and Sanger sequencing are also available where mpNGS is not accessible. Each method offers distinct advantages. Conventional methods for IGV gene sequencing are readily adaptable to most laboratories and provide an ease of analysis to capture salient features of B-cell use. This chapter describes two methods in detail for analysis of IGV genes, mpNGS and conventional RT-PCR with Sanger sequencing.
Hanriot, Lucie; Keime, Céline; Gay, Nadine; Faure, Claudine; Dossat, Carole; Wincker, Patrick; Scoté-Blachon, Céline; Peyron, Christelle; Gandrillon, Olivier
2008-01-01
Background "Open" transcriptome analysis methods allow to study gene expression without a priori knowledge of the transcript sequences. As of now, SAGE (Serial Analysis of Gene Expression), LongSAGE and MPSS (Massively Parallel Signature Sequencing) are the mostly used methods for "open" transcriptome analysis. Both LongSAGE and MPSS rely on the isolation of 21 pb tag sequences from each transcript. In contrast to LongSAGE, the high throughput sequencing method used in MPSS enables the rapid sequencing of very large libraries containing several millions of tags, allowing deep transcriptome analysis. However, a bias in the complexity of the transcriptome representation obtained by MPSS was recently uncovered. Results In order to make a deep analysis of mouse hypothalamus transcriptome avoiding the limitation introduced by MPSS, we combined LongSAGE with the Solexa sequencing technology and obtained a library of more than 11 millions of tags. We then compared it to a LongSAGE library of mouse hypothalamus sequenced with the Sanger method. Conclusion We found that Solexa sequencing technology combined with LongSAGE is perfectly suited for deep transcriptome analysis. In contrast to MPSS, it gives a complex representation of transcriptome as reliable as a LongSAGE library sequenced by the Sanger method. PMID:18796152
Liang, Chanjuan; van Dijk, Jeroen P; Scholtens, Ingrid M J; Staats, Martijn; Prins, Theo W; Voorhuijzen, Marleen M; da Silva, Andrea M; Arisi, Ana Carolina Maisonnave; den Dunnen, Johan T; Kok, Esther J
2014-04-01
The growing number of biotech crops with novel genetic elements increasingly complicates the detection of genetically modified organisms (GMOs) in food and feed samples using conventional screening methods. Unauthorized GMOs (UGMOs) in food and feed are currently identified through combining GMO element screening with sequencing the DNA flanking these elements. In this study, a specific and sensitive qPCR assay was developed for vip3A element detection based on the vip3Aa20 coding sequences of the recently marketed MIR162 maize and COT102 cotton. Furthermore, SiteFinding-PCR in combination with Sanger, Illumina or Pacific BioSciences (PacBio) sequencing was performed targeting the flanking DNA of the vip3Aa20 element in MIR162. De novo assembly and Basic Local Alignment Search Tool searches were used to mimic UGMO identification. PacBio data resulted in relatively long contigs in the upstream (1,326 nucleotides (nt); 95 % identity) and downstream (1,135 nt; 92 % identity) regions, whereas Illumina data resulted in two smaller contigs of 858 and 1,038 nt with higher sequence identity (>99 % identity). Both approaches outperformed Sanger sequencing, underlining the potential for next-generation sequencing in UGMO identification.
Sutton, Lesley-Ann; Ljungström, Viktor; Mansouri, Larry; Young, Emma; Cortese, Diego; Navrkalova, Veronika; Malcikova, Jitka; Muggen, Alice F; Trbusek, Martin; Panagiotidis, Panagiotis; Davi, Frederic; Belessi, Chrysoula; Langerak, Anton W; Ghia, Paolo; Pospisilova, Sarka; Stamatopoulos, Kostas; Rosenquist, Richard
2015-03-01
Next-generation sequencing has revealed novel recurrent mutations in chronic lymphocytic leukemia, particularly in patients with aggressive disease. Here, we explored targeted re-sequencing as a novel strategy to assess the mutation status of genes with prognostic potential. To this end, we utilized HaloPlex targeted enrichment technology and designed a panel including nine genes: ATM, BIRC3, MYD88, NOTCH1, SF3B1 and TP53, which have been linked to the prognosis of chronic lymphocytic leukemia, and KLHL6, POT1 and XPO1, which are less characterized but were found to be recurrently mutated in various sequencing studies. A total of 188 chronic lymphocytic leukemia patients with poor prognostic features (unmutated IGHV, n=137; IGHV3-21 subset #2, n=51) were sequenced on the HiSeq 2000 and data were analyzed using well-established bioinformatics tools. Using a conservative cutoff of 10% for the mutant allele, we found that 114/180 (63%) patients carried at least one mutation, with mutations in ATM, BIRC3, NOTCH1, SF3B1 and TP53 accounting for 149/177 (84%) of all mutations. We selected 155 mutations for Sanger validation (variant allele frequency, 10-99%) and 93% (144/155) of mutations were confirmed; notably, all 11 discordant variants had a variant allele frequency between 11-27%, hence at the detection limit of conventional Sanger sequencing. Technical precision was assessed by repeating the entire HaloPlex procedure for 63 patients; concordance was found for 77/82 (94%) mutations. In summary, this study demonstrates that targeted next-generation sequencing is an accurate and reproducible technique potentially suitable for routine screening, eventually as a stand-alone test without the need for confirmation by Sanger sequencing. Copyright© Ferrata Storti Foundation.
Thurber, Mary I; Ghai, Ria R; Hyeroba, David; Weny, Geoffrey; Tumukunde, Alex; Chapman, Colin A; Wiseman, Roger W; Dinis, Jorge; Steeil, James; Greiner, Ellis C; Friedrich, Thomas C; O'Connor, David H; Goldberg, Tony L
2013-07-01
Hemoparasites of the apicomplexan family Plasmodiidae include the etiological agents of malaria, as well as a suite of non-human primate parasites from which the human malaria agents evolved. Despite the significance of these parasites for global health, little information is available about their ecology in multi-host communities. Primates were investigated in Kibale National Park, Uganda, where ecological relationships among host species are well characterized. Blood samples were examined for parasites of the genera Plasmodium and Hepatocystis using microscopy and PCR targeting the parasite mitochondrial cytochrome b gene, followed by Sanger sequencing. To assess co-infection, "deep sequencing" of a variable region within cytochrome b was performed. Out of nine black-and-white colobus (Colobus guereza), one blue guenon (Cercopithecus mitis), five grey-cheeked mangabeys (Lophocebus albigena), 23 olive baboons (Papio anubis), 52 red colobus (Procolobus rufomitratus) and 12 red-tailed guenons (Cercopithecus ascanius), 79 infections (77.5%) were found, all of which were Hepatocystis spp. Sanger sequencing revealed 25 different parasite haplotypes that sorted phylogenetically into six species-specific but morphologically similar lineages. "Deep sequencing" revealed mixed-lineage co-infections in baboons and red colobus (41.7% and 64.7% of individuals, respectively) but not in other host species. One lineage infecting red colobus also infected baboons, but always as the minor variant, suggesting directional cross-species transmission. Hepatocystis parasites in this primate community are a diverse assemblage of cryptic lineages, some of which co-infect hosts and at least one of which can cross primate species barriers. Copyright © 2013 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
The quest for rare variants: pooled multiplexed next generation sequencing in plants.
Marroni, Fabio; Pinosio, Sara; Morgante, Michele
2012-01-01
Next generation sequencing (NGS) instruments produce an unprecedented amount of sequence data at contained costs. This gives researchers the possibility of designing studies with adequate power to identify rare variants at a fraction of the economic and labor resources required by individual Sanger sequencing. As of today, few research groups working in plant sciences have exploited this potentiality, showing that pooled NGS provides results in excellent agreement with those obtained by individual Sanger sequencing. The aim of this review is to convey to the reader the general ideas underlying the use of pooled NGS for the identification of rare variants. To facilitate a thorough understanding of the possibilities of the method, we will explain in detail the possible experimental and analytical approaches and discuss their advantages and disadvantages. We will show that information on allele frequency obtained by pooled NGS can be used to accurately compute basic population genetics indexes such as allele frequency, nucleotide diversity, and Tajima's D. Finally, we will discuss applications and future perspectives of the multiplexed NGS approach.
Ashrafi, Hamid; Hill, Theresa; Stoffel, Kevin; Kozik, Alexander; Yao, Jiqiang; Chin-Wo, Sebastian Reyes; Van Deynze, Allen
2012-10-30
Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80-120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project.
Mutational Signature Mark Cancer’s Smoking Gun
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alexandrov, Ludmil
A broad computational study of cancer genome sequences by Los Alamos National Laboratory with the UK’s Wellcome Trust Sanger Institute and other collaborators identifies telltale mutational signatures associated with smoking tobacco. The research demonstrates, for the first time, that smoking increases cancer risk by causing somatic mutations in tissues directly and indirectly exposed to tobacco smoke. The international study was published in the November 4 issue of Science. The analysis shows that tobacco smoking causes mutations leading to cancer by multiple distinct mechanisms, including by damaging DNA in organs and by speeding up a mutational cellular clock.
Mutational Signature Mark Cancerâs Smoking Gun
Alexandrov, Ludmil
2018-06-13
A broad computational study of cancer genome sequences by Los Alamos National Laboratory with the UKâs Wellcome Trust Sanger Institute and other collaborators identifies telltale mutational signatures associated with smoking tobacco. The research demonstrates, for the first time, that smoking increases cancer risk by causing somatic mutations in tissues directly and indirectly exposed to tobacco smoke. The international study was published in the November 4 issue of Science. The analysis shows that tobacco smoking causes mutations leading to cancer by multiple distinct mechanisms, including by damaging DNA in organs and by speeding up a mutational cellular clock.
Nanopore Technology: A Simple, Inexpensive, Futuristic Technology for DNA Sequencing.
Gupta, P D
2016-10-01
In health care, importance of DNA sequencing has been fully established. Sanger's Capillary Electrophoresis DNA sequencing methodology is time consuming, cumbersome, hence become more expensive. Lately, because of its versatility DNA sequencing became house hold name, and therefore, there is an urgent need of simple, fast, inexpensive, DNA sequencing technology. In the beginning of this century efforts were made, and Nanopore DNA sequencing technology was developed; still it is infancy, nevertheless, it is the futuristic technology.
Transcriptome assembly, gene annotation and tissue gene expression atlas of the rainbow trout
USDA-ARS?s Scientific Manuscript database
Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complimented by transcriptome information that will enhance genome assembly and annotation. Previously, we reported a transcriptome reference sequence using a 19X coverage of Sanger and 454-pyrosequencing dat...
Liu, Yong; Cao, Yu; Li, Yaxiong; Lei, Dongyun; Li, Lin; Hou, Zong Liu; Han, Shen; Meng, Mingyao; Shi, Jianlin; Zhang, Yayong; Wang, Yi; Niu, Zhaoyi; Xie, Yanhua; Xiao, Benshan; Wang, Yuanfei; Li, Xiao; Yang, Lirong
2018-01-01
Background Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. Material/Methods Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. Results From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (P<0.05); variants in FOXL2 and MYH6 were found in patients with isolated, sporadic ASD (P<5×10−4). Conclusions This was the first study that demonstrated variants in FOXL2 and HYDIN associated with sporadic ASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations. PMID:29505555
Liu, Yong; Cao, Yu; Li, Yaxiong; Lei, Dongyun; Li, Lin; Hou, Zong Liu; Han, Shen; Meng, Mingyao; Shi, Jianlin; Zhang, Yayong; Wang, Yi; Niu, Zhaoyi; Xie, Yanhua; Xiao, Benshan; Wang, Yuanfei; Li, Xiao; Yang, Lirong; Wang, Wenju; Jiang, Lihong
2018-03-05
BACKGROUND Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. MATERIAL AND METHODS Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. RESULTS From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (P<0.05); variants in FOXL2 and MYH6 were found in patients with isolated, sporadic ASD (P<5×10^-4). CONCLUSIONS This was the first study that demonstrated variants in FOXL2 and HYDIN associated with sporadic ASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations.
Mutation analysis of seven known glaucoma-associated genes in Chinese patients with glaucoma.
Huang, Xiaobo; Li, Miaoling; Guo, Xiangming; Li, Shiqiang; Xiao, Xueshan; Jia, Xiaoyun; Liu, Xing; Zhang, Qingjiong
2014-05-13
To evaluate mutations in the MYOC, WDR36, OPTN, OPA1, NTF4, CYP1B1, and LTBP2 genes in a cohort of Chinese patients with primary glaucoma. Genomic DNA was prepared from 683 unrelated patients, including 50 with primary congenital glaucoma, 104 with juvenile open-angle glaucoma (JOAG), 186 with primary open-angle glaucoma (POAG), and 343 with primary angle-closure glaucoma (PACG). Mutations in the seven genes in 257 patients (36 with JOAG, 89 with POAG, and 132 with PACG) were initially analyzed by exome sequencing and then confirmed by Sanger sequencing. In addition, Sanger sequencing was used to detect MYOC mutations in the remaining 426 patients. Exome sequencing identified 19 mutations (6 in MYOC, 9 in WDR36, 3 in OPA1, and 1 in OPTN) in 20 of 257 patients, including 4 patients with JOAG, 8 patients with POAG, and 8 patients with PACG. No mutation was detected in the other three genes. In addition, Sanger sequencing detected additional MYOC mutations in 5 of the remaining 426 patients, including 3 patients with JOAG and 2 patients with POAG. Twenty-two mutations in MYOC, WDR36, OPA1, and OPTN were detected in 25 of the 683 patients with primary glaucoma, including nine MYOC mutations in 11 patients, nine WDR36 mutations in 11 patients, three OPA1 mutations in 3 patients, and one OPTN mutation in a patient who also carried a MYOC mutation. Eight mutations in MYOC, WDR36, and OPA1 in 8 of the 343 PACG patients are of uncertain significance and need to be analyzed further. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.
CRISPR/Cas9-Based Multiplex Genome Editing in Monocot and Dicot Plants.
Ma, Xingliang; Liu, Yao-Guang
2016-07-01
The clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-mediated genome targeting system has been applied to a variety of organisms, including plants. Compared to other genome-targeting technologies such as zinc-finger nucleases (ZFNs) and transcription activator-like effector nucleases (TALENs), the CRISPR/Cas9 system is easier to use and has much higher editing efficiency. In addition, multiple "single guide RNAs" (sgRNAs) with different target sequences can be designed to direct the Cas9 protein to multiple genomic sites for simultaneous multiplex editing. Here, we present a procedure for highly efficient multiplex genome targeting in monocot and dicot plants using a versatile and robust CRISPR/Cas9 vector system, emphasizing the construction of binary constructs with multiple sgRNA expression cassettes in one round of cloning using Golden Gate ligation. We also describe the genotyping of targeted mutations in transgenic plants by direct Sanger sequencing followed by decoding of superimposed sequencing chromatograms containing biallelic or heterozygous mutations using the Web-based tool DSDecode. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
Unlocking Short Read Sequencing for Metagenomics
Rodrigue, Sébastien; Materna, Arne C.; Timberlake, Sonia C.; ...
2010-07-28
We describe an experimental and computational pipeline yielding millions of reads that can exceed 200 bp with quality scores approaching that of traditional Sanger sequencing. The method combines an automatable gel-less library construction step with paired-end sequencing on a short-read instrument. With appropriately sized library inserts, mate-pair sequences can overlap, and we describe the SHERA software package that joins them to form a longer composite read.
Mikkelsen, Martin; Frank-Hansen, Rune; Hansen, Anders J; Morling, Niels
2014-09-01
of sequencing of whole mitochondrial genome, HV1 and HV2 DNA with the second generation system (SGS) Roche 454 GS Junior were compared with results of Sanger sequencing and SNP typing with SNaPshot single base extension detected with MALDI-TOF and capillary electrophoresis. We investigated the performance of the software analysis of the data, reproducibility, ability to sequence homopolymeric regions, detection of mixtures and heteroplasmy as well as the implications of the depth of coverage. We found full reproducibility between samples sequenced twice with SGS. We found close to full concordance between the mtDNA sequences of 26 samples obtained with (1) the 454 SGS method using a depth of coverage above 100 and (2) Sanger sequencing and SNP typing. The discrepancies were primarily observed in homopolymeric regions. The 454 SGS method was able to sequence 95% of the reads correctly in homopolymers up to 4 bases, and up to 6 bases could be sequenced with similar success if the results were carefully, visually inspected. The 454 technology was able to detect mixtures or heteroplasmy of approximately 10%. We detected previously unreported heteroplasmy in the GM9947A component of the NIST human mitochondrial DNA SRM-2392 standard reference material. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Xu, Peiwen; Zou, Yang; Li, Jie; Huang, Sexin; Gao, Ming; Kang, Ranran; Xie, Hongqiang; Wang, Lijuan; Yan, Junhao; Gao, Yuan
2018-04-10
To assess the value of droplet digital PCR (ddPCR) for non-invasive prenatal diagnosis of single gene disease in two families. Paternal mutation in cell-free DNA derived from the maternal blood and amniotic fluid DNA was detected by ddPCR. Suspected mutation in the amniotic fluid DNA was verified with Sanger sequencing. The result of ddPCR and Sanger sequencing indicated that the fetuses have carried pathogenic mutations from the paternal side in both families. Droplet digital PCR can accurately detect paternal mutation carried by the fetus, and it is sensitive and reliable for analyzing trace samples. This method may be applied for the diagnosis of single gene diseases caused by paternal mutation using peripheral blood sample derived from the mother.
Bai, Y; Liu, N; Kong, X D; Yan, J; Qin, Z B; Wang, B
2016-12-07
Objective: To analyze the mutations of PAX3 gene in two Waardenburg syndrome type Ⅰ (WS1) pedigrees and make prenatal diagnosis for the high-risk 18-week-old fetus. Methods: PAX3 gene was first analyzed by Sanger sequencing and multiplex ligation-dependent probe amplification(MLPA) for detecting pathogenic mutation of the probands of the two pedigrees. The mutations were confirmed by MLPA and Sanger in parents and unrelated healthy individuals.Prenatal genetic diagnosis for the high-risk fetus was performed by amniotic fluid cell after genotyping. Results: A heterozygous PAX3 gene gross deletion (E7 deletion) was identified in all patients from WS1-01 family, and not found in 20 healthy individuals.Prenatal diagnosis in WS1-01 family indicated that the fetus was normal. Molecular studies identified a novel deletion mutation c. 1385_1386delCT within the PAX3 gene in all affected WS1-02 family members, but in none of the unaffected relatives and 200 healthy individuals. Conclusions: PAX3 gene mutation is etiological for two WS1 families. Sanger sequencing plus MLPA is effective and accurate for making gene diagnosis and prenatal diagnosis.
Smeele, Zoe E; Ainley, David G; Varsani, Arvind
2018-01-02
The Antarctic, sub-Antarctic islands and surrounding sea-ice provide a unique environment for the existence of organisms. Nonetheless, birds and seals of a variety of species inhabit them, particularly during their breeding seasons. Early research on Antarctic wildlife health, using serology-based assays, showed exposure to viruses in the families Birnaviridae, Flaviviridae, Herpesviridae, Orthomyxoviridae and Paramyxoviridae circulating in seals (Phocidae), penguins (Spheniscidae), petrels (Procellariidae) and skuas (Stercorariidae). It is only during the last decade or so that polymerase chain reaction-based assays have been used to characterize viruses associated with Antarctic animals. Furthermore, it is only during the last five years that full/whole genomes of viruses (adenoviruses, anelloviruses, orthomyxoviruses, a papillomavirus, paramyoviruses, polyomaviruses and a togavirus) have been sequenced using Sanger sequencing or high throughput sequencing (HTS) approaches. This review summaries the knowledge of animal Antarctic virology and discusses potential future directions with the advent of HTS in virus discovery and ecology. Copyright © 2017 Elsevier B.V. All rights reserved.
Bai, D Y; Zhang, H P; Zhong, S; Suo, W H; Gao, D H; Ding, Y; Tu, J H
2016-12-23
Objective: To investigate the clinical application value of combined detection of ALK fusion gene and c-ros oncogene 1 receptor tyrosine kinase (ROS1) fusion gene in non-small cell lung cancer (NSCLC) using real-time fluorescent PCR. Methods: A kit for combined detection of ALK fusion gene and ROS1 fusion gene based on fluorescent PCR was used to simultaneously detect the two fusion genes in 302 cases of NSCLC specimens. The results were validated through Sanger sequencing. The consistency of the two detection methods was analyzed. Results: All 302 cases of NSCLC specimens were successfully analyzed through fluorescent PCR (302/302). 12 cases (4.0%) were found to contain ALK fusion gene, including 3 cases with ALK-M1, 3 with ALK-M2, 3 with ALK-M3, 1 with ALK-M4, and 2 with ALK-M6 fusion gene.12 cases (4.0%) were found to contain ROS1 fusion gene, including 1 case with ROS1-M7, 8 cases with ROS1-M8, 1 case with ROS1-M12, 1 case with ROS1-M14, and 1 case with double-positive ROS1-M3 and ROS1-M8 fusion genes. The total detection rate of ALK fusion gene and ROS1 fusion gene was 7.9% (24/302) and 278 cases showed to be negative for ALK fusion gene and ROS1 fusion gene. The successful detection rates for Sanger DNA sequencing were also 100%. The positive, negative and total coincidence rates obtained by real-time fluorescent PCR and by Sanger DNA sequencing were all 100%. Conclusions: The results of Sanger DNA sequencing demonstrate that the real-time fluorescent PCR assay is equally effective in detecting ALK and ROS1 fusion genes in NSCLC tissues. Furthermore, real-time fluorescent PCR assay can be used to detect trace ALK and ROS1 fusion gene simultaneously in tiny samples, and can save time and avoid repeated sampling. It is worthy of recommendation as a rapid and reliable detection technique.
The tomato genome sequence provides insight into fleshy fruit evolution
USDA-ARS?s Scientific Manuscript database
The genome of the inbred tomato cultivar ‘Heinz 1706’ was sequenced and assembled using a combination of Sanger and “next generation” technologies. The predicted genome size is ~900 Mb, consistent with prior estimates, of which 760 Mb were assembled in 91 scaffolds aligned to the 12 tomato chromosom...
USDA-ARS?s Scientific Manuscript database
Marek’s disease virus (MDV-1) is a cell-associated alphaherpesvirus that induces rapid-onset T-cell lymphomas in poultry. The genomes of 6 strains have been sequenced using both Sanger didoxy sequencing and 454 Life Science pyrosequencing. These genomes largely represent cell culture adapted strains...
Tumiotto, Camille; Riviere, Lionel; Bellecave, Pantxika; Recordon-Pinson, Patricia; Vilain-Parce, Alice; Guidicelli, Gwenda-Line; Fleury, Hervé
2017-01-01
One of the strategies for curing viral HIV-1 is a therapeutic vaccine involving the stimulation of cytotoxic CD8-positive T cells (CTL) that are Human Leucocyte Antigen (HLA)-restricted. The lack of efficiency of previous vaccination strategies may have been due to the immunogenic peptides used, which could be different from a patient's virus epitopes and lead to a poor CTL response. To counteract this lack of specificity, conserved epitopes must be targeted. One alternative is to gather as many data as possible from a large number of patients on their HIV-1 proviral archived epitope variants, taking into account their genetic background to select the best presented CTL epitopes. In order to process big data generated by Next-Generation Sequencing (NGS) of the DNA of HIV-infected patients, we have developed a software package called TutuGenetics. This tool combines an alignment derived either from Sanger or NGS files, HLA typing, target gene and a CTL epitope list as input files. It allows automatic translation after correction of the alignment obtained between the HxB2 reference and the reads, followed by automatic calculation of the MHC IC50 value for each epitope variant and the HLA allele of the patient by using NetMHCpan 3.0, resulting in a csv file as output result. We validated this new tool by comparing Sanger and NGS (454, Roche) sequences obtained from the proviral DNA of patients at success of ART included in the Provir Latitude 45 study and showed a 90% correlation between the quantitative results of NGS and Sanger. This automated analysis combined with complementary samples should yield more data regarding the archived CTL epitopes according to the patients' HLA alleles and will be useful for screening epitopes that in theory are presented efficiently to the HLA groove, thus constituting promising immunogenic peptides for a therapeutic vaccine.
Primer-independent RNA sequencing with bacteriophage phi6 RNA polymerase and chain terminators.
Makeyev, E V; Bamford, D H
2001-05-01
Here we propose a new general method for directly determining RNA sequence based on the use of the RNA-dependent RNA polymerase from bacteriophage phi6 and the chain terminators (RdRP sequencing). The following properties of the polymerase render it appropriate for this application: (1) the phi6 polymerase can replicate a number of single-stranded RNA templates in vitro. (2) In contrast to the primer-dependent DNA polymerases utilized in the sequencing procedure by Sanger et al. (Proc Natl Acad Sci USA, 1977, 74:5463-5467), it initiates nascent strand synthesis without a primer, starting the polymerization on the very 3'-terminus of the template. (3) The polymerase can incorporate chain-terminating nucleotide analogs into the nascent RNA chain to produce a set of base-specific termination products. Consequently, 3' proximal or even complete sequence of many target RNA molecules can be rapidly deduced without prior sequence information. The new technique proved useful for sequencing several synthetic ssRNA templates. Furthermore, using genomic segments of the bluetongue virus we show that RdRP sequencing can also be applied to naturally occurring dsRNA templates. This suggests possible uses of the method in the RNA virus research and diagnostics.
Onuț-Brännström, Ioana; Benjamin, Mitchell; Scofield, Douglas G; Heiðmarsson, Starri; Andersson, Martin G I; Lindström, Eva S; Johannesson, Hanna
2018-03-13
In this study, we explored the diversity of green algal symbionts (photobionts) in sympatric populations of the cosmopolitan lichen-forming fungi Thamnolia and Cetraria. We sequenced with both Sanger and Ion Torrent High-Throughput Sequencing technologies the photobiont ITS-region of 30 lichen thalli from two islands: Iceland and Öland. While Sanger recovered just one photobiont genotype from each thallus, the Ion Torrent data recovered 10-18 OTUs for each pool of 5 lichen thalli, suggesting that individual lichens can contain heterogeneous photobiont populations. Both methods showed evidence for photobiont sharing between Thamnolia and Cetraria on Iceland. In contrast, our data suggest that on Öland the two mycobionts associate with distinct photobiont communities, with few shared OTUs revealed by Ion Torrent sequencing. Furthermore, by comparing our sequences with public data, we identified closely related photobionts from geographically distant localities. Taken together, we suggest that the photobiont composition in Thamnolia and Cetraria results from both photobiont-mycobiont codispersal and local acquisition during mycobiont establishment and/or lichen growth. We hypothesize that this is a successful strategy for lichens to be flexible in the use of the most adapted photobiont for the environment.
Mandelker, Diana; Schmidt, Ryan J; Ankala, Arunkanth; McDonald Gibson, Kristin; Bowser, Mark; Sharma, Himanshu; Duffy, Elizabeth; Hegde, Madhuri; Santani, Avni; Lebo, Matthew; Funke, Birgit
2016-12-01
Next-generation sequencing (NGS) is now routinely used to interrogate large sets of genes in a diagnostic setting. Regions of high sequence homology continue to be a major challenge for short-read technologies and can lead to false-positive and false-negative diagnostic errors. At the scale of whole-exome sequencing (WES), laboratories may be limited in their knowledge of genes and regions that pose technical hurdles due to high homology. We have created an exome-wide resource that catalogs highly homologous regions that is tailored toward diagnostic applications. This resource was developed using a mappability-based approach tailored to current Sanger and NGS protocols. Gene-level and exon-level lists delineate regions that are difficult or impossible to analyze via standard NGS. These regions are ranked by degree of affectedness, annotated for medical relevance, and classified by the type of homology (within-gene, different functional gene, known pseudogene, uncharacterized noncoding region). Additionally, we provide a list of exons that cannot be analyzed by short-amplicon Sanger sequencing. This resource can help guide clinical test design, supplemental assay implementation, and results interpretation in the context of high homology.Genet Med 18 12, 1282-1289.
The utility of Next Generation Sequencing for molecular diagnostics in Rett syndrome.
Vidal, Silvia; Brandi, Núria; Pacheco, Paola; Gerotina, Edgar; Blasco, Laura; Trotta, Jean-Rémi; Derdak, Sophia; Del Mar O'Callaghan, Maria; Garcia-Cazorla, Àngels; Pineda, Mercè; Armstrong, Judith
2017-09-25
Rett syndrome (RTT) is an early-onset neurodevelopmental disorder that almost exclusively affects girls and is totally disabling. Three genes have been identified that cause RTT: MECP2, CDKL5 and FOXG1. However, the etiology of some of RTT patients still remains unknown. Recently, next generation sequencing (NGS) has promoted genetic diagnoses because of the quickness and affordability of the method. To evaluate the usefulness of NGS in genetic diagnosis, we present the genetic study of RTT-like patients using different techniques based on this technology. We studied 1577 patients with RTT-like clinical diagnoses and reviewed patients who were previously studied and thought to have RTT genes by Sanger sequencing. Genetically, 477 of 1577 patients with a RTT-like suspicion have been diagnosed. Positive results were found in 30% by Sanger sequencing, 23% with a custom panel, 24% with a commercial panel and 32% with whole exome sequencing. A genetic study using NGS allows the study of a larger number of genes associated with RTT-like symptoms simultaneously, providing genetic study of a wider group of patients as well as significantly reducing the response time and cost of the study.
Zhang, Xinxin; Ma, Dehua; Zou, Wei; Ding, Yibing; Zhu, Chengchu; Min, Haiyan; Zhang, Bin; Wang, Wei; Chen, Baofu; Ye, Minhua; Cai, Minghui; Pan, Yanqing; Cao, Lei; Wan, Yueming; Jin, Yu; Gao, Qian; Yi, Long
2016-05-27
Primary spontaneous pneumothorax (PSP) or pulmonary cysts is one of the manifestations of Birt-Hogg-Dube syndrome (BHDS) that is caused by heterozygous mutations in FLCN gene. Most of the mutations are SNVs and small indels, and there are also approximately 10 % large intragenic deletions and duplications of the mutations. These molecular findings are generally obtained by disparate methods including Sanger sequencing and Multiple Ligation-dependent Probe Amplification in the clinical laboratory. In addition, as a genetically heterogeneous disorder, PSP may be caused by mutations in multiple genes include FBN1, COL3A1, CBS, SERPINA1 and TSC1/TSC2 genes. For differential diagnosis, these genes should also be screened which makes the diagnostic procedure more time-consuming and labor-intensive. Forty PSP patients were divided into 2 groups. Nineteen patients with different pathogenic mutations of FLCN previously identified by conventional Sanger sequencing and MLPA were included in test group, 21 random PSP patients without any genetic screening were included in blinded sample group. 7 PSP genes including FLCN, FBN1, COL3A1, CBS, SERPINA1 and TSC1/TSC2 were designed and enriched by Haloplex system, sequenced on a Miseq platform and analyzed in the 40 patients to evaluate the performance of the targeted-NGS method. We demonstrated that the full spectrum of genes associated with pneumothorax including FLCN gene mutations can be identified simultaneously in multiplexed sequence data. Noteworthy, by our in-house copy number analysis of the sequence data, we could not only detect intragenic deletions, but also determine approximate deletion junctions simultaneously. NGS based Haloplex target enrichment technology is proved to be a rapid and cost-effective screening strategy for the comprehensive molecular diagnosis of BHDS in PSP patients, as it can replace Sanger sequencing and MLPA by simultaneously detecting exonic and intronic SNVs, small indels, large intragenic deletions and determining deletion junctions in PSP-related genes.
Finishing Using Next Generation Technologies
Van Tonder, Andries
2018-01-16
Andries van Tonder of Wellcome Trust Sanger Institute discusses a pipeline for finishing genomes to the gold standard on June 3, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.
Sukalo, Maja; Schäflein, Eva; Schanze, Ina; Everman, David B; Rezaei, Nima; Argente, Jesús; Lorda-Sanchez, Isabel; Deshpande, Charu; Takahashi, Tsutomu; Kleger, Alexander; Zenker, Martin
2017-11-01
Johanson-Blizzard syndrome (JBS, MIM #243800) is a very rare autosomal recessive disorder characterized by exocrine pancreatic insufficiency, nasal wing hypoplasia, hypodontia, and other abnormalities. JBS is caused by mutations of the UBR1 gene (MIM *605981), encoding a ubiquitin ligase of the N-end rule pathway. Molecular findings in a total of 65 unrelated patients with a clinical diagnosis of JBS who were previously screened for UBR1 mutations by Sanger sequencing were reviewed and cases lacking a disease-causing UBR1 mutation on either one or both alleles were included in this study. In order to discover mutations that are not detectable by Sanger sequencing, we designed a probe set for multiplex ligation-dependent probe amplification (MLPA) analysis of the UBR1 gene and analyzed the copy number status of all 47 UBR1 exons. Our previous studies using Sanger sequencing could detect mutations in 93.1% of 130 disease-associated UBR1 alleles. Six patients with a highly suggestive clinical diagnosis of JBS and unsolved genotype were included in this study. MLPA analysis detected six alleles harboring exon deletions/duplications, thereby raising the mutation detection rate in the entire cohort to 97.7% (127/130 alleles). We conclude that single or multi-exon deletions or duplications account for a substantial proportion of JBS-associated UBR1 mutations. © 2017 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.
Zhong, Shan; Zhang, Hai-ping; Zheng, Jie; Bai, Dong-yu; Fu, Li; Chen, Pei-qiong
2013-04-01
To investigate the frequency of EML4-ALK fusion gene in non-small-cell lung cancer (NSCLC) patients, and its correlation with clinicopathologic features. Real-time PCR was used to detect the presence of EML4-ALK fusion gene in 268 cases of NSCLCs using paraffin-embedded tissue samples(among which 164 samples were re-validated by Sanger sequencing). Related clinicopathological correlation was analyzed. EML4-ALK fusion gene was found in 4.1% (11/268) of the cases. One hundred and sixty four samples were verified by Sanger sequencing, and the overall coincidence of the results of two methods (Sanger sequencing and Real-time PCR) was 100%. Female patients (5.9%, 5/85), ≤ 60 years of age (4.3%, 6/140), non-smokers (6.8%, 8/118) and adenocarcinomas (7.6%, 10/132) had a higher mutation rate than that in male patients (3.3%, 6/183), > 60 years of age (4.0%, 5/124), smokers (1.6%, 2/132) and squamous cell carcinomas (1.3%, 1/79), although no statistical significance in age (P = 0.918), gender (P = 0.503), smoking history (P = 0.092) and histological type (P = 0.094). Chinese NSCLC patients have a 4.1% detection rate of EML4-ALK fusion gene in the tumor tissues. Female, non-smoker and adenocarcinoma histological subtype tend to be associated with a higher rate of EML4-ALK gene fusion.
2012-01-01
Background Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Results Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80–120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Conclusions Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project. PMID:23110314
Microsatellite DNA capture from enriched libraries.
Gonzalez, Elena G; Zardoya, Rafael
2013-01-01
Microsatellites are DNA sequences of tandem repeats of one to six nucleotides, which are highly polymorphic, and thus the molecular markers of choice in many kinship, population genetic, and conservation studies. There have been significant technical improvements since the early methods for microsatellite isolation were developed, and today the most common procedures take advantage of the hybrid capture methods of enriched-targeted microsatellite DNA. Furthermore, recent advents in sequencing technologies (i.e., next-generation sequencing, NGS) have fostered the mining of microsatellite markers in non-model organisms, affording a cost-effective way of obtaining a large amount of sequence data potentially useful for loci characterization. The rapid improvements of NGS platforms together with the increase in available microsatellite information open new avenues to the understanding of the evolutionary forces that shape genetic structuring in wild populations. Here, we provide detailed methodological procedures for microsatellite isolation based on the screening of GT microsatellite-enriched libraries, either by cloning and Sanger sequencing of positive clones or by direct NGS. Guides for designing new species-specific primers and basic genotyping are also given.
The Mouse Genomes Project: a repository of inbred laboratory mouse strain genomes.
Adams, David J; Doran, Anthony G; Lilue, Jingtao; Keane, Thomas M
2015-10-01
The Mouse Genomes Project was initiated in 2009 with the goal of using next-generation sequencing technologies to catalogue molecular variation in the common laboratory mouse strains, and a selected set of wild-derived inbred strains. The initial sequencing and survey of sequence variation in 17 inbred strains was completed in 2011 and included comprehensive catalogue of single nucleotide polymorphisms, short insertion/deletions, larger structural variants including their fine scale architecture and landscape of transposable element variation, and genomic sites subject to post-transcriptional alteration of RNA. From this beginning, the resource has expanded significantly to include 36 fully sequenced inbred laboratory mouse strains, a refined and updated data processing pipeline, and new variation querying and data visualisation tools which are available on the project's website ( http://www.sanger.ac.uk/resources/mouse/genomes/ ). The focus of the project is now the completion of de novo assembled chromosome sequences and strain-specific gene structures for the core strains. We discuss how the assembled chromosomes will power comparative analysis, data access tools and future directions of mouse genetics.
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].
Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y
2017-08-01
To analyze and detect the whole genome sequence of human mitochondrial DNA (mtDNA) by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine
Lim, Hassol; Park, Young-Mi; Lee, Jong-Keuk; Taek Lim, Hyun
2016-10-01
To present an efficient and successful application of a single-exome sequencing study in a family clinically diagnosed with X-linked retinitis pigmentosa. Exome sequencing study based on clinical examination data. An 8-year-old proband and his family. The proband and his family members underwent comprehensive ophthalmologic examinations. Exome sequencing was undertaken in the proband using Agilent SureSelect Human All Exon Kit and Illumina HiSeq 2000 platform. Bioinformatic analysis used Illumina pipeline with Burrows-Wheeler Aligner-Genome Analysis Toolkit (BWA-GATK), followed by ANNOVAR to perform variant functional annotation. All variants passing filter criteria were validated by Sanger sequencing to confirm familial segregation. Analysis of exome sequence data identified a novel frameshift mutation in RP2 gene resulting in a premature stop codon (c.665delC, p.Pro222fsTer237). Sanger sequencing revealed this mutation co-segregated with the disease phenotype in the child's family. We identified a novel causative mutation in RP2 from a single proband's exome sequence data analysis. This study highlights the effectiveness of the whole-exome sequencing in the genetic diagnosis of X-linked retinitis pigmentosa, over the conventional sequencing methods. Even using a single exome, exome sequencing technology would be able to pinpoint pathogenic variant(s) for X-linked retinitis pigmentosa, when properly applied with aid of adequate variant filtering strategy. Copyright © 2016 Canadian Ophthalmological Society. Published by Elsevier Inc. All rights reserved.
Steele-Stallard, Heather B; Le Quesne Stabej, Polona; Lenassi, Eva; Luxon, Linda M; Claustres, Mireille; Roux, Anne-Francoise; Webster, Andrew R; Bitner-Glindzicz, Maria
2013-08-08
Usher Syndrome is the leading cause of inherited deaf-blindness. It is divided into three subtypes, of which the most common is Usher type 2, and the USH2A gene accounts for 75-80% of cases. Despite recent sequencing strategies, in our cohort a significant proportion of individuals with Usher type 2 have just one heterozygous disease-causing mutation in USH2A, or no convincing disease-causing mutations across nine Usher genes. The purpose of this study was to improve the molecular diagnosis in these families by screening USH2A for duplications, heterozygous deletions and a common pathogenic deep intronic variant USH2A: c.7595-2144A>G. Forty-nine Usher type 2 or atypical Usher families who had missing mutations (mono-allelic USH2A or no mutations following Sanger sequencing of nine Usher genes) were screened for duplications/deletions using the USH2A SALSA MLPA reagent kit (MRC-Holland). Identification of USH2A: c.7595-2144A>G was achieved by Sanger sequencing. Mutations were confirmed by a combination of reverse transcription PCR using RNA extracted from nasal epithelial cells or fibroblasts, and by array comparative genomic hybridisation with sequencing across the genomic breakpoints. Eight mutations were identified in 23 Usher type 2 families (35%) with one previously identified heterozygous disease-causing mutation in USH2A. These consisted of five heterozygous deletions, one duplication, and two heterozygous instances of the pathogenic variant USH2A: c.7595-2144A>G. No variants were found in the 15 Usher type 2 families with no previously identified disease-causing mutations. In 11 atypical families, none of whom had any previously identified convincing disease-causing mutations, the mutation USH2A: c.7595-2144A>G was identified in a heterozygous state in one family. All five deletions and the heterozygous duplication we report here are novel. This is the first time that a duplication in USH2A has been reported as a cause of Usher syndrome. We found that 8 of 23 (35%) of 'missing' mutations in Usher type 2 probands with only a single heterozygous USH2A mutation detected with Sanger sequencing could be attributed to deletions, duplications or a pathogenic deep intronic variant. Future mutation detection strategies and genetic counselling will need to take into account the prevalence of these types of mutations in order to provide a more comprehensive diagnostic service.
Chen, Zhao; Moran, Kimberly; Richards-Yutz, Jennifer; Toorens, Erik; Gerhart, Daniel; Ganguly, Tapan; Shields, Carol L; Ganguly, Arupa
2014-03-01
Sporadic retinoblastoma (RB) is caused by de novo mutations in the RB1 gene. Often, these mutations are present as mosaic mutations that cannot be detected by Sanger sequencing. Next-generation deep sequencing allows unambiguous detection of the mosaic mutations in lymphocyte DNA. Deep sequencing of the RB1 gene on lymphocyte DNA from 20 bilateral and 70 unilateral RB cases was performed, where Sanger sequencing excluded the presence of mutations. The individual exons of the RB1 gene from each sample were amplified, pooled, ligated to barcoded adapters, and sequenced using semiconductor sequencing on an Ion Torrent Personal Genome Machine. Six low-level mosaic mutations were identified in bilateral RB and four in unilateral RB cases. The incidence of low-level mosaic mutation was estimated to be 30% and 6%, respectively, in sporadic bilateral and unilateral RB cases, previously classified as mutation negative. The frequency of point mutations detectable in lymphocyte DNA increased from 96% to 97% for bilateral RB and from 13% to 18% for unilateral RB. The use of deep sequencing technology increased the sensitivity of the detection of low-level germline mosaic mutations in the RB1 gene. This finding has significant implications for improved clinical diagnosis, genetic counseling, surveillance, and management of RB. © 2013 WILEY PERIODICALS, INC.
A multiplex primer design algorithm for target amplification of continuous genomic regions.
Ozturk, Ahmet Rasit; Can, Tolga
2017-06-19
Targeted Next Generation Sequencing (NGS) assays are cost-efficient and reliable alternatives to Sanger sequencing. For sequencing of very large set of genes, the target enrichment approach is suitable. However, for smaller genomic regions, the target amplification method is more efficient than both the target enrichment method and Sanger sequencing. The major difficulty of the target amplification method is the preparation of amplicons, regarding required time, equipment, and labor. Multiplex PCR (MPCR) is a good solution for the mentioned problems. We propose a novel method to design MPCR primers for a continuous genomic region, following the best practices of clinically reliable PCR design processes. On an experimental setup with 48 different combinations of factors, we have shown that multiple parameters might effect finding the first feasible solution. Increasing the length of the initial primer candidate selection sequence gives better results whereas waiting for a longer time to find the first feasible solution does not have a significant impact. We generated MPCR primer designs for the HBB whole gene, MEFV coding regions, and human exons between 2000 bp to 2100 bp-long. Our benchmarking experiments show that the proposed MPCR approach is able produce reliable NGS assay primers for a given sequence in a reasonable amount of time.
Low Diversity in the Mitogenome of Sperm Whales Revealed by Next-Generation Sequencing
Alexander, Alana; Steel, Debbie; Slikas, Beth; Hoekzema, Kendra; Carraher, Colm; Parks, Matthew; Cronn, Richard; Baker, C. Scott
2013-01-01
Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20 mitogenomes from 17 sperm whales representative of worldwide diversity using Next Generation Sequencing (NGS) technologies (Illumina GAIIx, Roche 454 GS Junior). Resequencing of three individuals with both NGS platforms and partial Sanger sequencing showed low discrepancy rates (454-Illumina: 0.0071%; Sanger-Illumina: 0.0034%; and Sanger-454: 0.0023%) confirming suitability of both NGS platforms for investigating low mitogenomic diversity. Using the 17 sperm whale mitogenomes in a phylogenetic reconstruction with 41 other species, including 11 new dolphin mitogenomes, we tested two hypotheses for the low CR diversity. First, the hypothesis that CR-specific constraints have reduced diversity solely in the CR was rejected as diversity was low throughout the mitogenome, not just in the CR (overall diversity π = 0.096%; protein-coding 3rd codon = 0.22%; CR = 0.35%), and CR phylogenetic signal was congruent with protein-coding regions. Second, the hypothesis that slow substitution rates reduced diversity throughout the sperm whale mitogenome was rejected as sperm whales had significantly higher rates of CR evolution and no evidence of slow coding region evolution relative to other cetaceans. The estimated time to most recent common ancestor for sperm whale mitogenomes was 72,800 to 137,400 years ago (95% highest probability density interval), consistent with previous hypotheses of a bottleneck or selective sweep as likely causes of low mitogenome diversity. PMID:23254394
Low diversity in the mitogenome of sperm whales revealed by next-generation sequencing.
Alexander, Alana; Steel, Debbie; Slikas, Beth; Hoekzema, Kendra; Carraher, Colm; Parks, Matthew; Cronn, Richard; Baker, C Scott
2013-01-01
Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20 mitogenomes from 17 sperm whales representative of worldwide diversity using Next Generation Sequencing (NGS) technologies (Illumina GAIIx, Roche 454 GS Junior). Resequencing of three individuals with both NGS platforms and partial Sanger sequencing showed low discrepancy rates (454-Illumina: 0.0071%; Sanger-Illumina: 0.0034%; and Sanger-454: 0.0023%) confirming suitability of both NGS platforms for investigating low mitogenomic diversity. Using the 17 sperm whale mitogenomes in a phylogenetic reconstruction with 41 other species, including 11 new dolphin mitogenomes, we tested two hypotheses for the low CR diversity. First, the hypothesis that CR-specific constraints have reduced diversity solely in the CR was rejected as diversity was low throughout the mitogenome, not just in the CR (overall diversity π = 0.096%; protein-coding 3rd codon = 0.22%; CR = 0.35%), and CR phylogenetic signal was congruent with protein-coding regions. Second, the hypothesis that slow substitution rates reduced diversity throughout the sperm whale mitogenome was rejected as sperm whales had significantly higher rates of CR evolution and no evidence of slow coding region evolution relative to other cetaceans. The estimated time to most recent common ancestor for sperm whale mitogenomes was 72,800 to 137,400 years ago (95% highest probability density interval), consistent with previous hypotheses of a bottleneck or selective sweep as likely causes of low mitogenome diversity.
SNP discovery in the bovine milk transcriptome using RNA-Seq technology.
Cánovas, Angela; Rincon, Gonzalo; Islas-Trejo, Alma; Wickramasinghe, Saumya; Medrano, Juan F
2010-12-01
High-throughput sequencing of RNA (RNA-Seq) was developed primarily to analyze global gene expression in different tissues. However, it also is an efficient way to discover coding SNPs. The objective of this study was to perform a SNP discovery analysis in the milk transcriptome using RNA-Seq. Seven milk samples from Holstein cows were analyzed by sequencing cDNAs using the Illumina Genome Analyzer system. We detected 19,175 genes expressed in milk samples corresponding to approximately 70% of the total number of genes analyzed. The SNP detection analysis revealed 100,734 SNPs in Holstein samples, and a large number of those corresponded to differences between the Holstein breed and the Hereford bovine genome assembly Btau4.0. The number of polymorphic SNPs within Holstein cows was 33,045. The accuracy of RNA-Seq SNP discovery was tested by comparing SNPs detected in a set of 42 candidate genes expressed in milk that had been resequenced earlier using Sanger sequencing technology. Seventy of 86 SNPs were detected using both RNA-Seq and Sanger sequencing technologies. The KASPar Genotyping System was used to validate unique SNPs found by RNA-Seq but not observed by Sanger technology. Our results confirm that analyzing the transcriptome using RNA-Seq technology is an efficient and cost-effective method to identify SNPs in transcribed regions. This study creates guidelines to maximize the accuracy of SNP discovery and prevention of false-positive SNP detection, and provides more than 33,000 SNPs located in coding regions of genes expressed during lactation that can be used to develop genotyping platforms to perform marker-trait association studies in Holstein cattle.
Park, Ji Hye
2018-01-01
Estimation of postmortem interval (PMI) is paramount in modern forensic investigation. After the disappearance of the early postmortem phenomena conventionally used to estimate PMI, entomologic evidence provides important indicators for PMI estimation. The age of the oldest fly larvae or pupae can be estimated to pinpoint the time of oviposition, which is considered the minimum PMI (PMImin). The development rate of insects is usually temperature dependent and species specific. Therefore, species identification is mandatory for PMImin estimation using entomological evidence. The classical morphological identification method cannot be applied when specimens are damaged or have not yet matured. To overcome this limitation, some investigators employ molecular identification using mitochondrial cytochrome c oxidase subunit I (COI) nucleotide sequences. The molecular identification method commonly uses Sanger's nucleotide sequencing and molecular phylogeny, which are complex and time consuming and constitute another obstacle for forensic investigators. In this study, instead of using conventional Sanger's nucleotide sequencing, single-nucleotide polymorphisms (SNPs) in the COI gene region, which are unique between fly species, were selected and targeted for single-base extension (SBE) technology. These SNPs were genotyped using a SNaPshot® kit. Eleven Calliphoridae and seven Sarcophagidae species were covered. To validate this genotyping, fly DNA samples (103 adults, 84 larvae, and 4 pupae) previously confirmed by DNA barcoding were used. This method worked quickly with minimal DNA, providing a potential alternative to conventional DNA barcoding. Consisting of only a few simple electropherogram peaks, the results were more straightforward compared with those of the conventional DNA barcoding produced by Sanger's nucleotide sequencing. PMID:29682531
USDA-ARS?s Scientific Manuscript database
In a collaboration with Purdue University researchers, we sequenced a 143,606 base pair Rhipicephalus microplus BAC library clone that contained the coding region for acetylcholinesterase 1 (AChE1). Sequencing was by Sanger protocols and the final assembly resulted in 15 contigs of varying length, e...
Fong, Wai-Ying; Ho, Chi-Chun; Poon, Wing-Tat
2017-05-12
Thiopurine intolerance and treatment-related toxicity, such as fatal myelosuppression, is related to non-function genetic variants encoding thiopurine S-methyltransferase (TPMT) and Nudix hydrolase 15 (NUDT15). Genetic testing of the common variants NUDT15:NM_018283.2:c.415C>T (Arg139Cys, dbSNP rs116855232 T allele) and TPMT: NM_000367.4:c.719A>G (TPMT*3C, dbSNP rs1142345 G allele) in East Asians including Chinese can potentially prevent treatment-related complications. Two complementary genotyping approaches, real-time PCR-high resolution melt (PCR-HRM) and PCR-restriction fragment length morphism (PCR-RFLP) analysis were evaluated using conventional PCR and Sanger sequencing genotyping as the gold standard. Sixty patient samples were tested, revealing seven patients (11.7%) heterozygous for NUDT15 c.415C>T, one patient homozygous for the variant and one patient heterozygous for the TPMT*3C non-function allele. No patient was found to harbor both variants. In total, nine out of 60 (15%) patients tested had genotypic evidence of thiopurine intolerance, which may require dosage adjustment or alternative medication should they be started on azathioprine, mercaptopurine or thioguanine. The two newly developed assays were more efficient and showed complete concordance (60/60, 100%) compared to the Sanger sequencing results. Accurate and cost-effective genotyping assays by real-time PCR-HRM and PCR-RFLP for NUDT15 c.415C>T and TPMT*3C were successfully developed. Further studies may establish their roles in genotype-informed clinical decision-making in the prevention of morbidity and mortality due to thiopurine intolerance.
Gentile, Caren; Van Deerlin, Vivianna M; Goldberg, David S; Reese, Peter P; Hasz, Richard D; Abt, Peter; Blumberg, Emily; Farooqi, Midhat S
2018-02-01
Given the availability of new highly efficacious anti-HCV therapies, some clinicians have advocated for wider use of kidneys from hepatitis C virus-positive (HCV+) donors, including transplanting them into HCV-negative recipients. As treatment regimens for HCV are commonly guided by genotype, pretransplant HCV genotyping of tissue donors would be beneficial. To our knowledge, donor HCV genotyping has never been reported. We retrieved archived frozen plasma samples for 17 previous organ donors through a local organ procurement organization. We performed HCV genotyping using the eSensor HCVg Direct Test (GenMark Diagnostics) and also by Sanger sequencing, for confirmation (Retrogen). In addition, viral loads were measured using the COBAS AmpliPrep/TaqMan system (Roche Diagnostics). We found that most of the samples (n = 14) were HCV Genotype 1a with the remainder being Genotype 2b (n = 1) or Genotype 3 (n = 2). All genotyping results were concordant with Sanger sequencing. The average HCV viral load in the sample group was ~ 1.6 million IU/mL (range: ~16 000 IU/mL to 7 million IU/mL). We demonstrate that viral RNA from organ donor plasma can be successfully genotyped for HCV. This ability suggests that transplantation of HCV+ kidneys into HCV-negative recipients, followed by genotype-guided antiviral therapy, could be feasible. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Analysis of Chromatin Organisation
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2011-01-01
Terms to be familiar with before you start to solve the test: chromatin, nucleases, sucrose density gradient centrifugation, melting point, gel electrophoresis, ethidium bromide, autoradiography, Southern blotting, Northern blotting, Sanger sequencing, restriction endonucleases, exonucleases, linker DNA, chloroform extraction, nucleosomes,…
Sequencing Centers Panel at SFAF
Schilkey, Faye; Ali, Johar; Grafham, Darren; Muzny, Donna; Fulton, Bob; Fitzgerald, Mike; Hostetler, Jessica; Daum, Chris
2018-02-13
From left to right: Faye Schilkey of NCGR, Johar Ali of OICR, Darren Grafham of Wellcome Trust Sanger Institute, Donna Muzny of the Baylor College of Medicine, Bob Fulton of Washington University, Mike Fitzgerald of the Broad Institute, Jessica Hostetler of the J. Craig Venter Institute and Chris Daum of the DOE Joint Genome Institute discuss sequencing technologies, applications and pipelines on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.
Sequencing Centers Panel at SFAF
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schilkey, Faye; Ali, Johar; Grafham, Darren
From left to right: Faye Schilkey of NCGR, Johar Ali of OICR, Darren Grafham of Wellcome Trust Sanger Institute, Donna Muzny of the Baylor College of Medicine, Bob Fulton of Washington University, Mike Fitzgerald of the Broad Institute, Jessica Hostetler of the J. Craig Venter Institute and Chris Daum of the DOE Joint Genome Institute discuss sequencing technologies, applications and pipelines on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.
Cecconi, Massimiliano; Parodi, Maria I.; Formisano, Francesco; Spirito, Paolo; Autore, Camillo; Musumeci, Maria B.; Favale, Stefano; Forleo, Cinzia; Rapezzi, Claudio; Biagini, Elena; Davì, Sabrina; Canepa, Elisabetta; Pennese, Loredana; Castagnetta, Mauro; Degiorgio, Dario; Coviello, Domenico A.
2016-01-01
Hypertrophic cardiomyopathy (HCM) is mainly associated with myosin, heavy chain 7 (MYH7) and myosin binding protein C, cardiac (MYBPC3) mutations. In order to better explain the clinical and genetic heterogeneity in HCM patients, in this study, we implemented a target-next generation sequencing (NGS) assay. An Ion AmpliSeq™ Custom Panel for the enrichment of 19 genes, of which 9 of these did not encode thick/intermediate and thin myofilament (TTm) proteins and, among them, 3 responsible of HCM phenocopy, was created. Ninety-two DNA samples were analyzed by the Ion Personal Genome Machine: 73 DNA samples (training set), previously genotyped in some of the genes by Sanger sequencing, were used to optimize the NGS strategy, whereas 19 DNA samples (discovery set) allowed the evaluation of NGS performance. In the training set, we identified 72 out of 73 expected mutations and 15 additional mutations: the molecular diagnosis was achieved in one patient with a previously wild-type status and the pre-excitation syndrome was explained in another. In the discovery set, we identified 20 mutations, 5 of which were in genes encoding non-TTm proteins, increasing the diagnostic yield by approximately 20%: a single mutation in genes encoding non-TTm proteins was identified in 2 out of 3 borderline HCM patients, whereas co-occuring mutations in genes encoding TTm and galactosidase alpha (GLA) altered proteins were characterized in a male with HCM and multiorgan dysfunction. Our combined targeted NGS-Sanger sequencing-based strategy allowed the molecular diagnosis of HCM with greater efficiency than using the conventional (Sanger) sequencing alone. Mutant alleles encoding non-TTm proteins may aid in the complete understanding of the genetic and phenotypic heterogeneity of HCM: co-occuring mutations of genes encoding TTm and non-TTm proteins could explain the wide variability of the HCM phenotype, whereas mutations in genes encoding only the non-TTm proteins are identifiable in patients with a milder HCM status. PMID:27600940
Next-Generation Sequencing of Aquatic Oligochaetes: Comparison of Experimental Communities
Vivien, Régis; Lejzerowicz, Franck; Pawlowski, Jan
2016-01-01
Aquatic oligochaetes are a common group of freshwater benthic invertebrates known to be very sensitive to environmental changes and currently used as bioindicators in some countries. However, more extensive application of oligochaetes for assessing the ecological quality of sediments in watercourses and lakes would require overcoming the difficulties related to morphology-based identification of oligochaetes species. This study tested the Next-Generation Sequencing (NGS) of a standard cytochrome c oxydase I (COI) barcode as a tool for the rapid assessment of oligochaete diversity in environmental samples, based on mixed specimen samples. To know the composition of each sample we Sanger sequenced every specimen present in these samples. Our study showed that a large majority of OTUs (Operational Taxonomic Unit) could be detected by NGS analyses. We also observed congruence between the NGS and specimen abundance data for several but not all OTUs. Because the differences in sequence abundance data were consistent across samples, we exploited these variations to empirically design correction factors. We showed that such factors increased the congruence between the values of oligochaetes-based indices inferred from the NGS and the Sanger-sequenced specimen data. The validation of these correction factors by further experimental studies will be needed for the adaptation and use of NGS technology in biomonitoring studies based on oligochaete communities. PMID:26866802
Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima
2014-01-01
Background Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Methods Whole exome sequencing followed by expanded familial validation by Sanger sequencing. Results We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Conclusion Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes. PMID:25211151
Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima
2014-01-01
Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Whole exome sequencing followed by expanded familial validation by Sanger sequencing. We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes.
Yu, Hai-Jing; Deng, Hua; Ma, Jian; Huang, Shu-Jun; Yang, Jian-Min; Huang, Yan-Fen; Mu, Xiao-Ping; Zhang, Liang; Wang, Qi
2016-12-01
Granulomatous mastitis (GM) is a chronic inflammatory breast lesion. Its etiology remains incompletely defined. Although mounting evidence suggests the involvement of Corynebacterium in GM, there has been no systematic study of GM bacteriology using -omics technology. The bacterial diversity and relative abundances in breast abscesses from 19 women with GM were investigated using 16S rDNA metagenomic sequencing and Sanger sequencing. A quantitative PCR (qPCR) assay was also developed to identify Corynebacterium kroppenstedtii. A bioinformatic analysis revealed that Corynebacterium was present in the 19 GM patients, with abundances ranging from 1.1% to 58.9%. Of note, Corynebacterium was the most abundant taxon in seven patients (more than a third of the subjects). The predominance of Corynebacterium kroppenstedtii infection (11 of 19 patients, 57.9%) was confirmed with Sanger sequencing and the qPCR assay. This study profiled the microbiota of patients with GM and indicated an important role for Corynebacterium, and in particular C. kroppenstedtii, in the pathogenesis of this disease. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
A rapid screening with direct sequencing from blood samples for the diagnosis of Leigh syndrome.
Shimbo, Hiroko; Takagi, Mariko; Okuda, Mitsuko; Tsuyusaki, Yu; Takano, Kyoko; Iai, Mizue; Yamashita, Sumimasa; Murayama, Kei; Ohtake, Akira; Goto, Yu-Ichi; Aida, Noriko; Osaka, Hitoshi
2014-01-01
Large numbers of genes are responsible for Leigh syndrome (LS), making genetic confirmation of LS difficult. We screened our patients with LS using a limited set of 21 primers encompassing the frequently reported gene for the respiratory chain complexes I (ND1-ND6, and ND4L), IV(SURF1), and V(ATP6) and the pyruvate dehydrogenase E1α-subunit. Of 18 LS patients, we identified mutations in 11 patients, including 7 in mDNA (two with ATP6), 4 in nuclear (three with SURF1). Overall, we identified mutations in 61% of LS patients (11/18 individuals) in this cohort. Sanger sequencing with our limited set of primers allowed us a rapid genetic confirmation of more than half of the LS patients and it appears to be efficient as a primary genetic screening in this cohort.
Hong, Nan; Chen, Yan-hua; Xie, Chen; Xu, Bai-sheng; Huang, Hui; Li, Xin; Yang, Yue-qing; Huang, Ying-ping; Deng, Jian-lian; Qi, Ming; Gu, Yang-shun
2014-08-01
Nance-Horan syndrome (NHS) is a rare X-linked disorder characterized by congenital nuclear cataracts, dental anomalies, and craniofacial dysmorphisms. Mental retardation was present in about 30% of the reported cases. The purpose of this study was to investigate the genetic and clinical features of NHS in a Chinese family. Whole exome sequencing analysis was performed on DNA from an affected male to scan for candidate mutations on the X-chromosome. Sanger sequencing was used to verify these candidate mutations in the whole family. Clinical and ophthalmological examinations were performed on all members of the family. A combination of exome sequencing and Sanger sequencing revealed a nonsense mutation c.322G>T (E108X) in exon 1 of NHS gene, co-segregating with the disease in the family. The nonsense mutation led to the conversion of glutamic acid to a stop codon (E108X), resulting in truncation of the NHS protein. Multiple sequence alignments showed that codon 108, where the mutation (c.322G>T) occurred, was located within a phylogenetically conserved region. The clinical features in all affected males and female carriers are described in detail. We report a nonsense mutation c.322G>T (E108X) in a Chinese family with NHS. Our findings broaden the spectrum of NHS mutations and provide molecular insight into future NHS clinical genetic diagnosis.
Gullapalli, Rama R; Desai, Ketaki V; Santana-Santos, Lucas; Kant, Jeffrey A; Becich, Michael J
2012-01-01
The Human Genome Project (HGP) provided the initial draft of mankind's DNA sequence in 2001. The HGP was produced by 23 collaborating laboratories using Sanger sequencing of mapped regions as well as shotgun sequencing techniques in a process that occupied 13 years at a cost of ~$3 billion. Today, Next Generation Sequencing (NGS) techniques represent the next phase in the evolution of DNA sequencing technology at dramatically reduced cost compared to traditional Sanger sequencing. A single laboratory today can sequence the entire human genome in a few days for a few thousand dollars in reagents and staff time. Routine whole exome or even whole genome sequencing of clinical patients is well within the realm of affordability for many academic institutions across the country. This paper reviews current sequencing technology methods and upcoming advancements in sequencing technology as well as challenges associated with data generation, data manipulation and data storage. Implementation of routine NGS data in cancer genomics is discussed along with potential pitfalls in the interpretation of the NGS data. The overarching importance of bioinformatics in the clinical implementation of NGS is emphasized.[7] We also review the issue of physician education which also is an important consideration for the successful implementation of NGS in the clinical workplace. NGS technologies represent a golden opportunity for the next generation of pathologists to be at the leading edge of the personalized medicine approaches coming our way. Often under-emphasized issues of data access and control as well as potential ethical implications of whole genome NGS sequencing are also discussed. Despite some challenges, it's hard not to be optimistic about the future of personalized genome sequencing and its potential impact on patient care and the advancement of knowledge of human biology and disease in the near future.
Gullapalli, Rama R.; Desai, Ketaki V.; Santana-Santos, Lucas; Kant, Jeffrey A.; Becich, Michael J.
2012-01-01
The Human Genome Project (HGP) provided the initial draft of mankind's DNA sequence in 2001. The HGP was produced by 23 collaborating laboratories using Sanger sequencing of mapped regions as well as shotgun sequencing techniques in a process that occupied 13 years at a cost of ~$3 billion. Today, Next Generation Sequencing (NGS) techniques represent the next phase in the evolution of DNA sequencing technology at dramatically reduced cost compared to traditional Sanger sequencing. A single laboratory today can sequence the entire human genome in a few days for a few thousand dollars in reagents and staff time. Routine whole exome or even whole genome sequencing of clinical patients is well within the realm of affordability for many academic institutions across the country. This paper reviews current sequencing technology methods and upcoming advancements in sequencing technology as well as challenges associated with data generation, data manipulation and data storage. Implementation of routine NGS data in cancer genomics is discussed along with potential pitfalls in the interpretation of the NGS data. The overarching importance of bioinformatics in the clinical implementation of NGS is emphasized.[7] We also review the issue of physician education which also is an important consideration for the successful implementation of NGS in the clinical workplace. NGS technologies represent a golden opportunity for the next generation of pathologists to be at the leading edge of the personalized medicine approaches coming our way. Often under-emphasized issues of data access and control as well as potential ethical implications of whole genome NGS sequencing are also discussed. Despite some challenges, it's hard not to be optimistic about the future of personalized genome sequencing and its potential impact on patient care and the advancement of knowledge of human biology and disease in the near future. PMID:23248761
A vertebrate case study of the quality of assemblies derived from next-generation sequences
2011-01-01
The unparalleled efficiency of next-generation sequencing (NGS) has prompted widespread adoption, but significant problems remain in the use of NGS data for whole genome assembly. We explore the advantages and disadvantages of chicken genome assemblies generated using a variety of sequencing and assembly methodologies. NGS assemblies are equivalent in some ways to a Sanger-based assembly yet deficient in others. Nonetheless, these assemblies are sufficient for the identification of the majority of genes and can reveal novel sequences when compared to existing assembly references. PMID:21453517
Ribas, Laia; Pardo, Belén G; Fernández, Carlos; Alvarez-Diós, José Antonio; Gómez-Tato, Antonio; Quiroga, María Isabel; Planas, Josep V; Sitjà-Bobadilla, Ariadna; Martínez, Paulino; Piferrer, Francesc
2013-03-15
Genomic resources for plant and animal species that are under exploitation primarily for human consumption are increasingly important, among other things, for understanding physiological processes and for establishing adequate genetic selection programs. Current available techniques for high-throughput sequencing have been implemented in a number of species, including fish, to obtain a proper description of the transcriptome. The objective of this study was to generate a comprehensive transcriptomic database in turbot, a highly priced farmed fish species in Europe, with potential expansion to other areas of the world, for which there are unsolved production bottlenecks, to understand better reproductive- and immune-related functions. This information is essential to implement marker assisted selection programs useful for the turbot industry. Expressed sequence tags were generated by Sanger sequencing of cDNA libraries from different immune-related tissues after several parasitic challenges. The resulting database ("Turbot 2 database") was enlarged with sequences generated from a 454 sequencing run of brain-hypophysis-gonadal axis-derived RNA obtained from turbot at different development stages. The assembly of Sanger and 454 sequences generated 52,427 consensus sequences ("Turbot 3 database"), of which 23,661 were successfully annotated. A total of 1,410 sequences were confirmed to be related to reproduction and key genes involved in sex differentiation and maturation were identified for the first time in turbot (AR, AMH, SRY-related genes, CYP19A, ZPGs, STAR FSHR, etc.). Similarly, 2,241 sequences were related to the immune system and several novel key immune genes were identified (BCL, TRAF, NCK, CD28 and TOLLIP, among others). The number of genes of many relevant reproduction- and immune-related pathways present in the database was 50-90% of the total gene count of each pathway. In addition, 1,237 microsatellites and 7,362 single nucleotide polymorphisms (SNPs) were also compiled. Further, 2,976 putative natural antisense transcripts (NATs) including microRNAs were also identified. The combined sequencing strategies employed here significantly increased the turbot genomic resources available, including 34,400 novel sequences. The generated database contains a larger number of genes relevant for reproduction- and immune-associated studies, with an excellent coverage of most genes present in many relevant physiological pathways. This database also allowed the identification of many microsatellites and SNP markers that will be very useful for population and genome screening and a valuable aid in marker assisted selection programs.
2013-01-01
Background Genomic resources for plant and animal species that are under exploitation primarily for human consumption are increasingly important, among other things, for understanding physiological processes and for establishing adequate genetic selection programs. Current available techniques for high-throughput sequencing have been implemented in a number of species, including fish, to obtain a proper description of the transcriptome. The objective of this study was to generate a comprehensive transcriptomic database in turbot, a highly priced farmed fish species in Europe, with potential expansion to other areas of the world, for which there are unsolved production bottlenecks, to understand better reproductive- and immune-related functions. This information is essential to implement marker assisted selection programs useful for the turbot industry. Results Expressed sequence tags were generated by Sanger sequencing of cDNA libraries from different immune-related tissues after several parasitic challenges. The resulting database (“Turbot 2 database”) was enlarged with sequences generated from a 454 sequencing run of brain-hypophysis-gonadal axis-derived RNA obtained from turbot at different development stages. The assembly of Sanger and 454 sequences generated 52,427 consensus sequences (“Turbot 3 database”), of which 23,661 were successfully annotated. A total of 1,410 sequences were confirmed to be related to reproduction and key genes involved in sex differentiation and maturation were identified for the first time in turbot (AR, AMH, SRY-related genes, CYP19A, ZPGs, STAR FSHR, etc.). Similarly, 2,241 sequences were related to the immune system and several novel key immune genes were identified (BCL, TRAF, NCK, CD28 and TOLLIP, among others). The number of genes of many relevant reproduction- and immune-related pathways present in the database was 50–90% of the total gene count of each pathway. In addition, 1,237 microsatellites and 7,362 single nucleotide polymorphisms (SNPs) were also compiled. Further, 2,976 putative natural antisense transcripts (NATs) including microRNAs were also identified. Conclusions The combined sequencing strategies employed here significantly increased the turbot genomic resources available, including 34,400 novel sequences. The generated database contains a larger number of genes relevant for reproduction- and immune-associated studies, with an excellent coverage of most genes present in many relevant physiological pathways. This database also allowed the identification of many microsatellites and SNP markers that will be very useful for population and genome screening and a valuable aid in marker assisted selection programs. PMID:23497389
Thurber, Mary I.; Ghai, Ria R.; Hyeroba, Hyeroba; Weny, Geoffrey; Tumukunde, Alex; Chapman, Colin A.; Wiseman, Roger W.; Dinis, Jorge; Steeil, James; Greiner, Ellis C.; Friedrich, Thomas C.; O’Connor, David H.; Goldberg, Tony L.
2013-01-01
Hemoparasites of the apicomplexan family Plasmodiidae include the etiological agents of malaria, as well as a suite of non-human primate parasites from which the human malaria agents evolved. Despite the significance of these parasites for global health, little information is available about their ecology in multi-host communities. Primates were investigated in Kibale National Park, Uganda, where ecological relationships among host species are well characterized. Blood samples were examined for parasites of the genera Plasmodium and Hepatocystis using microscopy and PCR targeting the parasite mitochondrial cytochrome b gene, followed by Sanger sequencing. To assess co-infection, “deep sequencing” of a variable region within cytochrome b was performed. Out of nine black-and-white colobus (Colobus guereza), one blue guenon (Cercopithecus mitis), five grey-cheeked mangabeys (Lophocebus albigena), 23 olive baboons (Papio anubis), 52 red colobus (Procolobus rufomitratus) and 12 red-tailed guenons (Cercopithecus ascanius), 79 infections (77.5%) were found, all of which were Hepatocystis spp. Sanger sequencing revealed 25 different parasite haplotypes that sorted phylogenetically into six species-specific but morphologically similar lineages. “Deep sequencing” revealed mixed-lineage co-infections in baboons and red colobus (41.7% and 64.7% of individuals, respectively) but not in other host species. One lineage infecting red colobus also infected baboons, but always as the minor variant, suggesting directional cross-species transmission. Hepatocystis parasites in this primate community are a diverse assemblage of cryptic lineages, some of which co-infect hosts and at least one of which can cross primate species barriers. PMID:23603520
Song, Ju Yeon; Jeong, Haeyoung; Yu, Dong Su; Fischbach, Michael A.; Park, Hong-Seog; Kim, Jae Jong; Seo, Jeong-Sun; Jensen, Susan E.; Oh, Tae Kwang; Lee, Kye Joon; Kim, Jihyun F.
2010-01-01
Streptomyces clavuligerus is an important industrial strain that produces a number of antibiotics, including clavulanic acid and cephamycin C. A high-quality draft genome sequence of the S. clavuligerus NRRL 3585 strain was produced by employing a hybrid approach that involved Sanger sequencing, Roche/454 pyrosequencing, optical mapping, and partial finishing. Its genome, comprising four linear replicons, one chromosome, and four plasmids, carries numerous sets of genes involved in the biosynthesis of secondary metabolites, including a variety of antibiotics. PMID:20889745
Transcriptome assembly and digital gene expression atlas of the rainbow trout
USDA-ARS?s Scientific Manuscript database
Background: Transcriptome analysis is a preferred method for gene discovery, marker development and gene expression profiling in non-model organisms. Previously, we sequenced a transcriptome reference using Sanger-based and 454-pyrosequencing, however, a transcriptome assembly is still incomplete an...
"First generation" automated DNA sequencing technology.
Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M
2011-10-01
Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.
A novel COL4A3 mutation causes autosomal-recessive Alport syndrome in a large Turkish family.
Uzak, Asli Subasioglu; Tokgoz, Bulent; Dundar, Munis; Tekin, Mustafa
2013-03-01
Alport syndrome (AS) is a genetically heterogeneous disorder that is characterized by hematuria, progressive renal failure typically resulting in end-stage renal disease, sensorineural hearing loss, and variable ocular abnormalities. Only 15% of cases with AS are autosomal recessive and are caused by mutations in the COL4A3 or COL4A4 genes, encoding type IV collagen. Clinical data in a large consanguineous family with four affected members were reviewed, and genomic DNA was extracted. For mapping, 15 microsatellite markers flanking COL4A3, COL4A4, and COL4A5 in 16 family members were typed. For mutation screening, all coding exons of COL4A3 were polymerase chain reaction- amplified and Sanger-sequenced from genomic DNA. The disease locus was mapped to chromosome 2q36.3, where COL4A3 and COL4A4 reside. Sanger sequencing revealed a novel mis-sense mutation (c.2T>C; p.M1T) in exon 1 of COL4A3. The identified nucleotide change was not found in 100 healthy ethnicity-matched controls via Sanger sequencing. We present a large consanguineous Turkish family with AS that was found to have a COL4A3 mutation as the cause of the disease. Although the relationship between the various genotypes and phenotypes in AS has not been fully elucidated, detailed clinical and molecular analyses are helpful for providing data to be used in genetic counseling. It is important to identify new mutations to clarify their clinical importance, to assess the prognosis of the disease, and to avoid renal biopsy for final diagnosis.
Madeira, Joao Lo; Nishi, Mirian Y; Nakaguma, Marilena; Benedetti, Anna F; Biscotto, Isabela Peixoto; Fernandes, Thamiris; Pequeno, Thiago; Figueiredo, Thalita; Franca, Marcela M; Correa, Fernanda A; Otto, Aline P; Abrão, Milena; Miras, Mirta B; Santos, Silvana; Jorge, Alexander Al; Costalonga, Everlayny F; Mendonca, Berenice B; Arnhold, Ivo Jp; Carvalho, Luciani R
2017-12-01
Mutations in PROP1, HESX1 and LHX3 are associated with combined pituitary hormone deficiency (CPHD) and orthotopic posterior pituitary lobe (OPP). To identify mutations in PROP1, HESX1 and LHX3 in a large cohort of patients with CPHD and OPP (35 Brazilian, two Argentinian). We studied 23 index patients with CPHD and OPP (six familial and 17 sporadic) as well as 14 relatives. PROP1 was sequenced by the Sanger method in all except one sporadic case studied using a candidate gene panel. Multiplex ligation-dependent probe amplification (MLPA) was applied to one familial case in whom PROP1 failed to amplify by PCR. In the 13 patients without PROP1 mutations, HESX1 and LHX3 were sequenced by the Sanger method. We identified PROP1 mutations in 10 index cases. Three mutations were novel: one affecting the initiation codon (c.1A>G) and two affecting splicing sites, c.109+1G>A and c.342+1G>C. The known mutations, c.150delA (p.Arg53Aspfs*112), c.218G>A (p.Arg73His), c.263T>C (p.Phe88Ser) and c.301_302delAG (p.Leu102Cysfs*8), were also detected. MLPA confirmed complete PROP1 deletion in one family. We did not identify HESX1 and LHX3 mutations by Sanger. PROP1 mutations are a prevalent cause of congenital CPHD with OPP, and therefore, PROP1 sequencing must be the first step of molecular investigation in patients with CPHD and OPP, especially in populations with a high frequency of PROP1 mutations. In the absence of mutations, massively parallel sequencing is a promising approach. The high prevalence and diversity of PROP1 mutations is associated with the ethnic background of this cohort. © 2017 John Wiley & Sons Ltd.
Identification of a novel MYO7A mutation in Usher syndrome type 1.
Cheng, Ling; Yu, Hongsong; Jiang, Yan; He, Juan; Pu, Sisi; Li, Xin; Zhang, Li
2018-01-05
Usher syndrome (USH) is an autosomal recessive disease characterized by deafness and retinitis pigmentosa. In view of the high phenotypic and genetic heterogeneity in USH, performing genetic screening with traditional methods is impractical. In the present study, we carried out targeted next-generation sequencing (NGS) to uncover the underlying gene in an USH family (2 USH patients and 15 unaffected relatives). One hundred and thirty-five genes associated with inherited retinal degeneration were selected for deep exome sequencing. Subsequently, variant analysis, Sanger validation and segregation tests were utilized to identify the disease-causing mutations in this family. All affected individuals had a classic USH type I (USH1) phenotype which included deafness, vestibular dysfunction and retinitis pigmentosa. Targeted NGS and Sanger sequencing validation suggested that USH1 patients carried an unreported splice site mutation, c.5168+1G>A, as a compound heterozygous mutation with c.6070C>T (p.R2024X) in the MYO7A gene. A functional study revealed decreased expression of the MYO7A gene in the individuals carrying heterozygous mutations. In conclusion, targeted next-generation sequencing provided a comprehensive and efficient diagnosis for USH1. This study revealed the genetic defects in the MYO7A gene and expanded the spectrum of clinical phenotypes associated with USH1 mutations.
Telele, Nigus Fikrie; Kalu, Amare Worku; Gebre-Selassie, Solomon; Fekade, Daniel; Abdurahman, Samir; Marrone, Gaetano; Neogi, Ujjwal; Tegbaru, Belete; Sönnerborg, Anders
2018-05-15
Baseline plasma samples of 490 randomly selected antiretroviral therapy (ART) naïve patients from seven hospitals participating in the first nationwide Ethiopian HIV-1 cohort were analysed for surveillance drug resistance mutations (sDRM) by population based Sanger sequencing (PBSS). Also next generation sequencing (NGS) was used in a subset of 109 baseline samples of patients. Treatment outcome after 6- and 12-months was assessed by on-treatment (OT) and intention-to-treat (ITT) analyses. Transmitted drug resistance (TDR) was detected in 3.9% (18/461) of successfully sequenced samples by PBSS. However, NGS detected sDRM more often (24%; 26/109) than PBSS (6%; 7/109) (p = 0.0001) and major integrase strand transfer inhibitors (INSTI) DRMs were also found in minor viral variants from five patients. Patients with sDRM had more frequent treatment failure in both OT and ITT analyses. The high rate of TDR by NGS and the identification of preexisting INSTI DRMs in minor wild-type HIV-1 subtype C viral variants infected Ethiopian patients underscores the importance of TDR surveillance in low- and middle-income countries and shows added value of high-throughput NGS in such studies.
Severe infantile leigh syndrome associated with a rare mitochondrial ND6 mutation, m.14487T>C.
Tarnopolsky, Mark; Meaney, Brandon; Robinson, Brian; Sheldon, Katherine; Boles, Richard G
2013-08-01
We describe a case of severe infantile-onset complex I deficiency in association with an apparent de novo near-homoplasmic mutation (m.14487T>C) in the mitochondrial ND6 gene, which was previously associated with Leigh syndrome and other neurological disorders. The mutation was near-homoplasmic in muscle by NextGen sequencing (99.4% mutant), homoplasmic in muscle by Sanger sequencing, and it was associated with a severe complex I deficiency in both muscle and fibroblasts. This supports previous data regarding Leigh syndrome being on the severe end of a phenotypic spectrum including progressive myoclonic epilepsy, childhood-onset dystonia, bilateral striatal necrosis, and optic atrophy, depending on the proportion of mutant heteroplasmy. While the mother in all previously reported cases was heteroplasmic, the mother and brother of this case were homoplasmic for the wild-type, m.14487T. Importantly, the current data demonstrate the potential for cases of mutations that were previously reported to be homoplasmic by Sanger sequencing to be less homoplasmic by NextGen sequencing. This case underscores the importance of considering mitochondrial DNA mutations in families with a negative family history, even in offspring of those who have tested negative for a specific mtDNA mutation. Copyright © 2013 Wiley Periodicals, Inc.
Characterization of the rainbow trout transcriptome using Sanger and 454-Pyrosequencing approaches
USDA-ARS?s Scientific Manuscript database
BACKGROUND: Rainbow trout is an important fish species for aquaculture and a model species for research investigations associated with carcinogenesis, comparative immunology, toxicology and the evolutionary biology. However, to date there is no genome reference sequence to facilitate the development...
Characterization of the rainbow trout transcriptome using Sanger and 454-pyrosequencing approaches
USDA-ARS?s Scientific Manuscript database
Background: Rainbow trout is an important fish for aquaculture and recreational fisheries and serves as a model species for research investigations associated with carcinogenesis, comparative immunology, toxicology and the evolutionary biology. However, to date there is no genome reference sequence...
[Rapid detection of hot spot mutations of FGFR3 gene with PCR-high resolution melting assay].
Li, Shan; Wang, Han; Su, Hua; Gao, Jinsong; Zhao, Xiuli
2017-08-10
To identify the causative mutations in five individuals affected with dyschondroplasia and develop an efficient procedure for detecting hot spot mutations of the FGFR3 gene. Genomic DNA was extracted from peripheral blood samples with a standard phenol/chloroform method. PCR-Sanger sequencing was used to analyze the causative mutations in the five probands. PCR-high resolution melting (HRM) was developed to detect the identified mutations. A c.1138G>A mutation in exon 8 was found in 4 probands, while a c.1620C>G mutation was found in exon 11 of proband 5 whom had a mild phenotype. All patients were successfully distinguished from healthy controls with the PCR-HRM method. The results of HRM analysis were highly consistent with that of Sanger sequencing. The Gly380Arg and Asn540Lys are hot spot mutations of the FGFR3 gene among patients with ACH/HCH. PCR-HRM analysis is more efficient for detecting hot spot mutations of the FGFR3 gene.
High-Throughput Next-Generation Sequencing of Polioviruses
Montmayeur, Anna M.; Schmidt, Alexander; Zhao, Kun; Magaña, Laura; Iber, Jane; Castro, Christina J.; Chen, Qi; Henderson, Elizabeth; Ramos, Edward; Shaw, Jing; Tatusov, Roman L.; Dybdahl-Sissoko, Naomi; Endegue-Zanga, Marie Claire; Adeniji, Johnson A.; Oberste, M. Steven; Burns, Cara C.
2016-01-01
ABSTRACT The poliovirus (PV) is currently targeted for worldwide eradication and containment. Sanger-based sequencing of the viral protein 1 (VP1) capsid region is currently the standard method for PV surveillance. However, the whole-genome sequence is sometimes needed for higher resolution global surveillance. In this study, we optimized whole-genome sequencing protocols for poliovirus isolates and FTA cards using next-generation sequencing (NGS), aiming for high sequence coverage, efficiency, and throughput. We found that DNase treatment of poliovirus RNA followed by random reverse transcription (RT), amplification, and the use of the Nextera XT DNA library preparation kit produced significantly better results than other preparations. The average viral reads per total reads, a measurement of efficiency, was as high as 84.2% ± 15.6%. PV genomes covering >99 to 100% of the reference length were obtained and validated with Sanger sequencing. A total of 52 PV genomes were generated, multiplexing as many as 64 samples in a single Illumina MiSeq run. This high-throughput, sequence-independent NGS approach facilitated the detection of a diverse range of PVs, especially for those in vaccine-derived polioviruses (VDPV), circulating VDPV, or immunodeficiency-related VDPV. In contrast to results from previous studies on other viruses, our results showed that filtration and nuclease treatment did not discernibly increase the sequencing efficiency of PV isolates. However, DNase treatment after nucleic acid extraction to remove host DNA significantly improved the sequencing results. This NGS method has been successfully implemented to generate PV genomes for molecular epidemiology of the most recent PV isolates. Additionally, the ability to obtain full PV genomes from FTA cards will aid in facilitating global poliovirus surveillance. PMID:27927929
Henderson, James B.; Sellas, Anna B.; Fuchs, Jérôme; Bowie, Rauri C.K.; Dumbacher, John P.
2017-01-01
We report here the successful assembly of the complete mitochondrial genomes of the northern spotted owl (Strix occidentalis caurina) and the barred owl (S. varia). We utilized sequence data from two sequencing methodologies, Illumina paired-end sequence data with insert lengths ranging from approximately 250 nucleotides (nt) to 9,600 nt and read lengths from 100–375 nt and Sanger-derived sequences. We employed multiple assemblers and alignment methods to generate the final assemblies. The circular genomes of S. o. caurina and S. varia are comprised of 19,948 nt and 18,975 nt, respectively. Both code for two rRNAs, twenty-two tRNAs, and thirteen polypeptides. They both have duplicated control region sequences with complex repeat structures. We were not able to assemble the control regions solely using Illumina paired-end sequence data. By fully spanning the control regions, Sanger-derived sequences enabled accurate and complete assembly of these mitochondrial genomes. These are the first complete mitochondrial genome sequences of owls (Aves: Strigiformes) possessing duplicated control regions. We searched the nuclear genome of S. o. caurina for copies of mitochondrial genes and found at least nine separate stretches of nuclear copies of gene sequences originating in the mitochondrial genome (Numts). The Numts ranged from 226–19,522 nt in length and included copies of all mitochondrial genes except tRNAPro, ND6, and tRNAGlu. Strix occidentalis caurina and S. varia exhibited an average of 10.74% (8.68% uncorrected p-distance) divergence across the non-tRNA mitochondrial genes. PMID:29038757
Algorithms for accelerated convergence of adaptive PCA.
Chatterjee, C; Kang, Z; Roychowdhury, V P
2000-01-01
We derive and discuss new adaptive algorithms for principal component analysis (PCA) that are shown to converge faster than the traditional PCA algorithms due to Oja, Sanger, and Xu. It is well known that traditional PCA algorithms that are derived by using gradient descent on an objective function are slow to converge. Furthermore, the convergence of these algorithms depends on appropriate choices of the gain sequences. Since online applications demand faster convergence and an automatic selection of gains, we present new adaptive algorithms to solve these problems. We first present an unconstrained objective function, which can be minimized to obtain the principal components. We derive adaptive algorithms from this objective function by using: 1) gradient descent; 2) steepest descent; 3) conjugate direction; and 4) Newton-Raphson methods. Although gradient descent produces Xu's LMSER algorithm, the steepest descent, conjugate direction, and Newton-Raphson methods produce new adaptive algorithms for PCA. We also provide a discussion on the landscape of the objective function, and present a global convergence proof of the adaptive gradient descent PCA algorithm using stochastic approximation theory. Extensive experiments with stationary and nonstationary multidimensional Gaussian sequences show faster convergence of the new algorithms over the traditional gradient descent methods.We also compare the steepest descent adaptive algorithm with state-of-the-art methods on stationary and nonstationary sequences.
Hong, Nan; Chen, Yan-hua; Xie, Chen; Xu, Bai-sheng; Huang, Hui; Li, Xin; Yang, Yue-qing; Huang, Ying-ping; Deng, Jian-lian; Qi, Ming; Gu, Yang-shun
2014-01-01
Objective: Nance-Horan syndrome (NHS) is a rare X-linked disorder characterized by congenital nuclear cataracts, dental anomalies, and craniofacial dysmorphisms. Mental retardation was present in about 30% of the reported cases. The purpose of this study was to investigate the genetic and clinical features of NHS in a Chinese family. Methods: Whole exome sequencing analysis was performed on DNA from an affected male to scan for candidate mutations on the X-chromosome. Sanger sequencing was used to verify these candidate mutations in the whole family. Clinical and ophthalmological examinations were performed on all members of the family. Results: A combination of exome sequencing and Sanger sequencing revealed a nonsense mutation c.322G>T (E108X) in exon 1 of NHS gene, co-segregating with the disease in the family. The nonsense mutation led to the conversion of glutamic acid to a stop codon (E108X), resulting in truncation of the NHS protein. Multiple sequence alignments showed that codon 108, where the mutation (c.322G>T) occurred, was located within a phylogenetically conserved region. The clinical features in all affected males and female carriers are described in detail. Conclusions: We report a nonsense mutation c.322G>T (E108X) in a Chinese family with NHS. Our findings broaden the spectrum of NHS mutations and provide molecular insight into future NHS clinical genetic diagnosis. PMID:25091991
Hong, Yoonki; Kim, Woo Jin; Bang, Chi Young; Lee, Jae Cheol; Oh, Yeon-Mok
2016-04-01
Lung cancer is the most common cause of cancer related death. Alterations in gene sequence, structure, and expression have an important role in the pathogenesis of lung cancer. Fusion genes and alternative splicing of cancer-related genes have the potential to be oncogenic. In the current study, we performed RNA-sequencing (RNA-seq) to investigate potential fusion genes and alternative splicing in non-small cell lung cancer. RNA was isolated from lung tissues obtained from 86 subjects with lung cancer. The RNA samples from lung cancer and normal tissues were processed with RNA-seq using the HiSeq 2000 system. Fusion genes were evaluated using Defuse and ChimeraScan. Candidate fusion transcripts were validated by Sanger sequencing. Alternative splicing was analyzed using multivariate analysis of transcript sequencing and validated using quantitative real time polymerase chain reaction. RNA-seq data identified oncogenic fusion genes EML4-ALK and SLC34A2-ROS1 in three of 86 normal-cancer paired samples. Nine distinct fusion transcripts were selected using DeFuse and ChimeraScan; of which, four fusion transcripts were validated by Sanger sequencing. In 33 squamous cell carcinoma, 29 tumor specific skipped exon events and six mutually exclusive exon events were identified. ITGB4 and PYCR1 were top genes that showed significant tumor specific splice variants. In conclusion, RNA-seq data identified novel potential fusion transcripts and splice variants. Further evaluation of their functional significance in the pathogenesis of lung cancer is required.
Correa, Fernanda A; França, Marcela M; Fang, Qing; Ma, Qianyi; Bachega, Tania A; Rodrigues, Andresa; Ozel, Bilge A; Li, Jun Z; Mendonca, Berenice B; Jorge, Alexander A L; Carvalho, Luciani R; Camper, Sally A; Arnhold, Ivo J P
2017-12-01
Isolated growth hormone deficiency (IGHD) is the most common pituitary hormone deficiency and, clinically, patients have delayed bone age. High sequence similarity between CYP21A2 gene and CYP21A1P pseudogene poses difficulties for exome sequencing interpretation. A 7.5 year-old boy born to second-degree cousins presented with severe short stature (height SDS -3.7) and bone age of 6 years. Clonidine and combined pituitary stimulation tests revealed GH deficiency. Pituitary MRI was normal. The patient was successfully treated with rGH. Surprisingly, at 10.8 years, his bone age had advanced to 13 years, but physical exam, LH and testosterone levels remained prepubertal. An ACTH stimulation test disclosed a non-classic congenital adrenal hyperplasia due to 21-hydroxylase deficiency explaining the bone age advancement and, therefore, treatment with cortisone acetate was added. The genetic diagnosis of a homozygous mutation in GHRHR (p.Leu144His), a homozygous CYP21A2 mutation (p.Val282Leu) and CYP21A1P pseudogene duplication was established by Sanger sequencing, MLPA and whole-exome sequencing. We report the unusual clinical presentation of a patient born to consanguineous parents with two recessive endocrine diseases: non-classic congenital adrenal hyperplasia modifying the classical GH deficiency phenotype. We used a method of paired read mapping aided by neighbouring mis-matches to overcome the challenges of exome-sequencing in the presence of a pseudogene.
Shu, Hai-Rong; Bi, Huai; Pan, Yang-Chun; Xu, Hang-Yu; Song, Jian-Xin; Hu, Jie
2015-09-16
Usher syndrome (USH) is an autosomal recessive disorder characterized by hearing impairment and vision dysfunction due to retinitis pigmentosa. Phenotypic and genetic heterogeneities of this disease make it impractical to obtain a genetic diagnosis by conventional Sanger sequencing. In this study, we applied a next-generation sequencing approach to detect genetic abnormalities in patients with USH. Two unrelated Chinese families were recruited, consisting of two USH afflicted patients and four unaffected relatives. We selected 199 genes related to inherited retinal diseases as targets for deep exome sequencing. Through systematic data analysis using an established bioinformatics pipeline, all variants that passed filter criteria were validated by Sanger sequencing and co-segregation analysis. A homozygous frameshift mutation (c.4382delA, p.T1462Lfs*2) was revealed in exon20 of gene USH2A in the F1 family. Two compound heterozygous mutations, IVS47 + 1G > A and c.13156A > T (p.I4386F), located in intron 48 and exon 63 respectively, of USH2A, were identified as causative mutations for the F2 family. Of note, the missense mutation c.13156A > T has not been reported so far. In conclusion, targeted exome sequencing precisely and rapidly identified the genetic defects in two Chinese USH families and this technique can be applied as a routine examination for these disorders with significant clinical and genetic heterogeneity.
A comprehensive characterization of rare mitochondrial DNA variants in neuroblastoma.
Calabrese, Francesco Maria; Clima, Rosanna; Pignataro, Piero; Lasorsa, Vito Alessandro; Hogarty, Michael D; Castellano, Aurora; Conte, Massimo; Tonini, Gian Paolo; Iolascon, Achille; Gasparre, Giuseppe; Capasso, Mario
2016-08-02
Neuroblastoma, a tumor of the developing sympathetic nervous system, is a common childhood neoplasm that is often lethal. Mitochondrial DNA (mtDNA) mutations have been found in most tumors including neuroblastoma. We extracted mtDNA data from a cohort of neuroblastoma samples that had undergone Whole Exome Sequencing (WES) and also used snap-frozen samples in which mtDNA was entirely sequenced by Sanger technology. We next undertook the challenge of determining those mutations that are relevant to, or arisen during tumor development. The bioinformatics pipeline used to extract mitochondrial variants from matched tumor/blood samples was enriched by a set of filters inclusive of heteroplasmic fraction, nucleotide variability, and in silico prediction of pathogenicity. Our in silico multistep workflow applied both on WES and Sanger-sequenced neuroblastoma samples, allowed us to identify a limited burden of somatic and germline mitochondrial mutations with a potential pathogenic impact. The few singleton germline and somatic mitochondrial mutations emerged, according to our in silico analysis, do not appear to impact on the development of neuroblastoma. Our findings are consistent with the hypothesis that most mitochondrial somatic mutations can be considered as 'passengers' and consequently have no discernible effect in this type of cancer.
Bai, Ying; Chen, Yibing; Kong, Xiangdong
2018-02-02
It has been reported that mutations in arginine vasopressin type 2 receptor (AVPR2) cause congenital X-linked nephrogenic diabetes insipidus (NDI). However, only a few cases of AVPR2 deletion have been documented in China. An NDI pedigree was included in this study, including the proband and his mother. All NDI patients had polyuria, polydipsia, and growth retardation. PCR mapping, long range PCR and sanger sequencing were used to identify genetic causes of NDI. A novel 22,110 bp deletion comprising AVPR2 and ARH4GAP4 genes was identified by PCR mapping, long range PCR and sanger sequencing. The deletion happened perhaps due to the 4-bp homologous sequence (TTTT) at the junctions of both 5' and 3' breakpoints. The gross deletion co-segregates with NDI. After analyzing available data of putative clinical signs of AVPR2 and ARH4GAP4 deletion, we reconsider the potential role of AVPR2 deletion in short stature. We identified a novel 22.1-kb deletion leading to X-linked NDI in a Chinese pedigree, which would increase the current knowledge in AVPR2 mutation.
Tahir, Muhammad N; Lockhart, Ben; Grinstead, Samuel; Mollov, Dimitre
2017-04-01
Bermuda grass samples were examined by transmission electron microscopy and 28-30 nm spherical virus particles were observed. Total RNA from these plants was subjected to high-throughput sequencing (HTS). The nearly full genome sequence of a panicovirus was identified from one HTS scaffold. Sanger sequencing was used to confirm the HTS results and complete the genome sequence of 4404 nt. This virus was provisionally named Bermuda grass latent virus (BGLV). Its predicted open reading frames follow the typical arrangement of the genus Panicovirus. Based on sequence comparisons and phylogenetic analyses BGLV differs from other viruses and therefore taxonomically it is a new member of the genus Panicovirus, family Tombusviridae.
Unlocking hidden genomic sequence
Keith, Jonathan M.; Cochran, Duncan A. E.; Lala, Gita H.; Adams, Peter; Bryant, Darryn; Mitchelson, Keith R.
2004-01-01
Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs. PMID:14973330
The Genome of the Cucumber, Cucumis Sativus L
USDA-ARS?s Scientific Manuscript database
Cucumber is an economically important crop as well as a model system for sex determination studies and plant vascular biology. Here we report the draft genome sequence of Cucumis sativus var. sativus L., assembled using a novel combination of traditional Sanger and next-generation Illumina GA sequen...
Booher, Nicholas J.; Carpenter, Sara C. D.; Sebra, Robert P.; Wang, Li; Salzberg, Steven L.; Leach, Jan E.
2015-01-01
Pathogen-injected, direct transcriptional activators of host genes, TAL (transcription activator-like) effectors play determinative roles in plant diseases caused by Xanthomonas spp. A large domain of nearly identical, 33–35 aa repeats in each protein mediates DNA recognition. This modularity makes TAL effectors customizable and thus important also in biotechnology. However, the repeats render TAL effector (tal) genes nearly impossible to assemble using next-generation, short reads. Here, we demonstrate that long-read, single molecule real-time (SMRT) sequencing solves this problem. Taking an ensemble approach to first generate local, tal gene contigs, we correctly assembled de novo the genomes of two strains of the rice pathogen X. oryzae completed previously using the Sanger method and even identified errors in those references. Sequencing two more strains revealed a dynamic genome structure and a striking plasticity in tal gene content. Our results pave the way for population-level studies to inform resistance breeding, improve biotechnology and probe TAL effector evolution. PMID:27148456
Sequence variants in four genes underlying Bardet-Biedl syndrome in consanguineous families
Ullah, Asmat; Umair, Muhammad; Yousaf, Maryam; Khan, Sher Alam; Nazim-ud-din, Muhammad; Shah, Khadim; Ahmad, Farooq; Azeem, Zahid; Ali, Ghazanfar; Alhaddad, Bader; Rafique, Afzal; Jan, Abid; Haack, Tobias B.; Strom, Tim M.; Meitinger, Thomas; Ghous, Tahseen
2017-01-01
Purpose To investigate the molecular basis of Bardet-Biedl syndrome (BBS) in five consanguineous families of Pakistani origin. Methods Linkage in two families (A and B) was established to BBS7 on chromosome 4q27, in family C to BBS8 on chromosome 14q32.1, and in family D to BBS10 on chromosome 12q21.2. Family E was investigated directly with exome sequence analysis. Results Sanger sequencing revealed two novel mutations and three previously reported mutations in the BBS genes. These mutations include two deletions (c.580_582delGCA, c.1592_1597delTTCCAG) in the BBS7 gene, a missense mutation (p.Gln449His) in the BBS8 gene, a frameshift mutation (c.271_272insT) in the BBS10 gene, and a nonsense mutation (p.Ser40*) in the MKKS (BBS6) gene. Conclusions Two novel mutations and three previously reported variants, identified in the present study, further extend the body of evidence implicating BBS6, BBS7, BBS8, and BBS10 in causing BBS. PMID:28761321
Jimenez, Nelson Lopez; Flannick, Jason; Yahyavi, Mani; Li, Jiang; Bardakjian, Tanya; Tonkin, Leath; Schneider, Adele; Sherr, Elliott H; Slavotinek, Anne M
2011-12-28
Anophthalmia/microphthalmia (A/M) is caused by mutations in several different transcription factors, but mutations in each causative gene are relatively rare, emphasizing the need for a testing approach that screens multiple genes simultaneously. We used next-generation sequencing to screen 15 A/M patients for mutations in 9 pathogenic genes to evaluate this technology for screening in A/M. We used a pooled sequencing design, together with custom single nucleotide polymorphism (SNP) calling software. We verified predicted sequence alterations using Sanger sequencing. We verified three mutations - c.542delC in SOX2, resulting in p.Pro181Argfs*22, p.Glu105X in OTX2 and p.Cys240X in FOXE3. We found several novel sequence alterations and SNPs that were likely to be non-pathogenic - p.Glu42Lys in CRYBA4, p.Val201Met in FOXE3 and p.Asp291Asn in VSX2. Our analysis methodology gave one false positive result comprising a mutation in PAX6 (c.1268A > T, predicting p.X423LeuextX*15) that was not verified by Sanger sequencing. We also failed to detect one 20 base pair (bp) deletion and one 3 bp duplication in SOX2. Our results demonstrated the power of next-generation sequencing with pooled sample groups for the rapid screening of candidate genes for A/M as we were correctly able to identify disease-causing mutations. However, next-generation sequencing was less useful for small, intragenic deletions and duplications. We did not find mutations in 10/15 patients and conclude that there is a need for further gene discovery in A/M.
2011-01-01
Background Anophthalmia/microphthalmia (A/M) is caused by mutations in several different transcription factors, but mutations in each causative gene are relatively rare, emphasizing the need for a testing approach that screens multiple genes simultaneously. We used next-generation sequencing to screen 15 A/M patients for mutations in 9 pathogenic genes to evaluate this technology for screening in A/M. Methods We used a pooled sequencing design, together with custom single nucleotide polymorphism (SNP) calling software. We verified predicted sequence alterations using Sanger sequencing. Results We verified three mutations - c.542delC in SOX2, resulting in p.Pro181Argfs*22, p.Glu105X in OTX2 and p.Cys240X in FOXE3. We found several novel sequence alterations and SNPs that were likely to be non-pathogenic - p.Glu42Lys in CRYBA4, p.Val201Met in FOXE3 and p.Asp291Asn in VSX2. Our analysis methodology gave one false positive result comprising a mutation in PAX6 (c.1268A > T, predicting p.X423LeuextX*15) that was not verified by Sanger sequencing. We also failed to detect one 20 base pair (bp) deletion and one 3 bp duplication in SOX2. Conclusions Our results demonstrated the power of next-generation sequencing with pooled sample groups for the rapid screening of candidate genes for A/M as we were correctly able to identify disease-causing mutations. However, next-generation sequencing was less useful for small, intragenic deletions and duplications. We did not find mutations in 10/15 patients and conclude that there is a need for further gene discovery in A/M. PMID:22204637
2017-05-01
TERMS Ovarian cancer, drug resistance, rucaparib, phase 2, DNA repair, homologous recombination, nonhomologous end-joining (NHEJ), poly(ADP-ribose...tissues from AA patients with OC. This should add 50 AA OC patients. We are also requesting anonymized DNA from AA OC patients who participated on...extracts DNA and creates library pretps for DNA sequencing. He performs Sanger sequencing validations. Funding Support: Has there been a change
Zhu, Qihui; Smith, Shavannor M; Ayele, Mulu; Yang, Lixing; Jogi, Ansuya; Chaluvadi, Srinivasa R; Bennetzen, Jeffrey L
2012-11-01
Tef (Eragrostis tef) is a major cereal crop in Ethiopia. Lodging is the primary constraint to increasing productivity in this allotetraploid species, accounting for losses of ∼15-45% in yield each year. As a first step toward identifying semi-dwarf varieties that might have improved lodging resistance, an ∼6× fosmid library was constructed and used to identify both homeologues of the dw3 semi-dwarfing gene of Sorghum bicolor. An EMS mutagenized population, consisting of ∼21,210 tef plants, was planted and leaf materials were collected into 23 superpools. Two dwarfing candidate genes, homeologues of dw3 of sorghum and rht1 of wheat, were sequenced directly from each superpool with 454 technology, and 120 candidate mutations were identified. Out of 10 candidates tested, six independent mutations were validated by Sanger sequencing, including two predicted detrimental mutations in both dw3 homeologues with a potential to improve lodging resistance in tef through further breeding. This study demonstrates that high-throughput sequencing can identify potentially valuable mutations in under-studied plant species like tef and has provided mutant lines that can now be combined and tested in breeding programs for improved lodging resistance.
Zheng, Yu; Wang, Hai-Lin; Li, Jian-Kang; Xu, Li; Tellier, Laurent; Li, Xiao-Lin; Huang, Xiao-Yan; Li, Wei; Niu, Tong-Tong; Yang, Huan-Ming; Zhang, Jian-Guo; Liu, Dong-Ning
2018-01-01
To study the genes responsible for retinitis pigmentosa. A total of 15 Chinese families with retinitis pigmentosa, containing 94 sporadically afflicted cases, were recruited. The targeted sequences were captured using the Target_Eye_365_V3 chip and sequenced using the BGISEQ-500 sequencer, according to the manufacturer's instructions. Data were aligned to UCSC Genome Browser build hg19, using the Burroughs Wheeler Aligner MEM algorithm. Local realignment was performed with the Genome Analysis Toolkit (GATK v.3.3.0) IndelRealigner, and variants were called with the Genome Analysis Toolkit Haplotypecaller, without any use of imputation. Variants were filtered against a panel derived from 1000 Genomes Project, 1000G_ASN, ESP6500, ExAC and dbSNP138. In all members of Family ONE and Family TWO with available DNA samples, the genetic variant was validated using Sanger sequencing. A novel, pathogenic variant of retinitis pigmentosa, c.357_358delAA (p.Ser119SerfsX5) was identified in PRPF31 in 2 of 15 autosomal-dominant retinitis pigmentosa (ADRP) families, as well as in one, sporadic case. Sanger sequencing was performed upon probands, as well as upon other family members. This novel, pathogenic genotype co-segregated with retinitis pigmentosa phenotype in these two families. ADRP is a subtype of retinitis pigmentosa, defined by its genotype, which accounts for 20%-40% of the retinitis pigmentosa patients. Our study thus expands the spectrum of PRPF31 mutations known to occur in ADRP, and provides further demonstration of the applicability of the BGISEQ500 sequencer for genomics research.
Statistical method to compare massive parallel sequencing pipelines.
Elsensohn, M H; Leblay, N; Dimassi, S; Campan-Fournier, A; Labalme, A; Roucher-Boulez, F; Sanlaville, D; Lesca, G; Bardel, C; Roy, P
2017-03-01
Today, sequencing is frequently carried out by Massive Parallel Sequencing (MPS) that cuts drastically sequencing time and expenses. Nevertheless, Sanger sequencing remains the main validation method to confirm the presence of variants. The analysis of MPS data involves the development of several bioinformatic tools, academic or commercial. We present here a statistical method to compare MPS pipelines and test it in a comparison between an academic (BWA-GATK) and a commercial pipeline (TMAP-NextGENe®), with and without reference to a gold standard (here, Sanger sequencing), on a panel of 41 genes in 43 epileptic patients. This method used the number of variants to fit log-linear models for pairwise agreements between pipelines. To assess the heterogeneity of the margins and the odds ratios of agreement, four log-linear models were used: a full model, a homogeneous-margin model, a model with single odds ratio for all patients, and a model with single intercept. Then a log-linear mixed model was fitted considering the biological variability as a random effect. Among the 390,339 base-pairs sequenced, TMAP-NextGENe® and BWA-GATK found, on average, 2253.49 and 1857.14 variants (single nucleotide variants and indels), respectively. Against the gold standard, the pipelines had similar sensitivities (63.47% vs. 63.42%) and close but significantly different specificities (99.57% vs. 99.65%; p < 0.001). Same-trend results were obtained when only single nucleotide variants were considered (99.98% specificity and 76.81% sensitivity for both pipelines). The method allows thus pipeline comparison and selection. It is generalizable to all types of MPS data and all pipelines.
Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuo, Alan; Grigoriev, Igor
2009-04-17
Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentousmore » ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.« less
Understanding the mechanism of resistance breaking on tomato by Tomato mottle mosaic virus
USDA-ARS?s Scientific Manuscript database
Tomato mottle mosaic virus (ToMMV) has broadened it’s distribution around the world. In our previous work, we observed a partial resistance breaking by ToMMV on tomato. To understand the mechanism of this resistance breaking, we carried out comparative analysis through Sanger sequencing, genotyping ...
Lu, Chaoxia; Wu, Wei; Xiao, Jifang; Meng, Yan; Zhang, Shuyang; Zhang, Xue
2013-06-01
To detect pathogenic mutations in Marfan syndrome (MFS) using an Ion Torrent Personal Genome Machine (PGM) and to validate the result of targeted next-generation semiconductor sequencing for the diagnosis of genetic disorders. Peripheral blood samples were collected from three MFS patients and a normal control with informed consent. Genomic DNA was isolated by standard method and then subjected to targeted sequencing using an Ion Ampliseq(TM) Inherited Disease Panel. Three multiplex PCR reactions were carried out to amplify the coding exons of 328 genes including FBN1, TGFBR1 and TGFBR2. DNA fragments from different samples were ligated with barcoded sequencing adaptors. Template preparation and emulsion PCR, and Ion Sphere Particles enrichment were carried out using an Ion One Touch system. The ion sphere particles were sequenced on a 318 chip using the PGM platform. Data from the PGM runs were processed using an Ion Torrent Suite 3.2 software to generate sequence reads. After sequence alignment and extraction of SNPs and indels, all the variants were filtered against dbSNP137. DNA sequences were visualized with an Integrated Genomics Viewer. The most likely disease-causing variants were analyzed by Sanger sequencing. The PGM sequencing has yielded an output of 855.80 Mb, with a > 100 × median sequencing depth and a coverage of > 98% for the targeted regions in all the four samples. After data analysis and database filtering, one known missense mutation (p.E1811K) and two novel premature termination mutations (p.E2264X and p.L871FfsX23) in the FBN1 gene were identified in the three MFS patients. All mutations were verified by conventional Sanger sequencing. Pathogenic FBN1 mutations have been identified in all patients with MFS, indicating that the targeted next-generation sequencing on the PGM sequencers can be applied for accurate and high-throughput testing of genetic disorders.
Trend of different molecular markers in the last decades for studying human migrations.
Kundu, Sharbadeb; Ghosh, Sankar Kumar
2015-02-10
Anatomically modern humans are known to have widely migrated throughout history. Different scientific evidences suggest that the entire human population descended from just several thousand African migrants. About 85,000 years ago, the first wave of human migration was out of Africa, that followed the coasts through the Middle East, into Southern Asia via Sri Lanka, and in due course around Indonesia and into Australia. Another wave of migration between 40,000 and 12,000 years ago brought humans northward into Europe. However, the frozen north limited human expansion in Europe, and created a land bridge, "Bering land bridge", connecting Asia with North America about 25,000 years ago. Although fossil data give the most direct information about our past, it has certain anomalies. So, molecular archeologists are now using different molecular markers to trace the "most recent common ancestor" and also the migration pattern of modern humans. In this study, we have studied the trend of molecular markers and also the methodologies implemented in the last decades (2003-2014). From our observation, we can say that D-loop region of mtDNA and Y chromosome based markers are predominant. Nevertheless, mtDNA, especially the D-loop region, has some unique features, which makes it a more effective marker for tracing prehistoric footprints of modern human populations. Although, natural selection should also be taken into account in studying mtDNA based human migration. As per technology is concerned, Sanger sequencing is the major technique that is being used in almost all studies. But, the emergence of different cost-effective-and-easy-to-handle NGS platforms has increased its popularity over Sanger sequencing in studying human migration. Copyright © 2014. Published by Elsevier B.V.
Mu, John C.; Tootoonchi Afshar, Pegah; Mohiyuddin, Marghoob; Chen, Xi; Li, Jian; Bani Asadi, Narges; Gerstein, Mark B.; Wong, Wing H.; Lam, Hugo Y. K.
2015-01-01
A high-confidence, comprehensive human variant set is critical in assessing accuracy of sequencing algorithms, which are crucial in precision medicine based on high-throughput sequencing. Although recent works have attempted to provide such a resource, they still do not encompass all major types of variants including structural variants (SVs). Thus, we leveraged the massive high-quality Sanger sequences from the HuRef genome to construct by far the most comprehensive gold set of a single individual, which was cross validated with deep Illumina sequencing, population datasets, and well-established algorithms. It was a necessary effort to completely reanalyze the HuRef genome as its previously published variants were mostly reported five years ago, suffering from compatibility, organization, and accuracy issues that prevent their direct use in benchmarking. Our extensive analysis and validation resulted in a gold set with high specificity and sensitivity. In contrast to the current gold sets of the NA12878 or HS1011 genomes, our gold set is the first that includes small variants, deletion SVs and insertion SVs up to a hundred thousand base-pairs. We demonstrate the utility of our HuRef gold set to benchmark several published SV detection tools. PMID:26412485
Exome Sequencing of 18 Chinese Families with Congenital Cataracts: A New Sight of the NHS Gene
Sun, Wenmin; Xiao, Xueshan; Li, Shiqiang; Guo, Xiangming; Zhang, Qingjiong
2014-01-01
Purpose The aim of this study was to investigate the mutation spectrum and frequency of 34 known genes in 18 Chinese families with congenital cataracts. Methods Genomic DNA and clinical data was collected from 18 families with congenital cataracts. Variations in 34 cataract-associated genes were screened by whole exome sequencing and then validated by Sanger sequencing. Results Eleven candidate variants in seven of the 34 genes were detected by exome sequencing and then confirmed by Sanger sequencing, including two variants predicted to be benign and the other pathogenic mutations. The nine mutations were present in 9 of the 18 (50%) families with congenital cataracts. Of the four families with mutations in the X-linked NHS gene, no other abnormalities were recorded except for cataract, in which a pseudo-dominant inheritance form was suggested, as female carriers also had different forms of cataracts. Conclusion This study expands the mutation spectrum and frequency of genes responsible for congenital cataract. Mutation in NHS is a common cause of nonsyndromic congenital cataract with pseudo-autosomal dominant inheritance. Combined with our previous studies, a genetic basis could be identified in 67.6% of families with congenital cataracts in our case series, in which mutations in genes encoding crystallins, genes encoding connexins, and NHS are responsible for 29.4%, 14.7%, and 11.8% of families, respectively. Our results suggest that mutations in NHS are the common cause of congenital cataract, both syndromic and nonsyndromic. PMID:24968223
Lin, Pengfei; Zhang, Dong; Xu, Guangrun; Yan, Chuanzhu
2018-04-01
Spinocerebellar ataxias (SCAs) are a group of autosomal dominant, clinically heterogeneous neurodegenerative disorders. SCA18 is a rare autosomal dominant sensory/motor neuropathy with ataxia (OMIM#607458) associated with a single missense variant c.514 A>G in the interferon related developmental regulator 1 (IFRD1) gene previously reported in a five-generation American family of Irish origin. However, to date, there have been no other reports of the IFRD1 mutation to confirm its role in SCA. Here, we report a Han Chinese family with SCA18; the family members presented with a slowly progressing gait ataxia, pyramidal tract signs, and peripheral neuropathy. We identified a missense variant (c.514 A>G, p.I172V) in IFRD1 gene in the family using targeted next-generation sequencing and Sanger direct sequencing with specific primers. Our results suggest that the IFRD1 gene may be the causative allele for SCA18.
Pesz, Karolina; Pienkowski, Victor Murcia; Pollak, Agnieszka; Gasperowicz, Piotr; Sykulski, Maciej; Kosińska, Joanna; Kiszko, Magdalena; Krzykwa, Bogusława; Bartnik-Głaska, Magdalena; Nowakowska, Beata; Rydzanicz, Małgorzata; Sasiadek, Maria Małgorzata; Płoski, Rafał
2018-04-03
Mapping of de novo balanced chromosomal translocations (BCTs) in patients with sporadic poorly characterized disease(s) is an unbiased method of finding candidate gene(s) responsible for the observed symptoms. We present a paediatric patient suffering from epilepsy, developmental delay (DD) and atrial septal defect IIº (ASD) requiring surgery. Karyotyping indicated an apparently balanced de novo reciprocal translocation 46,XX,t(3;4)(p25.3;q31.1), whereas aCGH did not reveal any copy number changes. Using shallow mate-pair whole genome sequencing and direct Sanger sequencing of breakpoint regions we found that translocation disrupted SLC6A1 and NAA15 genes. Our results confirm two previous reports indicating that loss of function of a single allele of SLC6A1 causes epilepsy. In addition, we extend existing evidence that disruption of NAA15 is associated with DD and with congenital heart defects. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
Oh, Yejin; Song, Ik-Chan; Kim, Jimyung; Kwon, Gye Cheol; Koo, Sun Hoe; Kim, Seon Young
2018-05-01
We developed a pyrosequencing-based method for the quantification of CALR mutations and compared the results using Sanger sequencing, fragment length analysis (FLA), digital-droplet PCR (ddPCR), and next-generation sequencing (NGS). Method validation studies were performed using cloned plasmid controls. Samples from 24 patients with myeloproliferative neoplasms were evaluated. Among the 24 patients, 15 had CALR mutations (7 type 1, 2 type 2, and 6 other mutations). The type 1 or type 2 mutation-positive results from pyrosequencing exhibited 100% concordance with the Sanger sequencing results. One novel CALR mutation was not detected by pyrosequencing. The CALR mutation allele burdens measured by pyrosequencing were slightly lower than those measured by FLA but slightly higher than the results obtained using ddPCR. Pyrosequencing exhibited high correlations with both methods. The mutation allele burdens estimated by NGS were significantly lower than those measured by pyrosequencing. An increased CALR mutation allele burden was associated with overt primary myelofibrosis. Patients with >70% mutation allele burdens in myeloid cells had a significantly longer time from diagnosis (P = 0.007), more bone marrow fibrosis (P = 0.010), and lower hemoglobin (P = 0.007). Pyrosequencing was a useful rapid sequencing method to determine the burden of CALR mutations. Copyright © 2018 Elsevier B.V. All rights reserved.
Meng, Xiaohong; Li, Qiyou; Guo, Hong; Xu, Haiwei; Li, Shiying; Yin, Zhengqin
2017-01-01
To characterize the clinical and molecular genetic characteristics of a large, multigenerational Chinese family showing different phenotypes. A pedigree consisted of 56 individuals in 5 generations was recruited. Comprehensive ophthalmic examinations were performed in 16 family members affected. Mutation screening of CYP4V2 was performed by Sanger sequencing. Next-generation sequencing (NGS) was performed to capture and sequence all exons of 47 known retinal dystrophy-associated genes in two affected family members who had no mutations in CYP4V2 . The detected variants in NGS were validated by Sanger sequencing in the family members. Two compound heterozygous CYP4V2 mutations (c.802-8_810del17insGC and c.992A>C) were detected in the proband who presented typical clinical features of BCD. One missense mutation (c.1482C>T, p.T494M) in the PRPF3 gene was detected in 9 out of 22 affected family members who manifested classical clinical features of RP. Our results showed that two compound heterozygous CYP4V2 mutations caused BCD, and one missense mutation in PRPF3 was responsible for adRP in this large family. This study suggests that accurate phenotypic diagnosis, molecular diagnosis, and genetic counseling are necessary for patients with hereditary retinal degeneration in some large mutigenerational family.
A novel ATTR L32V mutation causes familial amyloid polyneuropathy in a Bolivian family.
Martínez-Ulloa, Pedro L; Vallejo, Manuela; Corral, Iñigo; García-Barragán, Nuria; Alcazar, Alberto; Martínez-Alonso, Emma; Martínez-Poles, Javier; Pian, Hector; Jiménez-Escrig, Adriano
2017-09-01
We report a new transthyretin (ATTR) gene c.272C>G mutation and variant protein, p.Leu32Val, in a kindred of Bolivian origin with a rapid progressive peripheral neuropathy and cardiomyopathy. Three individuals from a kindred with peripheral nerve and cardiac amyloidosis were examined. Analysis of the TTR gene was performed by Sanger direct sequencing. Neuropathologic examination was obtained on the index patient with mass spectrometry study of the ATTR deposition. Direct DNA sequence analysis of exons 2, 3, and 4 of the TTR gene demonstrated a c.272 C>G mutation in exon 2 (p.L32V). Sural nerve biopsy revealed massive amyloid deposition in the perineurium, endoneurium and vasa nervorum. Mass spectrometric analyses of ATTR immunoprecipitated from nerve biopsy showed the presence of both wild-type and variant proteins. The observed mass results for the wild-type and variant proteins were consistent with the predicted values calculated from the genetic analysis data. The ATTR L32V is associated with a severe course. This has implications for treatment of affected individuals and counseling of family members. © 2017 Peripheral Nerve Society.
Abu-Farha, Mohamed; Melhem, Motasem; Abubaker, Jehad; Behbehani, Kazem; Alsmadi, Osama; Elkum, Naser
2016-02-11
ANGPTL8 (betatrophin) has been recently identified as a regulator of lipid metabolism through its interaction with ANGPTL3. A sequence variant in ANGPTL8 has been shown to associate with lower level of Low Density Lipoprotein (LDL) and High Density Lipoprotein (HDL). The objective of this study is to identify sequence variants in ANGPTL8 gene in Arabs and investigate their association with ANGPTL8 plasma level and clinical parameters. A cross sectional study was designed to examine the level of ANGPTL8 in 283 non-diabetic Arabs, and to identify its sequence variants using Sanger sequencing and their association with various clinical parameters. Using Sanger sequencing, we sequenced the full ANGPTL8 gene in 283 Arabs identifying two single nucleotide polymorphisms (SNPs) Rs.892066 and Rs.2278426 in the coding region. Our data shows for the first time that Arabs with the heterozygote form of (c.194C > T Rs.2278426) had higher level of Fasting Blood Glucose (FBG) compared to the CC homozygotes. LDL and HDL level in these subjects did not show significant difference between the two subgroups. Circulation level of ANGPTL8 did not vary between the two forms. No significant changes were observed between the various forms of Rs.892066 variant and FBG, LDL or HDL. Our data shows for the first time that heterozygote form of ANGPTL8 Rs.2278426 variant was associated with higher FBG level in Arabs highlighting the importance of these variants in controlling the function of betatrophin.
Ovine pedomics: the first study of the ovine foot 16S rRNA-based microbiome
USDA-ARS?s Scientific Manuscript database
We report the first study of the bacterial microbiome of ovine interdigital skin based on 16S rRNA by pyrosequencing and conventional cloning with Sanger-sequencing. Ovine foot rot is an infectious, contagious disease of sheep that causes severe lameness and economic loss from decreased flock produc...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kumar, Dibyendu; Buhay, Christian; Van Tonder, Andries
From left to right: Dibyendu Kumar of the University of Florida, Christian Buhay of Baylor College of Medicine, Andries van Tonder of Wellcome Sanger Trust Institute, Anna Montmayeur of the Broad Institute and Karen Davenport of Los Alamos National Laboratory at the Finishing forum on June 3, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.
Zheng, Yu; Wang, Hai-Lin; Li, Jian-Kang; Xu, Li; Tellier, Laurent; Li, Xiao-Lin; Huang, Xiao-Yan; Li, Wei; Niu, Tong-Tong; Yang, Huan-Ming; Zhang, Jian-Guo; Liu, Dong-Ning
2018-01-01
AIM To study the genes responsible for retinitis pigmentosa. METHODS A total of 15 Chinese families with retinitis pigmentosa, containing 94 sporadically afflicted cases, were recruited. The targeted sequences were captured using the Target_Eye_365_V3 chip and sequenced using the BGISEQ-500 sequencer, according to the manufacturer's instructions. Data were aligned to UCSC Genome Browser build hg19, using the Burroughs Wheeler Aligner MEM algorithm. Local realignment was performed with the Genome Analysis Toolkit (GATK v.3.3.0) IndelRealigner, and variants were called with the Genome Analysis Toolkit Haplotypecaller, without any use of imputation. Variants were filtered against a panel derived from 1000 Genomes Project, 1000G_ASN, ESP6500, ExAC and dbSNP138. In all members of Family ONE and Family TWO with available DNA samples, the genetic variant was validated using Sanger sequencing. RESULTS A novel, pathogenic variant of retinitis pigmentosa, c.357_358delAA (p.Ser119SerfsX5) was identified in PRPF31 in 2 of 15 autosomal-dominant retinitis pigmentosa (ADRP) families, as well as in one, sporadic case. Sanger sequencing was performed upon probands, as well as upon other family members. This novel, pathogenic genotype co-segregated with retinitis pigmentosa phenotype in these two families. CONCLUSION ADRP is a subtype of retinitis pigmentosa, defined by its genotype, which accounts for 20%-40% of the retinitis pigmentosa patients. Our study thus expands the spectrum of PRPF31 mutations known to occur in ADRP, and provides further demonstration of the applicability of the BGISEQ500 sequencer for genomics research. PMID:29375987
NASA Astrophysics Data System (ADS)
Govindarajan, A.; Pineda, J.; Purcell, M.; Tradd, K.; Packard, G.; Girard, A.; Dennett, M.; Breier, J. A., Jr.
2016-02-01
We present a new method to estimate the distribution of invertebrate larvae relative to environmental variables such as temperature, salinity, and circulation. A large volume in situ filtering system developed for discrete biogeochemical sampling in the deep-sea (the Suspended Particulate Rosette "SUPR" multisampler) was mounted to the autonomous underwater vehicle REMUS 600 for coastal larval and environmental sampling. We describe the results of SUPR-REMUS deployments conducted in Buzzards Bay, Massachusetts (2014) and west of Martha's Vineyard, Massachusetts (2015). We collected discrete samples cross-shore and from surface, middle, and bottom layers of the water column. Samples were preserved for DNA analysis. Our Buzzards Bay deployment targeted barnacle larvae, which are abundant in late winter and early spring. For these samples, we used morphological analysis and DNA barcodes generated by Sanger sequencing to obtain stage and species-specific cross-shore and vertical distributions. We targeted bivalve larvae in our 2015 deployments, and genetic analysis of larvae from these samples is underway. For these samples, we are comparing species barcode data derived from traditional Sanger sequencing of individuals to those obtained from next generation sequencing (NGS) of bulk plankton samples. Our results demonstrate the utility of autonomous sampling combined with DNA barcoding for studying larval distributions and transport dynamics.
Optimizing the molecular diagnosis of CDKL5 gene-related epileptic encephalopathy in boys.
Mei, Davide; Darra, Francesca; Barba, Carmen; Marini, Carla; Fontana, Elena; Chiti, Laura; Parrini, Elena; Dalla Bernardina, Bernardo; Guerrini, Renzo
2014-11-01
Mutations involving the cyclin-dependent kinase-like 5 (CDKL5) gene cause an early onset epileptic encephalopathy (EE) with severe neurologic impairment and a skewed 12:1 female-to-male ratio. To date, 18 mutations have been described in boys. We analyzed our cohort of boys with early onset EE to assess the diagnostic yield of our molecular approach. We studied 74 boys who presented early onset severe seizures, including infantile spasms and developmental delay, in the setting of EE, using Sanger sequencing, next-generation sequencing (NGS) and multiplex ligation-dependent probe amplification (MLPA). We identified alterations involving CDKL5 in four boys (5.4%) using NGS in one and MLPA in three. Three of four mutations were indicative of somatic mosaicism. CDKL5 gene mutations accounted for 5.4% of boys with early onset EE. Somatic mosaic mutations might be even more represented than germline mutations, probably because their less deleterious effect enhances viability of the male embryo. The molecular approach used for CDKL5 screening remarkably influences the diagnostic yield in boys. Diagnosis is optimized by Sanger sequencing combined with array-based methods or MLPA; alternatively, NGS targeted resequencing designed to also detect copy number alterations, may be performed. Wiley Periodicals, Inc. © 2014 International League Against Epilepsy.
Entamoeba histolytica: construction and applications of subgenomic databases.
Hofer, Margit; Duchêne, Michael
2005-07-01
Knowledge about the influence of environmental stress such as the action of chemotherapeutic agents on gene expression in Entamoeba histolytica is limited. We plan to use oligonucleotide microarray hybridization to approach these questions. As the basis for our array, sequence data from the genome project carried out by the Institute for Genomic Research (TIGR) and the Sanger Institute were used to annotate parts of the parasite genome. Three subgenomic databases containing enzymes, cytoskeleton genes, and stress genes were compiled with the help of the ExPASy proteomics website and the BLAST servers at the two genome project sites. The known sequences from reference species, mostly human and Escherichia coli, were searched against TIGR and Sanger E. histolytica sequence contigs and the homologs were copied into a Microsoft Access database. In a similar way, two additional databases of cytoskeletal genes and stress genes were generated. Metabolic pathways could be assembled from our enzyme database, but sometimes they were incomplete as is the case for the sterol biosynthesis pathway. The raw databases contained a significant number of duplicate entries which were merged to obtain curated non-redundant databases. This procedure revealed that some E. histolytica genes may have several putative functions. Representative examples such as the case of the delta-aminolevulinate synthase/serine palmitoyltransferase are discussed.
Gille, Johan J. P.; Floor, Karijn; Kerkhoven, Lianne; Ameziane, Najim; Joenje, Hans; de Winter, Johan P.
2012-01-01
Fanconi anemia (FA) is a rare inherited disease characterized by developmental defects, short stature, bone marrow failure, and a high risk of malignancies. FA is heterogeneous: 15 genetic subtypes have been distinguished so far. A clinical diagnosis of FA needs to be confirmed by testing cells for sensitivity to cross-linking agents in a chromosomal breakage test. As a second step, DNA testing can be employed to elucidate the genetic subtype of the patient and to identify the familial mutations. This knowledge allows preimplantation genetic diagnosis (PGD) and enables prenatal DNA testing in future pregnancies. Although simultaneous testing of all FA genes by next generation sequencing will be possible in the near future, this technique will not be available immediately for all laboratories. In addition, in populations with strong founder mutations, a limited test using Sanger sequencing and MLPA will be a cost-effective alternative. We describe a strategy and optimized conditions for the screening of FANCA, FANCB, FANCC, FANCE, FANCF, and FANCG and present the results obtained in a cohort of 54 patients referred to our diagnostic service since 2008. In addition, the follow up with respect to genetic counseling and carrier screening in the families is discussed. PMID:22778927
Lasigliè, Denise; Mensa-Vilaro, Anna; Ferrera, Denise; Caorsi, Roberta; Penco, Federica; Santamaria, Giuseppe; Di Duca, Marco; Amico, Giulia; Nakagawa, Kenji; Antonini, Francesca; Tommasini, Alberto; Consolini, Rita; Insalaco, Antonella; Cattalini, Marco; Obici, Laura; Gallizzi, Romina; Santarelli, Francesca; Del Zotto, Genny; Severino, Mariasavina; Rubartelli, Anna; Ravazzolo, Roberto; Martini, Alberto; Ceccherini, Isabella; Nishikomori, Ryuta; Gattorno, Marco; Arostegui, Juan I; Borghini, Silvia
2017-11-01
To evaluate the rate of somatic NLRP3 mosaicism in an Italian cohort of mutation-negative patients with cryopyrin-associated periodic syndrome (CAPS). The study enrolled 14 patients with a clinical phenotype consistent with CAPS in whom Sanger sequencing of the NLRP3 gene yielded negative results. Patients' DNA were subjected to amplicon-based NLRP3 deep sequencing. Low-level somatic NLRP3 mosaicism has been detected in 4 patients, 3 affected with chronic infantile neurological cutaneous and articular syndrome and 1 with Muckle-Wells syndrome. Identified nucleotide substitutions encode for 4 different amino acid exchanges, with 2 of them being novel (p.Y563C and p.G564S). In vitro functional studies confirmed the deleterious behavior of the 4 somatic NLRP3 mutations. Among the different neurological manifestations detected, 1 patient displayed mild loss of white matter volume on brain magnetic resonance imaging. The allele frequency of somatic NLRP3 mutations occurs generally under 15%, considered the threshold of detectability using the Sanger method of DNA sequencing. Consequently, routine genetic diagnostic of CAPS should be currently performed by next-generation techniques ensuring high coverage to identify also low-level mosaicism, whose actual frequency is yet unknown and probably underestimated.
A comprehensive characterization of rare mitochondrial DNA variants in neuroblastoma
Pignataro, Piero; Lasorsa, Vito Alessandro; Hogarty, Michael D.; Castellano, Aurora; Conte, Massimo; Tonini, Gian Paolo; Iolascon, Achille; Gasparre, Giuseppe; Capasso, Mario
2016-01-01
Background Neuroblastoma, a tumor of the developing sympathetic nervous system, is a common childhood neoplasm that is often lethal. Mitochondrial DNA (mtDNA) mutations have been found in most tumors including neuroblastoma. We extracted mtDNA data from a cohort of neuroblastoma samples that had undergone Whole Exome Sequencing (WES) and also used snap-frozen samples in which mtDNA was entirely sequenced by Sanger technology. We next undertook the challenge of determining those mutations that are relevant to, or arisen during tumor development. The bioinformatics pipeline used to extract mitochondrial variants from matched tumor/blood samples was enriched by a set of filters inclusive of heteroplasmic fraction, nucleotide variability, and in silico prediction of pathogenicity. Results Our in silico multistep workflow applied both on WES and Sanger-sequenced neuroblastoma samples, allowed us to identify a limited burden of somatic and germline mitochondrial mutations with a potential pathogenic impact. Conclusions The few singleton germline and somatic mitochondrial mutations emerged, according to our in silico analysis, do not appear to impact on the development of neuroblastoma. Our findings are consistent with the hypothesis that most mitochondrial somatic mutations can be considered as ‘passengers’ and consequently have no discernible effect in this type of cancer. PMID:27351283
Vavrova, Eva; Kantorova, Barbara; Vonkova, Barbara; Kabathova, Jitka; Skuhrova-Francova, Hana; Diviskova, Eva; Letocha, Ondrej; Kotaskova, Jana; Brychtova, Yvona; Doubek, Michael; Mayer, Jiri; Pospisilova, Sarka
2017-09-01
The hotspot c.7541_7542delCT NOTCH1 mutation has been proven to have a negative clinical impact in chronic lymphocytic leukemia (CLL). However, an optimal method for its detection has not yet been specified. The aim of our study was to examine the presence of the NOTCH1 mutation in CLL using three commonly used molecular methods. Sanger sequencing, fragment analysis and allele-specific PCR were compared in the detection of the c.7541_7542delCT NOTCH1 mutation in 201 CLL patients. In 7 patients with inconclusive mutational analysis results, the presence of the NOTCH1 mutation was also confirmed using ultra-deep next generation sequencing. The NOTCH1 mutation was detected in 15% (30/201) of examined patients. Only fragment analysis was able to identify all 30 NOTCH1-mutated patients. Sanger sequencing and allele-specific PCR showed a lower detection efficiency, determining 93% (28/30) and 80% (24/30) of the present NOTCH1 mutations, respectively. Considering these three most commonly used methodologies for c.7541_7542delCT NOTCH1 mutation screening in CLL, we defined fragment analysis as the most suitable approach for detecting the hotspot NOTCH1 mutation. Copyright © 2017 Elsevier Ltd. All rights reserved.
Shen, Tao; Guan, Liping; Li, Shiqiang; Zhang, Jianguo; Xiao, Xueshan; Jiang, Hui; Yang, Jianhua; Guo, Xiangming; Wang, Jun; Zhang, Qingjiong
2015-03-01
The genetic defects underlying approximately half of all retinitis pigmentosa (RP) cases are unknown. A number of genes responsible for Leber congenital amaurosis (LCA) may also cause RP when they are mutated. Our previous study revealed that variants in the most frequently mutated nine exons accounted for approximately half of the mutations detected in a cohort of patients with LCA. The aim of the present study was to detect mutations in LCA-associated genes in patients with RP using two different strategies. Sanger sequencing was used to screen mutations in the nine exons in 293 patients with RP and exome sequencing was used to detect variants in 12 LCA-associated genes in 157 of the 293 patients with RP and then to validate the variants by Sanger sequencing. Potential pathogenic mutations were identified in four patients with early onset RP, including homozygous CRB1 mutations in two patients, compound heterozygous CRB1 mutations in one patient and compound heterozygous CEP290 mutations in one patient. The present study indicated that mutations in CEP290 may also be associated with RP but not with LCA. With the exception of CEP290, the remaining 11 genes known to be associated with LCA but not with RP are unlikely to be a common cause of RP.
Kariminejad, Ariana; Ajeawung, Norbert Fonya; Bozorgmehr, Bita; Dionne-Laporte, Alexandre; Molidperee, Sirinart; Najafi, Kimia; Gibbs, Richard A; Lee, Brendan H; Hennekam, Raoul C; Campeau, Philippe M
2017-04-01
Kaufman oculo-cerebro-facial syndrome (KOS) is caused by recessive UBE3B mutations and presents with microcephaly, ocular abnormalities, distinctive facial morphology, low cholesterol levels and intellectual disability. We describe a child with microcephaly, brachycephaly, hearing loss, ptosis, blepharophimosis, hypertelorism, cleft palate, multiple renal cysts, absent nails, small or absent terminal phalanges, absent speech and intellectual disability. Syndromes that were initially considered include DOORS syndrome, Coffin-Siris syndrome and Dubowitz syndrome. Clinical investigations coupled with karyotype analysis, array-comparative genomic hybridization, exome and Sanger sequencing were performed to characterize the condition in this child. Sanger sequencing was negative for the DOORS syndrome gene TBC1D24 but exome sequencing identified a homozygous deletion in UBE3B (NM_183415:c.3139_3141del, p.1047_1047del) located within the terminal portion of the HECT domain. This finding coupled with the presence of characteristic features such as brachycephaly, ptosis, blepharophimosis, hypertelorism, short palpebral fissures, cleft palate and developmental delay allowed us to make a diagnosis of KOS. In conclusion, our findings highlight the importance of considering KOS as a differential diagnosis for patients under evaluation for DOORS syndrome and expand the phenotype of KOS to include small or absent terminal phalanges, nails, and the presence of hallux varus and multicystic dysplastic kidneys.
Data compression for sequencing data
2013-01-01
Post-Sanger sequencing methods produce tons of data, and there is a general agreement that the challenge to store and process them must be addressed with data compression. In this review we first answer the question “why compression” in a quantitative manner. Then we also answer the questions “what” and “how”, by sketching the fundamental compression ideas, describing the main sequencing data types and formats, and comparing the specialized compression algorithms and tools. Finally, we go back to the question “why compression” and give other, perhaps surprising answers, demonstrating the pervasiveness of data compression techniques in computational biology. PMID:24252160
Authentication of Herbal Supplements Using Next-Generation Sequencing
Braukmann, Thomas W. A.; Borisenko, Alex V.; Zakharov, Evgeny V.
2016-01-01
Background DNA-based testing has been gaining acceptance as a tool for authentication of a wide range of food products; however, its applicability for testing of herbal supplements remains contentious. Methods We utilized Sanger and Next-Generation Sequencing (NGS) for taxonomic authentication of fifteen herbal supplements representing three different producers from five medicinal plants: Echinacea purpurea, Valeriana officinalis, Ginkgo biloba, Hypericum perforatum and Trigonella foenum-graecum. Experimental design included three modifications of DNA extraction, two lysate dilutions, Internal Amplification Control, and multiple negative controls to exclude background contamination. Ginkgo supplements were also analyzed using HPLC-MS for the presence of active medicinal components. Results All supplements yielded DNA from multiple species, rendering Sanger sequencing results for rbcL and ITS2 regions either uninterpretable or non-reproducible between the experimental replicates. Overall, DNA from the manufacturer-listed medicinal plants was successfully detected in seven out of eight dry herb form supplements; however, low or poor DNA recovery due to degradation was observed in most plant extracts (none detected by Sanger; three out of seven–by NGS). NGS also revealed a diverse community of fungi, known to be associated with live plant material and/or the fermentation process used in the production of plant extracts. HPLC-MS testing demonstrated that Ginkgo supplements with degraded DNA contained ten key medicinal components. Conclusion Quality control of herbal supplements should utilize a synergetic approach targeting both DNA and bioactive components, especially for standardized extracts with degraded DNA. The NGS workflow developed in this study enables reliable detection of plant and fungal DNA and can be utilized by manufacturers for quality assurance of raw plant materials, contamination control during the production process, and the final product. Interpretation of results should involve an interdisciplinary approach taking into account the processes involved in production of herbal supplements, as well as biocomplexity of plant-plant and plant-fungal biological interactions. PMID:27227830
Authentication of Herbal Supplements Using Next-Generation Sequencing.
Ivanova, Natalia V; Kuzmina, Maria L; Braukmann, Thomas W A; Borisenko, Alex V; Zakharov, Evgeny V
2016-01-01
DNA-based testing has been gaining acceptance as a tool for authentication of a wide range of food products; however, its applicability for testing of herbal supplements remains contentious. We utilized Sanger and Next-Generation Sequencing (NGS) for taxonomic authentication of fifteen herbal supplements representing three different producers from five medicinal plants: Echinacea purpurea, Valeriana officinalis, Ginkgo biloba, Hypericum perforatum and Trigonella foenum-graecum. Experimental design included three modifications of DNA extraction, two lysate dilutions, Internal Amplification Control, and multiple negative controls to exclude background contamination. Ginkgo supplements were also analyzed using HPLC-MS for the presence of active medicinal components. All supplements yielded DNA from multiple species, rendering Sanger sequencing results for rbcL and ITS2 regions either uninterpretable or non-reproducible between the experimental replicates. Overall, DNA from the manufacturer-listed medicinal plants was successfully detected in seven out of eight dry herb form supplements; however, low or poor DNA recovery due to degradation was observed in most plant extracts (none detected by Sanger; three out of seven-by NGS). NGS also revealed a diverse community of fungi, known to be associated with live plant material and/or the fermentation process used in the production of plant extracts. HPLC-MS testing demonstrated that Ginkgo supplements with degraded DNA contained ten key medicinal components. Quality control of herbal supplements should utilize a synergetic approach targeting both DNA and bioactive components, especially for standardized extracts with degraded DNA. The NGS workflow developed in this study enables reliable detection of plant and fungal DNA and can be utilized by manufacturers for quality assurance of raw plant materials, contamination control during the production process, and the final product. Interpretation of results should involve an interdisciplinary approach taking into account the processes involved in production of herbal supplements, as well as biocomplexity of plant-plant and plant-fungal biological interactions.
Gong, Bo; Wei, Bo; Huang, Lulin; Hao, Jilong; Li, Xiulan; Yang, Yin; Zhou, Yu; Hao, Fang; Cui, Zhihua; Zhang, Dingding; Wang, Le
2015-01-01
Retinitis pigmentosa (RP) is the most important hereditary retinal disease caused by progressive degeneration of the photoreceptor cells. This study is to identify gene mutations responsible for autosomal recessive retinitis pigmentosa (arRP) in a Chinese family using next-generation sequencing technology. A Chinese family with 7 members including two individuals affected with severe early-onset RP was studied. All patients underwent a complete ophthalmic examination. Exome sequencing was performed on a single RP patient (the proband of this family) and direct Sanger sequencing on other family members and normal controls was followed to confirm the causal mutations. A homozygous mutation c.437T
A DNA mini-barcode for land plants.
Little, Damon P
2014-05-01
Small portions of the barcode region - mini-barcodes - may be used in place of full-length barcodes to overcome DNA degradation for samples with poor DNA preservation. 591,491,286 rbcL mini-barcode primer combinations were electronically evaluated for PCR universality, and two novel highly universal sets of priming sites were identified. Novel and published rbcL mini-barcode primers were evaluated for PCR amplification [determined with a validated electronic simulation (n = 2765) and empirically (n = 188)], Sanger sequence quality [determined empirically (n = 188)], and taxonomic discrimination [determined empirically (n = 30,472)]. PCR amplification for all mini-barcodes, as estimated by validated electronic simulation, was successful for 90.2-99.8% of species. Overall Sanger sequence quality for mini-barcodes was very low - the best mini-barcode tested produced sequences of adequate quality (B20 ≥ 0.5) for 74.5% of samples. The majority of mini-barcodes provide correct identifications of families in excess of 70.1% of the time. Discriminatory power noticeably decreased at lower taxonomic levels. At the species level, the discriminatory power of the best mini-barcode was less than 38.2%. For samples believed to contain DNA from only one species, an investigator should attempt to sequence, in decreasing order of utility and probability of success, mini-barcodes F (rbcL1/rbcLB), D (F52/R193) and K (F517/R604). For samples believed to contain DNA from more than one species, an investigator should amplify and sequence mini-barcode D (F52/R193). © 2013 John Wiley & Sons Ltd.
A novel variant of FGFR3 causes proportionate short stature.
Kant, Sarina G; Cervenkova, Iveta; Balek, Lukas; Trantirek, Lukas; Santen, Gijs W E; de Vries, Martine C; van Duyvenvoorde, Hermine A; van der Wielen, Michiel J R; Verkerk, Annemieke J M H; Uitterlinden, André G; Hannema, Sabine E; Wit, Jan M; Oostdijk, Wilma; Krejci, Pavel; Losekoot, Monique
2015-06-01
Mutations of the fibroblast growth factor receptor 3 (FGFR3) cause various forms of short stature, of which the least severe phenotype is hypochondroplasia, mainly characterized by disproportionate short stature. Testing for an FGFR3 mutation is currently not part of routine diagnostic testing in children with short stature without disproportion. A three-generation family A with dominantly transmitted proportionate short stature was studied by whole-exome sequencing to identify the causal gene mutation. Functional studies and protein modeling studies were performed to confirm the pathogenicity of the mutation found in FGFR3. We performed Sanger sequencing in a second family B with dominant proportionate short stature and identified a rare variant in FGFR3. Exome sequencing and/or Sanger sequencing was performed, followed by functional studies using transfection of the mutant FGFR3 into cultured cells; homology modeling was used to construct a three-dimensional model of the two FGFR3 variants. A novel p.M528I mutation in FGFR3 was detected in family A, which segregates with short stature and proved to be activating in vitro. In family B, a rare variant (p.F384L) was found in FGFR3, which did not segregate with short stature and showed normal functionality in vitro compared with WT. Proportionate short stature can be caused by a mutation in FGFR3. Sequencing of this gene can be considered in patients with short stature, especially when there is an autosomal dominant pattern of inheritance. However, functional studies and segregation studies should be performed before concluding that a variant is pathogenic. © 2015 European Society of Endocrinology.
Jurkowska, Monika; Gos, Aleksandra; Ptaszyński, Konrad; Michej, Wanda; Tysarowski, Andrzej; Zub, Renata; Siedlecki, Janusz A; Rutkowski, Piotr
2015-01-01
The study compares detection rates of oncogenic BRAF mutations in a homogenous group of 236 FFPE cutaneous melanoma lymph node metastases, collected in one cancer center. BRAF mutational status was verified by two independent in-house PCR/Sanger sequencing tests, and the Cobas® 4800 BRAF V600 Mutation Test. The best of two sequencing approaches returned results for 230/236 samples. In 140 (60.9%), the mutation in codon 600 of BRAF was found. 91.4% of all mutated cases (128 samples) represented p.V600E. Both Sanger-based tests gave reproducible results although they differed significantly in the percentage of amplifiable samples: 230/236 to 109/143. Cobas generated results in all 236 cases, mutations changing codon V600 were detected in 144 of them (61.0%), including 5 not amplifiable and 5 negative in the standard sequencing. However, 6 cases positive in sequencing turned out to be negative in Cobas. Both tests provided us with the same BRAF V600 mutational status in 219 out of 230 cases with valid results (95.2%). The total BRAF V600 mutation detection rate didn't differ significantly between the two methodological approaches (60.9% vs. 61.0%). Sequencing was a reproducible method of V600 mutation detection and more powerful to detect mutations other than p.V600E, while Cobas test proved to be less susceptible to the poor DNA quality or investigator's bias. The study underlined an important role of pathologists in quality assurance of molecular diagnostics.
Urabe, N; Ishii, Y; Hyodo, Y; Aoki, K; Yoshizawa, S; Saga, T; Murayama, S Y; Sakai, K; Homma, S; Tateda, K
2016-04-01
Between 18 November and 3 December 2011, five renal transplant patients at the Department of Nephrology, Toho University Omori Medical Centre, Tokyo, were diagnosed with Pneumocystis pneumonia (PCP). We used molecular epidemiologic methods to determine whether the patients were infected with the same strain of Pneumocystis jirovecii. DNA extracted from the residual bronchoalveolar lavage fluid from the five outbreak cases and from another 20 cases of PCP between 2007 and 2014 were used for multilocus sequence typing to compare the genetic similarity of the P. jirovecii. DNA base sequencing by the Sanger method showed some regions where two bases overlapped and could not be defined. A next-generation sequencer was used to analyse the types and ratios of these overlapping bases. DNA base sequences of P. jirovecii in the bronchoalveolar lavage fluid from four of the five PCP patients in the 2011 outbreak and from another two renal transplant patients who developed PCP in 2013 were highly homologous. The Sanger method revealed 14 genomic regions where two differing DNA bases overlapped and could not be identified. Analyses of the overlapping bases by a next-generation sequencer revealed that the differing types of base were present in almost identical ratios. There is a strong possibility that the PCP outbreak at the Toho University Omori Medical Centre was caused by the same strain of P. jirovecii. Two different types of base present in some regions may be due to P. jirovecii's being a diploid species. Copyright © 2015 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Pyrosequencing for detection of lamivudine-resistant hepatitis B virus.
Lindström, Anna; Odeberg, Jacob; Albert, Jan
2004-10-01
Chronic hepatitis B virus (HBV) infection can cause severe liver disease, including cirrhosis and hepatocellular carcinoma. Lamivudine is a relatively recent alternative to alpha interferon for the treatment of HBV infection, but unfortunately, resistance to lamivudine commonly develops during monotherapy. Lamivudine-resistant HBV mutants display specific mutations in the YMDD (tyrosine, methionine, aspartate, aspartate) motif of the viral polymerase (reverse transcriptase [rt]), which is the catalytic site of the enzyme, i.e., methionine 204 to isoleucine (rtM204I) or valine (rtM204V). The latter mutation is often accompanied by a compensatory leucine-to-methionine change at codon 180 (rtL180M). In the present study, a novel sequencing method, pyrosequencing, was applied to the detection of lamivudine resistance mutations and was compared with direct Sanger sequencing. The new pyrosequencing method had advantages in terms of throughput. Experiments with mixtures of wild-type and resistant viruses indicated that pyrosequencing can detect minor sequence variants in heterogeneous virus populations. The new pyrosequencing method was evaluated with a small number of patient samples, and the results showed that the method could be a useful tool for the detection of lamivudine resistance in the clinical setting.
Genomics - the new rock and roll?
Dunham, I
2000-10-01
The end of the beginning of the Human Genome Project was announced on 26 June when the working draft or first assembly was announced. Here, Ian Dunham who led the group at the Sanger Centre that produced the first complete sequence of a human chromosome reflects on how it felt to be with the genome project from the beginning.
Gao, Ge; Johnson, Sarah H; Vasmatzis, George; Pauley, Christina E; Tombers, Nicole M; Kasperbauer, Jan L; Smith, David I
2017-01-01
Common fragile sites (CFS) are chromosome regions that are prone to form gaps or breaks in response to DNA replication stress. They are often found as hotspots for sister chromatid exchanges, deletions, and amplifications in different cancers. Many of the CFS regions are found to span genes whose genomic sequence is greater than 1 Mb, some of which have been demonstrated to function as important tumor suppressors. CFS regions are also hotspots for human papillomavirus (HPV) integrations in cervical cancer. We used mate-pair sequencing to examine HPV integration events and chromosomal structural variations in 34 oropharyngeal squamous cell carcinoma (OPSCC). We used endpoint PCR and Sanger sequencing to validate each HPV integration event and found HPV integrations preferentially occurred within CFS regions similar to what is observed in cervical cancer. We also found that many of the chromosomal alterations detected also occurred at or near the cytogenetic location of CFSs. Several large genes were also found to be recurrent targets of rearrangements, independent of HPV integrations, including CSMD1 (2.1Mb), LRP1B (1.9Mb), and LARGE1 (0.7Mb). Sanger sequencing revealed that the nucleotide sequences near to identified junction sites contained repetitive and AT-rich sequences that were shown to have the potential to form stem-loop DNA secondary structures that might stall DNA replication fork progression during replication stress. This could then cause increased instability in these regions which could lead to cancer development in human cells. Our findings suggest that CFSs and some specific large genes appear to play important roles in OPSCC. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Rosenthal, Lisa M; Larsson, Karl-Henrik; Branco, Sara; Chung, Judy A; Glassman, Sydney I; Liao, Hui-Ling; Peay, Kabir G; Smith, Dylan P; Talbot, Jennifer M; Taylor, John W; Vellinga, Else C; Vilgalys, Rytas; Bruns, Thomas D
2017-01-01
The corticioid fungi are commonly encountered, highly diverse, ecologically important, and understudied. We collected specimens in 60 pine and spruce forests across North America to survey corticioid fungal frequency and distribution and to compile an internal transcribed spacer (ITS) database for the group. Sanger sequences from the ITS region of vouchered specimens were compared with sequences on GenBank and UNITE, and with high-throughput sequence data from soil and roots taken at the same sites. Out of 425 high-quality Sanger sequences from vouchered specimens, we recovered 223 distinct operational taxonomic units (OTUs), the majority of which could not be assigned to species by matching to the BLAST database. Corticioid fungi were found to be hyperdiverse, as supported by the observations that nearly two-thirds of our OTUs were represented by single collections and species estimator curves showed steep slopes with no plateaus. We estimate that 14.8-24.7% of our voucher-based OTUs are likely to be ectomycorrhizal (EM). Corticioid fungi recovered from the soil formed a different community assemblage, with EM taxa accounting for 40.5-58.6% of OTUs. We compared basidioma sequences with EM root tips from our data, GenBank, or UNITE, and with this approach, we reiterate existing speculations that Trechispora stellulata is EM. We found that corticioid fungi have a significant distance-decay pattern, adding to the literature supporting fungi as having geographically structured communities. This study provides a first view of the diversity of this important group across North American pine forests, but much of the biology and taxonomy of these diverse, important, and widespread fungi remains unknown.
Fonseca, Luiz Henrique M; Lohmann, Lúcia G
2018-06-01
Combining high-throughput sequencing data with amplicon sequences allows the reconstruction of robust phylogenies based on comprehensive sampling of characters and taxa. Here, we combine Next Generation Sequencing (NGS) and Sanger sequencing data to infer the phylogeny of the "Adenocalymma-Neojobertia" clade (Bignonieae, Bignoniaceae), a diverse lineage of Neotropical plants, using Maximum Likelihood and Bayesian approaches. We used NGS to obtain complete or nearly-complete plastomes of members of this clade, leading to a final dataset with 54 individuals, representing 44 members of ingroup and 10 outgroups. In addition, we obtained Sanger sequences of two plastid markers (ndhF and rpl32-trnL) for 44 individuals (43 ingroup and 1 outgroup) and the nuclear PepC for 64 individuals (63 ingroup and 1 outgroup). Our final dataset includes 87 individuals of members of the "Adenocalymma-Neojobertia" clade, representing 66 species (ca. 90% of the diversity), plus 11 outgroups. Plastid and nuclear datasets recovered congruent topologies and were combined. The combined analysis recovered a monophyletic "Adenocalymma-Neojobertia" clade and a paraphyletic Adenocalymma that also contained a monophyletic Neojobertia plus Pleonotoma albiflora. Relationships are strongly supported in all analyses, with most lineages within the "Adenocalymma-Neojobertia" clade receiving maximum posterior probabilities. Ancestral character state reconstructions using Bayesian approaches identified six morphological synapomorphies of clades namely, prophyll type, petiole and petiolule articulation, tendril ramification, inflorescence ramification, calyx shape, and fruit wings. Other characters such as habit, calyx cupular trichomes, corolla color, and corolla shape evolved multiple times. These characters are putatively related with the clade diversification and can be further explored in diversification studies. Copyright © 2018 Elsevier Inc. All rights reserved.
Aziz, Nazneen; Zhao, Qin; Bry, Lynn; Driscoll, Denise K; Funke, Birgit; Gibson, Jane S; Grody, Wayne W; Hegde, Madhuri R; Hoeltge, Gerald A; Leonard, Debra G B; Merker, Jason D; Nagarajan, Rakesh; Palicki, Linda A; Robetorye, Ryan S; Schrijver, Iris; Weck, Karen E; Voelkerding, Karl V
2015-04-01
The higher throughput and lower per-base cost of next-generation sequencing (NGS) as compared to Sanger sequencing has led to its rapid adoption in clinical testing. The number of laboratories offering NGS-based tests has also grown considerably in the past few years, despite the fact that specific Clinical Laboratory Improvement Amendments of 1988/College of American Pathologists (CAP) laboratory standards had not yet been developed to regulate this technology. To develop a checklist for clinical testing using NGS technology that sets standards for the analytic wet bench process and for bioinformatics or "dry bench" analyses. As NGS-based clinical tests are new to diagnostic testing and are of much greater complexity than traditional Sanger sequencing-based tests, there is an urgent need to develop new regulatory standards for laboratories offering these tests. To develop the necessary regulatory framework for NGS and to facilitate appropriate adoption of this technology for clinical testing, CAP formed a committee in 2011, the NGS Work Group, to deliberate upon the contents to be included in the checklist. Results . -A total of 18 laboratory accreditation checklist requirements for the analytic wet bench process and bioinformatics analysis processes have been included within CAP's molecular pathology checklist (MOL). This report describes the important issues considered by the CAP committee during the development of the new checklist requirements, which address documentation, validation, quality assurance, confirmatory testing, exception logs, monitoring of upgrades, variant interpretation and reporting, incidental findings, data storage, version traceability, and data transfer confidentiality.
De Novo Paternal FBN1 Mutation Detected in Embryos Before Implantation.
Wang, Shuling; Niu, Ziru; Wang, Hui; Ma, Minyue; Zhang, Wei; Fang Wang, Shu; Wang, Jun; Yan, Hong; Liu, Yifan; Duan, Na; Zhang, Xiandong; Yao, Yuanqing
2017-06-26
BACKGROUND Marfan syndrome (MFS) is an autosomal dominant disease caused by mutations in the Fibrillin (FBN)1 gene and characterized by disorders in the cardiovascular, skeletal, and visual systems. The diversity of mutations and phenotypic heterogeneity of MFS make prenatal molecular diagnoses difficult. In this study, we used pre-implantation genetic diagnosis (PGD) to identify the pathogenic mutation in a male patient with MFS and to determine whether his offspring would be free of the disease. MATERIAL AND METHODS The history and pedigree of the proband were analyzed. Mutation analysis was performed on the couple and immediate family members. The couple chose IVF treatment and 4 blastocysts were biopsied. PGD was carried out by targeted high-throughput sequencing of the FBN1 gene in the embryos, along with single-nucleotide polymorphism haplotyping. Sanger sequencing was used to confirm the causative mutation. RESULTS c.2647T>C (p.Trp883Arg) was identified as the de novo likely pathogenic mutation in the proband. Whole-genome amplification and sequencing of the 3 embryos revealed that they did not carry the mutation, and 1 blastocyst was transferred back to the uterus. The amniocentesis test result analyzed by Sanger sequencing confirmed the PGD. A premature but healthy infant free of heart malformations was born. CONCLUSIONS The de novo mutation c.2647T>C (p.Trp883Arg) in FBN1 was identified in a Chinese patient with MFS. Embryos without the mutation were identified by PGD and resulted in a successful pregnancy.
Wilkinson, Samuel L.; John, Shibu; Walsh, Roddy; Novotny, Tomas; Valaskova, Iveta; Gupta, Manu; Game, Laurence; Barton, Paul J R.; Cook, Stuart A.; Ware, James S.
2013-01-01
Background Molecular genetic testing is recommended for diagnosis of inherited cardiac disease, to guide prognosis and treatment, but access is often limited by cost and availability. Recently introduced high-throughput bench-top DNA sequencing platforms have the potential to overcome these limitations. Methodology/Principal Findings We evaluated two next-generation sequencing (NGS) platforms for molecular diagnostics. The protein-coding regions of six genes associated with inherited arrhythmia syndromes were amplified from 15 human samples using parallelised multiplex PCR (Access Array, Fluidigm), and sequenced on the MiSeq (Illumina) and Ion Torrent PGM (Life Technologies). Overall, 97.9% of the target was sequenced adequately for variant calling on the MiSeq, and 96.8% on the Ion Torrent PGM. Regions missed tended to be of high GC-content, and most were problematic for both platforms. Variant calling was assessed using 107 variants detected using Sanger sequencing: within adequately sequenced regions, variant calling on both platforms was highly accurate (Sensitivity: MiSeq 100%, PGM 99.1%. Positive predictive value: MiSeq 95.9%, PGM 95.5%). At the time of the study the Ion Torrent PGM had a lower capital cost and individual runs were cheaper and faster. The MiSeq had a higher capacity (requiring fewer runs), with reduced hands-on time and simpler laboratory workflows. Both provide significant cost and time savings over conventional methods, even allowing for adjunct Sanger sequencing to validate findings and sequence exons missed by NGS. Conclusions/Significance MiSeq and Ion Torrent PGM both provide accurate variant detection as part of a PCR-based molecular diagnostic workflow, and provide alternative platforms for molecular diagnosis of inherited cardiac conditions. Though there were performance differences at this throughput, platforms differed primarily in terms of cost, scalability, protocol stability and ease of use. Compared with current molecular genetic diagnostic tests for inherited cardiac arrhythmias, these NGS approaches are faster, less expensive, and yet more comprehensive. PMID:23861798
Sapientia: accelerating rare disease diagnosis and treatment.
Furness, Mike
2016-09-01
Congenica (Cambridge, UK) is a world leading developer of genome-based discovery and diagnostic technologies. The UK company is a spin-out from the Wellcome Trust Sanger Institute (Cambridge, UK) and was founded by scientists and clinicians at the leading edge of genomic analysis. Congenica's Sapientia™ technology platform allows whole-genome sequence analysis to be easily interpreted and presented within a clinically actionable diagnostic report. It is based on pioneering research from Wellcome Trust Sanger Institute, National Health Service clinicians and regional genetic testing laboratories and validated by Genomics England Ltd (London, UK). Sapientia used for medical diagnosis in hospitals including Great Ormond Street Hospital (London, UK), Manchester Centre for Genomic Medicine (Manchester, UK), Birmingham Women's Hospital (Birmingham, UK) and for new drug development by pharmaceutical companies. This profile follows the journey from proof of concept to clinical diagnosis.
PAVE: program for assembling and viewing ESTs.
Soderlund, Carol; Johnson, Eric; Bomhoff, Matthew; Descour, Anne
2009-08-26
New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. The PAVE (Program for Assembling and Viewing ESTs) assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.
Gao, M L; Zhong, X M; Ma, X; Ning, H J; Zhu, D; Zou, J Z
2016-06-02
To make genetic diagnosis of Alagille syndrome (ALGS) patients using target gene sequence capture and next generation sequencing technology. Target gene sequence capture and next generation sequencing were used to detect ALGS gene of 4 patients. They were hospitalized at the Affiliated Hospital, Capital Institute of Pediatrics between January 2014 and December 2015, referred to clinical diagnosis of ALGS typical and atypical respectively in 2 cases. Blood samples were collected from patients and their parents and genomic DNA was extracted from lymphocytes. Target gene sequence capture and next generation sequencing was detected. Sanger sequencing was used to confirm the results of the patients and their parents. Cholestasis, heart defects, inverted triangular face and butterfly vertebrae were presented as main clinical features in 4 male patients. The first hospital visiting ages ranged from 3 months and 14 days to 3 years and 1 month. The age of onset ranged from 3 days to 42 days (median 23 days). According to the clinical diagnostic criteria of ALGS, patient 1 and patient 2 were considered as typical ALGS. The other 2 patients were considered as atypical ALGS. Four Jagged 1(JAG1) pathogenic mutations were detected. Three different missense mutations were detected in patient 1 to patient 3 with ALGS(c.839C>T(p.W280X), c. 703G>A(p.R235X), c. 1720C>T(p.V574M)). The JAG1 mutation of patient 3 was first reported. Patient 4 had one novel insertion mutation (c.1779_1780insA(p.Ile594AsnfsTer23)). Parental analysis verified that the JAG1 missense mutation of 3 patients were de novo. The results of sanger sequencing was consistent with the results of the next generation sequencing. Target gene sequence capture combined with next generation sequencing can detect two pathogenic genes in ALGS and test genes of other related diseases in infantile cholestatic diseases simultaneously and presents a high throughput, high efficiency and low cost. It may provide molecular diagnosis and treatment for clinicians with good clinical application prospects.
Kelm-Nelson, Cynthia A; Stevenson, Sharon A; Ciucci, Michelle R
2016-09-01
Datasets provided in this article represent the Rattus norvegicus primer design and verification used in Pink1 -/- and wildtype Long Evans brain tissue. Accessible tables include relevant information, accession numbers, sequences, temperatures and product length, describing primer design specific to the transcript amplification use. Additionally, results of Sanger sequencing of qPCR reaction products (FASTA aligned sequences) are presented for genes of interest. Results and further interpretation and discussion can be found in the original research article "Atp13a2 expression in the periaqueductal gray is decreased in the Pink1 -/- rat model of Parkinson disease" [1].
Next-Generation Sequencing Platforms
NASA Astrophysics Data System (ADS)
Mardis, Elaine R.
2013-06-01
Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.
Ciliates and the rare biosphere: a review.
Dunthorn, Micah; Stoeck, Thorsten; Clamp, John; Warren, Alan; Mahé, Frédéric
2014-01-01
Here we provide a brief review of the rare biosphere from the perspective of ciliates and other microbial eukaryotes. We trace research on rarity from its lack of much in-depth focus in morphological and Sanger sequencing projects, to its central importance in analyses using high throughput sequencing strategies. The problem that the rare biosphere is potentially comprised of mostly errors is then discussed in the light of asking community-comparative, novel-diversity, and ecosystem-functioning questions. © 2014 The Author(s) Journal of Eukaryotic Microbiology © 2014 International Society of Protistologists.
A novel PTCH1 mutation underlies non-syndromic cleft lip and/or palate in a Han Chinese family.
Zhao, Huaxiang; Zhong, Wenjie; Leng, Chuntao; Zhang, Jieni; Zhang, Mengqi; Huang, Wenbin; Zhang, Yunfan; Li, Weiran; Jia, Peizeng; Lin, Jiuxiang; Maimaitili, Gulibaha; Chen, Feng
2018-06-16
Cleft lip and/or palate (CL/P) is the most common craniofacial congenital disease, and it has a complex aetiology. This study aimed to identify the causative gene mutation of a Han Chinese family with CL/P. Whole exome sequencing was conducted on the proband and her mother, who exhibited the same phenotype. A Mendelian dominant inheritance model, allele frequency, mutation regions, functional prediction and literature review were used to screen and filter the variants. The candidate was validated by Sanger sequencing. Conservation analysis and homology modelling were conducted. A heterozygous missense mutation c.1175C>T in the PTCH1 gene predicting p.Ala392Val was identified. This variant has not been reported and was predicted to be deleterious. Sanger sequencing verified the variant and the dominant inheritance model in the family. The missense alteration affects an amino acid that is evolutionarily conserved in the first extracellular loop of the PTCH1 protein. The local structure of the mutant protein was significantly altered according to homology modelling. Our findings suggest that c.1175C>T in PTCH1 (NM_000264) may be the causative mutation of this pedigree. Our results add to the evidence that PTCH1 variants play a role in the pathogenesis of orofacial clefts. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Development of allele-specific multiplex PCR to determine the length of poly-T in intron 8 of CFTR
Prada, Anne E.
2014-01-01
Cystic fibrosis transmembrane conductance regulator (CFTR) gene mutation analysis has been implemented for Cystic Fibrosis (CF) carrier screening, and molecular diagnosis of CF and congenital bilateral absence of the vas deferens (CBAVD). Although poly-T allele analysis in intron 8 of CFTR is required when a patient is positive for R117H, it is not recommended for routine carrier screening. Therefore, commercial kits for CFTR mutation analysis were designed either to mask the poly-T allele results, unless a patient is R117H positive, or to have the poly-T analysis as a standalone reflex test using the same commercial platform. There are other standalone assays developed to detect poly-T alleles, such as heteroduplex analysis, High Resolution Melting (HRM) curve analysis, allele-specific PCR (AS-PCR) and Sanger sequencing. In this report, we developed a simple and easy-to-implement multiplex AS-PCR assay using unlabeled standard length primers, which can be used as a reflex or standalone test for CFTR poly-T track analysis. Out of 115 human gDNA samples tested, results from our new AS-PCR matched to the previous known poly-T results or results from Sanger sequencing. PMID:25071991
Cornforth, Michael N; Anur, Pavana; Wang, Nicholas; Robinson, Erin; Ray, F Andrew; Bedford, Joel S; Loucas, Bradford D; Williams, Eli S; Peto, Myron; Spellman, Paul; Kollipara, Rahul; Kittler, Ralf; Gray, Joe W; Bailey, Susan M
2018-05-11
Chromosome rearrangements are large-scale structural variants that are recognized drivers of oncogenic events in cancers of all types. Cytogenetics allows for their rapid, genome-wide detection, but does not provide gene-level resolution. Massively parallel sequencing (MPS) promises DNA sequence-level characterization of the specific breakpoints involved, but is strongly influenced by bioinformatics filters that affect detection efficiency. We sought to characterize the breakpoint junctions of chromosomal translocations and inversions in the clonal derivatives of human cells exposed to ionizing radiation. Here, we describe the first successful use of DNA paired-end analysis to locate and sequence across the breakpoint junctions of a radiation-induced reciprocal translocation. The analyses employed, with varying degrees of success, several well-known bioinformatics algorithms, a task made difficult by the involvement of repetitive DNA sequences. As for underlying mechanisms, the results of Sanger sequencing suggested that the translocation in question was likely formed via microhomology-mediated non-homologous end joining (mmNHEJ). To our knowledge, this represents the first use of MPS to characterize the breakpoint junctions of a radiation-induced chromosomal translocation in human cells. Curiously, these same approaches were unsuccessful when applied to the analysis of inversions previously identified by directional genomic hybridization (dGH). We conclude that molecular cytogenetics continues to provide critical guidance for structural variant discovery, validation and in "tuning" analysis filters to enable robust breakpoint identification at the base pair level.
ALDH1A3 Mutations Cause Recessive Anophthalmia and Microphthalmia
Fares-Taie, Lucas; Gerber, Sylvie; Chassaing, Nicolas; Clayton-Smith, Jill; Hanein, Sylvain; Silva, Eduardo; Serey, Margaux; Serre, Valérie; Gérard, Xavier; Baumann, Clarisse; Plessis, Ghislaine; Demeer, Bénédicte; Brétillon, Lionel; Bole, Christine; Nitschke, Patrick; Munnich, Arnold; Lyonnet, Stanislas; Calvas, Patrick; Kaplan, Josseline; Ragge, Nicola; Rozet, Jean-Michel
2013-01-01
Anophthalmia and microphthalmia (A/M) are early-eye-development anomalies resulting in absent or small ocular globes, respectively. A/M anomalies occur in syndromic or nonsyndromic forms. They are genetically heterogeneous, some mutations in some genes being responsible for both anophthalmia and microphthalmia. Using a combination of homozygosity mapping, exome sequencing, and Sanger sequencing, we identified homozygosity for one splice-site and two missense mutations in the gene encoding the A3 isoform of the aldehyde dehydrogenase 1 (ALDH1A3) in three consanguineous families segregating A/M with occasional orbital cystic, neurological, and cardiac anomalies. ALDH1A3 is a key enzyme in the formation of a retinoic acid gradient along the dorso-ventral axis during early eye development. Transitory expression of mutant ALDH1A3 open reading frames showed that both missense mutations reduce the accumulation of the enzyme, potentially leading to altered retinoic acid synthesis. Although the role of retinoic acid signaling in eye development is well established, our findings provide genetic evidence of a direct link between retinoic-acid-synthesis dysfunction and early-eye-development anomalies in humans. PMID:23312594
Li, Zhoufang; Liu, Guangjie; Tong, Yin; Zhang, Meng; Xu, Ying; Qin, Li; Wang, Zhanhui; Chen, Xiaoping; He, Jiankui
2015-01-01
Profiling immune repertoires by high throughput sequencing enhances our understanding of immune system complexity and immune-related diseases in humans. Previously, cloning and Sanger sequencing identified limited numbers of T cell receptor (TCR) nucleotide sequences in rhesus monkeys, thus their full immune repertoire is unknown. We applied multiplex PCR and Illumina high throughput sequencing to study the TCRβ of rhesus monkeys. We identified 1.26 million TCRβ sequences corresponding to 643,570 unique TCRβ sequences and 270,557 unique complementarity-determining region 3 (CDR3) gene sequences. Precise measurements of CDR3 length distribution, CDR3 amino acid distribution, length distribution of N nucleotide of junctional region, and TCRV and TCRJ gene usage preferences were performed. A comprehensive profile of rhesus monkey immune repertoire might aid human infectious disease studies using rhesus monkeys. PMID:25961410
Embedding gender equality into institutional strategy.
Ahmed, S
2017-01-01
The SiS (Sex in Science) Programme on the WGC (Wellcome Genome Campus) was established in 2011. Key participants include the Wellcome Trust Sanger Institute, EMB-EBI (EMBL-European Bioinformatics Institute), Open Targets and Elixir. The key objectives are to catalyse cultural change, develop partnerships, communicate activities and champion our women in science work at a national and international level (http://www.sanger.ac.uk/about/sex-science). In this paper, we highlight some of the many initiatives that have taken place since 2013, to address gender inequality at the highest levels; the challenges we have faced and how we have overcome these, and the future direction of travel.
Das Bhowmik, Aneek; Gupta, Neerja; Dalal, Ashwin; Kabra, Madhulika
In the present study we report on genetic analysis in a patient with developmental delay, truncal obesity and vision problem, to find the causative mutation. Whole exome sequencing was performed on genomic DNA extracted from whole blood of the patient which revealed a homozygous nonsense variant (c.2816T>A) in exon 8 of ALMS1 gene that results in a stop codon and premature truncation at codon 939 (p.L939Ter) of the protein. The mutation was confirmed by Sanger sequencing. Exome sequencing was helpful in establishing diagnosis of Alstrom syndrome in this patient. This case highlights the utility of exome sequencing in clinical practice. Copyright © 2016 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.
Xu, Yan; Guan, Liping; Xiao, Xueshan; Zhang, Jianguo; Li, Shiqiang; Jiang, Hui; Jia, Xiaoyun; Yang, Jianhua; Guo, Xiangming; Yin, Ye; Wang, Jun; Zhang, Qingjiong
2015-01-01
Mutations in 60 known genes were previously identified by exome sequencing in 79 of 157 families with retinitis pigmentosa (RP). This study analyzed variants in 129 genes associated with other forms of hereditary retinal dystrophy in the same cohort. Apart from the 73 genes previously analyzed, a further 129 genes responsible for other forms of hereditary retinal dystrophy were selected based on RetNet. Variants in the 129 genes determined by whole exome sequencing were selected and filtered by bioinformatics analysis. Candidate variants were confirmed by Sanger sequencing and validated by analysis of available family members and controls. A total of 90 candidate variants were present in the 129 genes. Sanger sequencing confirmed 83 of the 90 variants. Analysis of family members and controls excluded 76 of these 83 variants. The remaining seven variants were considered to be potential pathogenic mutations; these were c.899A>G, c.1814C>G, and c.2107C>T in BBS2; c.1073C>T and c.1669C>T in INPP5E; and c.3582C>G and c.5704-5C>G in CACNA1F. Six of these seven mutations were novel. The mutations were detected in five unrelated patients without a family history, including three patients with homozygous or compound heterozygous mutations in BBS2 and INPP5E, and two patients with hemizygous mutations in CACNA1F. None of the patients had mutations in the genes associated with autosome dominant retinal dystrophy. Only a small portion of patients with RP, about 3% (5/157), had causative mutations in the 129 genes associated with other forms of hereditary retinal dystrophy.
Cefalù, Angelo B; Spina, Rossella; Noto, Davide; Ingrassia, Valeria; Valenti, Vincenza; Giammanco, Antonina; Fayer, Francesca; Misiano, Gabriella; Cocorullo, Gianfranco; Scrimali, Chiara; Palesano, Ornella; Altieri, Grazia I; Ganci, Antonina; Barbagallo, Carlo M; Averna, Maurizio R
Severe hypertriglyceridemia (HTG) may result from mutations in genes affecting the intravascular lipolysis of triglyceride (TG)-rich lipoproteins. The aim of this study was to develop a targeted next-generation sequencing panel for the molecular diagnosis of disorders characterized by severe HTG. We developed a targeted customized panel for next-generation sequencing Ion Torrent Personal Genome Machine to capture the coding exons and intron/exon boundaries of 18 genes affecting the main pathways of TG synthesis and metabolism. We sequenced 11 samples of patients with severe HTG (TG>885 mg/dL-10 mmol/L): 4 positive controls in whom pathogenic mutations had previously been identified by Sanger sequencing and 7 patients in whom the molecular defect was still unknown. The customized panel was accurate, and it allowed to confirm genetic variants previously identified in all positive controls with primary severe HTG. Only 1 patient of 7 with HTG was found to be carrier of a homozygous pathogenic mutation of the third novel mutation of LMF1 gene (c.1380C>G-p.Y460X). The clinical and molecular familial cascade screening allowed the identification of 2 additional affected siblings and 7 heterozygous carriers of the mutation. We showed that our targeted resequencing approach for genetic diagnosis of severe HTG appears to be accurate, less time consuming, and more economical compared with traditional Sanger resequencing. The identification of pathogenic mutations in candidate genes remains challenging and clinical resequencing should mainly intended for patients with strong clinical criteria for monogenic severe HTG. Copyright © 2017 National Lipid Association. Published by Elsevier Inc. All rights reserved.
Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus
Kinoti, Wycliff M.; Constable, Fiona E.; Nancarrow, Narelle; Plummer, Kim M.; Rodoni, Brendan
2017-01-01
The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS) of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp) gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV) was the most frequently detected Ilarvirus, occurring in 48 of the 61 Ilarvirus-positive trees and Prune dwarf virus (PDV) and Apple mosaic virus (ApMV) were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV) was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus-like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus-like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus-like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples, and the need for a standardized approach to accurately determine what constitutes an active, viable virus infection after detection by molecular based methods. PMID:28713347
Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus.
Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan
2017-01-01
The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS) of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp) gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV) was the most frequently detected Ilarvirus , occurring in 48 of the 61 Ilarvirus -positive trees and Prune dwarf virus (PDV) and Apple mosaic virus (ApMV) were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV) was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus -like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus -like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus -like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples, and the need for a standardized approach to accurately determine what constitutes an active, viable virus infection after detection by molecular based methods.
Baurand, Amandine; Falcon-Eicher, Sylvie; Laurent, Gabriel; Villain, Elisabeth; Bonnet, Caroline; Thauvin-Robinet, Christel; Jacquot, Caroline; Eicher, Jean-Christophe; Gourraud, Jean-Baptiste; Schmitt, Sébastien; Bézieau, Stéphane; Giraud, Mathilde; Dumont, Solenne; Kuentz, Paul; Probst, Vincent; Burguet, Antoine; Kyndt, Florence; Faivre, Laurence
2017-02-01
Autosomal dominant genetic diseases can occur de novo and in the form of somatic mosaicism, which can give rise to a less severe phenotype, and make diagnosis more difficult given the sensitivity limits of the methods used. We report the case of female child with a history of surgery for syndactyly of the hands and feet, who was admitted at 6 years of age to a pediatric intensive care unit following cardiac arrest. The electrocardiogram (ECG) showed a long QT interval that on occasions reached 500 ms. Despite the absence of facial dysmorphism and the presence of normal psychomotor development, a diagnosis of Timothy syndrome was made given the association of syndactyly and the ECG features. Sanger sequencing of the CACNA1C gene, followed by sequencing of the genes KCNQ1, KCNH2, KCNE1, KCNE2, were negative. The subsequent analysis of a panel of genes responsible for hereditary cardiac rhythm disorders using Haloplex technology revealed a recurrent mosaic p.Gly406Arg missense mutation of the CACNA1C gene in 18% of the cells. This mosaicism can explain the negative Sanger analysis and the less complete phenotype in this patient. Given the other cases in the literature, mosaic mutations in Timothy syndrome appear more common than previously thought. This case demonstrates the importance of using next-generation sequencing to identify mosaic mutations when the clinical picture supports a specific mutation that is not identified using conventional testing. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Ma, Yalin; Xiao, Yun; Zhang, Fengguo; Han, Yuechen; Li, Jianfeng; Xu, Lei; Bai, Xiaohui; Wang, Haibo
2016-04-01
Mutations in MYO7A gene have been reported to be associated with Usher Syndrome type 1B (USH1B) and nonsyndromic hearing loss (DFNB2, DFNA11). Most mutations in MYO7A gene caused USH1B, whereas only a few reported mutations led to DFNB2 and DFNA11. The current study was designed to investigate the mutations among a Chinese family with autosomal recessive hearing loss. In this study, we present the clinical, genetic and molecular characteristics of a Chinese family. Targeted capture of 127 known deafness genes and next-generation sequencing were employed to study the genetic causes of two siblings in the Chinese family. Sanger sequencing was employed to examine those variant mutations in the members of this family and other ethnicity-matched controls. We identified the novel compound heterozygous mutant alleles of MYO7A gene: a novel missense mutation c.3671C>A (p.A1224D) and a reported insert mutation c.390_391insC (p.P131PfsX9). Variants were further confirmed by Sanger sequencing. These two compound heterozygous variants were co-segregated with autosomal recessive hearing loss phenotype. The gene mutation analysis and protein sequence alignment further supported that the novel compound heterozygous mutations were pathogenic. The novel compound heterozygous mutations (c.3671C>A and c.390_391insC) in MYO7A gene identified in this study were responsible for the autosomal recessive sensorineural hearing loss of this Chinese family. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Birt-Hogg-Dubé syndrome in two Chinese families with mutations in the FLCN gene.
Hou, Xiaocan; Zhou, Yuan; Peng, Yun; Qiu, Rong; Xia, Kun; Tang, Beisha; Zhuang, Wei; Jiang, Hong
2018-01-22
Birt-Hogg-Dubé syndrome is an autosomal dominant hereditary condition caused by mutations in the folliculin-encoding gene FLCN (NM_144997). It is associated with skin lesions such as fibrofolliculoma, acrochordon and trichodiscoma; pulmonary lesions including spontaneous pneumothorax and pulmonary cysts and renal cancer. Genomic DNA was extracted from peripheral venous blood samples of the propositi and their family members. Genetic analysis was performed by whole exome sequencing and Sanger sequencing aiming at corresponding exons in FLCN gene to explore the genetic mutations of these two families. In this study, we performed genetic analysis by whole exome sequencing and Sanger sequencing aiming at corresponding exons in FLCN gene to explore the genetic mutations in two Chinese families. Patients from family 1 mostly suffered from pneumothorax and pulmonary cysts, several of whom also mentioned skin lesions or kidney lesions. While in family 2, only thoracic lesions were found in the patients, without any other clinical manifestations. Two FLCN mutations have been identified: One is an insertion mutation (c.1579_1580insA/p.R527Xfs on exon 14) previously reported in three Asian families (one mainland family and two Taiwanese families); while the other is a firstly reviewed mutation in Asian population (c.649C > T / p.Gln217X on exon 7) that ever been detected in a French family. Overall, The detection of these two mutations expands the spectrum of FLCN mutations and will provide insight into genetic diagnosis and counseling of Birt-Hogg-Dubé syndrome.
Huang, Li; Xiao, Xueshan; Li, Shiqiang; Jia, Xiaoyun; Wang, Panfeng; Sun, Wenmin; Xu, Yan; Xin, Wei; Guo, Xiangming; Zhang, Qingjiong
2016-05-01
Cone-rod dystrophy (CORD) is a common form of inherited retinal degeneration. Previously, we have conducted serial mutational analysis in probands with CORD either by Sanger sequencing or whole exome sequencing (WES). In the current study, variants in all genes from RetNet were selected from the whole exome sequencing data of 108 CORD probands (including 61 probands reported here for the first time) and were analyzed by multistep bioinformatics analysis, followed by Sanger sequencing and segregation validation. Data from the previous studies and new data from this study (163 probands in total) were summarized to provide an overview of the molecular genetics of CORD. The following potentially pathogenic mutations were identified in 93 of the 163 (57.1%) probands: CNGA3 (32.5%), ABCA4 (3.8%), ALMS1 (3.1%), GUCY2D (3.1%), CACNA1F (2.5%), CRX (1.8%), PDE6C (1.8%), CNGB3 (1.8%), GUCA1A (1.2%), UNC119 (0.6%), RPGRIP1 (1.2%), RDH12 (0.6%), KCNV2 (0.6%), C21orf2 (0.6%), CEP290 (0.6%), USH2A (0.6%) and SNRNP200 (0.6%). The 17 genes with mutations included 12 known CORD genes and five genes (ALMS1, RDH12, CEP290, USH2A, and SNRNP200) associated with other forms of retinal degeneration. Mutations in CNGA3 is most common in this cohort. This is a systematic molecular genetic analysis of Chinese patients with CORD. Copyright © 2016 Elsevier Ltd. All rights reserved.
Tatsi, Christina; Gkourogianni, Alexandra; Mohnike, Klaus; DeArment, Diana; Witchel, Selma; Andrade, Anenisia C; Markello, Thomas C; Baron, Jeffrey; Nilsson, Ola; Jee, Youn Hee
2017-08-01
Aggrecan, a proteoglycan, is an important component of cartilage extracellular matrix, including that of the growth plate. Heterozygous mutations in ACAN , the gene encoding aggrecan, cause autosomal dominant short stature, accelerated skeletal maturation, and joint disease. The inheritance pattern and the presence of bone age equal to or greater than chronological age have been consistent features, serving as diagnostic clues. From family 1, a 6-year-old boy presented with short stature [height standard deviation score (SDS), -1.75] and bone age advanced by 3 years. There was no family history of short stature (height SDS: father, -0.76; mother, 0.7). Exome sequencing followed by Sanger sequencing identified a de novo novel heterozygous frameshift mutation in ACAN (c.6404delC: p.A2135Dfs). From family 2, a 12-year-old boy was evaluated for short stature (height SDS, -3.9). His bone age at the time of genetic evaluation was approximately 1 year less than his chronological age. Family history was consistent with an autosomal dominant inheritance of short stature, with several affected members also showing early-onset osteoarthritis. Exome sequencing, confirmed by Sanger sequencing, identified a novel nonsense mutation in ACAN (c.4852C>T: p.Q1618X), which cosegregated with the phenotype. In conclusion, patients with ACAN mutations may present with nonfamilial short stature and with bone age less than chronological age. These findings expand the known phenotypic spectrum of heterozygous ACAN mutations and indicate that this diagnosis should be considered in children without a family history of short stature and in children without accelerated skeletal maturation.
Snelling, Timothy J; Genç, Buğra; McKain, Nest; Watson, Mick; Waters, Sinéad M; Creevey, Christopher J; Wallace, R John
2014-01-01
Ruminal archaeomes of two mature sheep grazing in the Scottish uplands were analysed by different sequencing and analysis methods in order to compare the apparent archaeal communities. All methods revealed that the majority of methanogens belonged to the Methanobacteriales order containing the Methanobrevibacter, Methanosphaera and Methanobacteria genera. Sanger sequenced 1.3 kb 16S rRNA gene amplicons identified the main species of Methanobrevibacter present to be a SGMT Clade member Mbb. millerae (≥ 91% of OTUs); Methanosphaera comprised the remainder of the OTUs. The primers did not amplify ruminal Thermoplasmatales-related 16S rRNA genes. Illumina sequenced V6-V8 16S rRNA gene amplicons identified similar Methanobrevibacter spp. and Methanosphaera clades and also identified the Thermoplasmatales-related order as 13% of total archaea. Unusually, both methods concluded that Mbb. ruminantium and relatives from the same clade (RO) were almost absent. Sequences mapping to rumen 16S rRNA and mcrA gene references were extracted from Illumina metagenome data. Mapping of the metagenome data to 16S rRNA gene references produced taxonomic identification to Order level including 2-3% Thermoplasmatales, but was unable to discriminate to species level. Mapping of the metagenome data to mcrA gene references resolved 69% to unclassified Methanobacteriales. Only 30% of sequences were assigned to species level clades: of the sequences assigned to Methanobrevibacter, most mapped to SGMT (16%) and RO (10%) clades. The Sanger 16S amplicon and Illumina metagenome mcrA analyses showed similar species richness (Chao1 Index 19-35), while Illumina metagenome and amplicon 16S rRNA analysis gave lower richness estimates (10-18). The values of the Shannon Index were low in all methods, indicating low richness and uneven species distribution. Thus, although much information may be extracted from the other methods, Illumina amplicon sequencing of the V6-V8 16S rRNA gene would be the method of choice for studying rumen archaeal communities.
Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling.
Zhang, Guoqiang; Wang, Jianfeng; Yang, Jin; Li, Wenjie; Deng, Yutian; Li, Jing; Huang, Jun; Hu, Songnian; Zhang, Bing
2015-08-05
To promote the clinical application of next-generation sequencing, it is important to obtain accurate and consistent variants of target genomic regions at low cost. Ion Proton, the latest updated semiconductor-based sequencing instrument from Life Technologies, is designed to provide investigators with an inexpensive platform for human whole exome sequencing that achieves a rapid turnaround time. However, few studies have comprehensively compared and evaluated the accuracy of variant calling between Ion Proton and Illumina sequencing platforms such as HiSeq 2000, which is the most popular sequencing platform for the human genome. The Ion Proton sequencer combined with the Ion TargetSeq Exome Enrichment Kit together make up TargetSeq-Proton, whereas SureSelect-Hiseq is based on the Agilent SureSelect Human All Exon v4 Kit and the HiSeq 2000 sequencer. Here, we sequenced exonic DNA from four human blood samples using both TargetSeq-Proton and SureSelect-HiSeq. We then called variants in the exonic regions that overlapped between the two exome capture kits (33.6 Mb). The rates of shared variant loci called by two sequencing platforms were from 68.0 to 75.3% in four samples, whereas the concordance of co-detected variant loci reached 99%. Sanger sequencing validation revealed that the validated rate of concordant single nucleotide polymorphisms (SNPs) (91.5%) was higher than the SNPs specific to TargetSeq-Proton (60.0%) or specific to SureSelect-HiSeq (88.3%). With regard to 1-bp small insertions and deletions (InDels), the Sanger sequencing validated rates of concordant variants (100.0%) and SureSelect-HiSeq-specific (89.6%) were higher than those of TargetSeq-Proton-specific (15.8%). In the sequencing of exonic regions, a combination of using of two sequencing strategies (SureSelect-HiSeq and TargetSeq-Proton) increased the variant calling specificity for concordant variant loci and the sensitivity for variant loci called by any one platform. However, for the sequencing of platform-specific variants, the accuracy of variant calling by HiSeq 2000 was higher than that of Ion Proton, specifically for the InDel detection. Moreover, the variant calling software also influences the detection of SNPs and, specifically, InDels in Ion Proton exome sequencing.
ABACAS: algorithm-based automatic contiguation of assembled sequences
Assefa, Samuel; Keane, Thomas M.; Otto, Thomas D.; Newbold, Chris; Berriman, Matthew
2009-01-01
Summary: Due to the availability of new sequencing technologies, we are now increasingly interested in sequencing closely related strains of existing finished genomes. Recently a number of de novo and mapping-based assemblers have been developed to produce high quality draft genomes from new sequencing technology reads. New tools are necessary to take contigs from a draft assembly through to a fully contiguated genome sequence. ABACAS is intended as a tool to rapidly contiguate (align, order, orientate), visualize and design primers to close gaps on shotgun assembled contigs based on a reference sequence. The input to ABACAS is a set of contigs which will be aligned to the reference genome, ordered and orientated, visualized in the ACT comparative browser, and optimal primer sequences are automatically generated. Availability and Implementation: ABACAS is implemented in Perl and is freely available for download from http://abacas.sourceforge.net Contact: sa4@sanger.ac.uk PMID:19497936
Genome sequencing in microfabricated high-density picolitre reactors.
Margulies, Marcel; Egholm, Michael; Altman, William E; Attiya, Said; Bader, Joel S; Bemben, Lisa A; Berka, Jan; Braverman, Michael S; Chen, Yi-Ju; Chen, Zhoutao; Dewell, Scott B; Du, Lei; Fierro, Joseph M; Gomes, Xavier V; Godwin, Brian C; He, Wen; Helgesen, Scott; Ho, Chun Heen; Ho, Chun He; Irzyk, Gerard P; Jando, Szilveszter C; Alenquer, Maria L I; Jarvie, Thomas P; Jirage, Kshama B; Kim, Jong-Bum; Knight, James R; Lanza, Janna R; Leamon, John H; Lefkowitz, Steven M; Lei, Ming; Li, Jing; Lohman, Kenton L; Lu, Hong; Makhijani, Vinod B; McDade, Keith E; McKenna, Michael P; Myers, Eugene W; Nickerson, Elizabeth; Nobile, John R; Plant, Ramona; Puc, Bernard P; Ronan, Michael T; Roth, George T; Sarkis, Gary J; Simons, Jan Fredrik; Simpson, John W; Srinivasan, Maithreyan; Tartaro, Karrie R; Tomasz, Alexander; Vogt, Kari A; Volkmer, Greg A; Wang, Shally H; Wang, Yong; Weiner, Michael P; Yu, Pengguang; Begley, Richard F; Rothberg, Jonathan M
2005-09-15
The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.
Lim, Eileen C P; Brett, Maggie; Lai, Angeline H M; Lee, Siew-Peng; Tan, Ee-Shien; Jamuar, Saumya S; Ng, Ivy S L; Tan, Ene-Choo
2015-12-14
Next-generation sequencing (NGS) has revolutionized genetic research and offers enormous potential for clinical application. Sequencing the exome has the advantage of casting the net wide for all known coding regions while targeted gene panel sequencing provides enhanced sequencing depths and can be designed to avoid incidental findings in adult-onset conditions. A HaloPlex panel consisting of 180 genes within commonly altered chromosomal regions is available for use on both the Ion Personal Genome Machine (PGM) and MiSeq platforms to screen for causative mutations in these genes. We used this Haloplex ICCG panel for targeted sequencing of 15 patients with clinical presentations indicative of an abnormality in one of the 180 genes. Sequencing runs were done using the Ion 318 Chips on the Ion Torrent PGM. Variants were filtered for known polymorphisms and analysis was done to identify possible disease-causing variants before validation by Sanger sequencing. When possible, segregation of variants with phenotype in family members was performed to ascertain the pathogenicity of the variant. More than 97% of the target bases were covered at >20×. There was an average of 9.6 novel variants per patient. Pathogenic mutations were identified in five genes for six patients, with two novel variants. There were another five likely pathogenic variants, some of which were unreported novel variants. In a cohort of 15 patients, we were able to identify a likely genetic etiology in six patients (40%). Another five patients had candidate variants for which further evaluation and segregation analysis are ongoing. Our results indicate that the HaloPlex ICCG panel is useful as a rapid, high-throughput and cost-effective screening tool for 170 of the 180 genes. There is low coverage for some regions in several genes which might have to be supplemented by Sanger sequencing. However, comparing the cost, ease of analysis, and shorter turnaround time, it is a good alternative to exome sequencing for patients whose features are suggestive of a genetic etiology involving one of the genes in the panel.
Barbano, Raffaela; Pasculli, Barbara; Coco, Michelina; Fontana, Andrea; Copetti, Massimiliano; Rendina, Michelina; Valori, Vanna Maria; Graziano, Paolo; Maiello, Evaristo; Fazio, Vito Michele; Parrella, Paola
2015-01-01
BRAF codon 600 mutation testing of melanoma patients is mandatory for the choice of the most appropriate therapy in the clinical setting. Competitive allele specific TaqMan PCR (Cast-PCR) technology allows not only the selective amplification of minor alleles, but it also blocks the amplification of non-mutant allele. We genotyped codon 600 of the BRAF gene in 54 patients’ samples by Cast-PCR and bidirectional direct sequence analysis. All the mutations detected by sequencing were also identified by Cast-PCR. In addition, Cast-PCR assay detected four samples carrying mutations and was able to clearly identify two mutations of uncertain interpretation by Sanger sequencing. The limit of detection of Cast-PCR was evaluated by constructing dilution curves of BRAFV600E and BRAFV600K mutated clinical samples mixed with a not-mutated specimens. Both mutations could be detected until a 1:100 mutated/not mutated ratio. Cloning and sequencing of the clones was used to confirm mutations on representative discrepant cases. Cast PCR performances were not affected by intratumour heterogeneity, and less affected by melanin content. Our results indicate that Cast-PCR is a reliable diagnostic tool for the identification of melanoma patients as eligible to be treated with TKIs and might be implemented in the clinical setting as elective screening method. PMID:26690267
Chiu, Elliott S; Hoover, Edward A; VandeWoude, Sue
2018-01-10
Feline leukemia virus (FeLV) was the first feline retrovirus discovered, and is associated with multiple fatal disease syndromes in cats, including lymphoma. The original research conducted on FeLV employed classical virological techniques. As methods have evolved to allow FeLV genetic characterization, investigators have continued to unravel the molecular pathology associated with this fascinating agent. In this review, we discuss how FeLV classification, transmission, and disease-inducing potential have been defined sequentially by viral interference assays, Sanger sequencing, PCR, and next-generation sequencing. In particular, we highlight the influences of endogenous FeLV and host genetics that represent FeLV research opportunities on the near horizon.
The complete mitochondrial genome of the stonefly Dinocras cephalotes (Plecoptera, Perlidae).
Elbrecht, Vasco; Poettker, Lisa; John, Uwe; Leese, Florian
2015-06-01
The complete mitochondrial genome of the perlid stonefly Dinocras cephalotes (Curtis, 1827) was sequenced using a combined 454 and Sanger sequencing approach using the known sequence of Pteronarcys princeps Banks, 1907 (Pteronarcyidae), to identify homologous 454 reads. The genome is 15,666 bp in length and includes 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and a control region. Gene order resembles that of basal arthropods. The base composition of the genome is A (33.5%), T (29.0%), C (24.4%) and G (13.1%). This is the second published mitogenome for the order Plecoptera and will be useful in future phylogenetic analysis.
Krunic, Aleksandar L; Stone, Kristina L; Simpson, Michael A; McGrath, John A
2013-01-01
Acral peeling skin syndrome (APSS) is a clinically and genetically heterogeneous disorder. We used whole-exome sequencing to identify the molecular basis of APSS in a consanguineous Jordanian-American pedigree. We identified a homozygous nonsense mutation (p.Lys22X) in the CSTA gene, encoding cystatin A, that was confirmed using Sanger sequencing. Cystatin A is a protease inhibitor found in the cornified cell envelope, and loss-of-function mutations have previously been reported in two cases of exfoliative ichthyosis. Our study expands the molecular pathology of APSS and demonstrates the value of next-generation sequencing in the genetic characterization of inherited skin diseases. © 2013 Wiley Periodicals, Inc.
Deep Sequencing Insights in Therapeutic shRNA Processing and siRNA Target Cleavage Precision.
Denise, Hubert; Moschos, Sterghios A; Sidders, Benjamin; Burden, Frances; Perkins, Hannah; Carter, Nikki; Stroud, Tim; Kennedy, Michael; Fancy, Sally-Ann; Lapthorn, Cris; Lavender, Helen; Kinloch, Ross; Suhy, David; Corbau, Romu
2014-02-04
TT-034 (PF-05095808) is a recombinant adeno-associated virus serotype 8 (AAV8) agent expressing three short hairpin RNA (shRNA) pro-drugs that target the hepatitis C virus (HCV) RNA genome. The cytosolic enzyme Dicer cleaves each shRNA into multiple, potentially active small interfering RNA (siRNA) drugs. Using next-generation sequencing (NGS) to identify and characterize active shRNAs maturation products, we observed that each TT-034-encoded shRNA could be processed into as many as 95 separate siRNA strands. Few of these appeared active as determined by Sanger 5' RNA Ligase-Mediated Rapid Amplification of cDNA Ends (5-RACE) and through synthetic shRNA and siRNA analogue studies. Moreover, NGS scrutiny applied on 5-RACE products (RACE-seq) suggested that synthetic siRNAs could direct cleavage in not one, but up to five separate positions on targeted RNA, in a sequence-dependent manner. These data support an on-target mechanism of action for TT-034 without cytotoxicity and question the accepted precision of substrate processing by the key RNA interference (RNAi) enzymes Dicer and siRNA-induced silencing complex (siRISC).Molecular Therapy-Nucleic Acids (2014) 3, e145; doi:10.1038/mtna.2013.73; published online 4 February 2014.
The mitogenome of Onchocerca volvulus from the Brazilian Amazonia focus.
Crainey, James L; Silva, Túllio R R da; Encinas, Fernando; Marín, Michel A; Vicente, Ana Carolina P; Luz, Sérgio L B
2016-01-01
We report here the first complete mitochondria genome of Onchocerca volvulus from a focus outside of Africa. An O. volvulus mitogenome from the Brazilian Amazonia focus was obtained using a combination of high-throughput and Sanger sequencing technologies. Comparisons made between this mitochondrial genome and publicly available mitochondrial sequences identified 46 variant nucleotide positions and suggested that our Brazilian mitogenome is more closely related to Cameroon-origin mitochondria than West African-origin mitochondria. As well as providing insights into the origins of Latin American onchocerciasis, the Brazilian Amazonia focus mitogenome may also have value as an epidemiological resource.
High-throughput sequencing in veterinary infection biology and diagnostics.
Belák, S; Karlsson, O E; Leijon, M; Granberg, F
2013-12-01
Sequencing methods have improved rapidly since the first versions of the Sanger techniques, facilitating the development of very powerful tools for detecting and identifying various pathogens, such as viruses, bacteria and other microbes. The ongoing development of high-throughput sequencing (HTS; also known as next-generation sequencing) technologies has resulted in a dramatic reduction in DNA sequencing costs, making the technology more accessible to the average laboratory. In this White Paper of the World Organisation for Animal Health (OIE) Collaborating Centre for the Biotechnology-based Diagnosis of Infectious Diseases in Veterinary Medicine (Uppsala, Sweden), several approaches and examples of HTS are summarised, and their diagnostic applicability is briefly discussed. Selected future aspects of HTS are outlined, including the need for bioinformatic resources, with a focus on improving the diagnosis and control of infectious diseases in veterinary medicine.
NASA Astrophysics Data System (ADS)
Li, Ren; Zhou, Mingxing; Li, Jine; Wang, Zihua; Zhang, Weikai; Yue, Chunyan; Ma, Yan; Peng, Hailin; Wei, Zewen; Hu, Zhiyuan
2018-03-01
EGFR mutations companion diagnostics have been proved to be crucial for the efficacy of tyrosine kinase inhibitor targeted cancer therapies. To uncover multiple mutations occurred in minority of EGFR-mutated cells, which may be covered by the noises from majority of un-mutated cells, is currently becoming an urgent clinical requirement. Here we present the validation of a microfluidic-chip-based method for detecting EGFR multi-mutations at single-cell level. By trapping and immunofluorescently imaging single cells in specifically designed silicon microwells, the EGFR-expressed cells were easily identified. By in situ lysing single cells, the cell lysates of EGFR-expressed cells were retrieved without cross-contamination. Benefited from excluding the noise from cells without EGFR expression, the simple and cost-effective Sanger's sequencing, but not the expensive deep sequencing of the whole cell population, was used to discover multi-mutations. We verified the new method with precisely discovering three most important EGFR drug-related mutations from a sample in which EGFR-mutated cells only account for a small percentage of whole cell population. The microfluidic chip is capable of discovering not only the existence of specific EGFR multi-mutations, but also other valuable single-cell-level information: on which specific cells the mutations occurred, or whether different mutations coexist on the same cells. This microfluidic chip constitutes a promising method to promote simple and cost-effective Sanger's sequencing to be a routine test before performing targeted cancer therapy.[Figure not available: see fulltext.
Borsu, Laetitia; Intrieri, Julie; Thampi, Linta; Yu, Helena; Riely, Gregory; Nafa, Khedoudja; Chandramohan, Raghu; Ladanyi, Marc; Arcila, Maria E
2016-11-01
Although next-generation sequencing (NGS) is a robust technology for comprehensive assessment of EGFR-mutant lung adenocarcinomas with acquired resistance to tyrosine kinase inhibitors, it may not provide sufficiently rapid and sensitive detection of the EGFR T790M mutation, the most clinically relevant resistance biomarker. Here, we describe a digital PCR (dPCR) assay for rapid T790M detection on aliquots of NGS libraries prepared for comprehensive profiling, fully maximizing broad genomic analysis on limited samples. Tumor DNAs from patients with EGFR-mutant lung adenocarcinomas and acquired resistance to epidermal growth factor receptor inhibitors were prepared for Memorial Sloan-Kettering-Integrated Mutation Profiling of Actionable Cancer Targets sequencing, a hybrid capture-based assay interrogating 410 cancer-related genes. Precapture library aliquots were used for rapid EGFR T790M testing by dPCR, and results were compared with NGS and locked nucleic acid-PCR Sanger sequencing (reference high sensitivity method). Seventy resistance samples showed 99% concordance with the reference high sensitivity method in accuracy studies. Input as low as 2.5 ng provided a sensitivity of 1% and improved further with increasing DNA input. dPCR on libraries required less DNA and showed better performance than direct genomic DNA. dPCR on NGS libraries is a robust and rapid approach to EGFR T790M testing, allowing most economical utilization of limited material for comprehensive assessment. The same assay can also be performed directly on any limited DNA source and cell-free DNA. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Berger, C; Berger, B; Parson, W
2012-01-01
In recent years, evidence from domestic dogs has increasingly been analyzed by forensic DNA testing. Especially, canine hairs have proved most suitable and practical due to the high rate of hair transfer occurring between dogs and humans. Starting with the description of a contamination-free sample handling procedure, we give a detailed workflow for sequencing hypervariable segments (HVS) of the mtDNA control region from canine evidence. After the hair material is lysed and the DNA extracted by Phenol/Chloroform, the amplification and sequencing strategy comprises the HVS I and II of the canine control region and is optimized for DNA of medium-to-low quality and quantity. The sequencing procedure is based on the Sanger Big-dye deoxy-terminator method and the separation of the sequencing reaction products is performed on a conventional multicolor fluorescence detection capillary electrophoresis platform. Finally, software-aided base calling and sequence interpretation are addressed exemplarily.
Content Is King: Databases Preserve the Collective Information of Science.
Yates, John R
2018-04-01
Databases store sequence information experimentally gathered to create resources that further science. In the last 20 years databases have become critical components of fields like proteomics where they provide the basis for large-scale and high-throughput proteomic informatics. Amos Bairoch, winner of the Association of Biomolecular Resource Facilities Frederick Sanger Award, has created some of the important databases proteomic research depends upon for accurate interpretation of data.
Thulin, Sara; Olcén, Per; Fredlund, Hans; Unemo, Magnus
2008-01-01
A segment of penA in Neisseria meningitidis strains (n = 127), including two nucleotide sites closely associated to reduced susceptibility to penicillins, was amplified and pyrosequenced. All results were in concordance with Sanger sequencing, and a high correlation between alterations in the two Peni-specific sites and reduced susceptibility to penicillins was identified. PMID:18070955
Association of genetic variants of GRIN2B with autism.
Pan, Yongcheng; Chen, Jingjing; Guo, Hui; Ou, Jianjun; Peng, Yu; Liu, Qiong; Shen, Yidong; Shi, Lijuan; Liu, Yalan; Xiong, Zhimin; Zhu, Tengfei; Luo, Sanchuan; Hu, Zhengmao; Zhao, Jingping; Xia, Kun
2015-02-06
Autism (MIM 209850) is a complex neurodevelopmental disorder characterized by social communication impairments and restricted repetitive behaviors. It has a high heritability, although much remains unclear. To evaluate genetic variants of GRIN2B in autism etiology, we performed a system association study of common and rare variants of GRIN2B and autism in cohorts from a Chinese population, involving a total sample of 1,945 subjects. Meta-analysis of a triad family cohort and a case-control cohort identified significant associations of multiple common variants and autism risk (Pmin = 1.73 × 10(-4)). Significantly, the haplotype involved with the top common variants also showed significant association (P = 1.78 × 10(-6)). Sanger sequencing of 275 probands from a triad cohort identified several variants in coding regions, including four common variants and seven rare variants. Two of the common coding variants were located in the autism-related linkage disequilibrium (LD) block, and both were significantly associated with autism (P < 9 × 10(-3)) using an independent control cohort. Burden analysis and case-only analysis of rare coding variants identified by Sanger sequencing did not find this association. Our study for the first time reveals that common variants and related haplotypes of GRIN2B are associated with autism risk.
Genetic and clinical features of cryopyrin-associated periodic syndromes in Turkish children.
Eroglu, Fehime Kara; Kasapcopur, Ozgür; Beşbaş, Nesrin; Ozaltin, Fatih; Bilginer, Yelda; Barut, Kenan; Mensa-Vilaro, Anna; Nakagawa, Kenji; Heike, Toshio; Nishikomori, Ryuta; Arostegui, Juan; Ozen, Seza
2016-01-01
The aim of this study was to present the genetic and clinical data of the largest cohort of Turkish cryopyrin-associated periodic syndromes (CAPS) patients. This is a two-centre descriptive study of Turkish children with clinical diagnosis of CAPS. NLRP3 analyses were performed by Sanger sequencing and by massively parallel sequencing. ASC dependent NF-κB activation and transfection-induced THP-1 cell death assays determined the functional consequences of the detected variants. Disease activity and response to anti interleukin 1 (anti-IL-1) treatment was also assessed. Heterozygous germline NLRP3 mutation was detected in 8 of 14 enrolled patients (57.1%). Two novel somatic mutations Y560H and G307D were found which induced both THP-1 cell death and ASC dependent NF-kB activation. With anti-IL-1 treatment the disease activity was improved in all patients except one. Except two patients with macrophage activation syndrome (MAS) attack, there were no serious adverse events requiring hospitalisation. CAPS should be considered in all patients with typical symptoms even if Sanger-based genetic analysis is negative, since a considerable number of patients have mosaicism. Treatment should be patient-tailored and MAS should be considered as a rare complication.
Nouchi, A; Nguyen, T; Valantin, M A; Simon, A; Sayon, S; Agher, R; Calvez, V; Katlama, C; Marcelin, A G; Soulie, C
2018-05-29
To investigate the dynamics of HIV-1 variants archived in cells harbouring drug resistance-associated mutations (DRAMs) to lamivudine/emtricitabine, etravirine and rilpivirine in patients under effective ART free from selective pressure on these DRAMs, in order to assess the possibility of recycling molecules with resistance history. We studied 25 patients with at least one DRAM to lamivudine/emtricitabine, etravirine and/or rilpivirine identified on an RNA sequence in their history and with virological control for at least 5 years under a regimen excluding all drugs from the resistant class. Longitudinal ultra-deep sequencing (UDS) and Sanger sequencing of the reverse transcriptase region were performed on cell-associated HIV-1 DNA samples taken over the 5 years of follow-up. Viral variants harbouring the analysed DRAMs were no longer detected by UDS over the 5 years in 72% of patients, with viruses susceptible to the molecules of interest found after 5 years in 80% of patients with UDS and in 88% of patients with Sanger. Residual viraemia with <50 copies/mL was detected in 52% of patients. The median HIV DNA level remained stable (2.4 at baseline versus 2.1 log10 copies/106 cells 5 years later). These results show a clear trend towards clearance of archived DRAMs to reverse transcriptase inhibitors in cell-associated HIV-1 DNA after a long period of virological control, free from therapeutic selective pressure on these DRAMs, reflecting probable residual replication in some reservoirs of the fittest viruses and leading to persistent evolution of the archived HIV-1 DNA resistance profile.
Mensa-Vilaro, Anna; Teresa Bosque, María; Magri, Giuliana; Honda, Yoshitaka; Martínez-Banaclocha, Helios; Casorran-Berges, Marta; Sintes, Jordi; González-Roca, Eva; Ruiz-Ortiz, Estibaliz; Heike, Toshio; Martínez-Garcia, Juan J; Baroja-Mazo, Alberto; Cerutti, Andrea; Nishikomori, Ryuta; Yagüe, Jordi; Pelegrín, Pablo; Delgado-Beltran, Concha; Aróstegui, Juan I
2016-12-01
Gain-of-function NLRP3 mutations cause cryopyrin-associated periodic syndrome (CAPS), with gene mosaicism playing a relevant role in the pathogenesis. This study was undertaken to characterize the genetic cause underlying late-onset but otherwise typical CAPS. We studied a 64-year-old patient who presented with recurrent episodes of urticaria-like rash, fever, conjunctivitis, and oligoarthritis at age 56 years. DNA was extracted from both unfractionated blood and isolated leukocyte and CD34+ subpopulations. Genetic studies were performed using both the Sanger method of DNA sequencing and next-generation sequencing (NGS) methods. In vitro and ex vivo analyses were performed to determine the consequences that the presence of the variant have in the normal structure or function of the protein of the detected variant. NGS analyses revealed the novel p.Gln636Glu NLRP3 variant in unfractionated blood, with an allele frequency (18.4%) compatible with gene mosaicism. Sanger sequence chromatograms revealed a small peak corresponding to the variant allele. Amplicon-based deep sequencing revealed somatic NLRP3 mosaicism restricted to myeloid cells (31.8% in monocytes, 24.6% in neutrophils, and 11.2% in circulating CD34+ common myeloid progenitor cells) and its complete absence in lymphoid cells. Functional analyses confirmed the gain-of-function behavior of the gene variant and hyperactivity of the NLRP3 inflammasome in the patient. Treatment with anakinra resulted in good control of the disease. We identified the novel gain-of-function p.Gln636Glu NLRP3 mutation, which was detected as a somatic mutation restricted to myeloid cells, as the cause of late-onset but otherwise typical CAPS. Our results expand the diversity of CAPS toward milder phenotypes than previously reported, including those starting during adulthood. © 2016, American College of Rheumatology.
Prenatal diagnosis for a Chinese family with a de novo DMD gene mutation
Li, Tao; Zhang, Zhao-jing; Ma, Xin; Lv, Xue; Xiao, Hai; Guo, Qian-nan; Liu, Hong-yan; Wang, Hong-dan; Wu, Dong; Lou, Gui-yu; Wang, Xin; Zhang, Chao-yang; Liao, Shi-xiu
2017-01-01
Abstract Background: Patients with Duchenne muscular dystrophy (DMD) usually have severe and fatal symptoms. At present, there is no effective treatment for DMD, thus it is very important to avoid the birth of children with DMD by effective prenatal diagnosis. We identified a de novo DMD gene mutation in a Chinese family, and make a prenatal diagnosis. Methods: First, multiplex ligation-dependent probe amplification (MLPA) was applied to analyze DMD gene exon deletion/duplication in all family members. The coding sequences of 79 exons in DMD gene were analyzed by Sanger sequencing in the patient; and then according to DMD gene exon mutation in the patient, DMD gene sequencing was performed in the family members. On the basis of results above, the pathogenic mutation in DMD gene was identified. Results: MLPA showed no DMD gene exon deletion/duplication in all family members. Sanger sequencing revealed c.2767_2767delT [p.Ser923LeufsX26] mutation in DMD gene of the patient. Heterozygous deletion mutation (T/-) at this locus was observed in the pregnant woman and her mother and younger sister. The analyses of amniotic fluid samples indicated negative Y chromosome sex-determining gene, no DMD gene exon deletion/duplication, no mutations at c.2767 locus, and the inherited maternal X chromosome different from that of the patient. Conclusion: The pathogenic mutation in DMD gene, c.2767_2767delT [p.Ser923LeufsX26], identified in this family is a de novo mutation. On the basis of specific conditions, it is necessary to select suitable methods to make prenatal diagnosis more effective, accurate, and economic. PMID:29390271
Rapid RHD Zygosity Determination Using Digital PCR.
Sillence, Kelly A; Halawani, Amr J; Tounsi, Wajnat A; Clarke, Kirsty A; Kiernan, Michele; Madgett, Tracey E; Avent, Neil D
2017-08-01
Paternal zygosity testing is used for determining homo- or hemizygosity of RHD in pregnancies that are at a risk of hemolytic disease of the fetus and newborn. At present, this is achieved by using real-time PCR or the Rhesus box PCR, which can be difficult to interpret and unreliable, particularly for black African populations. DNA samples extracted from 53 blood donors were analyzed using 2 multiplex reactions for RHD -specific targets against a reference ( AGO1 ) 2 to determine gene dosage by digital PCR. Results were compared with serological data, and the correct genotype for 2 discordant results was determined by long-range PCR (LR-PCR), next-generation sequencing, and conventional Sanger sequencing. The results showed clear and reliable determination of RHD zygosity using digital PCR and revealed that 4 samples did not match the serologically predicted genotype. Sanger sequencing and long-range PCR followed by next-generation sequencing revealed that the correct genotypes for samples 729M and 351D, which were serologically typed as R 1 R 2 (DCe/DcE), were R 2 r' (DcE/dCe) for 729M and R 1 r″ (DCe/dcE), R 0 r y (Dce/dCE), or R Z r (DCE/dce) for 351D, in concordance with the digital PCR data. Digital PCR provides a highly accurate method to rapidly define blood group zygosity and has clinical application in the analysis of Rh phenotyped or genotyped samples. The vast majority of current blood group genotyping platforms are not designed to define zygosity, and thus, this technique may be used to define paternal RH zygosity in pregnancies that are at a risk of hemolytic disease of the fetus and newborn and can distinguish between homo- and hemizygous RHD -positive individuals. © 2017 American Association for Clinical Chemistry.
Karolak, Justyna A; Gambin, Tomasz; Pitarque, Jose A; Molinari, Andrea; Jhangiani, Shalini; Stankiewicz, Pawel; Lupski, James R; Gajecka, Marzena
2017-01-01
Keratoconus (KTCN) is a protrusion and thinning of the cornea, resulting in impairment of visual function. The extreme genetic heterogeneity makes it difficult to discover factors unambiguously influencing the KTCN phenotype. In this study, we used whole-exome sequencing (WES) and Sanger sequencing to reduce the number of candidate genes at the 5q31.1–q35.3 locus and to prioritize other potentially relevant variants in an Ecuadorian family with KTCN. We applied WES in two affected KTCN individuals from the Ecuadorian family that showed a suggestive linkage between the KTCN phenotype and the 5q31.1–q35.3 locus. Putative variants identified by WES were further evaluated in this family using Sanger sequencing. Exome capture discovered a total of 173 rare (minor allele frequency <0.001 in control population) nonsynonymous variants in both affected individuals. Among them, 16 SNVs were selected for further evaluation. Segregation analysis revealed that variants c.475T>G in SKP1, c.671G>A in PROB1, and c.527G>A in IL17B in the 5q31.1–q35.3 linkage region, and c.850G>A in HKDC1 in the 10q22 locus completely segregated with the phenotype in the studied KTCN family. We demonstrate that a combination of various techniques significantly narrowed the studied genomic region and reduced the list of the putative exonic variants. Moreover, since this locus overlapped two other chromosomal regions previously recognized in distinct KTCN studies, our findings suggest that this 5q31.1–q35.3 locus might be linked with KTCN. PMID:27703147
Inzaule, Seth C; Hamers, Ralph L; Paredes, Roger; Yang, Chunfu; Schuurman, Rob; Rinke de Wit, Tobias F
2017-01-01
Global scale-up of antiretroviral treatment has dramatically changed the prospects of HIV/AIDS disease, rendering life-long chronic care and treatment a reality for millions of HIV-infected patients. Affordable technologies to monitor antiretroviral treatment are needed to ensure long-term durability of limited available drug regimens. HIV drug resistance tests can complement existing strategies in optimizing clinical decision-making for patients with treatment failure, in addition to facilitating population-based surveillance of HIV drug resistance. This review assesses the current landscape of HIV drug resistance technologies and discusses the strengths and limitations of existing assays available for expanding testing in resource-limited settings. These include sequencing-based assays (Sanger sequencing assays and nextgeneration sequencing), point mutation assays, and genotype-free data-based prediction systems. Sanger assays are currently considered the gold standard genotyping technology, though only available at a limited number of resource-limited setting reference and regional laboratories, but high capital and test costs have limited their wide expansion. Point mutation assays present opportunities for simplified laboratory assays, but HIV genetic variability, extensive codon redundancy at or near the mutation target sites with limited multiplexing capability have restricted their utility. Next-generation sequencing, despite high costs, may have potential to reduce the testing cost significantly through multiplexing in high-throughput facilities, although the level of bioinformatics expertise required for data analysis is currently still complex and expensive and lacks standardization. Web-based genotype-free prediction systems may provide enhanced antiretroviral treatment decision-making without the need for laboratory testing, but require further clinical field evaluation and implementation scientific research in resource-limited settings.
Implementing a genomic data management system using iRODS in the Wellcome Trust Sanger Institute
2011-01-01
Background Increasingly large amounts of DNA sequencing data are being generated within the Wellcome Trust Sanger Institute (WTSI). The traditional file system struggles to handle these increasing amounts of sequence data. A good data management system therefore needs to be implemented and integrated into the current WTSI infrastructure. Such a system enables good management of the IT infrastructure of the sequencing pipeline and allows biologists to track their data. Results We have chosen a data grid system, iRODS (Rule-Oriented Data management systems), to act as the data management system for the WTSI. iRODS provides a rule-based system management approach which makes data replication much easier and provides extra data protection. Unlike the metadata provided by traditional file systems, the metadata system of iRODS is comprehensive and allows users to customize their own application level metadata. Users and IT experts in the WTSI can then query the metadata to find and track data. The aim of this paper is to describe how we designed and used (from both system and user viewpoints) iRODS as a data management system. Details are given about the problems faced and the solutions found when iRODS was implemented. A simple use case describing how users within the WTSI use iRODS is also introduced. Conclusions iRODS has been implemented and works as the production system for the sequencing pipeline of the WTSI. Both biologists and IT experts can now track and manage data, which could not previously be achieved. This novel approach allows biologists to define their own metadata and query the genomic data using those metadata. PMID:21906284
Mutations in SURF1 are important genetic causes of Leigh syndrome in Slovak patients.
Danis, Daniel; Brennerova, Katarina; Skopkova, Martina; Kurdiova, Timea; Ukropec, Jozef; Stanik, Juraj; Kolnikova, Miriam; Gasperikova, Daniela
2018-04-01
Leigh syndrome is a progressive early onset neurodegenerative disease typically presenting with psychomotor regression, signs of brainstem and/or basal ganglia disease, lactic acidosis, and characteristic magnetic resonance imaging findings. At molecular level, deficiency of respiratory complexes and/or pyruvate dehydrogenase complex is usually observed. Nuclear gene SURF1 encodes an assembly factor for cytochrome c-oxidase complex of the respiratory chain and autosomal recessive mutations in SURF1 are one of the most frequent causes of cytochrome c-oxidase-related Leigh syndrome cases. Here, we aimed to elucidate the genetic basis of Leigh syndrome in three Slovak families. Three probands presenting with Leigh syndrome were selected for DNA analysis. The first proband, presenting with atypical LS onset without abnormal basal ganglia magnetic resonance imaging findings, was analyzed with whole exome sequencing. In the two remaining probands, SURF1 was screened by Sanger sequencing. Four different heterozygous mutations were identified in SURF1: c.312_321delinsAT:p.(Pro104Profs*1), c.588+1G>A, c.823_833+7del:p. (?) and c.845_846del:p.(Ser282Cysfs*9). All the mutations are predicted to have a loss-of-function effect. We identified disease-causing mutations in all three probands, which points to the important role of SURF1 gene in etiology of Leigh syndrome in Slovakia. Our data showed that patients with atypical Leigh syndrome phenotype without lesions in basal ganglia may benefit from the whole exome sequencing method. In the case of probands presenting the typical phenotype, Sanger sequencing of the SURF1 gene seems to be an effective method of DNA analysis.
Ruane, Sara; Raxworthy, Christopher J; Lemmon, Alan R; Lemmon, Emily Moriarty; Burbrink, Frank T
2015-10-12
Using molecular data generated by high throughput next generation sequencing (NGS) platforms to infer phylogeny is becoming common as costs go down and the ability to capture loci from across the genome goes up. While there is a general consensus that greater numbers of independent loci should result in more robust phylogenetic estimates, few studies have compared phylogenies resulting from smaller datasets for commonly used genetic markers with the large datasets captured using NGS. Here, we determine how a 5-locus Sanger dataset compares with a 377-locus anchored genomics dataset for understanding the evolutionary history of the pseudoxyrhophiine snake radiation centered in Madagascar. The Pseudoxyrhophiinae comprise ~86 % of Madagascar's serpent diversity, yet they are poorly known with respect to ecology, behavior, and systematics. Using the 377-locus NGS dataset and the summary statistics species-tree methods STAR and MP-EST, we estimated a well-supported species tree that provides new insights concerning intergeneric relationships for the pseudoxyrhophiines. We also compared how these and other methods performed with respect to estimating tree topology using datasets with varying numbers of loci. Using Sanger sequencing and an anchored phylogenomics approach, we sequenced datasets comprised of 5 and 377 loci, respectively, for 23 pseudoxyrhophiine taxa. For each dataset, we estimated phylogenies using both gene-tree (concatenation) and species-tree (STAR, MP-EST) approaches. We determined the similarity of resulting tree topologies from the different datasets using Robinson-Foulds distances. In addition, we examined how subsets of these data performed compared to the complete Sanger and anchored datasets for phylogenetic accuracy using the same tree inference methodologies, as well as the program *BEAST to determine if a full coalescent model for species tree estimation could generate robust results with fewer loci compared to the summary statistics species tree approaches. We also examined the individual gene trees in comparison to the 377-locus species tree using the program MetaTree. Using the full anchored dataset under a variety of methods gave us the same, well-supported phylogeny for pseudoxyrhophiines. The African pseudoxyrhophiine Duberria is the sister taxon to the Malagasy pseudoxyrhophiines genera, providing evidence for a monophyletic radiation in Madagascar. In addition, within Madagascar, the two major clades inferred correspond largely to the aglyphous and opisthoglyphous genera, suggesting that feeding specializations associated with tooth venom delivery may have played a major role in the early diversification of this radiation. The comparison of tree topologies from the concatenated and species-tree methods using different datasets indicated the 5-locus dataset cannot beused to infer a correct phylogeny for the pseudoxyrhophiines under any method tested here and that summary statistics methods require 50 or more loci to consistently recover the species-tree inferred using the complete anchored dataset. However, as few as 15 loci may infer the correct topology when using the full coalescent species tree method *BEAST. MetaTree analyses of each gene tree from the Sanger and anchored datasets found that none of the individual gene trees matched the 377-locus species tree, and that no gene trees were identical with respect to topology. Our results suggest that ≥50 loci may be necessary to confidently infer phylogenies when using summaryspecies-tree methods, but that the coalescent-based method *BEAST consistently recovers the same topology using only 15 loci. These results reinforce that datasets with small numbers of markers may result in misleading topologies, and further, that the method of inference used to generate a phylogeny also has a major influence on the number of loci necessary to infer robust species trees.
Jasim, Anfal A.; Al-Bustan, Suzanne A.; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda
2018-01-01
Common variants of Apolipoprotein A5 (APOA5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3′ UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism. PMID:29686695
Jasim, Anfal A; Al-Bustan, Suzanne A; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda
2018-01-01
Common variants of Apolipoprotein A5 ( APOA 5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3' UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism.
Xin, Min; Zhang, Peipei; Liu, Wenwen; Ren, Yingdang; Cao, Mengji; Wang, Xifeng
2017-10-01
The complete nucleotide sequence of a novel positive single-stranded (+ss) RNA virus, tentatively named watermelon virus A (WVA), was determined using a combination of three methods: RNA sequencing, small RNA sequencing, and Sanger sequencing. The full genome of WVA is comprised of 8,372 nucleotides (nt), excluding the poly (A) tail, and contains four open reading frames (ORFs). The largest ORF, ORF1 encodes a putative replication-associated polyprotein (RP) with three conserved domains. ORF2 and ORF4 encode a movement protein (MP) and coat protein (CP), respectively. The putative product encoded by ORF3, of an estimated molecular mass of 25 kDa, has no significant similarity with other proteins. Identity and phylogenetic analysis indicate that WVA is a new virus, closely related to members of the family Betaflexiviridae. However, the final taxonomic allocation of WVA within the family is yet to be determined.
ALDH1A3 mutations cause recessive anophthalmia and microphthalmia.
Fares-Taie, Lucas; Gerber, Sylvie; Chassaing, Nicolas; Clayton-Smith, Jill; Hanein, Sylvain; Silva, Eduardo; Serey, Margaux; Serre, Valérie; Gérard, Xavier; Baumann, Clarisse; Plessis, Ghislaine; Demeer, Bénédicte; Brétillon, Lionel; Bole, Christine; Nitschke, Patrick; Munnich, Arnold; Lyonnet, Stanislas; Calvas, Patrick; Kaplan, Josseline; Ragge, Nicola; Rozet, Jean-Michel
2013-02-07
Anophthalmia and microphthalmia (A/M) are early-eye-development anomalies resulting in absent or small ocular globes, respectively. A/M anomalies occur in syndromic or nonsyndromic forms. They are genetically heterogeneous, some mutations in some genes being responsible for both anophthalmia and microphthalmia. Using a combination of homozygosity mapping, exome sequencing, and Sanger sequencing, we identified homozygosity for one splice-site and two missense mutations in the gene encoding the A3 isoform of the aldehyde dehydrogenase 1 (ALDH1A3) in three consanguineous families segregating A/M with occasional orbital cystic, neurological, and cardiac anomalies. ALDH1A3 is a key enzyme in the formation of a retinoic acid gradient along the dorso-ventral axis during early eye development. Transitory expression of mutant ALDH1A3 open reading frames showed that both missense mutations reduce the accumulation of the enzyme, potentially leading to altered retinoic acid synthesis. Although the role of retinoic acid signaling in eye development is well established, our findings provide genetic evidence of a direct link between retinoic-acid-synthesis dysfunction and early-eye-development anomalies in humans. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Bastaki, Fatma; Mohamed, Madiha; Nair, Pratibha; Saif, Fatima; Mustafa, Ethar M; Bizzari, Sami; Al-Ali, Mahmoud T; Hamzeh, Abdul Rezzak
2017-05-01
Clinical and molecular heterogeneity is a prominent characteristic of congenital ichthyoses, with the involvement of numerous causative loci. Mutations in these loci feature in autosomal recessive congenital ichthyoses (ARCIs) quite variably, with certain genes/mutations being more frequently uncovered in particular populations. In this study, we used whole exome sequencing as well as direct Sanger sequencing to uncover four novel mutations in ARCI-related genes, which were found in families from the United Arab Emirates. In silico tools such as CADD and SIFT Indel were used to predict the functional consequences of these mutations. The here-presented mutations occurred in three genes (ALOX12B, TGM1, ABCA12), and these are a mixture of missense and indel variants with damaging functional consequences on their encoded proteins. This study presents an overview of the mutations that were found in ARCI-related genes in Arabs and discusses molecular and clinical details pertaining to the above-mentioned Emirati cases and their novel mutations with special emphasis on the resulting protein changes. © 2017 The International Society of Dermatology.
Zhao, Xintao; Wu, Yajie; Chen, Yi; Feng, Xinxing; Song, Ying; Wang, Yilu; Zou, Yubao; Wang, Jizheng; Shao, Yibing; Hui, Rutai; Song, Lei; Wang, Xu
2014-07-01
To identify the casual mutation of a Chinese pedigree with hypertrophic cardiomyopathy (HCM), and to analyze the genotype-phenotype relationship. The coding exons of 26 reported disease genes were sequenced by targeted resequencing in the proband and the identified mutation were detected with bi-directional Sanger sequencing in all family members and 307 healthy controls. The genotype-phenotype correlation was analyzed in the family. A missense mutation (c.2191C > T, p. Pro731Ser) in the 20th exon of MYH7 gene was identified. This mutation was absent in 307 healthy controls and predicted to be pathogenic by PolyPhen-HCM. Totally 13 family members carried this mutation, including 10 patients with HCM and 3 asymptomatic mutation carriers. The proband manifested severe congestive heart failure and 8 patients expressed various clinical manifestations of heart failure, including dyspnea, palpitations, chest pain, amaurosis or syncope. Five patients were diagnosed as HCM at the age of 16 or younger. One family member suffered sudden cardiac death. The Pro731Ser of MYH7 gene mutation is a causal and malignant mutation linked with familiar HCM.
Huang, Tianhong; Yang, Guilin; Dang, Xiao; Ao, Feijian; Li, Jiankang; He, Yizhou; Tang, Qiyuan; He, Qing
2017-11-01
Alagille syndrome (AGS) is a highly variable, autosomal dominant disease that affects multiple structures including the liver, heart, eyes, bones and face. Targeted region capture sequencing focuses on a panel of known pathogenic genes and provides a rapid, cost‑effective and accurate method for molecular diagnosis. In a Chinese family, this method was used on the proband and Sanger sequencing was applied to validate the candidate mutation. A de novo heterozygous mutation (c.3254_3255insT p.Leu1085PhefsX24) of the jagged 1 gene was identified as the potential disease‑causing gene mutation. In conclusion, the present study suggested that target region capture sequencing is an efficient, reliable and accurate approach for the clinical diagnosis of AGS. Furthermore, these results expand on the understanding of the pathogenesis of AGS.
An efficient approach to BAC based assembly of complex genomes.
Visendi, Paul; Berkman, Paul J; Hayashi, Satomi; Golicz, Agnieszka A; Bayer, Philipp E; Ruperao, Pradeep; Hurgobin, Bhavna; Montenegro, Juan; Chan, Chon-Kit Kenneth; Staňková, Helena; Batley, Jacqueline; Šimková, Hana; Doležel, Jaroslav; Edwards, David
2016-01-01
There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.
Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D
2017-01-01
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Kotásková, Iva; Mališová, Barbora; Obručová, Hana; Holá, Veronika; Peroutková, Tereza; Růžička, Filip; Freiberger, Tomáš
2017-01-01
Complex samples are a challenge for sequencing-based broad-range diagnostics. We analysed 19 urinary catheter, ureteral Double-J catheter, and urine samples using 3 methodological approaches. Out of the total 84 operational taxonomic units, 37, 61, and 88% were identified by culture, PCR-DGGE-SS (PCR denaturing gradient gel electrophoresis followed by Sanger sequencing), and PCR-DGGE-RM (PCR- DGGE combined with software chromatogram separation by RipSeq Mixed tool), respectively. The latter approach was shown to be an efficient tool to complement culture in complex sample assessment. © 2017 S. Karger AG, Basel.
Identification of a novel vitivirus from grapevines in New Zealand.
Blouin, Arnaud G; Keenan, Sandi; Napier, Kathryn R; Barrero, Roberto A; MacDiarmid, Robin M
2018-01-01
We report a sequence of a novel vitivirus from Vitis vinifera obtained using two high-throughput sequencing (HTS) strategies on RNA. The initial discovery from small-RNA sequencing was confirmed by HTS of the total RNA and Sanger sequencing. The new virus has a genome structure similar to the one reported for other vitiviruses, with five open reading frames (ORFs) coding for the conserved domains described for members of that genus. Phylogenetic analysis of the complete genome sequence confirmed its affiliation to the genus Vitivirus, with the closest described viruses being grapevine virus E (GVE) and Agave tequilana leaf virus (ATLV). However, the virus we report is distinct and shares only 51% amino acid sequence identity with GVE in the replicase polyprotein and 66.8% amino acid sequence identity with ATLV in the coat protein. This is well below the threshold determined by the ICTV for species demarcation, and we propose that this virus represents a new species. It is provisionally named "grapevine virus G".
Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing.
Hargreaves, Adam D; Mulley, John F
2015-01-01
Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0-2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5' and 3' UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species.
Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing
Hargreaves, Adam D.
2015-01-01
Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0–2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5′ and 3′ UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species. PMID:26623194
Wang, Na; Wang, Chuan; Chen, Xuechao; Sheng, Donglai; Fu, Xi’an; See, Kelvin; Foo, Jia Nee; Low, Huiqi; Liany, Herty; Irwan, Ishak Darryl; Liu, Jian; Yang, Baoqi; Chen, Mingfei; Yu, Yongxiang; Yu, Gongqi; Niu, Guiye; You, Jiabao; Zhou, Yan; Ma, Shanshan; Wang, Ting; Yan, Xiaoxiao; Goh, Boon Kee; Common, John E. A.; Lane, Birgitte E.; Sun, Yonghu; Zhou, Guizhi; Lu, Xianmei; Wang, Zhenhua; Tian, Hongqing; Cao, Yuanhua; Chen, Shumin; Liu, Qiji; Liu, Jianjun; Zhang, Furen
2014-01-01
Background As a genetic disorder of abnormal pigmentation, the molecular basis of dyschromatosis universalis hereditaria (DUH) had remained unclear until recently when ABCB6 was reported as a causative gene of DUH. Methodology We performed genome-wide linkage scan using Illumina Human 660W-Quad BeadChip and exome sequencing analyses using Agilent SureSelect Human All Exon Kits in a multiplex Chinese DUH family to identify the pathogenic mutations and verified the candidate mutations using Sanger sequencing. Quantitative RT-PCR and Immunohistochemistry was performed to verify the expression of the pathogenic gene, Zebrafish was also used to confirm the functional role of ABCB6 in melanocytes and pigmentation. Results Genome-wide linkage (assuming autosomal dominant inheritance mode) and exome sequencing analyses identified ABCB6 as the disease candidate gene by discovering a coding mutation (c.1358C>T; p.Ala453Val) that co-segregates with the disease phenotype. Further mutation analysis of ABCB6 in four other DUH families and two sporadic cases by Sanger sequencing confirmed the mutation (c.1358C>T; p.Ala453Val) and discovered a second, co-segregating coding mutation (c.964A>C; p.Ser322Lys) in one of the four families. Both mutations were heterozygous in DUH patients and not present in the 1000 Genome Project and dbSNP database as well as 1,516 unrelated Chinese healthy controls. Expression analysis in human skin and mutagenesis interrogation in zebrafish confirmed the functional role of ABCB6 in melanocytes and pigmentation. Given the involvement of ABCB6 mutations in coloboma, we performed ophthalmological examination of the DUH carriers of ABCB6 mutations and found ocular abnormalities in them. Conclusion Our study has advanced our understanding of DUH pathogenesis and revealed the shared pathological mechanism between pigmentary DUH and ocular coloboma. PMID:24498303
Hennebique, Aurélie; Bidart, Marie; Jarraud, Sophie; Beraud, Laëtitia; Schwebel, Carole; Maurin, Max; Boisset, Sandrine
2017-09-01
The emergence of fluoroquinolone (FQ)-resistant mutants of Legionella pneumophila in infected humans was previously reported using a next-generation DNA sequencing (NGS) approach. This finding could explain part of the therapeutic failures observed in legionellosis patients treated with these antibiotics. The aim of this study was to develop digital PCR (dPCR) assays allowing rapid and accurate detection and quantification of these resistant mutants in respiratory samples, especially when the proportion of mutants in a wild-type background is low. We designed three dPCRgyrA assays to detect and differentiate the wild-type and one of the three gyrA mutations previously described as associated with FQ resistance in L. pneumophila : at positions 248C→T (T83I), 259G→A (D87N), and 259G→C (D87H). To assess the performance of these assays, mixtures of FQ-resistant and -susceptible strains of L. pneumophila were analyzed, and the results were compared with those obtained with Sanger DNA sequencing and real-time quantitative PCR (qPCR) technologies. The dPCRgyrA assays were able to detect mutated gyrA sequences in the presence of wild-type sequences at up to 1:1,000 resistant/susceptible allele ratios. By comparison, Sanger DNA sequencing and qPCR were less sensitive, allowing the detection of gyrA mutants at up to 1:1 and 1:10 ratios, respectively. When testing 38 respiratory samples from 23 legionellosis patients (69.6% treated with an FQ), dPCRgyrA detected small amounts of gyrA mutants in four (10.5%) samples from three (13.0%) patients. These results demonstrate that dPCR is a highly sensitive alternative to quantify FQ resistance in L. pneumophila , and it could be used in clinical practice to detect patients that could be at higher risk of therapeutic failure. Copyright © 2017 American Society for Microbiology.
A comparative molecular analysis of water-filled limestone sinkholes in north-eastern Mexico.
Sahl, Jason W; Gary, Marcus O; Harris, J Kirk; Spear, John R
2011-01-01
Sistema Zacatón in north-eastern Mexico is host to several deep, water-filled, anoxic, karstic sinkholes (cenotes). These cenotes were explored, mapped, and geochemically and microbiologically sampled by the autonomous underwater vehicle deep phreatic thermal explorer (DEPTHX). The community structure of the filterable fraction of the water column and extensive microbial mats that coat the cenote walls was investigated by comparative analysis of small-subunit (SSU) 16S rRNA gene sequences. Full-length Sanger gene sequence analysis revealed novel microbial diversity that included three putative bacterial candidate phyla and three additional groups that showed high intra-clade distance with poorly characterized bacterial candidate phyla. Limited functional gene sequence analysis in these anoxic environments identified genes associated with methanogenesis, sulfate reduction and anaerobic ammonium oxidation. A directed, barcoded amplicon, multiplex pyrosequencing approach was employed to compare ∼100,000 bacterial SSU gene sequences from water column and wall microbial mat samples from five cenotes in Sistema Zacatón. A new, high-resolution sequence distribution profile (SDP) method identified changes in specific phylogenetic types (phylotypes) in microbial mats at varied depths; Mantel tests showed a correlation of the genetic distances between mat communities in two cenotes and the geographic location of each cenote. Community structure profiles from the water column of three neighbouring cenotes showed distinct variation; statistically significant differences in the concentration of geochemical constituents suggest that the variation observed in microbial communities between neighbouring cenotes are due to geochemical variation. © 2010 Society for Applied Microbiology and Blackwell Publishing Ltd.
Nahku, Ranno; Peebo, Karl; Valgepea, Kaspar; Barrick, Jeffrey E; Adamberg, Kaarel; Vilu, Raivo
2011-09-01
Nutrient-limited continuous cultures in chemostats have been used to study microbial cell physiology for over 60 years. Genome instability and genetic heterogeneity are possible uncontrolled factors in continuous cultivation experiments. We investigated these issues by using high-throughput (HT) DNA sequencing to characterize samples from different phases of a glucose-limited accelerostat (A-stat) experiment with Escherichia coli K-12 MG1655 and a duration regularly used in cell physiology studies (20 generations of continuous cultivation). Seven consensus mutations from the reference sequence and five subpopulations characterized by different mutations were detected in the HT-sequenced samples. This genetic heterogeneity was confirmed to result from the stock culture by Sanger sequencing. All the subpopulations in which allele frequencies increased (betA, cspG/cspH, glyA) during the experiment were also present at the end of replicate A-stats, indicating that no new subpopulations emerged during our experiments. The fact that ~31 % of the cells in our initial cultures obtained directly from a culture stock centre were mutants raises concerns that even if cultivations are started from single colonies, there is a significant chance of picking a mutant clone with an altered phenotype. Our results show that current HT DNA sequencing technology allows accurate subpopulation analysis and demonstrates that a glucose-limited E. coli K-12 MG1655 A-stat experiment with a duration of tens of generations is suitable for studying cell physiology and collecting quantitative data for metabolic modelling without interference from new mutations.
Nahku, Ranno; Peebo, Karl; Valgepea, Kaspar; Barrick, Jeffrey E.; Adamberg, Kaarel
2011-01-01
Nutrient-limited continuous cultures in chemostats have been used to study microbial cell physiology for over 60 years. Genome instability and genetic heterogeneity are possible uncontrolled factors in continuous cultivation experiments. We investigated these issues by using high-throughput (HT) DNA sequencing to characterize samples from different phases of a glucose-limited accelerostat (A-stat) experiment with Escherichia coli K-12 MG1655 and a duration regularly used in cell physiology studies (20 generations of continuous cultivation). Seven consensus mutations from the reference sequence and five subpopulations characterized by different mutations were detected in the HT-sequenced samples. This genetic heterogeneity was confirmed to result from the stock culture by Sanger sequencing. All the subpopulations in which allele frequencies increased (betA, cspG/cspH, glyA) during the experiment were also present at the end of replicate A-stats, indicating that no new subpopulations emerged during our experiments. The fact that ~31 % of the cells in our initial cultures obtained directly from a culture stock centre were mutants raises concerns that even if cultivations are started from single colonies, there is a significant chance of picking a mutant clone with an altered phenotype. Our results show that current HT DNA sequencing technology allows accurate subpopulation analysis and demonstrates that a glucose-limited E. coli K-12 MG1655 A-stat experiment with a duration of tens of generations is suitable for studying cell physiology and collecting quantitative data for metabolic modelling without interference from new mutations. PMID:21700661
Inal-Gültekin, Güldal; Toptaş-Hekimoğlu, Bahar; Görmez, Zeliha; Gelişin, Özlem; Durmuş, Hacer; Ergüner, Bekir; Demirci, Hüseyin; Sağıroğlu, Mahmut Ş; Parman, Yeşim; Deymeer, Feza; Yılmaz-Aydoğan, Hülya; Pençe, Sadrettin; Bekircan-Kurt, Can Ebru; Tan, Ersin; Erdem-Özdamar, Sevim; Üstek, Duran; Giger, Urs; Öztürk, Oğuz; Serdaroğlu-Oflazer, Piraye
2017-11-01
This study aimed to identify PYGM mutations in patients with McArdle disease from Turkey by next generation sequencing (NGS). Genomic DNA was extracted from the blood of the McArdle patients (n = 67) and unrelated healthy volunteers (n = 53). The PYGM gene was sequenced with NGS and the observed mutations were validated by direct Sanger sequencing. A diagnostic algorithm was developed for patients with suspected McArdle disease. A total of 16 deleterious PYGM mutations were identified, of which 5 were novel, including 1 splice-site donor, 1 frame-shift, and 3 non-synonymous variants. The p.Met1Val (27-patients/11-families) was the most common PYGM mutation, followed by p.Arg576* (6/4), c.1827+7A>G (5/4), c.772+2_3delTG (5/3), p.Phe710del (4/2), p.Lys754Asnfs (2/1), and p.Arg50* (1/1). A molecular diagnostic flowchart is proposed for the McArdle patients in Turkey, covering the 6 most common PYGM mutations found in Turkey as well as the most common mutation in Europe. The diagnostic algorithm may alleviate the need for muscle biopsies in 77.6% of future patients. A prevalence of any of the mutations to a geographical region in Turkey was not identified. Furthermore, the NGS approach to sequence the entire PYGM gene was successful in detecting a common missense mutation and discovering novel mutations in this population study. Copyright © 2017 Elsevier B.V. All rights reserved.
Mikalová, Lenka; Strouhal, Michal; Oppelt, Jan; Grange, Philippe Alain; Janier, Michel; Benhaddou, Nadjet; Dupin, Nicolas; Šmajs, David
2017-01-01
Background Treponema pallidum subsp. endemicum (TEN) is the causative agent of endemic syphilis (bejel). An unusual human TEN 11q/j isolate was obtained from a syphilis-like primary genital lesion from a patient that returned to France from Pakistan. Methodology/Principal findings The TEN 11q/j isolate was characterized using nested PCR followed by Sanger sequencing and/or direct Illumina sequencing. Altogether, 44 chromosomal regions were analyzed. Overall, the 11q/j isolate clustered with TEN strains Bosnia A and Iraq B as expected from previous TEN classification of the 11q/j isolate. However, the 11q/j sequence in a 505 bp-long region at the TP0488 locus was similar to Treponema pallidum subsp. pallidum (TPA) strains, but not to TEN Bosnia A and Iraq B sequences, suggesting a recombination event at this locus. Similarly, the 11q/j sequence in a 613 bp-long region at the TP0548 locus was similar to Treponema pallidum subsp. pertenue (TPE) strains, but not to TEN sequences. Conclusions/Significance A detailed analysis of two recombinant loci found in the 11q/j clinical isolate revealed that the recombination event occurred just once, in the TP0488, with the donor sequence originating from a TPA strain. Since TEN Bosnia A and Iraq B were found to contain TPA-like sequences at the TP0548 locus, the recombination at TP0548 took place in a treponeme that was an ancestor to both TEN Bosnia A and Iraq B. The sequence of 11q/j isolate in TP0548 represents an ancestral TEN sequence that is similar to yaws-causing treponemes. In addition to the importance of the 11q/j isolate for reconstruction of the TEN phylogeny, this case emphasizes the possible role of TEN strains in development of syphilis-like lesions. PMID:28263990
Xu, Yan; Xiao, Xueshan; Li, Shiqiang; Jia, Xiaoyun; Xin, Wei; Wang, Panfeng; Sun, Wenmin; Huang, Li; Guo, Xiangming; Zhang, Qingjiong
2016-08-01
Leber congenital amaurosis (LCA) is the most severe form of inherited retinal dystrophy. We have previously performed a mutational analysis of the known LCA-associated genes in probands with LCA by both Sanger and whole exome sequencing. In this study, whole exome sequencing was carried out on 66 new probabds with LCA. In conjunction with these data, the present study provides a comprehensive analysis of the spectrum and frequency of all known genes associated with retinal dystrophy in a total of 159 Chinese probands with LCA. The known genes responsible for all forms hereditary retinal dystrophy were included based on information from RetNet. The candidate variants were filtered by bioinformatics analysis and confirmed by Sanger sequencing. Potentially causative mutations were further validated in available family members. Overall, a total of 118 putative pathogenic mutations from 23 genes were identified in 56.6% (90/159) of probands. These mutations were harbored in 13 LCA-associated genes and in ten genes related to other forms of retinal dystrophy. The most frequently mutated gene in probands with LCA was GUCY2D (10.7%, 17/159). A series of mutational analyses suggests that all known genes associated with retinal dystrophy account for 56.6% of Chinese patients with LCA. A comprehensive molecular genetic analysis of Chinese patients with LCA provides an overview of the spectrum and frequency of ethno-specific mutations of all known genes, as well as indications about other unknown genes in the remaining probands who lacked identified mutations. Copyright © 2016 Elsevier Ltd. All rights reserved.
Variants in the PRPF8 Gene are Associated with Glaucoma.
Micheal, Shazia; Hogewind, Barend F; Khan, Muhammad Imran; Siddiqui, Sorath Noorani; Zafar, Saemah Nuzhat; Akhtar, Farah; Qamar, Raheel; Hoyng, Carel B; den Hollander, Anneke I
2018-05-01
Glaucoma is the cause of irreversible blindness worldwide. Mutations in six genes have been associated with juvenile- and adult-onset familial primary open angle glaucoma (POAG) prior to this report but they explain only a small proportion of the genetic load. The aim of the study is to identify the novel genetic cause of the POAG in the families with adult-onset glaucoma. Whole exome sequencing (WES) was performed on DNA of two affected individuals, and predicted pathogenic variants were evaluated for segregation in four affected and three unaffected Dutch family members by Sanger sequencing. We identified a pathogenic variant (p.Val956Gly) in the PRPF8 gene, which segregates with the disease in Dutch family. Targeted Sanger sequencing of PRPF8 in a panel of 40 POAG families (18 Pakistani and 22 Dutch) revealed two additional nonsynonymous variants (p.Pro13Leu and p.Met25Thr), which segregate with the disease in two other Pakistani families. Both variants were then analyzed in a case-control cohort consisting of Pakistani 320 POAG cases and 250 matched controls. The p.Pro13Leu and p.Met25Thr variants were identified in 14 and 20 cases, respectively, while they were not detected in controls (p values 0.0004 and 0.0001, respectively). Previously, PRPF8 mutations have been associated with autosomal dominant retinitis pigmentosa (RP). The PRPF8 variants associated with POAG are located at the N-terminus, while all RP-associated mutations cluster at the C-terminus, dictating a clear genotype-phenotype correlation.
Ciavarella, Michele; Miccoli, Sara; Prossomariti, Anna; Pippucci, Tommaso; Bonora, Elena; Buscherini, Francesco; Palombo, Flavia; Zuntini, Roberta; Balbi, Tiziana; Ceccarelli, Claudio; Bazzoli, Franco; Ricciardiello, Luigi; Turchetti, Daniela; Piazzi, Giulia
2018-03-01
Germline variants in the APC gene cause familial adenomatous polyposis. Inherited variants in MutYH, POLE, POLD1, NTHL1, and MSH3 genes and somatic APC mosaicism have been reported as alternative causes of polyposis. However, ~30-50% of cases of polyposis remain genetically unsolved. Thus, the aim of this study was to investigate the genetic causes of unexplained adenomatous polyposis. Eight sporadic cases with >20 adenomatous polyps by 35 years of age or >50 adenomatous polyps by 55 years of age, and no causative germline variants in APC and/or MutYH, were enrolled from a cohort of 56 subjects with adenomatous colorectal polyposis. APC gene mosaicism was investigated on DNA from colonic adenomas by Sanger sequencing or Whole Exome Sequencing (WES). Mosaicism extension to other tissues (peripheral blood, saliva, hair follicles) was evaluated using Sanger sequencing and/or digital PCR. APC second hit was investigated in adenomas from mosaic patients. WES was performed on DNA from peripheral blood to identify additional polyposis candidate variants. We identified APC mosaicism in 50% of patients. In three cases mosaicism was restricted to the colon, while in one it also extended to the duodenum and saliva. One patient without APC mosaicism, carrying an APC in-frame deletion of uncertain significance, was found to harbor rare germline variants in OGG1, POLQ, and EXO1 genes. In conclusion, our restrictive selection criteria improved the detection of mosaic APC patients. In addition, we showed for the first time that an oligogenic inheritance of rare variants might have a cooperative role in sporadic colorectal polyposis onset.
Abou Al-Shaar, Hussam; Qadi, Najeeb; Al-Hamed, Mohamed H; Meyer, Brian F; Bohlega, Saeed
2016-08-15
Cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL) is a hereditary microangiopathy caused by mutations in NOTCH3, very rarely homoallelic. To describe the clinical, radiological, and neuropsychological features in an extended CADASIL family including members with either a homozygous or heterozygous NOTCH3 R1231C mutation. The pedigree included 3 generations of a family with 13 affected individuals. The patients were examined clinically and radiologically. Neuropsychological testing was performed on the proband. Sequencing of the entire coding DNA sequence (CDS) and flanking regions of NOTCH3 was undertaken using PCR amplification and direct Sanger sequencing. Homozygous C3769T mutation, predicting R1231C in exon 22 of NOTCH3 was found in 7 family members. Six other family members harbored the same in the heterozygous state. Homozygous individuals showed a slightly more severe clinical and radiological phenotype of earlier onset compared to their heterozygous counterparts. This study reports the largest number of patients with homozygous NOTCH3 mutation. The phenotype and imaging features of homozygous individuals is within the spectrum of CADASIL, although slightly at the severe end when compared to heterozygotes carrying the same mutation. Both genetic modifiers and environmental factors may play an essential role in modification and alteration of the clinical phenotype and white matter changes among CADASIL patients. Copyright © 2016 Elsevier B.V. All rights reserved.
RAS testing in metastatic colorectal cancer: advances in Europe.
Van Krieken, J Han J M; Rouleau, Etienne; Ligtenberg, Marjolijn J L; Normanno, Nicola; Patterson, Scott D; Jung, Andreas
2016-04-01
Personalized medicine shows promise for maximizing efficacy and minimizing toxicity of anti-cancer treatment. KRAS exon 2 mutations are predictive of resistance to epidermal growth factor receptor-directed monoclonal antibodies in patients with metastatic colorectal cancer. Recent studies have shown that broader RAS testing (KRAS and NRAS) is needed to select patients for treatment. While Sanger sequencing is still used, approaches based on various methodologies are available. Few CE-approved kits, however, detect the full spectrum of RAS mutations. More recently, "next-generation" sequencing has been developed for research use, including parallel semiconductor sequencing and reversible termination. These techniques have high technical sensitivities for detecting mutations, although the ideal threshold is currently unknown. Finally, liquid biopsy has the potential to become an additional tool to assess tumor-derived DNA. For accurate and timely RAS testing, appropriate sampling and prompt delivery of material is critical. Processes to ensure efficient turnaround from sample request to RAS evaluation must be implemented so that patients receive the most appropriate treatment. Given the variety of methodologies, external quality assurance programs are important to ensure a high standard of RAS testing. Here, we review technical and practical aspects of RAS testing for pathologists working with metastatic colorectal cancer tumor samples. The extension of markers from KRAS to RAS testing is the new paradigm for biomarker testing in colorectal cancer.
Goossens, Dirk; Moens, Lotte N; Nelis, Eva; Lenaerts, An-Sofie; Glassee, Wim; Kalbe, Andreas; Frey, Bruno; Kopal, Guido; De Jonghe, Peter; De Rijk, Peter; Del-Favero, Jurgen
2009-03-01
We evaluated multiplex PCR amplification as a front-end for high-throughput sequencing, to widen the applicability of massive parallel sequencers for the detailed analysis of complex genomes. Using multiplex PCR reactions, we sequenced the complete coding regions of seven genes implicated in peripheral neuropathies in 40 individuals on a GS-FLX genome sequencer (Roche). The resulting dataset showed highly specific and uniform amplification. Comparison of the GS-FLX sequencing data with the dataset generated by Sanger sequencing confirmed the detection of all variants present and proved the sensitivity of the method for mutation detection. In addition, we showed that we could exploit the multiplexed PCR amplicons to determine individual copy number variation (CNV), increasing the spectrum of detected variations to both genetic and genomic variants. We conclude that our straightforward procedure substantially expands the applicability of the massive parallel sequencers for sequencing projects of a moderate number of amplicons (50-500) with typical applications in resequencing exons in positional or functional candidate regions and molecular genetic diagnostics. 2008 Wiley-Liss, Inc.
Jia, Ying; Li, Xiaoge; Yang, Dong; Xu, Yi; Guo, Ying; Li, Xin
2018-01-01
The current study aims to identify the pathogenic sites in a core pedigree of Usher syndrome (USH). A core pedigree of USH was analyzed by whole exome sequencing (WES). Mutations were verified by polymerase chain reaction (PCR) amplification and Sanger sequencing. Two pathogenic variations (c.849+2T>C and c.5994G>A) in MYO7A were successfully identified and individually separated from parents. One variant (c.849+2T>C) was nonsense mutation, causing the protein terminated in advance, and the other one (c.5994G>A) located near the boundary of exon could cause aberrant splicing. This study provides a meaningful exploration for identification of clinical core genetic pedigrees. Copyright © 2017 Elsevier B.V. All rights reserved.
Creager, Hannah M; Becker, Ericka A; Sandman, Kelly K; Karl, Julie A; Lank, Simon M; Bimber, Benjamin N; Wiseman, Roger W; Hughes, Austin L; O'Connor, Shelby L; O'Connor, David H
2011-09-01
In recent years, the use of cynomolgus macaques in biomedical research has increased greatly. However, with the exception of the Mauritian population, knowledge of the MHC class II genetics of the species remains limited. Here, using cDNA cloning and Sanger sequencing, we identified 127 full-length MHC class II alleles in a group of 12 Indonesian and 12 Vietnamese cynomolgus macaques. Forty two of these were completely novel to cynomolgus macaques while 61 extended the sequence of previously identified alleles from partial to full length. This more than doubles the number of full-length cynomolgus macaque MHC class II alleles available in GenBank, significantly expanding the allele library for the species and laying the groundwork for future evolutionary and functional studies.
Holt, Kathryn E; Teo, Yik Y; Li, Heng; Nair, Satheesh; Dougan, Gordon; Wain, John; Parkhill, Julian
2009-08-15
Here, we present a method for estimating the frequencies of SNP alleles present within pooled samples of DNA using high-throughput short-read sequencing. The method was tested on real data from six strains of the highly monomorphic pathogen Salmonella Paratyphi A, sequenced individually and in a pool. A variety of read mapping and quality-weighting procedures were tested to determine the optimal parameters, which afforded > or =80% sensitivity of SNP detection and strong correlation with true SNP frequency at poolwide read depth of 40x, declining only slightly at read depths 20-40x. The method was implemented in Perl and relies on the opensource software Maq for read mapping and SNP calling. The Perl script is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/pools/.
Genetic heterogeneity of RPMI-8402, a T-acute lymphoblastic leukemia cell line
STOCZYNSKA-FIDELUS, EWELINA; PIASKOWSKI, SYLWESTER; PAWLOWSKA, ROZA; SZYBKA, MALGORZATA; PECIAK, JOANNA; HULAS-BIGOSZEWSKA, KRYSTYNA; WINIECKA-KLIMEK, MARTA; RIESKE, PIOTR
2016-01-01
Thorough examination of genetic heterogeneity of cell lines is uncommon. In order to address this issue, the present study analyzed the genetic heterogeneity of RPMI-8402, a T-acute lymphoblastic leukemia (T-ALL) cell line. For this purpose, traditional techniques such as fluorescence in situ hybridization and immunocytochemistry were used, in addition to more advanced techniques, including cell sorting, Sanger sequencing and massive parallel sequencing. The results indicated that the RPMI-8402 cell line consists of several genetically different cell subpopulations. Furthermore, massive parallel sequencing of RPMI-8402 provided insight into the evolution of T-ALL carcinogenesis, since this cell line exhibited the genetic heterogeneity typical of T-ALL. Therefore, the use of cell lines for drug testing in future studies may aid the progress of anticancer drug research. PMID:26870252
[Analysis of SOX10 gene mutation in a family affected with Waardenburg syndrome type II].
Zheng, Lei; Yan, Yousheng; Chen, Xue; Zhang, Chuan; Zhang, Qinghua; Feng, Xuan; Hao, Shen
2018-02-10
OBJECTIVE To detect potential mutation of SOX10 gene in a pedigree affected with Warrdenburg syndrome type II. METHODS Genomic DNA was extracted from peripheral blood samples of the proband and his family members. Exons and flanking sequences of MITF, PAX3, SOX10, SNAI2, END3 and ENDRB genes were analyzed by chip capturing and high throughput sequencing. Suspected mutations were verified with Sanger sequencing. RESULTS A c.127C>T (p.R43X) mutation of the SOX10 gene was detected in the proband, for which both parents showed a wild-type genotype. CONCLUSION The c.127C>T (p.R43X) mutation of SOX10 gene probably underlies the ocular symptoms and hearing loss of the proband.
Multiplexed capillary electrophoresis system
Yeung, Edward S.; Li, Qingbo; Lu, Xiandan
1998-04-21
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification ("base calling") is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations.
Multiplexed capillary electrophoresis system
Yeung, Edward S.; Chang, Huan-Tsang; Fung, Eliza N.; Li, Qingbo; Lu, Xiandan
1996-12-10
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification ("base calling") is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations.
Rfam: Wikipedia, clans and the “decimal” release
Gardner, Paul P.; Daub, Jennifer; Tate, John; Moore, Benjamin L.; Osuch, Isabelle H.; Griffiths-Jones, Sam; Finn, Robert D.; Nawrocki, Eric P.; Kolbe, Diana L.; Eddy, Sean R.; Bateman, Alex
2011-01-01
The Rfam database aims to catalogue non-coding RNAs through the use of sequence alignments and statistical profile models known as covariance models. In this contribution, we discuss the pros and cons of using the online encyclopedia, Wikipedia, as a source of community-derived annotation. We discuss the addition of groupings of related RNA families into clans and new developments to the website. Rfam is available on the Web at http://rfam.sanger.ac.uk. PMID:21062808
Multiplexed capillary electrophoresis system
Yeung, E.S.; Li, Q.; Lu, X.
1998-04-21
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification (``base calling``) is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations. 19 figs.
Multiplexed capillary electrophoresis system
Yeung, E.S.; Chang, H.T.; Fung, E.N.; Li, Q.; Lu, X.
1996-12-10
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification (``base calling``) is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations. 19 figs.
Xu, Xiaojing; Yang, Xiaoxu; Wu, Qixi; Liu, Aijie; Yang, Xiaoling; Ye, Adam Yongxin; Huang, August Yue; Li, Jiarui; Wang, Meng; Yu, Zhe; Wang, Sheng; Zhang, Zhichao; Wu, Xiru
2015-01-01
ABSTRACT The majority of children with Dravet syndrome (DS) are caused by de novo SCN1A mutations. To investigate the origin of the mutations, we developed and applied a new method that combined deep amplicon resequencing with a Bayesian model to detect and quantify allelic fractions with improved sensitivity. Of 174 SCN1A mutations in DS probands which were considered “de novo” by Sanger sequencing, we identified 15 cases (8.6%) of parental mosaicism. We identified another five cases of parental mosaicism that were also detectable by Sanger sequencing. Fraction of mutant alleles in the 20 cases of parental mosaicism ranged from 1.1% to 32.6%. Thirteen (65% of 20) mutations originated paternally and seven (35% of 20) maternally. Twelve (60% of 20) mosaic parents did not have any epileptic symptoms. Their mutant allelic fractions were significantly lower than those in mosaic parents with epileptic symptoms (P = 0.016). We identified mosaicism with varied allelic fractions in blood, saliva, urine, hair follicle, oral epithelium, and semen, demonstrating that postzygotic mutations could affect multiple somatic cells as well as germ cells. Our results suggest that more sensitive tools for detecting low‐level mosaicism in parents of families with seemingly “de novo” mutations will allow for better informed genetic counseling. PMID:26096185
Abal-Fabeiro, J L; Maside, X; Llovo, J; Bello, X; Torres, M; Treviño, M; Moldes, L; Muñoz, A; Carracedo, A; Bartolomé, C
2014-04-01
The epidemiological study of human cryptosporidiosis requires the characterization of species and subtypes involved in human disease in large sample collections. Molecular genotyping is costly and time-consuming, making the implementation of low-cost, highly efficient technologies increasingly necessary. Here, we designed a protocol based on MALDI-TOF mass spectrometry for the high-throughput genotyping of a panel of 55 single nucleotide variants (SNVs) selected as markers for the identification of common gp60 subtypes of four Cryptosporidium species that infect humans. The method was applied to a panel of 608 human and 63 bovine isolates and the results were compared with control samples typed by Sanger sequencing. The method allowed the identification of species in 610 specimens (90·9%) and gp60 subtype in 605 (90·2%). It displayed excellent performance, with sensitivity and specificity values of 87·3 and 98·0%, respectively. Up to nine genotypes from four different Cryptosporidium species (C. hominis, C. parvum, C. meleagridis and C. felis) were detected in humans; the most common ones were C. hominis subtype Ib, and C. parvum IIa (61·3 and 28·3%, respectively). 96·5% of the bovine samples were typed as IIa. The method performs as well as the widely used Sanger sequencing and is more cost-effective and less time consuming.
Clinical and genetic characteristics of chinese patients with Birt-Hogg-Dubé syndrome.
Liu, Yaping; Xu, Zhiyan; Feng, Ruie; Zhan, Yongzhong; Wang, Jun; Li, Guozhen; Li, Xue; Zhang, Weihong; Hu, Xiaowen; Tian, Xinlun; Xu, Kai-Feng; Zhang, Xue
2017-05-30
Birt-Hogg-Dubé syndrome (BHD) is an autosomal dominant disorder, the main manifestations of which are fibrofolliculomas, renal tumors, pulmonary cysts and recurrent pneumothorax. The known causative gene for BHD syndrome is the folliculin (FLCN) gene on chromosome 17p11.2. Studies of the FLCN mutation for BHD syndrome are less prevalent in Chinese populations than in Caucasian populations. Our study aims to investigate the genotype spectrum in a group of Chinese patients with BHD. We enrolled 51 patients with symptoms highly suggestive of BHD from January 2014 to February 2017. The FLCN gene was examined using PCR and Sanger sequencing in every patient, for those whose Sanger sequencing showed negative mutation results, multiplex ligation-dependent probe amplification (MLPA) testing was conducted to detect any losses of large segments. Among the 51 patients, 27 had FLCN germline mutations. In total, 20 mutations were identified: 14 were novel mutations, including 3 splice acceptor site mutations, 2 different deletions, 6 nonsense mutations, 1 missense mutation, 1 small insertion, and 1 deletion of the whole exon 8. We found a similar genotype spectrum but different mutant loci in Chinese patients with BHD compared with European and American patients, thus providing stronger evidence for the clinical molecular diagnosis of BHD in China. It suggests that mutation analysis of the FLCN gene should be systematically conducted in patients with cystic lung diseases.
Kudapa, Himabindu; Bharti, Arvind K; Cannon, Steven B; Farmer, Andrew D; Mulaosmanovic, Benjamin; Kramer, Robin; Bohra, Abhishek; Weeks, Nathan T; Crow, John A; Tuteja, Reetu; Shah, Trushar; Dutta, Sutapa; Gupta, Deepak K; Singh, Archana; Gaikwad, Kishor; Sharma, Tilak R; May, Gregory D; Singh, Nagendra K; Varshney, Rajeev K
2012-09-01
A comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes. The resultant transcriptome assembly, referred to as CcTA v2, comprised 21 434 transcript assembly contigs (TACs) with an N50 of 1510 bp, the largest one being ~8 kb. Of the 21 434 TACs, 16 622 (77.5%) could be mapped on to the soybean genome build 1.0.9 under fairly stringent alignment parameters. Based on knowledge of intron junctions, 10 009 primer pairs were designed from 5033 TACs for amplifying intron spanning regions (ISRs). By using in silico mapping of BAC-end-derived SSR loci of pigeonpea on the soybean genome as a reference, putative mapping positions at the chromosome level were predicted for 6284 ISR markers, covering all 11 pigeonpea chromosomes. A subset of 128 ISR markers were analyzed on a set of eight genotypes. While 116 markers were validated, 70 markers showed one to three alleles, with an average of 0.16 polymorphism information content (PIC) value. In summary, the CcTA v2 transcript assembly and ISR markers will serve as a useful resource to accelerate genetic research and breeding applications in pigeonpea.
Shin, Saeam; Kim, Juwon; Kim, Yoonjung; Cho, Sun-Mi; Lee, Kyung-A
2017-10-26
EGFR mutation is an emerging biomarker for treatment selection in non-small-cell lung cancer (NSCLC) patients. However, optimal mutation detection is hindered by complications associated with the biopsy procedure, tumor heterogeneity and limited sensitivity of test methodology. In this study, we evaluated the diagnostic utility of real-time PCR using malignant pleural effusion samples. A total of 77 pleural fluid samples from 77 NSCLC patients were tested using the cobas EGFR mutation test (Roche Molecular Systems). Pleural fluid was centrifuged, and separated cell pellets and supernatants were tested in parallel. Results were compared with Sanger sequencing and/or peptide nucleic acid (PNA)-mediated PCR clamping of matched tumor tissue or pleural fluid samples. All samples showed valid real-time PCR results in one or more DNA samples extracted from cell pellets and supernatants. Compared with other molecular methods, the sensitivity of real-time PCR method was 100%. Concordance rate of real-time PCR and Sanger sequencing plus PNA-mediated PCR clamping was 98.7%. We have confirmed that real-time PCR using pleural fluid had a high concordance rate compared to conventional methods, with no failed samples. Our data demonstrated that the parallel real-time PCR testing using supernatant and cell pellet could offer reliable and robust surrogate strategy when tissue is not available.
2010-01-01
Background The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despite their ecological and economical importance, very few genomic resources have yet been generated for these species. Here, we describe the development of an EST catalogue that will support ecosystem genomics studies, where geneticists, ecophysiologists, molecular biologists and ecologists join their efforts for understanding, monitoring and predicting functional genetic diversity. Results We generated 145,827 sequence reads from 20 cDNA libraries using the Sanger method. Unexploitable chromatograms and quality checking lead us to eliminate 19,941 sequences. Finally a total of 125,925 ESTs were retained from 111,361 cDNA clones. Pyrosequencing was also conducted for 14 libraries, generating 1,948,579 reads, from which 370,566 sequences (19.0%) were eliminated, resulting in 1,578,192 sequences. Following clustering and assembly using TGICL pipeline, 1,704,117 EST sequences collapsed into 69,154 tentative contigs and 153,517 singletons, providing 222,671 non-redundant sequences (including alternative transcripts). We also assembled the sequences using MIRA and PartiGene software and compared the three unigene sets. Gene ontology annotation was then assigned to 29,303 unigene elements. Blast search against the SWISS-PROT database revealed putative homologs for 32,810 (14.7%) unigene elements, but more extensive search with Pfam, Refseq_protein, Refseq_RNA and eight gene indices revealed homology for 67.4% of them. The EST catalogue was examined for putative homologs of candidate genes involved in bud phenology, cuticle formation, phenylpropanoids biosynthesis and cell wall formation. Our results suggest a good coverage of genes involved in these traits. Comparative orthologous sequences (COS) with other plant gene models were identified and allow to unravel the oak paleo-history. Simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 52,834 SSRs and 36,411 SNPs. All of these are available through the Oak Contig Browser http://genotoul-contigbrowser.toulouse.inra.fr:9092/Quercus_robur/index.html. Conclusions This genomic resource provides a unique tool to discover genes of interest, study the oak transcriptome, and develop new markers to investigate functional diversity in natural populations. PMID:21092232
Tsai, Meng-Che; Yu, Hui-Wen; Liu, Tsunglin; Chou, Yen-Yin; Chiou, Yuan-Yow; Chen, Peng-Chieh
2018-01-01
Alström syndrome (AS) is a rare autosomal recessive disorder that shares clinical features with other ciliopathy-related diseases. Genetic mutation analysis is often required in making differential diagnosis but usually costly in time and effort using conventional Sanger sequencing. Herein we describe a Taiwanese patient presenting cone-rod dystrophy and early-onset obesity that progressed to diabetes mellitus with marked insulin resistance during adolescence. Whole exome sequencing of the patient's genomic DNA identified a novel frameshift mutation in exons 15 (c.10290_10291delTA, p.Lys3431Serfs * 10) and a rare mutation in 16 (c.10823_10824delAG, p.Arg3609Alafs * 6) of ALMS1 gene. The compound heterozygous mutations were predicted to render truncated proteins. This report highlighted the clinical utility of exome sequencing and extended the knowledge of mutation spectrum in AS patients.
Dharmadhikari, Avinash V.; Kang, Sung-Hae L.; Szafranski, Przemyslaw; Person, Richard E.; Sampath, Srirangan; Prakash, Siddharth K.; Bader, Patricia I.; Phillips, John A.; Hannig, Vickie; Williams, Misti; Vinson, Sherry S.; Wilfong, Angus A.; Reimschisel, Tyler E.; Craigen, William J.; Patel, Ankita; Bi, Weimin; Lupski, James R.; Belmont, John; Cheung, Sau Wai; Stankiewicz, Pawel
2012-01-01
We have identified a rare small (∼450 kb unique sequence) recurrent deletion in a previously linked attention-deficit hyperactivity disorder (ADHD) locus at 2q21.1 in five unrelated families with developmental delay (DD)/intellectual disability (ID), ADHD, epilepsy and other neurobehavioral abnormalities from 17 035 samples referred for clinical chromosomal microarray analysis. Additionally, a DECIPHER (http://decipher.sanger.ac.uk) patient 2311 was found to have the same deletion and presented with aggressive behavior. The deletion was not found in either six control groups consisting of 13 999 healthy individuals or in the DGV database. We have also identified reciprocal duplications in five unrelated families with autism, developmental delay (DD), seizures and ADHD. This genomic region is flanked by large, complex low-copy repeats (LCRs) with directly oriented subunits of ∼109 kb in size that have 97.7% DNA sequence identity. We sequenced the deletion breakpoints within the directly oriented paralogous subunits of the flanking LCR clusters, demonstrating non-allelic homologous recombination as a mechanism of formation. The rearranged segment harbors five genes: GPR148, FAM123C, ARHGEF4, FAM168B and PLEKHB2. Expression of ARHGEF4 (Rho guanine nucleotide exchange factor 4) is restricted to the brain and may regulate the actin cytoskeletal network, cell morphology and migration, and neuronal function. GPR148 encodes a G-protein-coupled receptor protein expressed in the brain and testes. We suggest that small rare recurrent deletion of 2q21.1 is pathogenic for DD/ID, ADHD, epilepsy and other neurobehavioral abnormalities and, because of its small size, low frequency and more severe phenotype might have been missed in other previous genome-wide screening studies using single-nucleotide polymorphism analyses. PMID:22543972
Next Generation Sequencing Technologies: The Doorway to the Unexplored Genomics of Non-Model Plants
Unamba, Chibuikem I. N.; Nag, Akshay; Sharma, Ram K.
2015-01-01
Non-model plants i.e., the species which have one or all of the characters such as long life cycle, difficulty to grow in the laboratory or poor fecundity, have been schemed out of sequencing projects earlier, due to high running cost of Sanger sequencing. Consequently, the information about their genomics and key biological processes are inadequate. However, the advent of fast and cost effective next generation sequencing (NGS) platforms in the recent past has enabled the unearthing of certain characteristic gene structures unique to these species. It has also aided in gaining insight about mechanisms underlying processes of gene expression and secondary metabolism as well as facilitated development of genomic resources for diversity characterization, evolutionary analysis and marker assisted breeding even without prior availability of genomic sequence information. In this review we explore how different Next Gen Sequencing platforms, as well as recent advances in NGS based high throughput genotyping technologies are rewarding efforts on de-novo whole genome/transcriptome sequencing, development of genome wide sequence based markers resources for improvement of non-model crops that are less costly than phenotyping. PMID:26734016
Szopa, Magdalena; Ludwig-Galezowska, Agnieszka H; Radkowski, Piotr; Skupien, Jan; Machlowska, Julita; Klupa, Tomasz; Wolkow, Pawel; Borowiec, Maciej; Mlynarski, Wojciech; Malecki, Maciej T
2016-02-01
Until now only a few families with early onset autosomal diabetes due to the NEUROD1 gene mutations have been identified. Moreover, only some of them meet strict MODY (maturity-onset diabetes of the young) criteria. Next-generation sequencing (NGS) provides an opportunity to detect more pathogenic mutations in this gene. Here, we evaluated the segregation of the Arg103Pro mutation in the NEUROD1 gene in a pedigree in which it was detected, and described the clinical characteristics of the mutation carriers. We included 156 diabetic probands of MODY families, among them 52 patients earlier tested for GCK-MODY and/or HNF1A-MODY by Sanger sequencing with negative results. Genetic testing was performed by targeted NGS sequencing using a panel of 28 monogenic diabetes genes. As detected by NGS, one patient had the missense Arg103Pro (CGC/CCC) mutation in the gene NEUROD1 changing the amino-acid structure of the DNA binding domain of this transcription factor. We confirmed this sequence difference by Sanger sequencing. This family had previously been tested with negative results for HNF1A gene mutations. 17 additional members of this family were invited for further testing. We confirmed the presence of the mutation in 11 subjects. Seven adult mutation carriers (all but one) from three generations had been already diagnosed with diabetes. There were 3 individuals with the Arg103Pro mutation diagnosed before the age of 30 years in the family. The range of age of the four unaffected mutation carriers (3 minors and 1 adult) was 3-48 years. Interestingly, one mutation carrier had a history of transient neonatal hypoglycemia, of which the clinical course resembled episodes typical for HNF4A-MODY. We report a family with autosomal dominant diabetes related to a new NEUROD1 mutation, one of very few meeting MODY criteria. The use of the NGS method will facilitate identification of more families with rare forms of MODY. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Le, Thuy; Chiarella, Jennifer; Simen, Birgitte B; Hanczaruk, Bozena; Egholm, Michael; Landry, Marie L; Dieckhaus, Kevin; Rosen, Marc I; Kozal, Michael J
2009-06-29
It is largely unknown how frequently low-abundance HIV drug-resistant variants at levels under limit of detection of conventional genotyping (<20% of quasi-species) are present in antiretroviral-experienced persons experiencing virologic failure. Further, the clinical implications of low-abundance drug-resistant variants at time of virologic failure are unknown. Plasma samples from 22 antiretroviral-experienced subjects collected at time of virologic failure (viral load 1380 to 304,000 copies/mL) were obtained from a specimen bank (from 2004-2007). The prevalence and profile of drug-resistant mutations were determined using Sanger sequencing and ultra-deep pyrosequencing. Genotypes were interpreted using Stanford HIV database algorithm. Antiretroviral treatment histories were obtained by chart review and correlated with drug-resistant mutations. Low-abundance drug-resistant mutations were detected in all 22 subjects by deep sequencing and only in 3 subjects by Sanger sequencing. In total they accounted for 90 of 247 mutations (36%) detected by deep sequencing; the majority of these (95%) were not detected by standard genotyping. A mean of 4 additional mutations per subject were detected by deep sequencing (p<0.0001, 95%CI: 2.85-5.53). The additional low-abundance drug-resistant mutations increased a subject's genotypic resistance to one or more antiretrovirals in 17 of 22 subjects (77%). When correlated with subjects' antiretroviral treatment histories, the additional low-abundance drug-resistant mutations correlated with the failing antiretroviral drugs in 21% subjects and correlated with historical antiretroviral use in 79% subjects (OR, 13.73; 95% CI, 2.5-74.3, p = 0.0016). Low-abundance HIV drug-resistant mutations in antiretroviral-experienced subjects at time of virologic failure can increase a subject's overall burden of resistance, yet commonly go unrecognized by conventional genotyping. The majority of unrecognized resistant mutations correlate with historical antiretroviral use. Ultra-deep sequencing can provide important historical resistance information for clinicians when planning subsequent antiretroviral regimens for highly treatment-experienced patients, particularly when their prior treatment histories and longitudinal genotypes are not available.
Le, Thuy; Chiarella, Jennifer; Simen, Birgitte B.; Hanczaruk, Bozena; Egholm, Michael; Landry, Marie L.; Dieckhaus, Kevin; Rosen, Marc I.; Kozal, Michael J.
2009-01-01
Background It is largely unknown how frequently low-abundance HIV drug-resistant variants at levels under limit of detection of conventional genotyping (<20% of quasi-species) are present in antiretroviral-experienced persons experiencing virologic failure. Further, the clinical implications of low-abundance drug-resistant variants at time of virologic failure are unknown. Methodology/Principal Findings Plasma samples from 22 antiretroviral-experienced subjects collected at time of virologic failure (viral load 1380 to 304,000 copies/mL) were obtained from a specimen bank (from 2004–2007). The prevalence and profile of drug-resistant mutations were determined using Sanger sequencing and ultra-deep pyrosequencing. Genotypes were interpreted using Stanford HIV database algorithm. Antiretroviral treatment histories were obtained by chart review and correlated with drug-resistant mutations. Low-abundance drug-resistant mutations were detected in all 22 subjects by deep sequencing and only in 3 subjects by Sanger sequencing. In total they accounted for 90 of 247 mutations (36%) detected by deep sequencing; the majority of these (95%) were not detected by standard genotyping. A mean of 4 additional mutations per subject were detected by deep sequencing (p<0.0001, 95%CI: 2.85–5.53). The additional low-abundance drug-resistant mutations increased a subject's genotypic resistance to one or more antiretrovirals in 17 of 22 subjects (77%). When correlated with subjects' antiretroviral treatment histories, the additional low-abundance drug-resistant mutations correlated with the failing antiretroviral drugs in 21% subjects and correlated with historical antiretroviral use in 79% subjects (OR, 13.73; 95% CI, 2.5–74.3, p = 0.0016). Conclusions/Significance Low-abundance HIV drug-resistant mutations in antiretroviral-experienced subjects at time of virologic failure can increase a subject's overall burden of resistance, yet commonly go unrecognized by conventional genotyping. The majority of unrecognized resistant mutations correlate with historical antiretroviral use. Ultra-deep sequencing can provide important historical resistance information for clinicians when planning subsequent antiretroviral regimens for highly treatment-experienced patients, particularly when their prior treatment histories and longitudinal genotypes are not available. PMID:19562031
Mutant KRAS promotes malignant pleural effusion formation
Αgalioti, Theodora; Giannou, Anastasios D.; Krontira, Anthi C.; Kanellakis, Nikolaos I.; Kati, Danai; Vreka, Malamati; Pepe, Mario; Spella, Μagda; Lilis, Ioannis; Zazara, Dimitra E.; Nikolouli, Eirini; Spiropoulou, Nikolitsa; Papadakis, Andreas; Papadia, Konstantina; Voulgaridis, Apostolos; Harokopos, Vaggelis; Stamou, Panagiota; Meiners, Silke; Eickelberg, Oliver; Snyder, Linda A.; Antimisiaris, Sophia G.; Kardamakis, Dimitrios; Psallidas, Ioannis; Μarazioti, Antonia; Stathopoulos, Georgios T.
2017-01-01
Malignant pleural effusion (MPE) is the lethal consequence of various human cancers metastatic to the pleural cavity. However, the mechanisms responsible for the development of MPE are still obscure. Here we show that mutant KRAS is important for MPE induction in mice. Pleural disseminated, mutant KRAS bearing tumour cells upregulate and systemically release chemokine ligand 2 (CCL2) into the bloodstream to mobilize myeloid cells from the host bone marrow to the pleural space via the spleen. These cells promote MPE formation, as indicated by splenectomy and splenocyte restoration experiments. In addition, KRAS mutations are frequently detected in human MPE and cell lines isolated thereof, but are often lost during automated analyses, as indicated by manual versus automated examination of Sanger sequencing traces. Finally, the novel KRAS inhibitor deltarasin and a monoclonal antibody directed against CCL2 are equally effective against an experimental mouse model of MPE, a result that holds promise for future efficient therapies against the human condition. PMID:28508873
van der Klift, Heleen M; Tops, Carli M; Hes, Frederik J; Devilee, Peter; Wijnen, Juul T
2012-07-01
Heterozygous germline mutations in the mismatch repair gene PMS2 predispose carriers for Lynch syndrome, an autosomal dominant predisposition to cancer. Here, we present a LINE-1-mediated retrotranspositional insertion in PMS2 as a novel mutation type for Lynch syndrome. This insertion, detected with Southern blot analysis in the genomic DNA of the patient, is characterized as a 2.2 kb long 5' truncated SVA_F element. The insertion is not detectable by current diagnostic testing limited to MLPA and direct Sanger sequencing on genomic DNA. The molecular nature of this insertion could only be resolved in RNA from cultured lymphocytes in which nonsense-mediated RNA decay was inhibited. Our report illustrates the technical problems encountered in the detection of this mutation type. Especially large heterozygous insertions will remain unnoticed because of preferential amplification of the smaller wild-type allele in genomic DNA, and are probably underreported in the mutation spectra of autosomal dominant disorders. © 2012 Wiley Periodicals, Inc.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
DOE Office of Scientific and Technical Information (OSTI.GOV)
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.; ...
2017-07-18
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Richard A.; Brown, Steven D.
2017-01-01
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. PMID:28769883
Capillaries for use in a multiplexed capillary electrophoresis system
Yeung, Edward S.; Chang, Huan-Tsang; Fung, Eliza N.
1997-12-09
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification ("base calling") is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations.
Capillaries for use in a multiplexed capillary electrophoresis system
Yeung, E.S.; Chang, H.T.; Fung, E.N.
1997-12-09
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification (``base calling``) is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations. 19 figs.
Dual phylogenetic origins of Nigerian lions (Panthera leo).
Tende, Talatu; Bensch, Staffan; Ottosson, Ulf; Hansson, Bengt
2014-07-01
Lion fecal DNA extracts from four individuals each from Yankari Game Reserve and Kainji-Lake National Park (central northeast and west Nigeria, respectively) were Sanger-sequenced for the mitochondrial cytochrome b gene. The sequences were aligned against 61 lion reference sequences from other parts of Africa and India. The sequence data were analyzed further for the construction of phylogenetic trees using the maximum-likelihood approach to depict phylogenetic patterns of distribution among sequences. Our results show that Nigerian lions grouped together with lions from West and Central Africa. At the smaller geographical scale, lions from Kainji-Lake National Park in western Nigeria grouped with lions from Benin (located west of Nigeria), whereas lions from Yankari Game Reserve in central northeastern Nigeria grouped with the lion populations in Cameroon (located east of Nigeria). The finding that the two remaining lion populations in Nigeria have different phylogenetic origins is an important aspect to consider in future decisions regarding management and conservation of rapidly shrinking lion populations in West Africa.
Dual phylogenetic origins of Nigerian lions (Panthera leo)
Tende, Talatu; Bensch, Staffan; Ottosson, Ulf; Hansson, Bengt
2014-01-01
Lion fecal DNA extracts from four individuals each from Yankari Game Reserve and Kainji-Lake National Park (central northeast and west Nigeria, respectively) were Sanger-sequenced for the mitochondrial cytochrome b gene. The sequences were aligned against 61 lion reference sequences from other parts of Africa and India. The sequence data were analyzed further for the construction of phylogenetic trees using the maximum-likelihood approach to depict phylogenetic patterns of distribution among sequences. Our results show that Nigerian lions grouped together with lions from West and Central Africa. At the smaller geographical scale, lions from Kainji-Lake National Park in western Nigeria grouped with lions from Benin (located west of Nigeria), whereas lions from Yankari Game Reserve in central northeastern Nigeria grouped with the lion populations in Cameroon (located east of Nigeria). The finding that the two remaining lion populations in Nigeria have different phylogenetic origins is an important aspect to consider in future decisions regarding management and conservation of rapidly shrinking lion populations in West Africa. PMID:25077018
Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers.
Varshney, Rajeev K; Chen, Wenbin; Li, Yupeng; Bharti, Arvind K; Saxena, Rachit K; Schlueter, Jessica A; Donoghue, Mark T A; Azam, Sarwar; Fan, Guangyi; Whaley, Adam M; Farmer, Andrew D; Sheridan, Jaime; Iwata, Aiko; Tuteja, Reetu; Penmetsa, R Varma; Wu, Wei; Upadhyaya, Hari D; Yang, Shiaw-Pyng; Shah, Trushar; Saxena, K B; Michael, Todd; McCombie, W Richard; Yang, Bicheng; Zhang, Gengyun; Yang, Huanming; Wang, Jun; Spillane, Charles; Cook, Douglas R; May, Gregory D; Xu, Xun; Jackson, Scott A
2011-11-06
Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sanger-based bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds representing 72.7% (605.78 Mb) of the 833.07 Mb pigeonpea genome. Genome analysis predicted 48,680 genes for pigeonpea and also showed the potential role that certain gene families, for example, drought tolerance-related genes, have played throughout the domestication of pigeonpea and the evolution of its ancestors. Although we found a few segmental duplication events, we did not observe the recent genome-wide duplication events observed in soybean. This reference genome sequence will facilitate the identification of the genetic basis of agronomically important traits, and accelerate the development of improved pigeonpea varieties that could improve food security in many developing countries.
Smith, Miriam J; Beetz, Christian; Williams, Simon G; Bhaskar, Sanjeev S; O'Sullivan, James; Anderson, Beverley; Daly, Sarah B; Urquhart, Jill E; Bholah, Zaynab; Oudit, Deemesh; Cheesman, Edmund; Kelsey, Anna; McCabe, Martin G; Newman, William G; Evans, D Gareth R
2014-12-20
Heterozygous germline PTCH1 mutations are causative of Gorlin syndrome (naevoid basal cell carcinoma), but detection rates > 70% have rarely been reported. We aimed to define the causative mutations in individuals with Gorlin syndrome without PTCH1 mutations. We undertook exome sequencing on lymphocyte DNA from four unrelated individuals from families with Gorlin syndrome with no PTCH1 mutations found by Sanger sequencing, multiplex ligation-dependent probe amplification (MLPA), or RNA analysis. A germline heterozygous nonsense mutation in SUFU was identified in one of four exomes. Sanger sequencing of SUFU in 23 additional PTCH1-negative Gorlin syndrome families identified a SUFU mutation in a second family. Copy-number analysis of SUFU by MLPA revealed a large heterozygous deletion in a third family. All three SUFU-positive families fulfilled diagnostic criteria for Gorlin syndrome, although none had odontogenic jaw keratocysts. Each SUFU-positive family included a single case of medulloblastoma, whereas only two (1.7%) of 115 individuals with Gorlin syndrome and a PTCH1 mutation developed medulloblastoma. We demonstrate convincing evidence that SUFU mutations can cause classical Gorlin syndrome. Our study redefines the risk of medulloblastoma in Gorlin syndrome, dependent on the underlying causative gene. Previous reports have found a 5% risk of medulloblastoma in Gorlin syndrome. We found a < 2% risk in PTCH1 mutation-positive individuals, with a risk up to 20× higher in SUFU mutation-positive individuals. Our data suggest childhood brain magnetic resonance imaging surveillance is justified in SUFU-related, but not PTCH1-related, Gorlin syndrome. © 2014 by American Society of Clinical Oncology.
Rahman, Arfatur; Sahrin, Mahfuza; Afrin, Sadia; Earley, Keith; Ahmed, Shahriar; Rahman, S M Mazidur; Banu, Sayera
2016-01-01
GeneXpert MTB/RIF (Xpert) and Genotype MTBDRplus (DRplus) are two World Health Organization (WHO) endorsed probe based molecular drug susceptibility testing (DST) methods for rapid diagnosis of drug resistant tuberculosis. Both methods target the same 81 bp Rifampicin Resistance Determining Region (RRDR) of bacterial RNA polymerase β subunit (rpoB) for detection of Rifampicin (RIF) resistance associated mutations using DNA probes. So there is a correspondence of the probes of each other and expected similarity of probe binding. We analyzed 92 sputum specimens by Xpert, DRplus and LJ proportion method (LJ-DST). We compared molecular DSTs with gold standard LJ-DST. We wanted to see the agreement level of two molecular methods for detection of RIF resistance associated mutations. The 81bp RRDR region of rpoB gene of discrepant cases between the two molecular methods was sequenced by Sanger sequencing. The agreement of Xpert and DRplus with LJ-DST for detection of RIF susceptibility was found to be 93.5% and 92.4%, respectively. We also found 92.4% overall agreement of two molecular methods for the detection of RIF susceptibility. A total of 84 out of 92 samples (91.3%) had agreement on the molecular locus of RRDR mutation by DRplus and Xpert. Sanger sequencing of 81bp RRDR revealed that Xpert probes detected seven of eight discrepant cases correctly and DRplus was erroneous in all the eight cases. Although the overall concordance with LJ-DST was similar for both Xpert and DRplus assay, Xpert demonstrated more accuracy in the detection of RIF susceptibility for discrepant isolates compared with DRplus. This observation would be helpful for the improvement of probe based detection of drug resistance associated mutations especially rpoB mutation in M. tuberculosis.
Rafaelsen, Silje; Johansson, Stefan; Ræder, Helge; Bjerknes, Robert
2015-01-01
Objective Hereditary hypophosphatemias (HH) are rare monogenic conditions characterized by decreased renal tubular phosphate reabsorption. The aim of this study was to explore the prevalence, genotypes, phenotypic spectrum, treatment response, and complications of treatment in the Norwegian population of children with HH. Design Retrospective national cohort study. Methods Sanger sequencing and multiplex ligand-dependent probe amplification analysis of PHEX and Sanger sequencing of FGF23, DMP1, ENPP1KL, and FAM20C were performed to assess genotype in patients with HH with or without rickets in all pediatric hospital departments across Norway. Patients with hypercalcuria were screened for SLC34A3 mutations. In one family, exome sequencing was performed. Information from the patients' medical records was collected for the evaluation of phenotype. Results Twety-eight patients with HH (18 females and ten males) from 19 different families were identified. X-linked dominant hypophosphatemic rickets (XLHR) was confirmed in 21 children from 13 families. The total number of inhabitants in Norway aged 18 or below by 1st January 2010 was 1 109 156, giving an XLHR prevalence of ∼1 in 60 000 Norwegian children. FAM20C mutations were found in two brothers and SLC34A3 mutations in one patient. In XLHR, growth was compromised in spite of treatment with oral phosphate and active vitamin D compounds, with males tending to be more affected than females. Nephrocalcinosis tended to be slightly more common in patients starting treatment before 1 year of age, and was associated with higher average treatment doses of phosphate. However, none of these differences reached statistical significance. Conclusions We present the first national cohort of HH in children. The prevalence of XLHR seems to be lower in Norwegian children than reported earlier. PMID:26543054
Li, Qian; Cao, Xi; Qiu, Hai-Yan; Lu, Jing; Gao, Rui; Liu, Chao; Yuan, Ming-Xia; Yang, Guang-Ran; Yang, Jin-Kui
2016-08-22
To establish a three-step programmed method to find gene mutations related to maturity onset diabetes of the young (MODY). Target region capture and next-generation sequencing (NGS) were performed using customized oligonucleotide probes designed to capture suspected genes for MODY in 11 probands with clinically diagnosed MODY. The suspected associations of certain genes with MODY were then confirmed by Sanger sequencing in the probands and their family members. Finally, to validate variants of one of the genes of interest (glucokinase, GCK) as pathogenic mutations, protein function editing by the variant genes was assessed. In the target region capture and NGS phase, a total of nine variants of seven genes (GCK, WFS1, SLC19A2, SH2B1, SERPINB4, RFX6, and GATA6) were identified in eight probands. Two heterozygous GCK mutations located on the same allele (p.Leu77Arg and p.Val101Met) were identified in a MODY family. Sanger sequencing was used to confirm the variants identified by NGS to be present in probands and their diabetic family members, but not in non-diabetic family members. Finally, enzyme kinetic and thermal stability analyses revealed that the p.Leu77Arg mutation or the p.Leu77Arg mutation in combination with the p.Val101Met mutation inactivates GCK function and stability, while mutation of p.Val101Met alone does not. The p.Leu77Arg but not p.Val101Met GCK mutation is therefore considered a pathogenic mutation associated with MODY. Genetic screening coupled with gene-editing protein function testing is an effective and reliable method by which causative gene mutations of MODY can be identified. Copyright © 2016 Elsevier B.V. All rights reserved.
Lv, Weigang; Wei, Xianda; Guo, Ruolan; Liu, Qin; Zheng, Yu; Chang, Jiazhen; Bai, Ting; Li, Haoxian; Zhang, Jianguang; Song, Zhuo; Cram, David S; Liang, Desheng; Wu, Lingqian
2015-01-01
Noninvasive prenatal testing (NIPT) for monogenic diseases by use of PCR-based strategies requires precise quantification of mutant fetal alleles circulating in the maternal plasma. The study describes the development and validation of a novel assay termed circulating single-molecule amplification and resequencing technology (cSMART) for counting single allelic molecules in plasma. Here we demonstrate the suitability of cSMART for NIPT, with Wilson Disease (WD) as proof of concept. We used Sanger and whole-exome sequencing to identify familial ATP7B (ATPase, Cu(++) transporting, β polypeptide) gene mutations. For cSMART, single molecules were tagged with unique barcodes and circularized, and alleles were targeted and replicated by inverse PCR. The unique single allelic molecules were identified by sequencing and counted, and the percentage of mutant alleles in the original maternal plasma sample was used to determine fetal genotypes. Four families with WD pedigrees consented to the study. Using Sanger and whole-exome sequencing, we mapped the pathogenic ATP7B mutations in each pedigree and confirmed the proband's original diagnosis of WD. After validation of cSMART with defined plasma models mimicking fetal inheritance of paternal, maternal, or both parental mutant alleles, we retrospectively showed in second pregnancies that the fetal genotypes assigned by invasive testing and NIPT were concordant. We developed a reliable and accurate NIPT assay that correctly diagnosed the fetal genotypes in 4 pregnancies at risk for WD. This novel technology has potential as a universal strategy for NIPT of other monogenic disorders, since it requires only knowledge of the parental pathogenic mutations. © 2014 American Association for Clinical Chemistry.
Stam, Remco; Scheikl, Daniela; Tellier, Aurélien
2016-01-01
Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. PMID:27189991
Hajibabaei, Mehrdad; Shokralla, Shadi; Zhou, Xin; Singer, Gregory A. C.; Baird, Donald J.
2011-01-01
Timely and accurate biodiversity analysis poses an ongoing challenge for the success of biomonitoring programs. Morphology-based identification of bioindicator taxa is time consuming, and rarely supports species-level resolution especially for immature life stages. Much work has been done in the past decade to develop alternative approaches for biodiversity analysis using DNA sequence-based approaches such as molecular phylogenetics and DNA barcoding. On-going assembly of DNA barcode reference libraries will provide the basis for a DNA-based identification system. The use of recently introduced next-generation sequencing (NGS) approaches in biodiversity science has the potential to further extend the application of DNA information for routine biomonitoring applications to an unprecedented scale. Here we demonstrate the feasibility of using 454 massively parallel pyrosequencing for species-level analysis of freshwater benthic macroinvertebrate taxa commonly used for biomonitoring. We designed our experiments in order to directly compare morphology-based, Sanger sequencing DNA barcoding, and next-generation environmental barcoding approaches. Our results show the ability of 454 pyrosequencing of mini-barcodes to accurately identify all species with more than 1% abundance in the pooled mixture. Although the approach failed to identify 6 rare species in the mixture, the presence of sequences from 9 species that were not represented by individuals in the mixture provides evidence that DNA based analysis may yet provide a valuable approach in finding rare species in bulk environmental samples. We further demonstrate the application of the environmental barcoding approach by comparing benthic macroinvertebrates from an urban region to those obtained from a conservation area. Although considerable effort will be required to robustly optimize NGS tools to identify species from bulk environmental samples, our results indicate the potential of an environmental barcoding approach for biomonitoring programs. PMID:21533287
Exome capture sequencing identifies a novel mutation in BBS4
Wang, Hui; Chen, Xianfeng; Dudinsky, Lynn; Patenia, Claire; Chen, Yiyun; Li, Yumei; Wei, Yue; Abboud, Emad B.; Al-Rajhi, Ali A.; Lewis, Richard Alan; Lupski, James R.; Mardon, Graeme; Gibbs, Richard A.; Perkins, Brian D.
2011-01-01
Purpose Leber congenital amaurosis (LCA) is one of the most severe eye dystrophies characterized by severe vision loss at an early stage and accounts for approximately 5% of all retinal dystrophies. The purpose of this study was to identify a novel LCA disease allele or gene and to develop an approach combining genetic mapping with whole exome sequencing. Methods Three patients from King Khaled Eye Specialist Hospital (KKESH205) underwent whole genome single nucleotide polymorphism genotyping, and a single candidate region was identified. Taking advantage of next-generation high-throughput DNA sequencing technologies, whole exome capture sequencing was performed on patient KKESH205#7. Sanger direct sequencing was used during the validation step. The zebrafish model was used to examine the function of the mutant allele. Results A novel missense mutation in Bardet-Biedl syndrome 4 protein (BBS4) was identified in a consanguineous family from Saudi Arabia. This missense mutation in the fifth exon (c.253G>C;p.E85Q) of BBS4 is likely a disease-causing mutation as it segregates with the disease. The mutation is not found in the single nucleotide polymorphism (SNP) database, the 1000 Genomes Project, or matching normal controls. Functional analysis of this mutation in zebrafish indicates that the G253C allele is pathogenic. Coinjection of the G253C allele cannot rescue the mislocalization of rhodopsin in the retina when BBS4 is knocked down by morpholino injection. Immunofluorescence analysis in cell culture shows that this missense mutation in BBS4 does not cause obvious defects in protein expression or pericentriolar localization. Conclusions This mutation likely mainly reduces or abolishes BBS4 function in the retina. Further studies of this allele will provide important insights concerning the pleiotropic nature of BBS4 function. PMID:22219648
Milosevic Feenstra, Jelena D.; Nivarthi, Harini; Gisslinger, Heinz; Leroy, Emilie; Rumi, Elisa; Chachoua, Ilyas; Bagienski, Klaudia; Kubesova, Blanka; Pietra, Daniela; Gisslinger, Bettina; Milanesi, Chiara; Jäger, Roland; Chen, Doris; Berg, Tiina; Schalling, Martin; Schuster, Michael; Bock, Christoph; Constantinescu, Stefan N.; Cazzola, Mario
2016-01-01
Essential thrombocythemia (ET) and primary myelofibrosis (PMF) are chronic diseases characterized by clonal hematopoiesis and hyperproliferation of terminally differentiated myeloid cells. The disease is driven by somatic mutations in exon 9 of CALR or exon 10 of MPL or JAK2-V617F in >90% of the cases, whereas the remaining cases are termed “triple negative.” We aimed to identify the disease-causing mutations in the triple-negative cases of ET and PMF by applying whole-exome sequencing (WES) on paired tumor and control samples from 8 patients. We found evidence of clonal hematopoiesis in 5 of 8 studied cases based on clonality analysis and presence of somatic genetic aberrations. WES identified somatic mutations in 3 of 8 cases. We did not detect any novel recurrent somatic mutations. In 3 patients with clonal hematopoiesis analyzed by WES, we identified a somatic MPL-S204P, a germline MPL-V285E mutation, and a germline JAK2-G571S variant. We performed Sanger sequencing of the entire coding region of MPL in 62, and of JAK2 in 49 additional triple-negative cases of ET or PMF. New somatic (T119I, S204F, E230G, Y591D) and 1 germline (R321W) MPL mutation were detected. All of the identified MPL mutations were gain-of-function when analyzed in functional assays. JAK2 variants were identified in 5 of 57 triple-negative cases analyzed by WES and Sanger sequencing combined. We could demonstrate that JAK2-V625F and JAK2-F556V are gain-of-function mutations. Our results suggest that triple-negative cases of ET and PMF do not represent a homogenous disease entity. Cases with polyclonal hematopoiesis might represent hereditary disorders. PMID:26423830
Milosevic Feenstra, Jelena D; Nivarthi, Harini; Gisslinger, Heinz; Leroy, Emilie; Rumi, Elisa; Chachoua, Ilyas; Bagienski, Klaudia; Kubesova, Blanka; Pietra, Daniela; Gisslinger, Bettina; Milanesi, Chiara; Jäger, Roland; Chen, Doris; Berg, Tiina; Schalling, Martin; Schuster, Michael; Bock, Christoph; Constantinescu, Stefan N; Cazzola, Mario; Kralovics, Robert
2016-01-21
Essential thrombocythemia (ET) and primary myelofibrosis (PMF) are chronic diseases characterized by clonal hematopoiesis and hyperproliferation of terminally differentiated myeloid cells. The disease is driven by somatic mutations in exon 9 of CALR or exon 10 of MPL or JAK2-V617F in >90% of the cases, whereas the remaining cases are termed "triple negative." We aimed to identify the disease-causing mutations in the triple-negative cases of ET and PMF by applying whole-exome sequencing (WES) on paired tumor and control samples from 8 patients. We found evidence of clonal hematopoiesis in 5 of 8 studied cases based on clonality analysis and presence of somatic genetic aberrations. WES identified somatic mutations in 3 of 8 cases. We did not detect any novel recurrent somatic mutations. In 3 patients with clonal hematopoiesis analyzed by WES, we identified a somatic MPL-S204P, a germline MPL-V285E mutation, and a germline JAK2-G571S variant. We performed Sanger sequencing of the entire coding region of MPL in 62, and of JAK2 in 49 additional triple-negative cases of ET or PMF. New somatic (T119I, S204F, E230G, Y591D) and 1 germline (R321W) MPL mutation were detected. All of the identified MPL mutations were gain-of-function when analyzed in functional assays. JAK2 variants were identified in 5 of 57 triple-negative cases analyzed by WES and Sanger sequencing combined. We could demonstrate that JAK2-V625F and JAK2-F556V are gain-of-function mutations. Our results suggest that triple-negative cases of ET and PMF do not represent a homogenous disease entity. Cases with polyclonal hematopoiesis might represent hereditary disorders. © 2016 by The American Society of Hematology.
Timmermans, M J T N; Dodsworth, S; Culverwell, C L; Bocak, L; Ahrens, D; Littlewood, D T J; Pons, J; Vogler, A P
2010-11-01
Mitochondrial genome sequences are important markers for phylogenetics but taxon sampling remains sporadic because of the great effort and cost required to acquire full-length sequences. Here, we demonstrate a simple, cost-effective way to sequence the full complement of protein coding mitochondrial genes from pooled samples using the 454/Roche platform. Multiplexing was achieved without the need for expensive indexing tags ('barcodes'). The method was trialled with a set of long-range polymerase chain reaction (PCR) fragments from 30 species of Coleoptera (beetles) sequenced in a 1/16th sector of a sequencing plate. Long contigs were produced from the pooled sequences with sequencing depths ranging from ∼10 to 100× per contig. Species identity of individual contigs was established via three 'bait' sequences matching disparate parts of the mitochondrial genome obtained by conventional PCR and Sanger sequencing. This proved that assembly of contigs from the sequencing pool was correct. Our study produced sequences for 21 nearly complete and seven partial sets of protein coding mitochondrial genes. Combined with existing sequences for 25 taxa, an improved estimate of basal relationships in Coleoptera was obtained. The procedure could be employed routinely for mitochondrial genome sequencing at the species level, to provide improved species 'barcodes' that currently use the cox1 gene only.
Sample, Paul J.; Gaston, Kirk W.; Alfonzo, Juan D.; Limbach, Patrick A.
2015-01-01
Ribosomal ribonucleic acid (RNA), transfer RNA and other biological or synthetic RNA polymers can contain nucleotides that have been modified by the addition of chemical groups. Traditional Sanger sequencing methods cannot establish the chemical nature and sequence of these modified-nucleotide containing oligomers. Mass spectrometry (MS) has become the conventional approach for determining the nucleotide composition, modification status and sequence of modified RNAs. Modified RNAs are analyzed by MS using collision-induced dissociation tandem mass spectrometry (CID MS/MS), which produces a complex dataset of oligomeric fragments that must be interpreted to identify and place modified nucleosides within the RNA sequence. Here we report the development of RoboOligo, an interactive software program for the robust analysis of data generated by CID MS/MS of RNA oligomers. There are three main functions of RoboOligo: (i) automated de novo sequencing via the local search paradigm. (ii) Manual sequencing with real-time spectrum labeling and cumulative intensity scoring. (iii) A hybrid approach, coined ‘variable sequencing’, which combines the user intuition of manual sequencing with the high-throughput sampling of automated de novo sequencing. PMID:25820423
Mutation screening of Chinese Treacher Collins syndrome patients identified novel TCOF1 mutations.
Chen, Ying; Guo, Luo; Li, Chen-Long; Shan, Jing; Xu, Hai-Song; Li, Jie-Ying; Sun, Shan; Hao, Shao-Juan; Jin, Lei; Chai, Gang; Zhang, Tian-Yu
2018-04-01
Treacher Collins syndrome (TCS) (OMIM 154500) is a rare congenital craniofacial disorder with an autosomal dominant manner of inheritance in most cases. To date, three pathogenic genes (TCOF1, POLR1D and POLR1C) have been identified. In this study, we conducted mutational analysis on Chinese TCS patients to reveal a mutational spectrum of known causative genes and show phenotype-genotype data to provide more information for gene counselling and future studies on the pathogenesis of TCS. Twenty-two TCS patients were recruited from two tertiary referral centres, and Sanger sequencing for the coding exons and exon-intron boundaries of TCOF1, POLR1D and POLR1C was performed. For patients without small variants, further copy number variations (CNVs) analysis was conducted using high-density SNP array platforms. The Sanger sequencing overall mutation detection rate was as high as 86.3% (19/22) for our cohort. Fifteen TCOF1 pathogenic variants, including ten novel mutations, were identified in nineteen patients. No causative mutations in POLR1D and POLR1C genes and no CNVs mutations were detected. A suspected autosomal dominant inheritance case that implies germinal mosaicism was described. Our study confirmed that TCOF1 was the main disease-causing gene for the Chinese TCS population and revealed its mutation spectrum. We also addressed the need for more studies of mosaicism in TCS cases, which could explain the mechanism of autosomal dominant inheritance in TCS cases and benefit the prevention of TCS.
[Chromosomal large fragment deletion induced by CRISPR/Cas9 gene editing system].
Cheng, L H; Liu, Y; Niu, T
2017-05-14
Objective: Using CRISPR-Cas9 gene editing technology to achieve a number of genes co-deletion on the same chromosome. Methods: CRISPR-Cas9 lentiviral plasmid that could induce deletion of Aloxe3-Alox12b-Alox8 cluster genes located on mouse 11B3 chromosome was constructed via molecular clone. HEK293T cells were transfected to package lentivirus of CRISPR or Cas9 cDNA, then mouse NIH3T3 cells were infected by lentivirus and genomic DNA of these cells was extracted. The deleted fragment was amplified by PCR, TA clone, Sanger sequencing and other techniques were used to confirm the deletion of Aloxe3-Alox12b-Alox8 cluster genes. Results: The CRISPR-Cas9 lentiviral plasmid, which could induce deletion of Aloxe3-Alox12b-Alox8 cluster genes, was successfully constructed. Deletion of target chromosome fragment (Aloxe3-Alox12b-Alox8 cluster genes) was verified by PCR. The deletion of Aloxe3-Alox12b-Alox8 cluster genes was affirmed by TA clone, Sanger sequencing, and the breakpoint junctions of the CRISPR-Cas9 system mediate cutting events were accurately recombined, insertion mutation did not occur between two cleavage sites at all. Conclusion: Large fragment deletion of Aloxe3-Alox12b-Alox8 cluster genes located on mouse chromosome 11B3 was successfully induced by CRISPR-Cas9 gene editing system.
Tan, Ling; Hu, Yerong; Tao, Yongguang; Wang, Bin; Xiao, Jun; Tang, Zhenjie; Lu, Ting
2018-01-01
Background To identify whether RET is a potential target for NSCLC treatment, we examined the status of the RET gene in 631 early and mid stage NSCLC cases from south central China. Methods RET expression was identified by Western blot. RET‐positive expression samples were verified by immunohistochemistry. RET gene mutation, copy number variation, and rearrangement were analyzed by DNA Sanger sequencing, TaqMan copy number assays, and reverse transcription‐PCR. ALK and ROS1 expression levels were tested by Western blot and EGFR mutation using Sanger sequencing. Results The RET‐positive rate was 2.5% (16/631). RET‐positive expression was related to poorer tumor differentiation (P < 0.05). In the 16 RET‐positive samples, only two samples of moderately and poorly differentiated lung adenocarcinomas displayed RET rearrangement, both in RET‐KIF5B fusion partners. Neither ALK nor ROS1 translocation was found. The EGFR mutation rate in RET‐positive samples was significantly lower than in RET‐negative samples (P < 0.05). Conclusion RET‐positive expression in early and mid stage NSCLC cases from south central China is relatively low and is related to poorer tumor differentiation. RET gene alterations (copy number gain and rearrangement) exist in all RET‐positive samples. RET‐positive expression is a relatively independent factor in NSCLC patients, which indicates that the RET gene may be a novel target site for personalized treatment of NSCLC. PMID:29473341
Bu, Rong; Siraj, Abdul K; Al-Obaisi, Khadija A S; Beg, Shaham; Al Hazmi, Mohsen; Ajarim, Dahish; Tulbah, Asma; Al-Dayel, Fouad; Al-Kuraya, Khawla S
2016-09-01
Ethnic differences of breast cancer genomics have prompted us to investigate the spectra of BRCA1 and BRCA2 mutations in different populations. The prevalence and effect of BRCA 1 and BRCA 2 mutations in Middle Eastern population is not fully explored. To characterize the prevalence of BRCA mutations in Middle Eastern breast cancer patients, BRCA mutation screening was performed in 818 unselected breast cancer patients using Capture and/or Sanger sequencing. 19 short tandem repeat (STR) markers were used for founder mutation analysis. In our study, nine different types of deleterious mutation were identified in 28 (3.4%) cases, 25 (89.3%) cases in BRCA 1 and 3 (10.7%) cases in BRCA 2. Seven recurrent mutations identified accounted for 92.9% (26/28) of all the mutant cases. Haplotype analysis was performed to confirm c.1140 dupG and c.4136_4137delCT mutations as novel putative founder mutation, accounting for 46.4% (13/28) of all BRCA mutant cases and 1.6% (13/818) of all the breast cancer cases, respectively. Moreover, BRCA 1 mutation was significantly associated with BRCA 1 protein expression loss (p = 0.0005). Our finding revealed that a substantial number of BRCA mutations were identified in clinically high risk breast cancer from Middle East region. Identification of the mutation spectrum, prevalence and founder effect in Middle Eastern population facilitates genetic counseling, risk assessment and development of cost-effective screening strategy. © 2016 UICC.
Novel variants in PAX6 gene caused congenital aniridia in two Chinese families.
Zhang, R; Linpeng, S; Wei, X; Li, H; Huang, Y; Guo, J; Wu, Q; Liang, D; Wu, L
2017-06-01
PurposeTo reveal the underlying genetic defect in two four-generation Chinese families with aniridia and explore the pathologic mechanism.MethodsFull ophthalmic examinations were performed in two families with aniridia. The PAX6 gene was directly sequenced in patients of two families, and the detected variants were screened in unaffected family members and two hundred unrelated healthy controls. Real-time quantitative PCR was used to explore pathologic mechanisms of the two variants.ResultsAniridia, cataract, and oscillatory nystagmus were observed in patients of the two families. In addition, we observed corneal opacity and microphthalmus in family 1, and strabismus, left ectopia lentis, microphthalmus, and microcornea in family 2. Sanger sequencing detected a novel 1-bp duplication (c.50dupA) in family 1 and a novel 2-bp splice site deletion (c.765+1_765+2delGT) in family 2. Sequencing of cDNA indicated skipping of exon 9 caused by the splice site deletion, being predicted to cause a premature stop codon, as well as the duplication. The PAX6 mRNA significantly lower in patients with aniridia than in unaffected family members in both families, suggesting that the duplication and splice site deletion caused nonsense-mediated mRNA decay.ConclusionsOur study identified two novel PAX6 variants in two families with aniridia and revealed the pathogenicity of the variants; this would expand the variant spectrum of PAX6 and help us better understand the molecular basis of aniridia, thus facilitating genetic counseling.
The genome sequence of ectromelia virus Naval and Cornell isolates from outbreaks in North America.
Mavian, Carla; López-Bueno, Alberto; Bryant, Neil A; Seeger, Kathy; Quail, Michael A; Harris, David; Barrell, Bart; Alcami, Antonio
2014-08-01
Ectromelia virus (ECTV) is the causative agent of mousepox, a disease of laboratory mouse colonies and an excellent model for human smallpox. We report the genome sequence of two isolates from outbreaks in laboratory mouse colonies in the USA in 1995 and 1999: ECTV-Naval and ECTV-Cornell, respectively. The genome of ECTV-Naval and ECTV-Cornell was sequenced by the 454-Roche technology. The ECTV-Naval genome was also sequenced by the Sanger and Illumina technologies in order to evaluate these technologies for poxvirus genome sequencing. Genomic comparisons revealed that ECTV-Naval and ECTV-Cornell correspond to the same virus isolated from independent outbreaks. Both ECTV-Naval and ECTV-Cornell are extremely virulent in susceptible BALB/c mice, similar to ECTV-Moscow. This is consistent with the ECTV-Naval genome sharing 98.2% DNA sequence identity with that of ECTV-Moscow, and indicates that the genetic differences with ECTV-Moscow do not affect the virulence of ECTV-Naval in the mousepox model of footpad infection. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Identification of rare paired box 3 variant in strabismus by whole exome sequencing
Gong, Hui-Min; Wang, Jing; Xu, Jing; Zhou, Zhan-Yu; Li, Jing-Wen; Chen, Shu-Fang
2017-01-01
AIM To identify the potentially pathogenic gene variants that contributes to the etiology of strabismus. METHODS A Chinese pedigree with strabismus was collected and the exomes of two affected individuals were sequenced using the next-generation sequencing technology. The resulting variants from exome sequencing were filtered by subsequent bioinformatics methods and the candidate mutation was verified as heterozygous in the affected proposita and her mother by sanger sequencing. RESULTS Whole exome sequencing and filtering identified a nonsynonymous mutation c.434G-T transition in paired box 3 (PAX3) in the two affected individuals, which were predicted to be deleterious by more than 4 bioinformatics programs. This altered amino acid residue was located in the conserved PAX domain of PAX3. This gene encodes a member of the PAX family of transcription factors, which play critical roles during fetal development. Mutations in PAX3 were associated with Waardenburg syndrome with strabismus. CONCLUSION Our results report that the c.434G-T mutation (p.R145L) in PAX3 may contribute to strabismus, expanding our understanding of the causally relevant genes for this disorder. PMID:28861346
Identification of rare paired box 3 variant in strabismus by whole exome sequencing.
Gong, Hui-Min; Wang, Jing; Xu, Jing; Zhou, Zhan-Yu; Li, Jing-Wen; Chen, Shu-Fang
2017-01-01
To identify the potentially pathogenic gene variants that contributes to the etiology of strabismus. A Chinese pedigree with strabismus was collected and the exomes of two affected individuals were sequenced using the next-generation sequencing technology. The resulting variants from exome sequencing were filtered by subsequent bioinformatics methods and the candidate mutation was verified as heterozygous in the affected proposita and her mother by sanger sequencing. Whole exome sequencing and filtering identified a nonsynonymous mutation c.434G-T transition in paired box 3 (PAX3) in the two affected individuals, which were predicted to be deleterious by more than 4 bioinformatics programs. This altered amino acid residue was located in the conserved PAX domain of PAX3. This gene encodes a member of the PAX family of transcription factors, which play critical roles during fetal development. Mutations in PAX3 were associated with Waardenburg syndrome with strabismus. Our results report that the c.434G-T mutation (p.R145L) in PAX3 may contribute to strabismus, expanding our understanding of the causally relevant genes for this disorder.
Guo, Bingfu; Guo, Yong; Hong, Huilong; Qiu, Li-Juan
2016-01-01
Molecular characterization of sequence flanking exogenous fragment insertion is essential for safety assessment and labeling of genetically modified organism (GMO). In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS) method. More than 22.4 Gb sequence data (∼21 × coverage) for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundaries of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of genomic insertion sites of G2-EPSPS and GAT transgenes will facilitate the utilization of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS was a cost-effective and rapid method for identifying sites of T-DNA insertions and flanking sequences in soybean.
EMPOP-quality mtDNA control region sequences from Kashmiri of Azad Jammu & Kashmir, Pakistan.
Rakha, Allah; Peng, Min-Sheng; Bi, Rui; Song, Jiao-Jiao; Salahudin, Zeenat; Adan, Atif; Israr, Muhammad; Yao, Yong-Gang
2016-11-01
The mitochondrial DNA (mtDNA) control region (nucleotide position 16024-576) sequences were generated through Sanger sequencing method for 317 self-identified Kashmiris from all districts of Azad Jammu & Kashmir Pakistan. The population sample set showed a total of 251 haplotypes, with a relatively high haplotype diversity (0.9977) and a low random match probability (0.54%). The containing matrilineal lineages belonging to three different phylogeographic origins of Western Eurasian (48.9%), South Asian (47.0%) and East Asian (4.1%). The present study was compared to previous data from Pakistan and other worldwide populations (Central Asia, Western Asia, and East & Southeast Asia). The dataset is made available through EMPOP under accession number EMP00679 and will serve as an mtDNA reference database in forensic casework in Pakistan. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Exome sequencing identifies SUCO mutations in mesial temporal lobe epilepsy.
Sha, Zhiqiang; Sha, Longze; Li, Wenting; Dou, Wanchen; Shen, Yan; Wu, Liwen; Xu, Qi
2015-03-30
Mesial temporal lobe epilepsy (mTLE) is the main type and most common medically intractable form of epilepsy. Severity of disease-based stratified samples may help identify new disease-associated mutant genes. We analyzed mRNA expression profiles from patient hippocampal tissue. Three of the seven patients had severe mTLE with generalized-onset convulsions and consciousness loss that occurred over many years. We found that compared with other groups, patients with severe mTLE were classified into a distinct group. Whole-exome sequencing and Sanger sequencing validation in all seven patients identified three novel SUN domain-containing ossification factor (SUCO) mutations in severely affected patients. Furthermore, SUCO knock down significantly reduced dendritic length in vitro. Our results indicate that mTLE defects may affect neuronal development, and suggest that neurons have abnormal development due to lack of SUCO, which may be a generalized-onset epilepsy-related gene. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Characterization of Canna yellow mottle virus in a New Host, Alpinia purpurata, in Hawaii.
Zhang, Jingxin; Dey, Kishore K; Lin, Birun; Borth, Wayne B; Melzer, Michael J; Sether, Diane; Wang, Yanan; Wang, I-Chin; Shen, Huifang; Pu, Xiaoming; Sun, Dayuan; Hu, John S
2017-06-01
Canna yellow mottle virus (CaYMV) is an important badnavirus infecting Canna spp. worldwide. This is the first report of CaYMV in flowering ginger (Alpinia purpurata) in Hawaii, where it is associated with yellow mottling and necrosis of leaves, vein streaking, and stunted plants. We have sequenced CaYMV in A. purpurata (CaYMV-Ap) using a combination of next-generation sequencing and traditional Sanger sequencing techniques. The complete genome of CaYMV-Ap was 7,120 bp with an organization typical of other Badnavirus species. Our results indicated that CaYMV-Ap was present in the episomal form in infected flowering ginger. We determined that this virus disease is prevalent in Hawaii and could potentially have significant economic impact on the marketing of A. purpurata as cut flowers. There is a potential concern that the host range of CaYMV-Ap may expand to include other important tropical plants.
Zilinskas, Julius; Lančinskas, Algirdas; Guarracino, Mario Rosario
2014-01-01
In this paper we propose some mathematical models to plan a Next Generation Sequencing experiment to detect rare mutations in pools of patients. A mathematical optimization problem is formulated for optimal pooling, with respect to minimization of the experiment cost. Then, two different strategies to replicate patients in pools are proposed, which have the advantage to decrease the overall costs. Finally, a multi-objective optimization formulation is proposed, where the trade-off between the probability to detect a mutation and overall costs is taken into account. The proposed solutions are devised in pursuance of the following advantages: (i) the solution guarantees mutations are detectable in the experimental setting, and (ii) the cost of the NGS experiment and its biological validation using Sanger sequencing is minimized. Simulations show replicating pools can decrease overall experimental cost, thus making pooling an interesting option.
Dusi, Sabrina; Valletta, Lorella; Haack, Tobias B.; Tsuchiya, Yugo; Venco, Paola; Pasqualato, Sebastiano; Goffrini, Paola; Tigano, Marco; Demchenko, Nikita; Wieland, Thomas; Schwarzmayr, Thomas; Strom, Tim M.; Invernizzi, Federica; Garavaglia, Barbara; Gregory, Allison; Sanford, Lynn; Hamada, Jeffrey; Bettencourt, Conceição; Houlden, Henry; Chiapparini, Luisa; Zorzi, Giovanna; Kurian, Manju A.; Nardocci, Nardo; Prokisch, Holger; Hayflick, Susan; Gout, Ivan; Tiranti, Valeria
2014-01-01
Neurodegeneration with brain iron accumulation (NBIA) comprises a clinically and genetically heterogeneous group of disorders with progressive extrapyramidal signs and neurological deterioration, characterized by iron accumulation in the basal ganglia. Exome sequencing revealed the presence of recessive missense mutations in COASY, encoding coenzyme A (CoA) synthase in one NBIA-affected subject. A second unrelated individual carrying mutations in COASY was identified by Sanger sequence analysis. CoA synthase is a bifunctional enzyme catalyzing the final steps of CoA biosynthesis by coupling phosphopantetheine with ATP to form dephospho-CoA and its subsequent phosphorylation to generate CoA. We demonstrate alterations in RNA and protein expression levels of CoA synthase, as well as CoA amount, in fibroblasts derived from the two clinical cases and in yeast. This is the second inborn error of coenzyme A biosynthesis to be implicated in NBIA. PMID:24360804
2017-01-01
Amplicon (targeted) sequencing by massively parallel sequencing (PCR-MPS) is a potential method for use in forensic DNA analyses. In this application, PCR-MPS may supplement or replace other instrumental analysis methods such as capillary electrophoresis and Sanger sequencing for STR and mitochondrial DNA typing, respectively. PCR-MPS also may enable the expansion of forensic DNA analysis methods to include new marker systems such as single nucleotide polymorphisms (SNPs) and insertion/deletions (indels) that currently are assayable using various instrumental analysis methods including microarray and quantitative PCR. Acceptance of PCR-MPS as a forensic method will depend in part upon developing protocols and criteria that define the limitations of a method, including a defensible analytical threshold or method detection limit. This paper describes an approach to establish objective analytical thresholds suitable for multiplexed PCR-MPS methods. A definition is proposed for PCR-MPS method background noise, and an analytical threshold based on background noise is described. PMID:28542338
mtDNA sequence diversity of Hazara ethnic group from Pakistan.
Rakha, Allah; Fatima; Peng, Min-Sheng; Adan, Atif; Bi, Rui; Yasmin, Memona; Yao, Yong-Gang
2017-09-01
The present study was undertaken to investigate mitochondrial DNA (mtDNA) control region sequences of Hazaras from Pakistan, so as to generate mtDNA reference database for forensic casework in Pakistan and to analyze phylogenetic relationship of this particular ethnic group with geographically proximal populations. Complete mtDNA control region (nt 16024-576) sequences were generated through Sanger Sequencing for 319 Hazara individuals from Quetta, Baluchistan. The population sample set showed a total of 189 distinct haplotypes, belonging mainly to West Eurasian (51.72%), East & Southeast Asian (29.78%) and South Asian (18.50%) haplogroups. Compared with other populations from Pakistan, the Hazara population had a relatively high haplotype diversity (0.9945) and a lower random match probability (0.0085). The dataset has been incorporated into EMPOP database under accession number EMP00680. The data herein comprises the largest, and likely most thoroughly examined, control region mtDNA dataset from Hazaras of Pakistan. Copyright © 2017 Elsevier B.V. All rights reserved.
Young, Brian; King, Jonathan L; Budowle, Bruce; Armogida, Luigi
2017-01-01
Amplicon (targeted) sequencing by massively parallel sequencing (PCR-MPS) is a potential method for use in forensic DNA analyses. In this application, PCR-MPS may supplement or replace other instrumental analysis methods such as capillary electrophoresis and Sanger sequencing for STR and mitochondrial DNA typing, respectively. PCR-MPS also may enable the expansion of forensic DNA analysis methods to include new marker systems such as single nucleotide polymorphisms (SNPs) and insertion/deletions (indels) that currently are assayable using various instrumental analysis methods including microarray and quantitative PCR. Acceptance of PCR-MPS as a forensic method will depend in part upon developing protocols and criteria that define the limitations of a method, including a defensible analytical threshold or method detection limit. This paper describes an approach to establish objective analytical thresholds suitable for multiplexed PCR-MPS methods. A definition is proposed for PCR-MPS method background noise, and an analytical threshold based on background noise is described.
Kappel, Kristina; Haase, Ilka; Käppel, Christine; Sotelo, Carmen G; Schröder, Ute
2017-11-01
Conventional Sanger sequencing of PCR products is the gold standard for species authentication of seafood products. However, this method is inappropriate for the analysis of products that might contain mixtures of species, such as tinned tuna. The purpose of this study was to test whether next-generation sequencing (NGS) can be a solution for the authentication of mixed products. Nine tuna samples containing mixtures of up to four species were prepared and subjected to an NGS approach targeting two short cytochrome b gene (cytb) fragments on the Illumina MiSeq platform. Sequence recovery was precise and admixtures of as low as 1% could be identified, depending on the species composition of the mixtures. Duplicate samples as well as two individual NGS runs produced very similar results. A first test of three commercial tinned tuna samples indicated the presence of different species in the same tin, although this is forbidden by EU law. Copyright © 2017 Elsevier Ltd. All rights reserved.
Alkaptonuria and Pompe disease in one patient: metabolic and molecular analysis.
Zouheir Habbal, Mohammad; Bou Assi, Tarek; Mansour, Hicham
2013-04-29
Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual.
Alkaptonuria and pompe disease in one patient: metabolic and molecular analysis
Habbal, Mohammad Zouheir; Bou Assi, Tarek; Mansour, Hicham
2013-01-01
Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual. PMID:23632174
Antosik, Karolina; Gnyś, Piotr; Jarosz-Chobot, Przemysława; Myśliwiec, Małgorzata; Szadkowska, Agnieszka; Małecki, Maciej; Młynarski, Wojciech; Borowiec, Maciej
2017-01-01
Monogenic diabetes is a rare disease caused by single gene mutations. Maturity onset diabetes of the young (MODY) is one of the major forms of monogenic diabetes recognised in the paediatric population. To date, 13 genes have been related to MODY development. The aim of the study was to analyse the sequence of the BCL2-associated agonist of cell death (BAD) gene in patients with clinical suspicion of GCK-MODY, but who were negative for glucokinase (GCK) gene mutations. A group of 122 diabetic patients were recruited from the "Polish Registry for Paediatric and Adolescent Diabetes - nationwide genetic screening for monogenic diabetes" project. The molecular testing was performed by Sanger sequencing. A total of 10 sequence variants of the BAD gene were identified in 122 analysed diabetic patients. Among the analysed patients suspected of MODY, one possible pathogenic variant was identified in one patient; however, further confirmation is required for a certain identification.
Liu, Gary W; Livesay, Brynn R; Kacherovsky, Nataly A; Cieslewicz, Maryelise; Lutz, Emi; Waalkes, Adam; Jensen, Michael C; Salipante, Stephen J; Pun, Suzie H
2015-08-19
Peptide ligands are used to increase the specificity of drug carriers to their target cells and to facilitate intracellular delivery. One method to identify such peptide ligands, phage display, enables high-throughput screening of peptide libraries for ligands binding to therapeutic targets of interest. However, conventional methods for identifying target binders in a library by Sanger sequencing are low-throughput, labor-intensive, and provide a limited perspective (<0.01%) of the complete sequence space. Moreover, the small sample space can be dominated by nonspecific, preferentially amplifying "parasitic sequences" and plastic-binding sequences, which may lead to the identification of false positives or exclude the identification of target-binding sequences. To overcome these challenges, we employed next-generation Illumina sequencing to couple high-throughput screening and high-throughput sequencing, enabling more comprehensive access to the phage display library sequence space. In this work, we define the hallmarks of binding sequences in next-generation sequencing data, and develop a method that identifies several target-binding phage clones for murine, alternatively activated M2 macrophages with a high (100%) success rate: sequences and binding motifs were reproducibly present across biological replicates; binding motifs were identified across multiple unique sequences; and an unselected, amplified library accurately filtered out parasitic sequences. In addition, we validate the Multiple Em for Motif Elicitation tool as an efficient and principled means of discovering binding sequences.
Shah, Kushani; Thomas, Shelby; Stein, Arnold
2013-01-01
In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C Sanger sequencing reactions. They prepare and run the gels, perform Southern blots (which require only 10 min), and detect sequencing ladders using a colorimetric detection system. Students enlarge their sequencing ladders from digital images of their small nylon membranes, and read the sequence manually. They compare their reads with the actual DNA sequence using BLAST2. After mastering the DNA sequencing system, students prepare their own DNA from a cheek swab, polymerase chain reaction-amplify a region of their DNA that encompasses a SNP of interest, and perform sequencing to determine their genotype at the SNP position. A family pedigree can also be constructed. The SNP chosen by the instructor was rs17822931, which is in the ABCC11 gene and is the determinant of human earwax type. Genotypes at the rs178229931 site vary in different ethnic populations. © 2013 by The International Union of Biochemistry and Molecular Biology.
DSAP: deep-sequencing small RNA analysis pipeline.
Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus
2010-07-01
DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.
Multiplexed microsatellite recovery using massively parallel sequencing
Jennings, T.N.; Knaus, B.J.; Mullins, T.D.; Haig, S.M.; Cronn, R.C.
2011-01-01
Conservation and management of natural populations requires accurate and inexpensive genotyping methods. Traditional microsatellite, or simple sequence repeat (SSR), marker analysis remains a popular genotyping method because of the comparatively low cost of marker development, ease of analysis and high power of genotype discrimination. With the availability of massively parallel sequencing (MPS), it is now possible to sequence microsatellite-enriched genomic libraries in multiplex pools. To test this approach, we prepared seven microsatellite-enriched, barcoded genomic libraries from diverse taxa (two conifer trees, five birds) and sequenced these on one lane of the Illumina Genome Analyzer using paired-end 80-bp reads. In this experiment, we screened 6.1 million sequences and identified 356958 unique microreads that contained di- or trinucleotide microsatellites. Examination of four species shows that our conversion rate from raw sequences to polymorphic markers compares favourably to Sanger- and 454-based methods. The advantage of multiplexed MPS is that the staggering capacity of modern microread sequencing is spread across many libraries; this reduces sample preparation and sequencing costs to less than $400 (USD) per species. This price is sufficiently low that microsatellite libraries could be prepared and sequenced for all 1373 organisms listed as 'threatened' and 'endangered' in the United States for under $0.5M (USD).
Shirts, Brian H; Salipante, Stephen J; Casadei, Silvia; Ryan, Shawnia; Martin, Judith; Jacobson, Angela; Vlaskin, Tatyana; Koehler, Karen; Livingston, Robert J; King, Mary-Claire; Walsh, Tom; Pritchard, Colin C
2014-10-01
Single-exon inversions have rarely been described in clinical syndromes and are challenging to detect using Sanger sequencing. We report the case of a 40-year-old woman with adenomatous colon polyps too numerous to count and who had a complex inversion spanning the entire exon 10 in APC (the gene encoding for adenomatous polyposis coli), causing exon skipping and resulting in a frameshift and premature protein truncation. In this study, we employed complete APC gene sequencing using high-coverage next-generation sequencing by ColoSeq, analysis with BreakDancer and SLOPE software, and confirmatory transcript analysis. ColoSeq identified a complex small genomic rearrangement consisting of an inversion that results in translational skipping of exon 10 in the APC gene. This mutation would not have been detected by traditional sequencing or gene-dosage methods. We report a case of adenomatous polyposis resulting from a complex single-exon inversion. Our report highlights the benefits of large-scale sequencing methods that capture intronic sequences with high enough depth of coverage-as well as the use of informatics tools-to enable detection of small pathogenic structural rearrangements.
Detection of Emerging Vaccine-Related Polioviruses by Deep Sequencing.
Sahoo, Malaya K; Holubar, Marisa; Huang, ChunHong; Mohamed-Hadley, Alisha; Liu, Yuanyuan; Waggoner, Jesse J; Troy, Stephanie B; Garcia-Garcia, Lourdes; Ferreyra-Reyes, Leticia; Maldonado, Yvonne; Pinsky, Benjamin A
2017-07-01
Oral poliovirus vaccine can mutate to regain neurovirulence. To date, evaluation of these mutations has been performed primarily on culture-enriched isolates by using conventional Sanger sequencing. We therefore developed a culture-independent, deep-sequencing method targeting the 5' untranslated region (UTR) and P1 genomic region to characterize vaccine-related poliovirus variants. Error analysis of the deep-sequencing method demonstrated reliable detection of poliovirus mutations at levels of <1%, depending on read depth. Sequencing of viral nucleic acids from the stool of vaccinated, asymptomatic children and their close contacts collected during a prospective cohort study in Veracruz, Mexico, revealed no vaccine-derived polioviruses. This was expected given that the longest duration between sequenced sample collection and the end of the most recent national immunization week was 66 days. However, we identified many low-level variants (<5%) distributed across the 5' UTR and P1 genomic region in all three Sabin serotypes, as well as vaccine-related viruses with multiple canonical mutations associated with phenotypic reversion present at high levels (>90%). These results suggest that monitoring emerging vaccine-related poliovirus variants by deep sequencing may aid in the poliovirus endgame and efforts to ensure global polio eradication. Copyright © 2017 Sahoo et al.
Molecular Characterization of Transgenic Events Using Next Generation Sequencing Approach.
Guttikonda, Satish K; Marri, Pradeep; Mammadov, Jafar; Ye, Liang; Soe, Khaing; Richey, Kimberly; Cruse, James; Zhuang, Meibao; Gao, Zhifang; Evans, Clive; Rounsley, Steve; Kumpatla, Siva P
2016-01-01
Demand for the commercial use of genetically modified (GM) crops has been increasing in light of the projected growth of world population to nine billion by 2050. A prerequisite of paramount importance for regulatory submissions is the rigorous safety assessment of GM crops. One of the components of safety assessment is molecular characterization at DNA level which helps to determine the copy number, integrity and stability of a transgene; characterize the integration site within a host genome; and confirm the absence of vector DNA. Historically, molecular characterization has been carried out using Southern blot analysis coupled with Sanger sequencing. While this is a robust approach to characterize the transgenic crops, it is both time- and resource-consuming. The emergence of next-generation sequencing (NGS) technologies has provided highly sensitive and cost- and labor-effective alternative for molecular characterization compared to traditional Southern blot analysis. Herein, we have demonstrated the successful application of both whole genome sequencing and target capture sequencing approaches for the characterization of single and stacked transgenic events and compared the results and inferences with traditional method with respect to key criteria required for regulatory submissions.
A multiple-alignment based primer design algorithm for genetically highly variable DNA targets
2013-01-01
Background Primer design for highly variable DNA sequences is difficult, and experimental success requires attention to many interacting constraints. The advent of next-generation sequencing methods allows the investigation of rare variants otherwise hidden deep in large populations, but requires attention to population diversity and primer localization in relatively conserved regions, in addition to recognized constraints typically considered in primer design. Results Design constraints include degenerate sites to maximize population coverage, matching of melting temperatures, optimizing de novo sequence length, finding optimal bio-barcodes to allow efficient downstream analyses, and minimizing risk of dimerization. To facilitate primer design addressing these and other constraints, we created a novel computer program (PrimerDesign) that automates this complex procedure. We show its powers and limitations and give examples of successful designs for the analysis of HIV-1 populations. Conclusions PrimerDesign is useful for researchers who want to design DNA primers and probes for analyzing highly variable DNA populations. It can be used to design primers for PCR, RT-PCR, Sanger sequencing, next-generation sequencing, and other experimental protocols targeting highly variable DNA samples. PMID:23965160
Stam, Remco; Scheikl, Daniela; Tellier, Aurélien
2016-06-02
Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons
Haas, Brian J.; Gevers, Dirk; Earl, Ashlee M.; Feldgarden, Mike; Ward, Doyle V.; Giannoukos, Georgia; Ciulla, Dawn; Tabbaa, Diana; Highlander, Sarah K.; Sodergren, Erica; Methé, Barbara; DeSantis, Todd Z.; Petrosino, Joseph F.; Knight, Rob; Birren, Bruce W.
2011-01-01
Bacterial diversity among environmental samples is commonly assessed with PCR-amplified 16S rRNA gene (16S) sequences. Perceived diversity, however, can be influenced by sample preparation, primer selection, and formation of chimeric 16S amplification products. Chimeras are hybrid products between multiple parent sequences that can be falsely interpreted as novel organisms, thus inflating apparent diversity. We developed a new chimera detection tool called Chimera Slayer (CS). CS detects chimeras with greater sensitivity than previous methods, performs well on short sequences such as those produced by the 454 Life Sciences (Roche) Genome Sequencer, and can scale to large data sets. By benchmarking CS performance against sequences derived from a controlled DNA mixture of known organisms and a simulated chimera set, we provide insights into the factors that affect chimera formation such as sequence abundance, the extent of similarity between 16S genes, and PCR conditions. Chimeras were found to reproducibly form among independent amplifications and contributed to false perceptions of sample diversity and the false identification of novel taxa, with less-abundant species exhibiting chimera rates exceeding 70%. Shotgun metagenomic sequences of our mock community appear to be devoid of 16S chimeras, supporting a role for shotgun metagenomics in validating novel organisms discovered in targeted sequence surveys. PMID:21212162
Rowczenio, Dorota M; Gomes, Sónia Melo; Aróstegui, Juan I; Mensa-Vilaro, Anna; Omoyinmi, Ebun; Trojer, Hadija; Baginska, Anna; Baroja-Mazo, Alberto; Pelegrin, Pablo; Savic, Sinisa; Lane, Thirusha; Williams, Rene; Brogan, Paul; Lachmann, Helen J; Hawkins, Philip N
2017-01-01
Cryopyrin-associated periodic syndrome (CAPS) is caused by gain-of-function NLRP3 mutations. Recently, somatic NLRP3 mosaicism has been reported in some CAPS patients who were previously classified as "mutation-negative." We describe here the clinical and laboratory findings in eight British adult patients who presented with symptoms typical of CAPS other than an onset in mid-late adulthood. All patients underwent comprehensive clinical and laboratory investigations, including analysis of the NLRP3 gene using Sanger and amplicon-based deep sequencing (ADS) along with measurements of extracellular apoptosis-associated speck-like protein with CARD domain (ASC) aggregates. The clinical phenotype in all subjects was consistent with mid-spectrum CAPS, except a median age at disease onset of 50 years. Sanger sequencing of NLRP3 was non-diagnostic but ADS detected a somatic NLRP3 mutation in each case. In one patient, DNA isolated from blood demonstrated an increase in the mutant allele from 5 to 45% over 12 years. ASC aggregates in patients' serum measured during active disease were significantly higher than healthy controls. This series represents 8% of CAPS patients diagnosed in a single center, suggesting that acquired NLRP3 mutations may not be an uncommon cause of the syndrome and should be sought in all patients with late-onset symptoms otherwise compatible with CAPS. Steadily worsening CAPS symptoms in one patient were associated with clonal expansion of the mutant allele predominantly affecting myeloid cells. Two patients developed AA amyloidosis, which previously has only been reported in CAPS in association with life-long germline NLRP3 mutations.
A novel homozygous mutation in the FSHR gene is causative for primary ovarian insufficiency.
Liu, Hongli; Xu, Xiaofei; Han, Ting; Yan, Lei; Cheng, Lei; Qin, Yingying; Liu, Wen; Zhao, Shidou; Chen, Zi-Jiang
2017-12-01
To identify the potential FSHR mutation in a Chinese woman with primary ovarian insufficiency (POI). Genetic and functional studies. University-based reproductive medicine center. A POI patient, her family members, and another 192 control women with regular menstruation. Ovarian biopsy was performed in the patient. Sanger sequencing was carried out for the patient, her sister, and parents. The novel variant identified was further confirmed with the use of control subjects. Sanger sequencing and genotype analysis to identify the potential variant of the FSHR gene; hematoxylin and eosin staining of the ovarian section to observe the follicular development; Western blotting and immunofluorescence to detect FSH receptor (FSHR) expression; and cyclic adenosine monophosphate (cAMP) assay to monitor FSH-induced signaling. Histologic examination of the ovaries in the patient revealed follicular development up to the early antral stage. Mutational screening and genotype analysis of the FSHR gene identified a novel homozygous mutation c.175C>T (p.R59X) in exon 2, which was inherited in the autosomal recessive mode from her heterozygous parents but was absent in her sister and the 192 control women. Functional studies demonstrated that in vitro the nonsense mutation caused the loss of full-length FSHR expression and that p.R59X mutant showed no response to FSH stimulation in the cAMP level. The mutation p.R59X in FSHR is causative for POI by means of arresting folliculogenesis. Copyright © 2017 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Shiina, Takashi; Yamada, Yukiho; Aarnink, Alice; Suzuki, Shingo; Masuya, Anri; Ito, Sayaka; Ido, Daisuke; Yamanaka, Hisashi; Iwatani, Chizuru; Tsuchiya, Hideaki; Ishigaki, Hirohito; Itoh, Yasushi; Ogasawara, Kazumasa; Kulski, Jerzy K; Blancher, Antoine
2015-10-01
Although the low polymorphism of the major histocompatibility complex (MHC) transplantation genes in the Filipino cynomolgus macaque (Macaca fascicularis) is expected to have important implications in the selection and breeding of animals for medical research, detailed polymorphism information is still lacking for many of the duplicated class I genes. To better elucidate the degree and types of MHC polymorphisms and haplotypes in the Filipino macaque population, we genotyped 127 unrelated animals by the Sanger sequencing method and high-resolution pyrosequencing and identified 112 different alleles, 28 at cynomolgus macaque MHC (Mafa)-A, 54 at Mafa-B, 12 at Mafa-I, 11 at Mafa-E, and seven at Mafa-F alleles, of which 56 were newly described. Of them, the newly discovered Mafa-A8*01:01 lineage allele had low nucleotide similarities (<86%) with primate MHC class I genes, and it was also conserved in the Vietnamese and Indonesian populations. In addition, haplotype estimations revealed 17 Mafa-A, 23 Mafa-B, and 12 Mafa-E haplotypes integrated with 84 Mafa-class I haplotypes and Mafa-F alleles. Of these, the two Mafa-class I haplotypes, F/A/E/B-Hp1 and F/A/E/B-Hp2, had the highest haplotype frequencies at 10.6 and 10.2%, respectively. This suggests that large scale genetic screening of the Filipino macaque population would identify these and other high-frequency Mafa-class I haplotypes that could be used as MHC control animals for the benefit of biomedical research.
Marshall, Charla; Sturk-Andreaggi, Kimberly; Daniels-Higginbotham, Jennifer; Oliver, Robert Sean; Barritt-Ross, Suzanne; McMahon, Timothy P
2017-11-01
Next-generation ancient DNA technologies have the potential to assist in the analysis of degraded DNA extracted from forensic specimens. Mitochondrial genome (mitogenome) sequencing, specifically, may be of benefit to samples that fail to yield forensically relevant genetic information using conventional PCR-based techniques. This report summarizes the Armed Forces Medical Examiner System's Armed Forces DNA Identification Laboratory's (AFMES-AFDIL) performance evaluation of a Next-Generation Sequencing protocol for degraded and chemically treated past accounting samples. The procedure involves hybridization capture for targeted enrichment of mitochondrial DNA, massively parallel sequencing using Illumina chemistry, and an automated bioinformatic pipeline for forensic mtDNA profile generation. A total of 22 non-probative samples and associated controls were processed in the present study, spanning a range of DNA quantity and quality. Data were generated from over 100 DNA libraries by ten DNA analysts over the course of five months. The results show that the mitogenome sequencing procedure is reliable and robust, sensitive to low template (one ng control DNA) as well as degraded DNA, and specific to the analysis of the human mitogenome. Haplotypes were overall concordant between NGS replicates and with previously generated Sanger control region data. Due to the inherent risk for contamination when working with low-template, degraded DNA, a contamination assessment was performed. The consumables were shown to be void of human DNA contaminants and suitable for forensic use. Reagent blanks and negative controls were analyzed to determine the background signal of the procedure. This background signal was then used to set analytical and reporting thresholds, which were designated at 4.0X (limit of detection) and 10.0X (limit of quantiation) average coverage across the mitogenome, respectively. Nearly all human samples exceeded the reporting threshold, although coverage was reduced in chemically treated samples resulting in a ∼58% passing rate for these poor-quality samples. A concordance assessment demonstrated the reliability of the NGS data when compared to known Sanger profiles. One case sample was shown to be mixed with a co-processed sample and two reagent blanks indicated the presence of DNA above the analytical threshold. This contamination was attributed to sequencing crosstalk from simultaneously sequenced high-quality samples to include the positive control. Overall this study demonstrated that hybridization capture and Illumina sequencing provide a viable method for mitogenome sequencing of degraded and chemically treated skeletal DNA samples, yet may require alternative measures of quality control. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
White, K Makay; Matthews, Melinda K; Hughes, Rachel C; Sommer, Andrew J; Griffitts, Joel S; Newell, Peter D; Chaston, John M
2018-03-28
A metagenome wide association (MGWA) study of bacterial host association determinants in Drosophila predicted that LPS biosynthesis genes are significantly associated with host colonization. We were unable to create site-directed mutants for each of the predicted genes in Acetobacter , so we created an arrayed transposon insertion library using Acetobacter fabarum DsW_054 isolated from Drosophila Creation of the A. fabarum DsW_054 gene knock-out library was performed by combinatorial mapping and Illumina sequencing of random transposon insertion mutants. Transposon insertion locations for 6,418 mutants were successfully mapped, including hits within 63% of annotated genes in the A. fabarum DsW_054 genome. For 45/45 members of the library, insertion sites were verified by arbitrary PCR and Sanger sequencing. Mutants with insertions in four different LPS biosynthesis genes were selected from the library to validate the MGWA predictions. Insertion mutations in two genes biosynthetically upstream of Lipid-A formation, lpxC and lpxB , show significant differences in host association, whereas mutations in two genes encoding LPS biosynthesis functions downstream of Lipid-A biosynthesis had no effect. These results suggest an impact of bacterial cell surface molecules on the bacterial capacity for host association. Also, the transposon insertion mutant library will be a useful resource for ongoing research on the genetic basis for Acetobacter traits. Copyright © 2018 White et al.
Identifying mutations in Tunisian families with retinal dystrophy.
Habibi, Imen; Chebil, Ahmed; Falfoul, Yosra; Allaman-Pillet, Nathalie; Kort, Fedra; Schorderet, Daniel F; El Matri, Leila
2016-11-22
Retinal dystrophies (RD) are a rare genetic disorder with high genetic heterogeneity. This study aimed at identifying disease-causing variants in fifteen consanguineous Tunisian families. Full ophthalmic examination was performed. Index patients were subjected to IROme analysis or whole exome sequencing followed by homozygosity mapping. All detected variations were confirmed by direct Sanger sequencing. Mutation analysis in our patients revealed two compound heterozygous mutations p.(R91W);(V172D) in RPE65, and five novel homozygous mutations: p.R765C in CNGB1, p.H337R in PDE6B, splice site variant c.1129-2A > G and c.678_681delGAAG in FAM161A and c.1133 + 3_1133 + 6delAAGT in CERKL. The latter mutation impacts pre-mRNA splicing of CERKL. The other changes detected were six previously reported mutations in CNGB3 (p.R203*), ABCA4 (p.W782*), NR2E3 (p.R311Q), RPE65 (p.H182Y), PROM1 (c.1354dupT) and EYS (c.5928-2A > G). Segregation analysis in each family showed that all affected individuals were homozygotes and unaffected individuals were either heterozygote carriers or homozygous wild type allele. These results confirm the involvement of a large number of genes in RD in the Tunisian population.
Utturkar, Sagar M.; Klingeman, Dawn Marie; Land, Miriam L.; ...
2014-06-14
Our motivation with this work was to assess the potential of different types of sequence data combined with de novo and hybrid assembly approaches to improve existing draft genome sequences. Our results show Illumina, 454 and PacBio sequencing technologies were used to generate de novo and hybrid genome assemblies for four different bacteria, which were assessed for quality using summary statistics (e.g. number of contigs, N50) and in silico evaluation tools. Differences in predictions of multiple copies of rDNA operons for each respective bacterium were evaluated by PCR and Sanger sequencing, and then the validated results were applied as anmore » additional criterion to rank assemblies. In general, assemblies using longer PacBio reads were better able to resolve repetitive regions. In this study, the combination of Illumina and PacBio sequence data assembled through the ALLPATHS-LG algorithm gave the best summary statistics and most accurate rDNA operon number predictions. This study will aid others looking to improve existing draft genome assemblies. As to availability and implementation–all assembly tools except CLC Genomics Workbench are freely available under GNU General Public License.« less
Fonseca, Dora Janeth; Patiño, Liliana Catherine; Suárez, Yohjana Carolina; de Jesús Rodríguez, Asid; Mateus, Heidi Eliana; Jiménez, Karen Marcela; Ortega-Recalde, Oscar; Díaz-Yamal, Ivonne; Laissue, Paul
2015-07-01
To identify new molecular actors involved in nonsyndromic premature ovarian failure (POF) etiology. This is a retrospective case-control cohort study. University research group and IVF medical center. Twelve women affected by nonsyndromic POF. The control group included 176 women whose menopause had occurred after age 50 and had no antecedents regarding gynecological disease. A further 345 women from the same ethnic origin (general population group) were also recruited to assess allele frequency for potentially deleterious sequence variants. Next generation sequencing (NGS), Sanger sequencing, and bioinformatics analysis. The complete coding regions of 70 candidate genes were massively sequenced, via NGS, in POF patients. Bioinformatics and genetics were used to confirm NGS results and to identify potential sequence variants related to the disease pathogenesis. We have identified mutations in two novel genes, ADAMTS19 and BMPR2, that are potentially related to POF origin. LHCGR mutations, which might have contributed to the phenotype, were also detected. We thus recommend NGS as a powerful tool for identifying new molecular actors in POF and for future diagnostic/prognostic purposes. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.
VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C
2015-11-26
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.
Novel mutations of ABCB6 associated with autosomal dominant dyschromatosis universalis hereditaria.
Cui, Ying-Xia; Xia, Xin-Yi; Zhou, Yang; Gao, Lin; Shang, Xue-Jun; Ni, Tong; Wang, Wei-Ping; Fan, Xiao-Buo; Yin, Hong-Lin; Jiang, Shao-Jun; Yao, Bing; Hu, Yu-An; Wang, Gang; Li, Xiao-Jun
2013-01-01
Dyschromatosis universalis hereditaria (DUH) is a rare heterogeneous pigmentary genodermatosis, which was first described in 1933. The genetic cause has recently been discovered by the discovery of mutations in ABCB6. Here we investigated a Chinese family with typical features of autosomal dominant DUH and 3 unrelated patients with sporadic DUH. Skin tissues were obtained from the proband, of this family and the 3 sporadic patients. Histopathological examination and immunohistochemical analysis of ABCB6 were performed. Peripheral blood DNA samples were obtained from 21 affected, 14 unaffected, 11 spouses in the family and the 3 sporadic patients. A genome-wide linkage scan for the family was carried out to localize the causative gene. Exome sequencing was performed from 3 affected and 1 unaffected in the family. Sanger sequencing of ABCB6 was further used to identify the causative gene for all samples obtained from available family members, the 3 sporadic patients and a panel of 455 ethnically-matched normal Chinese individuals. Histopathological analysis showed melanocytes in normal control's skin tissue and the hyperpigmented area contained more melanized, mature melanosomes than those within the hypopigmented areas. Empty immature melanosomes were found in the hypopigmented melanocytes. Parametric multipoint linkage analysis produced a HLOD score of 4.68, with markers on chromosome 2q35-q37.2. A missense mutation (c.1663 C>A, p.Gln555Lys) in ABCB6 was identified in this family by exome and Sanger sequencing. The mutation perfectly cosegregated with the skin phenotype. An additional mutation (g.776 delC, c.459 delC) in ABCB6 was found in an unrelated sporadic patient. No mutation in ABCB6 was discovered in the other two sporadic patients. Neither of the two mutations was present in the 455 controls. Melanocytes showed positive immunoreactivity to ABCB6. Our data add new variants to the repertoire of ABCB6 mutations with DUH.
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.
Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D
2013-02-28
Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation
2013-01-01
Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change. PMID:23445355
A new sensitive PCR assay for one-step detection of 12 IDH1/2 mutations in glioma.
Catteau, Aurélie; Girardi, Hélène; Monville, Florence; Poggionovo, Cécile; Carpentier, Sabrina; Frayssinet, Véronique; Voss, Jesse; Jenkins, Robert; Boisselier, Blandine; Mokhtari, Karima; Sanson, Marc; Peyro-Saint-Paul, Hélène; Giannini, Caterina
2014-06-02
Mutations in isocitrate dehydrogenase genes IDH1 or IDH2 are frequent in glioma, and IDH mutation status is a strong diagnostic and prognostic marker. Current IDH mutation screening is performed with an immunohistochemistry (IHC) assay specific for IDH1 R132H, the most common mutation. Sequencing is recommended as a second-step test for IHC-negative or -equivocal cases. We developed and validated a new real-time quantitative polymerase chain reaction (PCR) assay for single-step detection of IDH1 R132H and 11 rare IDH1/2 mutations in formalin-fixed paraffin-embedded (FFPE) glioma samples. Performance of the IDH1/2 PCR assay was compared to IHC and Sanger sequencing. The IDH1/2 PCR assay combines PCR clamping for detection of 7 IDH1 and 5 IDH2 mutations, and Amplification Refractory Mutation System technology for specific identification of the 3 most common mutations (IDH1 R132H, IDH1 R132C, IDH2 R172K). Analytical sensitivity of the PCR assay for mutation detection was <5% for 11/12 mutations (mean: 3.3%), and sensitivity for mutation identification was very high (0.8% for IDH1 R132H; 1.2% for IDH1 R132C; 0.6% for IDH2 R172K). Assay performance was further validated on 171 clinical glioma FFPE samples; of these, 147 samples met the selection criteria and 146 DNA samples were successfully extracted. IDH1/2 status was successfully obtained in 91% of cases. All but one positive IDH1 R132H-IHC cases were concordantly detected by PCR and 3 were not detected by sequencing. Among the IHC-negative cases (n = 72), PCR detected 12 additional rare mutations (10 IDH1, 2 IDH2). All mutations detected by sequencing (n = 67) were concordantly detected by PCR and 5/66 sequencing-negative cases were PCR-positive (overall concordance: 96%). Analysis of synthetic samples representative of the 11 rare IDH1/2 mutations detected by the assay produced 100% correct results. The new IDH1/2 PCR assay has a high technical success rate and is more sensitive than Sanger sequencing. Positive concordance was 98% with IHC for IDH1 R132H detection and 100% with sequencing. The PCR assay can reliably be performed on FFPE samples and has a faster turnaround time than current IDH mutation detection algorithms. The assay should facilitate implementation of a comprehensive IDH1/2 testing protocol in routine clinical practice.
Etard, Christelle; Joshi, Swarnima; Stegmaier, Johannes; Mikut, Ralf; Strähle, Uwe
2017-12-01
A bottleneck in CRISPR/Cas9 genome editing is variable efficiencies of in silico-designed gRNAs. We evaluated the sensitivity of the TIDE method (Tracking of Indels by DEcomposition) introduced by Brinkman et al. in 2014 for assessing the cutting efficiencies of gRNAs in zebrafish. We show that this simple method, which involves bulk polymerase chain reaction amplification and Sanger sequencing, is highly effective in tracking well-performing gRNAs in pools of genomic DNA derived from injected embryos. The method is equally effective for tracing INDELs in heterozygotes.
Laghari, Muhammad Younis; Lashari, Punhal; Xu, Peng; Zhao, Zixia; Jiang, Li; Narejo, Naeem Tariq; Xin, Baoping; Sun, Xiaowen; Zhang, Yan
2016-01-01
Complete mitochondrial genome of fresh water giant catfish, Wallago attu, was isolated by LA PCR (TakaRa LAtaq, Dalian, China); and sequenced by Sanger's method to obtain the complete mitochondrial genome. The complete mitogenome was 15,639 bp in length and contains 13 typical vertebrate protein-coding genes, 2 rRNA and 22 tRNA genes. The whole genome base composition was estimated to be 31.17% A, 28.15% C, 15.55% G and 25.12% T. The complete mitochondrial genome of catfish, W. attu, provides the fundamental tools for genetic breeding.
Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences
Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.
2012-01-01
ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136
Novel Secondary Somatic Mutations in Ewing's Sarcoma and Desmoplastic Small Round Cell Tumors
Janku, Filip; Ludwig, Joseph A.; Naing, Aung; Benjamin, Robert S.; Brown, Robert E.; Anderson, Pete; Kurzrock, Razelle
2014-01-01
Background Ewing's sarcoma (ES) and desmoplastic small round cell tumors (DSRCT) are small round blue cell tumors driven by an N-terminal containing EWS translocation. Very few somatic mutations have been reported in ES, and none have been identified in DSRCT. The aim of this study is to explore potential actionable mutations in ES and DSRCT. Methodology Twenty eight patients with ES or DSRCT had tumor tissue available that could be analyzed by one of the following methods: 1) Next-generation exome sequencing platform; 2) Multiplex PCR/Mass Spectroscopy; 3) Polymerase chain reaction (PCR)-based single- gene mutation screening; 4) Sanger sequencing; 5) Morphoproteomics. Principal Findings Novel somatic mutations were identified in four out of 18 patients with advanced ES and two of 10 patients with advanced DSRCT (six out of 28 (21.4%));KRAS (n = 1), PTPRD (n = 1), GRB10 (n = 2), MET (n = 2) and PIK3CA (n = 1). One patient with both PTPRD and GRB10 mutations and one with a GRB10 mutation achieved a complete remission (CR) on an Insulin like growth factor 1 receptor (IGF1R) inhibitor based treatment. One patient, who achieved a partial remission (PR) with IGF1R inhibitor treatment, but later developed resistance, demonstrated a KRAS mutation in the post-treatment resistant tumor, but not in the pre-treatment tumor suggesting that the RAF/RAS/MEK pathway was activated with progression. Conclusions We have reported several different mutations in advanced ES and DSRCT that have direct implications for molecularly-directed targeted therapy. Our technology agnostic approach provides an initial mutational roadmap used in the path towards individualized combination therapy. PMID:25119929
Zahedi, Alireza; Monis, Paul; Gofton, Alexander W; Oskam, Charlotte L; Ball, Andrew; Bath, Andrew; Bartkow, Michael; Robertson, Ian; Ryan, Una
2018-05-01
As part of long-term monitoring of Cryptosporidium in water catchments serving Western Australia, New South Wales (Sydney) and Queensland, Australia, we characterised Cryptosporidium in a total of 5774 faecal samples from 17 known host species and 7 unknown bird samples, in 11 water catchment areas over a period of 30 months (July 2013 to December 2015). All samples were initially screened for Cryptosporidium spp. at the 18S rRNA locus using a quantitative PCR (qPCR). Positives samples were then typed by sequence analysis of an 825 bp fragment of the 18S gene and subtyped at the glycoprotein 60 (gp60) locus (832 bp). The overall prevalence of Cryptosporidium across the various hosts sampled was 18.3% (1054/5774; 95% CI, 17.3-19.3). Of these, 873 samples produced clean Sanger sequencing chromatograms, and the remaining 181 samples, which initially produced chromatograms suggesting the presence of multiple different sequences, were re-analysed by Next- Generation Sequencing (NGS) to resolve the presence of Cryptosporidium and the species composition of potential mixed infections. The overall prevalence of confirmed mixed infection was 1.7% (98/5774), and in the remaining 83 samples, NGS only detected one species of Cryptosporidium. Of the 17 Cryptosporidium species and four genotypes detected (Sanger sequencing combined with NGS), 13 are capable of infecting humans; C. parvum, C. hominis, C. ubiquitum, C. cuniculus, C. meleagridis, C. canis, C. felis, C. muris, C. suis, C. scrofarum, C. bovis, C. erinacei and C. fayeri. Oocyst numbers per gram of faeces (g -1 ) were also determined using qPCR, with medians varying from 6021-61,064 across the three states. The significant findings were the detection of C. hominis in cattle and kangaroo faeces and the high prevalence of C. parvum in cattle. In addition, two novel C. fayeri subtypes (IVaA11G3T1 and IVgA10G1T1R1) and one novel C. meleagridis subtype (IIIeA18G2R1) were identified. This is also the first report of C. erinacei in Australia. Future work to monitor the prevalence of Cryptosporidium species and subtypes in animals in these catchments is warranted. Copyright © 2018 Elsevier Ltd. All rights reserved.
Genetic Mapping and Exome Sequencing Identify Variants Associated with Five Novel Diseases
Puffenberger, Erik G.; Jinks, Robert N.; Sougnez, Carrie; Cibulskis, Kristian; Willert, Rebecca A.; Achilly, Nathan P.; Cassidy, Ryan P.; Fiorentini, Christopher J.; Heiken, Kory F.; Lawrence, Johnny J.; Mahoney, Molly H.; Miller, Christopher J.; Nair, Devika T.; Politi, Kristin A.; Worcester, Kimberly N.; Setton, Roni A.; DiPiazza, Rosa; Sherman, Eric A.; Eastman, James T.; Francklyn, Christopher; Robey-Bond, Susan; Rider, Nicholas L.; Gabriel, Stacey; Morton, D. Holmes; Strauss, Kevin A.
2012-01-01
The Clinic for Special Children (CSC) has integrated biochemical and molecular methods into a rural pediatric practice serving Old Order Amish and Mennonite (Plain) children. Among the Plain people, we have used single nucleotide polymorphism (SNP) microarrays to genetically map recessive disorders to large autozygous haplotype blocks (mean = 4.4 Mb) that contain many genes (mean = 79). For some, uninformative mapping or large gene lists preclude disease-gene identification by Sanger sequencing. Seven such conditions were selected for exome sequencing at the Broad Institute; all had been previously mapped at the CSC using low density SNP microarrays coupled with autozygosity and linkage analyses. Using between 1 and 5 patient samples per disorder, we identified sequence variants in the known disease-causing genes SLC6A3 and FLVCR1, and present evidence to strongly support the pathogenicity of variants identified in TUBGCP6, BRAT1, SNIP1, CRADD, and HARS. Our results reveal the power of coupling new genotyping technologies to population-specific genetic knowledge and robust clinical data. PMID:22279524
Huang, Xiaoyan; Tian, Mao; Li, Jiankang; Cui, Ling; Li, Min; Zhang, Jianguo
2017-11-01
Norrie disease (ND) is a rare X-linked genetic disorder, the main symptoms of which are congenital blindness and white pupils. It has been reported that ND is caused by mutations in the NDP gene. Although many mutations in NDP have been reported, the genetic cause for many patients remains unknown. In this study, the aim is to investigate the genetic defect in a five-generation family with typical symptoms of ND. To identify the causative gene, next-generation sequencing based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members using Sanger sequencing. We identified a novel missense variant (c.314C>A) located within the NDP gene. The mutation cosegregated within all affected individuals in the family and was not found in unaffected members. By happenstance, in this family, we also detected a known pathogenic variant of retinitis pigmentosa in a healthy individual. c.314C>A mutation of NDP gene is a novel mutation and broadens the genetic spectrum of ND.
Rivière, Jean-Baptiste; Mirzaa, Ghayda M.; O’Roak, Brian J.; Beddaoui, Margaret; Alcantara, Diana; Conway, Robert L.; St-Onge, Judith; Schwartzentruber, Jeremy A.; Gripp, Karen W.; Nikkel, Sarah M.; Worthylake, Thea; Sullivan, Christopher T.; Ward, Thomas R.; Butler, Hailly E.; Kramer, Nancy A.; Albrecht, Beate; Armour, Christine M.; Armstrong, Linlea; Caluseriu, Oana; Cytrynbaum, Cheryl; Drolet, Beth A.; Innes, A. Micheil; Lauzon, Julie L.; Lin, Angela E.; Mancini, Grazia M. S.; Meschino, Wendy S.; Reggin, James D.; Saggar, Anand K.; Lerman-Sagie, Tally; Uyanik, Gökhan; Weksberg, Rosanna; Zirn, Birgit; Beaulieu, Chandree L.; Majewski, Jacek; Bulman, Dennis E.; O’Driscoll, Mark; Shendure, Jay; Graham, John M.; Boycott, Kym M.; Dobyns, William B.
2012-01-01
Megalencephaly-capillary malformation (MCAP) and megalencephaly-polymicrogyria-polydactyly-hydrocephalus (MPPH) syndromes are sporadic overgrowth disorders associated with markedly enlarged brain size and other recognizable features1-5. We performed exome sequencing in three families with MCAP or MPPH and confirmed our initial observations in exomes from 7 MCAP and 174 control individuals, as well as in 40 additional megalencephaly subjects using a combination of Sanger sequencing, restriction-enzyme assays, and targeted deep sequencing. We identified de novo germline or postzygotic mutations in three core components of the phosphatidylinositol-3-kinase (PI3K)/AKT pathway. These include two mutations of AKT3, one recurrent mutation of PIK3R2 in 11 unrelated MPPH families, and 15 mostly postzygotic mutations of PIK3CA in 23 MCAP and one MPPH patients. Our data highlight the central role of PI3K/AKT signaling in vascular, limb and brain development, and emphasize the power of massively parallel sequencing in a challenging context of phenotypic and genetic heterogeneity combined with postzygotic mosaicism. PMID:22729224
Yuan, Lamei; Wu, Song; Xu, Hongbo; Xiao, Jingjing; Yang, Zhijian; Xia, Hong; Liu, An; Hu, Pengzhi; Lu, Anjie; Chen, Yulan; Xu, Fengping; Deng, Hao
2015-01-01
Familial hypophosphatemic rickets (HR), the most common inherited form of rickets, is a group of inherited renal phosphate wasting disorders characterized by growth retardation, rickets with bone deformities, osteomalacia, poor dental development, and hypophosphatemia. The purpose of this study was to identify the genetic defect responsible for familial HR in a four-generation Chinese Han pedigree by exome sequencing and Sanger sequencing. Clinical features include skeletal deformities, teeth abnormalities, hearing impairments and variable serum phosphate level in patients of this family. A novel deletion mutation, c.1553delT (p.F518Sfs*4), was identified in the X-linked phosphate regulating endopeptidase homolog gene (PHEX). The mutation is predicted to result in prematurely truncated and loss-of-function PHEX protein. Our data suggest that exome sequencing is a powerful tool to discover mutation(s) in HR, a disorder with genetic and clinical heterogeneity. The findings may also provide new insights into the cause and diagnosis of HR, and have implications for genetic counseling and clinical management.
Sheveleva, Anna; Kudryavtseva, Anna; Speranskaya, Anna; Belenikin, Maxim; Melnikova, Natalia; Chirkov, Sergei
2013-10-01
The near-complete (99.7 %) genome sequence of a novel Russian Plum pox virus (PPV) isolate Pk, belonging to the strain Winona (W), has been determined by 454 pyrosequencing with the exception of the thirty-one 5'-terminal nucleotides. This region was amplified using 5'RACE kit and sequenced by the Sanger method. Genomic RNA released from immunocaptured PPV particles was employed for generation of cDNA library using TransPlex Whole transcriptome amplification kit (WTA2, Sigma-Aldrich). The entire Pk genome has identity level of 92.8-94.5 % when compared to the complete nucleotide sequences of other PPV-W isolates (W3174, LV-141pl, LV-145bt, and UKR 44189), confirming a high degree of variability within the PPV-W strain. The isolates Pk and LV-141pl are most closely related. The Pk has been found in a wild plum (Prunus domestica) in a new region of Russia indicating widespread dissemination of the PPV-W strain in the European part of the former USSR.
Next-generation sequencing reveals a novel NDP gene mutation in a Chinese family with Norrie disease
Huang, Xiaoyan; Tian, Mao; Li, Jiankang; Cui, Ling; Li, Min; Zhang, Jianguo
2017-01-01
Purpose: Norrie disease (ND) is a rare X-linked genetic disorder, the main symptoms of which are congenital blindness and white pupils. It has been reported that ND is caused by mutations in the NDP gene. Although many mutations in NDP have been reported, the genetic cause for many patients remains unknown. In this study, the aim is to investigate the genetic defect in a five-generation family with typical symptoms of ND. Methods: To identify the causative gene, next-generation sequencing based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members using Sanger sequencing. Results: We identified a novel missense variant (c.314C>A) located within the NDP gene. The mutation cosegregated within all affected individuals in the family and was not found in unaffected members. By happenstance, in this family, we also detected a known pathogenic variant of retinitis pigmentosa in a healthy individual. Conclusion: c.314C>A mutation of NDP gene is a novel mutation and broadens the genetic spectrum of ND. PMID:29133643
Rivière, Jean-Baptiste; Mirzaa, Ghayda M; O'Roak, Brian J; Beddaoui, Margaret; Alcantara, Diana; Conway, Robert L; St-Onge, Judith; Schwartzentruber, Jeremy A; Gripp, Karen W; Nikkel, Sarah M; Worthylake, Thea; Sullivan, Christopher T; Ward, Thomas R; Butler, Hailly E; Kramer, Nancy A; Albrecht, Beate; Armour, Christine M; Armstrong, Linlea; Caluseriu, Oana; Cytrynbaum, Cheryl; Drolet, Beth A; Innes, A Micheil; Lauzon, Julie L; Lin, Angela E; Mancini, Grazia M S; Meschino, Wendy S; Reggin, James D; Saggar, Anand K; Lerman-Sagie, Tally; Uyanik, Gökhan; Weksberg, Rosanna; Zirn, Birgit; Beaulieu, Chandree L; Majewski, Jacek; Bulman, Dennis E; O'Driscoll, Mark; Shendure, Jay; Graham, John M; Boycott, Kym M; Dobyns, William B
2012-06-24
Megalencephaly-capillary malformation (MCAP) and megalencephaly-polymicrogyria-polydactyly-hydrocephalus (MPPH) syndromes are sporadic overgrowth disorders associated with markedly enlarged brain size and other recognizable features. We performed exome sequencing in 3 families with MCAP or MPPH, and our initial observations were confirmed in exomes from 7 individuals with MCAP and 174 control individuals, as well as in 40 additional subjects with megalencephaly, using a combination of Sanger sequencing, restriction enzyme assays and targeted deep sequencing. We identified de novo germline or postzygotic mutations in three core components of the phosphatidylinositol 3-kinase (PI3K)-AKT pathway. These include 2 mutations in AKT3, 1 recurrent mutation in PIK3R2 in 11 unrelated families with MPPH and 15 mostly postzygotic mutations in PIK3CA in 23 individuals with MCAP and 1 with MPPH. Our data highlight the central role of PI3K-AKT signaling in vascular, limb and brain development and emphasize the power of massively parallel sequencing in a challenging context of phenotypic and genetic heterogeneity combined with postzygotic mosaicism.
Russell, Jessica S; Caly, Leon; Kostecki, Renata; McGuinness, Sarah L; Carter, Glen; Bulach, Dieter; Seemann, Torsten; Stinear, Tim P; Baird, Rob; Catton, Mike; Druce, Julian
2018-06-11
Murray Valley Encephalitis virus (MVEV) is a mosquito-borne Flavivirus. Clinical presentation is rare but severe, with a case fatality rate of 15⁻30%. Here we report a case of MVEV from the cerebrospinal fluid (CSF) of a patient in the Northern Territory in Australia. Initial diagnosis was performed using both MVEV-specific real-time, and Pan- Flavivirus conventional, Polymerase Chain Reaction (PCR), with confirmation by Sanger sequencing. Subsequent isolation, the first from CSF, was conducted in Vero cells and the observed cytopathic effect was confirmed by increasing viral titre in the real-time PCR. Isolation allowed for full genome sequencing using the Scriptseq V2 RNASeq library preparation kit. A consensus genome for VIDRL-MVE was generated and phylogenetic analysis identified it as Genotype 2. This is the first reported isolation, and full genome sequencing of MVEV from CSF. It is also the first time Genotype 2 has been identified in humans. As such, this case has significant implications for public health surveillance, epidemiology, and the understanding of MVEV evolution.
Carbone, Ignazio; White, James B; Miadlikowska, Jolanta; Arnold, A Elizabeth; Miller, Mark A; Kauff, Frank; U'Ren, Jana M; May, Georgiana; Lutzoni, François
2017-04-15
High-quality phylogenetic placement of sequence data has the potential to greatly accelerate studies of the diversity, systematics, ecology and functional biology of diverse groups. We developed the Tree-Based Alignment Selector (T-BAS) toolkit to allow evolutionary placement and visualization of diverse DNA sequences representing unknown taxa within a robust phylogenetic context, and to permit the downloading of highly curated, single- and multi-locus alignments for specific clades. In its initial form, T-BAS v1.0 uses a core phylogeny of 979 taxa (including 23 outgroup taxa, as well as 61 orders, 175 families and 496 genera) representing all 13 classes of largest subphylum of Fungi-Pezizomycotina (Ascomycota)-based on sequence alignments for six loci (nr5.8S, nrLSU, nrSSU, mtSSU, RPB1, RPB2 ). T-BAS v1.0 has three main uses: (i) Users may download alignments and voucher tables for members of the Pezizomycotina directly from the reference tree, facilitating systematics studies of focal clades. (ii) Users may upload sequence files with reads representing unknown taxa and place these on the phylogeny using either BLAST or phylogeny-based approaches, and then use the displayed tree to select reference taxa to include when downloading alignments. The placement of unknowns can be performed for large numbers of Sanger sequences obtained from fungal cultures and for alignable, short reads of environmental amplicons. (iii) User-customizable metadata can be visualized on the tree. T-BAS Version 1.0 is available online at http://tbas.hpc.ncsu.edu . Registration is required to access the CIPRES Science Gateway and NSF XSEDE's large computational resources. icarbon@ncsu.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Analysis of quality raw data of second generation sequencers with Quality Assessment Software.
Ramos, Rommel Tj; Carneiro, Adriana R; Baumbach, Jan; Azevedo, Vasco; Schneider, Maria Pc; Silva, Artur
2011-04-18
Second generation technologies have advantages over Sanger; however, they have resulted in new challenges for the genome construction process, especially because of the small size of the reads, despite the high degree of coverage. Independent of the program chosen for the construction process, DNA sequences are superimposed, based on identity, to extend the reads, generating contigs; mismatches indicate a lack of homology and are not included. This process improves our confidence in the sequences that are generated. We developed Quality Assessment Software, with which one can review graphs showing the distribution of quality values from the sequencing reads. This software allow us to adopt more stringent quality standards for sequence data, based on quality-graph analysis and estimated coverage after applying the quality filter, providing acceptable sequence coverage for genome construction from short reads. Quality filtering is a fundamental step in the process of constructing genomes, as it reduces the frequency of incorrect alignments that are caused by measuring errors, which can occur during the construction process due to the size of the reads, provoking misassemblies. Application of quality filters to sequence data, using the software Quality Assessment, along with graphing analyses, provided greater precision in the definition of cutoff parameters, which increased the accuracy of genome construction.
Next generation sequencing (NGS): a golden tool in forensic toolkit.
Aly, S M; Sabri, D M
The DNA analysis is a cornerstone in contemporary forensic sciences. DNA sequencing technologies are powerful tools that enrich molecular sciences in the past based on Sanger sequencing and continue to glowing these sciences based on Next generation sequencing (NGS). Next generation sequencing has excellent potential to flourish and increase the molecular applications in forensic sciences by jumping over the pitfalls of the conventional method of sequencing. The main advantages of NGS compared to conventional method that it utilizes simultaneously a large number of genetic markers with high-resolution of genetic data. These advantages will help in solving several challenges such as mixture analysis and dealing with minute degraded samples. Based on these new technologies, many markers could be examined to get important biological data such as age, geographical origins, tissue type determination, external visible traits and monozygotic twins identification. It also could get data related to microbes, insects, plants and soil which are of great medico-legal importance. Despite the dozens of forensic research involving NGS, there are requirements before using this technology routinely in forensic cases. Thus, there is a great need to more studies that address robustness of these techniques. Therefore, this work highlights the applications of forensic sciences in the era of massively parallel sequencing.
NASA Astrophysics Data System (ADS)
Zhao, Cui; Zhang, Xiaojun; Liu, Chengzhang; Huan, Pin; Li, Fuhua; Xiang, Jianhai; Huang, Chao
2012-05-01
Little is known about the genome of Pacific white shrimp ( Litopenaeus vannamei). To address this, we conducted BAC (bacterial artificial chromosome) end sequencing of L. vannamei. We selected and sequenced 7 812 BAC clones from the BAC library LvHE from the two ends of the inserts by Sanger sequencing. After trimming and quality filtering, 11 279 BAC end sequences (BESs) including 4 609 pairedends BESs were obtained. The total length of the BESs was 4 340 753 bp, representing 0.18% of the L. vannamei haploid genome. The lengths of the BESs ranged from 100 bp to 660 bp with an average length of 385 bp. Analysis of the BESs indicated that the L. vannamei genome is AT-rich and that the primary repeats patterns were simple sequence repeats (SSRs) and low complexity sequences. Dinucleotide and hexanucleotide repeats were the most common SSR types in the BESs. The most abundant transposable element was gypsy, which may contribute to the generation of the large genome size of L. vannamei. We successfully annotated 4 519 BESs by BLAST searching, including genes involved in immunity and sex determination. Our results provide an important resource for functional gene studies, map construction and integration, and complete genome assembly for this species.
Illumina Production Sequencing at the DOE Joint Genome Institute - Workflow and Optimizations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tarver, Angela; Fern, Alison; Diego, Matthew San
2010-06-18
The U.S. Department of Energy (DOE) Joint Genome Institute?s (JGI) Production Sequencing group is committed to the generation of high-quality genomic DNA sequence to support the DOE mission areas of renewable energy generation, global carbon management, and environmental characterization and clean-up. Within the JGI?s Production Sequencing group, the Illumina Genome Analyzer pipeline has been established as one of three sequencing platforms, along with Roche/454 and ABI/Sanger. Optimization of the Illumina pipeline has been ongoing with the aim of continual process improvement of the laboratory workflow. These process improvement projects are being led by the JGI?s Process Optimization, Sequencing Technologies, Instrumentation&more » Engineering, and the New Technology Production groups. Primary focus has been on improving the procedural ergonomics and the technicians? operating environment, reducing manually intensive technician operations with different tools, reducing associated production costs, and improving the overall process and generated sequence quality. The U.S. DOE JGI was established in 1997 in Walnut Creek, CA, to unite the expertise and resources of five national laboratories? Lawrence Berkeley, Lawrence Livermore, Los Alamos, Oak Ridge, and Pacific Northwest ? along with HudsonAlpha Institute for Biotechnology. JGI is operated by the University of California for the U.S. DOE.« less
Scannell, Devin R.; Zill, Oliver A.; Rokas, Antonis; Payen, Celia; Dunham, Maitreya J.; Eisen, Michael B.; Rine, Jasper; Johnston, Mark; Hittinger, Chris Todd
2011-01-01
High-quality, well-annotated genome sequences and standardized laboratory strains fuel experimental and evolutionary research. We present improved genome sequences of three species of Saccharomyces sensu stricto yeasts: S. bayanus var. uvarum (CBS 7001), S. kudriavzevii (IFO 1802T and ZP 591), and S. mikatae (IFO 1815T), and describe their comparison to the genomes of S. cerevisiae and S. paradoxus. The new sequences, derived by assembling millions of short DNA sequence reads together with previously published Sanger shotgun reads, have vastly greater long-range continuity and far fewer gaps than the previously available genome sequences. New gene predictions defined a set of 5261 protein-coding orthologs across the five most commonly studied Saccharomyces yeasts, enabling a re-examination of the tempo and mode of yeast gene evolution and improved inferences of species-specific gains and losses. To facilitate experimental investigations, we generated genetically marked, stable haploid strains for all three of these Saccharomyces species. These nearly complete genome sequences and the collection of genetically marked strains provide a valuable toolset for comparative studies of gene function, metabolism, and evolution, and render Saccharomyces sensu stricto the most experimentally tractable model genus. These resources are freely available and accessible through www.SaccharomycesSensuStricto.org. PMID:22384314
Zheng, Yang; Cai, Jing; Li, JianWen; Li, Bo; Lin, Runmao; Tian, Feng; Wang, XiaoLing; Wang, Jun
2010-01-01
A 10-fold BAC library for giant panda was constructed and nine BACs were selected to generate finish sequences. These BACs could be used as a validation resource for the de novo assembly accuracy of the whole genome shotgun sequencing reads of giant panda newly generated by the Illumina GA sequencing technology. Complete sanger sequencing, assembly, annotation and comparative analysis were carried out on the selected BACs of a joint length 878 kb. Homologue search and de novo prediction methods were used to annotate genes and repeats. Twelve protein coding genes were predicted, seven of which could be functionally annotated. The seven genes have an average gene size of about 41 kb, an average coding size of about 1.2 kb and an average exon number of 6 per gene. Besides, seven tRNA genes were found. About 27 percent of the BAC sequence is composed of repeats. A phylogenetic tree was constructed using neighbor-join algorithm across five species, including giant panda, human, dog, cat and mouse, which reconfirms dog as the most related species to giant panda. Our results provide detailed sequence and structure information for new genes and repeats of giant panda, which will be helpful for further studies on the giant panda.
Characterization of a new apple luteovirus identified by high-throughput sequencing.
Liu, Huawei; Wu, Liping; Nikolaeva, Ekaterina; Peter, Kari; Liu, Zongrang; Mollov, Dimitre; Cao, Mengji; Li, Ruhui
2018-05-15
'Rapid Apple Decline' (RAD) is a newly emerging problem of young, dwarf apple trees in the Northeastern USA. The affected trees show trunk necrosis, cracking and canker before collapse in summer. In this study, we discovered and characterized a new luteovirus from apple trees in RAD-affected orchards using high-throughput sequencing (HTS) technology and subsequent Sanger sequencing. Illumina NextSeq sequencing was applied to total RNAs prepared from three diseased apple trees. Sequence reads were de novo assembled, and contigs were annotated by BLASTx. RT-PCR and 5'/3' RACE sequencing were used to obtain the complete genome of a new virus. RT-PCR was used to detect the virus. Three common apple viruses and a new luteovirus were identified from the diseased trees by HTS and RT-PCR. Sequence analyses of the complete genome of the new virus show that it is a new species of the genus Luteovirus in the family Luteoviridae. The virus is graft transmissible and detected by RT-PCR in apple trees in a couple of orchards. A new luteovirus and/or three known viruses were found to be associated with RAD. Molecular characterization of the new luteovirus provides important information for further investigation of its distribution and etiological role.
Machine Learned Replacement of N-Labels for Basecalled Sequences in DNA Barcoding.
Ma, Eddie Y T; Ratnasingham, Sujeevan; Kremer, Stefan C
2018-01-01
This study presents a machine learning method that increases the number of identified bases in Sanger Sequencing. The system post-processes a KB basecalled chromatogram. It selects a recoverable subset of N-labels in the KB-called chromatogram to replace with basecalls (A,C,G,T). An N-label correction is defined given an additional read of the same sequence, and a human finished sequence. Corrections are added to the dataset when an alignment determines the additional read and human agree on the identity of the N-label. KB must also rate the replacement with quality value of in the additional read. Corrections are only available during system training. Developing the system, nearly 850,000 N-labels are obtained from Barcode of Life Datasystems, the premier database of genetic markers called DNA Barcodes. Increasing the number of correct bases improves reference sequence reliability, increases sequence identification accuracy, and assures analysis correctness. Keeping with barcoding standards, our system maintains an error rate of percent. Our system only applies corrections when it estimates low rate of error. Tested on this data, our automation selects and recovers: 79 percent of N-labels from COI (animal barcode); 80 percent from matK and rbcL (plant barcodes); and 58 percent from non-protein-coding sequences (across eukaryotes).
Azab, Marwa Mohamed; Fayyad, Dalia Mukhtar
2018-01-01
The use of high throughput next generation technologies has allowed more comprehensive analysis than traditional Sanger sequencing. The specific aim of this study was to investigate the microbial diversity of primary endodontic infections using Illumina MiSeq sequencing platform in Egyptian patients. Samples were collected from 19 patients in Suez Canal University Hospital (Endodontic Department) using sterile # 15K file and paper points. DNA was extracted using Mo Bio power soil DNA isolation extraction kit followed by PCR amplification and agarose gel electrophoresis. The microbiome was characterized on the basis of the V3 and V4 hypervariable region of the 16S rRNA gene by using paired-end sequencing on Illumina MiSeq device. MOTHUR software was used in sequence filtration and analysis of sequenced data. A total of 1858 operational taxonomic units at 97% similarity were assigned to 26 phyla, 245 families, and 705 genera. Four main phyla Firmicutes, Bacteroidetes, Proteobacteria, and Synergistetes were predominant in all samples. At genus level, Prevotella, Bacillus, Porphyromonas, Streptococcus, and Bacteroides were the most abundant. Illumina MiSeq platform sequencing can be used to investigate oral microbiome composition of endodontic infections. Elucidating the ecology of endodontic infections is a necessary step in developing effective intracanal antimicrobials. PMID:29849646
Apert Syndrome: Molecularly Confirmed C.758C>G (P.Pro253Arg) in FGFR2
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cha Gon, Lee, E-mail: leechagon@eulji.ac.kr
A 5-day-old girl was referred to our clinic for evaluation of congenital malformations. She was identified with a pathogenic mutation c.758C>G (p.Pro253Arg) in FGFR2 gene using targeted exome sequencing. The de novo mutation was confirmed with Sanger sequencing in the patient and her parents. She showed occipital plagiocephaly with frontal bossing (Figure A and B). Skull frontal and lateral radiography revealed fusion of most of the sutures except coronal suture, with convolutional markings (Figure D and E). She had complete cleft palate (Figure C). Her fused bilateral hands showed type II syndactyly with complete syndactyly between the ring and themore » little fingers (Figure F1-F3). Both toes were simple syndactyly with side-to-side fusion of skin (Figure G1-)« less
Ahmad, Farooq; Nasir, Abdul; Thiele, Holger; Umair, Muhammad; Borck, Guntram; Ahmad, Wasim
2018-02-12
Ectodermal dysplasia syndactyly syndrome 1 (EDSS1) is a rare form of ectodermal dysplasia including anomalies of hair, nails, and teeth along with bilateral cutaneous syndactyly of hands and feet. In the present report, we performed a clinical and genetic characterization of a consanguineous Pakistani family with four individuals affected by EDSS1. We performed exome sequencing using DNA of one affected individual. Exome data analysis identified a novel homozygous missense variant (c.242T>C; p.(Leu81Pro)) in NECTIN4 (PVRL4). Sanger sequencing validated this variant and confirmed its cosegregation with the disease phenotype in the family members. Thus, our report adds a novel variant to the NECTIN4 mutation spectrum and contributes to the NECTIN4-related clinical characterization. © 2018 John Wiley & Sons Ltd/University College London.
Nephrocalcinosis (Enamel Renal Syndrome) Caused by Autosomal Recessive FAM20A Mutations
Jaureguiberry, Graciana; De la Dure-Molla, Muriel; Parry, David; Quentric, Mickael; Himmerkus, Nina; Koike, Toshiyasu; Poulter, James; Klootwijk, Enriko; Robinette, Steven L.; Howie, Alexander J.; Patel, Vaksha; Figueres, Marie-Lucile; Stanescu, Horia C.; Issler, Naomi; Nicholson, Jeremy K.; Bockenhauer, Detlef; Laing, Christopher; Walsh, Stephen B.; McCredie, David A.; Povey, Sue; Asselin, Audrey; Picard, Arnaud; Coulomb, Aurore; Medlar, Alan J.; Bailleul-Forestier, Isabelle; Verloes, Alain; Le Caignec, Cedric; Roussey, Gwenaelle; Guiol, Julien; Isidor, Bertrand; Logan, Clare; Shore, Roger; Johnson, Colin; Inglehearn, Christopher; Al-Bahlani, Suhaila; Schmittbuhl, Matthieu; Clauss, François; Huckert, Mathilde; Laugel, Virginie; Ginglinger, Emmanuelle; Pajarola, Sandra; Spartà, Giuseppina; Bartholdi, Deborah; Rauch, Anita; Addor, Marie-Claude; Yamaguti, Paulo M.; Safatle, Heloisa P.; Acevedo, Ana Carolina; Martelli-Júnior, Hercílio; dos Santos Netos, Pedro E.; Coletta, Ricardo D.; Gruessel, Sandra; Sandmann, Carolin; Ruehmann, Denise; Langman, Craig B.; Scheinman, Steven J.; Ozdemir-Ozenen, Didem; Hart, Thomas C.; Hart, P. Suzanne; Neugebauer, Ute; Schlatter, Eberhard; Houillier, Pascal; Gahl, William A.; Vikkula, Miikka; Bloch-Zupan, Agnès; Bleich, Markus; Kitagawa, Hiroshi; Unwin, Robert J.; Mighell, Alan; Berdal, Ariane; Kleta, Robert
2013-01-01
Background/Aims Calcium homeostasis requires regulated cellular and interstitial systems interacting to modulate the activity and movement of this ion. Disruption of these systems in the kidney results in nephrocalcinosis and nephrolithiasis, important medical problems whose pathogenesis is incompletely understood. Methods We investigated 25 patients from 16 families with unexplained nephrocalcinosis and characteristic dental defects (amelogenesis imperfecta, gingival hyperplasia, impaired tooth eruption). To identify the causative gene, we performed genome-wide linkage analysis, exome capture, next-generation sequencing, and Sanger sequencing. Results All patients had bi-allelic FAM20A mutations segregating with the disease; 20 different mutations were identified. Conclusions This au-tosomal recessive disorder, also known as enamel renal syndrome, of FAM20A causes nephrocalcinosis and amelogenesis imperfecta. We speculate that all individuals with biallelic FAM20A mutations will eventually show nephrocalcinosis. PMID:23434854
Biallelic Mutations in NBAS Cause Recurrent Acute Liver Failure with Onset in Infancy.
Haack, Tobias B; Staufner, Christian; Köpke, Marlies G; Straub, Beate K; Kölker, Stefan; Thiel, Christian; Freisinger, Peter; Baric, Ivo; McKiernan, Patrick J; Dikow, Nicola; Harting, Inga; Beisse, Flemming; Burgard, Peter; Kotzaeridou, Urania; Kühr, Joachim; Himbert, Urban; Taylor, Robert W; Distelmaier, Felix; Vockley, Jerry; Ghaloul-Gonzalez, Lina; Zschocke, Johannes; Kremer, Laura S; Graf, Elisabeth; Schwarzmayr, Thomas; Bader, Daniel M; Gagneur, Julien; Wieland, Thomas; Terrile, Caterina; Strom, Tim M; Meitinger, Thomas; Hoffmann, Georg F; Prokisch, Holger
2015-07-02
Acute liver failure (ALF) in infancy and childhood is a life-threatening emergency. Few conditions are known to cause recurrent acute liver failure (RALF), and in about 50% of cases, the underlying molecular cause remains unresolved. Exome sequencing in five unrelated individuals with fever-dependent RALF revealed biallelic mutations in NBAS. Subsequent Sanger sequencing of NBAS in 15 additional unrelated individuals with RALF or ALF identified compound heterozygous mutations in an additional six individuals from five families. Immunoblot analysis of mutant fibroblasts showed reduced protein levels of NBAS and its proposed interaction partner p31, both involved in retrograde transport between endoplasmic reticulum and Golgi. We recommend NBAS analysis in individuals with acute infantile liver failure, especially if triggered by fever. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Qi, Guoyan; Liu, Peng; Gu, Shanshan; Yang, Hongxia; Dong, Huimin; Xue, Yinping
2018-04-01
Our study is aimed to explore the underlying genetic basis of myasthenia gravis. We collected a Chinese pedigree with myasthenia gravis, and whole-exome sequencing was performed on the two affected siblings and their parents. The candidate pathogenic gene was identified by bioinformatics filtering, which was further verified by Sanger sequencing. The homozygous mutation c.G40A (p.V14M) in interferon gamma receptor 1was identified. Moreover, the mutation was also detected in 3 cases of 44 sporadic myasthenia gravis patients. The p.V14M substitution in interferon gamma receptor 1 may affect the signal peptide function and the translocation on cell membrane, which could disrupt the binding of the ligand of interferon gamma and antibody production, contributing to myasthenia gravis susceptibility. We discovered that a rare variant c.G40A in interferon gamma receptor 1 potentially contributes to the myasthenia gravis pathogenesis. Further functional studies are needed to confirm the effect of the interferon gamma receptor 1 on the myasthenia gravis phenotype.
Li, Juan; Ding, Yu; Chang, Guoying; Cheng, Qing; Li, Xin; Wang, Jian; Wang, Xiumin; Shen, Yiping
2017-02-10
To identify the genetic cause for a 11-year-old Chinese boy with Meier-Gorlin syndrome (MGS). Chromosomal microarray analysis (CMA) was used to detect potential variations, while whole exome sequencing (WES) was used to identify sequence variants. Sanger sequencing was used to confirm the suspected variants. The boy has featured short stature, microtia, small patella, slender body build, craniofacial anomalies, and small testes with normal gonadotropin. A complete uniparental disomy of chromosome 16 was revealed by CMA. WES has identified a novel homozygous mutation c.67A>G (p.Lys23Glu) in ORC6 gene mapped to chromosome 16. As predicted by Alamut functional software, the mutation may affect the function of structural domain of the ORC6 protein. The patient is probably the first diagnosed MGS case in China, who carried a novel homozygous mutation of the ORC6 gene and uniparental disomy of chromosome 16. The effect of this novel mutation on the growth and development needs to be further investigated.
Application of a mitochondrial DNA control region frequency database for UK domestic cats.
Ottolini, Barbara; Lall, Gurdeep Matharu; Sacchini, Federico; Jobling, Mark A; Wetton, Jon H
2017-03-01
DNA variation in 402bp of the mitochondrial control region flanked by repeat sequences RS2 and RS3 was evaluated by Sanger sequencing in 152 English domestic cats, in order to determine the significance of matching DNA sequences between hairs found with a victim's body and the suspect's pet cat. Whilst 95% of English cats possessed one of the twelve globally widespread mitotypes, four new variants were observed, the most common of which (2% frequency) was shared with the evidential samples. No significant difference in mitotype frequency was seen between 32 individuals from the locality of the crime and 120 additional cats from the rest of England, suggesting a lack of local population structure. However, significant differences were observed in comparison with frequencies in other countries, including the closely neighbouring Netherlands, highlighting the importance of appropriate genetic databases when determining the evidential significance of mitochondrial DNA evidence. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Reddy, Hemakumar M; Hamed, Sherifa A; Lek, Monkol; Mitsuhashi, Satomi; Estrella, Elicia; Jones, Michael D; Mahoney, Lane J; Duncan, Anna R; Cho, Kyung-Ah; Macarthur, Daniel G; Kunkel, Louis M; Kang, Peter B
2016-10-01
The genetic causes of limb-girdle muscular dystrophy (LGMD) have been studied in numerous countries, but such investigations have been limited in Egypt. A cohort of 30 families with suspected LGMD from Assiut, Egypt, was studied using immunohistochemistry, homozygosity mapping, Sanger sequencing, and whole exome sequencing. Six families were confirmed to have pathogenic mutations, 4 in SGCA and 2 in DMD. Of these, 3 families harbored a single nonsense mutation in SGCA, suggesting that this may be a common mutation in Assiut, Egypt, originating from a founder effect. The Assiut region in Egypt appears to share at least several of the common LGMD genes found in other parts of the world. It is notable that 4 of the 6 mutations were ascertained by means of whole exome sequencing, even though it was the last approach adopted. This illustrates the power of this technique for identifying causative mutations for muscular dystrophies. Muscle Nerve 54: 690-695, 2016. © 2016 Wiley Periodicals, Inc.
[Analysis of MAT1A gene mutations in a child affected with simple hypermethioninemia].
Sun, Yun; Ma, Dingyuan; Wang, Yanyun; Yang, Bin; Jiang, Tao
2017-02-10
To detect potential mutations of MAT1A gene in a child suspected with simple hypermethioninemia by MS/MS neonatal screening. Clinical data of the child was collected. Genomic DNA was extracted by a standard method and subjected to targeted sequencing using an Ion Ampliseq TM Inherited Disease Panel. Detected mutations were verified by Sanger sequencing. The child showed no clinical features except evaluated methionine. A novel compound mutation of the MAT1A gene, i.e., c.345delA and c.529C>T, was identified in the child. His father and mother were found to be heterozygous for the c.345delA mutation and c.529C>T mutation, respectively. The compound mutation c.345delA and c.529C>T of the MAT1A gene probably underlie the disease in the child. The semi-conductor sequencing has provided an important means for the diagnosis of hereditary diseases.
[Mutation analysis for a pedigree affected with keratitis-ichthyosis-deafness syndrome].
Li, Lulu; Li, Yuan; Lin, Wei; Zhao, Xiuli
2017-10-10
To identify mutation of GJB2 gene and provide genetic counseling for a family affected with keratitis-ichthyosis-deafness (KID) syndrome. Genomic DNA was extracted from peripheral blood samples with a standard phenol-chloroform method. PCR and Sanger sequencing were used to analyze potential mutation in the proband. Suspected mutation was verified with a PCR-high-resolution melting (PCR-HRM) method. T-clone sequencing was applied to determine the parental origin of the mutation. A heterozygous mutation, c.148G>A (p.Asp50Asn), which is located in the exon 1 of the GJB2 gene, was found in the proband. The results was confirmed by HRM analysis. Cloning sequencing suggested that the mutation was derived from the father's germline. The hot-spot mutation c.148G>A (p.Asp50Asn) in the GJB2 gene probably underlies the KID syndrome in this Chinese family. A PCR-HRM method has been established to rapidly detect common mutations associated with this disease.
Dusi, Sabrina; Valletta, Lorella; Haack, Tobias B; Tsuchiya, Yugo; Venco, Paola; Pasqualato, Sebastiano; Goffrini, Paola; Tigano, Marco; Demchenko, Nikita; Wieland, Thomas; Schwarzmayr, Thomas; Strom, Tim M; Invernizzi, Federica; Garavaglia, Barbara; Gregory, Allison; Sanford, Lynn; Hamada, Jeffrey; Bettencourt, Conceição; Houlden, Henry; Chiapparini, Luisa; Zorzi, Giovanna; Kurian, Manju A; Nardocci, Nardo; Prokisch, Holger; Hayflick, Susan; Gout, Ivan; Tiranti, Valeria
2014-01-02
Neurodegeneration with brain iron accumulation (NBIA) comprises a clinically and genetically heterogeneous group of disorders with progressive extrapyramidal signs and neurological deterioration, characterized by iron accumulation in the basal ganglia. Exome sequencing revealed the presence of recessive missense mutations in COASY, encoding coenzyme A (CoA) synthase in one NBIA-affected subject. A second unrelated individual carrying mutations in COASY was identified by Sanger sequence analysis. CoA synthase is a bifunctional enzyme catalyzing the final steps of CoA biosynthesis by coupling phosphopantetheine with ATP to form dephospho-CoA and its subsequent phosphorylation to generate CoA. We demonstrate alterations in RNA and protein expression levels of CoA synthase, as well as CoA amount, in fibroblasts derived from the two clinical cases and in yeast. This is the second inborn error of coenzyme A biosynthesis to be implicated in NBIA. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Rapid and Accurate Sequencing of Enterovirus Genomes Using MinION Nanopore Sequencer.
Wang, Ji; Ke, Yue Hua; Zhang, Yong; Huang, Ke Qiang; Wang, Lei; Shen, Xin Xin; Dong, Xiao Ping; Xu, Wen Bo; Ma, Xue Jun
2017-10-01
Knowledge of an enterovirus genome sequence is very important in epidemiological investigation to identify transmission patterns and ascertain the extent of an outbreak. The MinION sequencer is increasingly used to sequence various viral pathogens in many clinical situations because of its long reads, portability, real-time accessibility of sequenced data, and very low initial costs. However, information is lacking on MinION sequencing of enterovirus genomes. In this proof-of-concept study using Enterovirus 71 (EV71) and Coxsackievirus A16 (CA16) strains as examples, we established an amplicon-based whole genome sequencing method using MinION. We explored the accuracy, minimum sequencing time, discrimination and high-throughput sequencing ability of MinION, and compared its performance with Sanger sequencing. Within the first minute (min) of sequencing, the accuracy of MinION was 98.5% for the single EV71 strain and 94.12%-97.33% for 10 genetically-related CA16 strains. In as little as 14 min, 99% identity was reached for the single EV71 strain, and in 17 min (on average), 99% identity was achieved for 10 CA16 strains in a single run. MinION is suitable for whole genome sequencing of enteroviruses with sufficient accuracy and fine discrimination and has the potential as a fast, reliable and convenient method for routine use. Copyright © 2017 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
2011-01-01
Background Rust fungi are biotrophic basidiomycete plant pathogens that cause major diseases on plants and trees world-wide, affecting agriculture and forestry. Their biotrophic nature precludes many established molecular genetic manipulations and lines of research. The generation of genomic resources for these microbes is leading to novel insights into biology such as interactions with the hosts and guiding directions for breakthrough research in plant pathology. Results To support gene discovery and gene model verification in the genome of the wheat leaf rust fungus, Puccinia triticina (Pt), we have generated Expressed Sequence Tags (ESTs) by sampling several life cycle stages. We focused on several spore stages and isolated haustorial structures from infected wheat, generating 17,684 ESTs. We produced sequences from both the sexual (pycniospores, aeciospores and teliospores) and asexual (germinated urediniospores) stages of the life cycle. From pycniospores and aeciospores, produced by infecting the alternate host, meadow rue (Thalictrum speciosissimum), 4,869 and 1,292 reads were generated, respectively. We generated 3,703 ESTs from teliospores produced on the senescent primary wheat host. Finally, we generated 6,817 reads from haustoria isolated from infected wheat as well as 1,003 sequences from germinated urediniospores. Along with 25,558 previously generated ESTs, we compiled a database of 13,328 non-redundant sequences (4,506 singlets and 8,822 contigs). Fungal genes were predicted using the EST version of the self-training GeneMarkS algorithm. To refine the EST database, we compared EST sequences by BLASTN to a set of 454 pyrosequencing-generated contigs and Sanger BAC-end sequences derived both from the Pt genome, and to ESTs and genome reads from wheat. A collection of 6,308 fungal genes was identified and compared to sequences of the cereal rusts, Puccinia graminis f. sp. tritici (Pgt) and stripe rust, P. striiformis f. sp. tritici (Pst), and poplar leaf rust Melampsora species, and the corn smut fungus, Ustilago maydis (Um). While extensive homologies were found, many genes appeared novel and species-specific; over 40% of genes did not match any known sequence in existing databases. Focusing on spore stages, direct comparison to Um identified potential functional homologs, possibly allowing heterologous functional analysis in that model fungus. Many potentially secreted protein genes were identified by similarity searches against genes and proteins of Pgt and Melampsora spp., revealing apparent orthologs. Conclusions The current set of Pt unigenes contributes to gene discovery in this major cereal pathogen and will be invaluable for gene model verification in the genome sequence. PMID:21435244
Jang, Mi Ae; Lee, Chang Woo; Kim, Jin Kyung; Ki, Chang Seok
2015-11-01
Cornelia de Lange syndrome (CdLS) is a clinically and genetically heterogeneous congenital anomaly. Mutations in the NIPBL gene account for a half of the affected individuals. We describe a family with CdLS carrying a novel pathogenic variant of the SMC1A gene identified by exome sequencing. The proband was a 3-yr-old boy presenting with a developmental delay. He had distinctive facial features without major structural anomalies and tested negative for the NIPBL gene. His younger sister, mother, and maternal grandmother presented with mild mental retardation. By exome sequencing of the proband, a novel SMC1A variant, c.3178G>A, was identified, which was expected to cause an amino acid substitution (p.Glu1060Lys) in the highly conserved coiled-coil domain of the SMC1A protein. Sanger sequencing confirmed that the three female relatives with mental retardation also carry this variant. Our results reveal that SMC1A gene defects are associated with milder phenotypes of CdLS. Furthermore, we showed that exome sequencing could be a useful tool to identify pathogenic variants in patients with CdLS.
Implementation of Cloud based next generation sequencing data analysis in a clinical laboratory.
Onsongo, Getiria; Erdmann, Jesse; Spears, Michael D; Chilton, John; Beckman, Kenneth B; Hauge, Adam; Yohe, Sophia; Schomaker, Matthew; Bower, Matthew; Silverstein, Kevin A T; Thyagarajan, Bharat
2014-05-23
The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software.
Rama, Mélanie; Duflos, Claire; Melki, Isabelle; Bessis, Didier; Bonhomme, Axelle; Martin, Hélène; Doummar, Diane; Valence, Stéphanie; Rodriguez, Diana; Carme, Emilie; Genevieve, David; Heimdal, Ketil; Insalaco, Antonella; Franck, Nathalie; Queyrel-Moranne, Viviane; Tieulie, Nathalie; London, Jonathan; Uettwiller, Florence; Georgin-Lavialle, Sophie; Belot, Alexandre; Koné-Paut, Isabelle; Hentgen, Véronique; Boursier, Guilaine; Touitou, Isabelle; Sarrabay, Guillaume
2018-04-23
Deficiency of adenosine deaminase 2 (DADA2) is a recently described autoinflammatory disorder. Genetic analysis is required to confirm the diagnosis. We aimed to describe the identifying symptoms and genotypes of patients referred to our reference centres and to improve the indications for genetic testing. DNA from 66 patients with clinically suspected DADA2 were sequenced by Sanger or next-generation sequencing. Detailed epidemiological, clinical and biological features were collected by use of a questionnaire and were compared between patients with and without genetic confirmation of DADA2. We identified 13 patients (19.6%) carrying recessively inherited mutations in ADA2 that were predicted to be deleterious. Eight patients were compound heterozygous for mutations. Seven mutations were novel (4 missense variants, 2 predicted to affect mRNA splicing and 1 frameshift). The mean age of the 13 patients with genetic confirmation was 12.7 years at disease onset and 20.8 years at diagnosis. Phenotypic manifestations included fever (85%), vasculitis (85%) and neurological disorders (54%). Features best associated with a confirmatory genotype included fever with neurologic or cutaneous attacks (odds ratio [OR] 10.71, p = 0.003 and OR 10.9, p < 0.001), fever alone (OR 8.1, p = 0.01), and elevated C-reactive protein (CRP) level with neurologic involvement (OR 6.63, p = 0.017). Our proposed decision tree may help improve obtaining genetic confirmation of DADA2 in the context of autoinflammatory symptoms. Prerequisites for quick and low-cost Sanger analysis include one typical cutaneous or neurological sign, one marker of inflammation (fever or elevated CRP level), and recurrent or chronic attacks in adults.
ACSS2 gene variant associated with cleft lip and palate in two independent Hispanic populations.
Dodhia, Sonam; Celis, Katrina; Aylward, Alana; Cai, Yi; Fontana, Maria E; Trespalacios, Alberto; Hoffman, David C; Alfonso, Henry Ostos; Eisig, Sidney B; Su, Gloria H; Chung, Wendy K; Haddad, Joseph
2017-10-01
A candidate variant (p.Val496Ala) of the ACSS2 gene (T > C missense, rs59088485 variant at chr20: bp37 33509608) was previously found to consistently segregate with nonsyndromic cleft lip and/or palate (NSCLP) in three Honduran families. Objectives of this study were 1) to investigate the frequency of this ACSS2 variant in Honduran unrelated NSCLP patients and unrelated unaffected controls and 2) to investigate the frequency of this variant in Colombian unrelated affected NSCLP patients and unrelated unaffected controls. Case-control studies. Sanger sequencing of 99 unrelated Honduran NSCLP patients and 215 unrelated unaffected controls for the p.Val496Ala ACSS2 variant was used to determine the carrier frequency in NSCLP patients and controls. Sanger sequencing of 230 unrelated Colombian NSCLP patients and 146 unrelated unaffected controls for the p.Val496Ala ACSS2 variant was used to determine the carrier frequency in NSCLP patients and controls. In the Honduran population, the odds ratio of having NSCLP among carriers of the p.Val496Ala ACSS2 variant was 4.0 (P = .03), with a carrier frequency of seven of 99 (7.1%) in unrelated affected and four of 215 (1.9%) in unrelated unaffected individuals. In the Colombian population, the odds ratio of having NSCLP among carriers of the p.Val496Ala ACSS2 variant was 2.6 (P = .04), with a carrier frequency of 23 of 230 (10.0%) in unrelated affected and six of 146 (4.1%) in unrelated unaffected individuals. These findings support the role of ACSS2 in NSCLP in two independent Hispanic populations from Honduras and Colombia. NA Laryngoscope, 127:E336-E339, 2017. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Audo, Isabelle; Bujakowska, Kinga; Mohand-Saïd, Saddek; Tronche, Sophie; Lancelot, Marie-Elise; Antonio, Aline; Germain, Aurore; Lonjou, Christine; Carpentier, Wassila; Sahel, José-Alain; Bhattacharya, Shomi; Zeitz, Christina
2011-01-01
To identify the genetic defect of a consanguineous Portuguese family with rod-cone dystrophy and varying degrees of decreased audition. A detailed ophthalmic and auditory examination was performed on a Portuguese patient with severe autosomal recessive rod-cone dystrophy. Known genetic defects were excluded by performing autosomal recessive retinitis pigmentosa (arRP) genotyping microarray analysis and by Sanger sequencing of the coding exons and flanking intronic regions of eyes shut homolog-drosophila (EYS) and chromosome 2 open reading frame 71 (C2orf71). Subsequently, genome-wide homozygosity mapping was performed in DNA samples from available family members using a 700K single nucleotide polymorphism (SNP) microarray. Candidate genes present in the significantly large homozygous regions were screened for mutations using Sanger sequencing. The largest homozygous region (~11 Mb) in the affected family members was mapped to chromosome 9, which harbors deafness, autosomal recessive 31 (DFNB31; a gene previously associated with Usher syndrome). Mutation analysis of DFNB31 in the index patient identified a novel one-base-pair deletion (c.737delC), which is predicted to lead to a truncated protein (p.Pro246HisfsX13) and co-segregated with the disease in the family. Ophthalmic examination of the index patient and the affected siblings showed severe rod-cone dystrophy. Pure tone audiometry revealed a moderate hearing loss in the index patient, whereas the affected siblings were reported with more profound and early onset hearing impairment. We report a novel truncating mutation in DFNB31 associated with severe rod-cone dystrophy and varying degrees of hearing impairment in a consanguineous family of Portuguese origin. This is the second report of DFNB31 implication in Usher type 2.
Rowczenio, Dorota M.; Gomes, Sónia Melo; Aróstegui, Juan I.; Mensa-Vilaro, Anna; Omoyinmi, Ebun; Trojer, Hadija; Baginska, Anna; Baroja-Mazo, Alberto; Pelegrin, Pablo; Savic, Sinisa; Lane, Thirusha; Williams, Rene; Brogan, Paul; Lachmann, Helen J.; Hawkins, Philip N.
2017-01-01
Cryopyrin-associated periodic syndrome (CAPS) is caused by gain-of-function NLRP3 mutations. Recently, somatic NLRP3 mosaicism has been reported in some CAPS patients who were previously classified as “mutation-negative.” We describe here the clinical and laboratory findings in eight British adult patients who presented with symptoms typical of CAPS other than an onset in mid-late adulthood. All patients underwent comprehensive clinical and laboratory investigations, including analysis of the NLRP3 gene using Sanger and amplicon-based deep sequencing (ADS) along with measurements of extracellular apoptosis-associated speck-like protein with CARD domain (ASC) aggregates. The clinical phenotype in all subjects was consistent with mid-spectrum CAPS, except a median age at disease onset of 50 years. Sanger sequencing of NLRP3 was non-diagnostic but ADS detected a somatic NLRP3 mutation in each case. In one patient, DNA isolated from blood demonstrated an increase in the mutant allele from 5 to 45% over 12 years. ASC aggregates in patients’ serum measured during active disease were significantly higher than healthy controls. This series represents 8% of CAPS patients diagnosed in a single center, suggesting that acquired NLRP3 mutations may not be an uncommon cause of the syndrome and should be sought in all patients with late-onset symptoms otherwise compatible with CAPS. Steadily worsening CAPS symptoms in one patient were associated with clonal expansion of the mutant allele predominantly affecting myeloid cells. Two patients developed AA amyloidosis, which previously has only been reported in CAPS in association with life-long germline NLRP3 mutations. PMID:29163488
A novel COLD-PCR/FMCA assay enhances the detection of low-abundance IDH1 mutations in gliomas.
Pang, Brendan; Durso, Mary B; Hamilton, Ronald L; Nikiforova, Marina N
2013-03-01
Point mutations in isocitrate dehydrogenase 1 (IDH1) have been identified in many gliomas. The detection of IDH1 mutations becomes challenging on suboptimal glioma biopsies when a limited number of tumor cells is available for analysis. Coamplification at lower denaturing-polymerase chain reaction (COLD-PCR) is a PCR technique that deliberately lowers the denaturing cycle temperature to selectively favor amplification of mutant alleles, allowing for the sensitive detection of low-abundance mutations. We developed a novel COLD-PCR assay on the LightCycler platform (Roche, Applied Science, Indianapolis, IN), using post-PCR fluorescent melting curve analysis (FMCA) for the detection of mutant IDH1 with a detection limit of 1%. Thirty-five WHO grade I to IV gliomas and 9 non-neoplastic brain and spinal cord biopsies were analyzed with this technique and the results were compared with the conventional real-time PCR and the Sanger sequencing analysis. COLD-PCR/FMCA was able to detect the most common IDH1 R132H mutation and rare mutation types including R132H, R132C, R132L, R132S, and R132G mutations. Twenty-five glioma cases were positive for IDH1 mutations by COLD-PCR/FMCA, and 23 gliomas were positive by the conventional real-time PCR and Sanger sequencing. A pilocytic astrocytoma (PA I) and a glioblastoma multiforme (GBM IV) showed low-abundance IDH1 mutations detected by COLD-PCR/FMCA. The remaining 10 glioma and 9 non-neoplastic samples were negative by all the 3 methods. In summary, we report a novel COLD-PCR/FMCA method that provides rapid and sensitive detection of IDH1 mutations in formalin-fixed paraffin-embedded tissue and can be used in the clinical setting to assess the small brain biopsies.
Xp11.2 translocation renal cell carcinoma with PSF-TFE3 rearrangement.
Zhong, Minghao; Weisman, Paul; Zhu, Bing; Brassesco, Maria; Yang, Youfeng; Linehan, W Marston; Merino, Maria J; Zhang, David; Rohan, Stephen; Cai, Dongming; Yang, Ximing
2013-06-01
Xp11.2 translocation renal cell carcinoma (Xp11.2 RCC) is a subtype of RCC characterized by translocations involving a breakpoint at the TFE3 gene (Xp11.2). Moderate to strong nuclear TFE3 immunoreactivity has been recognized as a specific diagnostic marker for this type of tumor. However, exclusive cytoplasmic localization of a TFE3 fusion protein was reported in UOK 145 cells, a cell line derived from an Xp11.2 RCC harboring the PSF-TFE3 translocation. If reproducible using immunohistochemistry (IHC), this finding would have important implications for pathologists in the diagnosis of Xp11.2 RCC, calling into question the specificity of nuclear immunoreactivity for TFE3 in these tumors. The purpose of this study was to determine whether the above-noted cytoplasmic localization of the TFE3 fusion protein could be reproduced using IHC. UOK 145 cells and fresh frozen tissue from 2 clinical cases of Xp11.2 RCC found to harbor the PSF-TFE3 gene rearrangement (by cytogenetic testing) were collected. All samples were subjected to histopathologic evaluation by board-certified pathologists, TFE3 IHC, reverse transcription polymerase chain reaction, and Sanger sequencing analysis. A strong nuclear TFE3 immunoreactivity was demonstrated in all samples including the UOK 145 cell line. No cytoplasmic immunoreactivity was seen. Reverse transcription polymerase chain reaction and Sanger sequencing confirmed the previously reported PSF-TFE3 gene fusion between exon 9 of PSF and exon 6 of TFE3 in the UOK 145 cell line and in one of 2 clinical cases of Xp11.2 RCC. A novel PSF-TFE3 gene fusion between exon 9 of PSF and exon 5 of TFE3 was detected in the second clinical case of Xp11.2 RCC.
Minichromosome maintenance complex component 8 mutations cause primary ovarian insufficiency.
Dou, Xiaoyun; Guo, Ting; Li, Guangyu; Zhou, LiGuang; Qin, Yingying; Chen, Zi-Jiang
2016-11-01
To investigate whether mutations in the minichromosome maintenance complex component 8 (MCM8) gene were present in 192 patients with sporadic primary ovarian insufficiency (POI). Retrospective case-control cohort study. University-based reproductive medicine center. A total of 192 patients with sporadic POI and 312 control women with regular menstruation (192 age-matched women and 120 women >45 years old). Sanger sequencing was performed in patients with sporadic POI, and potentially pathogenic variants were confirmed in matched controls. DNA damage was induced by mitomycinC (MMC) treatment, and DNA repair capacity was evaluated by histone H2AX phosphorylation level. Sanger sequencing for MCM8 was performed in 192 patients with sporadic POI, and functional experiments were performed to explore the deleterious effects of mutations identified. Two novel missense variants in MCM8, c. A950T (p. H317L), and c. A1802G (p. H601R), were identified in two patients with POI but absent in 312 controls (the upper 90% confidence limit for the proportion 2/192 is 2.24%). The HeLa cells overexpressing mutant p. H317L and p. H601R showed higher sensitivity to MMC compared with wild type. Furthermore, mutant p. H317L showed decreased repair capacity after MMC treatment with much more histone H2AX phosphorylation remaining after 2 hours of recovery. Our result suggests novel mutations p. H317L and p. H601R in the MCM8 gene are potentially causative for POI by dysfunctional DNA repair. Copyright © 2016 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
BRAFV600E mutation in the diagnosis of unicystic ameloblastoma.
Pereira, Núbia Braga; Pereira, Karuza Maria Alves; Coura, Bruna Pizziolo; Diniz, Marina Gonçalves; de Castro, Wagner Henriques; Gomes, Carolina Cavalieri; Gomez, Ricardo Santiago
2016-11-01
Unicystic ameloblastoma, an odontogenic neoplasm, presents clinical and radiographic similarities with dentigerous and radicular cysts, non-neoplastic lesions. It is not always possible to reach a final diagnosis with the incisional biopsy, leading to inappropriate treatment. The BRAFV600E activating mutation has been reported in a high proportion of ameloblastomas. The purpose of the study was to assess the utility of the detection of the BRAFV600E mutation in the differential diagnosis of unicystic ameloblastoma with dentigerous and radicular cysts. Twenty-six archival samples were included, comprising eight unicystic ameloblastomas (UAs), nine dentigerous and nine radicular cysts. The mutation was assessed in all samples by anti-BRAFV600E (clone VE1) immunohistochemistry (IHC) and by TaqMan mutation detection qPCR assay. Sanger sequencing was further carried out when samples showed conflicting results in the IHC and qPCR. Although all UAs (8/8) showed positive uniform BRAFV600E staining along the epithelial lining length, the mutation was not confirmed by qPCR and Sanger sequencing in three samples. Positive staining for the BRAFV600E protein was observed in one dentigerous cyst, but it was not confirmed by the molecular methods. Furthermore, 2/9 dentigerous cysts and 2/9 radicular cysts showed non-specific immunostaining of the epithelium or plasma cells. None of the dentigerous or radicular cysts cases presented the BRAFV600E mutation in the qPCR assay. The BRAFV600E antibody (clone VE1) IHC may show non-specific staining, but molecular assays may be useful for the diagnosis of unicystic ameloblastoma, in conjunction with clinical, radiological and histopathological features. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Reijnders, Margot R F; Janowski, Robert; Alvi, Mohsan; Self, Jay E; van Essen, Ton J; Vreeburg, Maaike; Rouhl, Rob P W; Stevens, Servi J C; Stegmann, Alexander P A; Schieving, Jolanda; Pfundt, Rolph; van Dijk, Katinke; Smeets, Eric; Stumpel, Connie T R M; Bok, Levinus A; Cobben, Jan Maarten; Engelen, Marc; Mansour, Sahar; Whiteford, Margo; Chandler, Kate E; Douzgou, Sofia; Cooper, Nicola S; Tan, Ene-Choo; Foo, Roger; Lai, Angeline H M; Rankin, Julia; Green, Andrew; Lönnqvist, Tuula; Isohanni, Pirjo; Williams, Shelley; Ruhoy, Ilene; Carvalho, Karen S; Dowling, James J; Lev, Dorit L; Sterbova, Katalin; Lassuthova, Petra; Neupauerová, Jana; Waugh, Jeff L; Keros, Sotirios; Clayton-Smith, Jill; Smithson, Sarah F; Brunner, Han G; van Hoeckel, Ceciel; Anderson, Mel; Clowes, Virginia E; Siu, Victoria Mok; DDD study, The; Selber, Paulo; Leventer, Richard J; Nellaker, Christoffer; Niessing, Dierk; Hunt, David; Baralle, Diana
2018-01-01
Background De novo mutations in PURA have recently been described to cause PURA syndrome, a neurodevelopmental disorder characterised by severe intellectual disability (ID), epilepsy, feeding difficulties and neonatal hypotonia. Objectives To delineate the clinical spectrum of PURA syndrome and study genotype-phenotype correlations. Methods Diagnostic or research-based exome or Sanger sequencing was performed in individuals with ID. We systematically collected clinical and mutation data on newly ascertained PURA syndrome individuals, evaluated data of previously reported individuals and performed a computational analysis of photographs. We classified mutations based on predicted effect using 3D in silico models of crystal structures of Drosophila-derived Pur-alpha homologues. Finally, we explored genotype-phenotype correlations by analysis of both recurrent mutations as well as mutation classes. Results We report mutations in PURA (purine-rich element binding protein A) in 32 individuals, the largest cohort described so far. Evaluation of clinical data, including 22 previously published cases, revealed that all have moderate to severe ID and neonatal-onset symptoms, including hypotonia (96%), respiratory problems (57%), feeding difficulties (77%), exaggerated startle response (44%), hypersomnolence (66%) and hypothermia (35%). Epilepsy (54%) and gastrointestinal (69%), ophthalmological (51%) and endocrine problems (42%) were observed frequently. Computational analysis of facial photographs showed subtle facial dysmorphism. No strong genotype-phenotype correlation was identified by subgrouping mutations into functional classes. Conclusion We delineate the clinical spectrum of PURA syndrome with the identification of 32 additional individuals. The identification of one individual through targeted Sanger sequencing points towards the clinical recognisability of the syndrome. Genotype-phenotype analysis showed no significant correlation between mutation classes and disease severity. PMID:29097605
Kumar, Anupam; Pathak, Pankaj; Purkait, Suvendu; Faruq, Mohammed; Jha, Prerana; Mallick, Supriya; Suri, Vaishali; Sharma, Mehar C; Suri, Ashish; Sarkar, Chitra
2015-03-01
Pediatric oligodendrogliomas (pODGs) are rare central nervous system tumors, and comparatively little is known about their molecular pathogenesis. Co-deletion of 1p/19q; and IDH1, CIC, and FUBP1 mutations, which are molecular signatures of adult oligodendrogliomas, are extremely rare in pODGs. In this report, two pODGs, one each of grade II and grade III, were evaluated using clinical, radiological, histopathologic, and follow-up methods. IDH1, TP53, CIC, H3F3A, and BRAF-V600 E mutations were analyzed by Sanger sequencing and immunohistochemical methods, and 1p/19q co-deletion was analyzed by fluorescence in situ hybridization. PDGFRA amplification, BRAF gain, intragenic duplication of FGFR-TKD, and KIAA1549-BRAF fusion (validated by Sanger sequencing) were analyzed by real-time reverse transcription PCR. Notably, both cases showed the oncogenic KIAA1549_Ex15-BRAF_Ex9 fusion transcript. Further, immunohistochemical analysis showed activation of the MAPK/ERK pathway in both of these cases. However, neither 1p/19q co-deletion; IDH1, TP53, CIC, H3F3A, nor BRAF-V600 E mutation; PDGFRA amplification; BRAF gain; nor duplication of FGFR-TKD was identified. Overall, this study highlights that pODGs can harbor the KIAA1549-BRAF fusion with aberrant MAPK/ERK signaling, and there exists an option of targeting these pathways in such patients. These results indicate that pODGs with the KIAA1549-BRAF fusion may represent a subset of this rare tumor that shares molecular and genetic features of pilocytic astrocytomas. These findings will increase our understanding of pODGs and may have clinical implications. Copyright © 2015 Elsevier Inc. All rights reserved.
Bybee, Seth M; Bracken-Grissom, Heather; Haynes, Benjamin D; Hermansen, Russell A; Byers, Robert L; Clement, Mark J; Udall, Joshua A; Wilcox, Edward R; Crandall, Keith A
2011-01-01
Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach.
Bybee, Seth M.; Bracken-Grissom, Heather; Haynes, Benjamin D.; Hermansen, Russell A.; Byers, Robert L.; Clement, Mark J.; Udall, Joshua A.; Wilcox, Edward R.; Crandall, Keith A.
2011-01-01
Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach. PMID:22002916
Navigating the tip of the genomic iceberg: Next-generation sequencing for plant systematics.
Straub, Shannon C K; Parks, Matthew; Weitemier, Kevin; Fishbein, Mark; Cronn, Richard C; Liston, Aaron
2012-02-01
Just as Sanger sequencing did more than 20 years ago, next-generation sequencing (NGS) is poised to revolutionize plant systematics. By combining multiplexing approaches with NGS throughput, systematists may no longer need to choose between more taxa or more characters. Here we describe a genome skimming (shallow sequencing) approach for plant systematics. Through simulations, we evaluated optimal sequencing depth and performance of single-end and paired-end short read sequences for assembly of nuclear ribosomal DNA (rDNA) and plastomes and addressed the effect of divergence on reference-guided plastome assembly. We also used simulations to identify potential phylogenetic markers from low-copy nuclear loci at different sequencing depths. We demonstrated the utility of genome skimming through phylogenetic analysis of the Sonoran Desert clade (SDC) of Asclepias (Apocynaceae). Paired-end reads performed better than single-end reads. Minimum sequencing depths for high quality rDNA and plastome assemblies were 40× and 30×, respectively. Divergence from the reference significantly affected plastome assembly, but relatively similar references are available for most seed plants. Deeper rDNA sequencing is necessary to characterize intragenomic polymorphism. The low-copy fraction of the nuclear genome was readily surveyed, even at low sequencing depths. Nearly 160000 bp of sequence from three organelles provided evidence of phylogenetic incongruence in the SDC. Adoption of NGS will facilitate progress in plant systematics, as whole plastome and rDNA cistrons, partial mitochondrial genomes, and low-copy nuclear markers can now be efficiently obtained for molecular phylogenetics studies.
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum
DOE Office of Scientific and Technical Information (OSTI.GOV)
VanBuren, Robert; Bryant, Doug; Edger, Patrick P.
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum
VanBuren, Robert; Bryant, Doug; Edger, Patrick P.; ...
2015-11-11
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Analysis of CHRNA7 rare variants in autism spectrum disorder susceptibility.
Bacchelli, Elena; Battaglia, Agatino; Cameli, Cinzia; Lomartire, Silvia; Tancredi, Raffaella; Thomson, Susanne; Sutcliffe, James S; Maestrini, Elena
2015-04-01
Chromosome 15q13.3 recurrent microdeletions are causally associated with a wide range of phenotypes, including autism spectrum disorder (ASD), seizures, intellectual disability, and other psychiatric conditions. Whether the reciprocal microduplication is pathogenic is less certain. CHRNA7, encoding for the alpha7 subunit of the neuronal nicotinic acetylcholine receptor, is considered the likely culprit gene in mediating neurological phenotypes in 15q13.3 deletion cases. To assess if CHRNA7 rare variants confer risk to ASD, we performed copy number variant analysis and Sanger sequencing of the CHRNA7 coding sequence in a sample of 135 ASD cases. Sequence variation in this gene remains largely unexplored, given the existence of a fusion gene, CHRFAM7A, which includes a nearly identical partial duplication of CHRNA7. Hence, attempts to sequence coding exons must distinguish between CHRNA7 and CHRFAM7A, making next-generation sequencing approaches unreliable for this purpose. A CHRNA7 microduplication was detected in a patient with autism and moderate cognitive impairment; while no rare damaging variants were identified in the coding region, we detected rare variants in the promoter region, previously described to functionally reduce transcription. This study represents the first sequence variant analysis of CHRNA7 in a sample of idiopathic autism. © 2015 Wiley Periodicals, Inc.
Impact of Next Generation Sequencing Techniques in Food Microbiology
Mayo, Baltasar; Rachid, Caio T. C. C; Alegría, Ángel; Leite, Analy M. O; Peixoto, Raquel S; Delgado, Susana
2014-01-01
Understanding the Maxam-Gilbert and Sanger sequencing as the first generation, in recent years there has been an explosion of newly-developed sequencing strategies, which are usually referred to as next generation sequencing (NGS) techniques. NGS techniques have high-throughputs and produce thousands or even millions of sequences at the same time. These sequences allow for the accurate identification of microbial taxa, including uncultivable organisms and those present in small numbers. In specific applications, NGS provides a complete inventory of all microbial operons and genes present or being expressed under different study conditions. NGS techniques are revolutionizing the field of microbial ecology and have recently been used to examine several food ecosystems. After a short introduction to the most common NGS systems and platforms, this review addresses how NGS techniques have been employed in the study of food microbiota and food fermentations, and discusses their limits and perspectives. The most important findings are reviewed, including those made in the study of the microbiota of milk, fermented dairy products, and plant-, meat- and fish-derived fermented foods. The knowledge that can be gained on microbial diversity, population structure and population dynamics via the use of these technologies could be vital in improving the monitoring and manipulation of foods and fermented food products. They should also improve their safety. PMID:25132799
A massive parallel sequencing workflow for diagnostic genetic testing of mismatch repair genes
Hansen, Maren F; Neckmann, Ulrike; Lavik, Liss A S; Vold, Trine; Gilde, Bodil; Toft, Ragnhild K; Sjursen, Wenche
2014-01-01
The purpose of this study was to develop a massive parallel sequencing (MPS) workflow for diagnostic analysis of mismatch repair (MMR) genes using the GS Junior system (Roche). A pathogenic variant in one of four MMR genes, (MLH1, PMS2, MSH6, and MSH2), is the cause of Lynch Syndrome (LS), which mainly predispose to colorectal cancer. We used an amplicon-based sequencing method allowing specific and preferential amplification of the MMR genes including PMS2, of which several pseudogenes exist. The amplicons were pooled at different ratios to obtain coverage uniformity and maximize the throughput of a single-GS Junior run. In total, 60 previously identified and distinct variants (substitutions and indels), were sequenced by MPS and successfully detected. The heterozygote detection range was from 19% to 63% and dependent on sequence context and coverage. We were able to distinguish between false-positive and true-positive calls in homopolymeric regions by cross-sample comparison and evaluation of flow signal distributions. In addition, we filtered variants according to a predefined status, which facilitated variant annotation. Our study shows that implementation of MPS in routine diagnostics of LS can accelerate sample throughput and reduce costs without compromising sensitivity, compared to Sanger sequencing. PMID:24689082
Clinical and molecular characterization of KCNT1-related severe early-onset epilepsy
Nair, Umesh; Malhotra, Sony; Meyer, Esther; Trump, Natalie; Gazina, Elena V.; Papandreou, Apostolos; Ngoh, Adeline; Ackermann, Sally; Ambegaonkar, Gautam; Appleton, Richard; Desurkar, Archana; Eltze, Christin; Kneen, Rachel; Kumar, Ajith V.; Lascelles, Karine; Montgomery, Tara; Ramesh, Venkateswaran; Samanta, Rajib; Scott, Richard H.; Tan, Jeen; Whitehouse, William; Poduri, Annapurna; Scheffer, Ingrid E.; Chong, W.K. “Kling”; Cross, J. Helen; Topf, Maya; Petrou, Steven
2018-01-01
Objective To characterize the phenotypic spectrum, molecular genetic findings, and functional consequences of pathogenic variants in early-onset KCNT1 epilepsy. Methods We identified a cohort of 31 patients with epilepsy of infancy with migrating focal seizures (EIMFS) and screened for variants in KCNT1 using direct Sanger sequencing, a multiple-gene next-generation sequencing panel, and whole-exome sequencing. Additional patients with non-EIMFS early-onset epilepsy in whom we identified KCNT1 variants on local diagnostic multiple gene panel testing were also included. When possible, we performed homology modeling to predict the putative effects of variants on protein structure and function. We undertook electrophysiologic assessment of mutant KCNT1 channels in a xenopus oocyte model system. Results We identified pathogenic variants in KCNT1 in 12 patients, 4 of which are novel. Most variants occurred de novo. Ten patients had a clinical diagnosis of EIMFS, and the other 2 presented with early-onset severe nocturnal frontal lobe seizures. Three patients had a trial of quinidine with good clinical response in 1 patient. Computational modeling analysis implicates abnormal pore function (F346L) and impaired tetramer formation (F502V) as putative disease mechanisms. All evaluated KCNT1 variants resulted in marked gain of function with significantly increased channel amplitude and variable blockade by quinidine. Conclusions Gain-of-function KCNT1 pathogenic variants cause a spectrum of severe focal epilepsies with onset in early infancy. Currently, genotype-phenotype correlations are unclear, although clinical outcome is poor for the majority of cases. Further elucidation of disease mechanisms may facilitate the development of targeted treatments, much needed for this pharmacoresistant genetic epilepsy. PMID:29196579
Identification, variation and transcription of pneumococcal repeat sequences
2011-01-01
Background Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. Results Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. Conclusions BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/. PMID:21333003
Razmara, Ehsan; Bitarafan, Fatemeh; Esmaeilzadeh-Gharehdaghi, Elika; Almadani, Navid; Garshasbi, Masoud
2018-03-01
Targeted next-generation sequencing (NGS) provides a consequential opportunity to elucidate genetic factors in known diseases, particularly in profoundly heterogeneous disorders such as non-syndromic hearing loss (NSHL). Hearing impairments could be classified into syndromic and non-syndromic types. This study intended to assess the significance of mutations in these genes to the autosomal recessive/dominant non-syndromic genetic load among Iranian families. Two families were involved in this research and two patients were examined by targeted next-generation sequencing. Here we report two novel mutations in the MYO7A and EYA1 genes in two patients detected by targeted NGS. They were confirmed by Sanger sequencing and quantitative real-time PCR techniques. In this investigation, we identified a novel mutation in MYO7A , c.3751G>C, p.A1251P, along with another previously identified mutation (c.1708C>T) in one of the cases. This mutation is located in the MYTH4 protein domain which is a pivotal domain for the myosin function. Another finding in this research was a novel de-novo deletion which deletes the entire EYA1 coding region (EX1-18 DEL). Mutations in EYA1 gene have been found in branchiootorenal (BOR) syndrome. Interestingly the patient with EYA1 deletion did not show any other additional clinical implications apart from HL. This finding might argue for the sole involvement of EYA1 function in the mechanism of hearing. This investigation exhibited that the novel mutations in MYO7A , c.3751G>C, p.A1251P, and EYA1 , EX1-18 DEL, were associated with NSHL. Our research increased the mutation spectrum of hearing loss in the Iranian population.
Diagnosis of genetic predisposition for lactose intolerance by high resolution melting analysis.
Delacour, Hervé; Leduc, Amandine; Louçano-Perdriat, Andréa; Plantamura, Julie; Ceppa, Franck
2017-02-01
Lactose, the principle sugar in milk, is a disaccharide hydrolyzed by intestinal lactase into glucose and galactose, which are absorbed directly by diffusion in the intestine. The decline of lactase expression (or hypolactasia) in intestinal microvilli after weaning is a normal phenomenon in mammals known as lactase deficiency. It is observed in nearly 75% of the world population and is an inherited autosomal recessive trait with incomplete penetrance. It is caused by SNPs in a regulatory element for lactase gene. In Indo-European, lactase deficiency is associated with rs4982235 SNP (or -13910C>T). The aim of this study is to describe a method based on high resolution melting for rapidly detecting genetic predisposition to lactose intolerance. Analytical performance of the assay was assessed by evaluating within and betwwen-run precision and by comparing the results (n = 50 patients) obtained with the HRM assay to those obtained with the gold standard (Sanger sequencing of the region of interest). In silico prediction of HRM curves was performed to evaluate the potential impact of the other SNPs described within the PCR product on the HRM analytical performances. The assay has good performance (CV <0.2% during the between-run study). A perfect agreement with the gold standard method was observed. The presence of other polymorphisms within the amplified sequence is detected, the misclassification risk is low. This assay can be used for rapidly diagnosing genetic predisposition to lactose intolerance.
Application of COLD-PCR for improved detection of KRAS mutations in clinical samples.
Zuo, Zhuang; Chen, Su S; Chandra, Pranil K; Galbincea, John M; Soape, Matthew; Doan, Steven; Barkoh, Bedia A; Koeppen, Hartmut; Medeiros, L Jeffrey; Luthra, Rajyalakshmi
2009-08-01
KRAS mutations have been detected in approximately 30% of all human tumors, and have been shown to predict response to some targeted therapies. The most common KRAS mutation-detection strategy consists of conventional PCR and direct sequencing. This approach has a 10-20% detection sensitivity depending on whether pyrosequencing or Sanger sequencing is used. To improve detection sensitivity, we compared our conventional method with the recently described co-amplification-at-lower denaturation-temperature PCR (COLD-PCR) method, which selectively amplifies minority alleles. In COLD-PCR, the critical denaturation temperature is lowered to 80 degrees C (vs 94 degrees C in conventional PCR). The sensitivity of COLD-PCR was determined by assessing serial dilutions. Fifty clinical samples were used, including 20 fresh bone-marrow aspirate specimens and the formalin-fixed paraffin-embedded (FFPE) tissue of 30 solid tumors. Implementation of COLD-PCR was straightforward and required no additional cost for reagents or instruments. The method was specific and reproducible. COLD-PCR successfully detected mutations in all samples that were positive by conventional PCR, and enhanced the mutant-to-wild-type ratio by >4.74-fold, increasing the mutation detection sensitivity to 1.5%. The enhancement of mutation detection by COLD-PCR inversely correlated with the tumor-cell percentage in a sample. In conclusion, we validated the utility and superior sensitivity of COLD-PCR for detecting KRAS mutations in a variety of hematopoietic and solid tumors using either fresh or fixed, paraffin-embedded tissue.
Generation and validation of homozygous fluorescent knock-in cells using CRISPR-Cas9 genome editing.
Koch, Birgit; Nijmeijer, Bianca; Kueblbeck, Moritz; Cai, Yin; Walther, Nike; Ellenberg, Jan
2018-06-01
Gene tagging with fluorescent proteins is essential for investigations of the dynamic properties of cellular proteins. CRISPR-Cas9 technology is a powerful tool for inserting fluorescent markers into all alleles of the gene of interest (GOI) and allows functionality and physiological expression of the fusion protein. It is essential to evaluate such genome-edited cell lines carefully in order to preclude off-target effects caused by (i) incorrect insertion of the fluorescent protein, (ii) perturbation of the fusion protein by the fluorescent proteins or (iii) nonspecific genomic DNA damage by CRISPR-Cas9. In this protocol, we provide a step-by-step description of our systematic pipeline to generate and validate homozygous fluorescent knock-in cell lines.We have used the paired Cas9D10A nickase approach to efficiently insert tags into specific genomic loci via homology-directed repair (HDR) with minimal off-target effects. It is time-consuming and costly to perform whole-genome sequencing of each cell clone to check for spontaneous genetic variations occurring in mammalian cell lines. Therefore, we have developed an efficient validation pipeline of the generated cell lines consisting of junction PCR, Southern blotting analysis, Sanger sequencing, microscopy, western blotting analysis and live-cell imaging for cell-cycle dynamics. This protocol takes between 6 and 9 weeks. With this protocol, up to 70% of the targeted genes can be tagged homozygously with fluorescent proteins, thus resulting in physiological levels and phenotypically functional expression of the fusion proteins.
Thomas, Anna C.; Williams, Hywel; Setó-Salvia, Núria; Bacchelli, Chiara; Jenkins, Dagan; O’Sullivan, Mary; Mengrelis, Konstantinos; Ishida, Miho; Ocaka, Louise; Chanudet, Estelle; James, Chela; Lescai, Francesco; Anderson, Glenn; Morrogh, Deborah; Ryten, Mina; Duncan, Andrew J.; Pai, Yun Jin; Saraiva, Jorge M.; Ramos, Fabiana; Farren, Bernadette; Saunders, Dawn; Vernay, Bertrand; Gissen, Paul; Straatmaan-Iwanowska, Anna; Baas, Frank; Wood, Nicholas W.; Hersheson, Joshua; Houlden, Henry; Hurst, Jane; Scott, Richard; Bitner-Glindzicz, Maria; Moore, Gudrun E.; Sousa, Sérgio B.; Stanier, Philip
2014-01-01
Intellectual disability and cerebellar atrophy occur together in a large number of genetic conditions and are frequently associated with microcephaly and/or epilepsy. Here we report the identification of causal mutations in Sorting Nexin 14 (SNX14) found in seven affected individuals from three unrelated consanguineous families who presented with recessively inherited moderate-severe intellectual disability, cerebellar ataxia, early-onset cerebellar atrophy, sensorineural hearing loss, and the distinctive association of progressively coarsening facial features, relative macrocephaly, and the absence of seizures. We used homozygosity mapping and whole-exome sequencing to identify a homozygous nonsense mutation and an in-frame multiexon deletion in two families. A homozygous splice site mutation was identified by Sanger sequencing of SNX14 in a third family, selected purely by phenotypic similarity. This discovery confirms that these characteristic features represent a distinct and recognizable syndrome. SNX14 encodes a cellular protein containing Phox (PX) and regulator of G protein signaling (RGS) domains. Weighted gene coexpression network analysis predicts that SNX14 is highly coexpressed with genes involved in cellular protein metabolism and vesicle-mediated transport. All three mutations either directly affected the PX domain or diminished SNX14 levels, implicating a loss of normal cellular function. This manifested as increased cytoplasmic vacuolation as observed in cultured fibroblasts. Our findings indicate an essential role for SNX14 in neural development and function, particularly in development and maturation of the cerebellum. PMID:25439728
Thomas, Anna C; Williams, Hywel; Setó-Salvia, Núria; Bacchelli, Chiara; Jenkins, Dagan; O'Sullivan, Mary; Mengrelis, Konstantinos; Ishida, Miho; Ocaka, Louise; Chanudet, Estelle; James, Chela; Lescai, Francesco; Anderson, Glenn; Morrogh, Deborah; Ryten, Mina; Duncan, Andrew J; Pai, Yun Jin; Saraiva, Jorge M; Ramos, Fabiana; Farren, Bernadette; Saunders, Dawn; Vernay, Bertrand; Gissen, Paul; Straatmaan-Iwanowska, Anna; Baas, Frank; Wood, Nicholas W; Hersheson, Joshua; Houlden, Henry; Hurst, Jane; Scott, Richard; Bitner-Glindzicz, Maria; Moore, Gudrun E; Sousa, Sérgio B; Stanier, Philip
2014-11-06
Intellectual disability and cerebellar atrophy occur together in a large number of genetic conditions and are frequently associated with microcephaly and/or epilepsy. Here we report the identification of causal mutations in Sorting Nexin 14 (SNX14) found in seven affected individuals from three unrelated consanguineous families who presented with recessively inherited moderate-severe intellectual disability, cerebellar ataxia, early-onset cerebellar atrophy, sensorineural hearing loss, and the distinctive association of progressively coarsening facial features, relative macrocephaly, and the absence of seizures. We used homozygosity mapping and whole-exome sequencing to identify a homozygous nonsense mutation and an in-frame multiexon deletion in two families. A homozygous splice site mutation was identified by Sanger sequencing of SNX14 in a third family, selected purely by phenotypic similarity. This discovery confirms that these characteristic features represent a distinct and recognizable syndrome. SNX14 encodes a cellular protein containing Phox (PX) and regulator of G protein signaling (RGS) domains. Weighted gene coexpression network analysis predicts that SNX14 is highly coexpressed with genes involved in cellular protein metabolism and vesicle-mediated transport. All three mutations either directly affected the PX domain or diminished SNX14 levels, implicating a loss of normal cellular function. This manifested as increased cytoplasmic vacuolation as observed in cultured fibroblasts. Our findings indicate an essential role for SNX14 in neural development and function, particularly in development and maturation of the cerebellum. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Methylation-sensitive enrichment of minor DNA alleles using a double-strand DNA-specific nuclease.
Liu, Yibin; Song, Chen; Ladas, Ioannis; Fitarelli-Kiehl, Mariana; Makrigiorgos, G Mike
2017-04-07
Aberrant methylation changes, often present in a minor allelic fraction in clinical samples such as plasma-circulating DNA (cfDNA), are potentially powerful prognostic and predictive biomarkers in human disease including cancer. We report on a novel, highly-multiplexed approach to facilitate analysis of clinically useful methylation changes in minor DNA populations. Methylation Specific Nuclease-assisted Minor-allele Enrichment (MS-NaME) employs a double-strand-specific DNA nuclease (DSN) to remove excess DNA with normal methylation patterns. The technique utilizes oligonucleotide-probes that direct DSN activity to multiple targets in bisulfite-treated DNA, simultaneously. Oligonucleotide probes targeting unmethylated sequences generate local double stranded regions resulting to digestion of unmethylated targets, and leaving methylated targets intact; and vice versa. Subsequent amplification of the targeted regions results in enrichment of the targeted methylated or unmethylated minority-epigenetic-alleles. We validate MS-NaME by demonstrating enrichment of RARb2, ATM, MGMT and GSTP1 promoters in multiplexed MS-NaME reactions (177-plex) using dilutions of methylated/unmethylated DNA and in DNA from clinical lung cancer samples and matched normal tissue. MS-NaME is a highly scalable single-step approach performed at the genomic DNA level in solution that combines with most downstream detection technologies including Sanger sequencing, methylation-sensitive-high-resolution melting (MS-HRM) and methylation-specific-Taqman-based-digital-PCR (digital Methylight) to boost detection of low-level aberrant methylation-changes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
A family-level Tree of Life for bivalves based on a Sanger-sequencing approach.
Combosch, David J; Collins, Timothy M; Glover, Emily A; Graf, Daniel L; Harper, Elizabeth M; Healy, John M; Kawauchi, Gisele Y; Lemer, Sarah; McIntyre, Erin; Strong, Ellen E; Taylor, John D; Zardus, John D; Mikkelsen, Paula M; Giribet, Gonzalo; Bieler, Rüdiger
2017-02-01
The systematics of the molluscan class Bivalvia are explored using a 5-gene Sanger-based approach including the largest taxon sampling to date, encompassing 219 ingroup species spanning 93 (or 82%) of the 113 currently accepted bivalve families. This study was designed to populate the bivalve Tree of Life at the family level and to place many genera into a clear phylogenetic context, but also pointing to several major clades where taxonomic work is sorely needed. Despite not recovering monophyly of Bivalvia or Protobranchia-as in most previous Sanger-based approaches to bivalve phylogeny-our study provides increased resolution in many higher-level clades, and supports the monophyly of Autobranchia, Pteriomorphia, Heteroconchia, Palaeoheterodonta, Heterodonta, Archiheterodonta, Euheterodonta, Anomalodesmata, Imparidentia, and Neoheterodontei, in addition to many other lower clades. However, deep nodes within some of these clades, especially Pteriomorphia and Imparidentia, could not be resolved with confidence. In addition, many families are not supported, and several are supported as non-monophyletic, including Malletiidae, Nuculanidae, Yoldiidae, Malleidae, Pteriidae, Arcidae, Propeamussiidae, Iridinidae, Carditidae, Myochamidae, Lyonsiidae, Pandoridae, Montacutidae, Galeommatidae, Tellinidae, Semelidae, Psammobiidae, Donacidae, Mactridae, and Cyrenidae; Veneridae is paraphyletic with respect to Chamidae, although this result appears to be an artifact. The denser sampling however allowed testing specific placement of species, showing, for example, that the unusual Australian Plebidonax deltoides is not a member of Donacidae and instead nests within Psammobiidae, suggesting that major revision of Tellinoidea may be required. We also showed that Cleidothaerus is sister group to the cementing member of Myochamidae, suggesting that Cleidothaeridae may not be a valid family and that cementation in Cleidothaerus and Myochama may have had a single origin. These results highlight the need for an integrative approach including as many genera as possible, and that the monophyly and relationships of many families require detailed reassessment. NGS approaches may be able to resolve the most recalcitrant nodes in the near future. Copyright © 2016 Elsevier Inc. All rights reserved.
Inversion-mediated gene fusions involving NAB2-STAT6 in an unusual malignant meningioma.
Gao, F; Ling, C; Shi, L; Commins, D; Zada, G; Mack, W J; Wang, K
2013-08-20
Meningiomas are the most common primary intracranial tumours, with ∼3% meeting current histopathologic criteria for malignancy. In this study, we explored the transcriptome of meningiomas using RNA-Seq. Inversion-mediated fusions between two adjacent genes, NAB2 and STAT6, were detected in one malignant tumour, creating two novel in-frame transcripts that were validated by RT-PCR and Sanger sequencing. Gene fusions of NAB2-STAT6 were recently implicated in the pathogenesis of solitary fibrous tumours; our study suggested that similar fusions may also have a role in a malignant meningioma with unusual histopathologic features.
Yang, Tsun-Po; Beazley, Claude; Montgomery, Stephen B; Dimas, Antigone S; Gutierrez-Arcelus, Maria; Stranger, Barbara E; Deloukas, Panos; Dermitzakis, Emmanouil T
2010-10-01
Genevar (GENe Expression VARiation) is a database and Java tool designed to integrate multiple datasets, and provides analysis and visualization of associations between sequence variation and gene expression. Genevar allows researchers to investigate expression quantitative trait loci (eQTL) associations within a gene locus of interest in real time. The database and application can be installed on a standard computer in database mode and, in addition, on a server to share discoveries among affiliations or the broader community over the Internet via web services protocols. http://www.sanger.ac.uk/resources/software/genevar.
Complete mitochondrial genome of the Freshwater Catfish Rita rita (Siluriformes, Bagridae).
Lashari, Punhal; Laghari, Muhammad Younis; Xu, Peng; Zhao, Zixia; Jiang, Li; Narejo, Naeem Tariq; Deng, Yulin; Sun, Xiaowen; Zhang, Yan
2015-01-01
The complete mitochondrial genome of Catfish, Rita rita, was isolated by LA PCR (TakaRa LAtaq, Dalian, China); and sequenced by Sanger's method to obtain the complete mitochondrial genome, which is listed Critically Endangered and Red Listed species. The complete mitogenome was 16,449 bp in length and contains 13 typical vertebrate protein-coding genes, 2 rRNA and 22 tRNA genes. The whole genome base composition was estimated to be 33.40% A, 27.43% C, 14.26% G and 24.89% T. The complete mitochondrial genome of catfish, Rita rita provides the basis for genetic breeding and conservation studies.
A novel dysfunctional germline P53 mutation identified in a family with Li-Fraumeni syndrome.
Ji, Min; Wang, Lin; Shao, Yuguo; Cao, Wei; Xu, Ting; Chen, Shujie; Wang, Zhiwei; He, Qi; Yang, Kuo
2018-01-01
Li-Fraumeni Syndrome (LFS), which is a rare dominantly inherited cancer predisposition syndrome, is associated with germline P53 mutations. Mutations of the tumor suppressor protein P53 are associated with more than 50% of human cancers; however, almost 30% of P53 mutations occur rarely and this has raised questions about their significance. It therefore appeared of particular interest that we identified a novel mutation in a patient suffering from breast cancer and fulfilling the diagnostic criteria of LFS. In this study, a patient with remarkable family history developed breast cancer and was diagnosed with LFS. By performing next-generation sequencing on the patient and subsequent verification by Sanger sequencing among other family members, a new germ-line P53 replication error, a trinucleotide repeat mutation in the coding region, was identified in two generations of this Li-Fraumeni family.
Hegazi, Moustafa Abdelaal; Manou, Sommen; Sakr, Hazem; Camp, Guy Van
2017-01-01
Inherited Palmoplantar Keratodermas are rare disorders of genodermatosis that are conventionally regarded as autosomal dominant in inheritance with extensive clinical and genetic heterogeneity. This is the first report of a unique autosomal recessive Inherited Palmoplantar keratoderma - sensorineural hearing loss syndrome which has not been reported before in 3 siblings of a large consanguineous family. The patients presented unique clinical features that were different from other known Inherited Palmoplantar Keratodermas - hearing loss syndromes. Mutations in GJB2 or GJB6 and the mitochondrial A7445G mutation, known to be the major causes of diverse Inherited Palmoplantar Keratodermas -hearing loss syndromes were not detected by Sanger sequencing. Moreover, the pathogenic mutation could not be identified using whole exome sequencing. Other known Inherited Palmoplantar keratoderma syndromes were excluded based on both clinical criteria and genetic analysis. PMID:29267478
Absence of mutations in HCRT, HCRTR1 and HCRTR2 in patients with ROHHAD.
Barclay, Sarah F; Rand, Casey M; Gray, Paul A; Gibson, William T; Wilson, Richard J A; Berry-Kravis, Elizabeth M; Ize-Ludlow, Diego; Bech-Hansen, N Torben; Weese-Mayer, Debra E
2016-01-15
Rapid-onset obesity with hypothalamic dysfunction, hypoventilation, and autonomic dysregulation (ROHHAD) is a rare pediatric disease of unknown cause. Here, in response to a recent case report describing a ROHHAD patient who suffered from secondary narcolepsy confirmed by an absence of hypocretin-1 in the cerebrospinal fluid, we consider whether the ROHHAD phenotype is owing to one or more mutations in genes specific to hypocretin protein signalling. DNA samples from 16 ROHHAD patients were analyzed using a combination of next-generation and Sanger sequencing to identify exonic sequence variations in three genes: HCRT, HCRTR1, and HCRTR2. No rare or novel mutations were identified in the exons of HCRT, HCRTR1, or HCRTR2 genes in a set of 16 ROHHAD patients. ROHHAD is highly unlikely to be caused by mutations in the exons of the genes for hypocretin and its two receptors. Copyright © 2015 Elsevier B.V. All rights reserved.
A novel homozygous HOXB1 mutation in a Turkish family with hereditary congenital facial paresis.
Sahin, Yavuz; Güngör, Olcay; Ayaz, Akif; Güngör, Gülay; Sahin, Bedia; Yaykasli, Kursad; Ceylaner, Serdar
2017-02-01
Hereditary congenital facial paresis (HCFP) is characterized by isolated dysfunction of the facial nerve (CN VII) due to congenital cranial dysinnervation disorders. HCFP has genetic heterogeneity and HOXB1 is the first identified gene. We report the clinical, radiologic and molecular investigations of three patients admitted for HCFP in a large consanguineous Turkish family. High-throughput sequencing and Sanger sequencing of all patients revealed a novel homozygous mutation p.Arg230Trp (c.688C>T) within the HOXB1 gene. The report of the mutation brings the total number of HOXB1 mutations identified in HCFP to four. The results of this study emphasize that in individuals with congenital facial palsy accompanied by hearing loss and dysmorphic facial features, HOXB1 mutation causing HCFP should be kept in mind. Copyright © 2016 The Japanese Society of Child Neurology. Published by Elsevier B.V. All rights reserved.
Novel heterozygous NOTCH3 pathogenic variant found in two Chinese patients with CADASIL.
Li, Shufeng; Chen, Yifan; Shan, Haitao; Ma, Fang; Shi, Minke; Xue, Jun
2017-12-01
NOTCH3 mutations have been described to cause cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL). Here, we report 2 CADASIL patients from a Chinese family. Whole genome sequencing was performed on the two CADASIL patients. The novel variant c.128G>C in exon 2 of NOTCH3 was identified and confirmed through PCR-Sanger sequencing (Human Genome Variation Society nomenclature: HGVS: NOTCH3 c.128G>C; p.Cys43Ser). The heterozygous NOTCH3 variant cause a cysteine to serine substitution at codon 43. According to the variant interpretation guideline of American College of Medical Genetics and Genomics (ACMG), this variant was classified as "pathogenic". Other variants in HTRA1, COL4A1 and COL4A2 were also found, they were classified as "benign". Copyright © 2017 Elsevier Ltd. All rights reserved.
Anazawa, Takashi; Yamazaki, Motohiro
2017-12-05
Although multi-point, multi-color fluorescence-detection systems are widely used in various sciences, they would find wider applications if they are miniaturized. Accordingly, an ultra-small, four-emission-point and four-color fluorescence-detection system was developed. Its size (space between emission points and a detection plane) is 15 × 10 × 12 mm, which is three-orders-of-magnitude smaller than that of a conventional system. Fluorescence from four emission points with an interval of 1 mm on the same plane was respectively collimated by four lenses and split into four color fluxes by four dichroic mirrors. Then, a total of sixteen parallel color fluxes were directly input into an image sensor and simultaneously detected. The emission-point plane and the detection plane (the image-sensor surface) were parallel and separated by a distance of only 12 mm. The developed system was applied to four-capillary array electrophoresis and successfully achieved Sanger DNA sequencing. Moreover, compared with a conventional system, the developed system had equivalent high fluorescence-detection sensitivity (lower detection limit of 17 pM dROX) and 1.6-orders-of-magnitude higher dynamic range (4.3 orders of magnitude).
Fredlake, Christopher P; Hert, Daniel G; Kan, Cheuk-Wai; Chiesl, Thomas N; Root, Brian E; Forster, Ryan E; Barron, Annelise E
2008-01-15
To realize the immense potential of large-scale genomic sequencing after the completion of the second human genome (Venter's), the costs for the complete sequencing of additional genomes must be dramatically reduced. Among the technologies being developed to reduce sequencing costs, microchip electrophoresis is the only new technology ready to produce the long reads most suitable for the de novo sequencing and assembly of large and complex genomes. Compared with the current paradigm of capillary electrophoresis, microchip systems promise to reduce sequencing costs dramatically by increasing throughput, reducing reagent consumption, and integrating the many steps of the sequencing pipeline onto a single platform. Although capillary-based systems require approximately 70 min to deliver approximately 650 bases of contiguous sequence, we report sequencing up to 600 bases in just 6.5 min by microchip electrophoresis with a unique polymer matrix/adsorbed polymer wall coating combination. This represents a two-thirds reduction in sequencing time over any previously published chip sequencing result, with comparable read length and sequence quality. We hypothesize that these ultrafast long reads on chips can be achieved because the combined polymer system engenders a recently discovered "hybrid" mechanism of DNA electromigration, in which DNA molecules alternate rapidly between repeating through the intact polymer network and disrupting network entanglements to drag polymers through the solution, similar to dsDNA dynamics we observe in single-molecule DNA imaging studies. Most importantly, these results reveal the surprisingly powerful ability of microchip electrophoresis to provide ultrafast Sanger sequencing, which will translate to increased system throughput and reduced costs.
Fredlake, Christopher P.; Hert, Daniel G.; Kan, Cheuk-Wai; Chiesl, Thomas N.; Root, Brian E.; Forster, Ryan E.; Barron, Annelise E.
2008-01-01
To realize the immense potential of large-scale genomic sequencing after the completion of the second human genome (Venter's), the costs for the complete sequencing of additional genomes must be dramatically reduced. Among the technologies being developed to reduce sequencing costs, microchip electrophoresis is the only new technology ready to produce the long reads most suitable for the de novo sequencing and assembly of large and complex genomes. Compared with the current paradigm of capillary electrophoresis, microchip systems promise to reduce sequencing costs dramatically by increasing throughput, reducing reagent consumption, and integrating the many steps of the sequencing pipeline onto a single platform. Although capillary-based systems require ≈70 min to deliver ≈650 bases of contiguous sequence, we report sequencing up to 600 bases in just 6.5 min by microchip electrophoresis with a unique polymer matrix/adsorbed polymer wall coating combination. This represents a two-thirds reduction in sequencing time over any previously published chip sequencing result, with comparable read length and sequence quality. We hypothesize that these ultrafast long reads on chips can be achieved because the combined polymer system engenders a recently discovered “hybrid” mechanism of DNA electromigration, in which DNA molecules alternate rapidly between reptating through the intact polymer network and disrupting network entanglements to drag polymers through the solution, similar to dsDNA dynamics we observe in single-molecule DNA imaging studies. Most importantly, these results reveal the surprisingly powerful ability of microchip electrophoresis to provide ultrafast Sanger sequencing, which will translate to increased system throughput and reduced costs. PMID:18184818
Quasispecies Analyses of the HIV-1 Near-full-length Genome With Illumina MiSeq
Ode, Hirotaka; Matsuda, Masakazu; Matsuoka, Kazuhiro; Hachiya, Atsuko; Hattori, Junko; Kito, Yumiko; Yokomaku, Yoshiyuki; Iwatani, Yasumasa; Sugiura, Wataru
2015-01-01
Human immunodeficiency virus type-1 (HIV-1) exhibits high between-host genetic diversity and within-host heterogeneity, recognized as quasispecies. Because HIV-1 quasispecies fluctuate in terms of multiple factors, such as antiretroviral exposure and host immunity, analyzing the HIV-1 genome is critical for selecting effective antiretroviral therapy and understanding within-host viral coevolution mechanisms. Here, to obtain HIV-1 genome sequence information that includes minority variants, we sought to develop a method for evaluating quasispecies throughout the HIV-1 near-full-length genome using the Illumina MiSeq benchtop deep sequencer. To ensure the reliability of minority mutation detection, we applied an analysis method of sequence read mapping onto a consensus sequence derived from de novo assembly followed by iterative mapping and subsequent unique error correction. Deep sequencing analyses of aHIV-1 clone showed that the analysis method reduced erroneous base prevalence below 1% in each sequence position and discarded only < 1% of all collected nucleotides, maximizing the usage of the collected genome sequences. Further, we designed primer sets to amplify the HIV-1 near-full-length genome from clinical plasma samples. Deep sequencing of 92 samples in combination with the primer sets and our analysis method provided sufficient coverage to identify >1%-frequency sequences throughout the genome. When we evaluated sequences of pol genes from 18 treatment-naïve patients' samples, the deep sequencing results were in agreement with Sanger sequencing and identified numerous additional minority mutations. The results suggest that our deep sequencing method would be suitable for identifying within-host viral population dynamics throughout the genome. PMID:26617593
High-throughput sequence alignment using Graphics Processing Units
Schatz, Michael C; Trapnell, Cole; Delcher, Arthur L; Varshney, Amitabh
2007-01-01
Background The recent availability of new, less expensive high-throughput DNA sequencing technologies has yielded a dramatic increase in the volume of sequence data that must be analyzed. These data are being generated for several purposes, including genotyping, genome resequencing, metagenomics, and de novo genome assembly projects. Sequence alignment programs such as MUMmer have proven essential for analysis of these data, but researchers will need ever faster, high-throughput alignment tools running on inexpensive hardware to keep up with new sequence technologies. Results This paper describes MUMmerGPU, an open-source high-throughput parallel pairwise local sequence alignment program that runs on commodity Graphics Processing Units (GPUs) in common workstations. MUMmerGPU uses the new Compute Unified Device Architecture (CUDA) from nVidia to align multiple query sequences against a single reference sequence stored as a suffix tree. By processing the queries in parallel on the highly parallel graphics card, MUMmerGPU achieves more than a 10-fold speedup over a serial CPU version of the sequence alignment kernel, and outperforms the exact alignment component of MUMmer on a high end CPU by 3.5-fold in total application time when aligning reads from recent sequencing projects using Solexa/Illumina, 454, and Sanger sequencing technologies. Conclusion MUMmerGPU is a low cost, ultra-fast sequence alignment program designed to handle the increasing volume of data produced by new, high-throughput sequencing technologies. MUMmerGPU demonstrates that even memory-intensive applications can run significantly faster on the relatively low-cost GPU than on the CPU. PMID:18070356
The role of the physician: Eugene Sanger and a standard of care at the Elmira prison camp.
Waggoner, Jesse
2008-01-01
The conduct of American military physicians in prisoner of war (POW) camps has been called into question by the abuse scandals at Abu Ghraib and Guantánamo Bay. This essay explores the experiences of the first U.S. military physicians to confront POW patients in large numbers-events that occurred during the American Civil War. While POWs received sub-standard care in camps north and south, the war also saw the issuance of the first document to outline the rights of POWs. This ambivalence toward the proper care and treatment of the POW is evident in the career of Dr. Eugene Sanger, the first Union surgeon at the prison camp in Elmira, New York. Sanger demonstrated both concern about the sanitary condition of the camp and pride in the deaths of POWs as furthering the overall war aims. His cruelty attracted some censure, but Sanger never faced disciplinary action. He was honorably discharged and went on to become the Surgeon General of his home state. This article places his actions at Elmira in the context of medical ethics, Army orders, and Northern opinion in 1864, and it will argue that the lack of Federal response to Eugene Sanger's poor record while serving at the prison set a precedent for inferior medical care of POWs by American military physicians.
DOE Joint Genome Institute 2008 Progress Report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gilbert, David
2009-03-12
While initially a virtual institute, the driving force behind the creation of the DOE Joint Genome Institute in Walnut Creek, California in the Fall of 1999 was the Department of Energy's commitment to sequencing the human genome. With the publication in 2004 of a trio of manuscripts describing the finished 'DOE Human Chromosomes', the Institute successfully completed its human genome mission. In the time between the creation of the Department of Energy Joint Genome Institute (DOE JGI) and completion of the Human Genome Project, sequencing and its role in biology spread to fields extending far beyond what could be imaginedmore » when the Human Genome Project first began. Accordingly, the targets of the DOE JGI's sequencing activities changed, moving from a single human genome to the genomes of large numbers of microbes, plants, and other organisms, and the community of users of DOE JGI data similarly expanded and diversified. Transitioning into operating as a user facility, the DOE JGI modeled itself after other DOE user facilities, such as synchrotron light sources and supercomputer facilities, empowering the science of large numbers of investigators working in areas of relevance to energy and the environment. The JGI's approach to being a user facility is based on the concept that by focusing state-of-the-art sequencing and analysis capabilities on the best peer-reviewed ideas drawn from a broad community of scientists, the DOE JGI will effectively encourage creative approaches to DOE mission areas and produce important science. This clearly has occurred, only partially reflected in the fact that the DOE JGI has played a major role in more than 45 papers published in just the past three years alone in Nature and Science. The involvement of a large and engaged community of users working on important problems has helped maximize the impact of JGI science. A seismic technological change is presently underway at the JGI. The Sanger capillary-based sequencing process that dominated how sequencing was done in the last decade is being replaced by a variety of new processes and sequencing instruments. The JGI, with an increasing number of next-generation sequencers, whose throughput is 100- to 1,000-fold greater than the Sanger capillary-based sequencers, is increasingly focused in new directions on projects of scale and complexity not previously attempted. These new directions for the JGI come, in part, from the 2008 National Research Council report on the goals of the National Plant Genome Initiative as well as the 2007 National Research Council report on the New Science of Metagenomics. Both reports outline a crucial need for systematic large-scale surveys of the plant and microbial components of the biosphere as well as an increasing need for large-scale analysis capabilities to meet the challenge of converting sequence data into knowledge. The JGI is extensively discussed in both reports as vital to progress in these fields of major national interest. JGI's future plan for plants and microbes includes a systematic approach for investigation of these organisms at a scale requiring the special capabilities of the JGI to generate, manage, and analyze the datasets. JGI will generate and provide not only community access to these plant and microbial datasets, but also the tools for analyzing them. These activities will produce essential knowledge that will be needed if we are to be able to respond to the world's energy and environmental challenges. As the JGI Plant and Microbial programs advance, the JGI as a user facility is also evolving. The Institute has been highly successful in bending its technical and analytical skills to help users solve large complex problems of major importance, and that effort will continue unabated. The JGI will increasingly move from a central focus on 'one-off' user projects coming from small user communities to much larger scale projects driven by systematic and problem-focused approaches to selection of sequencing targets. Entire communities of scientists working in a particular field, such as feedstock improvement or biomass degradation, will be users of this information. Despite this new emphasis, an investigator-initiated user program will remain. This program in the future will replace small projects that increasingly can be accomplished without the involvement of JGI, with imaginative large-scale 'Grand Challenge' projects of foundational relevance to energy and the environment that require a new scale of sequencing and analysis capabilities. Close interactions with the DOE Bioenergy Research Centers, and with other DOE institutions that may follow, will also play a major role in shaping aspects of how the JGI operates as a user facility. Based on increased availability of high-throughput sequencing, the JGI will increasingly provide to users, in addition to DNA sequencing, an array of both pre- and post-sequencing value-added capabilities to accelerate their science.« less
Oliveira, Jorge; Negrão, Luís; Fineza, Isabel; Taipa, Ricardo; Melo-Pires, Manuel; Fortuna, Ana Maria; Gonçalves, Ana Rita; Froufe, Hugo; Egas, Conceição; Santos, Rosário; Sousa, Mário
2015-06-01
Muscular dystrophies (MDs) are a group of hereditary muscle disorders that include two particularly heterogeneous subgroups: limb-girdle MD and congenital MD, linked to 52 different genes (seven common to both subgroups). Massive parallel sequencing technology may avoid the usual stepwise gene-by-gene analysis. We report the whole-exome sequencing (WES) analysis of a patient with childhood-onset progressive MD, also presenting mental retardation and dilated cardiomyopathy. Conventional sequencing had excluded eight candidate genes. WES of the trio (patient and parents) was performed using the ion proton sequencing system. Data analysis resorted to filtering steps using the GEMINI software revealed a novel silent variant in the choline kinase beta (CHKB) gene. Inspection of sequence alignments ultimately identified the causal variant (CHKB:c.1031+3G>C). This splice site mutation was confirmed using Sanger sequencing and its effect was further evaluated with gene expression analysis. On reassessment of the muscle biopsy, typical abnormal mitochondrial oxidative changes were observed. Mutations in CHKB have been shown to cause phosphatidylcholine deficiency in myofibers, causing a rare form of CMD (only 21 patients reported). Notwithstanding interpretative difficulties that need to be overcome before the integration of WES in the diagnostic workflow, this work corroborates its utility in solving cases from highly heterogeneous groups of diseases, in which conventional diagnostic approaches fail to provide a definitive diagnosis.
Gürtler, Nicolas; Röthlisberger, Benno; Ludin, Katja; Schlegel, Christoph; Lalwani, Anil K
2017-07-01
Identification of the causative mutation using next-generation sequencing in autosomal-dominant hereditary hearing impairment, as mutation analysis in hereditary hearing impairment by classic genetic methods, is hindered by the high heterogeneity of the disease. Two Swiss families with autosomal-dominant hereditary hearing impairment. Amplified DNA libraries for next-generation sequencing were constructed from extracted genomic DNA, derived from peripheral blood, and enriched by a custom-made sequence capture library. Validated, pooled libraries were sequenced on an Illumina MiSeq instrument, 300 cycles and paired-end sequencing. Technical data analysis was performed with SeqMonk, variant analysis with GeneTalk or VariantStudio. The detection of mutations in genes related to hearing loss by next-generation sequencing was subsequently confirmed using specific polymerase-chain-reaction and Sanger sequencing. Mutation detection in hearing-loss-related genes. The first family harbored the mutation c.5383+5delGTGA in the TECTA-gene. In the second family, a novel mutation c.2614-2625delCATGGCGCCGTG in the WFS1-gene and a second mutation TCOF1-c.1028G>A were identified. Next-generation sequencing successfully identified the causative mutation in families with autosomal-dominant hereditary hearing impairment. The results helped to clarify the pathogenic role of a known mutation and led to the detection of a novel one. NGS represents a feasible approach with great potential future in the diagnostics of hereditary hearing impairment, even in smaller labs.
Vega, Ana I; Medrano, Celia; Navarrete, Rosa; Desviat, Lourdes R; Merinero, Begoña; Rodríguez-Pombo, Pilar; Vitoria, Isidro; Ugarte, Magdalena; Pérez-Cerdá, Celia; Pérez, Belen
2016-10-01
Glycogen storage disease (GSD) is an umbrella term for a group of genetic disorders that involve the abnormal metabolism of glycogen; to date, 23 types of GSD have been identified. The nonspecific clinical presentation of GSD and the lack of specific biomarkers mean that Sanger sequencing is now widely relied on for making a diagnosis. However, this gene-by-gene sequencing technique is both laborious and costly, which is a consequence of the number of genes to be sequenced and the large size of some genes. This work reports the use of massive parallel sequencing to diagnose patients at our laboratory in Spain using either a customized gene panel (targeted exome sequencing) or the Illumina Clinical-Exome TruSight One Gene Panel (clinical exome sequencing (CES)). Sequence variants were matched against biochemical and clinical hallmarks. Pathogenic mutations were detected in 23 patients. Twenty-two mutations were recognized (mostly loss-of-function mutations), including 11 that were novel in GSD-associated genes. In addition, CES detected five patients with mutations in ALDOB, LIPA, NKX2-5, CPT2, or ANO5. Although these genes are not involved in GSD, they are associated with overlapping phenotypic characteristics such as hepatic, muscular, and cardiac dysfunction. These results show that next-generation sequencing, in combination with the detection of biochemical and clinical hallmarks, provides an accurate, high-throughput means of making genetic diagnoses of GSD and related diseases.Genet Med 18 10, 1037-1043.
2011-01-01
Background Jatropha curcas L. is an important non-edible oilseed crop with promising future in biodiesel production. However, factors like oil yield, oil composition, toxic compounds in oil cake, pests and diseases limit its commercial potential. Well established genetic engineering methods using cloned genes could be used to address these limitations. Earlier, 10,983 unigenes from Sanger sequencing of ESTs, and 3,484 unique assembled transcripts from 454 pyrosequencing of uncloned cDNAs were reported. In order to expedite the process of gene discovery, we have undertaken 454 pyrosequencing of normalized cDNAs prepared from roots, mature leaves, flowers, developing seeds, and embryos of J. curcas. Results From 383,918 raw reads, we obtained 381,957 quality-filtered and trimmed reads that are suitable for the assembly of transcript sequences. De novo contig assembly of these reads generated 17,457 assembled transcripts (contigs) and 54,002 singletons. Average length of the assembled transcripts was 916 bp. About 30% of the transcripts were longer than 1000 bases, and the size of the longest transcript was 7,173 bases. BLASTX analysis revealed that 2,589 of these transcripts are full-length. The assembled transcripts were validated by RT-PCR analysis of 28 transcripts. The results showed that the transcripts were correctly assembled and represent actively expressed genes. KEGG pathway mapping showed that 2,320 transcripts are related to major biochemical pathways including the oil biosynthesis pathway. Overall, the current study reports 14,327 new assembled transcripts which included 2589 full-length transcripts and 27 transcripts that are directly involved in oil biosynthesis. Conclusion The large number of transcripts reported in the current study together with existing ESTs and transcript sequences will serve as an invaluable genetic resource for crop improvement in jatropha. Sequence information of those genes that are involved in oil biosynthesis could be used for metabolic engineering of jatropha to increase oil content, and to modify oil composition. PMID:21492485
Natarajan, Purushothaman; Parani, Madasamy
2011-04-15
Jatropha curcas L. is an important non-edible oilseed crop with promising future in biodiesel production. However, factors like oil yield, oil composition, toxic compounds in oil cake, pests and diseases limit its commercial potential. Well established genetic engineering methods using cloned genes could be used to address these limitations. Earlier, 10,983 unigenes from Sanger sequencing of ESTs, and 3,484 unique assembled transcripts from 454 pyrosequencing of uncloned cDNAs were reported. In order to expedite the process of gene discovery, we have undertaken 454 pyrosequencing of normalized cDNAs prepared from roots, mature leaves, flowers, developing seeds, and embryos of J. curcas. From 383,918 raw reads, we obtained 381,957 quality-filtered and trimmed reads that are suitable for the assembly of transcript sequences. De novo contig assembly of these reads generated 17,457 assembled transcripts (contigs) and 54,002 singletons. Average length of the assembled transcripts was 916 bp. About 30% of the transcripts were longer than 1000 bases, and the size of the longest transcript was 7,173 bases. BLASTX analysis revealed that 2,589 of these transcripts are full-length. The assembled transcripts were validated by RT-PCR analysis of 28 transcripts. The results showed that the transcripts were correctly assembled and represent actively expressed genes. KEGG pathway mapping showed that 2,320 transcripts are related to major biochemical pathways including the oil biosynthesis pathway. Overall, the current study reports 14,327 new assembled transcripts which included 2589 full-length transcripts and 27 transcripts that are directly involved in oil biosynthesis. The large number of transcripts reported in the current study together with existing ESTs and transcript sequences will serve as an invaluable genetic resource for crop improvement in jatropha. Sequence information of those genes that are involved in oil biosynthesis could be used for metabolic engineering of jatropha to increase oil content, and to modify oil composition.
Bergfors, Assar; Leenheer, Daniël; Bergqvist, Anders; Ameur, Adam; Lennerstrand, Johan
2016-02-01
Development of Hepatitis C virus (HCV) resistance against direct-acting antivirals (DAAs), including NS5A inhibitors, is an obstacle to successful treatment of HCV when DAAs are used in sub-optimal combinations. Furthermore, it has been shown that baseline (pre-existing) resistance against DAAs is present in treatment naïve-patients and this will potentially complicate future treatment strategies in different HCV genotypes (GTs). Thus the aim was to detect low levels of NS5A resistant associated variants (RAVs) in a limited sample set of treatment-naïve patients of HCV GT1a and 3a, since such polymorphisms can display in vitro resistance as high as 60000 fold. Ultra-deep single molecule real time (SMRT) sequencing with the Pacific Biosciences (PacBio) RSII instrument was used to detect these RAVs. The SMRT sequencing was conducted on ten samples; three of them positive with Sanger sequencing (GT1a Q30H and Y93N, and GT3a Y93H), five GT1a samples, and two GT3a non-positive samples. The same methods were applied to the HCV GT1a H77-plasmid in a dilution series, in order to determine the error rates of replication, which in turn was used to determine the limit of detection (LOD), as defined by mean + 3SD, of minority variants down to 0.24%. We found important baseline NS5A RAVs at levels between 0.24 and 0.5%, which could potentially have clinical relevance. This new method with low level detection of baseline RAVs could be useful in predicting the most cost-efficient combination of DAA treatment, and reduce the treatment duration for an HCV infected individual. Copyright © 2015 Elsevier B.V. All rights reserved.
Large-scale deletions of the ABCA1 gene in patients with hypoalphalipoproteinemia.
Dron, Jacqueline S; Wang, Jian; Berberich, Amanda J; Iacocca, Michael A; Cao, Henian; Yang, Ping; Knoll, Joan; Tremblay, Karine; Brisson, Diane; Netzer, Christian; Gouni-Berthold, Ioanna; Gaudet, Daniel; Hegele, Robert A
2018-06-04
Copy-number variations (CNVs) have been studied in the context of familial hypercholesterolemia but have not yet been evaluated in patients with extremes of high-density lipoprotein (HDL) cholesterol levels. We evaluated targeted next-generation sequencing data from patients with very low HDL cholesterol (i.e. hypoalphalipoproteinemia) using the VarSeq-CNV caller algorithm to screen for CNVs disrupting the ABCA1, LCAT or APOA1 genes. In four individuals, we found three unique deletions in ABCA1: a heterozygous deletion of exon 4, a heterozygous deletion spanning exons 8 to 31, and a heterozygous deletion of the entire ABCA1 gene. Breakpoints were identified using Sanger sequencing, and the full-gene deletion was also confirmed using exome sequencing and the Affymetrix CytoScanTM HD Array. Before now, large-scale deletions in candidate HDL genes have not been associated with hypoalphalipoproteinemia; our findings indicate that CNVs in ABCA1 may be a previously unappreciated genetic determinant of low HDL cholesterol levels. By coupling bioinformatic analyses with next-generation sequencing data, we can successfully assess the spectrum of genetic determinants of many dyslipidemias, now including hypoalphalipoproteinemia. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.
Implementation of Quality Management in Core Service Laboratories
Creavalle, T.; Haque, K.; Raley, C.; Subleski, M.; Smith, M.W.; Hicks, B.
2010-01-01
CF-28 The Genetics and Genomics group of the Advanced Technology Program of SAIC-Frederick exists to bring innovative genomic expertise, tools and analysis to NCI and the scientific community. The Sequencing Facility (SF) provides next generation short read (Illumina) sequencing capacity to investigators using a streamlined production approach. The Laboratory of Molecular Technology (LMT) offers a wide range of genomics core services including microarray expression analysis, miRNA analysis, array comparative genome hybridization, long read (Roche) next generation sequencing, quantitative real time PCR, transgenic genotyping, Sanger sequencing, and clinical mutation detection services to investigators from across the NIH. As the technology supporting this genomic research becomes more complex, the need for basic quality processes within all aspects of the core service groups becomes critical. The Quality Management group works alongside members of these labs to establish or improve processes supporting operations control (equipment, reagent and materials management), process improvement (reengineering/optimization, automation, acceptance criteria for new technologies and tech transfer), and quality assurance and customer support (controlled documentation/SOPs, training, service deficiencies and continual improvement efforts). Implementation and expansion of quality programs within unregulated environments demonstrates SAIC-Frederick's dedication to providing the highest quality products and services to the NIH community.
Ulusal, SD; Gürkan, H; Atlı, E; Özal, SA; Çiftdemir, M; Tozkır, H; Karal, Y; Güçlü, H; Eker, D; Görker, I
2017-01-01
Abstract Neurofibromatosis Type I (NF1) is a multi systemic autosomal dominant neurocutaneous disorder predisposing patients to have benign and/or malignant lesions predominantly of the skin, nervous system and bone. Loss of function mutations or deletions of the NF1 gene is responsible for NF1 disease. Involvement of various pathogenic variants, the size of the gene and presence of pseudogenes makes it difficult to analyze. We aimed to report the results of 2 years of multiplex ligation-dependent probe amplification (MLPA) and next generation sequencing (NGS) for genetic diagnosis of NF1 applied at our genetic diagnosis center. The MLPA, semiconductor sequencing and Sanger sequencing were performed in genomic DNA samples from 24 unrelated patients and their affected family members referred to our center suspected of having NF1. In total, three novel and 12 known pathogenic variants and a whole gene deletion were determined. We suggest that next generation sequencing is a practical tool for genetic analysis of NF1. Deletion/duplication analysis with MLPA may also be helpful for patients clinically diagnosed to carry NF1 but do not have a detectable mutation in NGS. PMID:28924536
Bintz, Brittania J; Dixon, Groves B; Wilson, Mark R
2014-07-01
Next-generation sequencing technologies enable the identification of minor mitochondrial DNA variants with higher sensitivity than Sanger methods, allowing for enhanced identification of minor variants. In this study, mixtures of human mtDNA control region amplicons were subjected to pyrosequencing to determine the detection threshold of the Roche GS Junior(®) instrument (Roche Applied Science, Indianapolis, IN). In addition to expected variants, a set of reproducible variants was consistently found in reads from one particular amplicon. A BLASTn search of the variant sequence revealed identity to a segment of a 611-bp nuclear insertion of the mitochondrial control region (NumtS) spanning the primer-binding sites of this amplicon (Nature 1995;378:489). Primers (Hum Genet 2012;131:757; Hum Biol 1996;68:847) flanking the insertion were used to confirm the presence or absence of the NumtS in buccal DNA extracts from twenty donors. These results further our understanding of human mtDNA variation and are expected to have a positive impact on the interpretation of mtDNA profiles using deep-sequencing methods in casework. © 2014 American Academy of Forensic Sciences.
Margam, Venu M.; Coates, Brad S.; Bayles, Darrell O.; Hellmich, Richard L.; Agunbiade, Tolulope; Seufferheld, Manfredo J.; Sun, Weilin; Kroemer, Jeremy A.; Ba, Malick N.; Binso-Dabire, Clementine L.; Baoua, Ibrahim; Ishiyaku, Mohammad F.; Covas, Fernando G.; Srinivasan, Ramasamy; Armstrong, Joel; Murdock, Larry L.; Pittendrigh, Barry R.
2011-01-01
The legume pod borer, Maruca vitrata (Lepidoptera: Crambidae), is an insect pest species of crops grown by subsistence farmers in tropical regions of Africa. We present the de novo assembly of 3729 contigs from 454- and Sanger-derived sequencing reads for midgut, salivary, and whole adult tissues of this non-model species. Functional annotation predicted that 1320 M. vitrata protein coding genes are present, of which 631 have orthologs within the Bombyx mori gene model. A homology-based analysis assigned M. vitrata genes into a group of paralogs, but these were subsequently partitioned into putative orthologs following phylogenetic analyses. Following sequence quality filtering, a total of 1542 putative single nucleotide polymorphisms (SNPs) were predicted within M. vitrata contig assemblies. Seventy one of 1078 designed molecular genetic markers were used to screen M. vitrata samples from five collection sites in West Africa. Population substructure may be present with significant implications in the insect resistance management recommendations pertaining to the release of biological control agents or transgenic cowpea that express Bacillus thuringiensis crystal toxins. Mutation data derived from transcriptome sequencing is an expeditious and economical source for genetic markers that allow evaluation of ecological differentiation. PMID:21754987
Miyatake, Satoko; Koshimizu, Eriko; Hayashi, Yukiko K; Miya, Kazushi; Shiina, Masaaki; Nakashima, Mitsuko; Tsurusaki, Yoshinori; Miyake, Noriko; Saitsu, Hirotomo; Ogata, Kazuhiro; Nishino, Ichizo; Matsumoto, Naomichi
2014-07-01
When an expected mutation in a particular disease-causing gene is not identified in a suspected carrier, it is usually assumed to be due to germline mosaicism. We report here very-low-grade somatic mosaicism in ACTA1 in an unaffected mother of two siblings affected with a neonatal form of nemaline myopathy. The mosaicism was detected by deep resequencing using a next-generation sequencer. We identified a novel heterozygous mutation in ACTA1, c.448A>G (p.Thr150Ala), in the affected siblings. Three-dimensional structural modeling suggested that this mutation may affect polymerization and/or actin's interactions with other proteins. In this family, we expected autosomal dominant inheritance with either parent demonstrating germline or somatic mosaicism. Sanger sequencing identified no mutation. However, further deep resequencing of this mutation on a next-generation sequencer identified very-low-grade somatic mosaicism in the mother: 0.4%, 1.1%, and 8.3% in the saliva, blood leukocytes, and nails, respectively. Our study demonstrates the possibility of very-low-grade somatic mosaicism in suspected carriers, rather than germline mosaicism. Copyright © 2014 Elsevier B.V. All rights reserved.
Rapid phylogenetic dissection of prokaryotic community structure in tidal flat using pyrosequencing.
Kim, Bong-Soo; Kim, Byung Kwon; Lee, Jae-Hak; Kim, Myungjin; Lim, Young Woon; Chun, Jongsik
2008-08-01
Dissection of prokaryotic community structure is prerequisite to understand their ecological roles. Various methods are available for such a purpose which amplification and sequencing of 16S rRNA genes gained its popularity. However, conventional methods based on Sanger sequencing technique require cloning process prior to sequencing, and are expensive and labor-intensive. We investigated prokaryotic community structure in tidal flat sediments, Korea, using pyrosequencing and a subsequent automated bioinformatic pipeline for the rapid and accurate taxonomic assignment of each amplicon. The combination of pyrosequencing and bioinformatic analysis showed that bacterial and archaeal communities were more diverse than previously reported in clone library studies. Pyrosequencing analysis revealed 21 bacterial divisions and 37 candidate divisions. Proteobacteria was the most abundant division in the bacterial community, of which Gamma-and Delta-Proteobacteria were the most abundant. Similarly, 4 archaeal divisions were found in tidal flat sediments. Euryarchaeota was the most abundant division in the archaeal sequences, which were further divided into 8 classes and 11 unclassified euryarchaeota groups. The system developed here provides a simple, in-depth and automated way of dissecting a prokaryotic community structure without extensive pretreatment such as cloning.
A new compound heterozygous CFTR mutation in a Chinese family with cystic fibrosis.
Xie, Yingjun; Huang, Xueqiong; Liang, Yujian; Xu, Lingling; Pei, Yuxin; Cheng, Yucai; Zhang, Lidan; Tang, Wen
2017-11-01
Cystic fibrosis (CF) is the most common autosomal recessive disease among Caucasians but is rarer in the Chinese population, because mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene. To elucidate the causative role of a novel compound heterozygous mutation of CF. In this study, clinical samples were obtained from two siblings with recurrent airway infections, clubbed fingers, salt-sweat and failure to gain weight in a non-consanguineous Chinese family. Next-generation sequencing was performed on the 27 coding exons of CFTR in both children, with confirmation by Sanger sequencing. Next-generation sequencing showed the same compound heterozygous CFTR mutation (c.865A>T p.Arg289X and c.3651_3652insAAAT p.Tyr1219X) in both children. As this mutation is consistent with the clinical manifestations of CF and no other mutations were detected after scanning the gene sequence, we suggest that the CF phenotype is caused by compound heterozygosity for c.865A>T and c.3651_3652insAAAT. As c865A>T is not currently listed in the "Cystic Fibrosis Mutation Database", this information about CF in a Chinese population is of interest. © 2015 John Wiley & Sons Ltd.
Afouda, Leonard; Kone, Daouda; Zinsou, Valerien; Dossou, Laurence; Kenyon, Lawrence; Winter, Stephan; Knierim, Dennis
2017-06-01
Surveys were conducted in 2014 and 2015 in Southern and Northern Benin, respectively, to identify the viruses infecting peppers (Capsicum spp.). The samples were screened by ELISA for cucumber mosaic virus (CMV), pepper veinal mottle virus (PVMV), potato virus Y (PVY) and tomato yellow leaf curl virus (TYLCV). A generic reverse transcription PCR (RT-PCR) was used to test for the presence of poleroviruses. ELISA tests confirmed the prevalence of all viruses, while the RT-PCR detected pepper vein yellows virus (PeVYV) which is reported for the first time in Benin. A further, divergent polerovirus isolate was detected from a single pepper sample originating from southern Benin. Screening of samples collected from solanaceous plants during virus surveys in Mali (conducted in 2009) also detected this divergent polerovirus isolate in two samples from African eggplants. The complete genome sequence was obtained from the Mali isolate using transcriptome sequencing and by conventional Sanger sequencing of overlapping RT-PCR products. Based on the sequence characteristics of this isolate we propose a new polerovirus species, African eggplant yellowing virus (AeYV).