Williams, Emma L; Bagg, Eleanor A L; Mueller, Michael; Vandrovcova, Jana; Aitman, Timothy J; Rumsby, Gill
2015-01-01
Definitive diagnosis of primary hyperoxaluria (PH) currently utilizes sequential Sanger sequencing of the AGXT, GRPHR, and HOGA1 genes but efficacy is unproven. This analysis is time-consuming, relatively expensive, and delays in diagnosis and inappropriate treatment can occur if not pursued early in the diagnostic work-up. We reviewed testing outcomes of Sanger sequencing in 200 consecutive patient samples referred for analysis. In addition, the Illumina Truseq custom amplicon system was evaluated for paralleled next-generation sequencing (NGS) of AGXT,GRHPR, and HOGA1 in 90 known PH patients. AGXT sequencing was requested in all patients, permitting a diagnosis of PH1 in 50%. All remaining patients underwent targeted exon sequencing of GRHPR and HOGA1 with 8% diagnosed with PH2 and 8% with PH3. Complete sequencing of both GRHPR and HOGA1 was not requested in 25% of patients referred leaving their diagnosis in doubt. NGS analysis showed 98% agreement with Sanger sequencing and both approaches had 100% diagnostic specificity. Diagnostic sensitivity of Sanger sequencing was 98% and for NGS it was 97%. NGS has comparable diagnostic performance to Sanger sequencing for the diagnosis of PH and, if implemented, would screen for all forms of PH simultaneously ensuring prompt diagnosis at decreased cost. PMID:25629080
Lee, Sung Hak; Chung, Arthur Minwoo; Lee, Ahwon; Oh, Woo Jin; Choi, Yeong Jin; Lee, Youn-Soo; Jung, Eun Sun
2017-01-01
Mutations in the KRAS gene have been identified in approximately 50% of colorectal cancers (CRCs). KRAS mutations are well established biomarkers in anti-epidermal growth factor receptor therapy. Therefore, assessment of KRAS mutations is needed in CRC patients to ensure appropriate treatment. We compared the analytical performance of the cobas test to Sanger sequencing in 264 CRC cases. In addition, discordant specimens were evaluated by 454 pyrosequencing. KRAS mutations for codons 12/13 were detected in 43.2% of cases (114/264) by Sanger sequencing. Of 257 evaluable specimens for comparison, KRAS mutations were detected in 112 cases (43.6%) by Sanger sequencing and 118 cases (45.9%) by the cobas test. Concordance between the cobas test and Sanger sequencing for each lot was 93.8% positive percent agreement (PPA) and 91.0% negative percent agreement (NPA) for codons 12/13. Results from the cobas test and Sanger sequencing were discordant for 20 cases (7.8%). Twenty discrepant cases were subsequently subjected to 454 pyrosequencing. After comprehensive analysis of the results from combined Sanger sequencing-454 pyrosequencing and the cobas test, PPA was 97.5% and NPA was 100%. The cobas test is an accurate and sensitive test for detecting KRAS -activating mutations and has analytical power equivalent to Sanger sequencing. Prescreening using the cobas test with subsequent application of Sanger sequencing is the best strategy for routine detection of KRAS mutations in CRC.
Detection of Rare Mutations in EGFR-ARMS-PCR-Negative Lung Adenocarcinoma by Sanger Sequencing.
Liang, Chaoyue; Wu, Zhuolin; Gan, Xiaohong; Liu, Yuanbin; You, You; Liu, Chenxian; Zhou, Chengzhi; Liang, Ying; Mo, Haiyun; Chen, Allen M; Zhang, Jiexia
2018-01-01
This study aimed to identify potential epidermal growth factor receptor (EGFR) gene mutations in non-small cell lung cancer that went undetected by amplification refractory mutation system-Scorpion real-time PCR (ARMS-PCR). A total of 200 specimens were obtained from the First Affiliated Hospital of Guangzhou Medical University from August 2014 to August 2015. In total, 100 ARMS-negative and 100 ARMS-positive specimens were evaluated for EGFR gene mutations by Sanger sequencing. The methodology and sensitivity of each method and the outcomes of EGFR-tyrosine kinase inhibitor (TKI) therapy were analyzed. Among the 100 ARMS-PCR-positive samples, 90 were positive by Sanger sequencing, while 10 cases were considered negative, because the mutation abundance was less than 10%. Among the 100 negative cases, three were positive for a rare EGFR mutation by Sanger sequencing. In the curative effect analysis of EGFR-TKIs, the progression-free survival (PFS) analysis based on ARMS and Sanger sequencing results showed no difference. However, the PFS of patients with a high abundance of EGFR mutation was 12.4 months [95% confidence interval (CI), 11.6-12.4 months], which was significantly higher than that of patients with a low abundance of mutations detected by Sanger sequencing (95% CI, 10.7-11.3 months) (p<0.001). The ARMS method demonstrated higher sensitivity than Sanger sequencing, but was prone to missing mutations due to primer design. Sanger sequencing was able to detect rare EGFR mutations and deemed applicable for confirming EGFR status. A clinical trial evaluating the efficacy of EGFR-TKIs in patients with rare EGFR mutations is needed. © Copyright: Yonsei University College of Medicine 2018
Pandey, Ram Vinay; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas
2016-01-01
Traditional Sanger sequencing as well as Next-Generation Sequencing have been used for the identification of disease causing mutations in human molecular research. The majority of currently available tools are developed for research and explorative purposes and often do not provide a complete, efficient, one-stop solution. As the focus of currently developed tools is mainly on NGS data analysis, no integrative solution for the analysis of Sanger data is provided and consequently a one-stop solution to analyze reads from both sequencing platforms is not available. We have therefore developed a new pipeline called MutAid to analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and analysis of Sanger and NGS data under a single platform. It is capable of analyzing reads from multiple patients in a single run to create a list of potential disease causing base substitutions as well as insertions and deletions. MutAid has been developed for expert and non-expert users and supports four sequencing platforms including Sanger, Illumina, 454 and Ion Torrent. Furthermore, for NGS data analysis, five read mappers including BWA, TMAP, Bowtie, Bowtie2 and GSNAP and four variant callers including GATK-HaplotypeCaller, SAMTOOLS, Freebayes and VarScan2 pipelines are supported. MutAid is freely available at https://sourceforge.net/projects/mutaid.
Pandey, Ram Vinay; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas
2016-01-01
Traditional Sanger sequencing as well as Next-Generation Sequencing have been used for the identification of disease causing mutations in human molecular research. The majority of currently available tools are developed for research and explorative purposes and often do not provide a complete, efficient, one-stop solution. As the focus of currently developed tools is mainly on NGS data analysis, no integrative solution for the analysis of Sanger data is provided and consequently a one-stop solution to analyze reads from both sequencing platforms is not available. We have therefore developed a new pipeline called MutAid to analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and analysis of Sanger and NGS data under a single platform. It is capable of analyzing reads from multiple patients in a single run to create a list of potential disease causing base substitutions as well as insertions and deletions. MutAid has been developed for expert and non-expert users and supports four sequencing platforms including Sanger, Illumina, 454 and Ion Torrent. Furthermore, for NGS data analysis, five read mappers including BWA, TMAP, Bowtie, Bowtie2 and GSNAP and four variant callers including GATK-HaplotypeCaller, SAMTOOLS, Freebayes and VarScan2 pipelines are supported. MutAid is freely available at https://sourceforge.net/projects/mutaid. PMID:26840129
Kiesler, Kevin M; Coble, Michael D; Hall, Thomas A; Vallone, Peter M
2014-01-01
A set of 711 samples from four U.S. population groups was analyzed using a novel mass spectrometry based method for mitochondrial DNA (mtDNA) base composition profiling. Comparison of the mass spectrometry results with Sanger sequencing derived data yielded a concordance rate of 99.97%. Length heteroplasmy was identified in 46% of samples and point heteroplasmy was observed in 6.6% of samples in the combined mass spectral and Sanger data set. Using discrimination capacity as a metric, Sanger sequencing of the full control region had the highest discriminatory power, followed by the mass spectrometry base composition method, which was more discriminating than Sanger sequencing of just the hypervariable regions. This trend is in agreement with the number of nucleotides covered by each of the three assays. Published by Elsevier Ireland Ltd.
[Molecular and prenatal diagnosis of a family with Fanconi anemia by next generation sequencing].
Gong, Zhuwen; Yu, Yongguo; Zhang, Qigang; Gu, Xuefan
2015-04-01
To provide prenatal diagnosis for a pregnant woman who had given birth to a child with Fanconi anemia with combined next-generation sequencing (NGS) and Sanger sequencing. For the affected child, potential mutations of the FANCA gene were analyzed with NGS. Suspected mutation was verified with Sanger sequencing. For prenatal diagnosis, genomic DNA was extracted from cultured fetal amniotic fluid cells and subjected to analysis of the same mutations. A low-frequency frameshifting mutation c.989_995del7 (p.H330LfsX2, inherited from his father) and a truncating mutation c.3971C>T (p.P1324L, inherited from his mother) have been identified in the affected child and considered to be pathogenic. The two mutations were subsequently verified by Sanger sequencing. Upon prenatal diagnosis, the fetus was found to carry two mutations. The combined next-generation sequencing and Sanger sequencing can reduce the time for diagnosis and identify subtypes of Fanconi anemia and the mutational sites, which has enabled reliable prenatal diagnosis of this disease.
Tzou, Philip L; Ariyaratne, Pramila; Varghese, Vici; Lee, Charlie; Rakhmanaliev, Elian; Villy, Carolin; Yee, Meiqi; Tan, Kevin; Michel, Gerd; Pinsky, Benjamin A; Shafer, Robert W
2018-06-01
The ability of next-generation sequencing (NGS) technologies to detect low frequency HIV-1 drug resistance mutations (DRMs) not detected by dideoxynucleotide Sanger sequencing has potential advantages for improved patient outcomes. We compared the performance of an in vitro diagnostic (IVD) NGS assay, the Sentosa SQ HIV genotyping assay for HIV-1 genotypic resistance testing, with Sanger sequencing on 138 protease/reverse transcriptase (RT) and 39 integrase sequences. The NGS assay used a 5% threshold for reporting low-frequency variants. The level of complete plus partial nucleotide sequence concordance between Sanger sequencing and NGS was 99.9%. Among the 138 protease/RT sequences, a mean of 6.4 DRMs was identified by both Sanger and NGS, a mean of 0.5 DRM was detected by NGS alone, and a mean of 0.1 DRM was detected by Sanger sequencing alone. Among the 39 integrase sequences, a mean of 1.6 DRMs was detected by both Sanger sequencing and NGS and a mean of 0.15 DRM was detected by NGS alone. Compared with Sanger sequencing, NGS estimated higher levels of resistance to one or more antiretroviral drugs for 18.2% of protease/RT sequences and 5.1% of integrase sequences. There was little evidence for technical artifacts in the NGS sequences, but the G-to-A hypermutation was detected in three samples. In conclusion, the IVD NGS assay evaluated in this study was highly concordant with Sanger sequencing. At the 5% threshold for reporting minority variants, NGS appeared to attain a modestly increased sensitivity for detecting low-frequency DRMs without compromising sequence accuracy. Copyright © 2018 American Society for Microbiology.
ERIC Educational Resources Information Center
Mottishaw, Jeffery D.; Erck, Adam R.; Kramer, Jordan H.; Sun, Haoran; Koppang, Miles
2015-01-01
Frederick Sanger's early work on protein sequencing through the use of colorimetric labeling combined with liquid chromatography involves an important nucleophilic aromatic substitution (S[subscript N]Ar) reaction in which the N-terminus of a protein is tagged with Sanger's reagent. Understanding the inherent differences between this S[subscript…
Altimari, Annalisa; de Biase, Dario; De Maglio, Giovanna; Gruppioni, Elisa; Capizzi, Elisa; Degiovanni, Alessio; D’Errico, Antonia; Pession, Annalisa; Pizzolitto, Stefano; Fiorentino, Michelangelo; Tallini, Giovanni
2013-01-01
Detection of KRAS mutations in archival pathology samples is critical for therapeutic appropriateness of anti-EGFR monoclonal antibodies in colorectal cancer. We compared the sensitivity, specificity, and accuracy of Sanger sequencing, ARMS-Scorpion (TheraScreen®) real-time polymerase chain reaction (PCR), pyrosequencing, chip array hybridization, and 454 next-generation sequencing to assess KRAS codon 12 and 13 mutations in 60 nonconsecutive selected cases of colorectal cancer. Twenty of the 60 cases were detected as wild-type KRAS by all methods with 100% specificity. Among the 40 mutated cases, 13 were discrepant with at least one method. The sensitivity was 85%, 90%, 93%, and 92%, and the accuracy was 90%, 93%, 95%, and 95% for Sanger sequencing, TheraScreen real-time PCR, pyrosequencing, and chip array hybridization, respectively. The main limitation of Sanger sequencing was its low analytical sensitivity, whereas TheraScreen real-time PCR, pyrosequencing, and chip array hybridization showed higher sensitivity but suffered from the limitations of predesigned assays. Concordance between the methods was k = 0.79 for Sanger sequencing and k > 0.85 for the other techniques. Tumor cell enrichment correlated significantly with the abundance of KRAS-mutated deoxyribonucleic acid (DNA), evaluated as ΔCt for TheraScreen real-time PCR (P = 0.03), percentage of mutation for pyrosequencing (P = 0.001), ratio for chip array hybridization (P = 0.003), and percentage of mutation for 454 next-generation sequencing (P = 0.004). Also, 454 next-generation sequencing showed the best cross correlation for quantification of mutation abundance compared with all the other methods (P < 0.001). Our comparison showed the superiority of next-generation sequencing over the other techniques in terms of sensitivity and specificity. Next-generation sequencing will replace Sanger sequencing as the reference technique for diagnostic detection of KRAS mutation in archival tumor tissues. PMID:23950653
Experience of targeted Usher exome sequencing as a clinical test
Besnard, Thomas; García-García, Gema; Baux, David; Vaché, Christel; Faugère, Valérie; Larrieu, Lise; Léonard, Susana; Millan, Jose M; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise
2014-01-01
We show that massively parallel targeted sequencing of 19 genes provides a new and reliable strategy for molecular diagnosis of Usher syndrome (USH) and nonsyndromic deafness, particularly appropriate for these disorders characterized by a high clinical and genetic heterogeneity and a complex structure of several of the genes involved. A series of 71 patients including Usher patients previously screened by Sanger sequencing plus newly referred patients was studied. Ninety-eight percent of the variants previously identified by Sanger sequencing were found by next-generation sequencing (NGS). NGS proved to be efficient as it offers analysis of all relevant genes which is laborious to reach with Sanger sequencing. Among the 13 newly referred Usher patients, both mutations in the same gene were identified in 77% of cases (10 patients) and one candidate pathogenic variant in two additional patients. This work can be considered as pilot for implementing NGS for genetically heterogeneous diseases in clinical service. PMID:24498627
Singh, Aditya; Bhatia, Prateek
2016-12-01
Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
Paul, Fiona; Otte, Jürgen; Schmitt, Imke; Dal Grande, Francesco
2018-06-05
The implementation of HTS (high-throughput sequencing) approaches is rapidly changing our understanding of the lichen symbiosis, by uncovering high bacterial and fungal diversity, which is often host-specific. Recently, HTS methods revealed the presence of multiple photobionts inside a single thallus in several lichen species. This differs from Sanger technology, which typically yields a single, unambiguous algal sequence per individual. Here we compared HTS and Sanger methods for estimating the diversity of green algal symbionts within lichen thalli using 240 lichen individuals belonging to two species of lichen-forming fungi. According to HTS data, Sanger technology consistently yielded the most abundant photobiont sequence in the sample. However, if the second most abundant photobiont exceeded 30% of the total HTS reads in a sample, Sanger sequencing generally failed. Our results suggest that most lichen individuals in the two analyzed species, Lasallia hispanica and L. pustulata, indeed contain a single, predominant green algal photobiont. We conclude that Sanger sequencing is a valid approach to detect the dominant photobionts in lichen individuals and populations. We discuss which research areas in lichen ecology and evolution will continue to benefit from Sanger sequencing, and which areas will profit from HTS approaches to assessing symbiont diversity.
Comparison of the Equine Reference Sequence with Its Sanger Source Data and New Illumina Reads
Rebolledo-Mendez, Jovan; Hestand, Matthew S.; Coleman, Stephen J.; Zeng, Zheng; Orlando, Ludovic; MacLeod, James N.; Kalbfleisch, Ted
2015-01-01
The reference assembly for the domestic horse, EquCab2, published in 2009, was built using approximately 30 million Sanger reads from a Thoroughbred mare named Twilight. Contiguity in the assembly was facilitated using nearly 315 thousand BAC end sequences from Twilight’s half brother Bravo. Since then, it has served as the foundation for many genome-wide analyses that include not only the modern horse, but ancient horses and other equid species as well. As data mapped to this reference has accumulated, consistent variation between mapped datasets and the reference, in terms of regions with no read coverage, single nucleotide variants, and small insertions/deletions have become apparent. In many cases, it is not clear whether these differences are the result of true sequence variation between the research subjects’ and Twilight’s genome or due to errors in the reference. EquCab2 is regarded as “The Twilight Assembly.” The objective of this study was to identify inconsistencies between the EquCab2 assembly and the source Twilight Sanger data used to build it. To that end, the original Sanger and BAC end reads have been mapped back to this equine reference and assessed with the addition of approximately 40X coverage of new Illumina Paired-End sequence data. The resulting mapped datasets identify those regions with low Sanger read coverage, as well as variation in genomic content that is not consistent with either the original Twilight Sanger data or the new genomic sequence data generated from Twilight on the Illumina platform. As the haploid EquCab2 reference assembly was created using Sanger reads derived largely from a single individual, the vast majority of variation detected in a mapped dataset comprised of those same Sanger reads should be heterozygous. In contrast, homozygous variations would represent either errors in the reference or contributions from Bravo's BAC end sequences. Our analysis identifies 720,843 homozygous discrepancies between new, high throughput genomic sequence data generated for Twilight and the EquCab2 reference assembly. Most of these represent errors in the assembly, while approximately 10,000 are demonstrated to be contributions from another horse. Other results are presented that include the binary alignment map file of the mapped Sanger reads, a list of variants identified as discrepancies between the source data and resulting reference, and a BED annotation file that lists the regions of the genome whose consensus was likely derived from low coverage alignments. PMID:26107638
[Genetic analysis of two children patients affected with CHARGE syndrome].
Li, Guoqiang; Li, Niu; Xu, Yufei; Li, Juan; Ding, Yu; Shen, Yiping; Wang, Xiumin; Wang, Jian
2018-04-10
To analyze two Chinese pediatric patients with multiple malformations and growth and development delay. Both patients were subjected to targeted gene sequencing, and the results were analyzed with Ingenuity Variant Analysis software. Suspected pathogenic variations were verified by Sanger sequencing. High-throughput sequencing showed that both patients have carried heterozygous variants of the CHD7 gene. Patient 1 carried a nonsense mutation in exon 36 (c.7957C>T, p.Arg2653*), while patient 2 carried a nonsense mutation of exon 2 (c.718C>T, p.Gln240*). Sanger sequencing confirmed the above mutations in both patients, while their parents were of wild-type for the corresponding sites, indicating that the two mutations have happened de novo. Two patients were diagnosed with CHARGE syndrome by high-throughput sequencing.
[Two novel pathogenic mutations of GAN gene identified in a patient with giant axonal neuropathy].
Wang, Juan; Ma, Qingwen; Cai, Qin; Liu, Yanna; Wang, Wei; Ren, Zhaorui
2016-06-01
To explore the disease-causing mutations in a patient suspected for giant axonal neuropathy(GAN). Target sequence capture sequencing was used to screen potential mutations in genomic DNA extracted from peripheral blood sample of the patient. Sanger sequencing was applied to confirm the detected mutation. The mutation was verified among 400 GAN alleles from 200 healthy individuals by Sanger sequencing. The function of the mutations was predicted by bioinformatics analysis. The patient was identified as a compound heterozygote carrying two novel pathogenic GAN mutations, i.e., c.778G>T (p.Glu260Ter) and c.277G>A (p.Gly93Arg). Sanger sequencing confirmed that the c.778G>T (p.Glu260Ter) mutation was inherited from his father, while c.277G>A (p.Gly93Arg) was inherited from his mother. The same mutations was not found in the 200 healthy individuals. Bioinformatics analysis predicted that the two mutations probably caused functional abnormality of gigaxonin. Two novel GAN mutations were detected in a patient with GAN. Both mutations are pathogenic and can cause abnormalities of gigaxonin structure and function, leading to pathogenesis of GAN. The results may also offer valuable information for similar diseases.
Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc'h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine
2017-01-01
Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus's but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies.
Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc’h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine
2017-01-01
Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus’s but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies. PMID:28362878
Revollo, Javier; Wang, Yiying; McKinzie, Page; Dad, Azra; Pearce, Mason; Heflich, Robert H; Dobrovolsky, Vasily N
2017-12-01
We used Sanger sequencing and next generation sequencing (NGS) for analysis of mutations in the endogenous X-linked Pig-a gene of clonally expanded L5178YTk +/- cells. The clones developed from single cells that were sorted on a flow cytometer based upon the expression pattern of the GPI-anchored marker, CD90, on their surface. CD90-deficient and CD90-proficient cells were sorted from untreated cultures and CD90-deficient cells were sorted from cultures treated with benzo[a]pyrene (B[a]P). Pig-a mutations were identified in all clones developed from CD90-deficient cells; no Pig-a mutations were found in clones of CD90-proficient cells. The spectrum of B[a]P-induced Pig-a mutations was dominated by basepair substitutions, small insertions and deletions at G:C, or at sequences rich in G:C content. We observed high concordance between Pig-a mutations determined by Sanger sequencing and by NGS, but NGS was able to identify mutations in samples that were difficult to analyze by Sanger sequencing (e.g., mixtures of two mutant clones). Overall, the NGS method is a cost and labor efficient high throughput approach for analysis of a large number of mutant clones. Published by Elsevier B.V.
van Huet, Ramon A. C.; Pierrache, Laurence H.M.; Meester-Smoor, Magda A.; Klaver, Caroline C.W.; van den Born, L. Ingeborgh; Hoyng, Carel B.; de Wijs, Ilse J.; Collin, Rob W. J.; Hoefsloot, Lies H.
2015-01-01
Purpose To determine the efficacy of multiple versions of a commercially available arrayed primer extension (APEX) microarray chip for autosomal recessive retinitis pigmentosa (arRP). Methods We included 250 probands suspected of arRP who were genetically analyzed with the APEX microarray between January 2008 and November 2013. The mode of inheritance had to be autosomal recessive according to the pedigree (including isolated cases). If the microarray identified a heterozygous mutation, we performed Sanger sequencing of exons and exon–intron boundaries of that specific gene. The efficacy of this microarray chip with the additional Sanger sequencing approach was determined by the percentage of patients that received a molecular diagnosis. We also collected data from genetic tests other than the APEX analysis for arRP to provide a detailed description of the molecular diagnoses in our study cohort. Results The APEX microarray chip for arRP identified the molecular diagnosis in 21 (8.5%) of the patients in our cohort. Additional Sanger sequencing yielded a second mutation in 17 patients (6.8%), thereby establishing the molecular diagnosis. In total, 38 patients (15.2%) received a molecular diagnosis after analysis using the microarray and additional Sanger sequencing approach. Further genetic analyses after a negative result of the arRP microarray (n = 107) resulted in a molecular diagnosis of arRP (n = 23), autosomal dominant RP (n = 5), X-linked RP (n = 2), and choroideremia (n = 1). Conclusions The efficacy of the commercially available APEX microarray chips for arRP appears to be low, most likely caused by the limitations of this technique and the genetic and allelic heterogeneity of RP. Diagnostic yields up to 40% have been reported for next-generation sequencing (NGS) techniques that, as expected, thereby outperform targeted APEX analysis. PMID:25999674
STINGRAY: system for integrated genomic resources and analysis.
Wagner, Glauber; Jardim, Rodrigo; Tschoeke, Diogo A; Loureiro, Daniel R; Ocaña, Kary A C S; Ribeiro, Antonio C B; Emmel, Vanessa E; Probst, Christian M; Pitaluga, André N; Grisard, Edmundo C; Cavalcanti, Maria C; Campos, Maria L M; Mattoso, Marta; Dávila, Alberto M R
2014-03-07
The STINGRAY system has been conceived to ease the tasks of integrating, analyzing, annotating and presenting genomic and expression data from Sanger and Next Generation Sequencing (NGS) platforms. STINGRAY includes: (a) a complete and integrated workflow (more than 20 bioinformatics tools) ranging from functional annotation to phylogeny; (b) a MySQL database schema, suitable for data integration and user access control; and (c) a user-friendly graphical web-based interface that makes the system intuitive, facilitating the tasks of data analysis and annotation. STINGRAY showed to be an easy to use and complete system for analyzing sequencing data. While both Sanger and NGS platforms are supported, the system could be faster using Sanger data, since the large NGS datasets could potentially slow down the MySQL database usage. STINGRAY is available at http://stingray.biowebdb.org and the open source code at http://sourceforge.net/projects/stingray-biowebdb/.
STINGRAY: system for integrated genomic resources and analysis
2014-01-01
Background The STINGRAY system has been conceived to ease the tasks of integrating, analyzing, annotating and presenting genomic and expression data from Sanger and Next Generation Sequencing (NGS) platforms. Findings STINGRAY includes: (a) a complete and integrated workflow (more than 20 bioinformatics tools) ranging from functional annotation to phylogeny; (b) a MySQL database schema, suitable for data integration and user access control; and (c) a user-friendly graphical web-based interface that makes the system intuitive, facilitating the tasks of data analysis and annotation. Conclusion STINGRAY showed to be an easy to use and complete system for analyzing sequencing data. While both Sanger and NGS platforms are supported, the system could be faster using Sanger data, since the large NGS datasets could potentially slow down the MySQL database usage. STINGRAY is available at http://stingray.biowebdb.org and the open source code at http://sourceforge.net/projects/stingray-biowebdb/. PMID:24606808
Simpson, Jared
2018-01-24
Wellcome Trust Sanger Institute's Jared Simpson on Memory efficient sequence analysis using compressed data structures at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.
Richter, Anna; Grieu, Fabienne; Carrello, Amerigo; Amanuel, Benhur; Namdarian, Kateh; Rynska, Aleksandra; Lucas, Amanda; Michael, Victoria; Bell, Anthony; Fox, Stephen B.; Hewitt, Chelsee A.; Do, Hongdo; McArthur, Grant A.; Wong, Stephen Q.; Dobrovic, Alexander; Iacopetta, Barry
2013-01-01
Melanoma patients with BRAF mutations respond to treatment with vemurafenib, thus creating a need for accurate testing of BRAF mutation status. We carried out a blinded study to evaluate various BRAF mutation testing methodologies in the clinical setting. Formalin-fixed, paraffin-embedded melanoma samples were macrodissected before screening for mutations using Sanger sequencing, single-strand conformation analysis (SSCA), high resolution melting analysis (HRM) and competitive allele-specific TaqMan® PCR (CAST-PCR). Concordance of 100% was observed between the Sanger sequencing, SSCA and HRM techniques. CAST-PCR gave rapid and accurate results for the common V600E and V600K mutations, however additional assays are required to detect rarer BRAF mutation types found in 3–4% of melanomas. HRM and SSCA followed by Sanger sequencing are effective two-step strategies for the detection of BRAF mutations in the clinical setting. CAST-PCR was useful for samples with low tumour purity and may also be a cost-effective and robust method for routine diagnostics. PMID:23584600
Mu, Wenbo; Lu, Hsiao-Mei; Chen, Jefferey; Li, Shuwei; Elliott, Aaron M
2016-11-01
Next-generation sequencing (NGS) has rapidly replaced Sanger sequencing as the method of choice for diagnostic gene-panel testing. For hereditary-cancer testing, the technical sensitivity and specificity of the assay are paramount as clinicians use results to make important clinical management and treatment decisions. There is significant debate within the diagnostics community regarding the necessity of confirming NGS variant calls by Sanger sequencing, considering that numerous laboratories report having 100% specificity from the NGS data alone. Here we report our results from 20,000 hereditary-cancer NGS panels spanning 47 genes, in which all 7845 nonpolymorphic variants were Sanger- sequenced. Of these, 98.7% were concordant between NGS and Sanger sequencing and 1.3% were identified as NGS false-positives, located mainly in complex genomic regions (A/T-rich regions, G/C-rich regions, homopolymer stretches, and pseudogene regions). Simulating a false-positive rate of zero by adjusting the variant-calling quality-score thresholds decreased the sensitivity of the assay from 100% to 97.8%, resulting in the missed detection of 176 Sanger-confirmed variants, the majority in complex genomic regions (n = 114) and mosaic mutations (n = 7). The data illustrate the importance of setting quality thresholds for panel testing only after thousands of samples have been processed and the necessity of Sanger confirmation of NGS variants to maintain the highest possible sensitivity. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Tracking B-Cell Repertoires and Clonal Histories in Normal and Malignant Lymphocytes.
Weston-Bell, Nicola J; Cowan, Graeme; Sahota, Surinder S
2017-01-01
Methods for tracking B-cell repertoires and clonal history in normal and malignant B-cells based on immunoglobulin variable region (IGV) gene analysis have developed rapidly with the advent of massive parallel next-generation sequencing (mpNGS) protocols. mpNGS permits a depth of analysis of IGV genes not hitherto feasible, and presents challenges of bioinformatics analysis, which can be readily met by current pipelines. This strategy offers a potential resolution of B-cell usage at a depth that may capture fully the natural state, in a given biological setting. Conventional methods based on RT-PCR amplification and Sanger sequencing are also available where mpNGS is not accessible. Each method offers distinct advantages. Conventional methods for IGV gene sequencing are readily adaptable to most laboratories and provide an ease of analysis to capture salient features of B-cell use. This chapter describes two methods in detail for analysis of IGV genes, mpNGS and conventional RT-PCR with Sanger sequencing.
Zopf, Agnes; Raim, Roman; Danzer, Martin; Niklas, Norbert; Spilka, Rita; Pröll, Johannes; Gabriel, Christian; Nechansky, Andreas; Roucka, Markus
2015-03-01
The detection of KRAS mutations in codons 12 and 13 is critical for anti-EGFR therapy strategies; however, only those methodologies with high sensitivity, specificity, and accuracy as well as the best cost and turnaround balance are suitable for routine daily testing. Here we compared the performance of compact sequencing using the novel hybcell technology with 454 next-generation sequencing (454-NGS), Sanger sequencing, and pyrosequencing, using an evaluation panel of 35 specimens. A total of 32 mutations and 10 wild-type cases were reported using 454-NGS as the reference method. Specificity ranged from 100% for Sanger sequencing to 80% for pyrosequencing. Sanger sequencing and hybcell-based compact sequencing achieved a sensitivity of 96%, whereas pyrosequencing had a sensitivity of 88%. Accuracy was 97% for Sanger sequencing, 85% for pyrosequencing, and 94% for hybcell-based compact sequencing. Quantitative results were obtained for 454-NGS and hybcell-based compact sequencing data, resulting in a significant correlation (r = 0.914). Whereas pyrosequencing and Sanger sequencing were not able to detect multiple mutated cell clones within one tumor specimen, 454-NGS and the hybcell-based compact sequencing detected multiple mutations in two specimens. Our comparison shows that the hybcell-based compact sequencing is a valuable alternative to state-of-the-art methodologies used for detection of clinically relevant point mutations.
Hanriot, Lucie; Keime, Céline; Gay, Nadine; Faure, Claudine; Dossat, Carole; Wincker, Patrick; Scoté-Blachon, Céline; Peyron, Christelle; Gandrillon, Olivier
2008-01-01
Background "Open" transcriptome analysis methods allow to study gene expression without a priori knowledge of the transcript sequences. As of now, SAGE (Serial Analysis of Gene Expression), LongSAGE and MPSS (Massively Parallel Signature Sequencing) are the mostly used methods for "open" transcriptome analysis. Both LongSAGE and MPSS rely on the isolation of 21 pb tag sequences from each transcript. In contrast to LongSAGE, the high throughput sequencing method used in MPSS enables the rapid sequencing of very large libraries containing several millions of tags, allowing deep transcriptome analysis. However, a bias in the complexity of the transcriptome representation obtained by MPSS was recently uncovered. Results In order to make a deep analysis of mouse hypothalamus transcriptome avoiding the limitation introduced by MPSS, we combined LongSAGE with the Solexa sequencing technology and obtained a library of more than 11 millions of tags. We then compared it to a LongSAGE library of mouse hypothalamus sequenced with the Sanger method. Conclusion We found that Solexa sequencing technology combined with LongSAGE is perfectly suited for deep transcriptome analysis. In contrast to MPSS, it gives a complex representation of transcriptome as reliable as a LongSAGE library sequenced by the Sanger method. PMID:18796152
Meng, Lanlan; Du, Juan; Li, Wen; Lu, Guangxiu; Tan, Yueqiu
2017-08-10
To determine the molecular etiology for a Chinese pedigree affected with epidermolysis bullosa simplex (EBS). Target region sequencing using a hereditary epidermolysis bullosa capture array combined with Sanger sequencing and bioinformatics analysis were used. Mutation taster, PolyPhen-2, Provean, and SIFT software and NCBI online were employed to assess the pathogenicity and conservation of detected mutations. One hundred healthy unrelated individuals were used as controls. Target region sequencing showed that the proband has carried a unreported heterozygous c.1234A>G (p.Ile412Val) mutation of the KRT14 gene, which was confirmed by Sanger sequencing in other 8 affected individuals but not among healthy members of the pedigree. Bioinformatics analysis indicated that the mutation is highly pathogenic. Remarkably, 3 members of the family (2 affected and 1 unaffected) have carried a heterozygous c.1237G>A (p.Ala413Thr) mutation of the KRT14 gene, which was collected in Human Gene Mutation Database (HGMD). Bioinformatics analysis indicated that the mutation may not be pathogenic. Both mutations were not detected among the 100 healthy controls. The novel c.1234A>G(p.Ile412Val) mutation of the KRT14 gene is probably responsible for the disease, while c.1237G>A (p.Ala413Thr) mutation of KRT14 gene may be a polymorphism. Compared with Sanger sequencing, target region capture sequencing is more efficient and can significantly reduce the cost of genetic testing for EBS.
Martinuzzi, Claudia; Pastorino, Lorenza; Andreotti, Virginia; Garuti, Anna; Minuto, Michele; Fiocca, Roberto; Bianchi-Scarrà, Giovanna; Ghiorzo, Paola; Grillo, Federica; Mastracci, Luca
2016-09-01
The optimal method for BRAF mutation detection remains to be determined despite advances in molecular detection techniques. The aim of this study was to compare, against classical Sanger sequencing, the diagnostic performance of two of the most recently developed, highly sensitive methods: BRAF V600E immunohistochemistry (IHC) and peptide nucleic-acid (PNA)-clamp qPCR. BRAF exon 15 mutations were searched in formalin-fixed paraffin-embedded tissues from 86 papillary thyroid carcinoma using the three methods. The limits of detection of Sanger sequencing in borderline or discordant cases were quantified by next generation sequencing. BRAF mutations were found in 74.4 % of cases by PNA, in 71 % of cases by IHC, and in 64 % of cases by Sanger sequencing. Complete concordance for the three methods was observed in 80 % of samples. Better concordance was observed with the combination of two methods, particularly PNA and IHC (59/64) (92 %), while the combination of PNA and Sanger was concordant in 55 cases (86 %). Sensitivity of the three methods was 99 % for PNA, 94.2 % for IHC, and 89.5 % for Sanger. Our data show that IHC could be used as a cost-effective, first-line method for BRAF V600E detection in daily practice, followed by PNA analysis in negative or uninterpretable cases, as the most efficient method. PNA-clamp quantitative PCR is highly sensitive and complementary to IHC as it also recognizes other mutations besides V600E and it is suitable for diagnostic purposes.
Hwang, Sang Mee; Lee, Ki Chan; Lee, Min Seob; Park, Kyoung Un
2018-01-01
Transition to next generation sequencing (NGS) for BRCA1 / BRCA2 analysis in clinical laboratories is ongoing but different platforms and/or data analysis pipelines give different results resulting in difficulties in implementation. We have evaluated the Ion Personal Genome Machine (PGM) Platforms (Ion PGM, Ion PGM Dx, Thermo Fisher Scientific) for the analysis of BRCA1 /2. The results of Ion PGM with OTG-snpcaller, a pipeline based on Torrent mapping alignment program and Genome Analysis Toolkit, from 75 clinical samples and 14 reference DNA samples were compared with Sanger sequencing for BRCA1 / BRCA2 . Ten clinical samples and 14 reference DNA samples were additionally sequenced by Ion PGM Dx with Torrent Suite. Fifty types of variants including 18 pathogenic or variants of unknown significance were identified from 75 clinical samples and known variants of the reference samples were confirmed by Sanger sequencing and/or NGS. One false-negative results were present for Ion PGM/OTG-snpcaller for an indel variant misidentified as a single nucleotide variant. However, eight discordant results were present for Ion PGM Dx/Torrent Suite with both false-positive and -negative results. A 40-bp deletion, a 4-bp deletion and a 1-bp deletion variant was not called and a false-positive deletion was identified. Four other variants were misidentified as another variant. Ion PGM/OTG-snpcaller showed acceptable performance with good concordance with Sanger sequencing. However, Ion PGM Dx/Torrent Suite showed many discrepant results not suitable for use in a clinical laboratory, requiring further optimization of the data analysis for calling variants.
Next generation sequencing as a useful tool in the diagnostics of mosaicism in Alport syndrome.
Beicht, Sonja; Strobl-Wildemann, Gertrud; Rath, Sabine; Wachter, Oliver; Alberer, Martin; Kaminsky, Elke; Weber, Lutz T; Hinrichsen, Tanja; Klein, Hanns-Georg; Hoefele, Julia
2013-09-10
Alport syndrome (ATS) is a progressive hereditary nephropathy characterized by hematuria and/or proteinuria with structural defects of the glomerular basement membrane. It can be associated with extrarenal manifestations (high-tone sensorineural hearing loss and ocular abnormalities). Somatic mutations in COL4A5 (X-linked), COL4A3 and COL4A4 genes (both autosomal recessive and autosomal dominant) cause Alport syndrome. Somatic mosaicism in Alport patients is very rare. The reason for this may be due to the difficulty of detection. We report the case of a boy and his mother who presented with Alport syndrome. Mutational analysis showed the novel hemizygote pathogenic mutation c.2396-1G>A (IVS29-1G>A) at the splice acceptor site of the intron 29 exon 30 boundary of the COL4A5 gene in the boy. The mutation in the mother would not have been detected by Sanger sequencing without the knowledge of the mutational analysis result of her son. Further investigation of the mother using next generation sequencing showed somatic mosaicism and implied potential germ cell mosaicism. The mutation in the mother has most likely occurred during early embryogenesis. Analysis of tissue of different embryonic origin in the mother confirmed mosaicism in both mesoderm and ectoderm. Low grade mosaicism is very difficult to detect by Sanger sequencing. Next generation sequencing is increasingly used in the diagnostics and might improve the detection of mosaicism. In the case of definite clinical symptoms of ATS and missing detection of a mutation by Sanger sequencing, mutational analysis should be performed by next generation sequencing. Copyright © 2013 Elsevier B.V. All rights reserved.
Ammann, Sandra; Lehmberg, Kai; Zur Stadt, Udo; Klemann, Christian; Bode, Sebastian F N; Speckmann, Carsten; Janka, Gritta; Wustrau, Katharina; Rakhmanov, Mirzokhid; Fuchs, Ilka; Hennies, Hans C; Ehl, Stephan
2017-11-01
We report our experience in using flow cytometry-based immunological screening prospectively as a decision tool for the use of genetic studies in the diagnostic approach to patients with hemophagocytic lymphohistiocytosis (HLH). We restricted genetic analysis largely to patients with abnormal immunological screening, but included whole exome sequencing (WES) for those with normal findings upon Sanger sequencing. Among 290 children with suspected HLH analyzed between 2010 and 2014 (including 17 affected, but asymptomatic siblings), 87/162 patients with "full" HLH and 79/111 patients with "incomplete/atypical" HLH had normal immunological screening results. In 10 patients, degranulation could not be tested. Among the 166 patients with normal screening, genetic analysis was not performed in 107 (all with uneventful follow-up), while 154 single gene tests by Sanger sequencing in the remaining 59 patients only identified a single atypical CHS patient. Flow cytometry correctly predicted all 29 patients with FHL-2, XLP1 or 2. Among 85 patients with defective NK degranulation (including 13 asymptomatic siblings), 70 were Sanger sequenced resulting in a genetic diagnosis in 55 (79%). Eight patients underwent WES, revealing mutations in two known and one unknown cytotoxicity genes and one metabolic disease. FHL3 was the most frequent genetic diagnosis. Immunological screening provided an excellent decision tool for the need and depth of genetic analysis of HLH patients and provided functionally relevant information for rapid patient classification, contributing to a significant reduction in the time from diagnosis to transplantation in recent years.
Paparini, Andrea; Gofton, Alexander; Yang, Rongchang; White, Nicole; Bunce, Michael; Ryan, Una M
2015-01-01
Cryptosporidium is an important enteric pathogen that infects a wide range of humans and animals. Rapid and reliable detection and characterisation methods are essential for understanding the transmission dynamics of the parasite. Sanger sequencing, and high-throughput sequencing (HTS) on an Ion Torrent platform, were compared with each other for their sensitivity and accuracy in detecting and characterising 25 Cryptosporidium-positive human and animal faecal samples. Ion Torrent reads (n = 123,857) were obtained at both 18S rRNA and actin loci for 21 of the 25 samples. Of these, one isolate at the actin locus (Cattle 05) and three at the 18S rRNA locus (HTS 10, HTS 11 and HTS 12), suffered PCR drop-out (i.e. PCR failures) when using fusion-tagged PCR. Sanger sequences were obtained for both loci for 23 of the 25 samples and showed good agreement with Ion Torrent-based genotyping. Two samples both from pythons (SK 02 and SK 05) produced mixed 18S and actin chromatograms by Sanger sequencing but were clearly identified by Ion Torrent sequencing as C. muris. One isolate (SK 03) was typed as C. muris by Sanger sequencing but was identified as a mixed C. muris and C. tyzzeri infection by HTS. 18S rRNA Type B sequences were identified in 4/6 C. parvum isolates when deep sequenced but were undetected in Sanger sequencing. Sanger was cheaper than Ion Torrent when sequencing a small numbers of samples, but when larger numbers of samples are considered (n = 60), the costs were comparative. Fusion-tagged amplicon based approaches are a powerful way of approaching mixtures, the only draw-back being the loss of PCR efficiency on low-template samples when using primers coupled to MID tags and adaptors. Taken together these data show that HTS has excellent potential for revealing the "true" composition of species/types in a Cryptosporidium infection, but that HTS workflows need to be carefully developed to ensure sensitivity, accuracy and contamination are controlled. Copyright © 2015 Elsevier Inc. All rights reserved.
Next Generation Sequencing at the University of Chicago Genomics Core
DOE Office of Scientific and Technical Information (OSTI.GOV)
Faber, Pieter
2013-04-24
The University of Chicago Genomics Core provides University of Chicago investigators (and external clients) access to State-of-the-Art genomics capabilities: next generation sequencing, Sanger sequencing / genotyping and micro-arrays (gene expression, genotyping, and methylation). The current presentation will highlight our capabilities in the area of ultra-high throughput sequencing analysis.
Buttitta, Fiamma; Felicioni, Lara; Del Grammastro, Maela; Filice, Giampaolo; Di Lorito, Alessia; Malatesta, Sara; Viola, Patrizia; Centi, Irene; D'Antuono, Tommaso; Zappacosta, Roberta; Rosini, Sandra; Cuccurullo, Franco; Marchetti, Antonio
2013-02-01
The therapeutic choice for patients with lung adenocarcinoma depends on the presence of EGF receptor (EGFR) mutations. In many cases, only cytologic samples are available for molecular diagnosis. Bronchoalveolar lavage (BAL) and pleural fluid, which represent a considerable proportion of cytologic specimens, cannot always be used for molecular testing because of low rate of tumor cells. We tested the feasibility of EGFR mutation analysis on BAL and pleural fluid samples by next-generation sequencing (NGS), an innovative and extremely sensitive platform. The study was devised to extend the EGFR test to those patients who could not get it due to the paucity of biologic material. A series of 830 lung cytology specimens was used to select 48 samples (BAL and pleural fluid) from patients with EGFR mutations in resected tumors. These samples included 36 cases with 0.3% to 9% of neoplastic cells (series A) and 12 cases without evidence of tumor (series B). All samples were analyzed by Sanger sequencing and NGS on 454 Roche platform. A mean of 21,130 ± 2,370 sequences per sample were obtained by NGS. In series A, EGFR mutations were detected in 16% of cases by Sanger sequencing and in 81% of cases by NGS. Seventy-seven percent of cases found to be negative by Sanger sequencing showed mutations by NGS. In series B, all samples were negative for EGFR mutation by Sanger sequencing whereas 42% of them were positive by NGS. The very sensitive EGFR-NGS assay may open up to the possibility of specific treatments for patients otherwise doomed to re-biopsies or nontargeted therapies.
Neupauerová, Jana; Grečmalová, Dagmar; Seeman, Pavel; Laššuthová, Petra
2016-05-01
We describe a patient with early onset severe axonal Charcot-Marie-Tooth disease (CMT2) with dominant inheritance, in whom Sanger sequencing failed to detect a mutation in the mitofusin 2 (MFN2) gene because of a single nucleotide polymorphism (rs2236057) under the PCR primer sequence. The severe early onset phenotype and the family history with severely affected mother (died after delivery) was very suggestive of CMT2A and this suspicion was finally confirmed by a MFN2 mutation. The mutation p.His361Tyr was later detected in the patient by massively parallel sequencing with a gene panel for hereditary neuropathies. According to this information, new primers for amplification and sequencing were designed which bind away from the polymorphic sites of the patient's DNA. Sanger sequencing with these new primers then confirmed the heterozygous mutation in the MFN2 gene in this patient. This case report shows that massively parallel sequencing may in some rare cases be more sensitive than Sanger sequencing and highlights the importance of accurate primer design which requires special attention. © 2016 John Wiley & Sons Ltd/University College London.
Pitfalls in genetic testing: the story of missed SCN1A mutations.
Djémié, Tania; Weckhuysen, Sarah; von Spiczak, Sarah; Carvill, Gemma L; Jaehn, Johanna; Anttonen, Anna-Kaisa; Brilstra, Eva; Caglayan, Hande S; de Kovel, Carolien G; Depienne, Christel; Gaily, Eija; Gennaro, Elena; Giraldez, Beatriz G; Gormley, Padhraig; Guerrero-López, Rosa; Guerrini, Renzo; Hämäläinen, Eija; Hartmann, Corinna; Hernandez-Hernandez, Laura; Hjalgrim, Helle; Koeleman, Bobby P C; Leguern, Eric; Lehesjoki, Anna-Elina; Lemke, Johannes R; Leu, Costin; Marini, Carla; McMahon, Jacinta M; Mei, Davide; Møller, Rikke S; Muhle, Hiltrud; Myers, Candace T; Nava, Caroline; Serratosa, Jose M; Sisodiya, Sanjay M; Stephani, Ulrich; Striano, Pasquale; van Kempen, Marjan J A; Verbeek, Nienke E; Usluer, Sunay; Zara, Federico; Palotie, Aarno; Mefford, Heather C; Scheffer, Ingrid E; De Jonghe, Peter; Helbig, Ingo; Suls, Arvid
2016-07-01
Sanger sequencing, still the standard technique for genetic testing in most diagnostic laboratories and until recently widely used in research, is gradually being complemented by next-generation sequencing (NGS). No single mutation detection technique is however perfect in identifying all mutations. Therefore, we wondered to what extent inconsistencies between Sanger sequencing and NGS affect the molecular diagnosis of patients. Since mutations in SCN1A, the major gene implicated in epilepsy, are found in the majority of Dravet syndrome (DS) patients, we focused on missed SCN1A mutations. We sent out a survey to 16 genetic centers performing SCN1A testing. We collected data on 28 mutations initially missed using Sanger sequencing. All patients were falsely reported as SCN1A mutation-negative, both due to technical limitations and human errors. We illustrate the pitfalls of Sanger sequencing and most importantly provide evidence that SCN1A mutations are an even more frequent cause of DS than already anticipated.
Chin, Ephrem L H; da Silva, Cristina; Hegde, Madhuri
2013-02-19
Detecting mutations in disease genes by full gene sequence analysis is common in clinical diagnostic laboratories. Sanger dideoxy terminator sequencing allows for rapid development and implementation of sequencing assays in the clinical laboratory, but it has limited throughput, and due to cost constraints, only allows analysis of one or at most a few genes in a patient. Next-generation sequencing (NGS), on the other hand, has evolved rapidly, although to date it has mainly been used for large-scale genome sequencing projects and is beginning to be used in the clinical diagnostic testing. One advantage of NGS is that many genes can be analyzed easily at the same time, allowing for mutation detection when there are many possible causative genes for a specific phenotype. In addition, regions of a gene typically not tested for mutations, like deep intronic and promoter mutations, can also be detected. Here we use 20 previously characterized Sanger-sequenced positive controls in disease-causing genes to demonstrate the utility of NGS in a clinical setting using standard PCR based amplification to assess the analytical sensitivity and specificity of the technology for detecting all previously characterized changes (mutations and benign SNPs). The positive controls chosen for validation range from simple substitution mutations to complex deletion and insertion mutations occurring in autosomal dominant and recessive disorders. The NGS data was 100% concordant with the Sanger sequencing data identifying all 119 previously identified changes in the 20 samples. We have demonstrated that NGS technology is ready to be deployed in clinical laboratories. However, NGS and associated technologies are evolving, and clinical laboratories will need to invest significantly in staff and infrastructure to build the necessary foundation for success.
Lim, Hassol; Park, Young-Mi; Lee, Jong-Keuk; Taek Lim, Hyun
2016-10-01
To present an efficient and successful application of a single-exome sequencing study in a family clinically diagnosed with X-linked retinitis pigmentosa. Exome sequencing study based on clinical examination data. An 8-year-old proband and his family. The proband and his family members underwent comprehensive ophthalmologic examinations. Exome sequencing was undertaken in the proband using Agilent SureSelect Human All Exon Kit and Illumina HiSeq 2000 platform. Bioinformatic analysis used Illumina pipeline with Burrows-Wheeler Aligner-Genome Analysis Toolkit (BWA-GATK), followed by ANNOVAR to perform variant functional annotation. All variants passing filter criteria were validated by Sanger sequencing to confirm familial segregation. Analysis of exome sequence data identified a novel frameshift mutation in RP2 gene resulting in a premature stop codon (c.665delC, p.Pro222fsTer237). Sanger sequencing revealed this mutation co-segregated with the disease phenotype in the child's family. We identified a novel causative mutation in RP2 from a single proband's exome sequence data analysis. This study highlights the effectiveness of the whole-exome sequencing in the genetic diagnosis of X-linked retinitis pigmentosa, over the conventional sequencing methods. Even using a single exome, exome sequencing technology would be able to pinpoint pathogenic variant(s) for X-linked retinitis pigmentosa, when properly applied with aid of adequate variant filtering strategy. Copyright © 2016 Canadian Ophthalmological Society. Published by Elsevier Inc. All rights reserved.
2018-01-01
New, as yet undiscovered aptamers for Protein A were identified by applying next generation sequencing (NGS) to a previously selected aptamer pool. This pool was obtained in a classical SELEX (Systematic Evolution of Ligands by EXponential enrichment) experiment using the FluMag-SELEX procedure followed by cloning and Sanger sequencing. PA#2/8 was identified as the only Protein A-binding aptamer from the Sanger sequence pool, and was shown to be able to bind intact cells of Staphylococcus aureus. In this study, we show the extension of the SELEX results by re-sequencing of the same aptamer pool using a medium throughput NGS approach and data analysis. Both data pools were compared. They confirm the selection of a highly complex and heterogeneous oligonucleotide pool and show consistently a high content of orphans as well as a similar relative frequency of certain sequence groups. But in contrast to the Sanger data pool, the NGS pool was clearly dominated by one sequence group containing the known Protein A-binding aptamer PA#2/8 as the most frequent sequence in this group. In addition, we found two new sequence groups in the NGS pool represented by PA-C10 and PA-C8, respectively, which also have high specificity for Protein A. Comparative affinity studies reveal differences between the aptamers and confirm that PA#2/8 remains the most potent sequence within the selected aptamer pool reaching affinities in the low nanomolar range of KD = 20 ± 1 nM. PMID:29495282
Stoltenburg, Regina; Strehlitz, Beate
2018-02-24
New, as yet undiscovered aptamers for Protein A were identified by applying next generation sequencing (NGS) to a previously selected aptamer pool. This pool was obtained in a classical SELEX (Systematic Evolution of Ligands by EXponential enrichment) experiment using the FluMag-SELEX procedure followed by cloning and Sanger sequencing. PA#2/8 was identified as the only Protein A-binding aptamer from the Sanger sequence pool, and was shown to be able to bind intact cells of Staphylococcus aureus . In this study, we show the extension of the SELEX results by re-sequencing of the same aptamer pool using a medium throughput NGS approach and data analysis. Both data pools were compared. They confirm the selection of a highly complex and heterogeneous oligonucleotide pool and show consistently a high content of orphans as well as a similar relative frequency of certain sequence groups. But in contrast to the Sanger data pool, the NGS pool was clearly dominated by one sequence group containing the known Protein A-binding aptamer PA#2/8 as the most frequent sequence in this group. In addition, we found two new sequence groups in the NGS pool represented by PA-C10 and PA-C8, respectively, which also have high specificity for Protein A. Comparative affinity studies reveal differences between the aptamers and confirm that PA#2/8 remains the most potent sequence within the selected aptamer pool reaching affinities in the low nanomolar range of K D = 20 ± 1 nM.
Liu, Yong; Cao, Yu; Li, Yaxiong; Lei, Dongyun; Li, Lin; Hou, Zong Liu; Han, Shen; Meng, Mingyao; Shi, Jianlin; Zhang, Yayong; Wang, Yi; Niu, Zhaoyi; Xie, Yanhua; Xiao, Benshan; Wang, Yuanfei; Li, Xiao; Yang, Lirong
2018-01-01
Background Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. Material/Methods Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. Results From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (P<0.05); variants in FOXL2 and MYH6 were found in patients with isolated, sporadic ASD (P<5×10−4). Conclusions This was the first study that demonstrated variants in FOXL2 and HYDIN associated with sporadic ASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations. PMID:29505555
Liu, Yong; Cao, Yu; Li, Yaxiong; Lei, Dongyun; Li, Lin; Hou, Zong Liu; Han, Shen; Meng, Mingyao; Shi, Jianlin; Zhang, Yayong; Wang, Yi; Niu, Zhaoyi; Xie, Yanhua; Xiao, Benshan; Wang, Yuanfei; Li, Xiao; Yang, Lirong; Wang, Wenju; Jiang, Lihong
2018-03-05
BACKGROUND Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. MATERIAL AND METHODS Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. RESULTS From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (P<0.05); variants in FOXL2 and MYH6 were found in patients with isolated, sporadic ASD (P<5×10^-4). CONCLUSIONS This was the first study that demonstrated variants in FOXL2 and HYDIN associated with sporadic ASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations.
LipidSeq: a next-generation clinical resequencing panel for monogenic dyslipidemias.
Johansen, Christopher T; Dubé, Joseph B; Loyzer, Melissa N; MacDonald, Austin; Carter, David E; McIntyre, Adam D; Cao, Henian; Wang, Jian; Robinson, John F; Hegele, Robert A
2014-04-01
We report the design of a targeted resequencing panel for monogenic dyslipidemias, LipidSeq, for the purpose of replacing Sanger sequencing in the clinical detection of dyslipidemia-causing variants. We also evaluate the performance of the LipidSeq approach versus Sanger sequencing in 84 patients with a range of phenotypes including extreme blood lipid concentrations as well as additional dyslipidemias and related metabolic disorders. The panel performs well, with high concordance (95.2%) in samples with known mutations based on Sanger sequencing and a high detection rate (57.9%) of mutations likely to be causative for disease in samples not previously sequenced. Clinical implementation of LipidSeq has the potential to aid in the molecular diagnosis of patients with monogenic dyslipidemias with a high degree of speed and accuracy and at lower cost than either Sanger sequencing or whole exome sequencing. Furthermore, LipidSeq will help to provide a more focused picture of monogenic and polygenic contributors that underlie dyslipidemia while excluding the discovery of incidental pathogenic clinically actionable variants in nonmetabolism-related genes, such as oncogenes, that would otherwise be identified by a whole exome approach, thus minimizing potential ethical issues.
LipidSeq: a next-generation clinical resequencing panel for monogenic dyslipidemias[S
Johansen, Christopher T.; Dubé, Joseph B.; Loyzer, Melissa N.; MacDonald, Austin; Carter, David E.; McIntyre, Adam D.; Cao, Henian; Wang, Jian; Robinson, John F.; Hegele, Robert A.
2014-01-01
We report the design of a targeted resequencing panel for monogenic dyslipidemias, LipidSeq, for the purpose of replacing Sanger sequencing in the clinical detection of dyslipidemia-causing variants. We also evaluate the performance of the LipidSeq approach versus Sanger sequencing in 84 patients with a range of phenotypes including extreme blood lipid concentrations as well as additional dyslipidemias and related metabolic disorders. The panel performs well, with high concordance (95.2%) in samples with known mutations based on Sanger sequencing and a high detection rate (57.9%) of mutations likely to be causative for disease in samples not previously sequenced. Clinical implementation of LipidSeq has the potential to aid in the molecular diagnosis of patients with monogenic dyslipidemias with a high degree of speed and accuracy and at lower cost than either Sanger sequencing or whole exome sequencing. Furthermore, LipidSeq will help to provide a more focused picture of monogenic and polygenic contributors that underlie dyslipidemia while excluding the discovery of incidental pathogenic clinically actionable variants in nonmetabolism-related genes, such as oncogenes, that would otherwise be identified by a whole exome approach, thus minimizing potential ethical issues. PMID:24503134
Finishing Using Next Generation Technologies
Van Tonder, Andries
2018-01-16
Andries van Tonder of Wellcome Trust Sanger Institute discusses a pipeline for finishing genomes to the gold standard on June 3, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.
Sukalo, Maja; Schäflein, Eva; Schanze, Ina; Everman, David B; Rezaei, Nima; Argente, Jesús; Lorda-Sanchez, Isabel; Deshpande, Charu; Takahashi, Tsutomu; Kleger, Alexander; Zenker, Martin
2017-11-01
Johanson-Blizzard syndrome (JBS, MIM #243800) is a very rare autosomal recessive disorder characterized by exocrine pancreatic insufficiency, nasal wing hypoplasia, hypodontia, and other abnormalities. JBS is caused by mutations of the UBR1 gene (MIM *605981), encoding a ubiquitin ligase of the N-end rule pathway. Molecular findings in a total of 65 unrelated patients with a clinical diagnosis of JBS who were previously screened for UBR1 mutations by Sanger sequencing were reviewed and cases lacking a disease-causing UBR1 mutation on either one or both alleles were included in this study. In order to discover mutations that are not detectable by Sanger sequencing, we designed a probe set for multiplex ligation-dependent probe amplification (MLPA) analysis of the UBR1 gene and analyzed the copy number status of all 47 UBR1 exons. Our previous studies using Sanger sequencing could detect mutations in 93.1% of 130 disease-associated UBR1 alleles. Six patients with a highly suggestive clinical diagnosis of JBS and unsolved genotype were included in this study. MLPA analysis detected six alleles harboring exon deletions/duplications, thereby raising the mutation detection rate in the entire cohort to 97.7% (127/130 alleles). We conclude that single or multi-exon deletions or duplications account for a substantial proportion of JBS-associated UBR1 mutations. © 2017 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.
Mou, Yi; Athar, Muhammad Ammar; Wu, Yuzhen; Xu, Ye; Wu, Jianhua; Xu, Zhenxing; Hayder, Zulfiqar; Khan, Saeed; Idrees, Muhammad; Nasir, Muhammad Israr; Liao, Yiqun; Li, Qingge
2016-11-01
Detection of anti-hepatitis B virus (HBV) drug resistance mutations is critical for therapeutic decisions for chronic hepatitis B virus infection. We describe a real-time PCR-based assay using multicolor melting curve analysis (MMCA) that could accurately detect 24 HBV nucleotide mutations at 10 amino acid positions in the reverse transcriptase region of the HBV polymerase gene. The two-reaction assay had a limit of detection of 5 copies per reaction and could detect a minor mutant population (5% of the total population) with the reverse transcriptase M204V amino acid mutation in the presence of the major wild-type population when the overall concentration was 10 4 copies/μl. The assay could be finished within 3 h, and the cost of materials for each sample was less than $10. Clinical validation studies using three groups of samples from both nucleos(t)ide analog-treated and -untreated patients showed that the results for 99.3% (840/846) of the samples and 99.9% (8,454/8,460) of the amino acids were concordant with those of Sanger sequencing of the PCR amplicon from the HBV reverse transcriptase region (PCR Sanger sequencing). HBV DNA in six samples with mixed infections consisting of minor mutant subpopulations was undetected by the PCR Sanger sequencing method but was detected by MMCA, and the results were confirmed by coamplification at a lower denaturation temperature-PCR Sanger sequencing. Among the treated patients, 48.6% (103/212) harbored viruses that displayed lamivudine monoresistance, adefovir monoresistance, entecavir resistance, or lamivudine and adefovir resistance. Among the untreated patients, the Chinese group had more mutation-containing samples than did the Pakistani group (3.3% versus 0.56%). Because of its accuracy, rapidness, wide-range coverage, and cost-effectiveness, the real-time PCR assay could be a robust tool for the detection if anti-HBV drug resistance mutations in resource-limited countries. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Deng, Yi-Mo; Spirason, Natalie; Iannello, Pina; Jelley, Lauren; Lau, Hilda; Barr, Ian G
2015-07-01
Full genome sequencing of influenza A viruses (IAV), including those that arise from annual influenza epidemics, is undertaken to determine if reassorting has occurred or if other pathogenic traits are present. Traditionally IAV sequencing has been biased toward the major surface glycoproteins haemagglutinin and neuraminidase, while the internal genes are often ignored. Despite the development of next generation sequencing (NGS), many laboratories are still reliant on conventional Sanger sequencing to sequence IAV. To develop a minimal and robust set of primers for Sanger sequencing of the full genome of IAV currently circulating in humans. A set of 13 primer pairs was designed that enabled amplification of the six internal genes of multiple human IAV subtypes including the recent avian influenza A(H7N9) virus from China. Specific primers were designed to amplify the HA and NA genes of each IAV subtype of interest. Each of the primers also incorporated a binding site at its 5'-end for either a forward or reverse M13 primer, such that only two M13 primers were required for all subsequent sequencing reactions. This minimal set of primers was suitable for sequencing the six internal genes of all currently circulating human seasonal influenza A subtypes as well as the avian A(H7N9) viruses that have infected humans in China. This streamlined Sanger sequencing protocol could be used to generate full genome sequence data more rapidly and easily than existing influenza genome sequencing protocols. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Fernández-Caballero Rico, Jose Ángel; Chueca Porcuna, Natalia; Álvarez Estévez, Marta; Mosquera Gutiérrez, María Del Mar; Marcos Maeso, María Ángeles; García, Federico
2018-02-01
To show how to generate a consensus sequence from the information of massive parallel sequences data obtained from routine HIV anti-retroviral resistance studies, and that may be suitable for molecular epidemiology studies. Paired Sanger (Trugene-Siemens) and next-generation sequencing (NGS) (454 GSJunior-Roche) HIV RT and protease sequences from 62 patients were studied. NGS consensus sequences were generated using Mesquite, using 10%, 15%, and 20% thresholds. Molecular evolutionary genetics analysis (MEGA) was used for phylogenetic studies. At a 10% threshold, NGS-Sanger sequences from 17/62 patients were phylogenetically related, with a median bootstrap-value of 88% (IQR83.5-95.5). Association increased to 36/62 sequences, median bootstrap 94% (IQR85.5-98)], using a 15% threshold. Maximum association was at the 20% threshold, with 61/62 sequences associated, and a median bootstrap value of 99% (IQR98-100). A safe method is presented to generate consensus sequences from HIV-NGS data at 20% threshold, which will prove useful for molecular epidemiological studies. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Analysis of Chromatin Organisation
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2011-01-01
Terms to be familiar with before you start to solve the test: chromatin, nucleases, sucrose density gradient centrifugation, melting point, gel electrophoresis, ethidium bromide, autoradiography, Southern blotting, Northern blotting, Sanger sequencing, restriction endonucleases, exonucleases, linker DNA, chloroform extraction, nucleosomes,…
BRAF mutation testing in solid tumors: a methodological comparison.
Weyant, Grace W; Wisotzkey, Jeffrey D; Benko, Floyd A; Donaldson, Keri J
2014-09-01
Solid tumor genotyping has become standard of care for the characterization of proto-oncogene mutational status, which has traditionally been accomplished with Sanger sequencing. However, companion diagnostic assays and comparable laboratory-developed tests are becoming increasingly popular, such as the cobas 4800 BRAF V600 Mutation Test and the INFINITI KRAS-BRAF assay, respectively. This study evaluates and validates the analytical performance of the INFINITI KRAS-BRAF assay and compares concordance of BRAF status with two reference assays, the cobas test and Sanger sequencing. DNA extraction from FFPE tissue specimens was performed followed by multiplex PCR amplification and fluorescent label incorporation using allele-specific primer extension. Hybridization to a microarray, signal detection, and analysis were then performed. The limits of detection were determined by testing dilutions of mutant BRAF alleles within wild-type background DNA, and accuracy was calculated based on these results. The INFINITI KRAS-BRAF assay produced 100% concordance with the cobas test and Sanger sequencing and had sensitivity equivalent to the cobas assay. The INFINITI assay is repeatable with at least 95% accuracy in the detection of mutant and wild-type BRAF alleles. These results confirm that the INFINITI KRAS-BRAF assay is comparable to traditional sequencing and the Food and Drug Administration-approved companion diagnostic assay for the detection of BRAF mutations. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Wu, Tonghua; Yin, Biao; Zhu, Yuanchang; Li, Guangui; Ye, Lijun; Liang, Desheng; Zeng, Yong
2017-12-01
To investigate the etiology of X-linked hypohidrotic ectodermal dysplasia (XLHED) in a family with an inversion of the X chromosome [inv(X)(p21q13)] and to achieve a healthy birth following preimplantation genetic diagnosis (PGD). Next generation sequencing (NGS) and Sanger sequencing analysis were carried out to define the inversion breakpoint. Multiple displacement amplification, amplification of breakpoint junction fragments, Sanger sequencing of exon 1 of ED1, haplotyping of informative short tandem repeat markers and gender determination were performed for PGD. NGS data of the proband sample revealed that the size of the possible inverted fragment was over 42Mb, spanning from position 26, 814, 206 to position 69, 231, 915 on the X chromosome. The breakpoints were confirmed by Sanger sequencing. A total of 5 blastocyst embryos underwent trophectoderm biopsy. Two embryos were diagnosed as carriers and three were unaffected. Two unaffected blastocysts were transferred and a singleton pregnancy was achieved. Following confirmation by prenatal diagnosis, a healthy baby was delivered. This is the first report of an XLHED family with inv(X). ED1 is disrupted by the X chromosome inversion in this XLHED family and embryos with the X chromosomal abnormality can be accurately identified by means of PGD. Copyright © 2017. Published by Elsevier B.V.
Tu, Bin; Masaberg, Carly; Hou, Lihua; Behm, Daniel; Brescia, Peter; Cha, Nuri; Kariyawasam, Kanthi; Lee, Jar How; Nong, Thoa; Sells, John; Tausch, Paul; Yang, Ruyan; Ng, Jennifer; Hurley, Carolyn Katovich
2017-02-01
Sanger-based DNA sequencing of exons 2+3 of HLA class I alleles from a heterozygote frequently results in two or more alternative genotypes. This study was undertaken to reduce the time and effort required to produce a single high resolution HLA genotype. Samples were typed in parallel by Sanger sequencing and oligonucleotide probe hybridization. This workflow, together with optimization of analysis software, was tested and refined during the typing of over 42,000 volunteers for an unrelated hematopoietic progenitor cell donor registry. Next generation DNA sequencing (NGS) was applied to over 1000 of these samples to identify the alleles present within the G group designations. Single genotypes at G level resolution were obtained for over 95% of the loci without additional assays. The vast majority of alleles identified (>99%) were the primary allele giving the G groups their name. Only 0.7% of the alleles identified encoded protein variants that were not detected by a focus on the antigen recognition domain (ARD)-encoding exons. Our combined method routinely provides biologically relevant typing resolution at the level of the ARD. It can be applied to both single samples or to large volume typing supporting either bone marrow or solid organ transplantation using technologies currently available in many HLA laboratories. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Inferring Short-Range Linkage Information from Sequencing Chromatograms
Beggel, Bastian; Neumann-Fraune, Maria; Kaiser, Rolf; Verheyen, Jens; Lengauer, Thomas
2013-01-01
Direct Sanger sequencing of viral genome populations yields multiple ambiguous sequence positions. It is not straightforward to derive linkage information from sequencing chromatograms, which in turn hampers the correct interpretation of the sequence data. We present a method for determining the variants existing in a viral quasispecies in the case of two nearby ambiguous sequence positions by exploiting the effect of sequence context-dependent incorporation of dideoxynucleotides. The computational model was trained on data from sequencing chromatograms of clonal variants and was evaluated on two test sets of in vitro mixtures. The approach achieved high accuracies in identifying the mixture components of 97.4% on a test set in which the positions to be analyzed are only one base apart from each other, and of 84.5% on a test set in which the ambiguous positions are separated by three bases. In silico experiments suggest two major limitations of our approach in terms of accuracy. First, due to a basic limitation of Sanger sequencing, it is not possible to reliably detect minor variants with a relative frequency of no more than 10%. Second, the model cannot distinguish between mixtures of two or four clonal variants, if one of two sets of linear constraints is fulfilled. Furthermore, the approach requires repetitive sequencing of all variants that might be present in the mixture to be analyzed. Nevertheless, the effectiveness of our method on the two in vitro test sets shows that short-range linkage information of two ambiguous sequence positions can be inferred from Sanger sequencing chromatograms without any further assumptions on the mixture composition. Additionally, our model provides new insights into the established and widely used Sanger sequencing technology. The source code of our method is made available at http://bioinf.mpi-inf.mpg.de/publications/beggel/linkageinformation.zip. PMID:24376502
Rapid DNA Sequencing by Direct Nanoscale Reading of Nucleotide Bases on Individual DNA Chains
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, James Weifu; Meller, Amit
2007-01-01
Since the independent invention of DNA sequencing by Sanger and by Gilbert 30 years ago, it has grown from a small scale technique capable of reading several kilobase-pair of sequence per day into today's multibillion dollar industry. This growth has spurred the development of new sequencing technologies that do not involve either electrophoresis or Sanger sequencing chemistries. Sequencing by Synthesis (SBS) involves multiple parallel micro-sequencing addition events occurring on a surface, where data from each round is detected by imaging. New High Throughput Technologies for DNA Sequencing and Genomics is the second volume in the Perspectives in Bioanalysis series, whichmore » looks at the electroanalytical chemistry of nucleic acids and proteins, development of electrochemical sensors and their application in biomedicine and in the new fields of genomics and proteomics. The authors have expertly formatted the information for a wide variety of readers, including new developments that will inspire students and young scientists to create new tools for science and medicine in the 21st century. Reviews of complementary developments in Sanger and SBS sequencing chemistries, capillary electrophoresis and microdevice integration, MS sequencing and applications set the framework for the book.« less
Zhang, Xun; Wang, Yuehua; Gao, Ning; Wang, Jinfen
2014-02-01
To compare the application values of real-time quantitative PCR-Sanger sequencing and TaqMan probe method in the detection of KRAS and BRAF mutations, and to correlate KRAS/BRAF mutations with the clinicopathological characteristics in colorectal carcinomas. Genomic DNA of the tumor cells was extracted from formalin fixed paraffin embedded (FFPE) tissue samples of 344 colorectal carcinomas by microdissection. Real-time quantitative PCR-Sanger sequencing and TaqMan probe method were performed to detect the KRAS/BRAF mutations. The frequency and types of KRAS/BRAF mutations, clinicopathological characteristics and survival time were analyzed. KRAS mutations were detected in 39.8% (137/344) and 38.7% (133/344) of 344 colorectal carcinomas by using real-time quantitative PCR-Sanger sequencing and TaqMan probe method, respectively. BRAF mutation was detected in 4.7% (16/344) and 4.1% (14/344), respectively. There was no significant correlation between the two methods. The frequency of the KRAS mutation in female was higher than that in male (P < 0.05). The frequency of the BRAF mutation in colon was higher than that in rectum. The frequency of the BRAF mutation in stage III-IV cases was higher than that in stageI-II cases. The frequency of the BRAF mutation in signet ring cell carcinoma was higher than that in mucinous carcinoma and nonspecific adenocarcinoma had the lowest mutation rate. The frequency of the BRAF mutation in grade III cases was higher than that in grade II cases (P < 0.05). The overall concordance for the two methods of KRAS/BRAF mutation detection was 98.8% (kappa = 0.976). There was statistic significance between BRAF and KRAS mutations for the survival time of colorectal carcinomas (P = 0.039). There were no statistic significance between BRAF mutation type and BRAF/KRAS wild type (P = 0.058). (1) Compared with real-time quantitative PCR-Sanger sequencing, TaqMan probe method is better with regard to handling time, efficiency, repeatability, cost and equipment. (2) The frequency of the KRAS mutation is correlated with gender. BRAF mutation is correlated with primary tumor site, TNM stage, histological types and histological grades.(3) BRAF gene mutation is an independent prognostic marker for colorectal carcinomas.
Yu, Hai-Jing; Deng, Hua; Ma, Jian; Huang, Shu-Jun; Yang, Jian-Min; Huang, Yan-Fen; Mu, Xiao-Ping; Zhang, Liang; Wang, Qi
2016-12-01
Granulomatous mastitis (GM) is a chronic inflammatory breast lesion. Its etiology remains incompletely defined. Although mounting evidence suggests the involvement of Corynebacterium in GM, there has been no systematic study of GM bacteriology using -omics technology. The bacterial diversity and relative abundances in breast abscesses from 19 women with GM were investigated using 16S rDNA metagenomic sequencing and Sanger sequencing. A quantitative PCR (qPCR) assay was also developed to identify Corynebacterium kroppenstedtii. A bioinformatic analysis revealed that Corynebacterium was present in the 19 GM patients, with abundances ranging from 1.1% to 58.9%. Of note, Corynebacterium was the most abundant taxon in seven patients (more than a third of the subjects). The predominance of Corynebacterium kroppenstedtii infection (11 of 19 patients, 57.9%) was confirmed with Sanger sequencing and the qPCR assay. This study profiled the microbiota of patients with GM and indicated an important role for Corynebacterium, and in particular C. kroppenstedtii, in the pathogenesis of this disease. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Next-generation sequencing for genetic testing of familial colorectal cancer syndromes.
Simbolo, Michele; Mafficini, Andrea; Agostini, Marco; Pedrazzani, Corrado; Bedin, Chiara; Urso, Emanuele D; Nitti, Donato; Turri, Giona; Scardoni, Maria; Fassan, Matteo; Scarpa, Aldo
2015-01-01
Genetic screening in families with high risk to develop colorectal cancer (CRC) prevents incurable disease and permits personalized therapeutic and follow-up strategies. The advancement of next-generation sequencing (NGS) technologies has revolutionized the throughput of DNA sequencing. A series of 16 probands for either familial adenomatous polyposis (FAP; 8 cases) or hereditary nonpolyposis colorectal cancer (HNPCC; 8 cases) were investigated for intragenic mutations in five CRC familial syndromes-associated genes (APC, MUTYH, MLH1, MSH2, MSH6) applying both a custom multigene Ion AmpliSeq NGS panel and conventional Sanger sequencing. Fourteen pathogenic variants were detected in 13/16 FAP/HNPCC probands (81.3 %); one FAP proband presented two co-existing pathogenic variants, one in APC and one in MUTYH. Thirteen of these 14 pathogenic variants were detected by both NGS and Sanger, while one MSH2 mutation (L280FfsX3) was identified only by Sanger sequencing. This is due to a limitation of the NGS approach in resolving sequences close or within homopolymeric stretches of DNA. To evaluate the performance of our NGS custom panel we assessed its capability to resolve the DNA sequences corresponding to 2225 pathogenic variants reported in the COSMIC database for APC, MUTYH, MLH1, MSH2, MSH6. Our NGS custom panel resolves the sequences where 2108 (94.7 %) of these variants occur. The remaining 117 mutations reside inside or in close proximity to homopolymer stretches; of these 27 (1.2 %) are imprecisely identified by the software but can be resolved by visual inspection of the region, while the remaining 90 variants (4.0 %) are blind spots. In summary, our custom panel would miss 4 % (90/2225) of pathogenic variants that would need a small set of Sanger sequencing reactions to be solved. The multiplex NGS approach has the advantage of analyzing multiple genes in multiple samples simultaneously, requiring only a reduced number of Sanger sequences to resolve homopolymeric DNA regions not adequately assessed by NGS. The implementation of NGS approaches in routine diagnostics of familial CRC is cost-effective and significantly reduces diagnostic turnaround times.
Sequencing Centers Panel at SFAF
Schilkey, Faye; Ali, Johar; Grafham, Darren; Muzny, Donna; Fulton, Bob; Fitzgerald, Mike; Hostetler, Jessica; Daum, Chris
2018-02-13
From left to right: Faye Schilkey of NCGR, Johar Ali of OICR, Darren Grafham of Wellcome Trust Sanger Institute, Donna Muzny of the Baylor College of Medicine, Bob Fulton of Washington University, Mike Fitzgerald of the Broad Institute, Jessica Hostetler of the J. Craig Venter Institute and Chris Daum of the DOE Joint Genome Institute discuss sequencing technologies, applications and pipelines on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.
Sequencing Centers Panel at SFAF
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schilkey, Faye; Ali, Johar; Grafham, Darren
From left to right: Faye Schilkey of NCGR, Johar Ali of OICR, Darren Grafham of Wellcome Trust Sanger Institute, Donna Muzny of the Baylor College of Medicine, Bob Fulton of Washington University, Mike Fitzgerald of the Broad Institute, Jessica Hostetler of the J. Craig Venter Institute and Chris Daum of the DOE Joint Genome Institute discuss sequencing technologies, applications and pipelines on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.
Parson, Walther; Strobl, Christina; Huber, Gabriela; Zimmermann, Bettina; Gomes, Sibylle M.; Souto, Luis; Fendt, Liane; Delport, Rhena; Langit, Reina; Wootton, Sharon; Lagacé, Robert; Irwin, Jodi
2013-01-01
Insights into the human mitochondrial phylogeny have been primarily achieved by sequencing full mitochondrial genomes (mtGenomes). In forensic genetics (partial) mtGenome information can be used to assign haplotypes to their phylogenetic backgrounds, which may, in turn, have characteristic geographic distributions that would offer useful information in a forensic case. In addition and perhaps even more relevant in the forensic context, haplogroup-specific patterns of mutations form the basis for quality control of mtDNA sequences. The current method for establishing (partial) mtDNA haplotypes is Sanger-type sequencing (STS), which is laborious, time-consuming, and expensive. With the emergence of Next Generation Sequencing (NGS) technologies, the body of available mtDNA data can potentially be extended much more quickly and cost-efficiently. Customized chemistries, laboratory workflows and data analysis packages could support the community and increase the utility of mtDNA analysis in forensics. We have evaluated the performance of mtGenome sequencing using the Personal Genome Machine (PGM) and compared the resulting haplotypes directly with conventional Sanger-type sequencing. A total of 64 mtGenomes (>1 million bases) were established that yielded high concordance with the corresponding STS haplotypes (<0.02% differences). About two-thirds of the differences were observed in or around homopolymeric sequence stretches. In addition, the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes. Further development of alignment software would be desirable to facilitate the application of NGS in mtDNA forensic genetics. PMID:23948325
Vavrova, Eva; Kantorova, Barbara; Vonkova, Barbara; Kabathova, Jitka; Skuhrova-Francova, Hana; Diviskova, Eva; Letocha, Ondrej; Kotaskova, Jana; Brychtova, Yvona; Doubek, Michael; Mayer, Jiri; Pospisilova, Sarka
2017-09-01
The hotspot c.7541_7542delCT NOTCH1 mutation has been proven to have a negative clinical impact in chronic lymphocytic leukemia (CLL). However, an optimal method for its detection has not yet been specified. The aim of our study was to examine the presence of the NOTCH1 mutation in CLL using three commonly used molecular methods. Sanger sequencing, fragment analysis and allele-specific PCR were compared in the detection of the c.7541_7542delCT NOTCH1 mutation in 201 CLL patients. In 7 patients with inconclusive mutational analysis results, the presence of the NOTCH1 mutation was also confirmed using ultra-deep next generation sequencing. The NOTCH1 mutation was detected in 15% (30/201) of examined patients. Only fragment analysis was able to identify all 30 NOTCH1-mutated patients. Sanger sequencing and allele-specific PCR showed a lower detection efficiency, determining 93% (28/30) and 80% (24/30) of the present NOTCH1 mutations, respectively. Considering these three most commonly used methodologies for c.7541_7542delCT NOTCH1 mutation screening in CLL, we defined fragment analysis as the most suitable approach for detecting the hotspot NOTCH1 mutation. Copyright © 2017 Elsevier Ltd. All rights reserved.
SNP discovery in the bovine milk transcriptome using RNA-Seq technology.
Cánovas, Angela; Rincon, Gonzalo; Islas-Trejo, Alma; Wickramasinghe, Saumya; Medrano, Juan F
2010-12-01
High-throughput sequencing of RNA (RNA-Seq) was developed primarily to analyze global gene expression in different tissues. However, it also is an efficient way to discover coding SNPs. The objective of this study was to perform a SNP discovery analysis in the milk transcriptome using RNA-Seq. Seven milk samples from Holstein cows were analyzed by sequencing cDNAs using the Illumina Genome Analyzer system. We detected 19,175 genes expressed in milk samples corresponding to approximately 70% of the total number of genes analyzed. The SNP detection analysis revealed 100,734 SNPs in Holstein samples, and a large number of those corresponded to differences between the Holstein breed and the Hereford bovine genome assembly Btau4.0. The number of polymorphic SNPs within Holstein cows was 33,045. The accuracy of RNA-Seq SNP discovery was tested by comparing SNPs detected in a set of 42 candidate genes expressed in milk that had been resequenced earlier using Sanger sequencing technology. Seventy of 86 SNPs were detected using both RNA-Seq and Sanger sequencing technologies. The KASPar Genotyping System was used to validate unique SNPs found by RNA-Seq but not observed by Sanger technology. Our results confirm that analyzing the transcriptome using RNA-Seq technology is an efficient and cost-effective method to identify SNPs in transcribed regions. This study creates guidelines to maximize the accuracy of SNP discovery and prevention of false-positive SNP detection, and provides more than 33,000 SNPs located in coding regions of genes expressed during lactation that can be used to develop genotyping platforms to perform marker-trait association studies in Holstein cattle.
Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima
2014-01-01
Background Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Methods Whole exome sequencing followed by expanded familial validation by Sanger sequencing. Results We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Conclusion Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes. PMID:25211151
Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima
2014-01-01
Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Whole exome sequencing followed by expanded familial validation by Sanger sequencing. We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes.
Riman, Sarah; Kiesler, Kevin M; Borsuk, Lisa A; Vallone, Peter M
2017-07-01
Standard Reference Materials SRM 2392 and 2392-I are intended to provide quality control when amplifying and sequencing human mitochondrial genome sequences. The National Institute of Standards and Technology (NIST) offers these SRMs to laboratories performing DNA-based forensic human identification, molecular diagnosis of mitochondrial diseases, mutation detection, evolutionary anthropology, and genetic genealogy. The entire mtGenome (∼16569bp) of SRM 2392 and 2392-I have previously been characterized at NIST by Sanger sequencing. Herein, we used the sensitivity, specificity, and accuracy offered by next generation sequencing (NGS) to: (1) re-sequence the certified values of the SRM 2392 and 2392-I; (2) confirm Sanger data with a high coverage new sequencing technology; (3) detect lower level heteroplasmies (<20%); and thus (4) support mitochondrial sequencing communities in the adoption of NGS methods. To obtain a consensus sequence for the SRMs as well as identify and control any bias, sequencing was performed using two NGS platforms and data was analyzed using different bioinformatics pipelines. Our results confirm five low level heteroplasmy sites that were not previously observed with Sanger sequencing: three sites in the GM09947A template in SRM 2392 and two sites in the HL-60 template in SRM 2392-I. Copyright © 2017 Elsevier B.V. All rights reserved.
Transcriptome assembly and digital gene expression atlas of the rainbow trout
USDA-ARS?s Scientific Manuscript database
Background: Transcriptome analysis is a preferred method for gene discovery, marker development and gene expression profiling in non-model organisms. Previously, we sequenced a transcriptome reference using Sanger-based and 454-pyrosequencing, however, a transcriptome assembly is still incomplete an...
Ten Broek, Roel W; Bekers, Elise M; de Leng, Wendy W J; Strengman, Eric; Tops, Bastiaan B J; Kutzner, Heinz; Leeuwis, Jan Willem; van Gorp, Joost M; Creytens, David H; Mentzel, Thomas; van Diest, Paul J; Eijkelenboom, Astrid; Flucke, Uta
2017-12-01
Spindle cell hemangioma (SCH) is a distinct vascular soft-tissue lesion characterized by cavernous blood vessels and a spindle cell component mainly occurring in the distal extremities of young adults. The majority of cases harbor heterozygous mutations in IDH1/2 sporadically or rarely in association with Maffucci syndrome. However, based on mosaicism and accordingly a low percentage of lesional cells harboring a mutant allele, detection can be challenging. We tested 19 sporadic SCHs by Sanger sequencing, multiplex ligation-dependent probe amplification (MLPA), conventional next generation sequencing (NGS), and NGS using a single molecule molecular inversion probes (smMIP)-based library preparation to compare their diagnostic value. Out of 10 cases tested by Sanger sequencing and 2 analyzed using MLPA, 4 and 1, respectively, revealed a mutation in IDH1 (p.R132C). The 7 remaining negative cases and additional 6 cases were investigated using smMIP/NGS, showing hot spot mutations in IDH1 (p.R132C) (8 cases) and IDH2 (3 cases; twice p.R172S and once p.R172G, respectively). One case was negative. Owing to insufficient DNA quality and insufficient coverage, 2 cases were excluded. In total, in 16 out of 17 cases successfully tested, an IDH1/2 mutation was found. Given that IDH1/2 mutations were absent in 161 other vascular lesions tested by smMIP/NGS, the mutation can be considered as highly specific for SCH. © 2017 Wiley Periodicals, Inc.
van den Akker, Jeroen; Mishne, Gilad; Zimmer, Anjali D; Zhou, Alicia Y
2018-04-17
Next generation sequencing (NGS) has become a common technology for clinical genetic tests. The quality of NGS calls varies widely and is influenced by features like reference sequence characteristics, read depth, and mapping accuracy. With recent advances in NGS technology and software tools, the majority of variants called using NGS alone are in fact accurate and reliable. However, a small subset of difficult-to-call variants that still do require orthogonal confirmation exist. For this reason, many clinical laboratories confirm NGS results using orthogonal technologies such as Sanger sequencing. Here, we report the development of a deterministic machine-learning-based model to differentiate between these two types of variant calls: those that do not require confirmation using an orthogonal technology (high confidence), and those that require additional quality testing (low confidence). This approach allows reliable NGS-based calling in a clinical setting by identifying the few important variant calls that require orthogonal confirmation. We developed and tested the model using a set of 7179 variants identified by a targeted NGS panel and re-tested by Sanger sequencing. The model incorporated several signals of sequence characteristics and call quality to determine if a variant was identified at high or low confidence. The model was tuned to eliminate false positives, defined as variants that were called by NGS but not confirmed by Sanger sequencing. The model achieved very high accuracy: 99.4% (95% confidence interval: +/- 0.03%). It categorized 92.2% (6622/7179) of the variants as high confidence, and 100% of these were confirmed to be present by Sanger sequencing. Among the variants that were categorized as low confidence, defined as NGS calls of low quality that are likely to be artifacts, 92.1% (513/557) were found to be not present by Sanger sequencing. This work shows that NGS data contains sufficient characteristics for a machine-learning-based model to differentiate low from high confidence variants. Additionally, it reveals the importance of incorporating site-specific features as well as variant call features in such a model.
Development of allele-specific multiplex PCR to determine the length of poly-T in intron 8 of CFTR
Prada, Anne E.
2014-01-01
Cystic fibrosis transmembrane conductance regulator (CFTR) gene mutation analysis has been implemented for Cystic Fibrosis (CF) carrier screening, and molecular diagnosis of CF and congenital bilateral absence of the vas deferens (CBAVD). Although poly-T allele analysis in intron 8 of CFTR is required when a patient is positive for R117H, it is not recommended for routine carrier screening. Therefore, commercial kits for CFTR mutation analysis were designed either to mask the poly-T allele results, unless a patient is R117H positive, or to have the poly-T analysis as a standalone reflex test using the same commercial platform. There are other standalone assays developed to detect poly-T alleles, such as heteroduplex analysis, High Resolution Melting (HRM) curve analysis, allele-specific PCR (AS-PCR) and Sanger sequencing. In this report, we developed a simple and easy-to-implement multiplex AS-PCR assay using unlabeled standard length primers, which can be used as a reflex or standalone test for CFTR poly-T track analysis. Out of 115 human gDNA samples tested, results from our new AS-PCR matched to the previous known poly-T results or results from Sanger sequencing. PMID:25071991
Bai, Y; Liu, N; Kong, X D; Yan, J; Qin, Z B; Wang, B
2016-12-07
Objective: To analyze the mutations of PAX3 gene in two Waardenburg syndrome type Ⅰ (WS1) pedigrees and make prenatal diagnosis for the high-risk 18-week-old fetus. Methods: PAX3 gene was first analyzed by Sanger sequencing and multiplex ligation-dependent probe amplification(MLPA) for detecting pathogenic mutation of the probands of the two pedigrees. The mutations were confirmed by MLPA and Sanger in parents and unrelated healthy individuals.Prenatal genetic diagnosis for the high-risk fetus was performed by amniotic fluid cell after genotyping. Results: A heterozygous PAX3 gene gross deletion (E7 deletion) was identified in all patients from WS1-01 family, and not found in 20 healthy individuals.Prenatal diagnosis in WS1-01 family indicated that the fetus was normal. Molecular studies identified a novel deletion mutation c. 1385_1386delCT within the PAX3 gene in all affected WS1-02 family members, but in none of the unaffected relatives and 200 healthy individuals. Conclusions: PAX3 gene mutation is etiological for two WS1 families. Sanger sequencing plus MLPA is effective and accurate for making gene diagnosis and prenatal diagnosis.
NASA Astrophysics Data System (ADS)
Govindarajan, A.; Pineda, J.; Purcell, M.; Tradd, K.; Packard, G.; Girard, A.; Dennett, M.; Breier, J. A., Jr.
2016-02-01
We present a new method to estimate the distribution of invertebrate larvae relative to environmental variables such as temperature, salinity, and circulation. A large volume in situ filtering system developed for discrete biogeochemical sampling in the deep-sea (the Suspended Particulate Rosette "SUPR" multisampler) was mounted to the autonomous underwater vehicle REMUS 600 for coastal larval and environmental sampling. We describe the results of SUPR-REMUS deployments conducted in Buzzards Bay, Massachusetts (2014) and west of Martha's Vineyard, Massachusetts (2015). We collected discrete samples cross-shore and from surface, middle, and bottom layers of the water column. Samples were preserved for DNA analysis. Our Buzzards Bay deployment targeted barnacle larvae, which are abundant in late winter and early spring. For these samples, we used morphological analysis and DNA barcodes generated by Sanger sequencing to obtain stage and species-specific cross-shore and vertical distributions. We targeted bivalve larvae in our 2015 deployments, and genetic analysis of larvae from these samples is underway. For these samples, we are comparing species barcode data derived from traditional Sanger sequencing of individuals to those obtained from next generation sequencing (NGS) of bulk plankton samples. Our results demonstrate the utility of autonomous sampling combined with DNA barcoding for studying larval distributions and transport dynamics.
[Rapid detection of hot spot mutations of FGFR3 gene with PCR-high resolution melting assay].
Li, Shan; Wang, Han; Su, Hua; Gao, Jinsong; Zhao, Xiuli
2017-08-10
To identify the causative mutations in five individuals affected with dyschondroplasia and develop an efficient procedure for detecting hot spot mutations of the FGFR3 gene. Genomic DNA was extracted from peripheral blood samples with a standard phenol/chloroform method. PCR-Sanger sequencing was used to analyze the causative mutations in the five probands. PCR-high resolution melting (HRM) was developed to detect the identified mutations. A c.1138G>A mutation in exon 8 was found in 4 probands, while a c.1620C>G mutation was found in exon 11 of proband 5 whom had a mild phenotype. All patients were successfully distinguished from healthy controls with the PCR-HRM method. The results of HRM analysis were highly consistent with that of Sanger sequencing. The Gly380Arg and Asn540Lys are hot spot mutations of the FGFR3 gene among patients with ACH/HCH. PCR-HRM analysis is more efficient for detecting hot spot mutations of the FGFR3 gene.
Mikkelsen, Martin; Frank-Hansen, Rune; Hansen, Anders J; Morling, Niels
2014-09-01
of sequencing of whole mitochondrial genome, HV1 and HV2 DNA with the second generation system (SGS) Roche 454 GS Junior were compared with results of Sanger sequencing and SNP typing with SNaPshot single base extension detected with MALDI-TOF and capillary electrophoresis. We investigated the performance of the software analysis of the data, reproducibility, ability to sequence homopolymeric regions, detection of mixtures and heteroplasmy as well as the implications of the depth of coverage. We found full reproducibility between samples sequenced twice with SGS. We found close to full concordance between the mtDNA sequences of 26 samples obtained with (1) the 454 SGS method using a depth of coverage above 100 and (2) Sanger sequencing and SNP typing. The discrepancies were primarily observed in homopolymeric regions. The 454 SGS method was able to sequence 95% of the reads correctly in homopolymers up to 4 bases, and up to 6 bases could be sequenced with similar success if the results were carefully, visually inspected. The 454 technology was able to detect mixtures or heteroplasmy of approximately 10%. We detected previously unreported heteroplasmy in the GM9947A component of the NIST human mitochondrial DNA SRM-2392 standard reference material. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Huang, Hui; Chen, Yanhua; Chen, Huishuang; Ma, Yuanyuan; Chiang, Pei-Wen; Zhong, Jing; Liu, Xuyang; Asan; Wu, Jing; Su, Yan; Li, Xin; Deng, Jianlian; Huang, Yingping; Zhang, Xinxin; Li, Yang; Fan, Ning; Wang, Ying; Tang, Lihui; Shen, Jinting; Chen, Meiyan; Zhang, Xiuqing; Te, Deng; Banerjee, Santasree; Liu, Hui; Qi, Ming; Yi, Xin
2018-01-01
Inherited eye diseases are major causes of vision loss in both children and adults. Inherited eye diseases are characterized by clinical variability and pronounced genetic heterogeneity. Genetic testing may provide an accurate diagnosis for ophthalmic genetic disorders and allow gene therapy for specific diseases. A targeted gene capture panel was designed to capture exons of 283 inherited eye disease genes including 58 known causative retinitis pigmentosa (RP) genes. 180 samples were tested with this panel, 68 were previously tested by Sanger sequencing. Systematic evaluation of our method and comprehensive molecular diagnosis were carried on 99 RP patients. 96.85% targeted regions were covered by at least 20 folds, the accuracy of variants detection was 99.994%. In 4 of the 68 samples previously tested by Sanger sequencing, mutations of other diseases not consisting with the clinical diagnosis were detected by next-generation sequencing (NGS) not Sanger. Among the 99 RP patients, 64 (64.6%) were detected with pathogenic mutations, while in 3 patients, it was inconsistent between molecular diagnosis and their initial clinical diagnosis. After revisiting, one patient's clinical diagnosis was reclassified. In addition, 3 patients were found carrying large deletions. We have systematically evaluated our method and compared it with Sanger sequencing, and have identified a large number of novel mutations in a cohort of 99 RP patients. The results showed a sufficient accuracy of our method and suggested the importance of molecular diagnosis in clinical diagnosis.
Ma, Yuanyuan; Chiang, Pei-Wen; Zhong, Jing; Liu, Xuyang; Asan; Wu, Jing; Su, Yan; Li, Xin; Deng, Jianlian; Huang, Yingping; Zhang, Xinxin; Li, Yang; Fan, Ning; Wang, Ying; Tang, Lihui; Shen, Jinting; Chen, Meiyan; Zhang, Xiuqing; Te, Deng; Banerjee, Santasree; Liu, Hui; Qi, Ming; Yi, Xin
2018-01-01
Background Inherited eye diseases are major causes of vision loss in both children and adults. Inherited eye diseases are characterized by clinical variability and pronounced genetic heterogeneity. Genetic testing may provide an accurate diagnosis for ophthalmic genetic disorders and allow gene therapy for specific diseases. Methods A targeted gene capture panel was designed to capture exons of 283 inherited eye disease genes including 58 known causative retinitis pigmentosa (RP) genes. 180 samples were tested with this panel, 68 were previously tested by Sanger sequencing. Systematic evaluation of our method and comprehensive molecular diagnosis were carried on 99 RP patients. Results 96.85% targeted regions were covered by at least 20 folds, the accuracy of variants detection was 99.994%. In 4 of the 68 samples previously tested by Sanger sequencing, mutations of other diseases not consisting with the clinical diagnosis were detected by next-generation sequencing (NGS) not Sanger. Among the 99 RP patients, 64 (64.6%) were detected with pathogenic mutations, while in 3 patients, it was inconsistent between molecular diagnosis and their initial clinical diagnosis. After revisiting, one patient’s clinical diagnosis was reclassified. In addition, 3 patients were found carrying large deletions. Conclusions We have systematically evaluated our method and compared it with Sanger sequencing, and have identified a large number of novel mutations in a cohort of 99 RP patients. The results showed a sufficient accuracy of our method and suggested the importance of molecular diagnosis in clinical diagnosis. PMID:29641573
Mutation analysis of seven known glaucoma-associated genes in Chinese patients with glaucoma.
Huang, Xiaobo; Li, Miaoling; Guo, Xiangming; Li, Shiqiang; Xiao, Xueshan; Jia, Xiaoyun; Liu, Xing; Zhang, Qingjiong
2014-05-13
To evaluate mutations in the MYOC, WDR36, OPTN, OPA1, NTF4, CYP1B1, and LTBP2 genes in a cohort of Chinese patients with primary glaucoma. Genomic DNA was prepared from 683 unrelated patients, including 50 with primary congenital glaucoma, 104 with juvenile open-angle glaucoma (JOAG), 186 with primary open-angle glaucoma (POAG), and 343 with primary angle-closure glaucoma (PACG). Mutations in the seven genes in 257 patients (36 with JOAG, 89 with POAG, and 132 with PACG) were initially analyzed by exome sequencing and then confirmed by Sanger sequencing. In addition, Sanger sequencing was used to detect MYOC mutations in the remaining 426 patients. Exome sequencing identified 19 mutations (6 in MYOC, 9 in WDR36, 3 in OPA1, and 1 in OPTN) in 20 of 257 patients, including 4 patients with JOAG, 8 patients with POAG, and 8 patients with PACG. No mutation was detected in the other three genes. In addition, Sanger sequencing detected additional MYOC mutations in 5 of the remaining 426 patients, including 3 patients with JOAG and 2 patients with POAG. Twenty-two mutations in MYOC, WDR36, OPA1, and OPTN were detected in 25 of the 683 patients with primary glaucoma, including nine MYOC mutations in 11 patients, nine WDR36 mutations in 11 patients, three OPA1 mutations in 3 patients, and one OPTN mutation in a patient who also carried a MYOC mutation. Eight mutations in MYOC, WDR36, and OPA1 in 8 of the 343 PACG patients are of uncertain significance and need to be analyzed further. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.
Understanding the mechanism of resistance breaking on tomato by Tomato mottle mosaic virus
USDA-ARS?s Scientific Manuscript database
Tomato mottle mosaic virus (ToMMV) has broadened it’s distribution around the world. In our previous work, we observed a partial resistance breaking by ToMMV on tomato. To understand the mechanism of this resistance breaking, we carried out comparative analysis through Sanger sequencing, genotyping ...
Shu, Hai-Rong; Bi, Huai; Pan, Yang-Chun; Xu, Hang-Yu; Song, Jian-Xin; Hu, Jie
2015-09-16
Usher syndrome (USH) is an autosomal recessive disorder characterized by hearing impairment and vision dysfunction due to retinitis pigmentosa. Phenotypic and genetic heterogeneities of this disease make it impractical to obtain a genetic diagnosis by conventional Sanger sequencing. In this study, we applied a next-generation sequencing approach to detect genetic abnormalities in patients with USH. Two unrelated Chinese families were recruited, consisting of two USH afflicted patients and four unaffected relatives. We selected 199 genes related to inherited retinal diseases as targets for deep exome sequencing. Through systematic data analysis using an established bioinformatics pipeline, all variants that passed filter criteria were validated by Sanger sequencing and co-segregation analysis. A homozygous frameshift mutation (c.4382delA, p.T1462Lfs*2) was revealed in exon20 of gene USH2A in the F1 family. Two compound heterozygous mutations, IVS47 + 1G > A and c.13156A > T (p.I4386F), located in intron 48 and exon 63 respectively, of USH2A, were identified as causative mutations for the F2 family. Of note, the missense mutation c.13156A > T has not been reported so far. In conclusion, targeted exome sequencing precisely and rapidly identified the genetic defects in two Chinese USH families and this technique can be applied as a routine examination for these disorders with significant clinical and genetic heterogeneity.
Valenzuela-González, Fabiola; Martínez-Porchas, Marcel; Villalpando-Canchola, Enrique; Vargas-Albores, Francisco
2016-03-01
Ultrafast-metagenomic sequence classification using exact alignments (Kraken) is a novel approach to classify 16S rDNA sequences. The classifier is based on mapping short sequences to the lowest ancestor and performing alignments to form subtrees with specific weights in each taxon node. This study aimed to evaluate the classification performance of Kraken with long 16S rDNA random environmental sequences produced by cloning and then Sanger sequenced. A total of 480 clones were isolated and expanded, and 264 of these clones formed contigs (1352 ± 153 bp). The same sequences were analyzed using the Ribosomal Database Project (RDP) classifier. Deeper classification performance was achieved by Kraken than by the RDP: 73% of the contigs were classified up to the species or variety levels, whereas 67% of these contigs were classified no further than the genus level by the RDP. The results also demonstrated that unassembled sequences analyzed by Kraken provide similar or inclusively deeper information. Moreover, sequences that did not form contigs, which are usually discarded by other programs, provided meaningful information when analyzed by Kraken. Finally, it appears that the assembly step for Sanger sequences can be eliminated when using Kraken. Kraken cumulates the information of both sequence senses, providing additional elements for the classification. In conclusion, the results demonstrate that Kraken is an excellent choice for use in the taxonomic assignment of sequences obtained by Sanger sequencing or based on third generation sequencing, of which the main goal is to generate larger sequences. Copyright © 2016 Elsevier B.V. All rights reserved.
"First generation" automated DNA sequencing technology.
Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M
2011-10-01
Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kumar, Dibyendu; Buhay, Christian; Van Tonder, Andries
From left to right: Dibyendu Kumar of the University of Florida, Christian Buhay of Baylor College of Medicine, Andries van Tonder of Wellcome Sanger Trust Institute, Anna Montmayeur of the Broad Institute and Karen Davenport of Los Alamos National Laboratory at the Finishing forum on June 3, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.
Zheng, Yu; Wang, Hai-Lin; Li, Jian-Kang; Xu, Li; Tellier, Laurent; Li, Xiao-Lin; Huang, Xiao-Yan; Li, Wei; Niu, Tong-Tong; Yang, Huan-Ming; Zhang, Jian-Guo; Liu, Dong-Ning
2018-01-01
To study the genes responsible for retinitis pigmentosa. A total of 15 Chinese families with retinitis pigmentosa, containing 94 sporadically afflicted cases, were recruited. The targeted sequences were captured using the Target_Eye_365_V3 chip and sequenced using the BGISEQ-500 sequencer, according to the manufacturer's instructions. Data were aligned to UCSC Genome Browser build hg19, using the Burroughs Wheeler Aligner MEM algorithm. Local realignment was performed with the Genome Analysis Toolkit (GATK v.3.3.0) IndelRealigner, and variants were called with the Genome Analysis Toolkit Haplotypecaller, without any use of imputation. Variants were filtered against a panel derived from 1000 Genomes Project, 1000G_ASN, ESP6500, ExAC and dbSNP138. In all members of Family ONE and Family TWO with available DNA samples, the genetic variant was validated using Sanger sequencing. A novel, pathogenic variant of retinitis pigmentosa, c.357_358delAA (p.Ser119SerfsX5) was identified in PRPF31 in 2 of 15 autosomal-dominant retinitis pigmentosa (ADRP) families, as well as in one, sporadic case. Sanger sequencing was performed upon probands, as well as upon other family members. This novel, pathogenic genotype co-segregated with retinitis pigmentosa phenotype in these two families. ADRP is a subtype of retinitis pigmentosa, defined by its genotype, which accounts for 20%-40% of the retinitis pigmentosa patients. Our study thus expands the spectrum of PRPF31 mutations known to occur in ADRP, and provides further demonstration of the applicability of the BGISEQ500 sequencer for genomics research.
Tumiotto, Camille; Riviere, Lionel; Bellecave, Pantxika; Recordon-Pinson, Patricia; Vilain-Parce, Alice; Guidicelli, Gwenda-Line; Fleury, Hervé
2017-01-01
One of the strategies for curing viral HIV-1 is a therapeutic vaccine involving the stimulation of cytotoxic CD8-positive T cells (CTL) that are Human Leucocyte Antigen (HLA)-restricted. The lack of efficiency of previous vaccination strategies may have been due to the immunogenic peptides used, which could be different from a patient's virus epitopes and lead to a poor CTL response. To counteract this lack of specificity, conserved epitopes must be targeted. One alternative is to gather as many data as possible from a large number of patients on their HIV-1 proviral archived epitope variants, taking into account their genetic background to select the best presented CTL epitopes. In order to process big data generated by Next-Generation Sequencing (NGS) of the DNA of HIV-infected patients, we have developed a software package called TutuGenetics. This tool combines an alignment derived either from Sanger or NGS files, HLA typing, target gene and a CTL epitope list as input files. It allows automatic translation after correction of the alignment obtained between the HxB2 reference and the reads, followed by automatic calculation of the MHC IC50 value for each epitope variant and the HLA allele of the patient by using NetMHCpan 3.0, resulting in a csv file as output result. We validated this new tool by comparing Sanger and NGS (454, Roche) sequences obtained from the proviral DNA of patients at success of ART included in the Provir Latitude 45 study and showed a 90% correlation between the quantitative results of NGS and Sanger. This automated analysis combined with complementary samples should yield more data regarding the archived CTL epitopes according to the patients' HLA alleles and will be useful for screening epitopes that in theory are presented efficiently to the HLA groove, thus constituting promising immunogenic peptides for a therapeutic vaccine.
Zhang, Xinxin; Ma, Dehua; Zou, Wei; Ding, Yibing; Zhu, Chengchu; Min, Haiyan; Zhang, Bin; Wang, Wei; Chen, Baofu; Ye, Minhua; Cai, Minghui; Pan, Yanqing; Cao, Lei; Wan, Yueming; Jin, Yu; Gao, Qian; Yi, Long
2016-05-27
Primary spontaneous pneumothorax (PSP) or pulmonary cysts is one of the manifestations of Birt-Hogg-Dube syndrome (BHDS) that is caused by heterozygous mutations in FLCN gene. Most of the mutations are SNVs and small indels, and there are also approximately 10 % large intragenic deletions and duplications of the mutations. These molecular findings are generally obtained by disparate methods including Sanger sequencing and Multiple Ligation-dependent Probe Amplification in the clinical laboratory. In addition, as a genetically heterogeneous disorder, PSP may be caused by mutations in multiple genes include FBN1, COL3A1, CBS, SERPINA1 and TSC1/TSC2 genes. For differential diagnosis, these genes should also be screened which makes the diagnostic procedure more time-consuming and labor-intensive. Forty PSP patients were divided into 2 groups. Nineteen patients with different pathogenic mutations of FLCN previously identified by conventional Sanger sequencing and MLPA were included in test group, 21 random PSP patients without any genetic screening were included in blinded sample group. 7 PSP genes including FLCN, FBN1, COL3A1, CBS, SERPINA1 and TSC1/TSC2 were designed and enriched by Haloplex system, sequenced on a Miseq platform and analyzed in the 40 patients to evaluate the performance of the targeted-NGS method. We demonstrated that the full spectrum of genes associated with pneumothorax including FLCN gene mutations can be identified simultaneously in multiplexed sequence data. Noteworthy, by our in-house copy number analysis of the sequence data, we could not only detect intragenic deletions, but also determine approximate deletion junctions simultaneously. NGS based Haloplex target enrichment technology is proved to be a rapid and cost-effective screening strategy for the comprehensive molecular diagnosis of BHDS in PSP patients, as it can replace Sanger sequencing and MLPA by simultaneously detecting exonic and intronic SNVs, small indels, large intragenic deletions and determining deletion junctions in PSP-related genes.
Anderson, Steven; Bloom, Kenneth J; Vallera, Dino U; Rueschoff, Josef; Meldrum, Cliff; Schilling, Robert; Kovach, Barbara; Lee, Ju Ruey-Jiuan; Ochoa, Pam; Langland, Rachel; Halait, Harkanwal; Lawrence, H Jeffrey; Dugan, Michael C
2012-11-01
A polymerase chain reaction-based companion diagnostic (cobas 4800 BRAF V600 Mutation Test) was recently approved by the US Food and Drug Administration to select patients with BRAF-mutant metastatic melanoma for treatment with the BRAF inhibitor vemurafenib. (1) To compare the analytic performance of the cobas test to Sanger sequencing by using screening specimens from phase II and phase III trials of vemurafenib, and (2) to assess the reproducibility of the cobas test at different testing sites. Specimens from 477 patients were used to determine positive and negative percent agreements between the cobas test and Sanger sequencing for detecting V600E (1799T>A) mutations. Specimens were evaluated with a massively parallel pyrosequencing method (454) to resolve discordances between polymerase chain reaction and Sanger results. Reproducibility of the cobas test was assessed at 3 sites by using 3 reagent lots and an 8-member panel of melanoma samples. A valid cobas result was obtained for all eligible patients. Sanger sequencing had a failure rate of 9.2% (44 of 477). For the remaining 433 specimens, positive percent agreement was 96.4% (215 of 223) and negative percent agreement, 80% (168 of 210). Among 42 cobas mutation-positive/Sanger V600E-negative specimens, 17 were V600E positive and 24 were V600K positive by 454. The cobas test detected 70% of V600K mutations. In the reproducibility study, a correct interpretation was made for 100% of wild-type specimens and specimens with greater than 5% mutant alleles; V600E mutations were detected in 90% of specimens with less than 5% mutant alleles. The cobas test (1) had a lower assay failure rate than that of Sanger, (2) was more sensitive in detecting V600E mutations, (3) detected most V600K mutations, and (4) was highly reproducible.
Sapientia: accelerating rare disease diagnosis and treatment.
Furness, Mike
2016-09-01
Congenica (Cambridge, UK) is a world leading developer of genome-based discovery and diagnostic technologies. The UK company is a spin-out from the Wellcome Trust Sanger Institute (Cambridge, UK) and was founded by scientists and clinicians at the leading edge of genomic analysis. Congenica's Sapientia™ technology platform allows whole-genome sequence analysis to be easily interpreted and presented within a clinically actionable diagnostic report. It is based on pioneering research from Wellcome Trust Sanger Institute, National Health Service clinicians and regional genetic testing laboratories and validated by Genomics England Ltd (London, UK). Sapientia used for medical diagnosis in hospitals including Great Ormond Street Hospital (London, UK), Manchester Centre for Genomic Medicine (Manchester, UK), Birmingham Women's Hospital (Birmingham, UK) and for new drug development by pharmaceutical companies. This profile follows the journey from proof of concept to clinical diagnosis.
Xu, Yan; Guan, Liping; Xiao, Xueshan; Zhang, Jianguo; Li, Shiqiang; Jiang, Hui; Jia, Xiaoyun; Yang, Jianhua; Guo, Xiangming; Yin, Ye; Wang, Jun; Zhang, Qingjiong
2015-01-01
Mutations in 60 known genes were previously identified by exome sequencing in 79 of 157 families with retinitis pigmentosa (RP). This study analyzed variants in 129 genes associated with other forms of hereditary retinal dystrophy in the same cohort. Apart from the 73 genes previously analyzed, a further 129 genes responsible for other forms of hereditary retinal dystrophy were selected based on RetNet. Variants in the 129 genes determined by whole exome sequencing were selected and filtered by bioinformatics analysis. Candidate variants were confirmed by Sanger sequencing and validated by analysis of available family members and controls. A total of 90 candidate variants were present in the 129 genes. Sanger sequencing confirmed 83 of the 90 variants. Analysis of family members and controls excluded 76 of these 83 variants. The remaining seven variants were considered to be potential pathogenic mutations; these were c.899A>G, c.1814C>G, and c.2107C>T in BBS2; c.1073C>T and c.1669C>T in INPP5E; and c.3582C>G and c.5704-5C>G in CACNA1F. Six of these seven mutations were novel. The mutations were detected in five unrelated patients without a family history, including three patients with homozygous or compound heterozygous mutations in BBS2 and INPP5E, and two patients with hemizygous mutations in CACNA1F. None of the patients had mutations in the genes associated with autosome dominant retinal dystrophy. Only a small portion of patients with RP, about 3% (5/157), had causative mutations in the 129 genes associated with other forms of hereditary retinal dystrophy.
Genome Sequencing and Assembly by Long Reads in Plants
Li, Changsheng; Lin, Feng; An, Dong; Huang, Ruidong
2017-01-01
Plant genomes generated by Sanger and Next Generation Sequencing (NGS) have provided insight into species diversity and evolution. However, Sanger sequencing is limited in its applications due to high cost, labor intensity, and low throughput, while NGS reads are too short to resolve abundant repeats and polyploidy, leading to incomplete or ambiguous assemblies. The advent and improvement of long-read sequencing by Third Generation Sequencing (TGS) methods such as PacBio and Nanopore have shown promise in producing high-quality assemblies for complex genomes. Here, we review the development of sequencing, introducing the application as well as considerations of experimental design in TGS of plant genomes. We also introduce recent revolutionary scaffolding technologies including BioNano, Hi-C, and 10× Genomics. We expect that the informative guidance for genome sequencing and assembly by long reads will benefit the initiation of scientists’ projects. PMID:29283420
Development of a Web Tool for Escherichia coli Subtyping Based on fimH Alleles.
Roer, Louise; Tchesnokova, Veronika; Allesøe, Rosa; Muradova, Mariya; Chattopadhyay, Sujay; Ahrenfeldt, Johanne; Thomsen, Martin C F; Lund, Ole; Hansen, Frank; Hammerum, Anette M; Sokurenko, Evgeni; Hasman, Henrik
2017-08-01
The aim of this study was to construct a valid publicly available method for in silico fimH subtyping of Escherichia coli particularly suitable for differentiation of fine-resolution subgroups within clonal groups defined by standard multilocus sequence typing (MLST). FimTyper was constructed as a FASTA database containing all currently known fimH alleles. The software source code is publicly available at https://bitbucket.org/genomicepidemiology/fimtyper, the database is freely available at https://bitbucket.org/genomicepidemiology/fimtyper_db, and a service implementing the software is available at https://cge.cbs.dtu.dk/services/FimTyper FimTyper was validated on three data sets: one containing Sanger sequences of fimH alleles of 42 E. coli isolates generated prior to the current study (data set 1), one containing whole-genome sequence (WGS) data of 243 third-generation-cephalosporin-resistant E. coli isolates (data set 2), and one containing a randomly chosen subset of 40 E. coli isolates from data set 2 that were subjected to conventional fimH subtyping (data set 3). The combination of the three data sets enabled an evaluation and comparison of FimTyper on both Sanger sequences and WGS data. FimTyper correctly predicted all 42 fimH subtypes from the Sanger sequences from data set 1 and successfully analyzed all 243 draft genomes from data set 2. FimTyper subtyping of the Sanger sequences and WGS data from data set 3 were in complete agreement. Additionally, fimH subtyping was evaluated on a phylogenetic network of 122 sequence type 131 (ST131) E. coli isolates. There was perfect concordance between the typology and fimH -based subclones within ST131, with accurate identification of the pandemic multidrug-resistant clonal subgroup ST131- H 30. FimTyper provides a standardized tool, as a rapid alternative to conventional fimH subtyping, highly suitable for surveillance and outbreak detection. Copyright © 2017 American Society for Microbiology.
Detection of MPL mutations by a novel allele-specific PCR-based strategy.
Furtado, Larissa V; Weigelin, Helmut C; Elenitoba-Johnson, Kojo S J; Betz, Bryan L
2013-11-01
MPL mutation testing is recommended in patients with suspected primary myelofibrosis or essential thrombocythemia who lack the JAK2 V617F mutation. MPL mutations can occur at allelic levels below 15%, which may escape detection by commonly used mutation screening methods such as Sanger sequencing. We developed a novel multiplexed allele-specific PCR assay capable of detecting most recurrent MPL exon 10 mutations associated with primary myelofibrosis and essential thrombocythemia (W515L, W515K, W515A, and S505N) down to a sensitivity of 2.5% mutant allele. Test results were reviewed from 15 reference cases and 1380 consecutive specimens referred to our laboratory for testing. Assay performance was compared to Sanger sequencing across a series of 58 specimens with MPL mutations. Positive cases consisted of 45 with W515L, 6 with S505N, 5 with W515K, 1 with W515A, and 1 with both W515L and S505N. Seven cases had mutations below 5% that were undetected by Sanger sequencing. Ten additional cases had mutation levels between 5% and 15% that were not consistently detected by sequencing. All results were easily interpreted in the allele-specific test. This assay offers a sensitive and reliable solution for MPL mutation testing. Sanger sequencing appears insufficiently sensitive for robust MPL mutation detection. Our data also suggest the relative frequency of S505N mutations may be underestimated, highlighting the necessity for inclusion of this mutation in MPL test platforms. Copyright © 2013 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Paparini, Andrea; Yang, Rongchang; Chen, Linda; Tong, Kaising; Gibson-Kueh, Susan; Lymbery, Alan; Ryan, Una M
2017-11-01
Currently, the systematics, biology and epidemiology of piscine Cryptosporidium species are poorly understood. Here, we compared Sanger ‒ and next-generation ‒ sequencing (NGS), of piscine Cryptosporidium, at the 18S rRNA and actin genes. The hosts comprised 11 ornamental fish species, spanning four orders and eight families. The objectives were: to (i) confirm the rich genetic diversity of the parasite and the high frequency of mixed infections; and (ii) explore the potential of NGS in the presence of complex genetic mixtures. By Sanger sequencing, four main genotypes were obtained at the actin locus, while for the 18S locus, seven genotypes were identified. At both loci, NGS revealed frequent mixed infections, consisting of one highly dominant variant plus substantially rarer genotypes. Both sequencing methods detected novel Cryptosporidium genotypes at both loci, including a novel and highly abundant actin genotype that was identified by both Sanger sequencing and NGS. Importantly, this genotype accounted for 68·9% of all NGS reads from all samples (249 585/362 372). The present study confirms that aquarium fish can harbour a large and unexplored Cryptosporidium genetic diversity. Although commonly used in molecular parasitology studies, nested PCR prevents quantitative comparisons and thwarts the advantages of NGS, when this latter approach is used to investigate multiple infections.
French, Deborah; Smith, Andrew; Powers, Martin P; Wu, Alan H B
2011-08-17
Binding of a ligand to the epidermal growth factor receptor (EGFR) stimulates various intracellular signaling pathways resulting in cell cycle progression, proliferation, angiogenesis and apoptosis inhibition. KRAS is involved in signaling pathways including RAF/MAPK and PI3K and mutations in this gene result in constitutive activation of these pathways, independent of EGFR activation. Seven mutations in codons 12 and 13 of KRAS comprise around 95% of the observed human mutations, rendering monoclonal antibodies against EGFR (e.g. cetuximab and panitumumab) useless in treatment of colorectal cancer. KRAS mutation testing by two different methodologies was compared; Sanger sequencing and AutoGenomics INFINITI® assay, on DNA extracted from colorectal cancers. Out of 29 colorectal tumor samples tested, 28 were concordant between the two methodologies for the KRAS mutations that were detected in both assays with the INFINITI® assay detecting a mutation in one sample that was indeterminate by Sanger sequencing and a third methodology; single nucleotide primer extension. This study indicates the utility of the AutoGenomics INFINITI® methodology in a clinical laboratory setting where technical expertise or access to equipment for DNA sequencing does not exist. Copyright © 2011 Elsevier B.V. All rights reserved.
Hong, Nan; Chen, Yan-hua; Xie, Chen; Xu, Bai-sheng; Huang, Hui; Li, Xin; Yang, Yue-qing; Huang, Ying-ping; Deng, Jian-lian; Qi, Ming; Gu, Yang-shun
2014-08-01
Nance-Horan syndrome (NHS) is a rare X-linked disorder characterized by congenital nuclear cataracts, dental anomalies, and craniofacial dysmorphisms. Mental retardation was present in about 30% of the reported cases. The purpose of this study was to investigate the genetic and clinical features of NHS in a Chinese family. Whole exome sequencing analysis was performed on DNA from an affected male to scan for candidate mutations on the X-chromosome. Sanger sequencing was used to verify these candidate mutations in the whole family. Clinical and ophthalmological examinations were performed on all members of the family. A combination of exome sequencing and Sanger sequencing revealed a nonsense mutation c.322G>T (E108X) in exon 1 of NHS gene, co-segregating with the disease in the family. The nonsense mutation led to the conversion of glutamic acid to a stop codon (E108X), resulting in truncation of the NHS protein. Multiple sequence alignments showed that codon 108, where the mutation (c.322G>T) occurred, was located within a phylogenetically conserved region. The clinical features in all affected males and female carriers are described in detail. We report a nonsense mutation c.322G>T (E108X) in a Chinese family with NHS. Our findings broaden the spectrum of NHS mutations and provide molecular insight into future NHS clinical genetic diagnosis.
Sanger sequencing as a first-line approach for molecular diagnosis of Andersen-Tawil syndrome.
Totomoch-Serra, Armando; Marquez, Manlio F; Cervantes-Barragán, David E
2017-01-01
In 1977, Frederick Sanger developed a new method for DNA sequencing based on the chain termination method, now known as the Sanger sequencing method (SSM). Recently, massive parallel sequencing, better known as next-generation sequencing (NGS), is replacing the SSM for detecting mutations in cardiovascular diseases with a genetic background. The present opinion article wants to remark that "targeted" SSM is still effective as a first-line approach for the molecular diagnosis of some specific conditions, as is the case for Andersen-Tawil syndrome (ATS). ATS is described as a rare multisystemic autosomal dominant channelopathy syndrome caused mainly by a heterozygous mutation in the KCNJ2 gene . KCJN2 has particular characteristics that make it attractive for "directed" SSM. KCNJ2 has a sequence of 17,510 base pairs (bp), and a short coding region with two exons (exon 1=166 bp and exon 2=5220 bp), half of the mutations are located in the C-terminal cytosolic domain, a mutational hotspot has been described in residue Arg218, and this gene explains the phenotype in 60% of ATS cases that fulfill all the clinical criteria of the disease. In order to increase the diagnosis of ATS we urge cardiologists to search for facial and muscular abnormalities in subjects with frequent ventricular arrhythmias (especially bigeminy) and prominent U waves on the electrocardiogram.
Sanger sequencing as a first-line approach for molecular diagnosis of Andersen-Tawil syndrome
Totomoch-Serra, Armando; Marquez, Manlio F.; Cervantes-Barragán, David E.
2017-01-01
In 1977, Frederick Sanger developed a new method for DNA sequencing based on the chain termination method, now known as the Sanger sequencing method (SSM). Recently, massive parallel sequencing, better known as next-generation sequencing (NGS), is replacing the SSM for detecting mutations in cardiovascular diseases with a genetic background. The present opinion article wants to remark that “targeted” SSM is still effective as a first-line approach for the molecular diagnosis of some specific conditions, as is the case for Andersen-Tawil syndrome (ATS). ATS is described as a rare multisystemic autosomal dominant channelopathy syndrome caused mainly by a heterozygous mutation in the KCNJ2 gene . KCJN2 has particular characteristics that make it attractive for “directed” SSM. KCNJ2 has a sequence of 17,510 base pairs (bp), and a short coding region with two exons (exon 1=166 bp and exon 2=5220 bp), half of the mutations are located in the C-terminal cytosolic domain, a mutational hotspot has been described in residue Arg218, and this gene explains the phenotype in 60% of ATS cases that fulfill all the clinical criteria of the disease. In order to increase the diagnosis of ATS we urge cardiologists to search for facial and muscular abnormalities in subjects with frequent ventricular arrhythmias (especially bigeminy) and prominent U waves on the electrocardiogram. PMID:29093808
Low-Cost, High-Throughput Sequencing of DNA Assemblies Using a Highly Multiplexed Nextera Process.
Shapland, Elaine B; Holmes, Victor; Reeves, Christopher D; Sorokin, Elena; Durot, Maxime; Platt, Darren; Allen, Christopher; Dean, Jed; Serber, Zach; Newman, Jack; Chandran, Sunil
2015-07-17
In recent years, next-generation sequencing (NGS) technology has greatly reduced the cost of sequencing whole genomes, whereas the cost of sequence verification of plasmids via Sanger sequencing has remained high. Consequently, industrial-scale strain engineers either limit the number of designs or take short cuts in quality control. Here, we show that over 4000 plasmids can be completely sequenced in one Illumina MiSeq run for less than $3 each (15× coverage), which is a 20-fold reduction over using Sanger sequencing (2× coverage). We reduced the volume of the Nextera tagmentation reaction by 100-fold and developed an automated workflow to prepare thousands of samples for sequencing. We also developed software to track the samples and associated sequence data and to rapidly identify correctly assembled constructs having the fewest defects. As DNA synthesis and assembly become a centralized commodity, this NGS quality control (QC) process will be essential to groups operating high-throughput pipelines for DNA construction.
Zheng, Yu; Wang, Hai-Lin; Li, Jian-Kang; Xu, Li; Tellier, Laurent; Li, Xiao-Lin; Huang, Xiao-Yan; Li, Wei; Niu, Tong-Tong; Yang, Huan-Ming; Zhang, Jian-Guo; Liu, Dong-Ning
2018-01-01
AIM To study the genes responsible for retinitis pigmentosa. METHODS A total of 15 Chinese families with retinitis pigmentosa, containing 94 sporadically afflicted cases, were recruited. The targeted sequences were captured using the Target_Eye_365_V3 chip and sequenced using the BGISEQ-500 sequencer, according to the manufacturer's instructions. Data were aligned to UCSC Genome Browser build hg19, using the Burroughs Wheeler Aligner MEM algorithm. Local realignment was performed with the Genome Analysis Toolkit (GATK v.3.3.0) IndelRealigner, and variants were called with the Genome Analysis Toolkit Haplotypecaller, without any use of imputation. Variants were filtered against a panel derived from 1000 Genomes Project, 1000G_ASN, ESP6500, ExAC and dbSNP138. In all members of Family ONE and Family TWO with available DNA samples, the genetic variant was validated using Sanger sequencing. RESULTS A novel, pathogenic variant of retinitis pigmentosa, c.357_358delAA (p.Ser119SerfsX5) was identified in PRPF31 in 2 of 15 autosomal-dominant retinitis pigmentosa (ADRP) families, as well as in one, sporadic case. Sanger sequencing was performed upon probands, as well as upon other family members. This novel, pathogenic genotype co-segregated with retinitis pigmentosa phenotype in these two families. CONCLUSION ADRP is a subtype of retinitis pigmentosa, defined by its genotype, which accounts for 20%-40% of the retinitis pigmentosa patients. Our study thus expands the spectrum of PRPF31 mutations known to occur in ADRP, and provides further demonstration of the applicability of the BGISEQ500 sequencer for genomics research. PMID:29375987
A high-throughput Sanger strategy for human mitochondrial genome sequencing
2013-01-01
Background A population reference database of complete human mitochondrial genome (mtGenome) sequences is needed to enable the use of mitochondrial DNA (mtDNA) coding region data in forensic casework applications. However, the development of entire mtGenome haplotypes to forensic data quality standards is difficult and laborious. A Sanger-based amplification and sequencing strategy that is designed for automated processing, yet routinely produces high quality sequences, is needed to facilitate high-volume production of these mtGenome data sets. Results We developed a robust 8-amplicon Sanger sequencing strategy that regularly produces complete, forensic-quality mtGenome haplotypes in the first pass of data generation. The protocol works equally well on samples representing diverse mtDNA haplogroups and DNA input quantities ranging from 50 pg to 1 ng, and can be applied to specimens of varying DNA quality. The complete workflow was specifically designed for implementation on robotic instrumentation, which increases throughput and reduces both the opportunities for error inherent to manual processing and the cost of generating full mtGenome sequences. Conclusions The described strategy will assist efforts to generate complete mtGenome haplotypes which meet the highest data quality expectations for forensic genetic and other applications. Additionally, high-quality data produced using this protocol can be used to assess mtDNA data developed using newer technologies and chemistries. Further, the amplification strategy can be used to enrich for mtDNA as a first step in sample preparation for targeted next-generation sequencing. PMID:24341507
Huang, Li; Xiao, Xueshan; Li, Shiqiang; Jia, Xiaoyun; Wang, Panfeng; Sun, Wenmin; Xu, Yan; Xin, Wei; Guo, Xiangming; Zhang, Qingjiong
2016-05-01
Cone-rod dystrophy (CORD) is a common form of inherited retinal degeneration. Previously, we have conducted serial mutational analysis in probands with CORD either by Sanger sequencing or whole exome sequencing (WES). In the current study, variants in all genes from RetNet were selected from the whole exome sequencing data of 108 CORD probands (including 61 probands reported here for the first time) and were analyzed by multistep bioinformatics analysis, followed by Sanger sequencing and segregation validation. Data from the previous studies and new data from this study (163 probands in total) were summarized to provide an overview of the molecular genetics of CORD. The following potentially pathogenic mutations were identified in 93 of the 163 (57.1%) probands: CNGA3 (32.5%), ABCA4 (3.8%), ALMS1 (3.1%), GUCY2D (3.1%), CACNA1F (2.5%), CRX (1.8%), PDE6C (1.8%), CNGB3 (1.8%), GUCA1A (1.2%), UNC119 (0.6%), RPGRIP1 (1.2%), RDH12 (0.6%), KCNV2 (0.6%), C21orf2 (0.6%), CEP290 (0.6%), USH2A (0.6%) and SNRNP200 (0.6%). The 17 genes with mutations included 12 known CORD genes and five genes (ALMS1, RDH12, CEP290, USH2A, and SNRNP200) associated with other forms of retinal degeneration. Mutations in CNGA3 is most common in this cohort. This is a systematic molecular genetic analysis of Chinese patients with CORD. Copyright © 2016 Elsevier Ltd. All rights reserved.
Identification of a novel MYO7A mutation in Usher syndrome type 1.
Cheng, Ling; Yu, Hongsong; Jiang, Yan; He, Juan; Pu, Sisi; Li, Xin; Zhang, Li
2018-01-05
Usher syndrome (USH) is an autosomal recessive disease characterized by deafness and retinitis pigmentosa. In view of the high phenotypic and genetic heterogeneity in USH, performing genetic screening with traditional methods is impractical. In the present study, we carried out targeted next-generation sequencing (NGS) to uncover the underlying gene in an USH family (2 USH patients and 15 unaffected relatives). One hundred and thirty-five genes associated with inherited retinal degeneration were selected for deep exome sequencing. Subsequently, variant analysis, Sanger validation and segregation tests were utilized to identify the disease-causing mutations in this family. All affected individuals had a classic USH type I (USH1) phenotype which included deafness, vestibular dysfunction and retinitis pigmentosa. Targeted NGS and Sanger sequencing validation suggested that USH1 patients carried an unreported splice site mutation, c.5168+1G>A, as a compound heterozygous mutation with c.6070C>T (p.R2024X) in the MYO7A gene. A functional study revealed decreased expression of the MYO7A gene in the individuals carrying heterozygous mutations. In conclusion, targeted next-generation sequencing provided a comprehensive and efficient diagnosis for USH1. This study revealed the genetic defects in the MYO7A gene and expanded the spectrum of clinical phenotypes associated with USH1 mutations.
A comprehensive characterization of rare mitochondrial DNA variants in neuroblastoma.
Calabrese, Francesco Maria; Clima, Rosanna; Pignataro, Piero; Lasorsa, Vito Alessandro; Hogarty, Michael D; Castellano, Aurora; Conte, Massimo; Tonini, Gian Paolo; Iolascon, Achille; Gasparre, Giuseppe; Capasso, Mario
2016-08-02
Neuroblastoma, a tumor of the developing sympathetic nervous system, is a common childhood neoplasm that is often lethal. Mitochondrial DNA (mtDNA) mutations have been found in most tumors including neuroblastoma. We extracted mtDNA data from a cohort of neuroblastoma samples that had undergone Whole Exome Sequencing (WES) and also used snap-frozen samples in which mtDNA was entirely sequenced by Sanger technology. We next undertook the challenge of determining those mutations that are relevant to, or arisen during tumor development. The bioinformatics pipeline used to extract mitochondrial variants from matched tumor/blood samples was enriched by a set of filters inclusive of heteroplasmic fraction, nucleotide variability, and in silico prediction of pathogenicity. Our in silico multistep workflow applied both on WES and Sanger-sequenced neuroblastoma samples, allowed us to identify a limited burden of somatic and germline mitochondrial mutations with a potential pathogenic impact. The few singleton germline and somatic mitochondrial mutations emerged, according to our in silico analysis, do not appear to impact on the development of neuroblastoma. Our findings are consistent with the hypothesis that most mitochondrial somatic mutations can be considered as 'passengers' and consequently have no discernible effect in this type of cancer.
Optimization of conditions to sequence long cDNAs from viruses
USDA-ARS?s Scientific Manuscript database
Fourth generation sequencing with the Minion nanopore sequencer provides opportunity to obtain deep coverage and long read for single molecules. This will benefit studies on RNA viruses. In the past, Sanger, Illumina, and Ion Torrent sequencing have been utilized to study RNA viruses. Both technique...
Pena, Loren D M; Jiang, Yong-Hui; Schoch, Kelly; Spillmann, Rebecca C; Walley, Nicole; Stong, Nicholas; Rapisardo Horn, Sarah; Sullivan, Jennifer A; McConkie-Rosell, Allyn; Kansagra, Sujay; Smith, Edward C; El-Dairi, Mays; Bellet, Jane; Keels, Martha Ann; Jasien, Joan; Kranz, Peter G; Noel, Richard; Nagaraj, Shashi K; Lark, Robert K; Wechsler, Daniel S G; Del Gaudio, Daniela; Leung, Marco L; Hendon, Laura G; Parker, Collette C; Jones, Kelly L; Goldstein, David B; Shashi, Vandana
2018-04-01
PurposeTo describe examples of missed pathogenic variants on whole-exome sequencing (WES) and the importance of deep phenotyping for further diagnostic testing.MethodsGuided by phenotypic information, three children with negative WES underwent targeted single-gene testing.ResultsIndividual 1 had a clinical diagnosis consistent with infantile systemic hyalinosis, although WES and a next-generation sequencing (NGS)-based ANTXR2 test were negative. Sanger sequencing of ANTXR2 revealed a homozygous single base pair insertion, previously missed by the WES variant caller software. Individual 2 had neurodevelopmental regression and cerebellar atrophy, with no diagnosis on WES. New clinical findings prompted Sanger sequencing and copy number testing of PLA2G6. A novel homozygous deletion of the noncoding exon 1 (not included in the WES capture kit) was detected, with extension into the promoter, confirming the clinical suspicion of infantile neuroaxonal dystrophy. Individual 3 had progressive ataxia, spasticity, and magnetic resonance image changes of vanishing white matter leukoencephalopathy. An NGS leukodystrophy gene panel and WES showed a heterozygous pathogenic variant in EIF2B5; no deletions/duplications were detected. Sanger sequencing of EIF2B5 showed a frameshift indel, probably missed owing to failure of alignment.ConclusionThese cases illustrate potential pitfalls of WES/NGS testing and the importance of phenotype-guided molecular testing in yielding diagnoses.
MPRAnator: a web-based tool for the design of massively parallel reporter assay experiments
Georgakopoulos-Soares, Ilias; Jain, Naman; Gray, Jesse M; Hemberg, Martin
2017-01-01
Motivation: With the rapid advances in DNA synthesis and sequencing technologies and the continuing decline in the associated costs, high-throughput experiments can be performed to investigate the regulatory role of thousands of oligonucleotide sequences simultaneously. Nevertheless, designing high-throughput reporter assay experiments such as massively parallel reporter assays (MPRAs) and similar methods remains challenging. Results: We introduce MPRAnator, a set of tools that facilitate rapid design of MPRA experiments. With MPRA Motif design, a set of variables provides fine control of how motifs are placed into sequences, thereby allowing the investigation of the rules that govern transcription factor (TF) occupancy. MPRA single-nucleotide polymorphism design can be used to systematically examine the functional effects of single or combinations of single-nucleotide polymorphisms at regulatory sequences. Finally, the Transmutation tool allows for the design of negative controls by permitting scrambling, reversing, complementing or introducing multiple random mutations in the input sequences or motifs. Availability and implementation: MPRAnator tool set is implemented in Python, Perl and Javascript and is freely available at www.genomegeek.com and www.sanger.ac.uk/science/tools/mpranator. The source code is available on www.github.com/hemberg-lab/MPRAnator/ under the MIT license. The REST API allows programmatic access to MPRAnator using simple URLs. Contact: igs@sanger.ac.uk or mh26@sanger.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27605100
MPRAnator: a web-based tool for the design of massively parallel reporter assay experiments.
Georgakopoulos-Soares, Ilias; Jain, Naman; Gray, Jesse M; Hemberg, Martin
2017-01-01
With the rapid advances in DNA synthesis and sequencing technologies and the continuing decline in the associated costs, high-throughput experiments can be performed to investigate the regulatory role of thousands of oligonucleotide sequences simultaneously. Nevertheless, designing high-throughput reporter assay experiments such as massively parallel reporter assays (MPRAs) and similar methods remains challenging. We introduce MPRAnator, a set of tools that facilitate rapid design of MPRA experiments. With MPRA Motif design, a set of variables provides fine control of how motifs are placed into sequences, thereby allowing the investigation of the rules that govern transcription factor (TF) occupancy. MPRA single-nucleotide polymorphism design can be used to systematically examine the functional effects of single or combinations of single-nucleotide polymorphisms at regulatory sequences. Finally, the Transmutation tool allows for the design of negative controls by permitting scrambling, reversing, complementing or introducing multiple random mutations in the input sequences or motifs. MPRAnator tool set is implemented in Python, Perl and Javascript and is freely available at www.genomegeek.com and www.sanger.ac.uk/science/tools/mpranator The source code is available on www.github.com/hemberg-lab/MPRAnator/ under the MIT license. The REST API allows programmatic access to MPRAnator using simple URLs. igs@sanger.ac.uk or mh26@sanger.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Association of genetic variants of GRIN2B with autism.
Pan, Yongcheng; Chen, Jingjing; Guo, Hui; Ou, Jianjun; Peng, Yu; Liu, Qiong; Shen, Yidong; Shi, Lijuan; Liu, Yalan; Xiong, Zhimin; Zhu, Tengfei; Luo, Sanchuan; Hu, Zhengmao; Zhao, Jingping; Xia, Kun
2015-02-06
Autism (MIM 209850) is a complex neurodevelopmental disorder characterized by social communication impairments and restricted repetitive behaviors. It has a high heritability, although much remains unclear. To evaluate genetic variants of GRIN2B in autism etiology, we performed a system association study of common and rare variants of GRIN2B and autism in cohorts from a Chinese population, involving a total sample of 1,945 subjects. Meta-analysis of a triad family cohort and a case-control cohort identified significant associations of multiple common variants and autism risk (Pmin = 1.73 × 10(-4)). Significantly, the haplotype involved with the top common variants also showed significant association (P = 1.78 × 10(-6)). Sanger sequencing of 275 probands from a triad cohort identified several variants in coding regions, including four common variants and seven rare variants. Two of the common coding variants were located in the autism-related linkage disequilibrium (LD) block, and both were significantly associated with autism (P < 9 × 10(-3)) using an independent control cohort. Burden analysis and case-only analysis of rare coding variants identified by Sanger sequencing did not find this association. Our study for the first time reveals that common variants and related haplotypes of GRIN2B are associated with autism risk.
Next-generation sequencing: the future of molecular genetics in poultry production and food safety.
Diaz-Sanchez, S; Hanning, I; Pendleton, Sean; D'Souza, Doris
2013-02-01
The era of molecular biology and automation of the Sanger chain-terminator sequencing method has led to discovery and advances in diagnostics and biotechnology. The Sanger methodology dominated research for over 2 decades, leading to significant accomplishments and technological improvements in DNA sequencing. Next-generation high-throughput sequencing (HT-NGS) technologies were developed subsequently to overcome the limitations of this first generation technology that include higher speed, less labor, and lowered cost. Various platforms developed include sequencing-by-synthesis 454 Life Sciences, Illumina (Solexa) sequencing, SOLiD sequencing (among others), and the Ion Torrent semiconductor sequencing technologies that use different detection principles. As technology advances, progress made toward third generation sequencing technologies are being reported, which include Nanopore Sequencing and real-time monitoring of PCR activity through fluorescent resonant energy transfer. The advantages of these technologies include scalability, simplicity, with increasing DNA polymerase performance and yields, being less error prone, and even more economically feasible with the eventual goal of obtaining real-time results. These technologies can be directly applied to improve poultry production and enhance food safety. For example, sequence-based (determination of the gut microbial community, genes for metabolic pathways, or presence of plasmids) and function-based (screening for function such as antibiotic resistance, or vitamin production) metagenomic analysis can be carried out. Gut microbialflora/communities of poultry can be sequenced to determine the changes that affect health and disease along with efficacy of methods to control pathogenic growth. Thus, the purpose of this review is to provide an overview of the principles of these current technologies and their potential application to improve poultry production and food safety as well as public health.
Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database.
Carver, Tim; Berriman, Matthew; Tivey, Adrian; Patel, Chinmay; Böhme, Ulrike; Barrell, Barclay G; Parkhill, Julian; Rajandream, Marie-Adèle
2008-12-01
Artemis and Artemis Comparison Tool (ACT) have become mainstream tools for viewing and annotating sequence data, particularly for microbial genomes. Since its first release, Artemis has been continuously developed and supported with additional functionality for editing and analysing sequences based on feedback from an active user community of laboratory biologists and professional annotators. Nevertheless, its utility has been somewhat restricted by its limitation to reading and writing from flat files. Therefore, a new version of Artemis has been developed, which reads from and writes to a relational database schema, and allows users to annotate more complex, often large and fragmented, genome sequences. Artemis and ACT have now been extended to read and write directly to the Generic Model Organism Database (GMOD, http://www.gmod.org) Chado relational database schema. In addition, a Gene Builder tool has been developed to provide structured forms and tables to edit coordinates of gene models and edit functional annotation, based on standard ontologies, controlled vocabularies and free text. Artemis and ACT are freely available (under a GPL licence) for download (for MacOSX, UNIX and Windows) at the Wellcome Trust Sanger Institute web sites: http://www.sanger.ac.uk/Software/Artemis/ http://www.sanger.ac.uk/Software/ACT/
Sun, Shumei; Zhou, Hao; Zhou, Bin; Hu, Ziyou; Hou, Jinlin; Sun, Jian
2012-05-01
To evaluate the sensitivity and specificity of nested PCR combined with pyrosequencing in the detection of HBV drug-resistance gene. RtM204I (ATT) mutant and rtM204 (ATG) nonmutant plasmids mixed at different ratios were detected for mutations using nested-PCR combined with pyrosequencing, and the results were compared with those by conventional PCR pyrosequencing to analyze the linearity and consistency of the two methods. Clinical specimens with different viral loads were examined for drug-resistant mutations using nested PCR pyrosequencing and nested PCR combined with dideoxy sequencing (Sanger) for comparison of the detection sensitivity and specificity. The fitting curves demonstrated good linearity of both conventional PCR pyrosequencing and nested PCR pyrosequencing (R(2)>0.99, P<0.05). Nested PCR showed a better consistency with the predicted value than conventional PCR, and was superior to conventional PCR for detection of samples containing 90% mutant plasmid. In the detection of clinical specimens, Sanger sequencing had a significantly lower sensitivity than nested PCR pyrosequencing (92% vs 100%, P<0.01). The detection sensitivity of Sanger sequencing varied with the viral loads, especially in samples with low viral copies (HBV DNA ≤3log10 copies/ml), where the sensitivity was 78%, significantly lower than that of pyrosequencing (100%, P<0.01). Neither of the two methods yielded positive results for the negative control samples, suggesting their good specificity. Compared with nested PCR and Sanger sequencing method, nested PCR pyrosequencing has a higher sensitivity especially in clinical specimens with low viral copies, which can be important for early detection of HBV mutant strains and hence more effective clinical management.
USDA-ARS?s Scientific Manuscript database
The current pig reference genome sequence (Sscrofa10.2) was established using Sanger sequencing and following the clone-by-clone hierarchical shotgun sequencing approach used in the public human genome project. However, as sequence coverage was low (4-6x) the resulting assembly was only of draft qua...
The Dynamics of DNA Sequencing.
ERIC Educational Resources Information Center
Morvillo, Nancy
1997-01-01
Describes a paper-and-pencil activity that helps students understand DNA sequencing and expands student understanding of DNA structure, replication, and gel electrophoresis. Appropriate for advanced biology students who are familiar with the Sanger method. (DDR)
Hong, Nan; Chen, Yan-hua; Xie, Chen; Xu, Bai-sheng; Huang, Hui; Li, Xin; Yang, Yue-qing; Huang, Ying-ping; Deng, Jian-lian; Qi, Ming; Gu, Yang-shun
2014-01-01
Objective: Nance-Horan syndrome (NHS) is a rare X-linked disorder characterized by congenital nuclear cataracts, dental anomalies, and craniofacial dysmorphisms. Mental retardation was present in about 30% of the reported cases. The purpose of this study was to investigate the genetic and clinical features of NHS in a Chinese family. Methods: Whole exome sequencing analysis was performed on DNA from an affected male to scan for candidate mutations on the X-chromosome. Sanger sequencing was used to verify these candidate mutations in the whole family. Clinical and ophthalmological examinations were performed on all members of the family. Results: A combination of exome sequencing and Sanger sequencing revealed a nonsense mutation c.322G>T (E108X) in exon 1 of NHS gene, co-segregating with the disease in the family. The nonsense mutation led to the conversion of glutamic acid to a stop codon (E108X), resulting in truncation of the NHS protein. Multiple sequence alignments showed that codon 108, where the mutation (c.322G>T) occurred, was located within a phylogenetically conserved region. The clinical features in all affected males and female carriers are described in detail. Conclusions: We report a nonsense mutation c.322G>T (E108X) in a Chinese family with NHS. Our findings broaden the spectrum of NHS mutations and provide molecular insight into future NHS clinical genetic diagnosis. PMID:25091991
Hong, Yoonki; Kim, Woo Jin; Bang, Chi Young; Lee, Jae Cheol; Oh, Yeon-Mok
2016-04-01
Lung cancer is the most common cause of cancer related death. Alterations in gene sequence, structure, and expression have an important role in the pathogenesis of lung cancer. Fusion genes and alternative splicing of cancer-related genes have the potential to be oncogenic. In the current study, we performed RNA-sequencing (RNA-seq) to investigate potential fusion genes and alternative splicing in non-small cell lung cancer. RNA was isolated from lung tissues obtained from 86 subjects with lung cancer. The RNA samples from lung cancer and normal tissues were processed with RNA-seq using the HiSeq 2000 system. Fusion genes were evaluated using Defuse and ChimeraScan. Candidate fusion transcripts were validated by Sanger sequencing. Alternative splicing was analyzed using multivariate analysis of transcript sequencing and validated using quantitative real time polymerase chain reaction. RNA-seq data identified oncogenic fusion genes EML4-ALK and SLC34A2-ROS1 in three of 86 normal-cancer paired samples. Nine distinct fusion transcripts were selected using DeFuse and ChimeraScan; of which, four fusion transcripts were validated by Sanger sequencing. In 33 squamous cell carcinoma, 29 tumor specific skipped exon events and six mutually exclusive exon events were identified. ITGB4 and PYCR1 were top genes that showed significant tumor specific splice variants. In conclusion, RNA-seq data identified novel potential fusion transcripts and splice variants. Further evaluation of their functional significance in the pathogenesis of lung cancer is required.
Oh, Hye-Seon; Kwon, Hyemi; Park, Suyeon; Kim, Mijin; Jeon, Min Ji; Kim, Tae Yong; Shong, Young Kee; Kim, Won Bae; Choi, Jene
2018-01-01
Background The BRAFV600E mutation is the most common genetic alteration identified in papillary thyroid carcinoma (PTC). Because of its costs effectiveness and sensitivity, direct Sanger sequencing has several limitations. The aim of this study was to evaluate the efficiency of immunohistochemistry (IHC) as an alternative method to detect the BRAFV600E mutation in preoperative and postoperative tissue samples. Methods We evaluated 71 patients who underwent thyroid surgery with the result of direct sequencing of the BRAFV600E mutation. IHC staining of the BRAFV600E mutation was performed in 49 preoperative and 23 postoperative thyroid specimens. Results Sixty-two patients (87.3%) had PTC, and of these, BRAFV600E was confirmed by direct sequencing in 57 patients (91.9%). In 23 postoperative tissue samples, the BRAFV600E mutation was detected in 16 samples (70%) by direct sequencing and 18 samples (78%) by IHC. In 24 fine needle aspiration (FNA) samples, BRAFV600E was detected in 18 samples (75%) by direct sequencing and 16 samples (67%) by IHC. In 25 core needle biopsy (CNB) samples, the BRAFV600E mutation was detected in 15 samples (60%) by direct sequencing and 16 samples (64%) by IHC. The sensitivity and specificity of IHC for detecting the BRAFV600E mutation were 77.8% and 66.7% in FNA samples and 99.3% and 80.0% in CNB samples. Conclusion IHC could be an alternative method to direct Sanger sequencing for BRAFV600E mutation detection both in postoperative and preoperative samples. However, application of IHC to detect the BRAFV600E mutation in FNA samples is of limited value compared with direct sequencing. PMID:29388401
Xu, Yan; Xiao, Xueshan; Li, Shiqiang; Jia, Xiaoyun; Xin, Wei; Wang, Panfeng; Sun, Wenmin; Huang, Li; Guo, Xiangming; Zhang, Qingjiong
2016-08-01
Leber congenital amaurosis (LCA) is the most severe form of inherited retinal dystrophy. We have previously performed a mutational analysis of the known LCA-associated genes in probands with LCA by both Sanger and whole exome sequencing. In this study, whole exome sequencing was carried out on 66 new probabds with LCA. In conjunction with these data, the present study provides a comprehensive analysis of the spectrum and frequency of all known genes associated with retinal dystrophy in a total of 159 Chinese probands with LCA. The known genes responsible for all forms hereditary retinal dystrophy were included based on information from RetNet. The candidate variants were filtered by bioinformatics analysis and confirmed by Sanger sequencing. Potentially causative mutations were further validated in available family members. Overall, a total of 118 putative pathogenic mutations from 23 genes were identified in 56.6% (90/159) of probands. These mutations were harbored in 13 LCA-associated genes and in ten genes related to other forms of retinal dystrophy. The most frequently mutated gene in probands with LCA was GUCY2D (10.7%, 17/159). A series of mutational analyses suggests that all known genes associated with retinal dystrophy account for 56.6% of Chinese patients with LCA. A comprehensive molecular genetic analysis of Chinese patients with LCA provides an overview of the spectrum and frequency of ethno-specific mutations of all known genes, as well as indications about other unknown genes in the remaining probands who lacked identified mutations. Copyright © 2016 Elsevier Ltd. All rights reserved.
PAVE: program for assembling and viewing ESTs.
Soderlund, Carol; Johnson, Eric; Bomhoff, Matthew; Descour, Anne
2009-08-26
New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. The PAVE (Program for Assembling and Viewing ESTs) assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.
BMPR1B mutation causes Pierre Robin sequence
Yao, Xu; Zhang, Rong; Yang, Hui; Zhao, Rui; Guo, Jihong; Jin, Ke; Mei, Haibo; Luo, Yongqi; Zhao, Liu; Tu, Ming; Zhu, Yimin
2017-01-01
Background We investigated a large family with Pierre Robin sequence (PRS). Aim of the study This study aims to determine the genetic cause of PRS. Results The reciprocal translocation t(4;6)(q22;p21) was identified to be segregated with PRS in a three-generation family. Whole-genome sequencing and Sanger sequencing successfully detected breakpoints in the intragenic regions of BMRP1B and GRM4. We hypothesized that PRS in this family was caused by (i) haploinsufficiency for BMPR1B or (ii) a gain of function mechanism mediated by the BMPR1B-GRM4 fusion gene. In an unrelated family, we identified another BMPR1B-splicing mutation that co-segregated with PRS. Conclusion We detected two BMPR1B mutations in two unrelated PRS families, suggesting that BMPR1B disruption is probably a cause of human PRS. Methods GTG banding, comparative genomic hybridization, whole-genome sequencing, and Sanger sequencing were performed to identify the gene causing PRS. PMID:28418932
Lim, Eileen C P; Brett, Maggie; Lai, Angeline H M; Lee, Siew-Peng; Tan, Ee-Shien; Jamuar, Saumya S; Ng, Ivy S L; Tan, Ene-Choo
2015-12-14
Next-generation sequencing (NGS) has revolutionized genetic research and offers enormous potential for clinical application. Sequencing the exome has the advantage of casting the net wide for all known coding regions while targeted gene panel sequencing provides enhanced sequencing depths and can be designed to avoid incidental findings in adult-onset conditions. A HaloPlex panel consisting of 180 genes within commonly altered chromosomal regions is available for use on both the Ion Personal Genome Machine (PGM) and MiSeq platforms to screen for causative mutations in these genes. We used this Haloplex ICCG panel for targeted sequencing of 15 patients with clinical presentations indicative of an abnormality in one of the 180 genes. Sequencing runs were done using the Ion 318 Chips on the Ion Torrent PGM. Variants were filtered for known polymorphisms and analysis was done to identify possible disease-causing variants before validation by Sanger sequencing. When possible, segregation of variants with phenotype in family members was performed to ascertain the pathogenicity of the variant. More than 97% of the target bases were covered at >20×. There was an average of 9.6 novel variants per patient. Pathogenic mutations were identified in five genes for six patients, with two novel variants. There were another five likely pathogenic variants, some of which were unreported novel variants. In a cohort of 15 patients, we were able to identify a likely genetic etiology in six patients (40%). Another five patients had candidate variants for which further evaluation and segregation analysis are ongoing. Our results indicate that the HaloPlex ICCG panel is useful as a rapid, high-throughput and cost-effective screening tool for 170 of the 180 genes. There is low coverage for some regions in several genes which might have to be supplemented by Sanger sequencing. However, comparing the cost, ease of analysis, and shorter turnaround time, it is a good alternative to exome sequencing for patients whose features are suggestive of a genetic etiology involving one of the genes in the panel.
Madeira, Joao Lo; Nishi, Mirian Y; Nakaguma, Marilena; Benedetti, Anna F; Biscotto, Isabela Peixoto; Fernandes, Thamiris; Pequeno, Thiago; Figueiredo, Thalita; Franca, Marcela M; Correa, Fernanda A; Otto, Aline P; Abrão, Milena; Miras, Mirta B; Santos, Silvana; Jorge, Alexander Al; Costalonga, Everlayny F; Mendonca, Berenice B; Arnhold, Ivo Jp; Carvalho, Luciani R
2017-12-01
Mutations in PROP1, HESX1 and LHX3 are associated with combined pituitary hormone deficiency (CPHD) and orthotopic posterior pituitary lobe (OPP). To identify mutations in PROP1, HESX1 and LHX3 in a large cohort of patients with CPHD and OPP (35 Brazilian, two Argentinian). We studied 23 index patients with CPHD and OPP (six familial and 17 sporadic) as well as 14 relatives. PROP1 was sequenced by the Sanger method in all except one sporadic case studied using a candidate gene panel. Multiplex ligation-dependent probe amplification (MLPA) was applied to one familial case in whom PROP1 failed to amplify by PCR. In the 13 patients without PROP1 mutations, HESX1 and LHX3 were sequenced by the Sanger method. We identified PROP1 mutations in 10 index cases. Three mutations were novel: one affecting the initiation codon (c.1A>G) and two affecting splicing sites, c.109+1G>A and c.342+1G>C. The known mutations, c.150delA (p.Arg53Aspfs*112), c.218G>A (p.Arg73His), c.263T>C (p.Phe88Ser) and c.301_302delAG (p.Leu102Cysfs*8), were also detected. MLPA confirmed complete PROP1 deletion in one family. We did not identify HESX1 and LHX3 mutations by Sanger. PROP1 mutations are a prevalent cause of congenital CPHD with OPP, and therefore, PROP1 sequencing must be the first step of molecular investigation in patients with CPHD and OPP, especially in populations with a high frequency of PROP1 mutations. In the absence of mutations, massively parallel sequencing is a promising approach. The high prevalence and diversity of PROP1 mutations is associated with the ethnic background of this cohort. © 2017 John Wiley & Sons Ltd.
USDA-ARS?s Scientific Manuscript database
The complete nucleotide sequence of a recently discovered Florida (FL) isolate of Hibiscus infecting Cilevirus (HiCV) was determined by Sanger sequencing. The movement- and coat- protein gene sequences of the HiCV-FL isolate are more divergent than other genes of the previously sequenced HiCV-HA (Ha...
Statistical method to compare massive parallel sequencing pipelines.
Elsensohn, M H; Leblay, N; Dimassi, S; Campan-Fournier, A; Labalme, A; Roucher-Boulez, F; Sanlaville, D; Lesca, G; Bardel, C; Roy, P
2017-03-01
Today, sequencing is frequently carried out by Massive Parallel Sequencing (MPS) that cuts drastically sequencing time and expenses. Nevertheless, Sanger sequencing remains the main validation method to confirm the presence of variants. The analysis of MPS data involves the development of several bioinformatic tools, academic or commercial. We present here a statistical method to compare MPS pipelines and test it in a comparison between an academic (BWA-GATK) and a commercial pipeline (TMAP-NextGENe®), with and without reference to a gold standard (here, Sanger sequencing), on a panel of 41 genes in 43 epileptic patients. This method used the number of variants to fit log-linear models for pairwise agreements between pipelines. To assess the heterogeneity of the margins and the odds ratios of agreement, four log-linear models were used: a full model, a homogeneous-margin model, a model with single odds ratio for all patients, and a model with single intercept. Then a log-linear mixed model was fitted considering the biological variability as a random effect. Among the 390,339 base-pairs sequenced, TMAP-NextGENe® and BWA-GATK found, on average, 2253.49 and 1857.14 variants (single nucleotide variants and indels), respectively. Against the gold standard, the pipelines had similar sensitivities (63.47% vs. 63.42%) and close but significantly different specificities (99.57% vs. 99.65%; p < 0.001). Same-trend results were obtained when only single nucleotide variants were considered (99.98% specificity and 76.81% sensitivity for both pipelines). The method allows thus pipeline comparison and selection. It is generalizable to all types of MPS data and all pipelines.
Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuo, Alan; Grigoriev, Igor
2009-04-17
Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentousmore » ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.« less
Problem-Solving Test: Pyrosequencing
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2013-01-01
Terms to be familiar with before you start to solve the test: Maxam-Gilbert sequencing, Sanger sequencing, gel electrophoresis, DNA synthesis reaction, polymerase chain reaction, template, primer, DNA polymerase, deoxyribonucleoside triphosphates, orthophosphate, pyrophosphate, nucleoside monophosphates, luminescence, acid anhydride bond,…
Jimenez, Nelson Lopez; Flannick, Jason; Yahyavi, Mani; Li, Jiang; Bardakjian, Tanya; Tonkin, Leath; Schneider, Adele; Sherr, Elliott H; Slavotinek, Anne M
2011-12-28
Anophthalmia/microphthalmia (A/M) is caused by mutations in several different transcription factors, but mutations in each causative gene are relatively rare, emphasizing the need for a testing approach that screens multiple genes simultaneously. We used next-generation sequencing to screen 15 A/M patients for mutations in 9 pathogenic genes to evaluate this technology for screening in A/M. We used a pooled sequencing design, together with custom single nucleotide polymorphism (SNP) calling software. We verified predicted sequence alterations using Sanger sequencing. We verified three mutations - c.542delC in SOX2, resulting in p.Pro181Argfs*22, p.Glu105X in OTX2 and p.Cys240X in FOXE3. We found several novel sequence alterations and SNPs that were likely to be non-pathogenic - p.Glu42Lys in CRYBA4, p.Val201Met in FOXE3 and p.Asp291Asn in VSX2. Our analysis methodology gave one false positive result comprising a mutation in PAX6 (c.1268A > T, predicting p.X423LeuextX*15) that was not verified by Sanger sequencing. We also failed to detect one 20 base pair (bp) deletion and one 3 bp duplication in SOX2. Our results demonstrated the power of next-generation sequencing with pooled sample groups for the rapid screening of candidate genes for A/M as we were correctly able to identify disease-causing mutations. However, next-generation sequencing was less useful for small, intragenic deletions and duplications. We did not find mutations in 10/15 patients and conclude that there is a need for further gene discovery in A/M.
2011-01-01
Background Anophthalmia/microphthalmia (A/M) is caused by mutations in several different transcription factors, but mutations in each causative gene are relatively rare, emphasizing the need for a testing approach that screens multiple genes simultaneously. We used next-generation sequencing to screen 15 A/M patients for mutations in 9 pathogenic genes to evaluate this technology for screening in A/M. Methods We used a pooled sequencing design, together with custom single nucleotide polymorphism (SNP) calling software. We verified predicted sequence alterations using Sanger sequencing. Results We verified three mutations - c.542delC in SOX2, resulting in p.Pro181Argfs*22, p.Glu105X in OTX2 and p.Cys240X in FOXE3. We found several novel sequence alterations and SNPs that were likely to be non-pathogenic - p.Glu42Lys in CRYBA4, p.Val201Met in FOXE3 and p.Asp291Asn in VSX2. Our analysis methodology gave one false positive result comprising a mutation in PAX6 (c.1268A > T, predicting p.X423LeuextX*15) that was not verified by Sanger sequencing. We also failed to detect one 20 base pair (bp) deletion and one 3 bp duplication in SOX2. Conclusions Our results demonstrated the power of next-generation sequencing with pooled sample groups for the rapid screening of candidate genes for A/M as we were correctly able to identify disease-causing mutations. However, next-generation sequencing was less useful for small, intragenic deletions and duplications. We did not find mutations in 10/15 patients and conclude that there is a need for further gene discovery in A/M. PMID:22204637
Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting
NASA Astrophysics Data System (ADS)
Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.
1997-05-01
Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.
da Fonseca, Allex Jardim; Galvão, Renata Silva; Miranda, Angelica Espinosa; Ferreira, Luiz Carlos de Lima; Chen, Zigui
2016-05-01
To compare the diagnostic performance for HPV infection using three laboratorial techniques. Ninty-five cervicovaginal samples were randomly selected; each was tested for HPV DNA and genotypes using 3 methods in parallel: Multiplex-PCR, the Nested PCR followed by Sanger sequencing, and the Next_Gen Sequencing (NGS) with two assays (NGS-A1, NGS-A2). The study was approved by the Brazilian National IRB (CONEP protocol 16,800). The prevalence of HPV by the NGS assays was higher than that using the Multiplex-PCR (64.2% vs. 45.2%, respectively; P = 0.001) and the Nested-PCR (64.2% vs. 49.5%, respectively; P = 0.003). NGS also showed better performance in detecting high-risk HPV (HR-HPV) and HPV16. There was a weak interobservers agreement between the results of Multiplex-PCR and Nested-PCR in relation to NGS for the diagnosis of HPV infection, and a moderate correlation for HR-HPV detection. Both NGS assays showed a strong correlation for detection of HPVs (k = 0.86), HR-HPVs (k = 0.91), HPV16 (k = 0.92) and HPV18 (k = 0.91). NGS is more sensitive than the traditional Sanger sequencing and the Multiplex PCR to genotype HPVs, with promising ability to detect multiple infections, and may have the potential to establish an alternative method for the diagnosis and genotyping of HPV. © 2015 Wiley Periodicals, Inc.
Oh, Yejin; Song, Ik-Chan; Kim, Jimyung; Kwon, Gye Cheol; Koo, Sun Hoe; Kim, Seon Young
2018-05-01
We developed a pyrosequencing-based method for the quantification of CALR mutations and compared the results using Sanger sequencing, fragment length analysis (FLA), digital-droplet PCR (ddPCR), and next-generation sequencing (NGS). Method validation studies were performed using cloned plasmid controls. Samples from 24 patients with myeloproliferative neoplasms were evaluated. Among the 24 patients, 15 had CALR mutations (7 type 1, 2 type 2, and 6 other mutations). The type 1 or type 2 mutation-positive results from pyrosequencing exhibited 100% concordance with the Sanger sequencing results. One novel CALR mutation was not detected by pyrosequencing. The CALR mutation allele burdens measured by pyrosequencing were slightly lower than those measured by FLA but slightly higher than the results obtained using ddPCR. Pyrosequencing exhibited high correlations with both methods. The mutation allele burdens estimated by NGS were significantly lower than those measured by pyrosequencing. An increased CALR mutation allele burden was associated with overt primary myelofibrosis. Patients with >70% mutation allele burdens in myeloid cells had a significantly longer time from diagnosis (P = 0.007), more bone marrow fibrosis (P = 0.010), and lower hemoglobin (P = 0.007). Pyrosequencing was a useful rapid sequencing method to determine the burden of CALR mutations. Copyright © 2018 Elsevier B.V. All rights reserved.
Baurand, Amandine; Falcon-Eicher, Sylvie; Laurent, Gabriel; Villain, Elisabeth; Bonnet, Caroline; Thauvin-Robinet, Christel; Jacquot, Caroline; Eicher, Jean-Christophe; Gourraud, Jean-Baptiste; Schmitt, Sébastien; Bézieau, Stéphane; Giraud, Mathilde; Dumont, Solenne; Kuentz, Paul; Probst, Vincent; Burguet, Antoine; Kyndt, Florence; Faivre, Laurence
2017-02-01
Autosomal dominant genetic diseases can occur de novo and in the form of somatic mosaicism, which can give rise to a less severe phenotype, and make diagnosis more difficult given the sensitivity limits of the methods used. We report the case of female child with a history of surgery for syndactyly of the hands and feet, who was admitted at 6 years of age to a pediatric intensive care unit following cardiac arrest. The electrocardiogram (ECG) showed a long QT interval that on occasions reached 500 ms. Despite the absence of facial dysmorphism and the presence of normal psychomotor development, a diagnosis of Timothy syndrome was made given the association of syndactyly and the ECG features. Sanger sequencing of the CACNA1C gene, followed by sequencing of the genes KCNQ1, KCNH2, KCNE1, KCNE2, were negative. The subsequent analysis of a panel of genes responsible for hereditary cardiac rhythm disorders using Haloplex technology revealed a recurrent mosaic p.Gly406Arg missense mutation of the CACNA1C gene in 18% of the cells. This mosaicism can explain the negative Sanger analysis and the less complete phenotype in this patient. Given the other cases in the literature, mosaic mutations in Timothy syndrome appear more common than previously thought. This case demonstrates the importance of using next-generation sequencing to identify mosaic mutations when the clinical picture supports a specific mutation that is not identified using conventional testing. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Birt-Hogg-Dubé syndrome in two Chinese families with mutations in the FLCN gene.
Hou, Xiaocan; Zhou, Yuan; Peng, Yun; Qiu, Rong; Xia, Kun; Tang, Beisha; Zhuang, Wei; Jiang, Hong
2018-01-22
Birt-Hogg-Dubé syndrome is an autosomal dominant hereditary condition caused by mutations in the folliculin-encoding gene FLCN (NM_144997). It is associated with skin lesions such as fibrofolliculoma, acrochordon and trichodiscoma; pulmonary lesions including spontaneous pneumothorax and pulmonary cysts and renal cancer. Genomic DNA was extracted from peripheral venous blood samples of the propositi and their family members. Genetic analysis was performed by whole exome sequencing and Sanger sequencing aiming at corresponding exons in FLCN gene to explore the genetic mutations of these two families. In this study, we performed genetic analysis by whole exome sequencing and Sanger sequencing aiming at corresponding exons in FLCN gene to explore the genetic mutations in two Chinese families. Patients from family 1 mostly suffered from pneumothorax and pulmonary cysts, several of whom also mentioned skin lesions or kidney lesions. While in family 2, only thoracic lesions were found in the patients, without any other clinical manifestations. Two FLCN mutations have been identified: One is an insertion mutation (c.1579_1580insA/p.R527Xfs on exon 14) previously reported in three Asian families (one mainland family and two Taiwanese families); while the other is a firstly reviewed mutation in Asian population (c.649C > T / p.Gln217X on exon 7) that ever been detected in a French family. Overall, The detection of these two mutations expands the spectrum of FLCN mutations and will provide insight into genetic diagnosis and counseling of Birt-Hogg-Dubé syndrome.
Pena, Loren DM; Jiang, Yong-Hui; Schoch, Kelly; Spillmann, Rebecca C.; Walley, Nicole; Stong, Nicholas; Horn, Sarah Rapisardo; Sullivan, Jennifer A.; McConkie-Rosell, Allyn; Kansagra, Sujay; Smith, Edward C.; El-Dairi, Mays; Bellet, Jane; Ann Keels, Martha; Jasien, Joan; Kranz, Peter G.; Noel, Richard; Nagaraj, Shashi K.; Lark, Robert K.; Wechsler, Daniel SG; del Gaudio, Daniela; Leung, Marco L.; Hendon, Laura G.; Parker, Collette C.; Jones, Kelly L.; Goldstein, David B.; Shashi, Vandana
2017-01-01
Purpose To describe examples of missed pathogenic variants on whole exome sequencing (WES) and the importance of deep phenotyping for further diagnostic testing. Methods Guided by phenotypic information, three children with negative WES underwent targeted single gene testing. Results Individual 1 had a clinical diagnosis consistent with infantile systemic hyalinosis, although WES and an NGS-based ANTXR2 test were negative. Sanger sequencing of ANTXR2 revealed a homozygous single base pair insertion, previously missed by the WES variant caller software. Individual 2 had neurodevelopmental regression and cerebellar atrophy, with no diagnosis on WES. New clinical findings prompted Sanger sequencing and copy number testing of PLA2G6. A novel homozygous deletion of the non-coding exon 1 (not included in the WES capture kit) was detected, with extension into the promoter, confirming the clinical suspicion of infantile neuroaxonal dystrophy. Individual 3 had progressive ataxia, spasticity and MRI changes of vanishing white matter leukoencephalopathy. An NGS leukodystrophy gene panel and WES showed a heterozygous pathogenic variant in EIF2B5; no deletions/duplications were detected. Sanger sequencing of EIF2B5 showed a frameshift indel, likely missed due to failure of alignment. Conclusions These cases illustrate potential pitfalls of WES/NGS testing, and the importance of phenotype-guided molecular testing in yielding diagnoses. PMID:28914269
First report of Cocksfoot mottle virus infecting wheat (Triticum aestivum) in Ohio
USDA-ARS?s Scientific Manuscript database
Cocksfoot mottle virus (CfMV) was discovered in Ohio wheat during a 2016 field survey utilizing RNA-Seq to identify virus-like sequences. Virus sequences were confirmed by reverse transcriptase-polymerase chain reaction (RT-PCR) and Sanger sequencing, and CfMV was transmitted to orchardgrass and pas...
Gille, Johan J. P.; Floor, Karijn; Kerkhoven, Lianne; Ameziane, Najim; Joenje, Hans; de Winter, Johan P.
2012-01-01
Fanconi anemia (FA) is a rare inherited disease characterized by developmental defects, short stature, bone marrow failure, and a high risk of malignancies. FA is heterogeneous: 15 genetic subtypes have been distinguished so far. A clinical diagnosis of FA needs to be confirmed by testing cells for sensitivity to cross-linking agents in a chromosomal breakage test. As a second step, DNA testing can be employed to elucidate the genetic subtype of the patient and to identify the familial mutations. This knowledge allows preimplantation genetic diagnosis (PGD) and enables prenatal DNA testing in future pregnancies. Although simultaneous testing of all FA genes by next generation sequencing will be possible in the near future, this technique will not be available immediately for all laboratories. In addition, in populations with strong founder mutations, a limited test using Sanger sequencing and MLPA will be a cost-effective alternative. We describe a strategy and optimized conditions for the screening of FANCA, FANCB, FANCC, FANCE, FANCF, and FANCG and present the results obtained in a cohort of 54 patients referred to our diagnostic service since 2008. In addition, the follow up with respect to genetic counseling and carrier screening in the families is discussed. PMID:22778927
A comprehensive characterization of rare mitochondrial DNA variants in neuroblastoma
Pignataro, Piero; Lasorsa, Vito Alessandro; Hogarty, Michael D.; Castellano, Aurora; Conte, Massimo; Tonini, Gian Paolo; Iolascon, Achille; Gasparre, Giuseppe; Capasso, Mario
2016-01-01
Background Neuroblastoma, a tumor of the developing sympathetic nervous system, is a common childhood neoplasm that is often lethal. Mitochondrial DNA (mtDNA) mutations have been found in most tumors including neuroblastoma. We extracted mtDNA data from a cohort of neuroblastoma samples that had undergone Whole Exome Sequencing (WES) and also used snap-frozen samples in which mtDNA was entirely sequenced by Sanger technology. We next undertook the challenge of determining those mutations that are relevant to, or arisen during tumor development. The bioinformatics pipeline used to extract mitochondrial variants from matched tumor/blood samples was enriched by a set of filters inclusive of heteroplasmic fraction, nucleotide variability, and in silico prediction of pathogenicity. Results Our in silico multistep workflow applied both on WES and Sanger-sequenced neuroblastoma samples, allowed us to identify a limited burden of somatic and germline mitochondrial mutations with a potential pathogenic impact. Conclusions The few singleton germline and somatic mitochondrial mutations emerged, according to our in silico analysis, do not appear to impact on the development of neuroblastoma. Our findings are consistent with the hypothesis that most mitochondrial somatic mutations can be considered as ‘passengers’ and consequently have no discernible effect in this type of cancer. PMID:27351283
Shen, Tao; Guan, Liping; Li, Shiqiang; Zhang, Jianguo; Xiao, Xueshan; Jiang, Hui; Yang, Jianhua; Guo, Xiangming; Wang, Jun; Zhang, Qingjiong
2015-03-01
The genetic defects underlying approximately half of all retinitis pigmentosa (RP) cases are unknown. A number of genes responsible for Leber congenital amaurosis (LCA) may also cause RP when they are mutated. Our previous study revealed that variants in the most frequently mutated nine exons accounted for approximately half of the mutations detected in a cohort of patients with LCA. The aim of the present study was to detect mutations in LCA-associated genes in patients with RP using two different strategies. Sanger sequencing was used to screen mutations in the nine exons in 293 patients with RP and exome sequencing was used to detect variants in 12 LCA-associated genes in 157 of the 293 patients with RP and then to validate the variants by Sanger sequencing. Potential pathogenic mutations were identified in four patients with early onset RP, including homozygous CRB1 mutations in two patients, compound heterozygous CRB1 mutations in one patient and compound heterozygous CEP290 mutations in one patient. The present study indicated that mutations in CEP290 may also be associated with RP but not with LCA. With the exception of CEP290, the remaining 11 genes known to be associated with LCA but not with RP are unlikely to be a common cause of RP.
Kariminejad, Ariana; Ajeawung, Norbert Fonya; Bozorgmehr, Bita; Dionne-Laporte, Alexandre; Molidperee, Sirinart; Najafi, Kimia; Gibbs, Richard A; Lee, Brendan H; Hennekam, Raoul C; Campeau, Philippe M
2017-04-01
Kaufman oculo-cerebro-facial syndrome (KOS) is caused by recessive UBE3B mutations and presents with microcephaly, ocular abnormalities, distinctive facial morphology, low cholesterol levels and intellectual disability. We describe a child with microcephaly, brachycephaly, hearing loss, ptosis, blepharophimosis, hypertelorism, cleft palate, multiple renal cysts, absent nails, small or absent terminal phalanges, absent speech and intellectual disability. Syndromes that were initially considered include DOORS syndrome, Coffin-Siris syndrome and Dubowitz syndrome. Clinical investigations coupled with karyotype analysis, array-comparative genomic hybridization, exome and Sanger sequencing were performed to characterize the condition in this child. Sanger sequencing was negative for the DOORS syndrome gene TBC1D24 but exome sequencing identified a homozygous deletion in UBE3B (NM_183415:c.3139_3141del, p.1047_1047del) located within the terminal portion of the HECT domain. This finding coupled with the presence of characteristic features such as brachycephaly, ptosis, blepharophimosis, hypertelorism, short palpebral fissures, cleft palate and developmental delay allowed us to make a diagnosis of KOS. In conclusion, our findings highlight the importance of considering KOS as a differential diagnosis for patients under evaluation for DOORS syndrome and expand the phenotype of KOS to include small or absent terminal phalanges, nails, and the presence of hallux varus and multicystic dysplastic kidneys.
Thomas, Vincent; Mazard, Blandine; Garcia, Caroline; Lacan, Philippe; Gagnieu, Marie-Claude; Joly, Philippe
2013-09-23
Minucci et al. have proposed in 2010 a rapid, simple and cost-effective HRM method on the LightCycler 480® apparatus (Roche) for the determination of the 6/6, 6/7 and 7/7 genotypes of the (TA)n UGT1A1 promoter polymorphism. However, they have not studied the n=5 and n=8 alleles which can be quite frequent in sickle-cell disease patients. The aim of our study was to test this HRM protocol to all the 10 possible (TA)n UGT1A1 genotypes (i.e. 5/5, 5/6, 5/7, 5/8, 6/6, 6/7, 6/8, 7/7, 7/8 and 8/8) by using our SCD cohort of patients. All genotypes could be unambiguously identified except 6/7 and 6/8 which give a similar HRM profile. For those two genotypes, the differentiation necessitates either a direct Sanger sequencing or a second PCR protocol followed by a 3% agarose gel migration. For the (TA)n UGT1A1 promoter genotyping of African patients, each lab has to wonder what is the best way between (i) direct Sanger sequencing of all patients and (ii) HRM protocol for all patients followed by a complementary analysis to differentiate the 6/7 and 6/8 genotypes. © 2013. Published by Elsevier B.V. All rights reserved.
Lu, Chaoxia; Wu, Wei; Xiao, Jifang; Meng, Yan; Zhang, Shuyang; Zhang, Xue
2013-06-01
To detect pathogenic mutations in Marfan syndrome (MFS) using an Ion Torrent Personal Genome Machine (PGM) and to validate the result of targeted next-generation semiconductor sequencing for the diagnosis of genetic disorders. Peripheral blood samples were collected from three MFS patients and a normal control with informed consent. Genomic DNA was isolated by standard method and then subjected to targeted sequencing using an Ion Ampliseq(TM) Inherited Disease Panel. Three multiplex PCR reactions were carried out to amplify the coding exons of 328 genes including FBN1, TGFBR1 and TGFBR2. DNA fragments from different samples were ligated with barcoded sequencing adaptors. Template preparation and emulsion PCR, and Ion Sphere Particles enrichment were carried out using an Ion One Touch system. The ion sphere particles were sequenced on a 318 chip using the PGM platform. Data from the PGM runs were processed using an Ion Torrent Suite 3.2 software to generate sequence reads. After sequence alignment and extraction of SNPs and indels, all the variants were filtered against dbSNP137. DNA sequences were visualized with an Integrated Genomics Viewer. The most likely disease-causing variants were analyzed by Sanger sequencing. The PGM sequencing has yielded an output of 855.80 Mb, with a > 100 × median sequencing depth and a coverage of > 98% for the targeted regions in all the four samples. After data analysis and database filtering, one known missense mutation (p.E1811K) and two novel premature termination mutations (p.E2264X and p.L871FfsX23) in the FBN1 gene were identified in the three MFS patients. All mutations were verified by conventional Sanger sequencing. Pathogenic FBN1 mutations have been identified in all patients with MFS, indicating that the targeted next-generation sequencing on the PGM sequencers can be applied for accurate and high-throughput testing of genetic disorders.
Das Bhowmik, Aneek; Gupta, Neerja; Dalal, Ashwin; Kabra, Madhulika
In the present study we report on genetic analysis in a patient with developmental delay, truncal obesity and vision problem, to find the causative mutation. Whole exome sequencing was performed on genomic DNA extracted from whole blood of the patient which revealed a homozygous nonsense variant (c.2816T>A) in exon 8 of ALMS1 gene that results in a stop codon and premature truncation at codon 939 (p.L939Ter) of the protein. The mutation was confirmed by Sanger sequencing. Exome sequencing was helpful in establishing diagnosis of Alstrom syndrome in this patient. This case highlights the utility of exome sequencing in clinical practice. Copyright © 2016 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.
Snelling, Timothy J; Genç, Buğra; McKain, Nest; Watson, Mick; Waters, Sinéad M; Creevey, Christopher J; Wallace, R John
2014-01-01
Ruminal archaeomes of two mature sheep grazing in the Scottish uplands were analysed by different sequencing and analysis methods in order to compare the apparent archaeal communities. All methods revealed that the majority of methanogens belonged to the Methanobacteriales order containing the Methanobrevibacter, Methanosphaera and Methanobacteria genera. Sanger sequenced 1.3 kb 16S rRNA gene amplicons identified the main species of Methanobrevibacter present to be a SGMT Clade member Mbb. millerae (≥ 91% of OTUs); Methanosphaera comprised the remainder of the OTUs. The primers did not amplify ruminal Thermoplasmatales-related 16S rRNA genes. Illumina sequenced V6-V8 16S rRNA gene amplicons identified similar Methanobrevibacter spp. and Methanosphaera clades and also identified the Thermoplasmatales-related order as 13% of total archaea. Unusually, both methods concluded that Mbb. ruminantium and relatives from the same clade (RO) were almost absent. Sequences mapping to rumen 16S rRNA and mcrA gene references were extracted from Illumina metagenome data. Mapping of the metagenome data to 16S rRNA gene references produced taxonomic identification to Order level including 2-3% Thermoplasmatales, but was unable to discriminate to species level. Mapping of the metagenome data to mcrA gene references resolved 69% to unclassified Methanobacteriales. Only 30% of sequences were assigned to species level clades: of the sequences assigned to Methanobrevibacter, most mapped to SGMT (16%) and RO (10%) clades. The Sanger 16S amplicon and Illumina metagenome mcrA analyses showed similar species richness (Chao1 Index 19-35), while Illumina metagenome and amplicon 16S rRNA analysis gave lower richness estimates (10-18). The values of the Shannon Index were low in all methods, indicating low richness and uneven species distribution. Thus, although much information may be extracted from the other methods, Illumina amplicon sequencing of the V6-V8 16S rRNA gene would be the method of choice for studying rumen archaeal communities.
2017-01-01
Amplicon (targeted) sequencing by massively parallel sequencing (PCR-MPS) is a potential method for use in forensic DNA analyses. In this application, PCR-MPS may supplement or replace other instrumental analysis methods such as capillary electrophoresis and Sanger sequencing for STR and mitochondrial DNA typing, respectively. PCR-MPS also may enable the expansion of forensic DNA analysis methods to include new marker systems such as single nucleotide polymorphisms (SNPs) and insertion/deletions (indels) that currently are assayable using various instrumental analysis methods including microarray and quantitative PCR. Acceptance of PCR-MPS as a forensic method will depend in part upon developing protocols and criteria that define the limitations of a method, including a defensible analytical threshold or method detection limit. This paper describes an approach to establish objective analytical thresholds suitable for multiplexed PCR-MPS methods. A definition is proposed for PCR-MPS method background noise, and an analytical threshold based on background noise is described. PMID:28542338
Young, Brian; King, Jonathan L; Budowle, Bruce; Armogida, Luigi
2017-01-01
Amplicon (targeted) sequencing by massively parallel sequencing (PCR-MPS) is a potential method for use in forensic DNA analyses. In this application, PCR-MPS may supplement or replace other instrumental analysis methods such as capillary electrophoresis and Sanger sequencing for STR and mitochondrial DNA typing, respectively. PCR-MPS also may enable the expansion of forensic DNA analysis methods to include new marker systems such as single nucleotide polymorphisms (SNPs) and insertion/deletions (indels) that currently are assayable using various instrumental analysis methods including microarray and quantitative PCR. Acceptance of PCR-MPS as a forensic method will depend in part upon developing protocols and criteria that define the limitations of a method, including a defensible analytical threshold or method detection limit. This paper describes an approach to establish objective analytical thresholds suitable for multiplexed PCR-MPS methods. A definition is proposed for PCR-MPS method background noise, and an analytical threshold based on background noise is described.
USDA-ARS?s Scientific Manuscript database
Nine different regions totaling 9.7 Mb of the 4.02 Gb Aegilops tauschii genome were sequenced using the Sanger sequencing technology and compared with orthologous Brachypodium distachyon, Oryza sativa (rice) and Sorghum bicolor (sorghum) genomic sequences. The ancestral gene content in these regio...
Dr. Sanger's Apprentice: A Computer-Aided Instruction to Protein Sequencing.
ERIC Educational Resources Information Center
Schmidt, Thomas G.; Place, Allen R.
1985-01-01
Modeled after the program "Mastermind," this program teaches students the art of protein sequencing. The program (written in Turbo Pascal for the IBM PC, requiring 128K, a graphics adapter, and an 8070 mathematics coprocessor) generates a polypeptide whose sequence and length can be user-defined (for practice) or computer-generated (for…
The complete mitochondrial genome of the stonefly Dinocras cephalotes (Plecoptera, Perlidae).
Elbrecht, Vasco; Poettker, Lisa; John, Uwe; Leese, Florian
2015-06-01
The complete mitochondrial genome of the perlid stonefly Dinocras cephalotes (Curtis, 1827) was sequenced using a combined 454 and Sanger sequencing approach using the known sequence of Pteronarcys princeps Banks, 1907 (Pteronarcyidae), to identify homologous 454 reads. The genome is 15,666 bp in length and includes 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and a control region. Gene order resembles that of basal arthropods. The base composition of the genome is A (33.5%), T (29.0%), C (24.4%) and G (13.1%). This is the second published mitogenome for the order Plecoptera and will be useful in future phylogenetic analysis.
Li, Zhoufang; Liu, Guangjie; Tong, Yin; Zhang, Meng; Xu, Ying; Qin, Li; Wang, Zhanhui; Chen, Xiaoping; He, Jiankui
2015-01-01
Profiling immune repertoires by high throughput sequencing enhances our understanding of immune system complexity and immune-related diseases in humans. Previously, cloning and Sanger sequencing identified limited numbers of T cell receptor (TCR) nucleotide sequences in rhesus monkeys, thus their full immune repertoire is unknown. We applied multiplex PCR and Illumina high throughput sequencing to study the TCRβ of rhesus monkeys. We identified 1.26 million TCRβ sequences corresponding to 643,570 unique TCRβ sequences and 270,557 unique complementarity-determining region 3 (CDR3) gene sequences. Precise measurements of CDR3 length distribution, CDR3 amino acid distribution, length distribution of N nucleotide of junctional region, and TCRV and TCRJ gene usage preferences were performed. A comprehensive profile of rhesus monkey immune repertoire might aid human infectious disease studies using rhesus monkeys. PMID:25961410
Diagnostic Applications of Next Generation Sequencing in Immunogenetics and Molecular Oncology
Grumbt, Barbara; Eck, Sebastian H.; Hinrichsen, Tanja; Hirv, Kaimo
2013-01-01
Summary With the introduction of the next generation sequencing (NGS) technologies, remarkable new diagnostic applications have been established in daily routine. Implementation of NGS is challenging in clinical diagnostics, but definite advantages and new diagnostic possibilities make the switch to the technology inevitable. In addition to the higher sequencing capacity, clonal sequencing of single molecules, multiplexing of samples, higher diagnostic sensitivity, workflow miniaturization, and cost benefits are some of the valuable features of the technology. After the recent advances, NGS emerged as a proven alternative for classical Sanger sequencing in the typing of human leukocyte antigens (HLA). By virtue of the clonal amplification of single DNA molecules ambiguous typing results can be avoided. Simultaneously, a higher sample throughput can be achieved by tagging of DNA molecules with multiplex identifiers and pooling of PCR products before sequencing. In our experience, up to 380 samples can be typed for HLA-A, -B, and -DRB1 in high-resolution during every sequencing run. In molecular oncology, NGS shows a markedly increased sensitivity in comparison to the conventional Sanger sequencing and is developing to the standard diagnostic tool in detection of somatic mutations in cancer cells with great impact on personalized treatment of patients. PMID:23922545
Albitar, Adam; Ma, Wanlong; DeDios, Ivan; Estella, Jeffrey; Ahn, Inhye; Farooqui, Mohammed; Wiestner, Adrian; Albitar, Maher
2017-03-14
Patients with chronic lymphocytic leukemia (CLL) that develop resistance to Bruton tyrosine kinase (BTK) inhibitors are typically positive for mutations in BTK or phospholipase c gamma 2 (PLCγ2). We developed a high sensitivity (HS) assay utilizing wild-type blocking polymerase chain reaction achieved via bridged and locked nucleic acids. We used this high sensitivity assay in combination with Sanger sequencing and next generation sequencing (NGS) and tested cellular DNA and cell-free DNA (cfDNA) from patients with CLL treated with the BTK inhibitor, ibrutinib. We also tested ibrutinib-naïve patients with CLL. HS testing achieved 100x greater sensitivity than Sanger. HS Sanger sequencing was capable of detecting < 1 mutant allele in background of 1000 wild-type alleles (1:1000). Similar sensitivity was achieved with HS NGS. No BTK or PLCγ2 mutations were detected in any of the 44 ibrutinib-naïve CLL patients. We demonstrate that without the HS testing 56% of positive samples would have been missed for BTK and 85% of PLCγ2 would have been missed. With the use of HS, we were able to detect multiple mutant clones in the same sample in 37.5% of patients; most would have been missed without HS testing. We also demonstrate that with HS sequencing, plasma cfDNA is more reliable than cellular DNA in detecting mutations. Our studies indicate that wild-type blocking and HS sequencing is necessary for proper and early detection of BTK or PLCγ2 mutations in monitoring patients treated with BTK inhibitors. Furthermore, cfDNA from plasma is very reliable sample-type for testing.
Janecek, Elisabeth; Streichan, Sabine; Strube, Christina
2012-10-18
Rickettsioses are caused by pathogenic species of the genus Rickettsia and play an important role as emerging diseases. The bacteria are transmitted to mammal hosts including humans by arthropod vectors. Since detection, especially in tick vectors, is usually based on PCR with genus-specific primers to include different occurring Rickettsia species, subsequent species identification is mainly achieved by Sanger sequencing. In the present study a real-time pyrosequencing approach was established with the objective to differentiate between species occurring in German Ixodes ticks, which are R. helvetica, R. monacensis, R. massiliae, and R. felis. Tick material from a quantitative real-time PCR (qPCR) based study on Rickettsia-infections in I. ricinus allowed direct comparison of both sequencing techniques, Sanger and real-time pyrosequencing. A sequence stretch of rickettsial citrate synthase (gltA) gene was identified to contain divergent single nucleotide polymorphism (SNP) sites suitable for Rickettsia species differentiation. Positive control plasmids inserting the respective target sequence of each Rickettsia species of interest were constructed for initial establishment of the real-time pyrosequencing approach using Qiagen's PSQ 96MA Pyrosequencing System operating in a 96-well format. The approach included an initial amplification reaction followed by the actual pyrosequencing, which is traceable by pyrograms in real-time. Afterwards, real-time pyrosequencing was applied to 263 Ixodes tick samples already detected Rickettsia-positive in previous qPCR experiments. Establishment of real-time pyrosequencing using positive control plasmids resulted in accurate detection of all SNPs in all included Rickettsia species. The method was then applied to 263 Rickettsia-positive Ixodes ricinus samples, of which 153 (58.2%) could be identified for their species (151 R. helvetica and 2 R. monacensis) by previous custom Sanger sequencing. Real-time pyrosequencing identified all Sanger-determined ticks as well as 35 previously undifferentiated ticks resulting in a total number of 188 (71.5%) identified samples. Pyrosequencing sensitivity was found to be strongly dependent on gltA copy numbers in the reaction setup. Whereas less than 101 copies in the initial amplification reaction resulted in identification of 15.1% of the samples only, the percentage increased to 54.2% at 101-102 copies, to 95.6% at >102-103 copies and reached 100% samples identified for their Rickettsia species if more than 103 copies were present in the template. The established real-time pyrosequencing approach represents a reliable method for detection and differentiation of Rickettsia spp. present in I. ricinus diagnostic material and prevalence studies. Furthermore, the method proved to be faster, more cost-effective as well as more sensitive than custom Sanger sequencing with simultaneous high specificity.
New single-copy nuclear genes for scale insect systematics
USDA-ARS?s Scientific Manuscript database
Despite the advent of next-generation sequencing, the polymerase chain reaction (PCR) and Sanger sequencing remain useful tools for molecular identification and systematics. To date, molecular systematics of scale insects has been constrained by the paucity of loci that researchers have been able to...
Nanopore Kinetic Proofreading of DNA Sequences
NASA Astrophysics Data System (ADS)
Ling, Xinsheng Sean
The concept of DNA sequencing using the time dependence of the nanopore ionic current was proposed in 1996 by Kasianowicz, Brandin, Branton, and Deamer (KBBD). The KBBD concept has generated tremendous amount interests in recent decade. In this talk, I will review the current understanding of the DNA ``translocation'' dynamics and how it can be described by Schrodinger's 1915 paper on first-passage-time distribution function. Schrodinger's distribution function can be used to give a rigorous criterion for achieving nanopore DNA sequencing which turns out to be identical to that of gel electrophoresis used by Sanger in the first-generation Sanger method. A nanopore DNA sequencing technology also requires discrimination of bases with high accuracies. I will describe a solid-state nanopore sandwich structure that can function as a proofreading device capable of discriminating between correct and incorrect hybridization probes with an accuracy rivaling that of high-fidelity DNA polymerases. The latest results from Nanjing will be presented. This work is supported by China 1000-Talent Program at Southeast University, Nanjing, China.
Haemonchus contortus: Genome Structure, Organization and Comparative Genomics.
Laing, R; Martinelli, A; Tracey, A; Holroyd, N; Gilleard, J S; Cotton, J A
2016-01-01
One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. Copyright © 2016 Elsevier Ltd. All rights reserved.
Mutations in SURF1 are important genetic causes of Leigh syndrome in Slovak patients.
Danis, Daniel; Brennerova, Katarina; Skopkova, Martina; Kurdiova, Timea; Ukropec, Jozef; Stanik, Juraj; Kolnikova, Miriam; Gasperikova, Daniela
2018-04-01
Leigh syndrome is a progressive early onset neurodegenerative disease typically presenting with psychomotor regression, signs of brainstem and/or basal ganglia disease, lactic acidosis, and characteristic magnetic resonance imaging findings. At molecular level, deficiency of respiratory complexes and/or pyruvate dehydrogenase complex is usually observed. Nuclear gene SURF1 encodes an assembly factor for cytochrome c-oxidase complex of the respiratory chain and autosomal recessive mutations in SURF1 are one of the most frequent causes of cytochrome c-oxidase-related Leigh syndrome cases. Here, we aimed to elucidate the genetic basis of Leigh syndrome in three Slovak families. Three probands presenting with Leigh syndrome were selected for DNA analysis. The first proband, presenting with atypical LS onset without abnormal basal ganglia magnetic resonance imaging findings, was analyzed with whole exome sequencing. In the two remaining probands, SURF1 was screened by Sanger sequencing. Four different heterozygous mutations were identified in SURF1: c.312_321delinsAT:p.(Pro104Profs*1), c.588+1G>A, c.823_833+7del:p. (?) and c.845_846del:p.(Ser282Cysfs*9). All the mutations are predicted to have a loss-of-function effect. We identified disease-causing mutations in all three probands, which points to the important role of SURF1 gene in etiology of Leigh syndrome in Slovakia. Our data showed that patients with atypical Leigh syndrome phenotype without lesions in basal ganglia may benefit from the whole exome sequencing method. In the case of probands presenting the typical phenotype, Sanger sequencing of the SURF1 gene seems to be an effective method of DNA analysis.
Aziz, Nazneen; Zhao, Qin; Bry, Lynn; Driscoll, Denise K; Funke, Birgit; Gibson, Jane S; Grody, Wayne W; Hegde, Madhuri R; Hoeltge, Gerald A; Leonard, Debra G B; Merker, Jason D; Nagarajan, Rakesh; Palicki, Linda A; Robetorye, Ryan S; Schrijver, Iris; Weck, Karen E; Voelkerding, Karl V
2015-04-01
The higher throughput and lower per-base cost of next-generation sequencing (NGS) as compared to Sanger sequencing has led to its rapid adoption in clinical testing. The number of laboratories offering NGS-based tests has also grown considerably in the past few years, despite the fact that specific Clinical Laboratory Improvement Amendments of 1988/College of American Pathologists (CAP) laboratory standards had not yet been developed to regulate this technology. To develop a checklist for clinical testing using NGS technology that sets standards for the analytic wet bench process and for bioinformatics or "dry bench" analyses. As NGS-based clinical tests are new to diagnostic testing and are of much greater complexity than traditional Sanger sequencing-based tests, there is an urgent need to develop new regulatory standards for laboratories offering these tests. To develop the necessary regulatory framework for NGS and to facilitate appropriate adoption of this technology for clinical testing, CAP formed a committee in 2011, the NGS Work Group, to deliberate upon the contents to be included in the checklist. Results . -A total of 18 laboratory accreditation checklist requirements for the analytic wet bench process and bioinformatics analysis processes have been included within CAP's molecular pathology checklist (MOL). This report describes the important issues considered by the CAP committee during the development of the new checklist requirements, which address documentation, validation, quality assurance, confirmatory testing, exception logs, monitoring of upgrades, variant interpretation and reporting, incidental findings, data storage, version traceability, and data transfer confidentiality.
De Novo Paternal FBN1 Mutation Detected in Embryos Before Implantation.
Wang, Shuling; Niu, Ziru; Wang, Hui; Ma, Minyue; Zhang, Wei; Fang Wang, Shu; Wang, Jun; Yan, Hong; Liu, Yifan; Duan, Na; Zhang, Xiandong; Yao, Yuanqing
2017-06-26
BACKGROUND Marfan syndrome (MFS) is an autosomal dominant disease caused by mutations in the Fibrillin (FBN)1 gene and characterized by disorders in the cardiovascular, skeletal, and visual systems. The diversity of mutations and phenotypic heterogeneity of MFS make prenatal molecular diagnoses difficult. In this study, we used pre-implantation genetic diagnosis (PGD) to identify the pathogenic mutation in a male patient with MFS and to determine whether his offspring would be free of the disease. MATERIAL AND METHODS The history and pedigree of the proband were analyzed. Mutation analysis was performed on the couple and immediate family members. The couple chose IVF treatment and 4 blastocysts were biopsied. PGD was carried out by targeted high-throughput sequencing of the FBN1 gene in the embryos, along with single-nucleotide polymorphism haplotyping. Sanger sequencing was used to confirm the causative mutation. RESULTS c.2647T>C (p.Trp883Arg) was identified as the de novo likely pathogenic mutation in the proband. Whole-genome amplification and sequencing of the 3 embryos revealed that they did not carry the mutation, and 1 blastocyst was transferred back to the uterus. The amniocentesis test result analyzed by Sanger sequencing confirmed the PGD. A premature but healthy infant free of heart malformations was born. CONCLUSIONS The de novo mutation c.2647T>C (p.Trp883Arg) in FBN1 was identified in a Chinese patient with MFS. Embryos without the mutation were identified by PGD and resulted in a successful pregnancy.
Liang, Chanjuan; van Dijk, Jeroen P; Scholtens, Ingrid M J; Staats, Martijn; Prins, Theo W; Voorhuijzen, Marleen M; da Silva, Andrea M; Arisi, Ana Carolina Maisonnave; den Dunnen, Johan T; Kok, Esther J
2014-04-01
The growing number of biotech crops with novel genetic elements increasingly complicates the detection of genetically modified organisms (GMOs) in food and feed samples using conventional screening methods. Unauthorized GMOs (UGMOs) in food and feed are currently identified through combining GMO element screening with sequencing the DNA flanking these elements. In this study, a specific and sensitive qPCR assay was developed for vip3A element detection based on the vip3Aa20 coding sequences of the recently marketed MIR162 maize and COT102 cotton. Furthermore, SiteFinding-PCR in combination with Sanger, Illumina or Pacific BioSciences (PacBio) sequencing was performed targeting the flanking DNA of the vip3Aa20 element in MIR162. De novo assembly and Basic Local Alignment Search Tool searches were used to mimic UGMO identification. PacBio data resulted in relatively long contigs in the upstream (1,326 nucleotides (nt); 95 % identity) and downstream (1,135 nt; 92 % identity) regions, whereas Illumina data resulted in two smaller contigs of 858 and 1,038 nt with higher sequence identity (>99 % identity). Both approaches outperformed Sanger sequencing, underlining the potential for next-generation sequencing in UGMO identification.
Urabe, N; Ishii, Y; Hyodo, Y; Aoki, K; Yoshizawa, S; Saga, T; Murayama, S Y; Sakai, K; Homma, S; Tateda, K
2016-04-01
Between 18 November and 3 December 2011, five renal transplant patients at the Department of Nephrology, Toho University Omori Medical Centre, Tokyo, were diagnosed with Pneumocystis pneumonia (PCP). We used molecular epidemiologic methods to determine whether the patients were infected with the same strain of Pneumocystis jirovecii. DNA extracted from the residual bronchoalveolar lavage fluid from the five outbreak cases and from another 20 cases of PCP between 2007 and 2014 were used for multilocus sequence typing to compare the genetic similarity of the P. jirovecii. DNA base sequencing by the Sanger method showed some regions where two bases overlapped and could not be defined. A next-generation sequencer was used to analyse the types and ratios of these overlapping bases. DNA base sequences of P. jirovecii in the bronchoalveolar lavage fluid from four of the five PCP patients in the 2011 outbreak and from another two renal transplant patients who developed PCP in 2013 were highly homologous. The Sanger method revealed 14 genomic regions where two differing DNA bases overlapped and could not be identified. Analyses of the overlapping bases by a next-generation sequencer revealed that the differing types of base were present in almost identical ratios. There is a strong possibility that the PCP outbreak at the Toho University Omori Medical Centre was caused by the same strain of P. jirovecii. Two different types of base present in some regions may be due to P. jirovecii's being a diploid species. Copyright © 2015 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Sutton, Lesley-Ann; Ljungström, Viktor; Mansouri, Larry; Young, Emma; Cortese, Diego; Navrkalova, Veronika; Malcikova, Jitka; Muggen, Alice F; Trbusek, Martin; Panagiotidis, Panagiotis; Davi, Frederic; Belessi, Chrysoula; Langerak, Anton W; Ghia, Paolo; Pospisilova, Sarka; Stamatopoulos, Kostas; Rosenquist, Richard
2015-03-01
Next-generation sequencing has revealed novel recurrent mutations in chronic lymphocytic leukemia, particularly in patients with aggressive disease. Here, we explored targeted re-sequencing as a novel strategy to assess the mutation status of genes with prognostic potential. To this end, we utilized HaloPlex targeted enrichment technology and designed a panel including nine genes: ATM, BIRC3, MYD88, NOTCH1, SF3B1 and TP53, which have been linked to the prognosis of chronic lymphocytic leukemia, and KLHL6, POT1 and XPO1, which are less characterized but were found to be recurrently mutated in various sequencing studies. A total of 188 chronic lymphocytic leukemia patients with poor prognostic features (unmutated IGHV, n=137; IGHV3-21 subset #2, n=51) were sequenced on the HiSeq 2000 and data were analyzed using well-established bioinformatics tools. Using a conservative cutoff of 10% for the mutant allele, we found that 114/180 (63%) patients carried at least one mutation, with mutations in ATM, BIRC3, NOTCH1, SF3B1 and TP53 accounting for 149/177 (84%) of all mutations. We selected 155 mutations for Sanger validation (variant allele frequency, 10-99%) and 93% (144/155) of mutations were confirmed; notably, all 11 discordant variants had a variant allele frequency between 11-27%, hence at the detection limit of conventional Sanger sequencing. Technical precision was assessed by repeating the entire HaloPlex procedure for 63 patients; concordance was found for 77/82 (94%) mutations. In summary, this study demonstrates that targeted next-generation sequencing is an accurate and reproducible technique potentially suitable for routine screening, eventually as a stand-alone test without the need for confirmation by Sanger sequencing. Copyright© Ferrata Storti Foundation.
The quest for rare variants: pooled multiplexed next generation sequencing in plants.
Marroni, Fabio; Pinosio, Sara; Morgante, Michele
2012-01-01
Next generation sequencing (NGS) instruments produce an unprecedented amount of sequence data at contained costs. This gives researchers the possibility of designing studies with adequate power to identify rare variants at a fraction of the economic and labor resources required by individual Sanger sequencing. As of today, few research groups working in plant sciences have exploited this potentiality, showing that pooled NGS provides results in excellent agreement with those obtained by individual Sanger sequencing. The aim of this review is to convey to the reader the general ideas underlying the use of pooled NGS for the identification of rare variants. To facilitate a thorough understanding of the possibilities of the method, we will explain in detail the possible experimental and analytical approaches and discuss their advantages and disadvantages. We will show that information on allele frequency obtained by pooled NGS can be used to accurately compute basic population genetics indexes such as allele frequency, nucleotide diversity, and Tajima's D. Finally, we will discuss applications and future perspectives of the multiplexed NGS approach.
Ashrafi, Hamid; Hill, Theresa; Stoffel, Kevin; Kozik, Alexander; Yao, Jiqiang; Chin-Wo, Sebastian Reyes; Van Deynze, Allen
2012-10-30
Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80-120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project.
Fantin, Yuri S.; Neverov, Alexey D.; Favorov, Alexander V.; Alvarez-Figueroa, Maria V.; Braslavskaya, Svetlana I.; Gordukova, Maria A.; Karandashova, Inga V.; Kuleshov, Konstantin V.; Myznikova, Anna I.; Polishchuk, Maya S.; Reshetov, Denis A.; Voiciehovskaya, Yana A.; Mironov, Andrei A.; Chulanov, Vladimir P.
2013-01-01
Sanger sequencing is a common method of reading DNA sequences. It is less expensive than high-throughput methods, and it is appropriate for numerous applications including molecular diagnostics. However, sequencing mixtures of similar DNA of pathogens with this method is challenging. This is important because most clinical samples contain such mixtures, rather than pure single strains. The traditional solution is to sequence selected clones of PCR products, a complicated, time-consuming, and expensive procedure. Here, we propose the base-calling with vocabulary (BCV) method that computationally deciphers Sanger chromatograms obtained from mixed DNA samples. The inputs to the BCV algorithm are a chromatogram and a dictionary of sequences that are similar to those we expect to obtain. We apply the base-calling function on a test dataset of chromatograms without ambiguous positions, as well as one with 3–14% sequence degeneracy. Furthermore, we use BCV to assemble a consensus sequence for an HIV genome fragment in a sample containing a mixture of viral DNA variants and to determine the positions of the indels. Finally, we detect drug-resistant Mycobacterium tuberculosis strains carrying frameshift mutations mixed with wild-type bacteria in the pncA gene, and roughly characterize bacterial communities in clinical samples by direct 16S rRNA sequencing. PMID:23382983
Fonseca, Luiz Henrique M; Lohmann, Lúcia G
2018-06-01
Combining high-throughput sequencing data with amplicon sequences allows the reconstruction of robust phylogenies based on comprehensive sampling of characters and taxa. Here, we combine Next Generation Sequencing (NGS) and Sanger sequencing data to infer the phylogeny of the "Adenocalymma-Neojobertia" clade (Bignonieae, Bignoniaceae), a diverse lineage of Neotropical plants, using Maximum Likelihood and Bayesian approaches. We used NGS to obtain complete or nearly-complete plastomes of members of this clade, leading to a final dataset with 54 individuals, representing 44 members of ingroup and 10 outgroups. In addition, we obtained Sanger sequences of two plastid markers (ndhF and rpl32-trnL) for 44 individuals (43 ingroup and 1 outgroup) and the nuclear PepC for 64 individuals (63 ingroup and 1 outgroup). Our final dataset includes 87 individuals of members of the "Adenocalymma-Neojobertia" clade, representing 66 species (ca. 90% of the diversity), plus 11 outgroups. Plastid and nuclear datasets recovered congruent topologies and were combined. The combined analysis recovered a monophyletic "Adenocalymma-Neojobertia" clade and a paraphyletic Adenocalymma that also contained a monophyletic Neojobertia plus Pleonotoma albiflora. Relationships are strongly supported in all analyses, with most lineages within the "Adenocalymma-Neojobertia" clade receiving maximum posterior probabilities. Ancestral character state reconstructions using Bayesian approaches identified six morphological synapomorphies of clades namely, prophyll type, petiole and petiolule articulation, tendril ramification, inflorescence ramification, calyx shape, and fruit wings. Other characters such as habit, calyx cupular trichomes, corolla color, and corolla shape evolved multiple times. These characters are putatively related with the clade diversification and can be further explored in diversification studies. Copyright © 2018 Elsevier Inc. All rights reserved.
A novel PTCH1 mutation underlies non-syndromic cleft lip and/or palate in a Han Chinese family.
Zhao, Huaxiang; Zhong, Wenjie; Leng, Chuntao; Zhang, Jieni; Zhang, Mengqi; Huang, Wenbin; Zhang, Yunfan; Li, Weiran; Jia, Peizeng; Lin, Jiuxiang; Maimaitili, Gulibaha; Chen, Feng
2018-06-16
Cleft lip and/or palate (CL/P) is the most common craniofacial congenital disease, and it has a complex aetiology. This study aimed to identify the causative gene mutation of a Han Chinese family with CL/P. Whole exome sequencing was conducted on the proband and her mother, who exhibited the same phenotype. A Mendelian dominant inheritance model, allele frequency, mutation regions, functional prediction and literature review were used to screen and filter the variants. The candidate was validated by Sanger sequencing. Conservation analysis and homology modelling were conducted. A heterozygous missense mutation c.1175C>T in the PTCH1 gene predicting p.Ala392Val was identified. This variant has not been reported and was predicted to be deleterious. Sanger sequencing verified the variant and the dominant inheritance model in the family. The missense alteration affects an amino acid that is evolutionarily conserved in the first extracellular loop of the PTCH1 protein. The local structure of the mutant protein was significantly altered according to homology modelling. Our findings suggest that c.1175C>T in PTCH1 (NM_000264) may be the causative mutation of this pedigree. Our results add to the evidence that PTCH1 variants play a role in the pathogenesis of orofacial clefts. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Berger, C; Berger, B; Parson, W
2012-01-01
In recent years, evidence from domestic dogs has increasingly been analyzed by forensic DNA testing. Especially, canine hairs have proved most suitable and practical due to the high rate of hair transfer occurring between dogs and humans. Starting with the description of a contamination-free sample handling procedure, we give a detailed workflow for sequencing hypervariable segments (HVS) of the mtDNA control region from canine evidence. After the hair material is lysed and the DNA extracted by Phenol/Chloroform, the amplification and sequencing strategy comprises the HVS I and II of the canine control region and is optimized for DNA of medium-to-low quality and quantity. The sequencing procedure is based on the Sanger Big-dye deoxy-terminator method and the separation of the sequencing reaction products is performed on a conventional multicolor fluorescence detection capillary electrophoresis platform. Finally, software-aided base calling and sequence interpretation are addressed exemplarily.
Bu, Rong; Siraj, Abdul K; Al-Obaisi, Khadija A S; Beg, Shaham; Al Hazmi, Mohsen; Ajarim, Dahish; Tulbah, Asma; Al-Dayel, Fouad; Al-Kuraya, Khawla S
2016-09-01
Ethnic differences of breast cancer genomics have prompted us to investigate the spectra of BRCA1 and BRCA2 mutations in different populations. The prevalence and effect of BRCA 1 and BRCA 2 mutations in Middle Eastern population is not fully explored. To characterize the prevalence of BRCA mutations in Middle Eastern breast cancer patients, BRCA mutation screening was performed in 818 unselected breast cancer patients using Capture and/or Sanger sequencing. 19 short tandem repeat (STR) markers were used for founder mutation analysis. In our study, nine different types of deleterious mutation were identified in 28 (3.4%) cases, 25 (89.3%) cases in BRCA 1 and 3 (10.7%) cases in BRCA 2. Seven recurrent mutations identified accounted for 92.9% (26/28) of all the mutant cases. Haplotype analysis was performed to confirm c.1140 dupG and c.4136_4137delCT mutations as novel putative founder mutation, accounting for 46.4% (13/28) of all BRCA mutant cases and 1.6% (13/818) of all the breast cancer cases, respectively. Moreover, BRCA 1 mutation was significantly associated with BRCA 1 protein expression loss (p = 0.0005). Our finding revealed that a substantial number of BRCA mutations were identified in clinically high risk breast cancer from Middle East region. Identification of the mutation spectrum, prevalence and founder effect in Middle Eastern population facilitates genetic counseling, risk assessment and development of cost-effective screening strategy. © 2016 UICC.
Nanopore Technology: A Simple, Inexpensive, Futuristic Technology for DNA Sequencing.
Gupta, P D
2016-10-01
In health care, importance of DNA sequencing has been fully established. Sanger's Capillary Electrophoresis DNA sequencing methodology is time consuming, cumbersome, hence become more expensive. Lately, because of its versatility DNA sequencing became house hold name, and therefore, there is an urgent need of simple, fast, inexpensive, DNA sequencing technology. In the beginning of this century efforts were made, and Nanopore DNA sequencing technology was developed; still it is infancy, nevertheless, it is the futuristic technology.
Kotásková, Iva; Mališová, Barbora; Obručová, Hana; Holá, Veronika; Peroutková, Tereza; Růžička, Filip; Freiberger, Tomáš
2017-01-01
Complex samples are a challenge for sequencing-based broad-range diagnostics. We analysed 19 urinary catheter, ureteral Double-J catheter, and urine samples using 3 methodological approaches. Out of the total 84 operational taxonomic units, 37, 61, and 88% were identified by culture, PCR-DGGE-SS (PCR denaturing gradient gel electrophoresis followed by Sanger sequencing), and PCR-DGGE-RM (PCR- DGGE combined with software chromatogram separation by RipSeq Mixed tool), respectively. The latter approach was shown to be an efficient tool to complement culture in complex sample assessment. © 2017 S. Karger AG, Basel.
Transcriptome assembly, gene annotation and tissue gene expression atlas of the rainbow trout
USDA-ARS?s Scientific Manuscript database
Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complimented by transcriptome information that will enhance genome assembly and annotation. Previously, we reported a transcriptome reference sequence using a 19X coverage of Sanger and 454-pyrosequencing dat...
Whole-exome sequencing identifies USH2A mutations in a pseudo-dominant Usher syndrome family.
Zheng, Sui-Lian; Zhang, Hong-Liang; Lin, Zhen-Lang; Kang, Qian-Yan
2015-10-01
Usher syndrome (USH) is an autosomal recessive (AR) multi-sensory degenerative disorder leading to deaf-blindness. USH is clinically subdivided into three subclasses, and 10 genes have been identified thus far. Clinical and genetic heterogeneities in USH make a precise diagnosis difficult. A dominant‑like USH family in successive generations was identified, and the present study aimed to determine the genetic predisposition of this family. Whole‑exome sequencing was performed in two affected patients and an unaffected relative. Systematic data were analyzed by bioinformatic analysis to remove the candidate mutations via step‑wise filtering. Direct Sanger sequencing and co‑segregation analysis were performed in the pedigree. One novel and two known mutations in the USH2A gene were identified, and were further confirmed by direct sequencing and co‑segregation analysis. The affected mother carried compound mutations in the USH2A gene, while the unaffected father carried a heterozygous mutation. The present study demonstrates that whole‑exome sequencing is a robust approach for the molecular diagnosis of disorders with high levels of genetic heterogeneity.
Unlocking Short Read Sequencing for Metagenomics
Rodrigue, Sébastien; Materna, Arne C.; Timberlake, Sonia C.; ...
2010-07-28
We describe an experimental and computational pipeline yielding millions of reads that can exceed 200 bp with quality scores approaching that of traditional Sanger sequencing. The method combines an automatable gel-less library construction step with paired-end sequencing on a short-read instrument. With appropriately sized library inserts, mate-pair sequences can overlap, and we describe the SHERA software package that joins them to form a longer composite read.
Abdelrahman, Tamer; Hughes, Joseph; Main, Janice; McLauchlan, John; Thursz, Mark; Thomson, Emma
2015-01-01
High rates of sexually transmitted infection and reinfection with hepatitis C virus (HCV) have recently been reported in human immunodeficiency virus (HIV)-infected men who have sex with men and reinfection has also been described in monoinfected injecting drug users. The diagnosis of reinfection has traditionally been based on direct Sanger sequencing of samples pre- and posttreatment, but not on more sensitive deep sequencing techniques. We studied viral quasispecies dynamics in patients who failed standard of care therapy in a high-risk HIV-infected cohort of patients with early HCV infection to determine whether treatment failure was associated with reinfection or recrudescence of preexisting infection. Paired sequences (pre- and posttreatment) were analyzed. The HCV E2 hypervariable region-1 was amplified using nested reverse-transcription polymerase chain reaction (RT-PCR) with indexed genotype-specific primers and the same products were sequenced using both Sanger and 454 pyrosequencing approaches. Of 99 HIV-infected patients with acute HCV treated with 24-48 weeks of pegylated interferon alpha and ribavirin, 15 failed to achieve a sustained virological response (six relapsed, six had a null response, and three had a partial response). Using direct sequencing, 10/15 patients (66%) had evidence of a previously undetected strain posttreatment; in many studies, this is interpreted as reinfection. However, pyrosequencing revealed that 15/15 (100%) of patients had evidence of persisting infection; 6/15 (40%) patients had evidence of a previously undetected variant present in the posttreatment sample in addition to a variant that was detected at baseline. This could represent superinfection or a limitation of the sensitivity of pyrosequencing. In this high-risk group, the emergence of new viral strains following treatment failure is most commonly associated with emerging dominance of preexisting minority variants rather than reinfection. Superinfection may occur in this cohort but reinfection is overestimated by Sanger sequencing. © 2014 The Authors. Hepatology published by Wiley on behalf of the American Association for the Study of Liver Diseases.
Xu, Peiwen; Zou, Yang; Li, Jie; Huang, Sexin; Gao, Ming; Kang, Ranran; Xie, Hongqiang; Wang, Lijuan; Yan, Junhao; Gao, Yuan
2018-04-10
To assess the value of droplet digital PCR (ddPCR) for non-invasive prenatal diagnosis of single gene disease in two families. Paternal mutation in cell-free DNA derived from the maternal blood and amniotic fluid DNA was detected by ddPCR. Suspected mutation in the amniotic fluid DNA was verified with Sanger sequencing. The result of ddPCR and Sanger sequencing indicated that the fetuses have carried pathogenic mutations from the paternal side in both families. Droplet digital PCR can accurately detect paternal mutation carried by the fetus, and it is sensitive and reliable for analyzing trace samples. This method may be applied for the diagnosis of single gene diseases caused by paternal mutation using peripheral blood sample derived from the mother.
Wang, Na; Wang, Chuan; Chen, Xuechao; Sheng, Donglai; Fu, Xi’an; See, Kelvin; Foo, Jia Nee; Low, Huiqi; Liany, Herty; Irwan, Ishak Darryl; Liu, Jian; Yang, Baoqi; Chen, Mingfei; Yu, Yongxiang; Yu, Gongqi; Niu, Guiye; You, Jiabao; Zhou, Yan; Ma, Shanshan; Wang, Ting; Yan, Xiaoxiao; Goh, Boon Kee; Common, John E. A.; Lane, Birgitte E.; Sun, Yonghu; Zhou, Guizhi; Lu, Xianmei; Wang, Zhenhua; Tian, Hongqing; Cao, Yuanhua; Chen, Shumin; Liu, Qiji; Liu, Jianjun; Zhang, Furen
2014-01-01
Background As a genetic disorder of abnormal pigmentation, the molecular basis of dyschromatosis universalis hereditaria (DUH) had remained unclear until recently when ABCB6 was reported as a causative gene of DUH. Methodology We performed genome-wide linkage scan using Illumina Human 660W-Quad BeadChip and exome sequencing analyses using Agilent SureSelect Human All Exon Kits in a multiplex Chinese DUH family to identify the pathogenic mutations and verified the candidate mutations using Sanger sequencing. Quantitative RT-PCR and Immunohistochemistry was performed to verify the expression of the pathogenic gene, Zebrafish was also used to confirm the functional role of ABCB6 in melanocytes and pigmentation. Results Genome-wide linkage (assuming autosomal dominant inheritance mode) and exome sequencing analyses identified ABCB6 as the disease candidate gene by discovering a coding mutation (c.1358C>T; p.Ala453Val) that co-segregates with the disease phenotype. Further mutation analysis of ABCB6 in four other DUH families and two sporadic cases by Sanger sequencing confirmed the mutation (c.1358C>T; p.Ala453Val) and discovered a second, co-segregating coding mutation (c.964A>C; p.Ser322Lys) in one of the four families. Both mutations were heterozygous in DUH patients and not present in the 1000 Genome Project and dbSNP database as well as 1,516 unrelated Chinese healthy controls. Expression analysis in human skin and mutagenesis interrogation in zebrafish confirmed the functional role of ABCB6 in melanocytes and pigmentation. Given the involvement of ABCB6 mutations in coloboma, we performed ophthalmological examination of the DUH carriers of ABCB6 mutations and found ocular abnormalities in them. Conclusion Our study has advanced our understanding of DUH pathogenesis and revealed the shared pathological mechanism between pigmentary DUH and ocular coloboma. PMID:24498303
Xin, Min; Zhang, Peipei; Liu, Wenwen; Ren, Yingdang; Cao, Mengji; Wang, Xifeng
2017-10-01
The complete nucleotide sequence of a novel positive single-stranded (+ss) RNA virus, tentatively named watermelon virus A (WVA), was determined using a combination of three methods: RNA sequencing, small RNA sequencing, and Sanger sequencing. The full genome of WVA is comprised of 8,372 nucleotides (nt), excluding the poly (A) tail, and contains four open reading frames (ORFs). The largest ORF, ORF1 encodes a putative replication-associated polyprotein (RP) with three conserved domains. ORF2 and ORF4 encode a movement protein (MP) and coat protein (CP), respectively. The putative product encoded by ORF3, of an estimated molecular mass of 25 kDa, has no significant similarity with other proteins. Identity and phylogenetic analysis indicate that WVA is a new virus, closely related to members of the family Betaflexiviridae. However, the final taxonomic allocation of WVA within the family is yet to be determined.
Bai, D Y; Zhang, H P; Zhong, S; Suo, W H; Gao, D H; Ding, Y; Tu, J H
2016-12-23
Objective: To investigate the clinical application value of combined detection of ALK fusion gene and c-ros oncogene 1 receptor tyrosine kinase (ROS1) fusion gene in non-small cell lung cancer (NSCLC) using real-time fluorescent PCR. Methods: A kit for combined detection of ALK fusion gene and ROS1 fusion gene based on fluorescent PCR was used to simultaneously detect the two fusion genes in 302 cases of NSCLC specimens. The results were validated through Sanger sequencing. The consistency of the two detection methods was analyzed. Results: All 302 cases of NSCLC specimens were successfully analyzed through fluorescent PCR (302/302). 12 cases (4.0%) were found to contain ALK fusion gene, including 3 cases with ALK-M1, 3 with ALK-M2, 3 with ALK-M3, 1 with ALK-M4, and 2 with ALK-M6 fusion gene.12 cases (4.0%) were found to contain ROS1 fusion gene, including 1 case with ROS1-M7, 8 cases with ROS1-M8, 1 case with ROS1-M12, 1 case with ROS1-M14, and 1 case with double-positive ROS1-M3 and ROS1-M8 fusion genes. The total detection rate of ALK fusion gene and ROS1 fusion gene was 7.9% (24/302) and 278 cases showed to be negative for ALK fusion gene and ROS1 fusion gene. The successful detection rates for Sanger DNA sequencing were also 100%. The positive, negative and total coincidence rates obtained by real-time fluorescent PCR and by Sanger DNA sequencing were all 100%. Conclusions: The results of Sanger DNA sequencing demonstrate that the real-time fluorescent PCR assay is equally effective in detecting ALK and ROS1 fusion genes in NSCLC tissues. Furthermore, real-time fluorescent PCR assay can be used to detect trace ALK and ROS1 fusion gene simultaneously in tiny samples, and can save time and avoid repeated sampling. It is worthy of recommendation as a rapid and reliable detection technique.
The tomato genome sequence provides insight into fleshy fruit evolution
USDA-ARS?s Scientific Manuscript database
The genome of the inbred tomato cultivar ‘Heinz 1706’ was sequenced and assembled using a combination of Sanger and “next generation” technologies. The predicted genome size is ~900 Mb, consistent with prior estimates, of which 760 Mb were assembled in 91 scaffolds aligned to the 12 tomato chromosom...
USDA-ARS?s Scientific Manuscript database
Marek’s disease virus (MDV-1) is a cell-associated alphaherpesvirus that induces rapid-onset T-cell lymphomas in poultry. The genomes of 6 strains have been sequenced using both Sanger didoxy sequencing and 454 Life Science pyrosequencing. These genomes largely represent cell culture adapted strains...
Ma, Yalin; Xiao, Yun; Zhang, Fengguo; Han, Yuechen; Li, Jianfeng; Xu, Lei; Bai, Xiaohui; Wang, Haibo
2016-04-01
Mutations in MYO7A gene have been reported to be associated with Usher Syndrome type 1B (USH1B) and nonsyndromic hearing loss (DFNB2, DFNA11). Most mutations in MYO7A gene caused USH1B, whereas only a few reported mutations led to DFNB2 and DFNA11. The current study was designed to investigate the mutations among a Chinese family with autosomal recessive hearing loss. In this study, we present the clinical, genetic and molecular characteristics of a Chinese family. Targeted capture of 127 known deafness genes and next-generation sequencing were employed to study the genetic causes of two siblings in the Chinese family. Sanger sequencing was employed to examine those variant mutations in the members of this family and other ethnicity-matched controls. We identified the novel compound heterozygous mutant alleles of MYO7A gene: a novel missense mutation c.3671C>A (p.A1224D) and a reported insert mutation c.390_391insC (p.P131PfsX9). Variants were further confirmed by Sanger sequencing. These two compound heterozygous variants were co-segregated with autosomal recessive hearing loss phenotype. The gene mutation analysis and protein sequence alignment further supported that the novel compound heterozygous mutations were pathogenic. The novel compound heterozygous mutations (c.3671C>A and c.390_391insC) in MYO7A gene identified in this study were responsible for the autosomal recessive sensorineural hearing loss of this Chinese family. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Gao, M L; Zhong, X M; Ma, X; Ning, H J; Zhu, D; Zou, J Z
2016-06-02
To make genetic diagnosis of Alagille syndrome (ALGS) patients using target gene sequence capture and next generation sequencing technology. Target gene sequence capture and next generation sequencing were used to detect ALGS gene of 4 patients. They were hospitalized at the Affiliated Hospital, Capital Institute of Pediatrics between January 2014 and December 2015, referred to clinical diagnosis of ALGS typical and atypical respectively in 2 cases. Blood samples were collected from patients and their parents and genomic DNA was extracted from lymphocytes. Target gene sequence capture and next generation sequencing was detected. Sanger sequencing was used to confirm the results of the patients and their parents. Cholestasis, heart defects, inverted triangular face and butterfly vertebrae were presented as main clinical features in 4 male patients. The first hospital visiting ages ranged from 3 months and 14 days to 3 years and 1 month. The age of onset ranged from 3 days to 42 days (median 23 days). According to the clinical diagnostic criteria of ALGS, patient 1 and patient 2 were considered as typical ALGS. The other 2 patients were considered as atypical ALGS. Four Jagged 1(JAG1) pathogenic mutations were detected. Three different missense mutations were detected in patient 1 to patient 3 with ALGS(c.839C>T(p.W280X), c. 703G>A(p.R235X), c. 1720C>T(p.V574M)). The JAG1 mutation of patient 3 was first reported. Patient 4 had one novel insertion mutation (c.1779_1780insA(p.Ile594AsnfsTer23)). Parental analysis verified that the JAG1 missense mutation of 3 patients were de novo. The results of sanger sequencing was consistent with the results of the next generation sequencing. Target gene sequence capture combined with next generation sequencing can detect two pathogenic genes in ALGS and test genes of other related diseases in infantile cholestatic diseases simultaneously and presents a high throughput, high efficiency and low cost. It may provide molecular diagnosis and treatment for clinicians with good clinical application prospects.
Onuț-Brännström, Ioana; Benjamin, Mitchell; Scofield, Douglas G; Heiðmarsson, Starri; Andersson, Martin G I; Lindström, Eva S; Johannesson, Hanna
2018-03-13
In this study, we explored the diversity of green algal symbionts (photobionts) in sympatric populations of the cosmopolitan lichen-forming fungi Thamnolia and Cetraria. We sequenced with both Sanger and Ion Torrent High-Throughput Sequencing technologies the photobiont ITS-region of 30 lichen thalli from two islands: Iceland and Öland. While Sanger recovered just one photobiont genotype from each thallus, the Ion Torrent data recovered 10-18 OTUs for each pool of 5 lichen thalli, suggesting that individual lichens can contain heterogeneous photobiont populations. Both methods showed evidence for photobiont sharing between Thamnolia and Cetraria on Iceland. In contrast, our data suggest that on Öland the two mycobionts associate with distinct photobiont communities, with few shared OTUs revealed by Ion Torrent sequencing. Furthermore, by comparing our sequences with public data, we identified closely related photobionts from geographically distant localities. Taken together, we suggest that the photobiont composition in Thamnolia and Cetraria results from both photobiont-mycobiont codispersal and local acquisition during mycobiont establishment and/or lichen growth. We hypothesize that this is a successful strategy for lichens to be flexible in the use of the most adapted photobiont for the environment.
Mandelker, Diana; Schmidt, Ryan J; Ankala, Arunkanth; McDonald Gibson, Kristin; Bowser, Mark; Sharma, Himanshu; Duffy, Elizabeth; Hegde, Madhuri; Santani, Avni; Lebo, Matthew; Funke, Birgit
2016-12-01
Next-generation sequencing (NGS) is now routinely used to interrogate large sets of genes in a diagnostic setting. Regions of high sequence homology continue to be a major challenge for short-read technologies and can lead to false-positive and false-negative diagnostic errors. At the scale of whole-exome sequencing (WES), laboratories may be limited in their knowledge of genes and regions that pose technical hurdles due to high homology. We have created an exome-wide resource that catalogs highly homologous regions that is tailored toward diagnostic applications. This resource was developed using a mappability-based approach tailored to current Sanger and NGS protocols. Gene-level and exon-level lists delineate regions that are difficult or impossible to analyze via standard NGS. These regions are ranked by degree of affectedness, annotated for medical relevance, and classified by the type of homology (within-gene, different functional gene, known pseudogene, uncharacterized noncoding region). Additionally, we provide a list of exons that cannot be analyzed by short-amplicon Sanger sequencing. This resource can help guide clinical test design, supplemental assay implementation, and results interpretation in the context of high homology.Genet Med 18 12, 1282-1289.
The utility of Next Generation Sequencing for molecular diagnostics in Rett syndrome.
Vidal, Silvia; Brandi, Núria; Pacheco, Paola; Gerotina, Edgar; Blasco, Laura; Trotta, Jean-Rémi; Derdak, Sophia; Del Mar O'Callaghan, Maria; Garcia-Cazorla, Àngels; Pineda, Mercè; Armstrong, Judith
2017-09-25
Rett syndrome (RTT) is an early-onset neurodevelopmental disorder that almost exclusively affects girls and is totally disabling. Three genes have been identified that cause RTT: MECP2, CDKL5 and FOXG1. However, the etiology of some of RTT patients still remains unknown. Recently, next generation sequencing (NGS) has promoted genetic diagnoses because of the quickness and affordability of the method. To evaluate the usefulness of NGS in genetic diagnosis, we present the genetic study of RTT-like patients using different techniques based on this technology. We studied 1577 patients with RTT-like clinical diagnoses and reviewed patients who were previously studied and thought to have RTT genes by Sanger sequencing. Genetically, 477 of 1577 patients with a RTT-like suspicion have been diagnosed. Positive results were found in 30% by Sanger sequencing, 23% with a custom panel, 24% with a commercial panel and 32% with whole exome sequencing. A genetic study using NGS allows the study of a larger number of genes associated with RTT-like symptoms simultaneously, providing genetic study of a wider group of patients as well as significantly reducing the response time and cost of the study.
[Analysis of SOX10 gene mutation in a family affected with Waardenburg syndrome type II].
Zheng, Lei; Yan, Yousheng; Chen, Xue; Zhang, Chuan; Zhang, Qinghua; Feng, Xuan; Hao, Shen
2018-02-10
OBJECTIVE To detect potential mutation of SOX10 gene in a pedigree affected with Warrdenburg syndrome type II. METHODS Genomic DNA was extracted from peripheral blood samples of the proband and his family members. Exons and flanking sequences of MITF, PAX3, SOX10, SNAI2, END3 and ENDRB genes were analyzed by chip capturing and high throughput sequencing. Suspected mutations were verified with Sanger sequencing. RESULTS A c.127C>T (p.R43X) mutation of the SOX10 gene was detected in the proband, for which both parents showed a wild-type genotype. CONCLUSION The c.127C>T (p.R43X) mutation of SOX10 gene probably underlies the ocular symptoms and hearing loss of the proband.
Detection of a divergent variant of grapevine virus F by next-generation sequencing.
Molenaar, Nicholas; Burger, Johan T; Maree, Hans J
2015-08-01
The complete genome sequence of a South African isolate of grapevine virus F (GVF) is presented. It was first detected by metagenomic next-generation sequencing of field samples and validated through direct Sanger sequencing. The genome sequence of GVF isolate V5 consists of 7539 nucleotides and contains a poly(A) tail. It has a typical vitivirus genome arrangement that comprises five open reading frames (ORFs), which share only 88.96 % nucleotide sequence identity with the existing complete GVF genome sequence (JX105428).
Zhong, Shan; Zhang, Hai-ping; Zheng, Jie; Bai, Dong-yu; Fu, Li; Chen, Pei-qiong
2013-04-01
To investigate the frequency of EML4-ALK fusion gene in non-small-cell lung cancer (NSCLC) patients, and its correlation with clinicopathologic features. Real-time PCR was used to detect the presence of EML4-ALK fusion gene in 268 cases of NSCLCs using paraffin-embedded tissue samples(among which 164 samples were re-validated by Sanger sequencing). Related clinicopathological correlation was analyzed. EML4-ALK fusion gene was found in 4.1% (11/268) of the cases. One hundred and sixty four samples were verified by Sanger sequencing, and the overall coincidence of the results of two methods (Sanger sequencing and Real-time PCR) was 100%. Female patients (5.9%, 5/85), ≤ 60 years of age (4.3%, 6/140), non-smokers (6.8%, 8/118) and adenocarcinomas (7.6%, 10/132) had a higher mutation rate than that in male patients (3.3%, 6/183), > 60 years of age (4.0%, 5/124), smokers (1.6%, 2/132) and squamous cell carcinomas (1.3%, 1/79), although no statistical significance in age (P = 0.918), gender (P = 0.503), smoking history (P = 0.092) and histological type (P = 0.094). Chinese NSCLC patients have a 4.1% detection rate of EML4-ALK fusion gene in the tumor tissues. Female, non-smoker and adenocarcinoma histological subtype tend to be associated with a higher rate of EML4-ALK gene fusion.
2012-01-01
Background Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Results Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80–120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Conclusions Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project. PMID:23110314
Identification of a novel vitivirus from grapevines in New Zealand.
Blouin, Arnaud G; Keenan, Sandi; Napier, Kathryn R; Barrero, Roberto A; MacDiarmid, Robin M
2018-01-01
We report a sequence of a novel vitivirus from Vitis vinifera obtained using two high-throughput sequencing (HTS) strategies on RNA. The initial discovery from small-RNA sequencing was confirmed by HTS of the total RNA and Sanger sequencing. The new virus has a genome structure similar to the one reported for other vitiviruses, with five open reading frames (ORFs) coding for the conserved domains described for members of that genus. Phylogenetic analysis of the complete genome sequence confirmed its affiliation to the genus Vitivirus, with the closest described viruses being grapevine virus E (GVE) and Agave tequilana leaf virus (ATLV). However, the virus we report is distinct and shares only 51% amino acid sequence identity with GVE in the replicase polyprotein and 66.8% amino acid sequence identity with ATLV in the coat protein. This is well below the threshold determined by the ICTV for species demarcation, and we propose that this virus represents a new species. It is provisionally named "grapevine virus G".
Genetic and clinical features of cryopyrin-associated periodic syndromes in Turkish children.
Eroglu, Fehime Kara; Kasapcopur, Ozgür; Beşbaş, Nesrin; Ozaltin, Fatih; Bilginer, Yelda; Barut, Kenan; Mensa-Vilaro, Anna; Nakagawa, Kenji; Heike, Toshio; Nishikomori, Ryuta; Arostegui, Juan; Ozen, Seza
2016-01-01
The aim of this study was to present the genetic and clinical data of the largest cohort of Turkish cryopyrin-associated periodic syndromes (CAPS) patients. This is a two-centre descriptive study of Turkish children with clinical diagnosis of CAPS. NLRP3 analyses were performed by Sanger sequencing and by massively parallel sequencing. ASC dependent NF-κB activation and transfection-induced THP-1 cell death assays determined the functional consequences of the detected variants. Disease activity and response to anti interleukin 1 (anti-IL-1) treatment was also assessed. Heterozygous germline NLRP3 mutation was detected in 8 of 14 enrolled patients (57.1%). Two novel somatic mutations Y560H and G307D were found which induced both THP-1 cell death and ASC dependent NF-kB activation. With anti-IL-1 treatment the disease activity was improved in all patients except one. Except two patients with macrophage activation syndrome (MAS) attack, there were no serious adverse events requiring hospitalisation. CAPS should be considered in all patients with typical symptoms even if Sanger-based genetic analysis is negative, since a considerable number of patients have mosaicism. Treatment should be patient-tailored and MAS should be considered as a rare complication.
Smith, Miriam J; Beetz, Christian; Williams, Simon G; Bhaskar, Sanjeev S; O'Sullivan, James; Anderson, Beverley; Daly, Sarah B; Urquhart, Jill E; Bholah, Zaynab; Oudit, Deemesh; Cheesman, Edmund; Kelsey, Anna; McCabe, Martin G; Newman, William G; Evans, D Gareth R
2014-12-20
Heterozygous germline PTCH1 mutations are causative of Gorlin syndrome (naevoid basal cell carcinoma), but detection rates > 70% have rarely been reported. We aimed to define the causative mutations in individuals with Gorlin syndrome without PTCH1 mutations. We undertook exome sequencing on lymphocyte DNA from four unrelated individuals from families with Gorlin syndrome with no PTCH1 mutations found by Sanger sequencing, multiplex ligation-dependent probe amplification (MLPA), or RNA analysis. A germline heterozygous nonsense mutation in SUFU was identified in one of four exomes. Sanger sequencing of SUFU in 23 additional PTCH1-negative Gorlin syndrome families identified a SUFU mutation in a second family. Copy-number analysis of SUFU by MLPA revealed a large heterozygous deletion in a third family. All three SUFU-positive families fulfilled diagnostic criteria for Gorlin syndrome, although none had odontogenic jaw keratocysts. Each SUFU-positive family included a single case of medulloblastoma, whereas only two (1.7%) of 115 individuals with Gorlin syndrome and a PTCH1 mutation developed medulloblastoma. We demonstrate convincing evidence that SUFU mutations can cause classical Gorlin syndrome. Our study redefines the risk of medulloblastoma in Gorlin syndrome, dependent on the underlying causative gene. Previous reports have found a 5% risk of medulloblastoma in Gorlin syndrome. We found a < 2% risk in PTCH1 mutation-positive individuals, with a risk up to 20× higher in SUFU mutation-positive individuals. Our data suggest childhood brain magnetic resonance imaging surveillance is justified in SUFU-related, but not PTCH1-related, Gorlin syndrome. © 2014 by American Society of Clinical Oncology.
Fong, Wai-Ying; Ho, Chi-Chun; Poon, Wing-Tat
2017-05-12
Thiopurine intolerance and treatment-related toxicity, such as fatal myelosuppression, is related to non-function genetic variants encoding thiopurine S-methyltransferase (TPMT) and Nudix hydrolase 15 (NUDT15). Genetic testing of the common variants NUDT15:NM_018283.2:c.415C>T (Arg139Cys, dbSNP rs116855232 T allele) and TPMT: NM_000367.4:c.719A>G (TPMT*3C, dbSNP rs1142345 G allele) in East Asians including Chinese can potentially prevent treatment-related complications. Two complementary genotyping approaches, real-time PCR-high resolution melt (PCR-HRM) and PCR-restriction fragment length morphism (PCR-RFLP) analysis were evaluated using conventional PCR and Sanger sequencing genotyping as the gold standard. Sixty patient samples were tested, revealing seven patients (11.7%) heterozygous for NUDT15 c.415C>T, one patient homozygous for the variant and one patient heterozygous for the TPMT*3C non-function allele. No patient was found to harbor both variants. In total, nine out of 60 (15%) patients tested had genotypic evidence of thiopurine intolerance, which may require dosage adjustment or alternative medication should they be started on azathioprine, mercaptopurine or thioguanine. The two newly developed assays were more efficient and showed complete concordance (60/60, 100%) compared to the Sanger sequencing results. Accurate and cost-effective genotyping assays by real-time PCR-HRM and PCR-RFLP for NUDT15 c.415C>T and TPMT*3C were successfully developed. Further studies may establish their roles in genotype-informed clinical decision-making in the prevention of morbidity and mortality due to thiopurine intolerance.
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].
Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y
2017-08-01
To analyze and detect the whole genome sequence of human mitochondrial DNA (mtDNA) by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine
Steele-Stallard, Heather B; Le Quesne Stabej, Polona; Lenassi, Eva; Luxon, Linda M; Claustres, Mireille; Roux, Anne-Francoise; Webster, Andrew R; Bitner-Glindzicz, Maria
2013-08-08
Usher Syndrome is the leading cause of inherited deaf-blindness. It is divided into three subtypes, of which the most common is Usher type 2, and the USH2A gene accounts for 75-80% of cases. Despite recent sequencing strategies, in our cohort a significant proportion of individuals with Usher type 2 have just one heterozygous disease-causing mutation in USH2A, or no convincing disease-causing mutations across nine Usher genes. The purpose of this study was to improve the molecular diagnosis in these families by screening USH2A for duplications, heterozygous deletions and a common pathogenic deep intronic variant USH2A: c.7595-2144A>G. Forty-nine Usher type 2 or atypical Usher families who had missing mutations (mono-allelic USH2A or no mutations following Sanger sequencing of nine Usher genes) were screened for duplications/deletions using the USH2A SALSA MLPA reagent kit (MRC-Holland). Identification of USH2A: c.7595-2144A>G was achieved by Sanger sequencing. Mutations were confirmed by a combination of reverse transcription PCR using RNA extracted from nasal epithelial cells or fibroblasts, and by array comparative genomic hybridisation with sequencing across the genomic breakpoints. Eight mutations were identified in 23 Usher type 2 families (35%) with one previously identified heterozygous disease-causing mutation in USH2A. These consisted of five heterozygous deletions, one duplication, and two heterozygous instances of the pathogenic variant USH2A: c.7595-2144A>G. No variants were found in the 15 Usher type 2 families with no previously identified disease-causing mutations. In 11 atypical families, none of whom had any previously identified convincing disease-causing mutations, the mutation USH2A: c.7595-2144A>G was identified in a heterozygous state in one family. All five deletions and the heterozygous duplication we report here are novel. This is the first time that a duplication in USH2A has been reported as a cause of Usher syndrome. We found that 8 of 23 (35%) of 'missing' mutations in Usher type 2 probands with only a single heterozygous USH2A mutation detected with Sanger sequencing could be attributed to deletions, duplications or a pathogenic deep intronic variant. Future mutation detection strategies and genetic counselling will need to take into account the prevalence of these types of mutations in order to provide a more comprehensive diagnostic service.
Chen, Zhao; Moran, Kimberly; Richards-Yutz, Jennifer; Toorens, Erik; Gerhart, Daniel; Ganguly, Tapan; Shields, Carol L; Ganguly, Arupa
2014-03-01
Sporadic retinoblastoma (RB) is caused by de novo mutations in the RB1 gene. Often, these mutations are present as mosaic mutations that cannot be detected by Sanger sequencing. Next-generation deep sequencing allows unambiguous detection of the mosaic mutations in lymphocyte DNA. Deep sequencing of the RB1 gene on lymphocyte DNA from 20 bilateral and 70 unilateral RB cases was performed, where Sanger sequencing excluded the presence of mutations. The individual exons of the RB1 gene from each sample were amplified, pooled, ligated to barcoded adapters, and sequenced using semiconductor sequencing on an Ion Torrent Personal Genome Machine. Six low-level mosaic mutations were identified in bilateral RB and four in unilateral RB cases. The incidence of low-level mosaic mutation was estimated to be 30% and 6%, respectively, in sporadic bilateral and unilateral RB cases, previously classified as mutation negative. The frequency of point mutations detectable in lymphocyte DNA increased from 96% to 97% for bilateral RB and from 13% to 18% for unilateral RB. The use of deep sequencing technology increased the sensitivity of the detection of low-level germline mosaic mutations in the RB1 gene. This finding has significant implications for improved clinical diagnosis, genetic counseling, surveillance, and management of RB. © 2013 WILEY PERIODICALS, INC.
A multiplex primer design algorithm for target amplification of continuous genomic regions.
Ozturk, Ahmet Rasit; Can, Tolga
2017-06-19
Targeted Next Generation Sequencing (NGS) assays are cost-efficient and reliable alternatives to Sanger sequencing. For sequencing of very large set of genes, the target enrichment approach is suitable. However, for smaller genomic regions, the target amplification method is more efficient than both the target enrichment method and Sanger sequencing. The major difficulty of the target amplification method is the preparation of amplicons, regarding required time, equipment, and labor. Multiplex PCR (MPCR) is a good solution for the mentioned problems. We propose a novel method to design MPCR primers for a continuous genomic region, following the best practices of clinically reliable PCR design processes. On an experimental setup with 48 different combinations of factors, we have shown that multiple parameters might effect finding the first feasible solution. Increasing the length of the initial primer candidate selection sequence gives better results whereas waiting for a longer time to find the first feasible solution does not have a significant impact. We generated MPCR primer designs for the HBB whole gene, MEFV coding regions, and human exons between 2000 bp to 2100 bp-long. Our benchmarking experiments show that the proposed MPCR approach is able produce reliable NGS assay primers for a given sequence in a reasonable amount of time.
Low Diversity in the Mitogenome of Sperm Whales Revealed by Next-Generation Sequencing
Alexander, Alana; Steel, Debbie; Slikas, Beth; Hoekzema, Kendra; Carraher, Colm; Parks, Matthew; Cronn, Richard; Baker, C. Scott
2013-01-01
Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20 mitogenomes from 17 sperm whales representative of worldwide diversity using Next Generation Sequencing (NGS) technologies (Illumina GAIIx, Roche 454 GS Junior). Resequencing of three individuals with both NGS platforms and partial Sanger sequencing showed low discrepancy rates (454-Illumina: 0.0071%; Sanger-Illumina: 0.0034%; and Sanger-454: 0.0023%) confirming suitability of both NGS platforms for investigating low mitogenomic diversity. Using the 17 sperm whale mitogenomes in a phylogenetic reconstruction with 41 other species, including 11 new dolphin mitogenomes, we tested two hypotheses for the low CR diversity. First, the hypothesis that CR-specific constraints have reduced diversity solely in the CR was rejected as diversity was low throughout the mitogenome, not just in the CR (overall diversity π = 0.096%; protein-coding 3rd codon = 0.22%; CR = 0.35%), and CR phylogenetic signal was congruent with protein-coding regions. Second, the hypothesis that slow substitution rates reduced diversity throughout the sperm whale mitogenome was rejected as sperm whales had significantly higher rates of CR evolution and no evidence of slow coding region evolution relative to other cetaceans. The estimated time to most recent common ancestor for sperm whale mitogenomes was 72,800 to 137,400 years ago (95% highest probability density interval), consistent with previous hypotheses of a bottleneck or selective sweep as likely causes of low mitogenome diversity. PMID:23254394
Low diversity in the mitogenome of sperm whales revealed by next-generation sequencing.
Alexander, Alana; Steel, Debbie; Slikas, Beth; Hoekzema, Kendra; Carraher, Colm; Parks, Matthew; Cronn, Richard; Baker, C Scott
2013-01-01
Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20 mitogenomes from 17 sperm whales representative of worldwide diversity using Next Generation Sequencing (NGS) technologies (Illumina GAIIx, Roche 454 GS Junior). Resequencing of three individuals with both NGS platforms and partial Sanger sequencing showed low discrepancy rates (454-Illumina: 0.0071%; Sanger-Illumina: 0.0034%; and Sanger-454: 0.0023%) confirming suitability of both NGS platforms for investigating low mitogenomic diversity. Using the 17 sperm whale mitogenomes in a phylogenetic reconstruction with 41 other species, including 11 new dolphin mitogenomes, we tested two hypotheses for the low CR diversity. First, the hypothesis that CR-specific constraints have reduced diversity solely in the CR was rejected as diversity was low throughout the mitogenome, not just in the CR (overall diversity π = 0.096%; protein-coding 3rd codon = 0.22%; CR = 0.35%), and CR phylogenetic signal was congruent with protein-coding regions. Second, the hypothesis that slow substitution rates reduced diversity throughout the sperm whale mitogenome was rejected as sperm whales had significantly higher rates of CR evolution and no evidence of slow coding region evolution relative to other cetaceans. The estimated time to most recent common ancestor for sperm whale mitogenomes was 72,800 to 137,400 years ago (95% highest probability density interval), consistent with previous hypotheses of a bottleneck or selective sweep as likely causes of low mitogenome diversity.
Park, Ji Hye
2018-01-01
Estimation of postmortem interval (PMI) is paramount in modern forensic investigation. After the disappearance of the early postmortem phenomena conventionally used to estimate PMI, entomologic evidence provides important indicators for PMI estimation. The age of the oldest fly larvae or pupae can be estimated to pinpoint the time of oviposition, which is considered the minimum PMI (PMImin). The development rate of insects is usually temperature dependent and species specific. Therefore, species identification is mandatory for PMImin estimation using entomological evidence. The classical morphological identification method cannot be applied when specimens are damaged or have not yet matured. To overcome this limitation, some investigators employ molecular identification using mitochondrial cytochrome c oxidase subunit I (COI) nucleotide sequences. The molecular identification method commonly uses Sanger's nucleotide sequencing and molecular phylogeny, which are complex and time consuming and constitute another obstacle for forensic investigators. In this study, instead of using conventional Sanger's nucleotide sequencing, single-nucleotide polymorphisms (SNPs) in the COI gene region, which are unique between fly species, were selected and targeted for single-base extension (SBE) technology. These SNPs were genotyped using a SNaPshot® kit. Eleven Calliphoridae and seven Sarcophagidae species were covered. To validate this genotyping, fly DNA samples (103 adults, 84 larvae, and 4 pupae) previously confirmed by DNA barcoding were used. This method worked quickly with minimal DNA, providing a potential alternative to conventional DNA barcoding. Consisting of only a few simple electropherogram peaks, the results were more straightforward compared with those of the conventional DNA barcoding produced by Sanger's nucleotide sequencing. PMID:29682531
USDA-ARS?s Scientific Manuscript database
In a collaboration with Purdue University researchers, we sequenced a 143,606 base pair Rhipicephalus microplus BAC library clone that contained the coding region for acetylcholinesterase 1 (AChE1). Sequencing was by Sanger protocols and the final assembly resulted in 15 contigs of varying length, e...
Mutation screening of Chinese Treacher Collins syndrome patients identified novel TCOF1 mutations.
Chen, Ying; Guo, Luo; Li, Chen-Long; Shan, Jing; Xu, Hai-Song; Li, Jie-Ying; Sun, Shan; Hao, Shao-Juan; Jin, Lei; Chai, Gang; Zhang, Tian-Yu
2018-04-01
Treacher Collins syndrome (TCS) (OMIM 154500) is a rare congenital craniofacial disorder with an autosomal dominant manner of inheritance in most cases. To date, three pathogenic genes (TCOF1, POLR1D and POLR1C) have been identified. In this study, we conducted mutational analysis on Chinese TCS patients to reveal a mutational spectrum of known causative genes and show phenotype-genotype data to provide more information for gene counselling and future studies on the pathogenesis of TCS. Twenty-two TCS patients were recruited from two tertiary referral centres, and Sanger sequencing for the coding exons and exon-intron boundaries of TCOF1, POLR1D and POLR1C was performed. For patients without small variants, further copy number variations (CNVs) analysis was conducted using high-density SNP array platforms. The Sanger sequencing overall mutation detection rate was as high as 86.3% (19/22) for our cohort. Fifteen TCOF1 pathogenic variants, including ten novel mutations, were identified in nineteen patients. No causative mutations in POLR1D and POLR1C genes and no CNVs mutations were detected. A suspected autosomal dominant inheritance case that implies germinal mosaicism was described. Our study confirmed that TCOF1 was the main disease-causing gene for the Chinese TCS population and revealed its mutation spectrum. We also addressed the need for more studies of mosaicism in TCS cases, which could explain the mechanism of autosomal dominant inheritance in TCS cases and benefit the prevention of TCS.
Molecular Characterization of Transgenic Events Using Next Generation Sequencing Approach.
Guttikonda, Satish K; Marri, Pradeep; Mammadov, Jafar; Ye, Liang; Soe, Khaing; Richey, Kimberly; Cruse, James; Zhuang, Meibao; Gao, Zhifang; Evans, Clive; Rounsley, Steve; Kumpatla, Siva P
2016-01-01
Demand for the commercial use of genetically modified (GM) crops has been increasing in light of the projected growth of world population to nine billion by 2050. A prerequisite of paramount importance for regulatory submissions is the rigorous safety assessment of GM crops. One of the components of safety assessment is molecular characterization at DNA level which helps to determine the copy number, integrity and stability of a transgene; characterize the integration site within a host genome; and confirm the absence of vector DNA. Historically, molecular characterization has been carried out using Southern blot analysis coupled with Sanger sequencing. While this is a robust approach to characterize the transgenic crops, it is both time- and resource-consuming. The emergence of next-generation sequencing (NGS) technologies has provided highly sensitive and cost- and labor-effective alternative for molecular characterization compared to traditional Southern blot analysis. Herein, we have demonstrated the successful application of both whole genome sequencing and target capture sequencing approaches for the characterization of single and stacked transgenic events and compared the results and inferences with traditional method with respect to key criteria required for regulatory submissions.
Shirts, Brian H; Salipante, Stephen J; Casadei, Silvia; Ryan, Shawnia; Martin, Judith; Jacobson, Angela; Vlaskin, Tatyana; Koehler, Karen; Livingston, Robert J; King, Mary-Claire; Walsh, Tom; Pritchard, Colin C
2014-10-01
Single-exon inversions have rarely been described in clinical syndromes and are challenging to detect using Sanger sequencing. We report the case of a 40-year-old woman with adenomatous colon polyps too numerous to count and who had a complex inversion spanning the entire exon 10 in APC (the gene encoding for adenomatous polyposis coli), causing exon skipping and resulting in a frameshift and premature protein truncation. In this study, we employed complete APC gene sequencing using high-coverage next-generation sequencing by ColoSeq, analysis with BreakDancer and SLOPE software, and confirmatory transcript analysis. ColoSeq identified a complex small genomic rearrangement consisting of an inversion that results in translational skipping of exon 10 in the APC gene. This mutation would not have been detected by traditional sequencing or gene-dosage methods. We report a case of adenomatous polyposis resulting from a complex single-exon inversion. Our report highlights the benefits of large-scale sequencing methods that capture intronic sequences with high enough depth of coverage-as well as the use of informatics tools-to enable detection of small pathogenic structural rearrangements.
Tsai, Meng-Che; Yu, Hui-Wen; Liu, Tsunglin; Chou, Yen-Yin; Chiou, Yuan-Yow; Chen, Peng-Chieh
2018-01-01
Alström syndrome (AS) is a rare autosomal recessive disorder that shares clinical features with other ciliopathy-related diseases. Genetic mutation analysis is often required in making differential diagnosis but usually costly in time and effort using conventional Sanger sequencing. Herein we describe a Taiwanese patient presenting cone-rod dystrophy and early-onset obesity that progressed to diabetes mellitus with marked insulin resistance during adolescence. Whole exome sequencing of the patient's genomic DNA identified a novel frameshift mutation in exons 15 (c.10290_10291delTA, p.Lys3431Serfs * 10) and a rare mutation in 16 (c.10823_10824delAG, p.Arg3609Alafs * 6) of ALMS1 gene. The compound heterozygous mutations were predicted to render truncated proteins. This report highlighted the clinical utility of exome sequencing and extended the knowledge of mutation spectrum in AS patients.
Azab, Marwa Mohamed; Fayyad, Dalia Mukhtar
2018-01-01
The use of high throughput next generation technologies has allowed more comprehensive analysis than traditional Sanger sequencing. The specific aim of this study was to investigate the microbial diversity of primary endodontic infections using Illumina MiSeq sequencing platform in Egyptian patients. Samples were collected from 19 patients in Suez Canal University Hospital (Endodontic Department) using sterile # 15K file and paper points. DNA was extracted using Mo Bio power soil DNA isolation extraction kit followed by PCR amplification and agarose gel electrophoresis. The microbiome was characterized on the basis of the V3 and V4 hypervariable region of the 16S rRNA gene by using paired-end sequencing on Illumina MiSeq device. MOTHUR software was used in sequence filtration and analysis of sequenced data. A total of 1858 operational taxonomic units at 97% similarity were assigned to 26 phyla, 245 families, and 705 genera. Four main phyla Firmicutes, Bacteroidetes, Proteobacteria, and Synergistetes were predominant in all samples. At genus level, Prevotella, Bacillus, Porphyromonas, Streptococcus, and Bacteroides were the most abundant. Illumina MiSeq platform sequencing can be used to investigate oral microbiome composition of endodontic infections. Elucidating the ecology of endodontic infections is a necessary step in developing effective intracanal antimicrobials. PMID:29849646
Implementation of Cloud based next generation sequencing data analysis in a clinical laboratory.
Onsongo, Getiria; Erdmann, Jesse; Spears, Michael D; Chilton, John; Beckman, Kenneth B; Hauge, Adam; Yohe, Sophia; Schomaker, Matthew; Bower, Matthew; Silverstein, Kevin A T; Thyagarajan, Bharat
2014-05-23
The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software.
Reijnders, Margot R F; Janowski, Robert; Alvi, Mohsan; Self, Jay E; van Essen, Ton J; Vreeburg, Maaike; Rouhl, Rob P W; Stevens, Servi J C; Stegmann, Alexander P A; Schieving, Jolanda; Pfundt, Rolph; van Dijk, Katinke; Smeets, Eric; Stumpel, Connie T R M; Bok, Levinus A; Cobben, Jan Maarten; Engelen, Marc; Mansour, Sahar; Whiteford, Margo; Chandler, Kate E; Douzgou, Sofia; Cooper, Nicola S; Tan, Ene-Choo; Foo, Roger; Lai, Angeline H M; Rankin, Julia; Green, Andrew; Lönnqvist, Tuula; Isohanni, Pirjo; Williams, Shelley; Ruhoy, Ilene; Carvalho, Karen S; Dowling, James J; Lev, Dorit L; Sterbova, Katalin; Lassuthova, Petra; Neupauerová, Jana; Waugh, Jeff L; Keros, Sotirios; Clayton-Smith, Jill; Smithson, Sarah F; Brunner, Han G; van Hoeckel, Ceciel; Anderson, Mel; Clowes, Virginia E; Siu, Victoria Mok; DDD study, The; Selber, Paulo; Leventer, Richard J; Nellaker, Christoffer; Niessing, Dierk; Hunt, David; Baralle, Diana
2018-01-01
Background De novo mutations in PURA have recently been described to cause PURA syndrome, a neurodevelopmental disorder characterised by severe intellectual disability (ID), epilepsy, feeding difficulties and neonatal hypotonia. Objectives To delineate the clinical spectrum of PURA syndrome and study genotype-phenotype correlations. Methods Diagnostic or research-based exome or Sanger sequencing was performed in individuals with ID. We systematically collected clinical and mutation data on newly ascertained PURA syndrome individuals, evaluated data of previously reported individuals and performed a computational analysis of photographs. We classified mutations based on predicted effect using 3D in silico models of crystal structures of Drosophila-derived Pur-alpha homologues. Finally, we explored genotype-phenotype correlations by analysis of both recurrent mutations as well as mutation classes. Results We report mutations in PURA (purine-rich element binding protein A) in 32 individuals, the largest cohort described so far. Evaluation of clinical data, including 22 previously published cases, revealed that all have moderate to severe ID and neonatal-onset symptoms, including hypotonia (96%), respiratory problems (57%), feeding difficulties (77%), exaggerated startle response (44%), hypersomnolence (66%) and hypothermia (35%). Epilepsy (54%) and gastrointestinal (69%), ophthalmological (51%) and endocrine problems (42%) were observed frequently. Computational analysis of facial photographs showed subtle facial dysmorphism. No strong genotype-phenotype correlation was identified by subgrouping mutations into functional classes. Conclusion We delineate the clinical spectrum of PURA syndrome with the identification of 32 additional individuals. The identification of one individual through targeted Sanger sequencing points towards the clinical recognisability of the syndrome. Genotype-phenotype analysis showed no significant correlation between mutation classes and disease severity. PMID:29097605
DSAP: deep-sequencing small RNA analysis pipeline.
Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus
2010-07-01
DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.
Multiplexed microsatellite recovery using massively parallel sequencing
Jennings, T.N.; Knaus, B.J.; Mullins, T.D.; Haig, S.M.; Cronn, R.C.
2011-01-01
Conservation and management of natural populations requires accurate and inexpensive genotyping methods. Traditional microsatellite, or simple sequence repeat (SSR), marker analysis remains a popular genotyping method because of the comparatively low cost of marker development, ease of analysis and high power of genotype discrimination. With the availability of massively parallel sequencing (MPS), it is now possible to sequence microsatellite-enriched genomic libraries in multiplex pools. To test this approach, we prepared seven microsatellite-enriched, barcoded genomic libraries from diverse taxa (two conifer trees, five birds) and sequenced these on one lane of the Illumina Genome Analyzer using paired-end 80-bp reads. In this experiment, we screened 6.1 million sequences and identified 356958 unique microreads that contained di- or trinucleotide microsatellites. Examination of four species shows that our conversion rate from raw sequences to polymorphic markers compares favourably to Sanger- and 454-based methods. The advantage of multiplexed MPS is that the staggering capacity of modern microread sequencing is spread across many libraries; this reduces sample preparation and sequencing costs to less than $400 (USD) per species. This price is sufficiently low that microsatellite libraries could be prepared and sequenced for all 1373 organisms listed as 'threatened' and 'endangered' in the United States for under $0.5M (USD).
Goossens, Dirk; Moens, Lotte N; Nelis, Eva; Lenaerts, An-Sofie; Glassee, Wim; Kalbe, Andreas; Frey, Bruno; Kopal, Guido; De Jonghe, Peter; De Rijk, Peter; Del-Favero, Jurgen
2009-03-01
We evaluated multiplex PCR amplification as a front-end for high-throughput sequencing, to widen the applicability of massive parallel sequencers for the detailed analysis of complex genomes. Using multiplex PCR reactions, we sequenced the complete coding regions of seven genes implicated in peripheral neuropathies in 40 individuals on a GS-FLX genome sequencer (Roche). The resulting dataset showed highly specific and uniform amplification. Comparison of the GS-FLX sequencing data with the dataset generated by Sanger sequencing confirmed the detection of all variants present and proved the sensitivity of the method for mutation detection. In addition, we showed that we could exploit the multiplexed PCR amplicons to determine individual copy number variation (CNV), increasing the spectrum of detected variations to both genetic and genomic variants. We conclude that our straightforward procedure substantially expands the applicability of the massive parallel sequencers for sequencing projects of a moderate number of amplicons (50-500) with typical applications in resequencing exons in positional or functional candidate regions and molecular genetic diagnostics. 2008 Wiley-Liss, Inc.
Cecconi, Massimiliano; Parodi, Maria I.; Formisano, Francesco; Spirito, Paolo; Autore, Camillo; Musumeci, Maria B.; Favale, Stefano; Forleo, Cinzia; Rapezzi, Claudio; Biagini, Elena; Davì, Sabrina; Canepa, Elisabetta; Pennese, Loredana; Castagnetta, Mauro; Degiorgio, Dario; Coviello, Domenico A.
2016-01-01
Hypertrophic cardiomyopathy (HCM) is mainly associated with myosin, heavy chain 7 (MYH7) and myosin binding protein C, cardiac (MYBPC3) mutations. In order to better explain the clinical and genetic heterogeneity in HCM patients, in this study, we implemented a target-next generation sequencing (NGS) assay. An Ion AmpliSeq™ Custom Panel for the enrichment of 19 genes, of which 9 of these did not encode thick/intermediate and thin myofilament (TTm) proteins and, among them, 3 responsible of HCM phenocopy, was created. Ninety-two DNA samples were analyzed by the Ion Personal Genome Machine: 73 DNA samples (training set), previously genotyped in some of the genes by Sanger sequencing, were used to optimize the NGS strategy, whereas 19 DNA samples (discovery set) allowed the evaluation of NGS performance. In the training set, we identified 72 out of 73 expected mutations and 15 additional mutations: the molecular diagnosis was achieved in one patient with a previously wild-type status and the pre-excitation syndrome was explained in another. In the discovery set, we identified 20 mutations, 5 of which were in genes encoding non-TTm proteins, increasing the diagnostic yield by approximately 20%: a single mutation in genes encoding non-TTm proteins was identified in 2 out of 3 borderline HCM patients, whereas co-occuring mutations in genes encoding TTm and galactosidase alpha (GLA) altered proteins were characterized in a male with HCM and multiorgan dysfunction. Our combined targeted NGS-Sanger sequencing-based strategy allowed the molecular diagnosis of HCM with greater efficiency than using the conventional (Sanger) sequencing alone. Mutant alleles encoding non-TTm proteins may aid in the complete understanding of the genetic and phenotypic heterogeneity of HCM: co-occuring mutations of genes encoding TTm and non-TTm proteins could explain the wide variability of the HCM phenotype, whereas mutations in genes encoding only the non-TTm proteins are identifiable in patients with a milder HCM status. PMID:27600940
Biallelic Mutations in NBAS Cause Recurrent Acute Liver Failure with Onset in Infancy.
Haack, Tobias B; Staufner, Christian; Köpke, Marlies G; Straub, Beate K; Kölker, Stefan; Thiel, Christian; Freisinger, Peter; Baric, Ivo; McKiernan, Patrick J; Dikow, Nicola; Harting, Inga; Beisse, Flemming; Burgard, Peter; Kotzaeridou, Urania; Kühr, Joachim; Himbert, Urban; Taylor, Robert W; Distelmaier, Felix; Vockley, Jerry; Ghaloul-Gonzalez, Lina; Zschocke, Johannes; Kremer, Laura S; Graf, Elisabeth; Schwarzmayr, Thomas; Bader, Daniel M; Gagneur, Julien; Wieland, Thomas; Terrile, Caterina; Strom, Tim M; Meitinger, Thomas; Hoffmann, Georg F; Prokisch, Holger
2015-07-02
Acute liver failure (ALF) in infancy and childhood is a life-threatening emergency. Few conditions are known to cause recurrent acute liver failure (RALF), and in about 50% of cases, the underlying molecular cause remains unresolved. Exome sequencing in five unrelated individuals with fever-dependent RALF revealed biallelic mutations in NBAS. Subsequent Sanger sequencing of NBAS in 15 additional unrelated individuals with RALF or ALF identified compound heterozygous mutations in an additional six individuals from five families. Immunoblot analysis of mutant fibroblasts showed reduced protein levels of NBAS and its proposed interaction partner p31, both involved in retrograde transport between endoplasmic reticulum and Golgi. We recommend NBAS analysis in individuals with acute infantile liver failure, especially if triggered by fever. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Mitochondrial sequence analysis for forensic identification using pyrosequencing technology.
Andréasson, H; Asp, A; Alderborn, A; Gyllensten, U; Allen, M
2002-01-01
Over recent years, requests for mtDNA analysis in the field of forensic medicine have notably increased, and the results of such analyses have proved to be very useful in forensic cases where nuclear DNA analysis cannot be performed. Traditionally, mtDNA has been analyzed by DNA sequencing of the two hypervariable regions, HVI and HVII, in the D-loop. DNA sequence analysis using the conventional Sanger sequencing is very robust but time consuming and labor intensive. By contrast, mtDNA analysis based on the pyrosequencing technology provides fast and accurate results from the human mtDNA present in many types of evidence materials in forensic casework. The assay has been developed to determine polymorphic sites in the mitochondrial D-loop as well as the coding region to further increase the discrimination power of mtDNA analysis. The pyrosequencing technology for analysis of mtDNA polymorphisms has been tested with regard to sensitivity, reproducibility, and success rate when applied to control samples and actual casework materials. The results show that the method is very accurate and sensitive; the results are easily interpreted and provide a high success rate on casework samples. The panel of pyrosequencing reactions for the mtDNA polymorphisms were chosen to result in an optimal discrimination power in relation to the number of bases determined.
Oliveira, Jorge; Negrão, Luís; Fineza, Isabel; Taipa, Ricardo; Melo-Pires, Manuel; Fortuna, Ana Maria; Gonçalves, Ana Rita; Froufe, Hugo; Egas, Conceição; Santos, Rosário; Sousa, Mário
2015-06-01
Muscular dystrophies (MDs) are a group of hereditary muscle disorders that include two particularly heterogeneous subgroups: limb-girdle MD and congenital MD, linked to 52 different genes (seven common to both subgroups). Massive parallel sequencing technology may avoid the usual stepwise gene-by-gene analysis. We report the whole-exome sequencing (WES) analysis of a patient with childhood-onset progressive MD, also presenting mental retardation and dilated cardiomyopathy. Conventional sequencing had excluded eight candidate genes. WES of the trio (patient and parents) was performed using the ion proton sequencing system. Data analysis resorted to filtering steps using the GEMINI software revealed a novel silent variant in the choline kinase beta (CHKB) gene. Inspection of sequence alignments ultimately identified the causal variant (CHKB:c.1031+3G>C). This splice site mutation was confirmed using Sanger sequencing and its effect was further evaluated with gene expression analysis. On reassessment of the muscle biopsy, typical abnormal mitochondrial oxidative changes were observed. Mutations in CHKB have been shown to cause phosphatidylcholine deficiency in myofibers, causing a rare form of CMD (only 21 patients reported). Notwithstanding interpretative difficulties that need to be overcome before the integration of WES in the diagnostic workflow, this work corroborates its utility in solving cases from highly heterogeneous groups of diseases, in which conventional diagnostic approaches fail to provide a definitive diagnosis.
Rapid RHD Zygosity Determination Using Digital PCR.
Sillence, Kelly A; Halawani, Amr J; Tounsi, Wajnat A; Clarke, Kirsty A; Kiernan, Michele; Madgett, Tracey E; Avent, Neil D
2017-08-01
Paternal zygosity testing is used for determining homo- or hemizygosity of RHD in pregnancies that are at a risk of hemolytic disease of the fetus and newborn. At present, this is achieved by using real-time PCR or the Rhesus box PCR, which can be difficult to interpret and unreliable, particularly for black African populations. DNA samples extracted from 53 blood donors were analyzed using 2 multiplex reactions for RHD -specific targets against a reference ( AGO1 ) 2 to determine gene dosage by digital PCR. Results were compared with serological data, and the correct genotype for 2 discordant results was determined by long-range PCR (LR-PCR), next-generation sequencing, and conventional Sanger sequencing. The results showed clear and reliable determination of RHD zygosity using digital PCR and revealed that 4 samples did not match the serologically predicted genotype. Sanger sequencing and long-range PCR followed by next-generation sequencing revealed that the correct genotypes for samples 729M and 351D, which were serologically typed as R 1 R 2 (DCe/DcE), were R 2 r' (DcE/dCe) for 729M and R 1 r″ (DCe/dcE), R 0 r y (Dce/dCE), or R Z r (DCE/dce) for 351D, in concordance with the digital PCR data. Digital PCR provides a highly accurate method to rapidly define blood group zygosity and has clinical application in the analysis of Rh phenotyped or genotyped samples. The vast majority of current blood group genotyping platforms are not designed to define zygosity, and thus, this technique may be used to define paternal RH zygosity in pregnancies that are at a risk of hemolytic disease of the fetus and newborn and can distinguish between homo- and hemizygous RHD -positive individuals. © 2017 American Association for Clinical Chemistry.
Karolak, Justyna A; Gambin, Tomasz; Pitarque, Jose A; Molinari, Andrea; Jhangiani, Shalini; Stankiewicz, Pawel; Lupski, James R; Gajecka, Marzena
2017-01-01
Keratoconus (KTCN) is a protrusion and thinning of the cornea, resulting in impairment of visual function. The extreme genetic heterogeneity makes it difficult to discover factors unambiguously influencing the KTCN phenotype. In this study, we used whole-exome sequencing (WES) and Sanger sequencing to reduce the number of candidate genes at the 5q31.1–q35.3 locus and to prioritize other potentially relevant variants in an Ecuadorian family with KTCN. We applied WES in two affected KTCN individuals from the Ecuadorian family that showed a suggestive linkage between the KTCN phenotype and the 5q31.1–q35.3 locus. Putative variants identified by WES were further evaluated in this family using Sanger sequencing. Exome capture discovered a total of 173 rare (minor allele frequency <0.001 in control population) nonsynonymous variants in both affected individuals. Among them, 16 SNVs were selected for further evaluation. Segregation analysis revealed that variants c.475T>G in SKP1, c.671G>A in PROB1, and c.527G>A in IL17B in the 5q31.1–q35.3 linkage region, and c.850G>A in HKDC1 in the 10q22 locus completely segregated with the phenotype in the studied KTCN family. We demonstrate that a combination of various techniques significantly narrowed the studied genomic region and reduced the list of the putative exonic variants. Moreover, since this locus overlapped two other chromosomal regions previously recognized in distinct KTCN studies, our findings suggest that this 5q31.1–q35.3 locus might be linked with KTCN. PMID:27703147
Inzaule, Seth C; Hamers, Ralph L; Paredes, Roger; Yang, Chunfu; Schuurman, Rob; Rinke de Wit, Tobias F
2017-01-01
Global scale-up of antiretroviral treatment has dramatically changed the prospects of HIV/AIDS disease, rendering life-long chronic care and treatment a reality for millions of HIV-infected patients. Affordable technologies to monitor antiretroviral treatment are needed to ensure long-term durability of limited available drug regimens. HIV drug resistance tests can complement existing strategies in optimizing clinical decision-making for patients with treatment failure, in addition to facilitating population-based surveillance of HIV drug resistance. This review assesses the current landscape of HIV drug resistance technologies and discusses the strengths and limitations of existing assays available for expanding testing in resource-limited settings. These include sequencing-based assays (Sanger sequencing assays and nextgeneration sequencing), point mutation assays, and genotype-free data-based prediction systems. Sanger assays are currently considered the gold standard genotyping technology, though only available at a limited number of resource-limited setting reference and regional laboratories, but high capital and test costs have limited their wide expansion. Point mutation assays present opportunities for simplified laboratory assays, but HIV genetic variability, extensive codon redundancy at or near the mutation target sites with limited multiplexing capability have restricted their utility. Next-generation sequencing, despite high costs, may have potential to reduce the testing cost significantly through multiplexing in high-throughput facilities, although the level of bioinformatics expertise required for data analysis is currently still complex and expensive and lacks standardization. Web-based genotype-free prediction systems may provide enhanced antiretroviral treatment decision-making without the need for laboratory testing, but require further clinical field evaluation and implementation scientific research in resource-limited settings.
Next-Generation Sequencing of Aquatic Oligochaetes: Comparison of Experimental Communities
Vivien, Régis; Lejzerowicz, Franck; Pawlowski, Jan
2016-01-01
Aquatic oligochaetes are a common group of freshwater benthic invertebrates known to be very sensitive to environmental changes and currently used as bioindicators in some countries. However, more extensive application of oligochaetes for assessing the ecological quality of sediments in watercourses and lakes would require overcoming the difficulties related to morphology-based identification of oligochaetes species. This study tested the Next-Generation Sequencing (NGS) of a standard cytochrome c oxydase I (COI) barcode as a tool for the rapid assessment of oligochaete diversity in environmental samples, based on mixed specimen samples. To know the composition of each sample we Sanger sequenced every specimen present in these samples. Our study showed that a large majority of OTUs (Operational Taxonomic Unit) could be detected by NGS analyses. We also observed congruence between the NGS and specimen abundance data for several but not all OTUs. Because the differences in sequence abundance data were consistent across samples, we exploited these variations to empirically design correction factors. We showed that such factors increased the congruence between the values of oligochaetes-based indices inferred from the NGS and the Sanger-sequenced specimen data. The validation of these correction factors by further experimental studies will be needed for the adaptation and use of NGS technology in biomonitoring studies based on oligochaete communities. PMID:26866802
Novel mutations of ABCB6 associated with autosomal dominant dyschromatosis universalis hereditaria.
Cui, Ying-Xia; Xia, Xin-Yi; Zhou, Yang; Gao, Lin; Shang, Xue-Jun; Ni, Tong; Wang, Wei-Ping; Fan, Xiao-Buo; Yin, Hong-Lin; Jiang, Shao-Jun; Yao, Bing; Hu, Yu-An; Wang, Gang; Li, Xiao-Jun
2013-01-01
Dyschromatosis universalis hereditaria (DUH) is a rare heterogeneous pigmentary genodermatosis, which was first described in 1933. The genetic cause has recently been discovered by the discovery of mutations in ABCB6. Here we investigated a Chinese family with typical features of autosomal dominant DUH and 3 unrelated patients with sporadic DUH. Skin tissues were obtained from the proband, of this family and the 3 sporadic patients. Histopathological examination and immunohistochemical analysis of ABCB6 were performed. Peripheral blood DNA samples were obtained from 21 affected, 14 unaffected, 11 spouses in the family and the 3 sporadic patients. A genome-wide linkage scan for the family was carried out to localize the causative gene. Exome sequencing was performed from 3 affected and 1 unaffected in the family. Sanger sequencing of ABCB6 was further used to identify the causative gene for all samples obtained from available family members, the 3 sporadic patients and a panel of 455 ethnically-matched normal Chinese individuals. Histopathological analysis showed melanocytes in normal control's skin tissue and the hyperpigmented area contained more melanized, mature melanosomes than those within the hypopigmented areas. Empty immature melanosomes were found in the hypopigmented melanocytes. Parametric multipoint linkage analysis produced a HLOD score of 4.68, with markers on chromosome 2q35-q37.2. A missense mutation (c.1663 C>A, p.Gln555Lys) in ABCB6 was identified in this family by exome and Sanger sequencing. The mutation perfectly cosegregated with the skin phenotype. An additional mutation (g.776 delC, c.459 delC) in ABCB6 was found in an unrelated sporadic patient. No mutation in ABCB6 was discovered in the other two sporadic patients. Neither of the two mutations was present in the 455 controls. Melanocytes showed positive immunoreactivity to ABCB6. Our data add new variants to the repertoire of ABCB6 mutations with DUH.
Next Generation Sequencing Technologies: The Doorway to the Unexplored Genomics of Non-Model Plants
Unamba, Chibuikem I. N.; Nag, Akshay; Sharma, Ram K.
2015-01-01
Non-model plants i.e., the species which have one or all of the characters such as long life cycle, difficulty to grow in the laboratory or poor fecundity, have been schemed out of sequencing projects earlier, due to high running cost of Sanger sequencing. Consequently, the information about their genomics and key biological processes are inadequate. However, the advent of fast and cost effective next generation sequencing (NGS) platforms in the recent past has enabled the unearthing of certain characteristic gene structures unique to these species. It has also aided in gaining insight about mechanisms underlying processes of gene expression and secondary metabolism as well as facilitated development of genomic resources for diversity characterization, evolutionary analysis and marker assisted breeding even without prior availability of genomic sequence information. In this review we explore how different Next Gen Sequencing platforms, as well as recent advances in NGS based high throughput genotyping technologies are rewarding efforts on de-novo whole genome/transcriptome sequencing, development of genome wide sequence based markers resources for improvement of non-model crops that are less costly than phenotyping. PMID:26734016
Analysis of CHRNA7 rare variants in autism spectrum disorder susceptibility.
Bacchelli, Elena; Battaglia, Agatino; Cameli, Cinzia; Lomartire, Silvia; Tancredi, Raffaella; Thomson, Susanne; Sutcliffe, James S; Maestrini, Elena
2015-04-01
Chromosome 15q13.3 recurrent microdeletions are causally associated with a wide range of phenotypes, including autism spectrum disorder (ASD), seizures, intellectual disability, and other psychiatric conditions. Whether the reciprocal microduplication is pathogenic is less certain. CHRNA7, encoding for the alpha7 subunit of the neuronal nicotinic acetylcholine receptor, is considered the likely culprit gene in mediating neurological phenotypes in 15q13.3 deletion cases. To assess if CHRNA7 rare variants confer risk to ASD, we performed copy number variant analysis and Sanger sequencing of the CHRNA7 coding sequence in a sample of 135 ASD cases. Sequence variation in this gene remains largely unexplored, given the existence of a fusion gene, CHRFAM7A, which includes a nearly identical partial duplication of CHRNA7. Hence, attempts to sequence coding exons must distinguish between CHRNA7 and CHRFAM7A, making next-generation sequencing approaches unreliable for this purpose. A CHRNA7 microduplication was detected in a patient with autism and moderate cognitive impairment; while no rare damaging variants were identified in the coding region, we detected rare variants in the promoter region, previously described to functionally reduce transcription. This study represents the first sequence variant analysis of CHRNA7 in a sample of idiopathic autism. © 2015 Wiley Periodicals, Inc.
Yang, Tsun-Po; Beazley, Claude; Montgomery, Stephen B; Dimas, Antigone S; Gutierrez-Arcelus, Maria; Stranger, Barbara E; Deloukas, Panos; Dermitzakis, Emmanouil T
2010-10-01
Genevar (GENe Expression VARiation) is a database and Java tool designed to integrate multiple datasets, and provides analysis and visualization of associations between sequence variation and gene expression. Genevar allows researchers to investigate expression quantitative trait loci (eQTL) associations within a gene locus of interest in real time. The database and application can be installed on a standard computer in database mode and, in addition, on a server to share discoveries among affiliations or the broader community over the Internet via web services protocols. http://www.sanger.ac.uk/resources/software/genevar.
[Mutation analysis for a pedigree affected with keratitis-ichthyosis-deafness syndrome].
Li, Lulu; Li, Yuan; Lin, Wei; Zhao, Xiuli
2017-10-10
To identify mutation of GJB2 gene and provide genetic counseling for a family affected with keratitis-ichthyosis-deafness (KID) syndrome. Genomic DNA was extracted from peripheral blood samples with a standard phenol-chloroform method. PCR and Sanger sequencing were used to analyze potential mutation in the proband. Suspected mutation was verified with a PCR-high-resolution melting (PCR-HRM) method. T-clone sequencing was applied to determine the parental origin of the mutation. A heterozygous mutation, c.148G>A (p.Asp50Asn), which is located in the exon 1 of the GJB2 gene, was found in the proband. The results was confirmed by HRM analysis. Cloning sequencing suggested that the mutation was derived from the father's germline. The hot-spot mutation c.148G>A (p.Asp50Asn) in the GJB2 gene probably underlies the KID syndrome in this Chinese family. A PCR-HRM method has been established to rapidly detect common mutations associated with this disease.
Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers.
Varshney, Rajeev K; Chen, Wenbin; Li, Yupeng; Bharti, Arvind K; Saxena, Rachit K; Schlueter, Jessica A; Donoghue, Mark T A; Azam, Sarwar; Fan, Guangyi; Whaley, Adam M; Farmer, Andrew D; Sheridan, Jaime; Iwata, Aiko; Tuteja, Reetu; Penmetsa, R Varma; Wu, Wei; Upadhyaya, Hari D; Yang, Shiaw-Pyng; Shah, Trushar; Saxena, K B; Michael, Todd; McCombie, W Richard; Yang, Bicheng; Zhang, Gengyun; Yang, Huanming; Wang, Jun; Spillane, Charles; Cook, Douglas R; May, Gregory D; Xu, Xun; Jackson, Scott A
2011-11-06
Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sanger-based bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds representing 72.7% (605.78 Mb) of the 833.07 Mb pigeonpea genome. Genome analysis predicted 48,680 genes for pigeonpea and also showed the potential role that certain gene families, for example, drought tolerance-related genes, have played throughout the domestication of pigeonpea and the evolution of its ancestors. Although we found a few segmental duplication events, we did not observe the recent genome-wide duplication events observed in soybean. This reference genome sequence will facilitate the identification of the genetic basis of agronomically important traits, and accelerate the development of improved pigeonpea varieties that could improve food security in many developing countries.
Clinical and genetic characteristics of chinese patients with Birt-Hogg-Dubé syndrome.
Liu, Yaping; Xu, Zhiyan; Feng, Ruie; Zhan, Yongzhong; Wang, Jun; Li, Guozhen; Li, Xue; Zhang, Weihong; Hu, Xiaowen; Tian, Xinlun; Xu, Kai-Feng; Zhang, Xue
2017-05-30
Birt-Hogg-Dubé syndrome (BHD) is an autosomal dominant disorder, the main manifestations of which are fibrofolliculomas, renal tumors, pulmonary cysts and recurrent pneumothorax. The known causative gene for BHD syndrome is the folliculin (FLCN) gene on chromosome 17p11.2. Studies of the FLCN mutation for BHD syndrome are less prevalent in Chinese populations than in Caucasian populations. Our study aims to investigate the genotype spectrum in a group of Chinese patients with BHD. We enrolled 51 patients with symptoms highly suggestive of BHD from January 2014 to February 2017. The FLCN gene was examined using PCR and Sanger sequencing in every patient, for those whose Sanger sequencing showed negative mutation results, multiplex ligation-dependent probe amplification (MLPA) testing was conducted to detect any losses of large segments. Among the 51 patients, 27 had FLCN germline mutations. In total, 20 mutations were identified: 14 were novel mutations, including 3 splice acceptor site mutations, 2 different deletions, 6 nonsense mutations, 1 missense mutation, 1 small insertion, and 1 deletion of the whole exon 8. We found a similar genotype spectrum but different mutant loci in Chinese patients with BHD compared with European and American patients, thus providing stronger evidence for the clinical molecular diagnosis of BHD in China. It suggests that mutation analysis of the FLCN gene should be systematically conducted in patients with cystic lung diseases.
A novel COLD-PCR/FMCA assay enhances the detection of low-abundance IDH1 mutations in gliomas.
Pang, Brendan; Durso, Mary B; Hamilton, Ronald L; Nikiforova, Marina N
2013-03-01
Point mutations in isocitrate dehydrogenase 1 (IDH1) have been identified in many gliomas. The detection of IDH1 mutations becomes challenging on suboptimal glioma biopsies when a limited number of tumor cells is available for analysis. Coamplification at lower denaturing-polymerase chain reaction (COLD-PCR) is a PCR technique that deliberately lowers the denaturing cycle temperature to selectively favor amplification of mutant alleles, allowing for the sensitive detection of low-abundance mutations. We developed a novel COLD-PCR assay on the LightCycler platform (Roche, Applied Science, Indianapolis, IN), using post-PCR fluorescent melting curve analysis (FMCA) for the detection of mutant IDH1 with a detection limit of 1%. Thirty-five WHO grade I to IV gliomas and 9 non-neoplastic brain and spinal cord biopsies were analyzed with this technique and the results were compared with the conventional real-time PCR and the Sanger sequencing analysis. COLD-PCR/FMCA was able to detect the most common IDH1 R132H mutation and rare mutation types including R132H, R132C, R132L, R132S, and R132G mutations. Twenty-five glioma cases were positive for IDH1 mutations by COLD-PCR/FMCA, and 23 gliomas were positive by the conventional real-time PCR and Sanger sequencing. A pilocytic astrocytoma (PA I) and a glioblastoma multiforme (GBM IV) showed low-abundance IDH1 mutations detected by COLD-PCR/FMCA. The remaining 10 glioma and 9 non-neoplastic samples were negative by all the 3 methods. In summary, we report a novel COLD-PCR/FMCA method that provides rapid and sensitive detection of IDH1 mutations in formalin-fixed paraffin-embedded tissue and can be used in the clinical setting to assess the small brain biopsies.
Gullapalli, Rama R; Desai, Ketaki V; Santana-Santos, Lucas; Kant, Jeffrey A; Becich, Michael J
2012-01-01
The Human Genome Project (HGP) provided the initial draft of mankind's DNA sequence in 2001. The HGP was produced by 23 collaborating laboratories using Sanger sequencing of mapped regions as well as shotgun sequencing techniques in a process that occupied 13 years at a cost of ~$3 billion. Today, Next Generation Sequencing (NGS) techniques represent the next phase in the evolution of DNA sequencing technology at dramatically reduced cost compared to traditional Sanger sequencing. A single laboratory today can sequence the entire human genome in a few days for a few thousand dollars in reagents and staff time. Routine whole exome or even whole genome sequencing of clinical patients is well within the realm of affordability for many academic institutions across the country. This paper reviews current sequencing technology methods and upcoming advancements in sequencing technology as well as challenges associated with data generation, data manipulation and data storage. Implementation of routine NGS data in cancer genomics is discussed along with potential pitfalls in the interpretation of the NGS data. The overarching importance of bioinformatics in the clinical implementation of NGS is emphasized.[7] We also review the issue of physician education which also is an important consideration for the successful implementation of NGS in the clinical workplace. NGS technologies represent a golden opportunity for the next generation of pathologists to be at the leading edge of the personalized medicine approaches coming our way. Often under-emphasized issues of data access and control as well as potential ethical implications of whole genome NGS sequencing are also discussed. Despite some challenges, it's hard not to be optimistic about the future of personalized genome sequencing and its potential impact on patient care and the advancement of knowledge of human biology and disease in the near future.
Gullapalli, Rama R.; Desai, Ketaki V.; Santana-Santos, Lucas; Kant, Jeffrey A.; Becich, Michael J.
2012-01-01
The Human Genome Project (HGP) provided the initial draft of mankind's DNA sequence in 2001. The HGP was produced by 23 collaborating laboratories using Sanger sequencing of mapped regions as well as shotgun sequencing techniques in a process that occupied 13 years at a cost of ~$3 billion. Today, Next Generation Sequencing (NGS) techniques represent the next phase in the evolution of DNA sequencing technology at dramatically reduced cost compared to traditional Sanger sequencing. A single laboratory today can sequence the entire human genome in a few days for a few thousand dollars in reagents and staff time. Routine whole exome or even whole genome sequencing of clinical patients is well within the realm of affordability for many academic institutions across the country. This paper reviews current sequencing technology methods and upcoming advancements in sequencing technology as well as challenges associated with data generation, data manipulation and data storage. Implementation of routine NGS data in cancer genomics is discussed along with potential pitfalls in the interpretation of the NGS data. The overarching importance of bioinformatics in the clinical implementation of NGS is emphasized.[7] We also review the issue of physician education which also is an important consideration for the successful implementation of NGS in the clinical workplace. NGS technologies represent a golden opportunity for the next generation of pathologists to be at the leading edge of the personalized medicine approaches coming our way. Often under-emphasized issues of data access and control as well as potential ethical implications of whole genome NGS sequencing are also discussed. Despite some challenges, it's hard not to be optimistic about the future of personalized genome sequencing and its potential impact on patient care and the advancement of knowledge of human biology and disease in the near future. PMID:23248761
Anasagasti, Ander; Barandika, Olatz; Irigoyen, Cristina; Benitez, Bruno A; Cooper, Breanna; Cruchaga, Carlos; López de Munain, Adolfo; Ruiz-Ederra, Javier
2013-11-01
Retinitis Pigmentosa (RP) involves a group of genetically determined retinal diseases caused by a large number of mutations that result in rod photoreceptor cell death followed by gradual death of cone cells. Most cases of RP are monogenic, with more than 80 associated genes identified so far. The high number of genes and variants involved in RP, among other factors, is making the molecular characterization of RP a real challenge for many patients. Although HRM has been used for the analysis of isolated variants or single RP genes, as far as we are concerned, this is the first study that uses HRM analysis for a high-throughput screening of several RP genes. Our main goal was to test the suitability of HRM analysis as a genetic screening technique in RP, and to compare its performance with two of the most widely used NGS platforms, Illumina and PGM-Ion Torrent technologies. RP patients (n = 96) were clinically diagnosed at the Ophthalmology Department of Donostia University Hospital, Spain. We analyzed a total of 16 RP genes that meet the following inclusion criteria: 1) size: genes with transcripts of less than 4 kb; 2) number of exons: genes with up to 22 exons; and 3) prevalence: genes reported to account for, at least, 0.4% of total RP cases worldwide. For comparison purposes, RHO gene was also sequenced with Illumina (GAII; Illumina), Ion semiconductor technologies (PGM; Life Technologies) and Sanger sequencing (ABI 3130xl platform; Applied Biosystems). Detected variants were confirmed in all cases by Sanger sequencing and tested for co-segregation in the family of affected probands. We identified a total of 65 genetic variants, 15 of which (23%) were novel, in 49 out of 96 patients. Among them, 14 (4 novel) are probable disease-causing genetic variants in 7 RP genes, affecting 15 patients. Our HRM analysis-based study, proved to be a cost-effective and rapid method that provides an accurate identification of genetic RP variants. This approach is effective for medium sized (<4 kb transcript) RP genes, which constitute over 80% of the total of known RP genes.
Anasagasti, Ander; Barandika, Olatz; Irigoyen, Cristina; Benitez, Bruno A; Cooper, Breanna; Cruchaga, Carlos; López de Munain, Adolfo; Ruiz-Ederra, Javier
2013-10-24
Retinitis Pigmentosa (RP) involves a group of genetically determined retinal diseases caused by a large number of mutations that result in rod photoreceptor cell death followed by gradual death of cone cells. Most cases of RP are monogenic, with more than 80 associated genes identified so far. The high number of genes and variants involved in RP, among other factors, is making the molecular characterization of RP a real challenge for many patients. Although HRM has been used for the analysis of isolated variants or single RP genes, as far as we are concerned, this is the first study that uses HRM analysis for a high-throughput screening of several RP genes. Our main goal was to test the suitability of HRM analysis as a genetic screening technique in RP, and to compare its performance with two of the most widely used NGS platforms, Illumina and PGM-Ion Torrent technologies. RP patients (n=96) were clinically diagnosed at the Ophthalmology Department of Donostia University Hospital, Spain. We analyzed a total of 16 RP genes that meet the following inclusion criteria: 1) size: genes with transcripts of less than 4 kb; 2) number of exons: genes with up to 22 exons; and 3) prevalence: genes reported to account for, at least, 0.4 % of total RP cases worldwide. For comparison purposes, RHO gene was also sequenced with Illumina (GAII; Illumina), Ion semiconductor technologies (PGM; Life Technologies) and Sanger sequencing (ABI 3130xl platform; Applied Biosystems). Detected variants were confirmed in all cases by Sanger sequencing and tested for co-segregation in the family of affected probands. We identified a total of 65 genetic variants, 15 of which (23%) were novel, in 49 out of 96 patients. Among them, 14 (4 novel) are probable disease-causing genetic variants in 7 RP genes, affecting 15 patients. Our HRM analysis-based study, proved to be a cost-effective and rapid method that provides an accurate identification of genetic RP variants. This approach is effective for medium sized (<4 kb transcript) RP genes, which constitute over 80% of the total of known RP genes. © 2013 Published by Elsevier Ltd.
Implementation of Quality Management in Core Service Laboratories
Creavalle, T.; Haque, K.; Raley, C.; Subleski, M.; Smith, M.W.; Hicks, B.
2010-01-01
CF-28 The Genetics and Genomics group of the Advanced Technology Program of SAIC-Frederick exists to bring innovative genomic expertise, tools and analysis to NCI and the scientific community. The Sequencing Facility (SF) provides next generation short read (Illumina) sequencing capacity to investigators using a streamlined production approach. The Laboratory of Molecular Technology (LMT) offers a wide range of genomics core services including microarray expression analysis, miRNA analysis, array comparative genome hybridization, long read (Roche) next generation sequencing, quantitative real time PCR, transgenic genotyping, Sanger sequencing, and clinical mutation detection services to investigators from across the NIH. As the technology supporting this genomic research becomes more complex, the need for basic quality processes within all aspects of the core service groups becomes critical. The Quality Management group works alongside members of these labs to establish or improve processes supporting operations control (equipment, reagent and materials management), process improvement (reengineering/optimization, automation, acceptance criteria for new technologies and tech transfer), and quality assurance and customer support (controlled documentation/SOPs, training, service deficiencies and continual improvement efforts). Implementation and expansion of quality programs within unregulated environments demonstrates SAIC-Frederick's dedication to providing the highest quality products and services to the NIH community.
A novel homozygous mutation in the FSHR gene is causative for primary ovarian insufficiency.
Liu, Hongli; Xu, Xiaofei; Han, Ting; Yan, Lei; Cheng, Lei; Qin, Yingying; Liu, Wen; Zhao, Shidou; Chen, Zi-Jiang
2017-12-01
To identify the potential FSHR mutation in a Chinese woman with primary ovarian insufficiency (POI). Genetic and functional studies. University-based reproductive medicine center. A POI patient, her family members, and another 192 control women with regular menstruation. Ovarian biopsy was performed in the patient. Sanger sequencing was carried out for the patient, her sister, and parents. The novel variant identified was further confirmed with the use of control subjects. Sanger sequencing and genotype analysis to identify the potential variant of the FSHR gene; hematoxylin and eosin staining of the ovarian section to observe the follicular development; Western blotting and immunofluorescence to detect FSH receptor (FSHR) expression; and cyclic adenosine monophosphate (cAMP) assay to monitor FSH-induced signaling. Histologic examination of the ovaries in the patient revealed follicular development up to the early antral stage. Mutational screening and genotype analysis of the FSHR gene identified a novel homozygous mutation c.175C>T (p.R59X) in exon 2, which was inherited in the autosomal recessive mode from her heterozygous parents but was absent in her sister and the 192 control women. Functional studies demonstrated that in vitro the nonsense mutation caused the loss of full-length FSHR expression and that p.R59X mutant showed no response to FSH stimulation in the cAMP level. The mutation p.R59X in FSHR is causative for POI by means of arresting folliculogenesis. Copyright © 2017 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
A vertebrate case study of the quality of assemblies derived from next-generation sequences
2011-01-01
The unparalleled efficiency of next-generation sequencing (NGS) has prompted widespread adoption, but significant problems remain in the use of NGS data for whole genome assembly. We explore the advantages and disadvantages of chicken genome assemblies generated using a variety of sequencing and assembly methodologies. NGS assemblies are equivalent in some ways to a Sanger-based assembly yet deficient in others. Nonetheless, these assemblies are sufficient for the identification of the majority of genes and can reveal novel sequences when compared to existing assembly references. PMID:21453517
Ribas, Laia; Pardo, Belén G; Fernández, Carlos; Alvarez-Diós, José Antonio; Gómez-Tato, Antonio; Quiroga, María Isabel; Planas, Josep V; Sitjà-Bobadilla, Ariadna; Martínez, Paulino; Piferrer, Francesc
2013-03-15
Genomic resources for plant and animal species that are under exploitation primarily for human consumption are increasingly important, among other things, for understanding physiological processes and for establishing adequate genetic selection programs. Current available techniques for high-throughput sequencing have been implemented in a number of species, including fish, to obtain a proper description of the transcriptome. The objective of this study was to generate a comprehensive transcriptomic database in turbot, a highly priced farmed fish species in Europe, with potential expansion to other areas of the world, for which there are unsolved production bottlenecks, to understand better reproductive- and immune-related functions. This information is essential to implement marker assisted selection programs useful for the turbot industry. Expressed sequence tags were generated by Sanger sequencing of cDNA libraries from different immune-related tissues after several parasitic challenges. The resulting database ("Turbot 2 database") was enlarged with sequences generated from a 454 sequencing run of brain-hypophysis-gonadal axis-derived RNA obtained from turbot at different development stages. The assembly of Sanger and 454 sequences generated 52,427 consensus sequences ("Turbot 3 database"), of which 23,661 were successfully annotated. A total of 1,410 sequences were confirmed to be related to reproduction and key genes involved in sex differentiation and maturation were identified for the first time in turbot (AR, AMH, SRY-related genes, CYP19A, ZPGs, STAR FSHR, etc.). Similarly, 2,241 sequences were related to the immune system and several novel key immune genes were identified (BCL, TRAF, NCK, CD28 and TOLLIP, among others). The number of genes of many relevant reproduction- and immune-related pathways present in the database was 50-90% of the total gene count of each pathway. In addition, 1,237 microsatellites and 7,362 single nucleotide polymorphisms (SNPs) were also compiled. Further, 2,976 putative natural antisense transcripts (NATs) including microRNAs were also identified. The combined sequencing strategies employed here significantly increased the turbot genomic resources available, including 34,400 novel sequences. The generated database contains a larger number of genes relevant for reproduction- and immune-associated studies, with an excellent coverage of most genes present in many relevant physiological pathways. This database also allowed the identification of many microsatellites and SNP markers that will be very useful for population and genome screening and a valuable aid in marker assisted selection programs.
2013-01-01
Background Genomic resources for plant and animal species that are under exploitation primarily for human consumption are increasingly important, among other things, for understanding physiological processes and for establishing adequate genetic selection programs. Current available techniques for high-throughput sequencing have been implemented in a number of species, including fish, to obtain a proper description of the transcriptome. The objective of this study was to generate a comprehensive transcriptomic database in turbot, a highly priced farmed fish species in Europe, with potential expansion to other areas of the world, for which there are unsolved production bottlenecks, to understand better reproductive- and immune-related functions. This information is essential to implement marker assisted selection programs useful for the turbot industry. Results Expressed sequence tags were generated by Sanger sequencing of cDNA libraries from different immune-related tissues after several parasitic challenges. The resulting database (“Turbot 2 database”) was enlarged with sequences generated from a 454 sequencing run of brain-hypophysis-gonadal axis-derived RNA obtained from turbot at different development stages. The assembly of Sanger and 454 sequences generated 52,427 consensus sequences (“Turbot 3 database”), of which 23,661 were successfully annotated. A total of 1,410 sequences were confirmed to be related to reproduction and key genes involved in sex differentiation and maturation were identified for the first time in turbot (AR, AMH, SRY-related genes, CYP19A, ZPGs, STAR FSHR, etc.). Similarly, 2,241 sequences were related to the immune system and several novel key immune genes were identified (BCL, TRAF, NCK, CD28 and TOLLIP, among others). The number of genes of many relevant reproduction- and immune-related pathways present in the database was 50–90% of the total gene count of each pathway. In addition, 1,237 microsatellites and 7,362 single nucleotide polymorphisms (SNPs) were also compiled. Further, 2,976 putative natural antisense transcripts (NATs) including microRNAs were also identified. Conclusions The combined sequencing strategies employed here significantly increased the turbot genomic resources available, including 34,400 novel sequences. The generated database contains a larger number of genes relevant for reproduction- and immune-associated studies, with an excellent coverage of most genes present in many relevant physiological pathways. This database also allowed the identification of many microsatellites and SNP markers that will be very useful for population and genome screening and a valuable aid in marker assisted selection programs. PMID:23497389
Analysis of quality raw data of second generation sequencers with Quality Assessment Software.
Ramos, Rommel Tj; Carneiro, Adriana R; Baumbach, Jan; Azevedo, Vasco; Schneider, Maria Pc; Silva, Artur
2011-04-18
Second generation technologies have advantages over Sanger; however, they have resulted in new challenges for the genome construction process, especially because of the small size of the reads, despite the high degree of coverage. Independent of the program chosen for the construction process, DNA sequences are superimposed, based on identity, to extend the reads, generating contigs; mismatches indicate a lack of homology and are not included. This process improves our confidence in the sequences that are generated. We developed Quality Assessment Software, with which one can review graphs showing the distribution of quality values from the sequencing reads. This software allow us to adopt more stringent quality standards for sequence data, based on quality-graph analysis and estimated coverage after applying the quality filter, providing acceptable sequence coverage for genome construction from short reads. Quality filtering is a fundamental step in the process of constructing genomes, as it reduces the frequency of incorrect alignments that are caused by measuring errors, which can occur during the construction process due to the size of the reads, provoking misassemblies. Application of quality filters to sequence data, using the software Quality Assessment, along with graphing analyses, provided greater precision in the definition of cutoff parameters, which increased the accuracy of genome construction.
Next generation sequencing (NGS): a golden tool in forensic toolkit.
Aly, S M; Sabri, D M
The DNA analysis is a cornerstone in contemporary forensic sciences. DNA sequencing technologies are powerful tools that enrich molecular sciences in the past based on Sanger sequencing and continue to glowing these sciences based on Next generation sequencing (NGS). Next generation sequencing has excellent potential to flourish and increase the molecular applications in forensic sciences by jumping over the pitfalls of the conventional method of sequencing. The main advantages of NGS compared to conventional method that it utilizes simultaneously a large number of genetic markers with high-resolution of genetic data. These advantages will help in solving several challenges such as mixture analysis and dealing with minute degraded samples. Based on these new technologies, many markers could be examined to get important biological data such as age, geographical origins, tissue type determination, external visible traits and monozygotic twins identification. It also could get data related to microbes, insects, plants and soil which are of great medico-legal importance. Despite the dozens of forensic research involving NGS, there are requirements before using this technology routinely in forensic cases. Thus, there is a great need to more studies that address robustness of these techniques. Therefore, this work highlights the applications of forensic sciences in the era of massively parallel sequencing.
Zheng, Yang; Cai, Jing; Li, JianWen; Li, Bo; Lin, Runmao; Tian, Feng; Wang, XiaoLing; Wang, Jun
2010-01-01
A 10-fold BAC library for giant panda was constructed and nine BACs were selected to generate finish sequences. These BACs could be used as a validation resource for the de novo assembly accuracy of the whole genome shotgun sequencing reads of giant panda newly generated by the Illumina GA sequencing technology. Complete sanger sequencing, assembly, annotation and comparative analysis were carried out on the selected BACs of a joint length 878 kb. Homologue search and de novo prediction methods were used to annotate genes and repeats. Twelve protein coding genes were predicted, seven of which could be functionally annotated. The seven genes have an average gene size of about 41 kb, an average coding size of about 1.2 kb and an average exon number of 6 per gene. Besides, seven tRNA genes were found. About 27 percent of the BAC sequence is composed of repeats. A phylogenetic tree was constructed using neighbor-join algorithm across five species, including giant panda, human, dog, cat and mouse, which reconfirms dog as the most related species to giant panda. Our results provide detailed sequence and structure information for new genes and repeats of giant panda, which will be helpful for further studies on the giant panda.
Song, Ju Yeon; Jeong, Haeyoung; Yu, Dong Su; Fischbach, Michael A.; Park, Hong-Seog; Kim, Jae Jong; Seo, Jeong-Sun; Jensen, Susan E.; Oh, Tae Kwang; Lee, Kye Joon; Kim, Jihyun F.
2010-01-01
Streptomyces clavuligerus is an important industrial strain that produces a number of antibiotics, including clavulanic acid and cephamycin C. A high-quality draft genome sequence of the S. clavuligerus NRRL 3585 strain was produced by employing a hybrid approach that involved Sanger sequencing, Roche/454 pyrosequencing, optical mapping, and partial finishing. Its genome, comprising four linear replicons, one chromosome, and four plasmids, carries numerous sets of genes involved in the biosynthesis of secondary metabolites, including a variety of antibiotics. PMID:20889745
Dusi, Sabrina; Valletta, Lorella; Haack, Tobias B.; Tsuchiya, Yugo; Venco, Paola; Pasqualato, Sebastiano; Goffrini, Paola; Tigano, Marco; Demchenko, Nikita; Wieland, Thomas; Schwarzmayr, Thomas; Strom, Tim M.; Invernizzi, Federica; Garavaglia, Barbara; Gregory, Allison; Sanford, Lynn; Hamada, Jeffrey; Bettencourt, Conceição; Houlden, Henry; Chiapparini, Luisa; Zorzi, Giovanna; Kurian, Manju A.; Nardocci, Nardo; Prokisch, Holger; Hayflick, Susan; Gout, Ivan; Tiranti, Valeria
2014-01-01
Neurodegeneration with brain iron accumulation (NBIA) comprises a clinically and genetically heterogeneous group of disorders with progressive extrapyramidal signs and neurological deterioration, characterized by iron accumulation in the basal ganglia. Exome sequencing revealed the presence of recessive missense mutations in COASY, encoding coenzyme A (CoA) synthase in one NBIA-affected subject. A second unrelated individual carrying mutations in COASY was identified by Sanger sequence analysis. CoA synthase is a bifunctional enzyme catalyzing the final steps of CoA biosynthesis by coupling phosphopantetheine with ATP to form dephospho-CoA and its subsequent phosphorylation to generate CoA. We demonstrate alterations in RNA and protein expression levels of CoA synthase, as well as CoA amount, in fibroblasts derived from the two clinical cases and in yeast. This is the second inborn error of coenzyme A biosynthesis to be implicated in NBIA. PMID:24360804
Quasispecies Analyses of the HIV-1 Near-full-length Genome With Illumina MiSeq
Ode, Hirotaka; Matsuda, Masakazu; Matsuoka, Kazuhiro; Hachiya, Atsuko; Hattori, Junko; Kito, Yumiko; Yokomaku, Yoshiyuki; Iwatani, Yasumasa; Sugiura, Wataru
2015-01-01
Human immunodeficiency virus type-1 (HIV-1) exhibits high between-host genetic diversity and within-host heterogeneity, recognized as quasispecies. Because HIV-1 quasispecies fluctuate in terms of multiple factors, such as antiretroviral exposure and host immunity, analyzing the HIV-1 genome is critical for selecting effective antiretroviral therapy and understanding within-host viral coevolution mechanisms. Here, to obtain HIV-1 genome sequence information that includes minority variants, we sought to develop a method for evaluating quasispecies throughout the HIV-1 near-full-length genome using the Illumina MiSeq benchtop deep sequencer. To ensure the reliability of minority mutation detection, we applied an analysis method of sequence read mapping onto a consensus sequence derived from de novo assembly followed by iterative mapping and subsequent unique error correction. Deep sequencing analyses of aHIV-1 clone showed that the analysis method reduced erroneous base prevalence below 1% in each sequence position and discarded only < 1% of all collected nucleotides, maximizing the usage of the collected genome sequences. Further, we designed primer sets to amplify the HIV-1 near-full-length genome from clinical plasma samples. Deep sequencing of 92 samples in combination with the primer sets and our analysis method provided sufficient coverage to identify >1%-frequency sequences throughout the genome. When we evaluated sequences of pol genes from 18 treatment-naïve patients' samples, the deep sequencing results were in agreement with Sanger sequencing and identified numerous additional minority mutations. The results suggest that our deep sequencing method would be suitable for identifying within-host viral population dynamics throughout the genome. PMID:26617593
A novel COL4A3 mutation causes autosomal-recessive Alport syndrome in a large Turkish family.
Uzak, Asli Subasioglu; Tokgoz, Bulent; Dundar, Munis; Tekin, Mustafa
2013-03-01
Alport syndrome (AS) is a genetically heterogeneous disorder that is characterized by hematuria, progressive renal failure typically resulting in end-stage renal disease, sensorineural hearing loss, and variable ocular abnormalities. Only 15% of cases with AS are autosomal recessive and are caused by mutations in the COL4A3 or COL4A4 genes, encoding type IV collagen. Clinical data in a large consanguineous family with four affected members were reviewed, and genomic DNA was extracted. For mapping, 15 microsatellite markers flanking COL4A3, COL4A4, and COL4A5 in 16 family members were typed. For mutation screening, all coding exons of COL4A3 were polymerase chain reaction- amplified and Sanger-sequenced from genomic DNA. The disease locus was mapped to chromosome 2q36.3, where COL4A3 and COL4A4 reside. Sanger sequencing revealed a novel mis-sense mutation (c.2T>C; p.M1T) in exon 1 of COL4A3. The identified nucleotide change was not found in 100 healthy ethnicity-matched controls via Sanger sequencing. We present a large consanguineous Turkish family with AS that was found to have a COL4A3 mutation as the cause of the disease. Although the relationship between the various genotypes and phenotypes in AS has not been fully elucidated, detailed clinical and molecular analyses are helpful for providing data to be used in genetic counseling. It is important to identify new mutations to clarify their clinical importance, to assess the prognosis of the disease, and to avoid renal biopsy for final diagnosis.
Gürtler, Nicolas; Röthlisberger, Benno; Ludin, Katja; Schlegel, Christoph; Lalwani, Anil K
2017-07-01
Identification of the causative mutation using next-generation sequencing in autosomal-dominant hereditary hearing impairment, as mutation analysis in hereditary hearing impairment by classic genetic methods, is hindered by the high heterogeneity of the disease. Two Swiss families with autosomal-dominant hereditary hearing impairment. Amplified DNA libraries for next-generation sequencing were constructed from extracted genomic DNA, derived from peripheral blood, and enriched by a custom-made sequence capture library. Validated, pooled libraries were sequenced on an Illumina MiSeq instrument, 300 cycles and paired-end sequencing. Technical data analysis was performed with SeqMonk, variant analysis with GeneTalk or VariantStudio. The detection of mutations in genes related to hearing loss by next-generation sequencing was subsequently confirmed using specific polymerase-chain-reaction and Sanger sequencing. Mutation detection in hearing-loss-related genes. The first family harbored the mutation c.5383+5delGTGA in the TECTA-gene. In the second family, a novel mutation c.2614-2625delCATGGCGCCGTG in the WFS1-gene and a second mutation TCOF1-c.1028G>A were identified. Next-generation sequencing successfully identified the causative mutation in families with autosomal-dominant hereditary hearing impairment. The results helped to clarify the pathogenic role of a known mutation and led to the detection of a novel one. NGS represents a feasible approach with great potential future in the diagnostics of hereditary hearing impairment, even in smaller labs.
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-01-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield. PMID:25333064
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-09-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield.
Telele, Nigus Fikrie; Kalu, Amare Worku; Gebre-Selassie, Solomon; Fekade, Daniel; Abdurahman, Samir; Marrone, Gaetano; Neogi, Ujjwal; Tegbaru, Belete; Sönnerborg, Anders
2018-05-15
Baseline plasma samples of 490 randomly selected antiretroviral therapy (ART) naïve patients from seven hospitals participating in the first nationwide Ethiopian HIV-1 cohort were analysed for surveillance drug resistance mutations (sDRM) by population based Sanger sequencing (PBSS). Also next generation sequencing (NGS) was used in a subset of 109 baseline samples of patients. Treatment outcome after 6- and 12-months was assessed by on-treatment (OT) and intention-to-treat (ITT) analyses. Transmitted drug resistance (TDR) was detected in 3.9% (18/461) of successfully sequenced samples by PBSS. However, NGS detected sDRM more often (24%; 26/109) than PBSS (6%; 7/109) (p = 0.0001) and major integrase strand transfer inhibitors (INSTI) DRMs were also found in minor viral variants from five patients. Patients with sDRM had more frequent treatment failure in both OT and ITT analyses. The high rate of TDR by NGS and the identification of preexisting INSTI DRMs in minor wild-type HIV-1 subtype C viral variants infected Ethiopian patients underscores the importance of TDR surveillance in low- and middle-income countries and shows added value of high-throughput NGS in such studies.
Severe infantile leigh syndrome associated with a rare mitochondrial ND6 mutation, m.14487T>C.
Tarnopolsky, Mark; Meaney, Brandon; Robinson, Brian; Sheldon, Katherine; Boles, Richard G
2013-08-01
We describe a case of severe infantile-onset complex I deficiency in association with an apparent de novo near-homoplasmic mutation (m.14487T>C) in the mitochondrial ND6 gene, which was previously associated with Leigh syndrome and other neurological disorders. The mutation was near-homoplasmic in muscle by NextGen sequencing (99.4% mutant), homoplasmic in muscle by Sanger sequencing, and it was associated with a severe complex I deficiency in both muscle and fibroblasts. This supports previous data regarding Leigh syndrome being on the severe end of a phenotypic spectrum including progressive myoclonic epilepsy, childhood-onset dystonia, bilateral striatal necrosis, and optic atrophy, depending on the proportion of mutant heteroplasmy. While the mother in all previously reported cases was heteroplasmic, the mother and brother of this case were homoplasmic for the wild-type, m.14487T. Importantly, the current data demonstrate the potential for cases of mutations that were previously reported to be homoplasmic by Sanger sequencing to be less homoplasmic by NextGen sequencing. This case underscores the importance of considering mitochondrial DNA mutations in families with a negative family history, even in offspring of those who have tested negative for a specific mtDNA mutation. Copyright © 2013 Wiley Periodicals, Inc.
Sample, Paul J.; Gaston, Kirk W.; Alfonzo, Juan D.; Limbach, Patrick A.
2015-01-01
Ribosomal ribonucleic acid (RNA), transfer RNA and other biological or synthetic RNA polymers can contain nucleotides that have been modified by the addition of chemical groups. Traditional Sanger sequencing methods cannot establish the chemical nature and sequence of these modified-nucleotide containing oligomers. Mass spectrometry (MS) has become the conventional approach for determining the nucleotide composition, modification status and sequence of modified RNAs. Modified RNAs are analyzed by MS using collision-induced dissociation tandem mass spectrometry (CID MS/MS), which produces a complex dataset of oligomeric fragments that must be interpreted to identify and place modified nucleosides within the RNA sequence. Here we report the development of RoboOligo, an interactive software program for the robust analysis of data generated by CID MS/MS of RNA oligomers. There are three main functions of RoboOligo: (i) automated de novo sequencing via the local search paradigm. (ii) Manual sequencing with real-time spectrum labeling and cumulative intensity scoring. (iii) A hybrid approach, coined ‘variable sequencing’, which combines the user intuition of manual sequencing with the high-throughput sampling of automated de novo sequencing. PMID:25820423
Single-cell genomic sequencing using Multiple Displacement Amplification.
Lasken, Roger S
2007-10-01
Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
Kappel, Kristina; Haase, Ilka; Käppel, Christine; Sotelo, Carmen G; Schröder, Ute
2017-11-01
Conventional Sanger sequencing of PCR products is the gold standard for species authentication of seafood products. However, this method is inappropriate for the analysis of products that might contain mixtures of species, such as tinned tuna. The purpose of this study was to test whether next-generation sequencing (NGS) can be a solution for the authentication of mixed products. Nine tuna samples containing mixtures of up to four species were prepared and subjected to an NGS approach targeting two short cytochrome b gene (cytb) fragments on the Illumina MiSeq platform. Sequence recovery was precise and admixtures of as low as 1% could be identified, depending on the species composition of the mixtures. Duplicate samples as well as two individual NGS runs produced very similar results. A first test of three commercial tinned tuna samples indicated the presence of different species in the same tin, although this is forbidden by EU law. Copyright © 2017 Elsevier Ltd. All rights reserved.
Alkaptonuria and Pompe disease in one patient: metabolic and molecular analysis.
Zouheir Habbal, Mohammad; Bou Assi, Tarek; Mansour, Hicham
2013-04-29
Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual.
Alkaptonuria and pompe disease in one patient: metabolic and molecular analysis
Habbal, Mohammad Zouheir; Bou Assi, Tarek; Mansour, Hicham
2013-01-01
Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual. PMID:23632174
Antosik, Karolina; Gnyś, Piotr; Jarosz-Chobot, Przemysława; Myśliwiec, Małgorzata; Szadkowska, Agnieszka; Małecki, Maciej; Młynarski, Wojciech; Borowiec, Maciej
2017-01-01
Monogenic diabetes is a rare disease caused by single gene mutations. Maturity onset diabetes of the young (MODY) is one of the major forms of monogenic diabetes recognised in the paediatric population. To date, 13 genes have been related to MODY development. The aim of the study was to analyse the sequence of the BCL2-associated agonist of cell death (BAD) gene in patients with clinical suspicion of GCK-MODY, but who were negative for glucokinase (GCK) gene mutations. A group of 122 diabetic patients were recruited from the "Polish Registry for Paediatric and Adolescent Diabetes - nationwide genetic screening for monogenic diabetes" project. The molecular testing was performed by Sanger sequencing. A total of 10 sequence variants of the BAD gene were identified in 122 analysed diabetic patients. Among the analysed patients suspected of MODY, one possible pathogenic variant was identified in one patient; however, further confirmation is required for a certain identification.
Characterization of the rainbow trout transcriptome using Sanger and 454-Pyrosequencing approaches
USDA-ARS?s Scientific Manuscript database
BACKGROUND: Rainbow trout is an important fish species for aquaculture and a model species for research investigations associated with carcinogenesis, comparative immunology, toxicology and the evolutionary biology. However, to date there is no genome reference sequence to facilitate the development...
Characterization of the rainbow trout transcriptome using Sanger and 454-pyrosequencing approaches
USDA-ARS?s Scientific Manuscript database
Background: Rainbow trout is an important fish for aquaculture and recreational fisheries and serves as a model species for research investigations associated with carcinogenesis, comparative immunology, toxicology and the evolutionary biology. However, to date there is no genome reference sequence...
Mutational Signature Mark Cancer’s Smoking Gun
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alexandrov, Ludmil
A broad computational study of cancer genome sequences by Los Alamos National Laboratory with the UK’s Wellcome Trust Sanger Institute and other collaborators identifies telltale mutational signatures associated with smoking tobacco. The research demonstrates, for the first time, that smoking increases cancer risk by causing somatic mutations in tissues directly and indirectly exposed to tobacco smoke. The international study was published in the November 4 issue of Science. The analysis shows that tobacco smoking causes mutations leading to cancer by multiple distinct mechanisms, including by damaging DNA in organs and by speeding up a mutational cellular clock.
Mutational Signature Mark Cancerâs Smoking Gun
Alexandrov, Ludmil
2018-06-13
A broad computational study of cancer genome sequences by Los Alamos National Laboratory with the UKâs Wellcome Trust Sanger Institute and other collaborators identifies telltale mutational signatures associated with smoking tobacco. The research demonstrates, for the first time, that smoking increases cancer risk by causing somatic mutations in tissues directly and indirectly exposed to tobacco smoke. The international study was published in the November 4 issue of Science. The analysis shows that tobacco smoking causes mutations leading to cancer by multiple distinct mechanisms, including by damaging DNA in organs and by speeding up a mutational cellular clock.
Yang, Tsun-Po; Beazley, Claude; Montgomery, Stephen B.; Dimas, Antigone S.; Gutierrez-Arcelus, Maria; Stranger, Barbara E.; Deloukas, Panos; Dermitzakis, Emmanouil T.
2010-01-01
Summary: Genevar (GENe Expression VARiation) is a database and Java tool designed to integrate multiple datasets, and provides analysis and visualization of associations between sequence variation and gene expression. Genevar allows researchers to investigate expression quantitative trait loci (eQTL) associations within a gene locus of interest in real time. The database and application can be installed on a standard computer in database mode and, in addition, on a server to share discoveries among affiliations or the broader community over the Internet via web services protocols. Availability: http://www.sanger.ac.uk/resources/software/genevar Contact: emmanouil.dermitzakis@unige.ch PMID:20702402
High-Throughput Next-Generation Sequencing of Polioviruses
Montmayeur, Anna M.; Schmidt, Alexander; Zhao, Kun; Magaña, Laura; Iber, Jane; Castro, Christina J.; Chen, Qi; Henderson, Elizabeth; Ramos, Edward; Shaw, Jing; Tatusov, Roman L.; Dybdahl-Sissoko, Naomi; Endegue-Zanga, Marie Claire; Adeniji, Johnson A.; Oberste, M. Steven; Burns, Cara C.
2016-01-01
ABSTRACT The poliovirus (PV) is currently targeted for worldwide eradication and containment. Sanger-based sequencing of the viral protein 1 (VP1) capsid region is currently the standard method for PV surveillance. However, the whole-genome sequence is sometimes needed for higher resolution global surveillance. In this study, we optimized whole-genome sequencing protocols for poliovirus isolates and FTA cards using next-generation sequencing (NGS), aiming for high sequence coverage, efficiency, and throughput. We found that DNase treatment of poliovirus RNA followed by random reverse transcription (RT), amplification, and the use of the Nextera XT DNA library preparation kit produced significantly better results than other preparations. The average viral reads per total reads, a measurement of efficiency, was as high as 84.2% ± 15.6%. PV genomes covering >99 to 100% of the reference length were obtained and validated with Sanger sequencing. A total of 52 PV genomes were generated, multiplexing as many as 64 samples in a single Illumina MiSeq run. This high-throughput, sequence-independent NGS approach facilitated the detection of a diverse range of PVs, especially for those in vaccine-derived polioviruses (VDPV), circulating VDPV, or immunodeficiency-related VDPV. In contrast to results from previous studies on other viruses, our results showed that filtration and nuclease treatment did not discernibly increase the sequencing efficiency of PV isolates. However, DNase treatment after nucleic acid extraction to remove host DNA significantly improved the sequencing results. This NGS method has been successfully implemented to generate PV genomes for molecular epidemiology of the most recent PV isolates. Additionally, the ability to obtain full PV genomes from FTA cards will aid in facilitating global poliovirus surveillance. PMID:27927929
Henderson, James B.; Sellas, Anna B.; Fuchs, Jérôme; Bowie, Rauri C.K.; Dumbacher, John P.
2017-01-01
We report here the successful assembly of the complete mitochondrial genomes of the northern spotted owl (Strix occidentalis caurina) and the barred owl (S. varia). We utilized sequence data from two sequencing methodologies, Illumina paired-end sequence data with insert lengths ranging from approximately 250 nucleotides (nt) to 9,600 nt and read lengths from 100–375 nt and Sanger-derived sequences. We employed multiple assemblers and alignment methods to generate the final assemblies. The circular genomes of S. o. caurina and S. varia are comprised of 19,948 nt and 18,975 nt, respectively. Both code for two rRNAs, twenty-two tRNAs, and thirteen polypeptides. They both have duplicated control region sequences with complex repeat structures. We were not able to assemble the control regions solely using Illumina paired-end sequence data. By fully spanning the control regions, Sanger-derived sequences enabled accurate and complete assembly of these mitochondrial genomes. These are the first complete mitochondrial genome sequences of owls (Aves: Strigiformes) possessing duplicated control regions. We searched the nuclear genome of S. o. caurina for copies of mitochondrial genes and found at least nine separate stretches of nuclear copies of gene sequences originating in the mitochondrial genome (Numts). The Numts ranged from 226–19,522 nt in length and included copies of all mitochondrial genes except tRNAPro, ND6, and tRNAGlu. Strix occidentalis caurina and S. varia exhibited an average of 10.74% (8.68% uncorrected p-distance) divergence across the non-tRNA mitochondrial genes. PMID:29038757
Correa, Fernanda A; França, Marcela M; Fang, Qing; Ma, Qianyi; Bachega, Tania A; Rodrigues, Andresa; Ozel, Bilge A; Li, Jun Z; Mendonca, Berenice B; Jorge, Alexander A L; Carvalho, Luciani R; Camper, Sally A; Arnhold, Ivo J P
2017-12-01
Isolated growth hormone deficiency (IGHD) is the most common pituitary hormone deficiency and, clinically, patients have delayed bone age. High sequence similarity between CYP21A2 gene and CYP21A1P pseudogene poses difficulties for exome sequencing interpretation. A 7.5 year-old boy born to second-degree cousins presented with severe short stature (height SDS -3.7) and bone age of 6 years. Clonidine and combined pituitary stimulation tests revealed GH deficiency. Pituitary MRI was normal. The patient was successfully treated with rGH. Surprisingly, at 10.8 years, his bone age had advanced to 13 years, but physical exam, LH and testosterone levels remained prepubertal. An ACTH stimulation test disclosed a non-classic congenital adrenal hyperplasia due to 21-hydroxylase deficiency explaining the bone age advancement and, therefore, treatment with cortisone acetate was added. The genetic diagnosis of a homozygous mutation in GHRHR (p.Leu144His), a homozygous CYP21A2 mutation (p.Val282Leu) and CYP21A1P pseudogene duplication was established by Sanger sequencing, MLPA and whole-exome sequencing. We report the unusual clinical presentation of a patient born to consanguineous parents with two recessive endocrine diseases: non-classic congenital adrenal hyperplasia modifying the classical GH deficiency phenotype. We used a method of paired read mapping aided by neighbouring mis-matches to overcome the challenges of exome-sequencing in the presence of a pseudogene.
Dubey, Anuja; Farmer, Andrew; Schlueter, Jessica; Cannon, Steven B; Abernathy, Brian; Tuteja, Reetu; Woodward, Jimmy; Shah, Trushar; Mulasmanovic, Benjamin; Kudapa, Himabindu; Raju, Nikku L; Gothalwal, Ragini; Pande, Suresh; Xiao, Yongli; Town, Chris D; Singh, Nagendra K; May, Gregory D; Jackson, Scott; Varshney, Rajeev K
2011-06-01
This study reports generation of large-scale genomic resources for pigeonpea, a so-called 'orphan crop species' of the semi-arid tropic regions. FLX/454 sequencing carried out on a normalized cDNA pool prepared from 31 tissues produced 494 353 short transcript reads (STRs). Cluster analysis of these STRs, together with 10 817 Sanger ESTs, resulted in a pigeonpea trancriptome assembly (CcTA) comprising of 127 754 tentative unique sequences (TUSs). Functional analysis of these TUSs highlights several active pathways and processes in the sampled tissues. Comparison of the CcTA with the soybean genome showed similarity to 10 857 and 16 367 soybean gene models (depending on alignment methods). Additionally, Illumina 1G sequencing was performed on Fusarium wilt (FW)- and sterility mosaic disease (SMD)-challenged root tissues of 10 resistant and susceptible genotypes. More than 160 million sequence tags were used to identify FW- and SMD-responsive genes. Sequence analysis of CcTA and the Illumina tags identified a large new set of markers for use in genetics and breeding, including 8137 simple sequence repeats, 12 141 single-nucleotide polymorphisms and 5845 intron-spanning regions. Genomic resources developed in this study should be useful for basic and applied research, not only for pigeonpea improvement but also for other related, agronomically important legumes.
Bai, Ying; Chen, Yibing; Kong, Xiangdong
2018-02-02
It has been reported that mutations in arginine vasopressin type 2 receptor (AVPR2) cause congenital X-linked nephrogenic diabetes insipidus (NDI). However, only a few cases of AVPR2 deletion have been documented in China. An NDI pedigree was included in this study, including the proband and his mother. All NDI patients had polyuria, polydipsia, and growth retardation. PCR mapping, long range PCR and sanger sequencing were used to identify genetic causes of NDI. A novel 22,110 bp deletion comprising AVPR2 and ARH4GAP4 genes was identified by PCR mapping, long range PCR and sanger sequencing. The deletion happened perhaps due to the 4-bp homologous sequence (TTTT) at the junctions of both 5' and 3' breakpoints. The gross deletion co-segregates with NDI. After analyzing available data of putative clinical signs of AVPR2 and ARH4GAP4 deletion, we reconsider the potential role of AVPR2 deletion in short stature. We identified a novel 22.1-kb deletion leading to X-linked NDI in a Chinese pedigree, which would increase the current knowledge in AVPR2 mutation.
Tahir, Muhammad N; Lockhart, Ben; Grinstead, Samuel; Mollov, Dimitre
2017-04-01
Bermuda grass samples were examined by transmission electron microscopy and 28-30 nm spherical virus particles were observed. Total RNA from these plants was subjected to high-throughput sequencing (HTS). The nearly full genome sequence of a panicovirus was identified from one HTS scaffold. Sanger sequencing was used to confirm the HTS results and complete the genome sequence of 4404 nt. This virus was provisionally named Bermuda grass latent virus (BGLV). Its predicted open reading frames follow the typical arrangement of the genus Panicovirus. Based on sequence comparisons and phylogenetic analyses BGLV differs from other viruses and therefore taxonomically it is a new member of the genus Panicovirus, family Tombusviridae.
Unlocking hidden genomic sequence
Keith, Jonathan M.; Cochran, Duncan A. E.; Lala, Gita H.; Adams, Peter; Bryant, Darryn; Mitchelson, Keith R.
2004-01-01
Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs. PMID:14973330
The Genome of the Cucumber, Cucumis Sativus L
USDA-ARS?s Scientific Manuscript database
Cucumber is an economically important crop as well as a model system for sex determination studies and plant vascular biology. Here we report the draft genome sequence of Cucumis sativus var. sativus L., assembled using a novel combination of traditional Sanger and next-generation Illumina GA sequen...
Rafaelsen, Silje; Johansson, Stefan; Ræder, Helge; Bjerknes, Robert
2015-01-01
Objective Hereditary hypophosphatemias (HH) are rare monogenic conditions characterized by decreased renal tubular phosphate reabsorption. The aim of this study was to explore the prevalence, genotypes, phenotypic spectrum, treatment response, and complications of treatment in the Norwegian population of children with HH. Design Retrospective national cohort study. Methods Sanger sequencing and multiplex ligand-dependent probe amplification analysis of PHEX and Sanger sequencing of FGF23, DMP1, ENPP1KL, and FAM20C were performed to assess genotype in patients with HH with or without rickets in all pediatric hospital departments across Norway. Patients with hypercalcuria were screened for SLC34A3 mutations. In one family, exome sequencing was performed. Information from the patients' medical records was collected for the evaluation of phenotype. Results Twety-eight patients with HH (18 females and ten males) from 19 different families were identified. X-linked dominant hypophosphatemic rickets (XLHR) was confirmed in 21 children from 13 families. The total number of inhabitants in Norway aged 18 or below by 1st January 2010 was 1 109 156, giving an XLHR prevalence of ∼1 in 60 000 Norwegian children. FAM20C mutations were found in two brothers and SLC34A3 mutations in one patient. In XLHR, growth was compromised in spite of treatment with oral phosphate and active vitamin D compounds, with males tending to be more affected than females. Nephrocalcinosis tended to be slightly more common in patients starting treatment before 1 year of age, and was associated with higher average treatment doses of phosphate. However, none of these differences reached statistical significance. Conclusions We present the first national cohort of HH in children. The prevalence of XLHR seems to be lower in Norwegian children than reported earlier. PMID:26543054
Guo, Bingfu; Guo, Yong; Hong, Huilong; Qiu, Li-Juan
2016-01-01
Molecular characterization of sequence flanking exogenous fragment insertion is essential for safety assessment and labeling of genetically modified organism (GMO). In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS) method. More than 22.4 Gb sequence data (∼21 × coverage) for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundaries of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of genomic insertion sites of G2-EPSPS and GAT transgenes will facilitate the utilization of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS was a cost-effective and rapid method for identifying sites of T-DNA insertions and flanking sequences in soybean.
2017-05-01
TERMS Ovarian cancer, drug resistance, rucaparib, phase 2, DNA repair, homologous recombination, nonhomologous end-joining (NHEJ), poly(ADP-ribose...tissues from AA patients with OC. This should add 50 AA OC patients. We are also requesting anonymized DNA from AA OC patients who participated on...extracts DNA and creates library pretps for DNA sequencing. He performs Sanger sequencing validations. Funding Support: Has there been a change
A multiple-alignment based primer design algorithm for genetically highly variable DNA targets
2013-01-01
Background Primer design for highly variable DNA sequences is difficult, and experimental success requires attention to many interacting constraints. The advent of next-generation sequencing methods allows the investigation of rare variants otherwise hidden deep in large populations, but requires attention to population diversity and primer localization in relatively conserved regions, in addition to recognized constraints typically considered in primer design. Results Design constraints include degenerate sites to maximize population coverage, matching of melting temperatures, optimizing de novo sequence length, finding optimal bio-barcodes to allow efficient downstream analyses, and minimizing risk of dimerization. To facilitate primer design addressing these and other constraints, we created a novel computer program (PrimerDesign) that automates this complex procedure. We show its powers and limitations and give examples of successful designs for the analysis of HIV-1 populations. Conclusions PrimerDesign is useful for researchers who want to design DNA primers and probes for analyzing highly variable DNA populations. It can be used to design primers for PCR, RT-PCR, Sanger sequencing, next-generation sequencing, and other experimental protocols targeting highly variable DNA samples. PMID:23965160
The Mouse Genomes Project: a repository of inbred laboratory mouse strain genomes.
Adams, David J; Doran, Anthony G; Lilue, Jingtao; Keane, Thomas M
2015-10-01
The Mouse Genomes Project was initiated in 2009 with the goal of using next-generation sequencing technologies to catalogue molecular variation in the common laboratory mouse strains, and a selected set of wild-derived inbred strains. The initial sequencing and survey of sequence variation in 17 inbred strains was completed in 2011 and included comprehensive catalogue of single nucleotide polymorphisms, short insertion/deletions, larger structural variants including their fine scale architecture and landscape of transposable element variation, and genomic sites subject to post-transcriptional alteration of RNA. From this beginning, the resource has expanded significantly to include 36 fully sequenced inbred laboratory mouse strains, a refined and updated data processing pipeline, and new variation querying and data visualisation tools which are available on the project's website ( http://www.sanger.ac.uk/resources/mouse/genomes/ ). The focus of the project is now the completion of de novo assembled chromosome sequences and strain-specific gene structures for the core strains. We discuss how the assembled chromosomes will power comparative analysis, data access tools and future directions of mouse genetics.
2010-01-01
Background The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despite their ecological and economical importance, very few genomic resources have yet been generated for these species. Here, we describe the development of an EST catalogue that will support ecosystem genomics studies, where geneticists, ecophysiologists, molecular biologists and ecologists join their efforts for understanding, monitoring and predicting functional genetic diversity. Results We generated 145,827 sequence reads from 20 cDNA libraries using the Sanger method. Unexploitable chromatograms and quality checking lead us to eliminate 19,941 sequences. Finally a total of 125,925 ESTs were retained from 111,361 cDNA clones. Pyrosequencing was also conducted for 14 libraries, generating 1,948,579 reads, from which 370,566 sequences (19.0%) were eliminated, resulting in 1,578,192 sequences. Following clustering and assembly using TGICL pipeline, 1,704,117 EST sequences collapsed into 69,154 tentative contigs and 153,517 singletons, providing 222,671 non-redundant sequences (including alternative transcripts). We also assembled the sequences using MIRA and PartiGene software and compared the three unigene sets. Gene ontology annotation was then assigned to 29,303 unigene elements. Blast search against the SWISS-PROT database revealed putative homologs for 32,810 (14.7%) unigene elements, but more extensive search with Pfam, Refseq_protein, Refseq_RNA and eight gene indices revealed homology for 67.4% of them. The EST catalogue was examined for putative homologs of candidate genes involved in bud phenology, cuticle formation, phenylpropanoids biosynthesis and cell wall formation. Our results suggest a good coverage of genes involved in these traits. Comparative orthologous sequences (COS) with other plant gene models were identified and allow to unravel the oak paleo-history. Simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 52,834 SSRs and 36,411 SNPs. All of these are available through the Oak Contig Browser http://genotoul-contigbrowser.toulouse.inra.fr:9092/Quercus_robur/index.html. Conclusions This genomic resource provides a unique tool to discover genes of interest, study the oak transcriptome, and develop new markers to investigate functional diversity in natural populations. PMID:21092232
Rama, Mélanie; Duflos, Claire; Melki, Isabelle; Bessis, Didier; Bonhomme, Axelle; Martin, Hélène; Doummar, Diane; Valence, Stéphanie; Rodriguez, Diana; Carme, Emilie; Genevieve, David; Heimdal, Ketil; Insalaco, Antonella; Franck, Nathalie; Queyrel-Moranne, Viviane; Tieulie, Nathalie; London, Jonathan; Uettwiller, Florence; Georgin-Lavialle, Sophie; Belot, Alexandre; Koné-Paut, Isabelle; Hentgen, Véronique; Boursier, Guilaine; Touitou, Isabelle; Sarrabay, Guillaume
2018-04-23
Deficiency of adenosine deaminase 2 (DADA2) is a recently described autoinflammatory disorder. Genetic analysis is required to confirm the diagnosis. We aimed to describe the identifying symptoms and genotypes of patients referred to our reference centres and to improve the indications for genetic testing. DNA from 66 patients with clinically suspected DADA2 were sequenced by Sanger or next-generation sequencing. Detailed epidemiological, clinical and biological features were collected by use of a questionnaire and were compared between patients with and without genetic confirmation of DADA2. We identified 13 patients (19.6%) carrying recessively inherited mutations in ADA2 that were predicted to be deleterious. Eight patients were compound heterozygous for mutations. Seven mutations were novel (4 missense variants, 2 predicted to affect mRNA splicing and 1 frameshift). The mean age of the 13 patients with genetic confirmation was 12.7 years at disease onset and 20.8 years at diagnosis. Phenotypic manifestations included fever (85%), vasculitis (85%) and neurological disorders (54%). Features best associated with a confirmatory genotype included fever with neurologic or cutaneous attacks (odds ratio [OR] 10.71, p = 0.003 and OR 10.9, p < 0.001), fever alone (OR 8.1, p = 0.01), and elevated C-reactive protein (CRP) level with neurologic involvement (OR 6.63, p = 0.017). Our proposed decision tree may help improve obtaining genetic confirmation of DADA2 in the context of autoinflammatory symptoms. Prerequisites for quick and low-cost Sanger analysis include one typical cutaneous or neurological sign, one marker of inflammation (fever or elevated CRP level), and recurrent or chronic attacks in adults.
Audo, Isabelle; Bujakowska, Kinga; Mohand-Saïd, Saddek; Tronche, Sophie; Lancelot, Marie-Elise; Antonio, Aline; Germain, Aurore; Lonjou, Christine; Carpentier, Wassila; Sahel, José-Alain; Bhattacharya, Shomi; Zeitz, Christina
2011-01-01
To identify the genetic defect of a consanguineous Portuguese family with rod-cone dystrophy and varying degrees of decreased audition. A detailed ophthalmic and auditory examination was performed on a Portuguese patient with severe autosomal recessive rod-cone dystrophy. Known genetic defects were excluded by performing autosomal recessive retinitis pigmentosa (arRP) genotyping microarray analysis and by Sanger sequencing of the coding exons and flanking intronic regions of eyes shut homolog-drosophila (EYS) and chromosome 2 open reading frame 71 (C2orf71). Subsequently, genome-wide homozygosity mapping was performed in DNA samples from available family members using a 700K single nucleotide polymorphism (SNP) microarray. Candidate genes present in the significantly large homozygous regions were screened for mutations using Sanger sequencing. The largest homozygous region (~11 Mb) in the affected family members was mapped to chromosome 9, which harbors deafness, autosomal recessive 31 (DFNB31; a gene previously associated with Usher syndrome). Mutation analysis of DFNB31 in the index patient identified a novel one-base-pair deletion (c.737delC), which is predicted to lead to a truncated protein (p.Pro246HisfsX13) and co-segregated with the disease in the family. Ophthalmic examination of the index patient and the affected siblings showed severe rod-cone dystrophy. Pure tone audiometry revealed a moderate hearing loss in the index patient, whereas the affected siblings were reported with more profound and early onset hearing impairment. We report a novel truncating mutation in DFNB31 associated with severe rod-cone dystrophy and varying degrees of hearing impairment in a consanguineous family of Portuguese origin. This is the second report of DFNB31 implication in Usher type 2.
Ulusal, SD; Gürkan, H; Atlı, E; Özal, SA; Çiftdemir, M; Tozkır, H; Karal, Y; Güçlü, H; Eker, D; Görker, I
2017-01-01
Abstract Neurofibromatosis Type I (NF1) is a multi systemic autosomal dominant neurocutaneous disorder predisposing patients to have benign and/or malignant lesions predominantly of the skin, nervous system and bone. Loss of function mutations or deletions of the NF1 gene is responsible for NF1 disease. Involvement of various pathogenic variants, the size of the gene and presence of pseudogenes makes it difficult to analyze. We aimed to report the results of 2 years of multiplex ligation-dependent probe amplification (MLPA) and next generation sequencing (NGS) for genetic diagnosis of NF1 applied at our genetic diagnosis center. The MLPA, semiconductor sequencing and Sanger sequencing were performed in genomic DNA samples from 24 unrelated patients and their affected family members referred to our center suspected of having NF1. In total, three novel and 12 known pathogenic variants and a whole gene deletion were determined. We suggest that next generation sequencing is a practical tool for genetic analysis of NF1. Deletion/duplication analysis with MLPA may also be helpful for patients clinically diagnosed to carry NF1 but do not have a detectable mutation in NGS. PMID:28924536
Rapid phylogenetic dissection of prokaryotic community structure in tidal flat using pyrosequencing.
Kim, Bong-Soo; Kim, Byung Kwon; Lee, Jae-Hak; Kim, Myungjin; Lim, Young Woon; Chun, Jongsik
2008-08-01
Dissection of prokaryotic community structure is prerequisite to understand their ecological roles. Various methods are available for such a purpose which amplification and sequencing of 16S rRNA genes gained its popularity. However, conventional methods based on Sanger sequencing technique require cloning process prior to sequencing, and are expensive and labor-intensive. We investigated prokaryotic community structure in tidal flat sediments, Korea, using pyrosequencing and a subsequent automated bioinformatic pipeline for the rapid and accurate taxonomic assignment of each amplicon. The combination of pyrosequencing and bioinformatic analysis showed that bacterial and archaeal communities were more diverse than previously reported in clone library studies. Pyrosequencing analysis revealed 21 bacterial divisions and 37 candidate divisions. Proteobacteria was the most abundant division in the bacterial community, of which Gamma-and Delta-Proteobacteria were the most abundant. Similarly, 4 archaeal divisions were found in tidal flat sediments. Euryarchaeota was the most abundant division in the archaeal sequences, which were further divided into 8 classes and 11 unclassified euryarchaeota groups. The system developed here provides a simple, in-depth and automated way of dissecting a prokaryotic community structure without extensive pretreatment such as cloning.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lapidus, Alla L.
From the date its role in heredity was discovered, DNA has been generating interest among scientists from different fields of knowledge: physicists have studied the three dimensional structure of the DNA molecule, biologists tried to decode the secrets of life hidden within these long molecules, and technologists invent and improve methods of DNA analysis. The analysis of the nucleotide sequence of DNA occupies a special place among the methods developed. Thanks to the variety of sequencing technologies available, the process of decoding the sequence of genomic DNA (or whole genome sequencing) has become robust and inexpensive. Meanwhile the assembly ofmore » whole genome sequences remains a challenging task. In addition to the need to assemble millions of DNA fragments of different length (from 35 bp (Solexa) to 800 bp (Sanger)), great interest in analysis of microbial communities (metagenomes) of different complexities raises new problems and pushes some new requirements for sequence assembly tools to the forefront. The genome assembly process can be divided into two steps: draft assembly and assembly improvement (finishing). Despite the fact that automatically performed assembly (or draft assembly) is capable of covering up to 98% of the genome, in most cases, it still contains incorrectly assembled reads. The error rate of the consensus sequence produced at this stage is about 1/2000 bp. A finished genome represents the genome assembly of much higher accuracy (with no gaps or incorrectly assembled areas) and quality ({approx}1 error/10,000 bp), validated through a number of computer and laboratory experiments.« less
Detection of Emerging Vaccine-Related Polioviruses by Deep Sequencing.
Sahoo, Malaya K; Holubar, Marisa; Huang, ChunHong; Mohamed-Hadley, Alisha; Liu, Yuanyuan; Waggoner, Jesse J; Troy, Stephanie B; Garcia-Garcia, Lourdes; Ferreyra-Reyes, Leticia; Maldonado, Yvonne; Pinsky, Benjamin A
2017-07-01
Oral poliovirus vaccine can mutate to regain neurovirulence. To date, evaluation of these mutations has been performed primarily on culture-enriched isolates by using conventional Sanger sequencing. We therefore developed a culture-independent, deep-sequencing method targeting the 5' untranslated region (UTR) and P1 genomic region to characterize vaccine-related poliovirus variants. Error analysis of the deep-sequencing method demonstrated reliable detection of poliovirus mutations at levels of <1%, depending on read depth. Sequencing of viral nucleic acids from the stool of vaccinated, asymptomatic children and their close contacts collected during a prospective cohort study in Veracruz, Mexico, revealed no vaccine-derived polioviruses. This was expected given that the longest duration between sequenced sample collection and the end of the most recent national immunization week was 66 days. However, we identified many low-level variants (<5%) distributed across the 5' UTR and P1 genomic region in all three Sabin serotypes, as well as vaccine-related viruses with multiple canonical mutations associated with phenotypic reversion present at high levels (>90%). These results suggest that monitoring emerging vaccine-related poliovirus variants by deep sequencing may aid in the poliovirus endgame and efforts to ensure global polio eradication. Copyright © 2017 Sahoo et al.
Ferret, Yann; Caillault, Aurélie; Sebda, Shéhérazade; Duez, Marc; Grardel, Nathalie; Duployez, Nicolas; Villenet, Céline; Figeac, Martin; Preudhomme, Claude; Salson, Mikaël; Giraud, Mathieu
2016-05-01
High-throughput sequencing (HTS) is considered a technical revolution that has improved our knowledge of lymphoid and autoimmune diseases, changing our approach to leukaemia both at diagnosis and during follow-up. As part of an immunoglobulin/T cell receptor-based minimal residual disease (MRD) assessment of acute lymphoblastic leukaemia patients, we assessed the performance and feasibility of the replacement of the first steps of the approach based on DNA isolation and Sanger sequencing, using a HTS protocol combined with bioinformatics analysis and visualization using the Vidjil software. We prospectively analysed the diagnostic and relapse samples of 34 paediatric patients, thus identifying 125 leukaemic clones with recombinations on multiple loci (TRG, TRD, IGH and IGK), including Dd2/Dd3 and Intron/KDE rearrangements. Sequencing failures were halved (14% vs. 34%, P = 0.0007), enabling more patients to be monitored. Furthermore, more markers per patient could be monitored, reducing the probability of false negative MRD results. The whole analysis, from sample receipt to clinical validation, was shorter than our current diagnostic protocol, with equal resources. V(D)J recombination was successfully assigned by the software, even for unusual recombinations. This study emphasizes the progress that HTS with adapted bioinformatics tools can bring to the diagnosis of leukaemia patients. © 2016 John Wiley & Sons Ltd.
MALINA: a web service for visual analytics of human gut microbiota whole-genome metagenomic reads.
Tyakht, Alexander V; Popenko, Anna S; Belenikin, Maxim S; Altukhov, Ilya A; Pavlenko, Alexander V; Kostryukova, Elena S; Selezneva, Oksana V; Larin, Andrei K; Karpova, Irina Y; Alexeev, Dmitry G
2012-12-07
MALINA is a web service for bioinformatic analysis of whole-genome metagenomic data obtained from human gut microbiota sequencing. As input data, it accepts metagenomic reads of various sequencing technologies, including long reads (such as Sanger and 454 sequencing) and next-generation (including SOLiD and Illumina). It is the first metagenomic web service that is capable of processing SOLiD color-space reads, to authors' knowledge. The web service allows phylogenetic and functional profiling of metagenomic samples using coverage depth resulting from the alignment of the reads to the catalogue of reference sequences which are built into the pipeline and contain prevalent microbial genomes and genes of human gut microbiota. The obtained metagenomic composition vectors are processed by the statistical analysis and visualization module containing methods for clustering, dimension reduction and group comparison. Additionally, the MALINA database includes vectors of bacterial and functional composition for human gut microbiota samples from a large number of existing studies allowing their comparative analysis together with user samples, namely datasets from Russian Metagenome project, MetaHIT and Human Microbiome Project (downloaded from http://hmpdacc.org). MALINA is made freely available on the web at http://malina.metagenome.ru. The website is implemented in JavaScript (using Ext JS), Microsoft .NET Framework, MS SQL, Python, with all major browsers supported.
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.
Militello, Kevin T; Lazatin, Justine C
2017-05-01
Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
Milosevic Feenstra, Jelena D.; Nivarthi, Harini; Gisslinger, Heinz; Leroy, Emilie; Rumi, Elisa; Chachoua, Ilyas; Bagienski, Klaudia; Kubesova, Blanka; Pietra, Daniela; Gisslinger, Bettina; Milanesi, Chiara; Jäger, Roland; Chen, Doris; Berg, Tiina; Schalling, Martin; Schuster, Michael; Bock, Christoph; Constantinescu, Stefan N.; Cazzola, Mario
2016-01-01
Essential thrombocythemia (ET) and primary myelofibrosis (PMF) are chronic diseases characterized by clonal hematopoiesis and hyperproliferation of terminally differentiated myeloid cells. The disease is driven by somatic mutations in exon 9 of CALR or exon 10 of MPL or JAK2-V617F in >90% of the cases, whereas the remaining cases are termed “triple negative.” We aimed to identify the disease-causing mutations in the triple-negative cases of ET and PMF by applying whole-exome sequencing (WES) on paired tumor and control samples from 8 patients. We found evidence of clonal hematopoiesis in 5 of 8 studied cases based on clonality analysis and presence of somatic genetic aberrations. WES identified somatic mutations in 3 of 8 cases. We did not detect any novel recurrent somatic mutations. In 3 patients with clonal hematopoiesis analyzed by WES, we identified a somatic MPL-S204P, a germline MPL-V285E mutation, and a germline JAK2-G571S variant. We performed Sanger sequencing of the entire coding region of MPL in 62, and of JAK2 in 49 additional triple-negative cases of ET or PMF. New somatic (T119I, S204F, E230G, Y591D) and 1 germline (R321W) MPL mutation were detected. All of the identified MPL mutations were gain-of-function when analyzed in functional assays. JAK2 variants were identified in 5 of 57 triple-negative cases analyzed by WES and Sanger sequencing combined. We could demonstrate that JAK2-V625F and JAK2-F556V are gain-of-function mutations. Our results suggest that triple-negative cases of ET and PMF do not represent a homogenous disease entity. Cases with polyclonal hematopoiesis might represent hereditary disorders. PMID:26423830
Milosevic Feenstra, Jelena D; Nivarthi, Harini; Gisslinger, Heinz; Leroy, Emilie; Rumi, Elisa; Chachoua, Ilyas; Bagienski, Klaudia; Kubesova, Blanka; Pietra, Daniela; Gisslinger, Bettina; Milanesi, Chiara; Jäger, Roland; Chen, Doris; Berg, Tiina; Schalling, Martin; Schuster, Michael; Bock, Christoph; Constantinescu, Stefan N; Cazzola, Mario; Kralovics, Robert
2016-01-21
Essential thrombocythemia (ET) and primary myelofibrosis (PMF) are chronic diseases characterized by clonal hematopoiesis and hyperproliferation of terminally differentiated myeloid cells. The disease is driven by somatic mutations in exon 9 of CALR or exon 10 of MPL or JAK2-V617F in >90% of the cases, whereas the remaining cases are termed "triple negative." We aimed to identify the disease-causing mutations in the triple-negative cases of ET and PMF by applying whole-exome sequencing (WES) on paired tumor and control samples from 8 patients. We found evidence of clonal hematopoiesis in 5 of 8 studied cases based on clonality analysis and presence of somatic genetic aberrations. WES identified somatic mutations in 3 of 8 cases. We did not detect any novel recurrent somatic mutations. In 3 patients with clonal hematopoiesis analyzed by WES, we identified a somatic MPL-S204P, a germline MPL-V285E mutation, and a germline JAK2-G571S variant. We performed Sanger sequencing of the entire coding region of MPL in 62, and of JAK2 in 49 additional triple-negative cases of ET or PMF. New somatic (T119I, S204F, E230G, Y591D) and 1 germline (R321W) MPL mutation were detected. All of the identified MPL mutations were gain-of-function when analyzed in functional assays. JAK2 variants were identified in 5 of 57 triple-negative cases analyzed by WES and Sanger sequencing combined. We could demonstrate that JAK2-V625F and JAK2-F556V are gain-of-function mutations. Our results suggest that triple-negative cases of ET and PMF do not represent a homogenous disease entity. Cases with polyclonal hematopoiesis might represent hereditary disorders. © 2016 by The American Society of Hematology.
Shah, Kushani; Thomas, Shelby; Stein, Arnold
2013-01-01
In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C Sanger sequencing reactions. They prepare and run the gels, perform Southern blots (which require only 10 min), and detect sequencing ladders using a colorimetric detection system. Students enlarge their sequencing ladders from digital images of their small nylon membranes, and read the sequence manually. They compare their reads with the actual DNA sequence using BLAST2. After mastering the DNA sequencing system, students prepare their own DNA from a cheek swab, polymerase chain reaction-amplify a region of their DNA that encompasses a SNP of interest, and perform sequencing to determine their genotype at the SNP position. A family pedigree can also be constructed. The SNP chosen by the instructor was rs17822931, which is in the ABCC11 gene and is the determinant of human earwax type. Genotypes at the rs178229931 site vary in different ethnic populations. © 2013 by The International Union of Biochemistry and Molecular Biology.
Determination of EGFR and KRAS mutational status in Greek non-small-cell lung cancer patients
PAPADOPOULOU, EIRINI; TSOULOS, NIKOLAOS; TSIRIGOTI, ANGELIKI; APESSOS, ANGELA; AGIANNITOPOULOS, KONSTANTINOS; METAXA-MARIATOU, VASILIKI; ZAROGOULIDIS, KONSTANTINOS; ZAROGOULIDIS, PAVLOS; KASARAKIS, DIMITRIOS; KAKOLYRIS, STYLIANOS; DAHABREH, JUBRAIL; VLASTOS, FOTIS; ZOUBLIOS, CHARALAMPOS; RAPTI, AGGELIKI; PAPAGEORGIOU, NIKI GEORGATOU; VELDEKIS, DIMITRIOS; GAGA, MINA; ARAVANTINOS, GERASIMOS; KARAVASILIS, VASILEIOS; KARAGIANNIDIS, NAPOLEON; NASIOULAS, GEORGE
2015-01-01
It has been reported that certain patients with non-small-cell lung cancer (NSCLC) that harbor activating somatic mutations within the tyrosine kinase domain of the epidermal growth factor receptor (EGFR) gene may be effectively treated using targeted therapy. The use of EGFR inhibitors in patient therapy has been demonstrated to improve response and survival rates; therefore, it was suggested that clinical screening for EGFR mutations should be performed for all patients. Numerous clinicopathological factors have been associated with EGFR and Kirsten-rat sarcoma oncogene homolog (KRAS) mutational status including gender, smoking history and histology. In addition, it was reported that EGFR mutation frequency in NSCLC patients was ethnicity-dependent, with an incidence rate of ~30% in Asian populations and ~15% in Caucasian populations. However, limited data has been reported on intra-ethnic differences throughout Europe. The present study aimed to investigate the frequency and spectrum of EGFR mutations in 1,472 Greek NSCLC patients. In addition, KRAS mutation analysis was performed in patients with known smoking history in order to determine the correlation of type and mutation frequency with smoking. High-resolution melting curve (HRM) analysis followed by Sanger sequencing was used to identify mutations in exons 18–21 of the EGFR gene and in exon 2 of the KRAS gene. A sensitive next-generation sequencing (NGS) technology was also employed to classify samples with equivocal results. The use of sensitive mutation detection techniques in a large study population of Greek NSCLC patients in routine diagnostic practice revealed an overall EGFR mutation frequency of 15.83%. This mutation frequency was comparable to that previously reported in other European populations. Of note, there was a 99.8% concordance between the HRM method and Sanger sequencing. NGS was found to be the most sensitive method. In addition, female non-smokers demonstrated a high prevalence of EGFR mutations. Furthermore, KRAS mutation analysis in patients with a known smoking history revealed no difference in mutation frequency according to smoking status; however, a different mutation spectrum was observed. PMID:26622815
Ramos, M D; Trujillano, D; Olivar, R; Sotillo, F; Ossowski, S; Manzanares, J; Costa, J; Gartner, S; Oliva, C; Quintana, E; Gonzalez, M I; Vazquez, C; Estivill, X; Casals, T
2014-07-01
The term cystic fibrosis (CF)-like disease is used to describe patients with a borderline sweat test and suggestive CF clinical features but without two CFTR(cystic fibrosis transmembrane conductance regulator) mutations. We have performed the extensive molecular analysis of four candidate genes (SCNN1A, SCNN1B, SCNN1G and SERPINA1) in a cohort of 10 uncharacterized patients with CF and CF-like disease. We have used whole-exome sequencing to characterize mutations in the CFTR gene and these four candidate genes. CFTR molecular analysis allowed a complete characterization of three of four CF patients. Candidate variants in SCNN1A, SCNN1B, SCNN1G and SERPINA1 in six patients with CF-like phenotypes were confirmed by Sanger sequencing and were further supported by in silico predictive analysis, pedigree studies, sweat test in other family members, and analysis in CF patients and healthy subjects. Our results suggest that CF-like disease probably results from complex genotypes in several genes in an oligogenic form, with rare variants interacting with environmental factors. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Marshall, Charla; Sturk-Andreaggi, Kimberly; Daniels-Higginbotham, Jennifer; Oliver, Robert Sean; Barritt-Ross, Suzanne; McMahon, Timothy P
2017-11-01
Next-generation ancient DNA technologies have the potential to assist in the analysis of degraded DNA extracted from forensic specimens. Mitochondrial genome (mitogenome) sequencing, specifically, may be of benefit to samples that fail to yield forensically relevant genetic information using conventional PCR-based techniques. This report summarizes the Armed Forces Medical Examiner System's Armed Forces DNA Identification Laboratory's (AFMES-AFDIL) performance evaluation of a Next-Generation Sequencing protocol for degraded and chemically treated past accounting samples. The procedure involves hybridization capture for targeted enrichment of mitochondrial DNA, massively parallel sequencing using Illumina chemistry, and an automated bioinformatic pipeline for forensic mtDNA profile generation. A total of 22 non-probative samples and associated controls were processed in the present study, spanning a range of DNA quantity and quality. Data were generated from over 100 DNA libraries by ten DNA analysts over the course of five months. The results show that the mitogenome sequencing procedure is reliable and robust, sensitive to low template (one ng control DNA) as well as degraded DNA, and specific to the analysis of the human mitogenome. Haplotypes were overall concordant between NGS replicates and with previously generated Sanger control region data. Due to the inherent risk for contamination when working with low-template, degraded DNA, a contamination assessment was performed. The consumables were shown to be void of human DNA contaminants and suitable for forensic use. Reagent blanks and negative controls were analyzed to determine the background signal of the procedure. This background signal was then used to set analytical and reporting thresholds, which were designated at 4.0X (limit of detection) and 10.0X (limit of quantiation) average coverage across the mitogenome, respectively. Nearly all human samples exceeded the reporting threshold, although coverage was reduced in chemically treated samples resulting in a ∼58% passing rate for these poor-quality samples. A concordance assessment demonstrated the reliability of the NGS data when compared to known Sanger profiles. One case sample was shown to be mixed with a co-processed sample and two reagent blanks indicated the presence of DNA above the analytical threshold. This contamination was attributed to sequencing crosstalk from simultaneously sequenced high-quality samples to include the positive control. Overall this study demonstrated that hybridization capture and Illumina sequencing provide a viable method for mitogenome sequencing of degraded and chemically treated skeletal DNA samples, yet may require alternative measures of quality control. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Exome Sequencing of 18 Chinese Families with Congenital Cataracts: A New Sight of the NHS Gene
Sun, Wenmin; Xiao, Xueshan; Li, Shiqiang; Guo, Xiangming; Zhang, Qingjiong
2014-01-01
Purpose The aim of this study was to investigate the mutation spectrum and frequency of 34 known genes in 18 Chinese families with congenital cataracts. Methods Genomic DNA and clinical data was collected from 18 families with congenital cataracts. Variations in 34 cataract-associated genes were screened by whole exome sequencing and then validated by Sanger sequencing. Results Eleven candidate variants in seven of the 34 genes were detected by exome sequencing and then confirmed by Sanger sequencing, including two variants predicted to be benign and the other pathogenic mutations. The nine mutations were present in 9 of the 18 (50%) families with congenital cataracts. Of the four families with mutations in the X-linked NHS gene, no other abnormalities were recorded except for cataract, in which a pseudo-dominant inheritance form was suggested, as female carriers also had different forms of cataracts. Conclusion This study expands the mutation spectrum and frequency of genes responsible for congenital cataract. Mutation in NHS is a common cause of nonsyndromic congenital cataract with pseudo-autosomal dominant inheritance. Combined with our previous studies, a genetic basis could be identified in 67.6% of families with congenital cataracts in our case series, in which mutations in genes encoding crystallins, genes encoding connexins, and NHS are responsible for 29.4%, 14.7%, and 11.8% of families, respectively. Our results suggest that mutations in NHS are the common cause of congenital cataract, both syndromic and nonsyndromic. PMID:24968223
Ağladıoğlu, Sebahat Yılmaz; Aycan, Zehra; Çetinkaya, Semra; Baş, Veysel Nijat; Önder, Aşan; Peltek Kendirci, Havva Nur; Doğan, Haldun; Ceylaner, Serdar
2016-04-01
Maturity-onset diabetes of the youth (MODY), is a genetically and clinically heterogeneous group of diseasesand is often misdiagnosed as type 1 or type 2 diabetes. The aim of this study is to investigate both novel and proven mutations of 11 MODY genes in Turkish children by using targeted next generation sequencing. A panel of 11 MODY genes were screened in 43 children with MODY diagnosed by clinical criterias. Studies of index cases was done with MISEQ-ILLUMINA, and family screenings and confirmation studies of mutations was done by Sanger sequencing. We identified 28 (65%) point mutations among 43 patients. Eighteen patients have GCK mutations, four have HNF1A, one has HNF4A, one has HNF1B, two have NEUROD1, one has PDX1 gene variations and one patient has both HNF1A and HNF4A heterozygote mutations. This is the first study including molecular studies of 11 MODY genes in Turkish children. GCK is the most frequent type of MODY in our study population. Very high frequency of novel mutations (42%) in our study population, supports that in heterogenous disorders like MODY sequence analysis provides rapid, cost effective and accurate genetic diagnosis.
Ahmad, Farooq; Nasir, Abdul; Thiele, Holger; Umair, Muhammad; Borck, Guntram; Ahmad, Wasim
2018-02-12
Ectodermal dysplasia syndactyly syndrome 1 (EDSS1) is a rare form of ectodermal dysplasia including anomalies of hair, nails, and teeth along with bilateral cutaneous syndactyly of hands and feet. In the present report, we performed a clinical and genetic characterization of a consanguineous Pakistani family with four individuals affected by EDSS1. We performed exome sequencing using DNA of one affected individual. Exome data analysis identified a novel homozygous missense variant (c.242T>C; p.(Leu81Pro)) in NECTIN4 (PVRL4). Sanger sequencing validated this variant and confirmed its cosegregation with the disease phenotype in the family members. Thus, our report adds a novel variant to the NECTIN4 mutation spectrum and contributes to the NECTIN4-related clinical characterization. © 2018 John Wiley & Sons Ltd/University College London.
Nephrocalcinosis (Enamel Renal Syndrome) Caused by Autosomal Recessive FAM20A Mutations
Jaureguiberry, Graciana; De la Dure-Molla, Muriel; Parry, David; Quentric, Mickael; Himmerkus, Nina; Koike, Toshiyasu; Poulter, James; Klootwijk, Enriko; Robinette, Steven L.; Howie, Alexander J.; Patel, Vaksha; Figueres, Marie-Lucile; Stanescu, Horia C.; Issler, Naomi; Nicholson, Jeremy K.; Bockenhauer, Detlef; Laing, Christopher; Walsh, Stephen B.; McCredie, David A.; Povey, Sue; Asselin, Audrey; Picard, Arnaud; Coulomb, Aurore; Medlar, Alan J.; Bailleul-Forestier, Isabelle; Verloes, Alain; Le Caignec, Cedric; Roussey, Gwenaelle; Guiol, Julien; Isidor, Bertrand; Logan, Clare; Shore, Roger; Johnson, Colin; Inglehearn, Christopher; Al-Bahlani, Suhaila; Schmittbuhl, Matthieu; Clauss, François; Huckert, Mathilde; Laugel, Virginie; Ginglinger, Emmanuelle; Pajarola, Sandra; Spartà, Giuseppina; Bartholdi, Deborah; Rauch, Anita; Addor, Marie-Claude; Yamaguti, Paulo M.; Safatle, Heloisa P.; Acevedo, Ana Carolina; Martelli-Júnior, Hercílio; dos Santos Netos, Pedro E.; Coletta, Ricardo D.; Gruessel, Sandra; Sandmann, Carolin; Ruehmann, Denise; Langman, Craig B.; Scheinman, Steven J.; Ozdemir-Ozenen, Didem; Hart, Thomas C.; Hart, P. Suzanne; Neugebauer, Ute; Schlatter, Eberhard; Houillier, Pascal; Gahl, William A.; Vikkula, Miikka; Bloch-Zupan, Agnès; Bleich, Markus; Kitagawa, Hiroshi; Unwin, Robert J.; Mighell, Alan; Berdal, Ariane; Kleta, Robert
2013-01-01
Background/Aims Calcium homeostasis requires regulated cellular and interstitial systems interacting to modulate the activity and movement of this ion. Disruption of these systems in the kidney results in nephrocalcinosis and nephrolithiasis, important medical problems whose pathogenesis is incompletely understood. Methods We investigated 25 patients from 16 families with unexplained nephrocalcinosis and characteristic dental defects (amelogenesis imperfecta, gingival hyperplasia, impaired tooth eruption). To identify the causative gene, we performed genome-wide linkage analysis, exome capture, next-generation sequencing, and Sanger sequencing. Results All patients had bi-allelic FAM20A mutations segregating with the disease; 20 different mutations were identified. Conclusions This au-tosomal recessive disorder, also known as enamel renal syndrome, of FAM20A causes nephrocalcinosis and amelogenesis imperfecta. We speculate that all individuals with biallelic FAM20A mutations will eventually show nephrocalcinosis. PMID:23434854
Meng, Xiaohong; Li, Qiyou; Guo, Hong; Xu, Haiwei; Li, Shiying; Yin, Zhengqin
2017-01-01
To characterize the clinical and molecular genetic characteristics of a large, multigenerational Chinese family showing different phenotypes. A pedigree consisted of 56 individuals in 5 generations was recruited. Comprehensive ophthalmic examinations were performed in 16 family members affected. Mutation screening of CYP4V2 was performed by Sanger sequencing. Next-generation sequencing (NGS) was performed to capture and sequence all exons of 47 known retinal dystrophy-associated genes in two affected family members who had no mutations in CYP4V2 . The detected variants in NGS were validated by Sanger sequencing in the family members. Two compound heterozygous CYP4V2 mutations (c.802-8_810del17insGC and c.992A>C) were detected in the proband who presented typical clinical features of BCD. One missense mutation (c.1482C>T, p.T494M) in the PRPF3 gene was detected in 9 out of 22 affected family members who manifested classical clinical features of RP. Our results showed that two compound heterozygous CYP4V2 mutations caused BCD, and one missense mutation in PRPF3 was responsible for adRP in this large family. This study suggests that accurate phenotypic diagnosis, molecular diagnosis, and genetic counseling are necessary for patients with hereditary retinal degeneration in some large mutigenerational family.
Hamada, Motoharu; Doisaki, Sayoko; Okuno, Yusuke; Muramatsu, Hideki; Hama, Asahito; Kawashima, Nozomu; Narita, Atsushi; Nishio, Nobuhiro; Yoshida, Kenichi; Kanno, Hitoshi; Manabe, Atsushi; Taga, Takashi; Takahashi, Yoshiyuki; Miyano, Satoru; Ogawa, Seishi; Kojima, Seiji
2018-06-23
Congenital dyserythropoietic anemia (CDA) is a heterogeneous group of rare congenital disorders characterized by ineffective erythropoiesis and dysplastic changes in erythroblasts. Diagnosis of CDA is based primarily on the morphology of bone marrow erythroblasts; however, genetic tests have recently become more important. Here, we performed genetic analysis of 10 Japanese patients who had been diagnosed with CDA based on laboratory findings and morphological characteristics. We examined 10 CDA patients via central review of bone marrow morphology and genetic analysis for congenital bone marrow failure syndromes. Sanger sequencing for CDAN1, SEC23B, and KLF1 was performed for all patients. We performed whole-exome sequencing in patients without mutation in these genes. Three patients carried pathogenic CDAN1 mutations, whereas no SEC23B mutations were identified in our cohort. WES unexpectedly identified gene mutations known to cause congenital hemolytic anemia in two patients: canonical G6PD p.Val394Leu mutation and SPTA1 p.Arg28His mutation. Comprehensive genetic analysis is warranted for more effective diagnosis of patients with suspected CDA.
Abu-Farha, Mohamed; Melhem, Motasem; Abubaker, Jehad; Behbehani, Kazem; Alsmadi, Osama; Elkum, Naser
2016-02-11
ANGPTL8 (betatrophin) has been recently identified as a regulator of lipid metabolism through its interaction with ANGPTL3. A sequence variant in ANGPTL8 has been shown to associate with lower level of Low Density Lipoprotein (LDL) and High Density Lipoprotein (HDL). The objective of this study is to identify sequence variants in ANGPTL8 gene in Arabs and investigate their association with ANGPTL8 plasma level and clinical parameters. A cross sectional study was designed to examine the level of ANGPTL8 in 283 non-diabetic Arabs, and to identify its sequence variants using Sanger sequencing and their association with various clinical parameters. Using Sanger sequencing, we sequenced the full ANGPTL8 gene in 283 Arabs identifying two single nucleotide polymorphisms (SNPs) Rs.892066 and Rs.2278426 in the coding region. Our data shows for the first time that Arabs with the heterozygote form of (c.194C > T Rs.2278426) had higher level of Fasting Blood Glucose (FBG) compared to the CC homozygotes. LDL and HDL level in these subjects did not show significant difference between the two subgroups. Circulation level of ANGPTL8 did not vary between the two forms. No significant changes were observed between the various forms of Rs.892066 variant and FBG, LDL or HDL. Our data shows for the first time that heterozygote form of ANGPTL8 Rs.2278426 variant was associated with higher FBG level in Arabs highlighting the importance of these variants in controlling the function of betatrophin.
Liu, Shanlin; Yang, Chentao; Zhou, Chengran; Zhou, Xin
2017-12-01
Over the past decade, biodiversity researchers have dedicated tremendous efforts to constructing DNA reference barcodes for rapid species registration and identification. Although analytical cost for standard DNA barcoding has been significantly reduced since early 2000, further dramatic reduction in barcoding costs is unlikely because Sanger sequencing is approaching its limits in throughput and chemistry cost. Constraints in barcoding cost not only led to unbalanced barcoding efforts around the globe, but also prevented high-throughput sequencing (HTS)-based taxonomic identification from applying binomial species names, which provide crucial linkages to biological knowledge. We developed an Illumina-based pipeline, HIFI-Barcode, to produce full-length Cytochrome c oxidase subunit I (COI) barcodes from pooled polymerase chain reaction amplicons generated by individual specimens. The new pipeline generated accurate barcode sequences that were comparable to Sanger standards, even for different haplotypes of the same species that were only a few nucleotides different from each other. Additionally, the new pipeline was much more sensitive in recovering amplicons at low quantity. The HIFI-Barcode pipeline successfully recovered barcodes from more than 78% of the polymerase chain reactions that didn't show clear bands on the electrophoresis gel. Moreover, sequencing results based on the single molecular sequencing platform Pacbio confirmed the accuracy of the HIFI-Barcode results. Altogether, the new pipeline can provide an improved solution to produce full-length reference barcodes at about one-tenth of the current cost, enabling construction of comprehensive barcode libraries for local fauna, leading to a feasible direction for DNA barcoding global biomes. © The Authors 2017. Published by Oxford University Press.
Fonseca, Dora Janeth; Patiño, Liliana Catherine; Suárez, Yohjana Carolina; de Jesús Rodríguez, Asid; Mateus, Heidi Eliana; Jiménez, Karen Marcela; Ortega-Recalde, Oscar; Díaz-Yamal, Ivonne; Laissue, Paul
2015-07-01
To identify new molecular actors involved in nonsyndromic premature ovarian failure (POF) etiology. This is a retrospective case-control cohort study. University research group and IVF medical center. Twelve women affected by nonsyndromic POF. The control group included 176 women whose menopause had occurred after age 50 and had no antecedents regarding gynecological disease. A further 345 women from the same ethnic origin (general population group) were also recruited to assess allele frequency for potentially deleterious sequence variants. Next generation sequencing (NGS), Sanger sequencing, and bioinformatics analysis. The complete coding regions of 70 candidate genes were massively sequenced, via NGS, in POF patients. Bioinformatics and genetics were used to confirm NGS results and to identify potential sequence variants related to the disease pathogenesis. We have identified mutations in two novel genes, ADAMTS19 and BMPR2, that are potentially related to POF origin. LHCGR mutations, which might have contributed to the phenotype, were also detected. We thus recommend NGS as a powerful tool for identifying new molecular actors in POF and for future diagnostic/prognostic purposes. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Inskeep, William P.; Jay, Zackary J.; Herrgard, Markus J.; Kozubal, Mark A.; Rusch, Douglas B.; Tringe, Susannah G.; Macur, Richard E.; Jennings, Ryan deM.; Boyd, Eric S.; Spear, John R.; Roberto, Francisco F.
2013-01-01
Geothermal habitats in Yellowstone National Park (YNP) provide an unparalleled opportunity to understand the environmental factors that control the distribution of archaea in thermal habitats. Here we describe, analyze, and synthesize metagenomic and geochemical data collected from seven high-temperature sites that contain microbial communities dominated by archaea relative to bacteria. The specific objectives of the study were to use metagenome sequencing to determine the structure and functional capacity of thermophilic archaeal-dominated microbial communities across a pH range from 2.5 to 6.4 and to discuss specific examples where the metabolic potential correlated with measured environmental parameters and geochemical processes occurring in situ. Random shotgun metagenome sequence (∼40–45 Mb Sanger sequencing per site) was obtained from environmental DNA extracted from high-temperature sediments and/or microbial mats and subjected to numerous phylogenetic and functional analyses. Analysis of individual sequences (e.g., MEGAN and G + C content) and assemblies from each habitat type revealed the presence of dominant archaeal populations in all environments, 10 of whose genomes were largely reconstructed from the sequence data. Analysis of protein family occurrence, particularly of those involved in energy conservation, electron transport, and autotrophic metabolism, revealed significant differences in metabolic strategies across sites consistent with differences in major geochemical attributes (e.g., sulfide, oxygen, pH). These observations provide an ecological basis for understanding the distribution of indigenous archaeal lineages across high-temperature systems of YNP. PMID:23720654
Toward the 1,000 dollars human genome.
Bennett, Simon T; Barnes, Colin; Cox, Anthony; Davies, Lisa; Brown, Clive
2005-06-01
Revolutionary new technologies, capable of transforming the economics of sequencing, are providing an unparalleled opportunity to analyze human genetic variation comprehensively at the whole-genome level within a realistic timeframe and at affordable costs. Current estimates suggest that it would cost somewhere in the region of 30 million US dollars to sequence an entire human genome using Sanger-based sequencing, and on one machine it would take about 60 years. Solexa is widely regarded as a company with the necessary disruptive technology to be the first to achieve the ultimate goal of the so-called 1,000 dollars human genome - the conceptual cost-point needed for routine analysis of individual genomes. Solexa's technology is based on completely novel sequencing chemistry capable of sequencing billions of individual DNA molecules simultaneously, a base at a time, to enable highly accurate, low cost analysis of an entire human genome in a single experiment. When applied over a large enough genomic region, these new approaches to resequencing will enable the simultaneous detection and typing of known, as well as unknown, polymorphisms, and will also offer information about patterns of linkage disequilibrium in the population being studied. Technological progress, leading to the advent of single-molecule-based approaches, is beginning to dramatically drive down costs and increase throughput to unprecedented levels, each being several orders of magnitude better than that which is currently available. A new sequencing paradigm based on single molecules will be faster, cheaper and more sensitive, and will permit routine analysis at the whole-genome level.
Cornforth, Michael N; Anur, Pavana; Wang, Nicholas; Robinson, Erin; Ray, F Andrew; Bedford, Joel S; Loucas, Bradford D; Williams, Eli S; Peto, Myron; Spellman, Paul; Kollipara, Rahul; Kittler, Ralf; Gray, Joe W; Bailey, Susan M
2018-05-11
Chromosome rearrangements are large-scale structural variants that are recognized drivers of oncogenic events in cancers of all types. Cytogenetics allows for their rapid, genome-wide detection, but does not provide gene-level resolution. Massively parallel sequencing (MPS) promises DNA sequence-level characterization of the specific breakpoints involved, but is strongly influenced by bioinformatics filters that affect detection efficiency. We sought to characterize the breakpoint junctions of chromosomal translocations and inversions in the clonal derivatives of human cells exposed to ionizing radiation. Here, we describe the first successful use of DNA paired-end analysis to locate and sequence across the breakpoint junctions of a radiation-induced reciprocal translocation. The analyses employed, with varying degrees of success, several well-known bioinformatics algorithms, a task made difficult by the involvement of repetitive DNA sequences. As for underlying mechanisms, the results of Sanger sequencing suggested that the translocation in question was likely formed via microhomology-mediated non-homologous end joining (mmNHEJ). To our knowledge, this represents the first use of MPS to characterize the breakpoint junctions of a radiation-induced chromosomal translocation in human cells. Curiously, these same approaches were unsuccessful when applied to the analysis of inversions previously identified by directional genomic hybridization (dGH). We conclude that molecular cytogenetics continues to provide critical guidance for structural variant discovery, validation and in "tuning" analysis filters to enable robust breakpoint identification at the base pair level.
A Comprehensive Strategy for Accurate Mutation Detection of the Highly Homologous PMS2.
Li, Jianli; Dai, Hongzheng; Feng, Yanming; Tang, Jia; Chen, Stella; Tian, Xia; Gorman, Elizabeth; Schmitt, Eric S; Hansen, Terah A A; Wang, Jing; Plon, Sharon E; Zhang, Victor Wei; Wong, Lee-Jun C
2015-09-01
Germline mutations in the DNA mismatch repair gene PMS2 underlie the cancer susceptibility syndrome, Lynch syndrome. However, accurate molecular testing of PMS2 is complicated by a large number of highly homologous sequences. To establish a comprehensive approach for mutation detection of PMS2, we have designed a strategy combining targeted capture next-generation sequencing (NGS), multiplex ligation-dependent probe amplification, and long-range PCR followed by NGS to simultaneously detect point mutations and copy number changes of PMS2. Exonic deletions (E2 to E9, E5 to E9, E8, E10, E14, and E1 to E15), duplications (E11 to E12), and a nonsense mutation, p.S22*, were identified. Traditional multiplex ligation-dependent probe amplification and Sanger sequencing approaches cannot differentiate the origin of the exonic deletions in the 3' region when PMS2 and PMS2CL share identical sequences as a result of gene conversion. Our approach allows unambiguous identification of mutations in the active gene with a straightforward long-range-PCR/NGS method. Breakpoint analysis of multiple samples revealed that recurrent exon 14 deletions are mediated by homologous Alu sequences. Our comprehensive approach provides a reliable tool for accurate molecular analysis of genes containing multiple copies of highly homologous sequences and should improve PMS2 molecular analysis for patients with Lynch syndrome. Copyright © 2015 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Copy number analysis reveals a novel multiexon deletion of the COLQ gene in congenital myasthenia.
Wang, Wei; Wu, Yanhong; Wang, Chen; Jiao, Jinsong; Klein, Christopher J
2016-12-01
Congenital myasthenic syndrome (CMS) is genetically and clinically heterogeneous. 1 Despite a considerable number of causal genes discovered, many patients are left without a specific diagnosis after genetic testing. The presumption is that novel genes yet to be discovered will account for the majority of such patients. However, it is also possible that we are neglecting a type of genetic variation: copy number changes (>50 bp) as causal for some of these patients. Next-generation sequencing (NGS) can simultaneously screen all known causal genes 2 and is increasingly being validated to have a potential to identify copy number changes. 3 We present a CMS case who did not receive a genetic diagnosis from previous Sanger sequencing, but through a novel copy number analysis algorithm integrated into our targeted NGS panel, we discovered a novel copy number mutation in the COLQ gene and made a genetic diagnosis. This discovery expands the genotype-phenotype correlation of CMS, leads to improved genetic counsel, and allows for specific pharmacologic treatment. 1 .
A Hypomorphic RAG1 Mutation Resulting in a Phenotype Resembling Common Variable Immunodeficiency
Abolhassani, Hassan; Wang, Ning; Aghamohammadi, Asghar; Rezaei, Nima; Lee, Yu Nee; Frugoni, Francesco; Notrangelo, Luigi D.; Pan-Hammarström, Qiang; Hammarström, Lennart
2014-01-01
Background RAG1 deficiency presents a varied spectrum of combined immunodeficiency, ranging from a T−B−NK+type of disease to a T+B+NK+ phenotype. Objective To assess the genetic background of common variable immunodeficiency (CVID) patients. Methods A patient diagnosed with CVID, who was born in a consanguineous family and thus would be expected to show an autosomal recessive inheritance, was subjected to clinical evaluation, immunological assays, homozygosity gene mapping, exome sequencing, Sanger sequencing and functional analysis. Results The 14-year-old patient, who suffered from liver granuloma, extranodal marginal zone B cell lymphoma and autoimmune neutropenia, is presented with a clinical picture resembling CVID. Genetic analysis of this patient showed a homozygous hypomorphic RAG1 mutation (c.1073 G>A, p.C358Y) with a residual functional capacity of 48% of wild-type protein. Conclusion Our finding broadens the range of disorders associated with RAG1 mutations and may have important therapeutic implications. PMID:24996264
Ovine pedomics: the first study of the ovine foot 16S rRNA-based microbiome
USDA-ARS?s Scientific Manuscript database
We report the first study of the bacterial microbiome of ovine interdigital skin based on 16S rRNA by pyrosequencing and conventional cloning with Sanger-sequencing. Ovine foot rot is an infectious, contagious disease of sheep that causes severe lameness and economic loss from decreased flock produc...
Li, Juan; Ding, Yu; Chang, Guoying; Cheng, Qing; Li, Xin; Wang, Jian; Wang, Xiumin; Shen, Yiping
2017-02-10
To identify the genetic cause for a 11-year-old Chinese boy with Meier-Gorlin syndrome (MGS). Chromosomal microarray analysis (CMA) was used to detect potential variations, while whole exome sequencing (WES) was used to identify sequence variants. Sanger sequencing was used to confirm the suspected variants. The boy has featured short stature, microtia, small patella, slender body build, craniofacial anomalies, and small testes with normal gonadotropin. A complete uniparental disomy of chromosome 16 was revealed by CMA. WES has identified a novel homozygous mutation c.67A>G (p.Lys23Glu) in ORC6 gene mapped to chromosome 16. As predicted by Alamut functional software, the mutation may affect the function of structural domain of the ORC6 protein. The patient is probably the first diagnosed MGS case in China, who carried a novel homozygous mutation of the ORC6 gene and uniparental disomy of chromosome 16. The effect of this novel mutation on the growth and development needs to be further investigated.
[Analysis of MAT1A gene mutations in a child affected with simple hypermethioninemia].
Sun, Yun; Ma, Dingyuan; Wang, Yanyun; Yang, Bin; Jiang, Tao
2017-02-10
To detect potential mutations of MAT1A gene in a child suspected with simple hypermethioninemia by MS/MS neonatal screening. Clinical data of the child was collected. Genomic DNA was extracted by a standard method and subjected to targeted sequencing using an Ion Ampliseq TM Inherited Disease Panel. Detected mutations were verified by Sanger sequencing. The child showed no clinical features except evaluated methionine. A novel compound mutation of the MAT1A gene, i.e., c.345delA and c.529C>T, was identified in the child. His father and mother were found to be heterozygous for the c.345delA mutation and c.529C>T mutation, respectively. The compound mutation c.345delA and c.529C>T of the MAT1A gene probably underlie the disease in the child. The semi-conductor sequencing has provided an important means for the diagnosis of hereditary diseases.
Dusi, Sabrina; Valletta, Lorella; Haack, Tobias B; Tsuchiya, Yugo; Venco, Paola; Pasqualato, Sebastiano; Goffrini, Paola; Tigano, Marco; Demchenko, Nikita; Wieland, Thomas; Schwarzmayr, Thomas; Strom, Tim M; Invernizzi, Federica; Garavaglia, Barbara; Gregory, Allison; Sanford, Lynn; Hamada, Jeffrey; Bettencourt, Conceição; Houlden, Henry; Chiapparini, Luisa; Zorzi, Giovanna; Kurian, Manju A; Nardocci, Nardo; Prokisch, Holger; Hayflick, Susan; Gout, Ivan; Tiranti, Valeria
2014-01-02
Neurodegeneration with brain iron accumulation (NBIA) comprises a clinically and genetically heterogeneous group of disorders with progressive extrapyramidal signs and neurological deterioration, characterized by iron accumulation in the basal ganglia. Exome sequencing revealed the presence of recessive missense mutations in COASY, encoding coenzyme A (CoA) synthase in one NBIA-affected subject. A second unrelated individual carrying mutations in COASY was identified by Sanger sequence analysis. CoA synthase is a bifunctional enzyme catalyzing the final steps of CoA biosynthesis by coupling phosphopantetheine with ATP to form dephospho-CoA and its subsequent phosphorylation to generate CoA. We demonstrate alterations in RNA and protein expression levels of CoA synthase, as well as CoA amount, in fibroblasts derived from the two clinical cases and in yeast. This is the second inborn error of coenzyme A biosynthesis to be implicated in NBIA. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Optimizing the molecular diagnosis of CDKL5 gene-related epileptic encephalopathy in boys.
Mei, Davide; Darra, Francesca; Barba, Carmen; Marini, Carla; Fontana, Elena; Chiti, Laura; Parrini, Elena; Dalla Bernardina, Bernardo; Guerrini, Renzo
2014-11-01
Mutations involving the cyclin-dependent kinase-like 5 (CDKL5) gene cause an early onset epileptic encephalopathy (EE) with severe neurologic impairment and a skewed 12:1 female-to-male ratio. To date, 18 mutations have been described in boys. We analyzed our cohort of boys with early onset EE to assess the diagnostic yield of our molecular approach. We studied 74 boys who presented early onset severe seizures, including infantile spasms and developmental delay, in the setting of EE, using Sanger sequencing, next-generation sequencing (NGS) and multiplex ligation-dependent probe amplification (MLPA). We identified alterations involving CDKL5 in four boys (5.4%) using NGS in one and MLPA in three. Three of four mutations were indicative of somatic mosaicism. CDKL5 gene mutations accounted for 5.4% of boys with early onset EE. Somatic mosaic mutations might be even more represented than germline mutations, probably because their less deleterious effect enhances viability of the male embryo. The molecular approach used for CDKL5 screening remarkably influences the diagnostic yield in boys. Diagnosis is optimized by Sanger sequencing combined with array-based methods or MLPA; alternatively, NGS targeted resequencing designed to also detect copy number alterations, may be performed. Wiley Periodicals, Inc. © 2014 International League Against Epilepsy.
Entamoeba histolytica: construction and applications of subgenomic databases.
Hofer, Margit; Duchêne, Michael
2005-07-01
Knowledge about the influence of environmental stress such as the action of chemotherapeutic agents on gene expression in Entamoeba histolytica is limited. We plan to use oligonucleotide microarray hybridization to approach these questions. As the basis for our array, sequence data from the genome project carried out by the Institute for Genomic Research (TIGR) and the Sanger Institute were used to annotate parts of the parasite genome. Three subgenomic databases containing enzymes, cytoskeleton genes, and stress genes were compiled with the help of the ExPASy proteomics website and the BLAST servers at the two genome project sites. The known sequences from reference species, mostly human and Escherichia coli, were searched against TIGR and Sanger E. histolytica sequence contigs and the homologs were copied into a Microsoft Access database. In a similar way, two additional databases of cytoskeletal genes and stress genes were generated. Metabolic pathways could be assembled from our enzyme database, but sometimes they were incomplete as is the case for the sterol biosynthesis pathway. The raw databases contained a significant number of duplicate entries which were merged to obtain curated non-redundant databases. This procedure revealed that some E. histolytica genes may have several putative functions. Representative examples such as the case of the delta-aminolevulinate synthase/serine palmitoyltransferase are discussed.
Lasigliè, Denise; Mensa-Vilaro, Anna; Ferrera, Denise; Caorsi, Roberta; Penco, Federica; Santamaria, Giuseppe; Di Duca, Marco; Amico, Giulia; Nakagawa, Kenji; Antonini, Francesca; Tommasini, Alberto; Consolini, Rita; Insalaco, Antonella; Cattalini, Marco; Obici, Laura; Gallizzi, Romina; Santarelli, Francesca; Del Zotto, Genny; Severino, Mariasavina; Rubartelli, Anna; Ravazzolo, Roberto; Martini, Alberto; Ceccherini, Isabella; Nishikomori, Ryuta; Gattorno, Marco; Arostegui, Juan I; Borghini, Silvia
2017-11-01
To evaluate the rate of somatic NLRP3 mosaicism in an Italian cohort of mutation-negative patients with cryopyrin-associated periodic syndrome (CAPS). The study enrolled 14 patients with a clinical phenotype consistent with CAPS in whom Sanger sequencing of the NLRP3 gene yielded negative results. Patients' DNA were subjected to amplicon-based NLRP3 deep sequencing. Low-level somatic NLRP3 mosaicism has been detected in 4 patients, 3 affected with chronic infantile neurological cutaneous and articular syndrome and 1 with Muckle-Wells syndrome. Identified nucleotide substitutions encode for 4 different amino acid exchanges, with 2 of them being novel (p.Y563C and p.G564S). In vitro functional studies confirmed the deleterious behavior of the 4 somatic NLRP3 mutations. Among the different neurological manifestations detected, 1 patient displayed mild loss of white matter volume on brain magnetic resonance imaging. The allele frequency of somatic NLRP3 mutations occurs generally under 15%, considered the threshold of detectability using the Sanger method of DNA sequencing. Consequently, routine genetic diagnostic of CAPS should be currently performed by next-generation techniques ensuring high coverage to identify also low-level mosaicism, whose actual frequency is yet unknown and probably underestimated.
Kraková, Lucia; Šoltys, Katarína; Budiš, Jaroslav; Grivalský, Tomáš; Ďuriš, František; Pangallo, Domenico; Szemes, Tomáš
2016-09-01
Different protocols based on Illumina high-throughput DNA sequencing and denaturing gradient gel electrophoresis (DGGE)-cloning were developed and applied for investigating hot spring related samples. The study was focused on three target genes: archaeal and bacterial 16S rRNA and mcrA of methanogenic microflora. Shorter read lengths of the currently most popular technology of sequencing by Illumina do not allow analysis of the complete 16S rRNA region, or of longer gene fragments, as was the case of Sanger sequencing. Here, we demonstrate that there is no need for special indexed or tailed primer sets dedicated to short variable regions of 16S rRNA since the presented approach allows the analysis of complete bacterial 16S rRNA amplicons (V1-V9) and longer archaeal 16S rRNA and mcrA sequences. Sample augmented with transposon is represented by a set of approximately 300 bp long fragments that can be easily sequenced by Illumina MiSeq. Furthermore, a low proportion of chimeric sequences was observed. DGGE-cloning based strategies were performed combining semi-nested PCR, DGGE and clone library construction. Comparing both investigation methods, a certain degree of complementarity was observed confirming that the DGGE-cloning approach is not obsolete. Novel protocols were created for several types of laboratories, utilizing the traditional DGGE technique or using the most modern Illumina sequencing.
Huang, Xiaoyan; Tian, Mao; Li, Jiankang; Cui, Ling; Li, Min; Zhang, Jianguo
2017-11-01
Norrie disease (ND) is a rare X-linked genetic disorder, the main symptoms of which are congenital blindness and white pupils. It has been reported that ND is caused by mutations in the NDP gene. Although many mutations in NDP have been reported, the genetic cause for many patients remains unknown. In this study, the aim is to investigate the genetic defect in a five-generation family with typical symptoms of ND. To identify the causative gene, next-generation sequencing based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members using Sanger sequencing. We identified a novel missense variant (c.314C>A) located within the NDP gene. The mutation cosegregated within all affected individuals in the family and was not found in unaffected members. By happenstance, in this family, we also detected a known pathogenic variant of retinitis pigmentosa in a healthy individual. c.314C>A mutation of NDP gene is a novel mutation and broadens the genetic spectrum of ND.
Next-generation sequencing reveals a novel NDP gene mutation in a Chinese family with Norrie disease
Huang, Xiaoyan; Tian, Mao; Li, Jiankang; Cui, Ling; Li, Min; Zhang, Jianguo
2017-01-01
Purpose: Norrie disease (ND) is a rare X-linked genetic disorder, the main symptoms of which are congenital blindness and white pupils. It has been reported that ND is caused by mutations in the NDP gene. Although many mutations in NDP have been reported, the genetic cause for many patients remains unknown. In this study, the aim is to investigate the genetic defect in a five-generation family with typical symptoms of ND. Methods: To identify the causative gene, next-generation sequencing based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members using Sanger sequencing. Results: We identified a novel missense variant (c.314C>A) located within the NDP gene. The mutation cosegregated within all affected individuals in the family and was not found in unaffected members. By happenstance, in this family, we also detected a known pathogenic variant of retinitis pigmentosa in a healthy individual. Conclusion: c.314C>A mutation of NDP gene is a novel mutation and broadens the genetic spectrum of ND. PMID:29133643
Russell, Jessica S; Caly, Leon; Kostecki, Renata; McGuinness, Sarah L; Carter, Glen; Bulach, Dieter; Seemann, Torsten; Stinear, Tim P; Baird, Rob; Catton, Mike; Druce, Julian
2018-06-11
Murray Valley Encephalitis virus (MVEV) is a mosquito-borne Flavivirus. Clinical presentation is rare but severe, with a case fatality rate of 15⁻30%. Here we report a case of MVEV from the cerebrospinal fluid (CSF) of a patient in the Northern Territory in Australia. Initial diagnosis was performed using both MVEV-specific real-time, and Pan- Flavivirus conventional, Polymerase Chain Reaction (PCR), with confirmation by Sanger sequencing. Subsequent isolation, the first from CSF, was conducted in Vero cells and the observed cytopathic effect was confirmed by increasing viral titre in the real-time PCR. Isolation allowed for full genome sequencing using the Scriptseq V2 RNASeq library preparation kit. A consensus genome for VIDRL-MVE was generated and phylogenetic analysis identified it as Genotype 2. This is the first reported isolation, and full genome sequencing of MVEV from CSF. It is also the first time Genotype 2 has been identified in humans. As such, this case has significant implications for public health surveillance, epidemiology, and the understanding of MVEV evolution.
Data compression for sequencing data
2013-01-01
Post-Sanger sequencing methods produce tons of data, and there is a general agreement that the challenge to store and process them must be addressed with data compression. In this review we first answer the question “why compression” in a quantitative manner. Then we also answer the questions “what” and “how”, by sketching the fundamental compression ideas, describing the main sequencing data types and formats, and comparing the specialized compression algorithms and tools. Finally, we go back to the question “why compression” and give other, perhaps surprising answers, demonstrating the pervasiveness of data compression techniques in computational biology. PMID:24252160
Authentication of Herbal Supplements Using Next-Generation Sequencing
Braukmann, Thomas W. A.; Borisenko, Alex V.; Zakharov, Evgeny V.
2016-01-01
Background DNA-based testing has been gaining acceptance as a tool for authentication of a wide range of food products; however, its applicability for testing of herbal supplements remains contentious. Methods We utilized Sanger and Next-Generation Sequencing (NGS) for taxonomic authentication of fifteen herbal supplements representing three different producers from five medicinal plants: Echinacea purpurea, Valeriana officinalis, Ginkgo biloba, Hypericum perforatum and Trigonella foenum-graecum. Experimental design included three modifications of DNA extraction, two lysate dilutions, Internal Amplification Control, and multiple negative controls to exclude background contamination. Ginkgo supplements were also analyzed using HPLC-MS for the presence of active medicinal components. Results All supplements yielded DNA from multiple species, rendering Sanger sequencing results for rbcL and ITS2 regions either uninterpretable or non-reproducible between the experimental replicates. Overall, DNA from the manufacturer-listed medicinal plants was successfully detected in seven out of eight dry herb form supplements; however, low or poor DNA recovery due to degradation was observed in most plant extracts (none detected by Sanger; three out of seven–by NGS). NGS also revealed a diverse community of fungi, known to be associated with live plant material and/or the fermentation process used in the production of plant extracts. HPLC-MS testing demonstrated that Ginkgo supplements with degraded DNA contained ten key medicinal components. Conclusion Quality control of herbal supplements should utilize a synergetic approach targeting both DNA and bioactive components, especially for standardized extracts with degraded DNA. The NGS workflow developed in this study enables reliable detection of plant and fungal DNA and can be utilized by manufacturers for quality assurance of raw plant materials, contamination control during the production process, and the final product. Interpretation of results should involve an interdisciplinary approach taking into account the processes involved in production of herbal supplements, as well as biocomplexity of plant-plant and plant-fungal biological interactions. PMID:27227830
Authentication of Herbal Supplements Using Next-Generation Sequencing.
Ivanova, Natalia V; Kuzmina, Maria L; Braukmann, Thomas W A; Borisenko, Alex V; Zakharov, Evgeny V
2016-01-01
DNA-based testing has been gaining acceptance as a tool for authentication of a wide range of food products; however, its applicability for testing of herbal supplements remains contentious. We utilized Sanger and Next-Generation Sequencing (NGS) for taxonomic authentication of fifteen herbal supplements representing three different producers from five medicinal plants: Echinacea purpurea, Valeriana officinalis, Ginkgo biloba, Hypericum perforatum and Trigonella foenum-graecum. Experimental design included three modifications of DNA extraction, two lysate dilutions, Internal Amplification Control, and multiple negative controls to exclude background contamination. Ginkgo supplements were also analyzed using HPLC-MS for the presence of active medicinal components. All supplements yielded DNA from multiple species, rendering Sanger sequencing results for rbcL and ITS2 regions either uninterpretable or non-reproducible between the experimental replicates. Overall, DNA from the manufacturer-listed medicinal plants was successfully detected in seven out of eight dry herb form supplements; however, low or poor DNA recovery due to degradation was observed in most plant extracts (none detected by Sanger; three out of seven-by NGS). NGS also revealed a diverse community of fungi, known to be associated with live plant material and/or the fermentation process used in the production of plant extracts. HPLC-MS testing demonstrated that Ginkgo supplements with degraded DNA contained ten key medicinal components. Quality control of herbal supplements should utilize a synergetic approach targeting both DNA and bioactive components, especially for standardized extracts with degraded DNA. The NGS workflow developed in this study enables reliable detection of plant and fungal DNA and can be utilized by manufacturers for quality assurance of raw plant materials, contamination control during the production process, and the final product. Interpretation of results should involve an interdisciplinary approach taking into account the processes involved in production of herbal supplements, as well as biocomplexity of plant-plant and plant-fungal biological interactions.
Mavromatis, Konstantinos; Land, Miriam L; Brettin, Thomas S; Quest, Daniel J; Copeland, Alex; Clum, Alicia; Goodwin, Lynne; Woyke, Tanja; Lapidus, Alla; Klenk, Hans Peter; Cottingham, Robert W; Kyrpides, Nikos C
2012-01-01
The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation. In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis. These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio).
Chen, Neng; Tranebjærg, Lisbeth; Rendtorff, Nanna Dahl; Schrijver, Iris
2011-01-01
Pendred syndrome and DFNB4 (autosomal recessive nonsyndromic congenital deafness, locus 4) are associated with autosomal recessive congenital sensorineural hearing loss and mutations in the SLC26A4 gene. Extensive allelic heterogeneity, however, necessitates analysis of all exons and splice sites to identify mutations for individual patients. Although Sanger sequencing is the gold standard for mutation detection, screening methods supplemented with targeted sequencing can provide a cost-effective alternative. One such method, denaturing high-performance liquid chromatography, was developed for clinical mutation detection in SLC26A4. However, this method inherently cannot distinguish homozygous changes from wild-type sequences. High-resolution melting (HRM), on the other hand, can detect heterozygous and homozygous changes cost-effectively, without any post-PCR modifications. We developed a closed-tube HRM mutation detection method specific for SLC26A4 that can be used in the clinical diagnostic setting. Twenty-eight primer pairs were designed to cover all 21 SLC26A4 exons and splice junction sequences. Using the resulting amplicons, initial HRM analysis detected all 45 variants previously identified by sequencing. Subsequently, a 384-well plate format was designed for up to three patient samples per run. Blinded HRM testing on these plates of patient samples collected over 1 year in a clinical diagnostic laboratory accurately detected all variants identified by sequencing. In conclusion, HRM with targeted sequencing is a reliable, simple, and cost-effective method for SLC26A4 mutation screening and detection. PMID:21704276
Rowczenio, Dorota M; Gomes, Sónia Melo; Aróstegui, Juan I; Mensa-Vilaro, Anna; Omoyinmi, Ebun; Trojer, Hadija; Baginska, Anna; Baroja-Mazo, Alberto; Pelegrin, Pablo; Savic, Sinisa; Lane, Thirusha; Williams, Rene; Brogan, Paul; Lachmann, Helen J; Hawkins, Philip N
2017-01-01
Cryopyrin-associated periodic syndrome (CAPS) is caused by gain-of-function NLRP3 mutations. Recently, somatic NLRP3 mosaicism has been reported in some CAPS patients who were previously classified as "mutation-negative." We describe here the clinical and laboratory findings in eight British adult patients who presented with symptoms typical of CAPS other than an onset in mid-late adulthood. All patients underwent comprehensive clinical and laboratory investigations, including analysis of the NLRP3 gene using Sanger and amplicon-based deep sequencing (ADS) along with measurements of extracellular apoptosis-associated speck-like protein with CARD domain (ASC) aggregates. The clinical phenotype in all subjects was consistent with mid-spectrum CAPS, except a median age at disease onset of 50 years. Sanger sequencing of NLRP3 was non-diagnostic but ADS detected a somatic NLRP3 mutation in each case. In one patient, DNA isolated from blood demonstrated an increase in the mutant allele from 5 to 45% over 12 years. ASC aggregates in patients' serum measured during active disease were significantly higher than healthy controls. This series represents 8% of CAPS patients diagnosed in a single center, suggesting that acquired NLRP3 mutations may not be an uncommon cause of the syndrome and should be sought in all patients with late-onset symptoms otherwise compatible with CAPS. Steadily worsening CAPS symptoms in one patient were associated with clonal expansion of the mutant allele predominantly affecting myeloid cells. Two patients developed AA amyloidosis, which previously has only been reported in CAPS in association with life-long germline NLRP3 mutations.
Bailey, Sarah F; Scheible, Melissa K; Williams, Christopher; Silva, Deborah S B S; Hoggan, Marina; Eichman, Christopher; Faith, Seth A
2017-11-01
Next-generation Sequencing (NGS) is a rapidly evolving technology with demonstrated benefits for forensic genetic applications, and the strategies to analyze and manage the massive NGS datasets are currently in development. Here, the computing, data storage, connectivity, and security resources of the Cloud were evaluated as a model for forensic laboratory systems that produce NGS data. A complete front-to-end Cloud system was developed to upload, process, and interpret raw NGS data using a web browser dashboard. The system was extensible, demonstrating analysis capabilities of autosomal and Y-STRs from a variety of NGS instrumentation (Illumina MiniSeq and MiSeq, and Oxford Nanopore MinION). NGS data for STRs were concordant with standard reference materials previously characterized with capillary electrophoresis and Sanger sequencing. The computing power of the Cloud was implemented with on-demand auto-scaling to allow multiple file analysis in tandem. The system was designed to store resulting data in a relational database, amenable to downstream sample interpretations and databasing applications following the most recent guidelines in nomenclature for sequenced alleles. Lastly, a multi-layered Cloud security architecture was tested and showed that industry standards for securing data and computing resources were readily applied to the NGS system without disadvantageous effects for bioinformatic analysis, connectivity or data storage/retrieval. The results of this study demonstrate the feasibility of using Cloud-based systems for secured NGS data analysis, storage, databasing, and multi-user distributed connectivity. Copyright © 2017 Elsevier B.V. All rights reserved.
Hegazi, Moustafa Abdelaal; Manou, Sommen; Sakr, Hazem; Camp, Guy Van
2017-01-01
Inherited Palmoplantar Keratodermas are rare disorders of genodermatosis that are conventionally regarded as autosomal dominant in inheritance with extensive clinical and genetic heterogeneity. This is the first report of a unique autosomal recessive Inherited Palmoplantar keratoderma - sensorineural hearing loss syndrome which has not been reported before in 3 siblings of a large consanguineous family. The patients presented unique clinical features that were different from other known Inherited Palmoplantar Keratodermas - hearing loss syndromes. Mutations in GJB2 or GJB6 and the mitochondrial A7445G mutation, known to be the major causes of diverse Inherited Palmoplantar Keratodermas -hearing loss syndromes were not detected by Sanger sequencing. Moreover, the pathogenic mutation could not be identified using whole exome sequencing. Other known Inherited Palmoplantar keratoderma syndromes were excluded based on both clinical criteria and genetic analysis. PMID:29267478
A DNA mini-barcode for land plants.
Little, Damon P
2014-05-01
Small portions of the barcode region - mini-barcodes - may be used in place of full-length barcodes to overcome DNA degradation for samples with poor DNA preservation. 591,491,286 rbcL mini-barcode primer combinations were electronically evaluated for PCR universality, and two novel highly universal sets of priming sites were identified. Novel and published rbcL mini-barcode primers were evaluated for PCR amplification [determined with a validated electronic simulation (n = 2765) and empirically (n = 188)], Sanger sequence quality [determined empirically (n = 188)], and taxonomic discrimination [determined empirically (n = 30,472)]. PCR amplification for all mini-barcodes, as estimated by validated electronic simulation, was successful for 90.2-99.8% of species. Overall Sanger sequence quality for mini-barcodes was very low - the best mini-barcode tested produced sequences of adequate quality (B20 ≥ 0.5) for 74.5% of samples. The majority of mini-barcodes provide correct identifications of families in excess of 70.1% of the time. Discriminatory power noticeably decreased at lower taxonomic levels. At the species level, the discriminatory power of the best mini-barcode was less than 38.2%. For samples believed to contain DNA from only one species, an investigator should attempt to sequence, in decreasing order of utility and probability of success, mini-barcodes F (rbcL1/rbcLB), D (F52/R193) and K (F517/R604). For samples believed to contain DNA from more than one species, an investigator should amplify and sequence mini-barcode D (F52/R193). © 2013 John Wiley & Sons Ltd.
A novel variant of FGFR3 causes proportionate short stature.
Kant, Sarina G; Cervenkova, Iveta; Balek, Lukas; Trantirek, Lukas; Santen, Gijs W E; de Vries, Martine C; van Duyvenvoorde, Hermine A; van der Wielen, Michiel J R; Verkerk, Annemieke J M H; Uitterlinden, André G; Hannema, Sabine E; Wit, Jan M; Oostdijk, Wilma; Krejci, Pavel; Losekoot, Monique
2015-06-01
Mutations of the fibroblast growth factor receptor 3 (FGFR3) cause various forms of short stature, of which the least severe phenotype is hypochondroplasia, mainly characterized by disproportionate short stature. Testing for an FGFR3 mutation is currently not part of routine diagnostic testing in children with short stature without disproportion. A three-generation family A with dominantly transmitted proportionate short stature was studied by whole-exome sequencing to identify the causal gene mutation. Functional studies and protein modeling studies were performed to confirm the pathogenicity of the mutation found in FGFR3. We performed Sanger sequencing in a second family B with dominant proportionate short stature and identified a rare variant in FGFR3. Exome sequencing and/or Sanger sequencing was performed, followed by functional studies using transfection of the mutant FGFR3 into cultured cells; homology modeling was used to construct a three-dimensional model of the two FGFR3 variants. A novel p.M528I mutation in FGFR3 was detected in family A, which segregates with short stature and proved to be activating in vitro. In family B, a rare variant (p.F384L) was found in FGFR3, which did not segregate with short stature and showed normal functionality in vitro compared with WT. Proportionate short stature can be caused by a mutation in FGFR3. Sequencing of this gene can be considered in patients with short stature, especially when there is an autosomal dominant pattern of inheritance. However, functional studies and segregation studies should be performed before concluding that a variant is pathogenic. © 2015 European Society of Endocrinology.
Zhou, Chengran
2017-01-01
Abstract Over the past decade, biodiversity researchers have dedicated tremendous efforts to constructing DNA reference barcodes for rapid species registration and identification. Although analytical cost for standard DNA barcoding has been significantly reduced since early 2000, further dramatic reduction in barcoding costs is unlikely because Sanger sequencing is approaching its limits in throughput and chemistry cost. Constraints in barcoding cost not only led to unbalanced barcoding efforts around the globe, but also prevented high-throughput sequencing (HTS)–based taxonomic identification from applying binomial species names, which provide crucial linkages to biological knowledge. We developed an Illumina-based pipeline, HIFI-Barcode, to produce full-length Cytochrome c oxidase subunit I (COI) barcodes from pooled polymerase chain reaction amplicons generated by individual specimens. The new pipeline generated accurate barcode sequences that were comparable to Sanger standards, even for different haplotypes of the same species that were only a few nucleotides different from each other. Additionally, the new pipeline was much more sensitive in recovering amplicons at low quantity. The HIFI-Barcode pipeline successfully recovered barcodes from more than 78% of the polymerase chain reactions that didn’t show clear bands on the electrophoresis gel. Moreover, sequencing results based on the single molecular sequencing platform Pacbio confirmed the accuracy of the HIFI-Barcode results. Altogether, the new pipeline can provide an improved solution to produce full-length reference barcodes at about one-tenth of the current cost, enabling construction of comprehensive barcode libraries for local fauna, leading to a feasible direction for DNA barcoding global biomes. PMID:29077841
Jurkowska, Monika; Gos, Aleksandra; Ptaszyński, Konrad; Michej, Wanda; Tysarowski, Andrzej; Zub, Renata; Siedlecki, Janusz A; Rutkowski, Piotr
2015-01-01
The study compares detection rates of oncogenic BRAF mutations in a homogenous group of 236 FFPE cutaneous melanoma lymph node metastases, collected in one cancer center. BRAF mutational status was verified by two independent in-house PCR/Sanger sequencing tests, and the Cobas® 4800 BRAF V600 Mutation Test. The best of two sequencing approaches returned results for 230/236 samples. In 140 (60.9%), the mutation in codon 600 of BRAF was found. 91.4% of all mutated cases (128 samples) represented p.V600E. Both Sanger-based tests gave reproducible results although they differed significantly in the percentage of amplifiable samples: 230/236 to 109/143. Cobas generated results in all 236 cases, mutations changing codon V600 were detected in 144 of them (61.0%), including 5 not amplifiable and 5 negative in the standard sequencing. However, 6 cases positive in sequencing turned out to be negative in Cobas. Both tests provided us with the same BRAF V600 mutational status in 219 out of 230 cases with valid results (95.2%). The total BRAF V600 mutation detection rate didn't differ significantly between the two methodological approaches (60.9% vs. 61.0%). Sequencing was a reproducible method of V600 mutation detection and more powerful to detect mutations other than p.V600E, while Cobas test proved to be less susceptible to the poor DNA quality or investigator's bias. The study underlined an important role of pathologists in quality assurance of molecular diagnostics.
de Vries, Tamar I; Monroe, Glen R; van Belzen, Martine J; van der Lans, Christian A; Savelberg, Sanne Mc; Newman, William G; van Haaften, Gijs; Nievelstein, Rutger A; van Haelst, Mieke M
2016-08-01
Rubinstein-Taybi syndrome (RTS, OMIM 180849) and Filippi syndrome (FLPIS, OMIM 272440) are both rare syndromes, with multiple congenital anomalies and intellectual deficit (MCA/ID). We present a patient with intellectual deficit, short stature, bilateral syndactyly of hands and feet, broad thumbs, ocular abnormalities, and dysmorphic facial features. These clinical features suggest both RTS and FLPIS. Initial DNA analysis of DNA isolated from blood did not identify variants to confirm either of these syndrome diagnoses. Whole-exome sequencing identified a homozygous variant in C9orf173, which was novel at the time of analysis. Further Sanger sequencing analysis of FLPIS cases tested negative for CKAP2L variants did not, however, reveal any further variants. Subsequent analysis using DNA isolated from buccal mucosa revealed a mosaic variant in CREBBP. This report highlights the importance of excluding mosaic variants in patients with a strong but atypical clinical presentation of a MCA/ID syndrome if no disease-causing variants can be detected in DNA isolated from blood samples. As the striking syndactyly observed in the present case is typical for FLPIS, we suggest CREBBP analysis in saliva samples for FLPIS syndrome cases in which no causal CKAP2L variant is detected.
NASA Astrophysics Data System (ADS)
Zhao, Cui; Zhang, Xiaojun; Liu, Chengzhang; Huan, Pin; Li, Fuhua; Xiang, Jianhai; Huang, Chao
2012-05-01
Little is known about the genome of Pacific white shrimp ( Litopenaeus vannamei). To address this, we conducted BAC (bacterial artificial chromosome) end sequencing of L. vannamei. We selected and sequenced 7 812 BAC clones from the BAC library LvHE from the two ends of the inserts by Sanger sequencing. After trimming and quality filtering, 11 279 BAC end sequences (BESs) including 4 609 pairedends BESs were obtained. The total length of the BESs was 4 340 753 bp, representing 0.18% of the L. vannamei haploid genome. The lengths of the BESs ranged from 100 bp to 660 bp with an average length of 385 bp. Analysis of the BESs indicated that the L. vannamei genome is AT-rich and that the primary repeats patterns were simple sequence repeats (SSRs) and low complexity sequences. Dinucleotide and hexanucleotide repeats were the most common SSR types in the BESs. The most abundant transposable element was gypsy, which may contribute to the generation of the large genome size of L. vannamei. We successfully annotated 4 519 BESs by BLAST searching, including genes involved in immunity and sex determination. Our results provide an important resource for functional gene studies, map construction and integration, and complete genome assembly for this species.
Machine Learned Replacement of N-Labels for Basecalled Sequences in DNA Barcoding.
Ma, Eddie Y T; Ratnasingham, Sujeevan; Kremer, Stefan C
2018-01-01
This study presents a machine learning method that increases the number of identified bases in Sanger Sequencing. The system post-processes a KB basecalled chromatogram. It selects a recoverable subset of N-labels in the KB-called chromatogram to replace with basecalls (A,C,G,T). An N-label correction is defined given an additional read of the same sequence, and a human finished sequence. Corrections are added to the dataset when an alignment determines the additional read and human agree on the identity of the N-label. KB must also rate the replacement with quality value of in the additional read. Corrections are only available during system training. Developing the system, nearly 850,000 N-labels are obtained from Barcode of Life Datasystems, the premier database of genetic markers called DNA Barcodes. Increasing the number of correct bases improves reference sequence reliability, increases sequence identification accuracy, and assures analysis correctness. Keeping with barcoding standards, our system maintains an error rate of percent. Our system only applies corrections when it estimates low rate of error. Tested on this data, our automation selects and recovers: 79 percent of N-labels from COI (animal barcode); 80 percent from matK and rbcL (plant barcodes); and 58 percent from non-protein-coding sequences (across eukaryotes).
Wang, Xueling; Lin, Xiao-Jiang; Tang, Xiangrong; Chai, Yong-Chuan; Yu, De-Hong; Chen, Dong-Ye; Wu, Hao
2017-11-01
The purpose of this study was to identify the genetic causes of a family presenting with multiple symptoms overlapping Usher syndrome type II (USH2) and Waardenburg syndrome type IV (WS4). Targeted next-generation sequencing including the exon and flanking intron sequences of 79 deafness genes was performed on the proband. Co-segregation of the disease phenotype and the detected variants were confirmed in all family members by PCR amplification and Sanger sequencing. The affected members of this family had two different recessive disorders, USH2 and WS4. By targeted next-generation sequencing, we identified that USH2 was caused by a novel missense mutation, p.V4907D in GPR98; whereas WS4 due to p.V185M in EDNRB. This is the first report of homozygous p.V185M mutation in EDNRB in patient with WS4. This study reported a Chinese family with multiple independent and overlapping phenotypes. In condition, molecular level analysis was efficient to identify the causative variant p.V4907D in GPR98 and p.V185M in EDNRB, also was helpful to confirm the clinical diagnosis of USH2 and WS4. Copyright © 2017 Elsevier B.V. All rights reserved.
Genomics - the new rock and roll?
Dunham, I
2000-10-01
The end of the beginning of the Human Genome Project was announced on 26 June when the working draft or first assembly was announced. Here, Ian Dunham who led the group at the Sanger Centre that produced the first complete sequence of a human chromosome reflects on how it felt to be with the genome project from the beginning.
Gao, Ge; Johnson, Sarah H; Vasmatzis, George; Pauley, Christina E; Tombers, Nicole M; Kasperbauer, Jan L; Smith, David I
2017-01-01
Common fragile sites (CFS) are chromosome regions that are prone to form gaps or breaks in response to DNA replication stress. They are often found as hotspots for sister chromatid exchanges, deletions, and amplifications in different cancers. Many of the CFS regions are found to span genes whose genomic sequence is greater than 1 Mb, some of which have been demonstrated to function as important tumor suppressors. CFS regions are also hotspots for human papillomavirus (HPV) integrations in cervical cancer. We used mate-pair sequencing to examine HPV integration events and chromosomal structural variations in 34 oropharyngeal squamous cell carcinoma (OPSCC). We used endpoint PCR and Sanger sequencing to validate each HPV integration event and found HPV integrations preferentially occurred within CFS regions similar to what is observed in cervical cancer. We also found that many of the chromosomal alterations detected also occurred at or near the cytogenetic location of CFSs. Several large genes were also found to be recurrent targets of rearrangements, independent of HPV integrations, including CSMD1 (2.1Mb), LRP1B (1.9Mb), and LARGE1 (0.7Mb). Sanger sequencing revealed that the nucleotide sequences near to identified junction sites contained repetitive and AT-rich sequences that were shown to have the potential to form stem-loop DNA secondary structures that might stall DNA replication fork progression during replication stress. This could then cause increased instability in these regions which could lead to cancer development in human cells. Our findings suggest that CFSs and some specific large genes appear to play important roles in OPSCC. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Rosenthal, Lisa M; Larsson, Karl-Henrik; Branco, Sara; Chung, Judy A; Glassman, Sydney I; Liao, Hui-Ling; Peay, Kabir G; Smith, Dylan P; Talbot, Jennifer M; Taylor, John W; Vellinga, Else C; Vilgalys, Rytas; Bruns, Thomas D
2017-01-01
The corticioid fungi are commonly encountered, highly diverse, ecologically important, and understudied. We collected specimens in 60 pine and spruce forests across North America to survey corticioid fungal frequency and distribution and to compile an internal transcribed spacer (ITS) database for the group. Sanger sequences from the ITS region of vouchered specimens were compared with sequences on GenBank and UNITE, and with high-throughput sequence data from soil and roots taken at the same sites. Out of 425 high-quality Sanger sequences from vouchered specimens, we recovered 223 distinct operational taxonomic units (OTUs), the majority of which could not be assigned to species by matching to the BLAST database. Corticioid fungi were found to be hyperdiverse, as supported by the observations that nearly two-thirds of our OTUs were represented by single collections and species estimator curves showed steep slopes with no plateaus. We estimate that 14.8-24.7% of our voucher-based OTUs are likely to be ectomycorrhizal (EM). Corticioid fungi recovered from the soil formed a different community assemblage, with EM taxa accounting for 40.5-58.6% of OTUs. We compared basidioma sequences with EM root tips from our data, GenBank, or UNITE, and with this approach, we reiterate existing speculations that Trechispora stellulata is EM. We found that corticioid fungi have a significant distance-decay pattern, adding to the literature supporting fungi as having geographically structured communities. This study provides a first view of the diversity of this important group across North American pine forests, but much of the biology and taxonomy of these diverse, important, and widespread fungi remains unknown.
Wilkinson, Samuel L.; John, Shibu; Walsh, Roddy; Novotny, Tomas; Valaskova, Iveta; Gupta, Manu; Game, Laurence; Barton, Paul J R.; Cook, Stuart A.; Ware, James S.
2013-01-01
Background Molecular genetic testing is recommended for diagnosis of inherited cardiac disease, to guide prognosis and treatment, but access is often limited by cost and availability. Recently introduced high-throughput bench-top DNA sequencing platforms have the potential to overcome these limitations. Methodology/Principal Findings We evaluated two next-generation sequencing (NGS) platforms for molecular diagnostics. The protein-coding regions of six genes associated with inherited arrhythmia syndromes were amplified from 15 human samples using parallelised multiplex PCR (Access Array, Fluidigm), and sequenced on the MiSeq (Illumina) and Ion Torrent PGM (Life Technologies). Overall, 97.9% of the target was sequenced adequately for variant calling on the MiSeq, and 96.8% on the Ion Torrent PGM. Regions missed tended to be of high GC-content, and most were problematic for both platforms. Variant calling was assessed using 107 variants detected using Sanger sequencing: within adequately sequenced regions, variant calling on both platforms was highly accurate (Sensitivity: MiSeq 100%, PGM 99.1%. Positive predictive value: MiSeq 95.9%, PGM 95.5%). At the time of the study the Ion Torrent PGM had a lower capital cost and individual runs were cheaper and faster. The MiSeq had a higher capacity (requiring fewer runs), with reduced hands-on time and simpler laboratory workflows. Both provide significant cost and time savings over conventional methods, even allowing for adjunct Sanger sequencing to validate findings and sequence exons missed by NGS. Conclusions/Significance MiSeq and Ion Torrent PGM both provide accurate variant detection as part of a PCR-based molecular diagnostic workflow, and provide alternative platforms for molecular diagnosis of inherited cardiac conditions. Though there were performance differences at this throughput, platforms differed primarily in terms of cost, scalability, protocol stability and ease of use. Compared with current molecular genetic diagnostic tests for inherited cardiac arrhythmias, these NGS approaches are faster, less expensive, and yet more comprehensive. PMID:23861798
Integrative workflows for metagenomic analysis
Ladoukakis, Efthymios; Kolisis, Fragiskos N.; Chatziioannou, Aristotelis A.
2014-01-01
The rapid evolution of all sequencing technologies, described by the term Next Generation Sequencing (NGS), have revolutionized metagenomic analysis. They constitute a combination of high-throughput analytical protocols, coupled to delicate measuring techniques, in order to potentially discover, properly assemble and map allelic sequences to the correct genomes, achieving particularly high yields for only a fraction of the cost of traditional processes (i.e., Sanger). From a bioinformatic perspective, this boils down to many GB of data being generated from each single sequencing experiment, rendering the management or even the storage, critical bottlenecks with respect to the overall analytical endeavor. The enormous complexity is even more aggravated by the versatility of the processing steps available, represented by the numerous bioinformatic tools that are essential, for each analytical task, in order to fully unveil the genetic content of a metagenomic dataset. These disparate tasks range from simple, nonetheless non-trivial, quality control of raw data to exceptionally complex protein annotation procedures, requesting a high level of expertise for their proper application or the neat implementation of the whole workflow. Furthermore, a bioinformatic analysis of such scale, requires grand computational resources, imposing as the sole realistic solution, the utilization of cloud computing infrastructures. In this review article we discuss different, integrative, bioinformatic solutions available, which address the aforementioned issues, by performing a critical assessment of the available automated pipelines for data management, quality control, and annotation of metagenomic data, embracing various, major sequencing technologies and applications. PMID:25478562
Wang, Bo; Guo, Ruiqi; Zuo, Lei; Shao, Hong; Liu, Ying; Wang, Yu; Ju, Yan; Sun, Chao; Wang, Lifeng; Zhang, Yanmin; Liu, Liwen
2017-08-10
To analyze the phenotype-genotype correlation of MYH7-V878A mutation. Exonic amplification and high-throughput sequencing of 96-cardiovascular disease-related genes were carried out on probands from 210 pedigrees affected with hypertrophic cardiomyopathy (HCM). For the probands, their family members, and 300 healthy volunteers, the identified MYH7-V878A mutation was verified by Sanger sequencing. Information of the HCM patients and their family members, including clinical data, physical examination, echocardiography (UCG), electrocardiography (ECG), and conserved sequence of the mutation among various species were analyzed. A MYH7-V878A mutation was detected in five HCM pedigrees containing 31 family members. Fourteen members have carried the mutation, among whom 11 were diagnosed with HCM, while 3 did not meet the diagnostic criteria. Some of the fourteen members also carried other mutations. Family members not carrying the mutation had normal UCG and ECG. No MYH7-V878A mutation was found among the 300 healthy volunteers. Analysis of sequence conservation showed that the amino acid is located in highly conserved regions among various species. MYH7-V878A is a hot spot among ethnic Han Chinese with a high penetrance. Functional analysis of the conserved sequences suggested that the mutation may cause significant alteration of the function. MYH7-V878A has a significant value for the early diagnosis of HCM.
Trentin, Luca; Bresolin, Silvia; Giarin, Emanuela; Bardini, Michela; Serafin, Valentina; Accordi, Benedetta; Fais, Franco; Tenca, Claudya; De Lorenzo, Paola; Valsecchi, Maria Grazia; Cazzaniga, Giovanni; Kronnie, Geertruy Te; Basso, Giuseppe
2016-10-04
To induce and sustain the leukaemogenic process, MLL-AF4+ leukaemia seems to require very few genetic alterations in addition to the fusion gene itself. Studies of infant and paediatric patients with MLL-AF4+ B cell precursor acute lymphoblastic leukaemia (BCP-ALL) have reported mutations in KRAS and NRAS with incidences ranging from 25 to 50%. Whereas previous studies employed Sanger sequencing, here we used next generation amplicon deep sequencing for in depth evaluation of RAS mutations in 36 paediatric patients at diagnosis of MLL-AF4+ leukaemia. RAS mutations including those in small sub-clones were detected in 63.9% of patients. Furthermore, the mutational analysis of 17 paired samples at diagnosis and relapse revealed complex RAS clone dynamics and showed that the mutated clones present at relapse were almost all originated from clones that were already detectable at diagnosis and survived to the initial therapy. Finally, we showed that mutated patients were indeed characterized by a RAS related signature at both transcriptional and protein levels and that the targeting of the RAS pathway could be of beneficial for treatment of MLL-AF4+ BCP-ALL clones carrying somatic RAS mutations.
Novel FAM20A mutation causes autosomal recessive amelogenesis imperfecta.
Volodarsky, Michael; Zilberman, Uri; Birk, Ohad S
2015-06-01
To relate the peculiar phenotype of amelogenesis imperfecta in a large Bedouin family to the genotype determined by whole genome linkage analysis. Amelogenesis imperfecta (AI) is a broad group of inherited pathologies affecting enamel formation, characterized by variability in phenotypes, causing mutations and modes of inheritance. Autosomal recessive or compound heterozygous mutations in FAM20A, encoding sequence similarity 20, member A, have been shown to cause several AI phenotypes. Five members from a large consanguineous Bedouin family presented with hypoplastic amelogenesis imperfecta with unerupted and resorbed permanent molars. Following Soroka Medical Center IRB approval and informed consent, blood samples were obtained from six affected offspring, five obligatory carriers and two unaffected siblings. Whole genome linkage analysis was performed followed by Sanger sequencing of FAM20A. The sequencing unravelled a novel homozygous deletion mutation in exon 11 (c.1523delC), predicted to insert a premature stop codon (p.Thr508Lysfs*6). We provide an interesting case of novel mutation in this rare disorder, in which the affected kindred is unique in the large number of family members sharing a similar phenotype. Copyright © 2015 Elsevier Ltd. All rights reserved.
Intronic splicing mutations in PTCH1 cause Gorlin syndrome.
Bholah, Zaynab; Smith, Miriam J; Byers, Helen J; Miles, Emma K; Evans, D Gareth; Newman, William G
2014-09-01
Gorlin syndrome is an autosomal dominant disorder characterized by multiple early-onset basal cell carcinoma, odontogenic keratocysts and skeletal abnormalities. It is caused by heterozygous mutations in the tumour suppressor PTCH1. Routine clinical genetic testing, by Sanger sequencing and multiplex ligation-dependent probe amplification (MLPA) to confirm a clinical diagnosis of Gorlin syndrome, identifies a mutation in 60-90 % of cases. We undertook RNA analysis on lymphocytes from ten individuals diagnosed with Gorlin syndrome, but without known PTCH1 mutations by exonic sequencing or MLPA. Two altered PTCH1 transcripts were identified. Genomic DNA sequence analysis identified an intron 7 mutation c.1068-10T>A, which created a strong cryptic splice acceptor site, leading to an intronic insertion of eight bases; this is predicted to create a frameshift p.(His358Alafs*12). Secondly, a deep intronic mutation c.2561-2057A>G caused an inframe insertion of 78 intronic bases in the cDNA transcript, leading to a premature stop codon p.(Gly854fs*3). The mutations are predicted to cause loss of function of PTCH1, consistent with its tumour suppressor function. The findings indicate the importance of RNA analysis to detect intronic mutations in PTCH1 not identified by routine screening techniques.
Wu, Shuang; Nakamoto, Shingo; Kanda, Tatsuo; Jiang, Xia; Nakamura, Masato; Miyamura, Tatsuo; Shirasawa, Hiroshi; Sugiura, Nobuyuki; Takahashi-Nakaguchi, Azusa; Gonoi, Tohru; Yokosuka, Osamu
2014-01-01
Hepatitis A virus (HAV) is a causative agent of acute viral hepatitis for which an effective vaccine has been developed. Here we describe ultra-deep pyrosequences (UDPSs) of HAV 5'-untranslated region (5'UTR) among cases of the same outbreak, which arose from a single source, associated with a revolving sushi bar. We determined the reference sequence from HAV-derived clone from an attendant by the Sanger method. Sixteen UDPSs from this outbreak and one from another sporadic case were compared with this reference. Nucleotide errors yielded a UDPS error rate of < 1%. This study confirmed that nucleotide substitutions of this region are transition mutations in outbreak cases, that insertion was observed only in non-severe cases, and that these nucleotide substitutions were different from those of the sporadic case. Analysis of UDPSs detected low-prevalence HAV variations in 5'UTR, but no specific mutations associated with severity in these outbreak cases. To our surprise, HAV strains in this outbreak conserved HAV IRES sequence even if we performed analysis of UDPSs. UDPS analysis of HAV 5'UTR gave us no association between the disease severity of hepatitis A and HAV 5'UTR substitutions. It might be more interesting to perform ultra-deep sequencing of full length HAV genome in order to reveal possible unknown genomic determinants associated with disease severity. Further studies will be needed. PMID:24396287
A comparative molecular analysis of water-filled limestone sinkholes in north-eastern Mexico.
Sahl, Jason W; Gary, Marcus O; Harris, J Kirk; Spear, John R
2011-01-01
Sistema Zacatón in north-eastern Mexico is host to several deep, water-filled, anoxic, karstic sinkholes (cenotes). These cenotes were explored, mapped, and geochemically and microbiologically sampled by the autonomous underwater vehicle deep phreatic thermal explorer (DEPTHX). The community structure of the filterable fraction of the water column and extensive microbial mats that coat the cenote walls was investigated by comparative analysis of small-subunit (SSU) 16S rRNA gene sequences. Full-length Sanger gene sequence analysis revealed novel microbial diversity that included three putative bacterial candidate phyla and three additional groups that showed high intra-clade distance with poorly characterized bacterial candidate phyla. Limited functional gene sequence analysis in these anoxic environments identified genes associated with methanogenesis, sulfate reduction and anaerobic ammonium oxidation. A directed, barcoded amplicon, multiplex pyrosequencing approach was employed to compare ∼100,000 bacterial SSU gene sequences from water column and wall microbial mat samples from five cenotes in Sistema Zacatón. A new, high-resolution sequence distribution profile (SDP) method identified changes in specific phylogenetic types (phylotypes) in microbial mats at varied depths; Mantel tests showed a correlation of the genetic distances between mat communities in two cenotes and the geographic location of each cenote. Community structure profiles from the water column of three neighbouring cenotes showed distinct variation; statistically significant differences in the concentration of geochemical constituents suggest that the variation observed in microbial communities between neighbouring cenotes are due to geochemical variation. © 2010 Society for Applied Microbiology and Blackwell Publishing Ltd.
Kelm-Nelson, Cynthia A; Stevenson, Sharon A; Ciucci, Michelle R
2016-09-01
Datasets provided in this article represent the Rattus norvegicus primer design and verification used in Pink1 -/- and wildtype Long Evans brain tissue. Accessible tables include relevant information, accession numbers, sequences, temperatures and product length, describing primer design specific to the transcript amplification use. Additionally, results of Sanger sequencing of qPCR reaction products (FASTA aligned sequences) are presented for genes of interest. Results and further interpretation and discussion can be found in the original research article "Atp13a2 expression in the periaqueductal gray is decreased in the Pink1 -/- rat model of Parkinson disease" [1].
Next-Generation Sequencing Platforms
NASA Astrophysics Data System (ADS)
Mardis, Elaine R.
2013-06-01
Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.
Ciliates and the rare biosphere: a review.
Dunthorn, Micah; Stoeck, Thorsten; Clamp, John; Warren, Alan; Mahé, Frédéric
2014-01-01
Here we provide a brief review of the rare biosphere from the perspective of ciliates and other microbial eukaryotes. We trace research on rarity from its lack of much in-depth focus in morphological and Sanger sequencing projects, to its central importance in analyses using high throughput sequencing strategies. The problem that the rare biosphere is potentially comprised of mostly errors is then discussed in the light of asking community-comparative, novel-diversity, and ecosystem-functioning questions. © 2014 The Author(s) Journal of Eukaryotic Microbiology © 2014 International Society of Protistologists.
A massive parallel sequencing workflow for diagnostic genetic testing of mismatch repair genes
Hansen, Maren F; Neckmann, Ulrike; Lavik, Liss A S; Vold, Trine; Gilde, Bodil; Toft, Ragnhild K; Sjursen, Wenche
2014-01-01
The purpose of this study was to develop a massive parallel sequencing (MPS) workflow for diagnostic analysis of mismatch repair (MMR) genes using the GS Junior system (Roche). A pathogenic variant in one of four MMR genes, (MLH1, PMS2, MSH6, and MSH2), is the cause of Lynch Syndrome (LS), which mainly predispose to colorectal cancer. We used an amplicon-based sequencing method allowing specific and preferential amplification of the MMR genes including PMS2, of which several pseudogenes exist. The amplicons were pooled at different ratios to obtain coverage uniformity and maximize the throughput of a single-GS Junior run. In total, 60 previously identified and distinct variants (substitutions and indels), were sequenced by MPS and successfully detected. The heterozygote detection range was from 19% to 63% and dependent on sequence context and coverage. We were able to distinguish between false-positive and true-positive calls in homopolymeric regions by cross-sample comparison and evaluation of flow signal distributions. In addition, we filtered variants according to a predefined status, which facilitated variant annotation. Our study shows that implementation of MPS in routine diagnostics of LS can accelerate sample throughput and reduce costs without compromising sensitivity, compared to Sanger sequencing. PMID:24689082
Kerkhof, Jennifer; Schenkel, Laila C; Reilly, Jack; McRobbie, Sheri; Aref-Eshghi, Erfan; Stuart, Alan; Rupar, C Anthony; Adams, Paul; Hegele, Robert A; Lin, Hanxin; Rodenhiser, David; Knoll, Joan; Ainsworth, Peter J; Sadikovic, Bekim
2017-11-01
Next-generation sequencing (NGS) technology has rapidly replaced Sanger sequencing in the assessment of sequence variations in clinical genetics laboratories. One major limitation of current NGS approaches is the ability to detect copy number variations (CNVs) approximately >50 bp. Because these represent a major mutational burden in many genetic disorders, parallel CNV assessment using alternate supplemental methods, along with the NGS analysis, is normally required, resulting in increased labor, costs, and turnaround times. The objective of this study was to clinically validate a novel CNV detection algorithm using targeted clinical NGS gene panel data. We have applied this approach in a retrospective cohort of 391 samples and a prospective cohort of 2375 samples and found a 100% sensitivity (95% CI, 89%-100%) for 37 unique events and a high degree of specificity to detect CNVs across nine distinct targeted NGS gene panels. This NGS CNV pipeline enables stand-alone first-tier assessment for CNV and sequence variants in a clinical laboratory setting, dispensing with the need for parallel CNV analysis using classic techniques, such as microarray, long-range PCR, or multiplex ligation-dependent probe amplification. This NGS CNV pipeline can also be applied to the assessment of complex genomic regions, including pseudogenic DNA sequences, such as the PMS2CL gene, and to mitochondrial genome heteroplasmy detection. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
1996-02-01
JOICFP President Shidzue Kato was influenced by and friends with Margaret Sanger from their initial meeting in 1920 to Sanger's death in 1966. Bruce Alfred is currently directing and producing a 90-minute documentary film about Sanger and her pioneering work in promoting the development and use of family planning. Once completed in the Spring of 1997, the film will be broadcast nationally in the US on Public Television. It is being produced with support from the National Endowment for the Humanities and several private foundations. Alfred interviewed Kato for three hours to gain insight into the life and legacy of Margaret Sanger. Sanger inspired Kato to make birth control her life's work. Kato spoke about how the prewar, pronatalist Japanese government allowed Sanger to visit Japan in 1922 only on the condition that she not speak about birth control. This official opposition and the subsequent reaction, however, actually fueled interest in Sanger and her message, and caused her ideas to become widely known among the Japanese public. While in Japan, Sanger did manage to discuss family planning, but in English in a closed meeting. Oddly enough, the government honored Sanger after the second World War with the highest award presented to non-Japanese. Kato noted how unfortunate it was that Sanger died in 1966 without witnessing the realization of the UNFPA.
Cefalù, Angelo B; Spina, Rossella; Noto, Davide; Ingrassia, Valeria; Valenti, Vincenza; Giammanco, Antonina; Fayer, Francesca; Misiano, Gabriella; Cocorullo, Gianfranco; Scrimali, Chiara; Palesano, Ornella; Altieri, Grazia I; Ganci, Antonina; Barbagallo, Carlo M; Averna, Maurizio R
Severe hypertriglyceridemia (HTG) may result from mutations in genes affecting the intravascular lipolysis of triglyceride (TG)-rich lipoproteins. The aim of this study was to develop a targeted next-generation sequencing panel for the molecular diagnosis of disorders characterized by severe HTG. We developed a targeted customized panel for next-generation sequencing Ion Torrent Personal Genome Machine to capture the coding exons and intron/exon boundaries of 18 genes affecting the main pathways of TG synthesis and metabolism. We sequenced 11 samples of patients with severe HTG (TG>885 mg/dL-10 mmol/L): 4 positive controls in whom pathogenic mutations had previously been identified by Sanger sequencing and 7 patients in whom the molecular defect was still unknown. The customized panel was accurate, and it allowed to confirm genetic variants previously identified in all positive controls with primary severe HTG. Only 1 patient of 7 with HTG was found to be carrier of a homozygous pathogenic mutation of the third novel mutation of LMF1 gene (c.1380C>G-p.Y460X). The clinical and molecular familial cascade screening allowed the identification of 2 additional affected siblings and 7 heterozygous carriers of the mutation. We showed that our targeted resequencing approach for genetic diagnosis of severe HTG appears to be accurate, less time consuming, and more economical compared with traditional Sanger resequencing. The identification of pathogenic mutations in candidate genes remains challenging and clinical resequencing should mainly intended for patients with strong clinical criteria for monogenic severe HTG. Copyright © 2017 National Lipid Association. Published by Elsevier Inc. All rights reserved.
Gocho, Kiyoko; Akeo, Keiichiro; Itoh, Naoko; Kameya, Shuhei; Hayashi, Takaaki; Katagiri, Satoshi; Gekka, Tamaki; Ohkuma, Yasuhiro; Tsuneoka, Hiroshi; Takahashi, Hiroshi
2016-12-01
To report the clinical features of Japanese patients at Stage 1 and 2 of central areolar choroidal dystrophy (CACD). Five family members had comprehensive ophthalmic examinations including adaptive optics (AO) retinal imaging. Mutation analysis of the PRPH2 gene was performed by Sanger sequencing. The protocol conformed to the tenets of the Declaration of Helsinki and was approved by the institutional review board of The Jikei University School of Medicine. Four family members had a heterozygous PRPH2 mutation, p.R172Q; however, one member with a mutation did not show any ophthalmological abnormalities. Two patients had mild parafoveal retinal dystrophy and a reduction of cone density determined by AO analysis. The results indicate that the parafoveal cone photoreceptors can be affected even at the early stage of CACD. [Ophthalmic Surg Lasers Imaging Retina. 2016;47:1115-1126.]. Copyright 2016, SLACK Incorporated.
Navigating the tip of the genomic iceberg: Next-generation sequencing for plant systematics.
Straub, Shannon C K; Parks, Matthew; Weitemier, Kevin; Fishbein, Mark; Cronn, Richard C; Liston, Aaron
2012-02-01
Just as Sanger sequencing did more than 20 years ago, next-generation sequencing (NGS) is poised to revolutionize plant systematics. By combining multiplexing approaches with NGS throughput, systematists may no longer need to choose between more taxa or more characters. Here we describe a genome skimming (shallow sequencing) approach for plant systematics. Through simulations, we evaluated optimal sequencing depth and performance of single-end and paired-end short read sequences for assembly of nuclear ribosomal DNA (rDNA) and plastomes and addressed the effect of divergence on reference-guided plastome assembly. We also used simulations to identify potential phylogenetic markers from low-copy nuclear loci at different sequencing depths. We demonstrated the utility of genome skimming through phylogenetic analysis of the Sonoran Desert clade (SDC) of Asclepias (Apocynaceae). Paired-end reads performed better than single-end reads. Minimum sequencing depths for high quality rDNA and plastome assemblies were 40× and 30×, respectively. Divergence from the reference significantly affected plastome assembly, but relatively similar references are available for most seed plants. Deeper rDNA sequencing is necessary to characterize intragenomic polymorphism. The low-copy fraction of the nuclear genome was readily surveyed, even at low sequencing depths. Nearly 160000 bp of sequence from three organelles provided evidence of phylogenetic incongruence in the SDC. Adoption of NGS will facilitate progress in plant systematics, as whole plastome and rDNA cistrons, partial mitochondrial genomes, and low-copy nuclear markers can now be efficiently obtained for molecular phylogenetics studies.
Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus
Kinoti, Wycliff M.; Constable, Fiona E.; Nancarrow, Narelle; Plummer, Kim M.; Rodoni, Brendan
2017-01-01
The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS) of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp) gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV) was the most frequently detected Ilarvirus, occurring in 48 of the 61 Ilarvirus-positive trees and Prune dwarf virus (PDV) and Apple mosaic virus (ApMV) were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV) was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus-like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus-like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus-like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples, and the need for a standardized approach to accurately determine what constitutes an active, viable virus infection after detection by molecular based methods. PMID:28713347
Generic Amplicon Deep Sequencing to Determine Ilarvirus Species Diversity in Australian Prunus.
Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan
2017-01-01
The distribution of Ilarvirus species populations amongst 61 Australian Prunus trees was determined by next generation sequencing (NGS) of amplicons generated using a genus-based generic RT-PCR targeting a conserved region of the Ilarvirus RNA2 component that encodes the RNA dependent RNA polymerase (RdRp) gene. Presence of Ilarvirus sequences in each positive sample was further validated by Sanger sequencing of cloned amplicons of regions of each of RNA1, RNA2 and/or RNA3 that were generated by species specific PCRs and by metagenomic NGS. Prunus necrotic ringspot virus (PNRSV) was the most frequently detected Ilarvirus , occurring in 48 of the 61 Ilarvirus -positive trees and Prune dwarf virus (PDV) and Apple mosaic virus (ApMV) were detected in three trees and one tree, respectively. American plum line pattern virus (APLPV) was detected in three trees and represents the first report of APLPV detection in Australia. Two novel and distinct groups of Ilarvirus -like RNA2 amplicon sequences were also identified in several trees by the generic amplicon NGS approach. The high read depth from the amplicon NGS of the generic PCR products allowed the detection of distinct RNA2 RdRp sequence variant populations of PNRSV, PDV, ApMV, APLPV and the two novel Ilarvirus -like sequences. Mixed infections of ilarviruses were also detected in seven Prunus trees. Sanger sequencing of specific RNA1, RNA2, and/or RNA3 genome segments of each virus and total nucleic acid metagenomics NGS confirmed the presence of PNRSV, PDV, ApMV and APLPV detected by RNA2 generic amplicon NGS. However, the two novel groups of Ilarvirus -like RNA2 amplicon sequences detected by the generic amplicon NGS could not be associated to the presence of sequence from RNA1 or RNA3 genome segments or full Ilarvirus genomes, and their origin is unclear. This work highlights the sensitivity of genus-specific amplicon NGS in detection of virus sequences and their distinct populations in multiple samples, and the need for a standardized approach to accurately determine what constitutes an active, viable virus infection after detection by molecular based methods.
Rowczenio, Dorota M.; Gomes, Sónia Melo; Aróstegui, Juan I.; Mensa-Vilaro, Anna; Omoyinmi, Ebun; Trojer, Hadija; Baginska, Anna; Baroja-Mazo, Alberto; Pelegrin, Pablo; Savic, Sinisa; Lane, Thirusha; Williams, Rene; Brogan, Paul; Lachmann, Helen J.; Hawkins, Philip N.
2017-01-01
Cryopyrin-associated periodic syndrome (CAPS) is caused by gain-of-function NLRP3 mutations. Recently, somatic NLRP3 mosaicism has been reported in some CAPS patients who were previously classified as “mutation-negative.” We describe here the clinical and laboratory findings in eight British adult patients who presented with symptoms typical of CAPS other than an onset in mid-late adulthood. All patients underwent comprehensive clinical and laboratory investigations, including analysis of the NLRP3 gene using Sanger and amplicon-based deep sequencing (ADS) along with measurements of extracellular apoptosis-associated speck-like protein with CARD domain (ASC) aggregates. The clinical phenotype in all subjects was consistent with mid-spectrum CAPS, except a median age at disease onset of 50 years. Sanger sequencing of NLRP3 was non-diagnostic but ADS detected a somatic NLRP3 mutation in each case. In one patient, DNA isolated from blood demonstrated an increase in the mutant allele from 5 to 45% over 12 years. ASC aggregates in patients’ serum measured during active disease were significantly higher than healthy controls. This series represents 8% of CAPS patients diagnosed in a single center, suggesting that acquired NLRP3 mutations may not be an uncommon cause of the syndrome and should be sought in all patients with late-onset symptoms otherwise compatible with CAPS. Steadily worsening CAPS symptoms in one patient were associated with clonal expansion of the mutant allele predominantly affecting myeloid cells. Two patients developed AA amyloidosis, which previously has only been reported in CAPS in association with life-long germline NLRP3 mutations. PMID:29163488
Xp11.2 translocation renal cell carcinoma with PSF-TFE3 rearrangement.
Zhong, Minghao; Weisman, Paul; Zhu, Bing; Brassesco, Maria; Yang, Youfeng; Linehan, W Marston; Merino, Maria J; Zhang, David; Rohan, Stephen; Cai, Dongming; Yang, Ximing
2013-06-01
Xp11.2 translocation renal cell carcinoma (Xp11.2 RCC) is a subtype of RCC characterized by translocations involving a breakpoint at the TFE3 gene (Xp11.2). Moderate to strong nuclear TFE3 immunoreactivity has been recognized as a specific diagnostic marker for this type of tumor. However, exclusive cytoplasmic localization of a TFE3 fusion protein was reported in UOK 145 cells, a cell line derived from an Xp11.2 RCC harboring the PSF-TFE3 translocation. If reproducible using immunohistochemistry (IHC), this finding would have important implications for pathologists in the diagnosis of Xp11.2 RCC, calling into question the specificity of nuclear immunoreactivity for TFE3 in these tumors. The purpose of this study was to determine whether the above-noted cytoplasmic localization of the TFE3 fusion protein could be reproduced using IHC. UOK 145 cells and fresh frozen tissue from 2 clinical cases of Xp11.2 RCC found to harbor the PSF-TFE3 gene rearrangement (by cytogenetic testing) were collected. All samples were subjected to histopathologic evaluation by board-certified pathologists, TFE3 IHC, reverse transcription polymerase chain reaction, and Sanger sequencing analysis. A strong nuclear TFE3 immunoreactivity was demonstrated in all samples including the UOK 145 cell line. No cytoplasmic immunoreactivity was seen. Reverse transcription polymerase chain reaction and Sanger sequencing confirmed the previously reported PSF-TFE3 gene fusion between exon 9 of PSF and exon 6 of TFE3 in the UOK 145 cell line and in one of 2 clinical cases of Xp11.2 RCC. A novel PSF-TFE3 gene fusion between exon 9 of PSF and exon 5 of TFE3 was detected in the second clinical case of Xp11.2 RCC.
Kumar, Anupam; Pathak, Pankaj; Purkait, Suvendu; Faruq, Mohammed; Jha, Prerana; Mallick, Supriya; Suri, Vaishali; Sharma, Mehar C; Suri, Ashish; Sarkar, Chitra
2015-03-01
Pediatric oligodendrogliomas (pODGs) are rare central nervous system tumors, and comparatively little is known about their molecular pathogenesis. Co-deletion of 1p/19q; and IDH1, CIC, and FUBP1 mutations, which are molecular signatures of adult oligodendrogliomas, are extremely rare in pODGs. In this report, two pODGs, one each of grade II and grade III, were evaluated using clinical, radiological, histopathologic, and follow-up methods. IDH1, TP53, CIC, H3F3A, and BRAF-V600 E mutations were analyzed by Sanger sequencing and immunohistochemical methods, and 1p/19q co-deletion was analyzed by fluorescence in situ hybridization. PDGFRA amplification, BRAF gain, intragenic duplication of FGFR-TKD, and KIAA1549-BRAF fusion (validated by Sanger sequencing) were analyzed by real-time reverse transcription PCR. Notably, both cases showed the oncogenic KIAA1549_Ex15-BRAF_Ex9 fusion transcript. Further, immunohistochemical analysis showed activation of the MAPK/ERK pathway in both of these cases. However, neither 1p/19q co-deletion; IDH1, TP53, CIC, H3F3A, nor BRAF-V600 E mutation; PDGFRA amplification; BRAF gain; nor duplication of FGFR-TKD was identified. Overall, this study highlights that pODGs can harbor the KIAA1549-BRAF fusion with aberrant MAPK/ERK signaling, and there exists an option of targeting these pathways in such patients. These results indicate that pODGs with the KIAA1549-BRAF fusion may represent a subset of this rare tumor that shares molecular and genetic features of pilocytic astrocytomas. These findings will increase our understanding of pODGs and may have clinical implications. Copyright © 2015 Elsevier Inc. All rights reserved.
Sahl, Jason W; Fairfield, Nathaniel; Harris, J Kirk; Wettergreen, David; Stone, William C; Spear, John R
2010-03-01
The deep phreatic thermal explorer (DEPTHX) is an autonomous underwater vehicle designed to navigate an unexplored environment, generate high-resolution three-dimensional (3-D) maps, collect biological samples based on an autonomous sampling decision, and return to its origin. In the spring of 2007, DEPTHX was deployed in Zacatón, a deep (approximately 318 m), limestone, phreatic sinkhole (cenote) in northeastern Mexico. As DEPTHX descended, it generated a 3-D map based on the processing of range data from 54 onboard sonars. The vehicle collected water column samples and wall biomat samples throughout the depth profile of the cenote. Post-expedition sample analysis via comparative analysis of 16S rRNA gene sequences revealed a wealth of microbial diversity. Traditional Sanger gene sequencing combined with a barcoded-amplicon pyrosequencing approach revealed novel, phylum-level lineages from the domains Bacteria and Archaea; in addition, several novel subphylum lineages were also identified. Overall, DEPTHX successfully navigated and mapped Zacatón, and collected biological samples based on an autonomous decision, which revealed novel microbial diversity in a previously unexplored environment.
Wang, Shi-Yuan; Zhang, Qi; Zhang, Xiang; Zhao, Pei-Quan
2016-01-01
To make a comprehensive analysis of the potential pathogenic genes related with Leber congenital amaurosis (LCA) in Chinese. LCA subjects and their families were retrospectively collected from 2013 to 2015. Firstly, whole-exome sequencing was performed in patients who had underwent gene mutation screening with nothing found, and then homozygous sites was selected, candidate sites were annotated, and pathogenic analysis was conducted using softwares including Sorting Tolerant from Intolerant (SIFT), Polyphen-2, Mutation assessor, Condel, and Functional Analysis through Hidden Markov Models (FATHMM). Furthermore, Gene Ontology function and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of pathogenic genes were performed followed by co-segregation analysis using Fisher exact Test. Sanger sequencing was used to validate single-nucleotide variations (SNVs). Expanded verification was performed in the rest patients. Totally 51 LCA families with 53 patients and 24 family members were recruited. A total of 104 SNVs (66 LCA-related genes and 15 co-segregated genes) were submitted for expand verification. The frequencies of homozygous mutation of KRT12 and CYP1A1 were simultaneously observed in 3 families. Enrichment analysis showed that the potential pathogenic genes were mainly enriched in functions related to cell adhesion, biological adhesion, retinoid metabolic process, and eye development biological adhesion. Additionally, WFS1 and STAU2 had the highest homozygous frequencies. LCA is a highly heterogeneous disease. Mutations in KRT12, CYP1A1, WFS1, and STAU2 may be involved in the development of LCA.
Zhang, Xiuwen; Unmack, Peter J; Kuchling, Gerald; Wang, Yinan; Georges, Arthur
2017-10-01
Pseudemydura umbrina is one of the most endangered turtle species in the world, and the imperative for its conservation is its distinctive morphology and relict status among the Chelidae. We use Illumina sequencing to obtain the complete mitogenome for resolving its uncertain phylogenetic position. A novel nuclear paralogue confounded the assembly, and resolution of the authentic mitogenome required further Sanger sequencing. The P. umbrina mitogenome is 16,414bp comprising 37 genes organized in a conserved pattern for other vertebrates. The nuclear paralogue is 547bp, 97.8% identity to the corresponding mitochondrial sequence. Particular features of the mitogenome include an nd3 174+1A frameshift, loss of DHC loop in tRNA Ser (AGN), and a light-strand replication initiation site in Wancy region that extends into an adjacent tRNA gene. Phylogenetic analysis showed that P. umbrina is the monotypic sister lineage to the remaining Australasian Chelidae, a lineage probably dating back to the Cretaceous. Copyright © 2017 Elsevier Inc. All rights reserved.
Whole-exome sequencing revealed two novel mutations in Usher syndrome.
Koparir, Asuman; Karatas, Omer Faruk; Atayoglu, Ali Timucin; Yuksel, Bayram; Sagiroglu, Mahmut Samil; Seven, Mehmet; Ulucan, Hakan; Yuksel, Adnan; Ozen, Mustafa
2015-06-01
Usher syndrome is a clinically and genetically heterogeneous autosomal recessive inherited disorder accompanied by hearing loss and retinitis pigmentosa (RP). Since the associated genes are various and quite large, we utilized whole-exome sequencing (WES) as a diagnostic tool to identify the molecular basis of Usher syndrome. DNA from a 12-year-old male diagnosed with Usher syndrome was analyzed by WES. Mutations detected were confirmed by Sanger sequencing. The pathogenicity of these mutations was determined by in silico analysis. A maternally inherited deleterious frameshift mutation, c.14439_14454del in exon 66 and a paternally inherited non-sense c.10830G>A stop-gain SNV in exon 55 of USH2A were found as two novel compound heterozygous mutations. Both of these mutations disrupt the C terminal of USH2A protein. As a result, WES revealed two novel compound heterozygous mutations in a Turkish USH2A patient. This approach gave us an opportunity to have an appropriate diagnosis and provide genetic counseling to the family within a reasonable time. Copyright © 2015 Elsevier B.V. All rights reserved.
Indigenous species barcode database improves the identification of zooplankton
Yang, Jianghua; Zhang, Wanwan; Sun, Jingying; Xie, Yuwei; Zhang, Yimin; Burton, G. Allen; Yu, Hongxia
2017-01-01
Incompleteness and inaccuracy of DNA barcode databases is considered an important hindrance to the use of metabarcoding in biodiversity analysis of zooplankton at the species-level. Species barcoding by Sanger sequencing is inefficient for organisms with small body sizes, such as zooplankton. Here mitochondrial cytochrome c oxidase I (COI) fragment barcodes from 910 freshwater zooplankton specimens (87 morphospecies) were recovered by a high-throughput sequencing platform, Ion Torrent PGM. Intraspecific divergence of most zooplanktons was < 5%, except Branchionus leydign (Rotifer, 14.3%), Trichocerca elongate (Rotifer, 11.5%), Lecane bulla (Rotifer, 15.9%), Synchaeta oblonga (Rotifer, 5.95%) and Schmackeria forbesi (Copepod, 6.5%). Metabarcoding data of 28 environmental samples from Lake Tai were annotated by both an indigenous database and NCBI Genbank database. The indigenous database improved the taxonomic assignment of metabarcoding of zooplankton. Most zooplankton (81%) with barcode sequences in the indigenous database were identified by metabarcoding monitoring. Furthermore, the frequency and distribution of zooplankton were also consistent between metabarcoding and morphology identification. Overall, the indigenous database improved the taxonomic assignment of zooplankton. PMID:28977035
Bacrot, Séverine; Doyard, Mathilde; Huber, Céline; Alibeu, Olivier; Feldhahn, Niklas; Lehalle, Daphné; Lacombe, Didier; Marlin, Sandrine; Nitschke, Patrick; Petit, Florence; Vazquez, Marie-Paule; Munnich, Arnold; Cormier-Daire, Valérie
2015-02-01
Cerebro-costo-mandibular syndrome (CCMS) is a developmental disorder characterized by the association of Pierre Robin sequence and posterior rib defects. Exome sequencing and Sanger sequencing in five unrelated CCMS patients revealed five heterozygous variants in the small nuclear ribonucleoprotein polypeptides B and B1 (SNRPB) gene. This gene includes three transcripts, namely transcripts 1 and 2, encoding components of the core spliceosomal machinery (SmB' and SmB) and transcript 3 undergoing nonsense-mediated mRNA decay. All variants were located in the premature termination codon (PTC)-introducing alternative exon of transcript 3. Quantitative RT-PCR analysis revealed a significant increase in transcript 3 levels in leukocytes of CCMS individuals compared to controls. We conclude that CCMS is due to heterozygous mutations in SNRPB, enhancing inclusion of a SNRPB PTC-introducing alternative exon, and show that this developmental disease is caused by defects in the splicing machinery. Our finding confirms the report of SNRPB mutations in CCMS patients by Lynch et al. (2014) and further extends the clinical and molecular observations. © 2014 WILEY PERIODICALS, INC.
Current and future molecular approaches in the diagnosis of cystic fibrosis.
Bergougnoux, Anne; Taulan-Cadars, Magali; Claustres, Mireille; Raynal, Caroline
2018-05-01
Cystic Fibrosis is among the first diseases to have general population genetic screening tests and one of the most common indications of prenatal and preimplantation genetic diagnosis for single gene disorders. During the past twenty years, thanks to the evolution of diagnostic techniques, our knowledge of CFTR genetics and pathophysiological mechanisms involved in cystic fibrosis has significantly improved. Areas covered: Sanger sequencing and quantitative methods greatly contributed to the identification of more than 2,000 sequence variations reported worldwide in the CFTR gene. We are now entering a new technological age with the generalization of high throughput approaches such as Next Generation Sequencing and Droplet Digital PCR technologies in diagnostics laboratories. These powerful technologies open up new perspectives for scanning the entire CFTR locus, exploring modifier factors that possibly influence the clinical evolution of patients, and for preimplantation and prenatal diagnosis. Expert commentary: Such breakthroughs would, however, require powerful bioinformatics tools and relevant functional tests of variants for analysis and interpretation of the resulting data. Ultimately, an optimal use of all those resources may improve patient care and therapeutic decision-making.
Sequence variants in four genes underlying Bardet-Biedl syndrome in consanguineous families
Ullah, Asmat; Umair, Muhammad; Yousaf, Maryam; Khan, Sher Alam; Nazim-ud-din, Muhammad; Shah, Khadim; Ahmad, Farooq; Azeem, Zahid; Ali, Ghazanfar; Alhaddad, Bader; Rafique, Afzal; Jan, Abid; Haack, Tobias B.; Strom, Tim M.; Meitinger, Thomas; Ghous, Tahseen
2017-01-01
Purpose To investigate the molecular basis of Bardet-Biedl syndrome (BBS) in five consanguineous families of Pakistani origin. Methods Linkage in two families (A and B) was established to BBS7 on chromosome 4q27, in family C to BBS8 on chromosome 14q32.1, and in family D to BBS10 on chromosome 12q21.2. Family E was investigated directly with exome sequence analysis. Results Sanger sequencing revealed two novel mutations and three previously reported mutations in the BBS genes. These mutations include two deletions (c.580_582delGCA, c.1592_1597delTTCCAG) in the BBS7 gene, a missense mutation (p.Gln449His) in the BBS8 gene, a frameshift mutation (c.271_272insT) in the BBS10 gene, and a nonsense mutation (p.Ser40*) in the MKKS (BBS6) gene. Conclusions Two novel mutations and three previously reported variants, identified in the present study, further extend the body of evidence implicating BBS6, BBS7, BBS8, and BBS10 in causing BBS. PMID:28761321
Jang, Mi-Ae; Lee, Taeheon; Lee, Junnam; Cho, Eun-Hae; Ki, Chang-Seok
2015-05-01
Waardenburg syndrome (WS) is a clinically and genetically heterogeneous hereditary auditory pigmentary disorder characterized by congenital sensorineural hearing loss and iris discoloration. Many genes have been linked to WS, including PAX3, MITF, SNAI2, EDNRB, EDN3, and SOX10, and many additional genes have been associated with disorders with phenotypic overlap with WS. To screen all possible genes associated with WS and congenital deafness simultaneously, we performed diagnostic exome sequencing (DES) in a male patient with clinical features consistent with WS. Using DES, we identified a novel missense variant (c.220C>G; p.Arg74Gly) in exon 2 of the PAX3 gene in the patient. Further analysis by Sanger sequencing of the patient and his parents revealed a de novo occurrence of the variant. Our findings show that DES can be a useful tool for the identification of pathogenic gene variants in WS patients and for differentiation between WS and similar disorders. To the best of our knowledge, this is the first report of genetically confirmed WS in Korea.
Zhu, Qihui; Smith, Shavannor M; Ayele, Mulu; Yang, Lixing; Jogi, Ansuya; Chaluvadi, Srinivasa R; Bennetzen, Jeffrey L
2012-11-01
Tef (Eragrostis tef) is a major cereal crop in Ethiopia. Lodging is the primary constraint to increasing productivity in this allotetraploid species, accounting for losses of ∼15-45% in yield each year. As a first step toward identifying semi-dwarf varieties that might have improved lodging resistance, an ∼6× fosmid library was constructed and used to identify both homeologues of the dw3 semi-dwarfing gene of Sorghum bicolor. An EMS mutagenized population, consisting of ∼21,210 tef plants, was planted and leaf materials were collected into 23 superpools. Two dwarfing candidate genes, homeologues of dw3 of sorghum and rht1 of wheat, were sequenced directly from each superpool with 454 technology, and 120 candidate mutations were identified. Out of 10 candidates tested, six independent mutations were validated by Sanger sequencing, including two predicted detrimental mutations in both dw3 homeologues with a potential to improve lodging resistance in tef through further breeding. This study demonstrates that high-throughput sequencing can identify potentially valuable mutations in under-studied plant species like tef and has provided mutant lines that can now be combined and tested in breeding programs for improved lodging resistance.
Margam, Venu M.; Coates, Brad S.; Bayles, Darrell O.; Hellmich, Richard L.; Agunbiade, Tolulope; Seufferheld, Manfredo J.; Sun, Weilin; Kroemer, Jeremy A.; Ba, Malick N.; Binso-Dabire, Clementine L.; Baoua, Ibrahim; Ishiyaku, Mohammad F.; Covas, Fernando G.; Srinivasan, Ramasamy; Armstrong, Joel; Murdock, Larry L.; Pittendrigh, Barry R.
2011-01-01
The legume pod borer, Maruca vitrata (Lepidoptera: Crambidae), is an insect pest species of crops grown by subsistence farmers in tropical regions of Africa. We present the de novo assembly of 3729 contigs from 454- and Sanger-derived sequencing reads for midgut, salivary, and whole adult tissues of this non-model species. Functional annotation predicted that 1320 M. vitrata protein coding genes are present, of which 631 have orthologs within the Bombyx mori gene model. A homology-based analysis assigned M. vitrata genes into a group of paralogs, but these were subsequently partitioned into putative orthologs following phylogenetic analyses. Following sequence quality filtering, a total of 1542 putative single nucleotide polymorphisms (SNPs) were predicted within M. vitrata contig assemblies. Seventy one of 1078 designed molecular genetic markers were used to screen M. vitrata samples from five collection sites in West Africa. Population substructure may be present with significant implications in the insect resistance management recommendations pertaining to the release of biological control agents or transgenic cowpea that express Bacillus thuringiensis crystal toxins. Mutation data derived from transcriptome sequencing is an expeditious and economical source for genetic markers that allow evaluation of ecological differentiation. PMID:21754987
Tatsi, Christina; Gkourogianni, Alexandra; Mohnike, Klaus; DeArment, Diana; Witchel, Selma; Andrade, Anenisia C; Markello, Thomas C; Baron, Jeffrey; Nilsson, Ola; Jee, Youn Hee
2017-08-01
Aggrecan, a proteoglycan, is an important component of cartilage extracellular matrix, including that of the growth plate. Heterozygous mutations in ACAN , the gene encoding aggrecan, cause autosomal dominant short stature, accelerated skeletal maturation, and joint disease. The inheritance pattern and the presence of bone age equal to or greater than chronological age have been consistent features, serving as diagnostic clues. From family 1, a 6-year-old boy presented with short stature [height standard deviation score (SDS), -1.75] and bone age advanced by 3 years. There was no family history of short stature (height SDS: father, -0.76; mother, 0.7). Exome sequencing followed by Sanger sequencing identified a de novo novel heterozygous frameshift mutation in ACAN (c.6404delC: p.A2135Dfs). From family 2, a 12-year-old boy was evaluated for short stature (height SDS, -3.9). His bone age at the time of genetic evaluation was approximately 1 year less than his chronological age. Family history was consistent with an autosomal dominant inheritance of short stature, with several affected members also showing early-onset osteoarthritis. Exome sequencing, confirmed by Sanger sequencing, identified a novel nonsense mutation in ACAN (c.4852C>T: p.Q1618X), which cosegregated with the phenotype. In conclusion, patients with ACAN mutations may present with nonfamilial short stature and with bone age less than chronological age. These findings expand the known phenotypic spectrum of heterozygous ACAN mutations and indicate that this diagnosis should be considered in children without a family history of short stature and in children without accelerated skeletal maturation.
Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling.
Zhang, Guoqiang; Wang, Jianfeng; Yang, Jin; Li, Wenjie; Deng, Yutian; Li, Jing; Huang, Jun; Hu, Songnian; Zhang, Bing
2015-08-05
To promote the clinical application of next-generation sequencing, it is important to obtain accurate and consistent variants of target genomic regions at low cost. Ion Proton, the latest updated semiconductor-based sequencing instrument from Life Technologies, is designed to provide investigators with an inexpensive platform for human whole exome sequencing that achieves a rapid turnaround time. However, few studies have comprehensively compared and evaluated the accuracy of variant calling between Ion Proton and Illumina sequencing platforms such as HiSeq 2000, which is the most popular sequencing platform for the human genome. The Ion Proton sequencer combined with the Ion TargetSeq Exome Enrichment Kit together make up TargetSeq-Proton, whereas SureSelect-Hiseq is based on the Agilent SureSelect Human All Exon v4 Kit and the HiSeq 2000 sequencer. Here, we sequenced exonic DNA from four human blood samples using both TargetSeq-Proton and SureSelect-HiSeq. We then called variants in the exonic regions that overlapped between the two exome capture kits (33.6 Mb). The rates of shared variant loci called by two sequencing platforms were from 68.0 to 75.3% in four samples, whereas the concordance of co-detected variant loci reached 99%. Sanger sequencing validation revealed that the validated rate of concordant single nucleotide polymorphisms (SNPs) (91.5%) was higher than the SNPs specific to TargetSeq-Proton (60.0%) or specific to SureSelect-HiSeq (88.3%). With regard to 1-bp small insertions and deletions (InDels), the Sanger sequencing validated rates of concordant variants (100.0%) and SureSelect-HiSeq-specific (89.6%) were higher than those of TargetSeq-Proton-specific (15.8%). In the sequencing of exonic regions, a combination of using of two sequencing strategies (SureSelect-HiSeq and TargetSeq-Proton) increased the variant calling specificity for concordant variant loci and the sensitivity for variant loci called by any one platform. However, for the sequencing of platform-specific variants, the accuracy of variant calling by HiSeq 2000 was higher than that of Ion Proton, specifically for the InDel detection. Moreover, the variant calling software also influences the detection of SNPs and, specifically, InDels in Ion Proton exome sequencing.
ABACAS: algorithm-based automatic contiguation of assembled sequences
Assefa, Samuel; Keane, Thomas M.; Otto, Thomas D.; Newbold, Chris; Berriman, Matthew
2009-01-01
Summary: Due to the availability of new sequencing technologies, we are now increasingly interested in sequencing closely related strains of existing finished genomes. Recently a number of de novo and mapping-based assemblers have been developed to produce high quality draft genomes from new sequencing technology reads. New tools are necessary to take contigs from a draft assembly through to a fully contiguated genome sequence. ABACAS is intended as a tool to rapidly contiguate (align, order, orientate), visualize and design primers to close gaps on shotgun assembled contigs based on a reference sequence. The input to ABACAS is a set of contigs which will be aligned to the reference genome, ordered and orientated, visualized in the ACT comparative browser, and optimal primer sequences are automatically generated. Availability and Implementation: ABACAS is implemented in Perl and is freely available for download from http://abacas.sourceforge.net Contact: sa4@sanger.ac.uk PMID:19497936
Genome sequencing in microfabricated high-density picolitre reactors.
Margulies, Marcel; Egholm, Michael; Altman, William E; Attiya, Said; Bader, Joel S; Bemben, Lisa A; Berka, Jan; Braverman, Michael S; Chen, Yi-Ju; Chen, Zhoutao; Dewell, Scott B; Du, Lei; Fierro, Joseph M; Gomes, Xavier V; Godwin, Brian C; He, Wen; Helgesen, Scott; Ho, Chun Heen; Ho, Chun He; Irzyk, Gerard P; Jando, Szilveszter C; Alenquer, Maria L I; Jarvie, Thomas P; Jirage, Kshama B; Kim, Jong-Bum; Knight, James R; Lanza, Janna R; Leamon, John H; Lefkowitz, Steven M; Lei, Ming; Li, Jing; Lohman, Kenton L; Lu, Hong; Makhijani, Vinod B; McDade, Keith E; McKenna, Michael P; Myers, Eugene W; Nickerson, Elizabeth; Nobile, John R; Plant, Ramona; Puc, Bernard P; Ronan, Michael T; Roth, George T; Sarkis, Gary J; Simons, Jan Fredrik; Simpson, John W; Srinivasan, Maithreyan; Tartaro, Karrie R; Tomasz, Alexander; Vogt, Kari A; Volkmer, Greg A; Wang, Shally H; Wang, Yong; Weiner, Michael P; Yu, Pengguang; Begley, Richard F; Rothberg, Jonathan M
2005-09-15
The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.
McCourt, Clare M; McArt, Darragh G; Mills, Ken; Catherwood, Mark A; Maxwell, Perry; Waugh, David J; Hamilton, Peter; O'Sullivan, Joe M; Salto-Tellez, Manuel
2013-01-01
Next Generation Sequencing (NGS) has the potential of becoming an important tool in clinical diagnosis and therapeutic decision-making in oncology owing to its enhanced sensitivity in DNA mutation detection, fast-turnaround of samples in comparison to current gold standard methods and the potential to sequence a large number of cancer-driving genes at the one time. We aim to test the diagnostic accuracy of current NGS technology in the analysis of mutations that represent current standard-of-care, and its reliability to generate concomitant information on other key genes in human oncogenesis. Thirteen clinical samples (8 lung adenocarcinomas, 3 colon carcinomas and 2 malignant melanomas) already genotyped for EGFR, KRAS and BRAF mutations by current standard-of-care methods (Sanger Sequencing and q-PCR), were analysed for detection of mutations in the same three genes using two NGS platforms and an additional 43 genes with one of these platforms. The results were analysed using closed platform-specific proprietary bioinformatics software as well as open third party applications. Our results indicate that the existing format of the NGS technology performed well in detecting the clinically relevant mutations stated above but may not be reliable for a broader unsupervised analysis of the wider genome in its current design. Our study represents a diagnostically lead validation of the major strengths and weaknesses of this technology before consideration for diagnostic use.
A novel ATTR L32V mutation causes familial amyloid polyneuropathy in a Bolivian family.
Martínez-Ulloa, Pedro L; Vallejo, Manuela; Corral, Iñigo; García-Barragán, Nuria; Alcazar, Alberto; Martínez-Alonso, Emma; Martínez-Poles, Javier; Pian, Hector; Jiménez-Escrig, Adriano
2017-09-01
We report a new transthyretin (ATTR) gene c.272C>G mutation and variant protein, p.Leu32Val, in a kindred of Bolivian origin with a rapid progressive peripheral neuropathy and cardiomyopathy. Three individuals from a kindred with peripheral nerve and cardiac amyloidosis were examined. Analysis of the TTR gene was performed by Sanger direct sequencing. Neuropathologic examination was obtained on the index patient with mass spectrometry study of the ATTR deposition. Direct DNA sequence analysis of exons 2, 3, and 4 of the TTR gene demonstrated a c.272 C>G mutation in exon 2 (p.L32V). Sural nerve biopsy revealed massive amyloid deposition in the perineurium, endoneurium and vasa nervorum. Mass spectrometric analyses of ATTR immunoprecipitated from nerve biopsy showed the presence of both wild-type and variant proteins. The observed mass results for the wild-type and variant proteins were consistent with the predicted values calculated from the genetic analysis data. The ATTR L32V is associated with a severe course. This has implications for treatment of affected individuals and counseling of family members. © 2017 Peripheral Nerve Society.
A novel mutation in PAX3 associated with Waardenburg syndrome type I in a Chinese family.
Xiao, Yun; Luo, Jianfen; Zhang, Fengguo; Li, Jianfeng; Han, Yuechen; Zhang, Daogong; Wang, Mingming; Ma, Yalin; Xu, Lei; Bai, Xiaohui; Wang, Haibo
2016-01-01
The novel compound heterozygous mutation in PAX3 was the key genetic reason for WS1 in this family, which was useful to the molecular diagnosis of WS1. Screening the pathogenic mutations in a four generation Chinese family with Waardenburg syndrome type I (WS1). WS1 was diagnosed in a 4-year-old boy according to the Waardenburg syndrome Consortium criteria. The detailed family history revealed four affected members in the family. Routine clinical, audiological examination, and ophthalmologic evaluation were performed on four affected and 10 healthy members in this family. The genetic analysis was conducted, including the targeted next-generation sequencing of 127 known deafness genes combined with Sanger sequencing, TA clone and bioinformatic analysis. A novel compound heterozygous mutation c.[169_170insC;172_174delAAG] (p.His57ProfsX55) was identified in PAX3, which was co-segregated with WS1 in the Chinese family. This mutation was absent in the unaffected family members and 200 ethnicity-matched controls. The phylogenetic analysis and three-dimensional (3D) modeling of Pax3 protein further confirmed that the novel compound heterozygous mutation was pathogenic.
A new sensitive PCR assay for one-step detection of 12 IDH1/2 mutations in glioma.
Catteau, Aurélie; Girardi, Hélène; Monville, Florence; Poggionovo, Cécile; Carpentier, Sabrina; Frayssinet, Véronique; Voss, Jesse; Jenkins, Robert; Boisselier, Blandine; Mokhtari, Karima; Sanson, Marc; Peyro-Saint-Paul, Hélène; Giannini, Caterina
2014-06-02
Mutations in isocitrate dehydrogenase genes IDH1 or IDH2 are frequent in glioma, and IDH mutation status is a strong diagnostic and prognostic marker. Current IDH mutation screening is performed with an immunohistochemistry (IHC) assay specific for IDH1 R132H, the most common mutation. Sequencing is recommended as a second-step test for IHC-negative or -equivocal cases. We developed and validated a new real-time quantitative polymerase chain reaction (PCR) assay for single-step detection of IDH1 R132H and 11 rare IDH1/2 mutations in formalin-fixed paraffin-embedded (FFPE) glioma samples. Performance of the IDH1/2 PCR assay was compared to IHC and Sanger sequencing. The IDH1/2 PCR assay combines PCR clamping for detection of 7 IDH1 and 5 IDH2 mutations, and Amplification Refractory Mutation System technology for specific identification of the 3 most common mutations (IDH1 R132H, IDH1 R132C, IDH2 R172K). Analytical sensitivity of the PCR assay for mutation detection was <5% for 11/12 mutations (mean: 3.3%), and sensitivity for mutation identification was very high (0.8% for IDH1 R132H; 1.2% for IDH1 R132C; 0.6% for IDH2 R172K). Assay performance was further validated on 171 clinical glioma FFPE samples; of these, 147 samples met the selection criteria and 146 DNA samples were successfully extracted. IDH1/2 status was successfully obtained in 91% of cases. All but one positive IDH1 R132H-IHC cases were concordantly detected by PCR and 3 were not detected by sequencing. Among the IHC-negative cases (n = 72), PCR detected 12 additional rare mutations (10 IDH1, 2 IDH2). All mutations detected by sequencing (n = 67) were concordantly detected by PCR and 5/66 sequencing-negative cases were PCR-positive (overall concordance: 96%). Analysis of synthetic samples representative of the 11 rare IDH1/2 mutations detected by the assay produced 100% correct results. The new IDH1/2 PCR assay has a high technical success rate and is more sensitive than Sanger sequencing. Positive concordance was 98% with IHC for IDH1 R132H detection and 100% with sequencing. The PCR assay can reliably be performed on FFPE samples and has a faster turnaround time than current IDH mutation detection algorithms. The assay should facilitate implementation of a comprehensive IDH1/2 testing protocol in routine clinical practice.
Exome Sequence Analysis of 14 Families With High Myopia.
Kloss, Bethany A; Tompson, Stuart W; Whisenhunt, Kristina N; Quow, Krystina L; Huang, Samuel J; Pavelec, Derek M; Rosenberg, Thomas; Young, Terri L
2017-04-01
To identify causal gene mutations in 14 families with autosomal dominant (AD) high myopia using exome sequencing. Select individuals from 14 large Caucasian families with high myopia were exome sequenced. Gene variants were filtered to identify potential pathogenic changes. Sanger sequencing was used to confirm variants in original DNA, and to test for disease cosegregation in additional family members. Candidate genes and chromosomal loci previously associated with myopic refractive error and its endophenotypes were comprehensively screened. In 14 high myopia families, we identified 73 rare and 31 novel gene variants as candidates for pathogenicity. In seven of these families, two of the novel and eight of the rare variants were within known myopia loci. A total of 104 heterozygous nonsynonymous rare variants in 104 genes were identified in 10 out of 14 probands. Each variant cosegregated with affection status. No rare variants were identified in genes known to cause myopia or in genes closest to published genome-wide association study association signals for refractive error or its endophenotypes. Whole exome sequencing was performed to determine gene variants implicated in the pathogenesis of AD high myopia. This study provides new genes for consideration in the pathogenesis of high myopia, and may aid in the development of genetic profiling of those at greatest risk for attendant ocular morbidities of this disorder.
Mu, John C.; Tootoonchi Afshar, Pegah; Mohiyuddin, Marghoob; Chen, Xi; Li, Jian; Bani Asadi, Narges; Gerstein, Mark B.; Wong, Wing H.; Lam, Hugo Y. K.
2015-01-01
A high-confidence, comprehensive human variant set is critical in assessing accuracy of sequencing algorithms, which are crucial in precision medicine based on high-throughput sequencing. Although recent works have attempted to provide such a resource, they still do not encompass all major types of variants including structural variants (SVs). Thus, we leveraged the massive high-quality Sanger sequences from the HuRef genome to construct by far the most comprehensive gold set of a single individual, which was cross validated with deep Illumina sequencing, population datasets, and well-established algorithms. It was a necessary effort to completely reanalyze the HuRef genome as its previously published variants were mostly reported five years ago, suffering from compatibility, organization, and accuracy issues that prevent their direct use in benchmarking. Our extensive analysis and validation resulted in a gold set with high specificity and sensitivity. In contrast to the current gold sets of the NA12878 or HS1011 genomes, our gold set is the first that includes small variants, deletion SVs and insertion SVs up to a hundred thousand base-pairs. We demonstrate the utility of our HuRef gold set to benchmark several published SV detection tools. PMID:26412485
Bullich, Gemma; Trujillano, Daniel; Santín, Sheila; Ossowski, Stephan; Mendizábal, Santiago; Fraga, Gloria; Madrid, Álvaro; Ariceta, Gema; Ballarín, José; Torra, Roser; Estivill, Xavier; Ars, Elisabet
2015-09-01
Genetic diagnosis of steroid-resistant nephrotic syndrome (SRNS) using Sanger sequencing is complicated by the high genetic heterogeneity and phenotypic variability of this disease. We aimed to improve the genetic diagnosis of SRNS by simultaneously sequencing 26 glomerular genes using massive parallel sequencing and to study whether mutations in multiple genes increase disease severity. High-throughput mutation analysis was performed in 50 SRNS and/or focal segmental glomerulosclerosis (FSGS) patients, a validation cohort of 25 patients with known pathogenic mutations, and a discovery cohort of 25 uncharacterized patients with probable genetic etiology. In the validation cohort, we identified the 42 previously known pathogenic mutations across NPHS1, NPHS2, WT1, TRPC6, and INF2 genes. In the discovery cohort, disease-causing mutations in SRNS/FSGS genes were found in nine patients. We detected three patients with mutations in an SRNS/FSGS gene and COL4A3. Two of them were familial cases and presented a more severe phenotype than family members with mutation in only one gene. In conclusion, our results show that massive parallel sequencing is feasible and robust for genetic diagnosis of SRNS/FSGS. Our results indicate that patients carrying mutations in an SRNS/FSGS gene and also in COL4A3 gene have increased disease severity.
Rathi, Vivek; Wright, Gavin; Constantin, Diana; Chang, Siok; Pham, Huong; Jones, Kerryn; Palios, Atha; Mclachlan, Sue-Anne; Conron, Matthew; McKelvie, Penny; Williams, Richard
2017-01-01
The advent of massively parallel sequencing has caused a paradigm shift in the ways cancer is treated, as personalised therapy becomes a reality. More and more laboratories are looking to introduce next generation sequencing (NGS) as a tool for mutational analysis, as this technology has many advantages compared to conventional platforms like Sanger sequencing. In Australia all massively parallel sequencing platforms are still considered in-house in vitro diagnostic tools by the National Association of Testing Authorities (NATA) and a comprehensive analytical validation of all assays, and not just mere verification, is a strict requirement before accreditation can be granted for clinical testing on these platforms. Analytical validation of assays on NGS platforms can prove to be extremely challenging for pathology laboratories. Although there are many affordable and easily accessible NGS instruments available, there are no standardised guidelines as yet for clinical validation of NGS assays. We present an accreditation development procedure that was both comprehensive and applicable in a setting of hospital laboratory for NGS services. This approach may also be applied to other NGS applications in service laboratories. Copyright © 2016 Royal College of Pathologists of Australasia. Published by Elsevier B.V. All rights reserved.
Chiu, Elliott S; Hoover, Edward A; VandeWoude, Sue
2018-01-10
Feline leukemia virus (FeLV) was the first feline retrovirus discovered, and is associated with multiple fatal disease syndromes in cats, including lymphoma. The original research conducted on FeLV employed classical virological techniques. As methods have evolved to allow FeLV genetic characterization, investigators have continued to unravel the molecular pathology associated with this fascinating agent. In this review, we discuss how FeLV classification, transmission, and disease-inducing potential have been defined sequentially by viral interference assays, Sanger sequencing, PCR, and next-generation sequencing. In particular, we highlight the influences of endogenous FeLV and host genetics that represent FeLV research opportunities on the near horizon.
d’Avila-Levy, Claudia Masini; Boucinha, Carolina; Kostygov, Alexei; Santos, Helena Lúcia Carneiro; Morelli, Karina Alessandra; Grybchuk-Ieremenko, Anastasiia; Duval, Linda; Votýpka, Jan; Yurchenko, Vyacheslav; Grellier, Philippe; Lukeš, Julius
2015-01-01
The class Kinetoplastea encompasses both free-living and parasitic species from a wide range of hosts. Several representatives of this group are responsible for severe human diseases and for economic losses in agriculture and livestock. While this group encompasses over 30 genera, most of the available information has been derived from the vertebrate pathogenic genera Leishmaniaand Trypanosoma. Recent studies of the previously neglected groups of Kinetoplastea indicated that the actual diversity is much higher than previously thought. This article discusses the known segment of kinetoplastid diversity and how gene-directed Sanger sequencing and next-generation sequencing methods can help to deepen our knowledge of these interesting protists. PMID:26602872
Krunic, Aleksandar L; Stone, Kristina L; Simpson, Michael A; McGrath, John A
2013-01-01
Acral peeling skin syndrome (APSS) is a clinically and genetically heterogeneous disorder. We used whole-exome sequencing to identify the molecular basis of APSS in a consanguineous Jordanian-American pedigree. We identified a homozygous nonsense mutation (p.Lys22X) in the CSTA gene, encoding cystatin A, that was confirmed using Sanger sequencing. Cystatin A is a protease inhibitor found in the cornified cell envelope, and loss-of-function mutations have previously been reported in two cases of exfoliative ichthyosis. Our study expands the molecular pathology of APSS and demonstrates the value of next-generation sequencing in the genetic characterization of inherited skin diseases. © 2013 Wiley Periodicals, Inc.
de Vries, Tamar I; R Monroe, Glen; van Belzen, Martine J; van der Lans, Christian A; Savelberg, Sanne MC; Newman, William G; van Haaften, Gijs; Nievelstein, Rutger A; van Haelst, Mieke M
2016-01-01
Rubinstein–Taybi syndrome (RTS, OMIM 180849) and Filippi syndrome (FLPIS, OMIM 272440) are both rare syndromes, with multiple congenital anomalies and intellectual deficit (MCA/ID). We present a patient with intellectual deficit, short stature, bilateral syndactyly of hands and feet, broad thumbs, ocular abnormalities, and dysmorphic facial features. These clinical features suggest both RTS and FLPIS. Initial DNA analysis of DNA isolated from blood did not identify variants to confirm either of these syndrome diagnoses. Whole-exome sequencing identified a homozygous variant in C9orf173, which was novel at the time of analysis. Further Sanger sequencing analysis of FLPIS cases tested negative for CKAP2L variants did not, however, reveal any further variants. Subsequent analysis using DNA isolated from buccal mucosa revealed a mosaic variant in CREBBP. This report highlights the importance of excluding mosaic variants in patients with a strong but atypical clinical presentation of a MCA/ID syndrome if no disease-causing variants can be detected in DNA isolated from blood samples. As the striking syndactyly observed in the present case is typical for FLPIS, we suggest CREBBP analysis in saliva samples for FLPIS syndrome cases in which no causal CKAP2L variant is detected. PMID:26956253
Zahedi, Alireza; Monis, Paul; Gofton, Alexander W; Oskam, Charlotte L; Ball, Andrew; Bath, Andrew; Bartkow, Michael; Robertson, Ian; Ryan, Una
2018-05-01
As part of long-term monitoring of Cryptosporidium in water catchments serving Western Australia, New South Wales (Sydney) and Queensland, Australia, we characterised Cryptosporidium in a total of 5774 faecal samples from 17 known host species and 7 unknown bird samples, in 11 water catchment areas over a period of 30 months (July 2013 to December 2015). All samples were initially screened for Cryptosporidium spp. at the 18S rRNA locus using a quantitative PCR (qPCR). Positives samples were then typed by sequence analysis of an 825 bp fragment of the 18S gene and subtyped at the glycoprotein 60 (gp60) locus (832 bp). The overall prevalence of Cryptosporidium across the various hosts sampled was 18.3% (1054/5774; 95% CI, 17.3-19.3). Of these, 873 samples produced clean Sanger sequencing chromatograms, and the remaining 181 samples, which initially produced chromatograms suggesting the presence of multiple different sequences, were re-analysed by Next- Generation Sequencing (NGS) to resolve the presence of Cryptosporidium and the species composition of potential mixed infections. The overall prevalence of confirmed mixed infection was 1.7% (98/5774), and in the remaining 83 samples, NGS only detected one species of Cryptosporidium. Of the 17 Cryptosporidium species and four genotypes detected (Sanger sequencing combined with NGS), 13 are capable of infecting humans; C. parvum, C. hominis, C. ubiquitum, C. cuniculus, C. meleagridis, C. canis, C. felis, C. muris, C. suis, C. scrofarum, C. bovis, C. erinacei and C. fayeri. Oocyst numbers per gram of faeces (g -1 ) were also determined using qPCR, with medians varying from 6021-61,064 across the three states. The significant findings were the detection of C. hominis in cattle and kangaroo faeces and the high prevalence of C. parvum in cattle. In addition, two novel C. fayeri subtypes (IVaA11G3T1 and IVgA10G1T1R1) and one novel C. meleagridis subtype (IIIeA18G2R1) were identified. This is also the first report of C. erinacei in Australia. Future work to monitor the prevalence of Cryptosporidium species and subtypes in animals in these catchments is warranted. Copyright © 2018 Elsevier Ltd. All rights reserved.
TCW: Transcriptome Computational Workbench
Soderlund, Carol; Nelson, William; Willer, Mark; Gang, David R.
2013-01-01
Background The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. Methodology The Transcriptome Computational Workbench (TCW) provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms). The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina) or assembling long sequences (e.g. Sanger, 454, transcripts), annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. Conclusion It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the transcriptome. TCW is freely available from www.agcol.arizona.edu/software/tcw. PMID:23874959
TCW: transcriptome computational workbench.
Soderlund, Carol; Nelson, William; Willer, Mark; Gang, David R
2013-01-01
The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. The Transcriptome Computational Workbench (TCW) provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms). The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina) or assembling long sequences (e.g. Sanger, 454, transcripts), annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the transcriptome. TCW is freely available from www.agcol.arizona.edu/software/tcw.
[Detection of UGT1A1*28 Polymorphism Using Fragment Analysis].
Huang, Ying; Su, Jian; Huang, Xiaosui; Lu, Danxia; Xie, Zhi; Yang, Suqing; Guo, Weibang; Lv, Zhiyi; Wu, Hongsui; Zhang, Xuchao
2017-12-20
Uridine-diphosphoglucuronosyl transferase 1A1 (UGT1A1), UGT1A1*28 polymorphism can reduce UGT1A1 enzymatic activity, which may lead to severe toxicities in patients who receive irinotecan. This study tries to build a fragment analysis method to detect UGT1A1*28 polymorphism. A total of 286 blood specimens from the lung cancer patients who were hospitalized in Guangdong General Hospital between April 2014 to May 2015 were detected UGT1A1*28 polymorphism by fragment analysis method. Comparing with Sanger sequencing, precision and accuracy of the fragment analysis method were 100%. Of the 286 patients, 236 (82.5% harbored TA6/6 genotype, 48 (16.8%) TA 6/7 genotype and 2 (0.7%) TA7/7 genotype. Our data suggest hat the fragment analysis method is robust for detecting UGT1A1*28 polymorphism in clinical practice. It's simple, time-saving, and easy-to-carry.
Hajibabaei, Mehrdad; Shokralla, Shadi; Zhou, Xin; Singer, Gregory A. C.; Baird, Donald J.
2011-01-01
Timely and accurate biodiversity analysis poses an ongoing challenge for the success of biomonitoring programs. Morphology-based identification of bioindicator taxa is time consuming, and rarely supports species-level resolution especially for immature life stages. Much work has been done in the past decade to develop alternative approaches for biodiversity analysis using DNA sequence-based approaches such as molecular phylogenetics and DNA barcoding. On-going assembly of DNA barcode reference libraries will provide the basis for a DNA-based identification system. The use of recently introduced next-generation sequencing (NGS) approaches in biodiversity science has the potential to further extend the application of DNA information for routine biomonitoring applications to an unprecedented scale. Here we demonstrate the feasibility of using 454 massively parallel pyrosequencing for species-level analysis of freshwater benthic macroinvertebrate taxa commonly used for biomonitoring. We designed our experiments in order to directly compare morphology-based, Sanger sequencing DNA barcoding, and next-generation environmental barcoding approaches. Our results show the ability of 454 pyrosequencing of mini-barcodes to accurately identify all species with more than 1% abundance in the pooled mixture. Although the approach failed to identify 6 rare species in the mixture, the presence of sequences from 9 species that were not represented by individuals in the mixture provides evidence that DNA based analysis may yet provide a valuable approach in finding rare species in bulk environmental samples. We further demonstrate the application of the environmental barcoding approach by comparing benthic macroinvertebrates from an urban region to those obtained from a conservation area. Although considerable effort will be required to robustly optimize NGS tools to identify species from bulk environmental samples, our results indicate the potential of an environmental barcoding approach for biomonitoring programs. PMID:21533287
High-throughput sequence alignment using Graphics Processing Units
Schatz, Michael C; Trapnell, Cole; Delcher, Arthur L; Varshney, Amitabh
2007-01-01
Background The recent availability of new, less expensive high-throughput DNA sequencing technologies has yielded a dramatic increase in the volume of sequence data that must be analyzed. These data are being generated for several purposes, including genotyping, genome resequencing, metagenomics, and de novo genome assembly projects. Sequence alignment programs such as MUMmer have proven essential for analysis of these data, but researchers will need ever faster, high-throughput alignment tools running on inexpensive hardware to keep up with new sequence technologies. Results This paper describes MUMmerGPU, an open-source high-throughput parallel pairwise local sequence alignment program that runs on commodity Graphics Processing Units (GPUs) in common workstations. MUMmerGPU uses the new Compute Unified Device Architecture (CUDA) from nVidia to align multiple query sequences against a single reference sequence stored as a suffix tree. By processing the queries in parallel on the highly parallel graphics card, MUMmerGPU achieves more than a 10-fold speedup over a serial CPU version of the sequence alignment kernel, and outperforms the exact alignment component of MUMmer on a high end CPU by 3.5-fold in total application time when aligning reads from recent sequencing projects using Solexa/Illumina, 454, and Sanger sequencing technologies. Conclusion MUMmerGPU is a low cost, ultra-fast sequence alignment program designed to handle the increasing volume of data produced by new, high-throughput sequencing technologies. MUMmerGPU demonstrates that even memory-intensive applications can run significantly faster on the relatively low-cost GPU than on the CPU. PMID:18070356
Optimization and quality control of genome-wide Hi-C library preparation.
Zhang, Xiang-Yuan; He, Chao; Ye, Bing-Yu; Xie, De-Jian; Shi, Ming-Lei; Zhang, Yan; Shen, Wen-Long; Li, Ping; Zhao, Zhi-Hu
2017-09-20
Highest-throughput chromosome conformation capture (Hi-C) is one of the key assays for genome- wide chromatin interaction studies. It is a time-consuming process that involves many steps and many different kinds of reagents, consumables, and equipments. At present, the reproducibility is unsatisfactory. By optimizing the key steps of the Hi-C experiment, such as crosslinking, pretreatment of digestion, inactivation of restriction enzyme, and in situ ligation etc., we established a robust Hi-C procedure and prepared two biological replicates of Hi-C libraries from the GM12878 cells. After preliminary quality control by Sanger sequencing, the two replicates were high-throughput sequenced. The bioinformatics analysis of the raw sequencing data revealed the mapping-ability and pair-mate rate of the raw data were around 90% and 72%, respectively. Additionally, after removal of self-circular ligations and dangling-end products, more than 96% of the valid pairs were reached. Genome-wide interactome profiling shows clear topological associated domains (TADs), which is consistent with previous reports. Further correlation analysis showed that the two biological replicates strongly correlate with each other in terms of both bin coverage and all bin pairs. All these results indicated that the optimized Hi-C procedure is robust and stable, which will be very helpful for the wide applications of the Hi-C assay.
[The principle and application of the single-molecule real-time sequencing technology].
Yanhu, Liu; Lu, Wang; Li, Yu
2015-03-01
Last decade witnessed the explosive development of the third-generation sequencing strategy, including single-molecule real-time sequencing (SMRT), true single-molecule sequencing (tSMSTM) and the single-molecule nanopore DNA sequencing. In this review, we summarize the principle, performance and application of the SMRT sequencing technology. Compared with the traditional Sanger method and the next-generation sequencing (NGS) technologies, the SMRT approach has several advantages, including long read length, high speed, PCR-free and the capability of direct detection of epigenetic modifications. However, the disadvantage of its low accuracy, most of which resulted from insertions and deletions, is also notable. So, the raw sequence data need to be corrected before assembly. Up to now, the SMRT is a good fit for applications in the de novo genomic sequencing and the high-quality assemblies of small genomes. In the future, it is expected to play an important role in epigenetics, transcriptomic sequencing, and assemblies of large genomes.
The mitogenome of Onchocerca volvulus from the Brazilian Amazonia focus.
Crainey, James L; Silva, Túllio R R da; Encinas, Fernando; Marín, Michel A; Vicente, Ana Carolina P; Luz, Sérgio L B
2016-01-01
We report here the first complete mitochondria genome of Onchocerca volvulus from a focus outside of Africa. An O. volvulus mitogenome from the Brazilian Amazonia focus was obtained using a combination of high-throughput and Sanger sequencing technologies. Comparisons made between this mitochondrial genome and publicly available mitochondrial sequences identified 46 variant nucleotide positions and suggested that our Brazilian mitogenome is more closely related to Cameroon-origin mitochondria than West African-origin mitochondria. As well as providing insights into the origins of Latin American onchocerciasis, the Brazilian Amazonia focus mitogenome may also have value as an epidemiological resource.
Exome Sequencing Identifies Potentially Druggable Mutations in Nasopharyngeal Carcinoma.
Chow, Yock Ping; Tan, Lu Ping; Chai, San Jiun; Abdul Aziz, Norazlin; Choo, Siew Woh; Lim, Paul Vey Hong; Pathmanathan, Rajadurai; Mohd Kornain, Noor Kaslina; Lum, Chee Lun; Pua, Kin Choo; Yap, Yoke Yeow; Tan, Tee Yong; Teo, Soo Hwang; Khoo, Alan Soo-Beng; Patel, Vyomesh
2017-03-03
In this study, we first performed whole exome sequencing of DNA from 10 untreated and clinically annotated fresh frozen nasopharyngeal carcinoma (NPC) biopsies and matched bloods to identify somatically mutated genes that may be amenable to targeted therapeutic strategies. We identified a total of 323 mutations which were either non-synonymous (n = 238) or synonymous (n = 85). Furthermore, our analysis revealed genes in key cancer pathways (DNA repair, cell cycle regulation, apoptosis, immune response, lipid signaling) were mutated, of which those in the lipid-signaling pathway were the most enriched. We next extended our analysis on a prioritized sub-set of 37 mutated genes plus top 5 mutated cancer genes listed in COSMIC using a custom designed HaloPlex target enrichment panel with an additional 88 NPC samples. Our analysis identified 160 additional non-synonymous mutations in 37/42 genes in 66/88 samples. Of these, 99/160 mutations within potentially druggable pathways were further selected for validation. Sanger sequencing revealed that 77/99 variants were true positives, giving an accuracy of 78%. Taken together, our study indicated that ~72% (n = 71/98) of NPC samples harbored mutations in one of the four cancer pathways (EGFR-PI3K-Akt-mTOR, NOTCH, NF-κB, DNA repair) which may be potentially useful as predictive biomarkers of response to matched targeted therapies.
Chopperla, Ramakrishna; Singh, Sonam; Mohanty, Sasmita; Reddy, Nanja; Padaria, Jasdeep C; Solanke, Amolkumar U
2017-10-01
Basic leucine zipper (bZIP) transcription factors comprise one of the largest gene families in plants. They play a key role in almost every aspect of plant growth and development and also in biotic and abiotic stress tolerance. In this study, we report isolation and characterization of EcbZIP17 , a group B bZIP transcription factor from a climate smart cereal, finger millet ( Eleusine coracana L.). The genomic sequence of EcbZIP17 is 2662 bp long encompassing two exons and one intron with ORF of 1722 bp and peptide length of 573 aa. This gene is homologous to AtbZIP17 ( Arabidopsis ), ZmbZIP17 (maize) and OsbZIP60 (rice) which play a key role in endoplasmic reticulum (ER) stress pathway. In silico analysis confirmed the presence of basic leucine zipper (bZIP) and transmembrane (TM) domains in the EcbZIP17 protein. Allele mining of this gene in 16 different genotypes by Sanger sequencing revealed no variation in nucleotide sequence, including the 618 bp long intron. Expression analysis of EcbZIP17 under heat stress exhibited similar pattern of expression in all the genotypes across time intervals with highest upregulation after 4 h. The present study established the conserved nature of EcbZIP17 at nucleotide and expression level.
Exome Sequencing Identifies Potentially Druggable Mutations in Nasopharyngeal Carcinoma
Chow, Yock Ping; Tan, Lu Ping; Chai, San Jiun; Abdul Aziz, Norazlin; Choo, Siew Woh; Lim, Paul Vey Hong; Pathmanathan, Rajadurai; Mohd Kornain, Noor Kaslina; Lum, Chee Lun; Pua, Kin Choo; Yap, Yoke Yeow; Tan, Tee Yong; Teo, Soo Hwang; Khoo, Alan Soo-Beng; Patel, Vyomesh
2017-01-01
In this study, we first performed whole exome sequencing of DNA from 10 untreated and clinically annotated fresh frozen nasopharyngeal carcinoma (NPC) biopsies and matched bloods to identify somatically mutated genes that may be amenable to targeted therapeutic strategies. We identified a total of 323 mutations which were either non-synonymous (n = 238) or synonymous (n = 85). Furthermore, our analysis revealed genes in key cancer pathways (DNA repair, cell cycle regulation, apoptosis, immune response, lipid signaling) were mutated, of which those in the lipid-signaling pathway were the most enriched. We next extended our analysis on a prioritized sub-set of 37 mutated genes plus top 5 mutated cancer genes listed in COSMIC using a custom designed HaloPlex target enrichment panel with an additional 88 NPC samples. Our analysis identified 160 additional non-synonymous mutations in 37/42 genes in 66/88 samples. Of these, 99/160 mutations within potentially druggable pathways were further selected for validation. Sanger sequencing revealed that 77/99 variants were true positives, giving an accuracy of 78%. Taken together, our study indicated that ~72% (n = 71/98) of NPC samples harbored mutations in one of the four cancer pathways (EGFR-PI3K-Akt-mTOR, NOTCH, NF-κB, DNA repair) which may be potentially useful as predictive biomarkers of response to matched targeted therapies. PMID:28256603
High-throughput sequencing in veterinary infection biology and diagnostics.
Belák, S; Karlsson, O E; Leijon, M; Granberg, F
2013-12-01
Sequencing methods have improved rapidly since the first versions of the Sanger techniques, facilitating the development of very powerful tools for detecting and identifying various pathogens, such as viruses, bacteria and other microbes. The ongoing development of high-throughput sequencing (HTS; also known as next-generation sequencing) technologies has resulted in a dramatic reduction in DNA sequencing costs, making the technology more accessible to the average laboratory. In this White Paper of the World Organisation for Animal Health (OIE) Collaborating Centre for the Biotechnology-based Diagnosis of Infectious Diseases in Veterinary Medicine (Uppsala, Sweden), several approaches and examples of HTS are summarised, and their diagnostic applicability is briefly discussed. Selected future aspects of HTS are outlined, including the need for bioinformatic resources, with a focus on improving the diagnosis and control of infectious diseases in veterinary medicine.
NASA Astrophysics Data System (ADS)
Li, Ren; Zhou, Mingxing; Li, Jine; Wang, Zihua; Zhang, Weikai; Yue, Chunyan; Ma, Yan; Peng, Hailin; Wei, Zewen; Hu, Zhiyuan
2018-03-01
EGFR mutations companion diagnostics have been proved to be crucial for the efficacy of tyrosine kinase inhibitor targeted cancer therapies. To uncover multiple mutations occurred in minority of EGFR-mutated cells, which may be covered by the noises from majority of un-mutated cells, is currently becoming an urgent clinical requirement. Here we present the validation of a microfluidic-chip-based method for detecting EGFR multi-mutations at single-cell level. By trapping and immunofluorescently imaging single cells in specifically designed silicon microwells, the EGFR-expressed cells were easily identified. By in situ lysing single cells, the cell lysates of EGFR-expressed cells were retrieved without cross-contamination. Benefited from excluding the noise from cells without EGFR expression, the simple and cost-effective Sanger's sequencing, but not the expensive deep sequencing of the whole cell population, was used to discover multi-mutations. We verified the new method with precisely discovering three most important EGFR drug-related mutations from a sample in which EGFR-mutated cells only account for a small percentage of whole cell population. The microfluidic chip is capable of discovering not only the existence of specific EGFR multi-mutations, but also other valuable single-cell-level information: on which specific cells the mutations occurred, or whether different mutations coexist on the same cells. This microfluidic chip constitutes a promising method to promote simple and cost-effective Sanger's sequencing to be a routine test before performing targeted cancer therapy.[Figure not available: see fulltext.
Brassac, Jonathan; Blattner, Frank R
2015-09-01
Polyploidization is an important speciation mechanism in the barley genus Hordeum. To analyze evolutionary changes after allopolyploidization, knowledge of parental relationships is essential. One chloroplast and 12 nuclear single-copy loci were amplified by polymerase chain reaction (PCR) in all Hordeum plus six out-group species. Amplicons from each of 96 individuals were pooled, sheared, labeled with individual-specific barcodes and sequenced in a single run on a 454 platform. Reference sequences were obtained by cloning and Sanger sequencing of all loci for nine supplementary individuals. The 454 reads were assembled into contigs representing the 13 loci and, for polyploids, also homoeologues. Phylogenetic analyses were conducted for all loci separately and for a concatenated data matrix of all loci. For diploid taxa, a Bayesian concordance analysis and a coalescent-based dated species tree was inferred from all gene trees. Chloroplast matK was used to determine the maternal parent in allopolyploid taxa. The relative performance of different multilocus analyses in the presence of incomplete lineage sorting and hybridization was also assessed. The resulting multilocus phylogeny reveals for the first time species phylogeny and progenitor-derivative relationships of all di- and polyploid Hordeum taxa within a single analysis. Our study proves that it is possible to obtain a multilocus species-level phylogeny for di- and polyploid taxa by combining PCR with next-generation sequencing, without cloning and without creating a heavy load of sequence data. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
A Glimpse into the Satellite DNA Library in Characidae Fish (Teleostei, Characiformes)
Utsunomia, Ricardo; Ruiz-Ruano, Francisco J.; Silva, Duílio M. Z. A.; Serrano, Érica A.; Rosa, Ivana F.; Scudeler, Patrícia E. S.; Hashimoto, Diogo T.; Oliveira, Claudio; Camacho, Juan Pedro M.; Foresti, Fausto
2017-01-01
Satellite DNA (satDNA) is an abundant fraction of repetitive DNA in eukaryotic genomes and plays an important role in genome organization and evolution. In general, satDNA sequences follow a concerted evolutionary pattern through the intragenomic homogenization of different repeat units. In addition, the satDNA library hypothesis predicts that related species share a series of satDNA variants descended from a common ancestor species, with differential amplification of different satDNA variants. The finding of a same satDNA family in species belonging to different genera within Characidae fish provided the opportunity to test both concerted evolution and library hypotheses. For this purpose, we analyzed here sequence variation and abundance of this satDNA family in ten species, by a combination of next generation sequencing (NGS), PCR and Sanger sequencing, and fluorescence in situ hybridization (FISH). We found extensive between-species variation for the number and size of pericentromeric FISH signals. At genomic level, the analysis of 1000s of DNA sequences obtained by Illumina sequencing and PCR amplification allowed defining 150 haplotypes which were linked in a common minimum spanning tree, where different patterns of concerted evolution were apparent. This also provided a glimpse into the satDNA library of this group of species. In consistency with the library hypothesis, different variants for this satDNA showed high differences in abundance between species, from highly abundant to simply relictual variants. PMID:28855916
Content Is King: Databases Preserve the Collective Information of Science.
Yates, John R
2018-04-01
Databases store sequence information experimentally gathered to create resources that further science. In the last 20 years databases have become critical components of fields like proteomics where they provide the basis for large-scale and high-throughput proteomic informatics. Amos Bairoch, winner of the Association of Biomolecular Resource Facilities Frederick Sanger Award, has created some of the important databases proteomic research depends upon for accurate interpretation of data.
Thulin, Sara; Olcén, Per; Fredlund, Hans; Unemo, Magnus
2008-01-01
A segment of penA in Neisseria meningitidis strains (n = 127), including two nucleotide sites closely associated to reduced susceptibility to penicillins, was amplified and pyrosequenced. All results were in concordance with Sanger sequencing, and a high correlation between alterations in the two Peni-specific sites and reduced susceptibility to penicillins was identified. PMID:18070955
Assessment of an automated capillary system for Plasmodium vivax microsatellite genotyping.
Manrique, Paulo; Hoshi, Mari; Fasabi, Manuel; Nolasco, Oscar; Yori, Pablo; Calderón, Martiza; Gilman, Robert H; Kosek, Margaret N; Vinetz, Joseph M; Gamboa, Dionicia
2015-08-21
Several platforms have been used to generate the primary data for microsatellite analysis of malaria parasite genotypes. Each has relative advantages but share a limitation of being time- and cost-intensive. A commercially available automated capillary gel cartridge system was assessed in the microsatellite analysis of Plasmodium vivax diversity in the Peruvian Amazon. The reproducibility and accuracy of a commercially-available automated capillary system, QIAxcel, was assessed using a sequenced PCR product of 227 base pairs. This product was measured 42 times, then 27 P. vivax samples from Peruvian Amazon subjects were analyzed with this instrument using five informative microsatellites. Results from the QIAxcel system were compared with a Sanger-type sequencing machine, the ABI PRISM(®) 3100 Genetic Analyzer. Significant differences were seen between the sequenced amplicons and the results from the QIAxcel instrument. Different runs, plates and cartridges yielded significantly different results. Additionally, allele size decreased with each run by 0.045, or 1 bp, every three plates. QIAxcel and ABI PRISM systems differed in giving different values than those obtained by ABI PRISM, and too many (i.e. inaccurate) alleles per locus were also seen with the automated instrument. While P. vivax diversity could generally be estimated using an automated capillary gel cartridge system, the data demonstrate that this system is not sufficiently precise for reliably identifying parasite strains via microsatellite analysis. This conclusion reached after systematic analysis was due both to inadequate precision and poor reproducibility in measuring PCR product size.
Nouchi, A; Nguyen, T; Valantin, M A; Simon, A; Sayon, S; Agher, R; Calvez, V; Katlama, C; Marcelin, A G; Soulie, C
2018-05-29
To investigate the dynamics of HIV-1 variants archived in cells harbouring drug resistance-associated mutations (DRAMs) to lamivudine/emtricitabine, etravirine and rilpivirine in patients under effective ART free from selective pressure on these DRAMs, in order to assess the possibility of recycling molecules with resistance history. We studied 25 patients with at least one DRAM to lamivudine/emtricitabine, etravirine and/or rilpivirine identified on an RNA sequence in their history and with virological control for at least 5 years under a regimen excluding all drugs from the resistant class. Longitudinal ultra-deep sequencing (UDS) and Sanger sequencing of the reverse transcriptase region were performed on cell-associated HIV-1 DNA samples taken over the 5 years of follow-up. Viral variants harbouring the analysed DRAMs were no longer detected by UDS over the 5 years in 72% of patients, with viruses susceptible to the molecules of interest found after 5 years in 80% of patients with UDS and in 88% of patients with Sanger. Residual viraemia with <50 copies/mL was detected in 52% of patients. The median HIV DNA level remained stable (2.4 at baseline versus 2.1 log10 copies/106 cells 5 years later). These results show a clear trend towards clearance of archived DRAMs to reverse transcriptase inhibitors in cell-associated HIV-1 DNA after a long period of virological control, free from therapeutic selective pressure on these DRAMs, reflecting probable residual replication in some reservoirs of the fittest viruses and leading to persistent evolution of the archived HIV-1 DNA resistance profile.
Mensa-Vilaro, Anna; Teresa Bosque, María; Magri, Giuliana; Honda, Yoshitaka; Martínez-Banaclocha, Helios; Casorran-Berges, Marta; Sintes, Jordi; González-Roca, Eva; Ruiz-Ortiz, Estibaliz; Heike, Toshio; Martínez-Garcia, Juan J; Baroja-Mazo, Alberto; Cerutti, Andrea; Nishikomori, Ryuta; Yagüe, Jordi; Pelegrín, Pablo; Delgado-Beltran, Concha; Aróstegui, Juan I
2016-12-01
Gain-of-function NLRP3 mutations cause cryopyrin-associated periodic syndrome (CAPS), with gene mosaicism playing a relevant role in the pathogenesis. This study was undertaken to characterize the genetic cause underlying late-onset but otherwise typical CAPS. We studied a 64-year-old patient who presented with recurrent episodes of urticaria-like rash, fever, conjunctivitis, and oligoarthritis at age 56 years. DNA was extracted from both unfractionated blood and isolated leukocyte and CD34+ subpopulations. Genetic studies were performed using both the Sanger method of DNA sequencing and next-generation sequencing (NGS) methods. In vitro and ex vivo analyses were performed to determine the consequences that the presence of the variant have in the normal structure or function of the protein of the detected variant. NGS analyses revealed the novel p.Gln636Glu NLRP3 variant in unfractionated blood, with an allele frequency (18.4%) compatible with gene mosaicism. Sanger sequence chromatograms revealed a small peak corresponding to the variant allele. Amplicon-based deep sequencing revealed somatic NLRP3 mosaicism restricted to myeloid cells (31.8% in monocytes, 24.6% in neutrophils, and 11.2% in circulating CD34+ common myeloid progenitor cells) and its complete absence in lymphoid cells. Functional analyses confirmed the gain-of-function behavior of the gene variant and hyperactivity of the NLRP3 inflammasome in the patient. Treatment with anakinra resulted in good control of the disease. We identified the novel gain-of-function p.Gln636Glu NLRP3 mutation, which was detected as a somatic mutation restricted to myeloid cells, as the cause of late-onset but otherwise typical CAPS. Our results expand the diversity of CAPS toward milder phenotypes than previously reported, including those starting during adulthood. © 2016, American College of Rheumatology.
Prenatal diagnosis for a Chinese family with a de novo DMD gene mutation
Li, Tao; Zhang, Zhao-jing; Ma, Xin; Lv, Xue; Xiao, Hai; Guo, Qian-nan; Liu, Hong-yan; Wang, Hong-dan; Wu, Dong; Lou, Gui-yu; Wang, Xin; Zhang, Chao-yang; Liao, Shi-xiu
2017-01-01
Abstract Background: Patients with Duchenne muscular dystrophy (DMD) usually have severe and fatal symptoms. At present, there is no effective treatment for DMD, thus it is very important to avoid the birth of children with DMD by effective prenatal diagnosis. We identified a de novo DMD gene mutation in a Chinese family, and make a prenatal diagnosis. Methods: First, multiplex ligation-dependent probe amplification (MLPA) was applied to analyze DMD gene exon deletion/duplication in all family members. The coding sequences of 79 exons in DMD gene were analyzed by Sanger sequencing in the patient; and then according to DMD gene exon mutation in the patient, DMD gene sequencing was performed in the family members. On the basis of results above, the pathogenic mutation in DMD gene was identified. Results: MLPA showed no DMD gene exon deletion/duplication in all family members. Sanger sequencing revealed c.2767_2767delT [p.Ser923LeufsX26] mutation in DMD gene of the patient. Heterozygous deletion mutation (T/-) at this locus was observed in the pregnant woman and her mother and younger sister. The analyses of amniotic fluid samples indicated negative Y chromosome sex-determining gene, no DMD gene exon deletion/duplication, no mutations at c.2767 locus, and the inherited maternal X chromosome different from that of the patient. Conclusion: The pathogenic mutation in DMD gene, c.2767_2767delT [p.Ser923LeufsX26], identified in this family is a de novo mutation. On the basis of specific conditions, it is necessary to select suitable methods to make prenatal diagnosis more effective, accurate, and economic. PMID:29390271
Thurber, Mary I; Ghai, Ria R; Hyeroba, David; Weny, Geoffrey; Tumukunde, Alex; Chapman, Colin A; Wiseman, Roger W; Dinis, Jorge; Steeil, James; Greiner, Ellis C; Friedrich, Thomas C; O'Connor, David H; Goldberg, Tony L
2013-07-01
Hemoparasites of the apicomplexan family Plasmodiidae include the etiological agents of malaria, as well as a suite of non-human primate parasites from which the human malaria agents evolved. Despite the significance of these parasites for global health, little information is available about their ecology in multi-host communities. Primates were investigated in Kibale National Park, Uganda, where ecological relationships among host species are well characterized. Blood samples were examined for parasites of the genera Plasmodium and Hepatocystis using microscopy and PCR targeting the parasite mitochondrial cytochrome b gene, followed by Sanger sequencing. To assess co-infection, "deep sequencing" of a variable region within cytochrome b was performed. Out of nine black-and-white colobus (Colobus guereza), one blue guenon (Cercopithecus mitis), five grey-cheeked mangabeys (Lophocebus albigena), 23 olive baboons (Papio anubis), 52 red colobus (Procolobus rufomitratus) and 12 red-tailed guenons (Cercopithecus ascanius), 79 infections (77.5%) were found, all of which were Hepatocystis spp. Sanger sequencing revealed 25 different parasite haplotypes that sorted phylogenetically into six species-specific but morphologically similar lineages. "Deep sequencing" revealed mixed-lineage co-infections in baboons and red colobus (41.7% and 64.7% of individuals, respectively) but not in other host species. One lineage infecting red colobus also infected baboons, but always as the minor variant, suggesting directional cross-species transmission. Hepatocystis parasites in this primate community are a diverse assemblage of cryptic lineages, some of which co-infect hosts and at least one of which can cross primate species barriers. Copyright © 2013 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Implementing a genomic data management system using iRODS in the Wellcome Trust Sanger Institute
2011-01-01
Background Increasingly large amounts of DNA sequencing data are being generated within the Wellcome Trust Sanger Institute (WTSI). The traditional file system struggles to handle these increasing amounts of sequence data. A good data management system therefore needs to be implemented and integrated into the current WTSI infrastructure. Such a system enables good management of the IT infrastructure of the sequencing pipeline and allows biologists to track their data. Results We have chosen a data grid system, iRODS (Rule-Oriented Data management systems), to act as the data management system for the WTSI. iRODS provides a rule-based system management approach which makes data replication much easier and provides extra data protection. Unlike the metadata provided by traditional file systems, the metadata system of iRODS is comprehensive and allows users to customize their own application level metadata. Users and IT experts in the WTSI can then query the metadata to find and track data. The aim of this paper is to describe how we designed and used (from both system and user viewpoints) iRODS as a data management system. Details are given about the problems faced and the solutions found when iRODS was implemented. A simple use case describing how users within the WTSI use iRODS is also introduced. Conclusions iRODS has been implemented and works as the production system for the sequencing pipeline of the WTSI. Both biologists and IT experts can now track and manage data, which could not previously be achieved. This novel approach allows biologists to define their own metadata and query the genomic data using those metadata. PMID:21906284
DMD mutation spectrum analysis in 613 Chinese patients with dystrophinopathy.
Guo, Ruolan; Zhu, Guosheng; Zhu, Huimin; Ma, Ruiyu; Peng, Ying; Liang, Desheng; Wu, Lingqian
2015-08-01
Dystrophinopathy is a group of inherited diseases caused by mutations in the DMD gene. Within the dystrophinopathy spectrum, Duchenne and Becker muscular dystrophies are common X-linked recessive disorders that mainly feature striated muscle necrosis. We combined multiplex ligation-dependent probe amplification with Sanger sequencing to detect large deletions/duplications and point mutations in the DMD gene in 613 Chinese patients. A total of 571 (93.1%) patients were diagnosed, including 428 (69.8%) with large deletions/duplications and 143 (23.3%) with point mutations. Deletion/duplication breakpoints gathered mostly in introns 44-55. Reading frame rules could explain 88.6% of deletion mutations. We identified seventy novel point mutations that had not been previously reported. Spectrum expansion and genotype-phenotype analysis of DMD mutations on such a large sample size in Han Chinese population would provide new insights into the pathogenic mechanism underlying dystrophinopathies.
Wang, Shi-Yuan; Zhang, Qi; Zhang, Xiang; Zhao, Pei-Quan
2016-01-01
AIM To make a comprehensive analysis of the potential pathogenic genes related with Leber congenital amaurosis (LCA) in Chinese. METHODS LCA subjects and their families were retrospectively collected from 2013 to 2015. Firstly, whole-exome sequencing was performed in patients who had underwent gene mutation screening with nothing found, and then homozygous sites was selected, candidate sites were annotated, and pathogenic analysis was conducted using softwares including Sorting Tolerant from Intolerant (SIFT), Polyphen-2, Mutation assessor, Condel, and Functional Analysis through Hidden Markov Models (FATHMM). Furthermore, Gene Ontology function and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of pathogenic genes were performed followed by co-segregation analysis using Fisher exact Test. Sanger sequencing was used to validate single-nucleotide variations (SNVs). Expanded verification was performed in the rest patients. RESULTS Totally 51 LCA families with 53 patients and 24 family members were recruited. A total of 104 SNVs (66 LCA-related genes and 15 co-segregated genes) were submitted for expand verification. The frequencies of homozygous mutation of KRT12 and CYP1A1 were simultaneously observed in 3 families. Enrichment analysis showed that the potential pathogenic genes were mainly enriched in functions related to cell adhesion, biological adhesion, retinoid metabolic process, and eye development biological adhesion. Additionally, WFS1 and STAU2 had the highest homozygous frequencies. CONCLUSION LCA is a highly heterogeneous disease. Mutations in KRT12, CYP1A1, WFS1, and STAU2 may be involved in the development of LCA. PMID:27672588
Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.
Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong
2014-05-01
We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.
Jasim, Anfal A.; Al-Bustan, Suzanne A.; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda
2018-01-01
Common variants of Apolipoprotein A5 (APOA5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3′ UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism. PMID:29686695
Jasim, Anfal A; Al-Bustan, Suzanne A; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda
2018-01-01
Common variants of Apolipoprotein A5 ( APOA 5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3' UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism.
Ruane, Sara; Raxworthy, Christopher J; Lemmon, Alan R; Lemmon, Emily Moriarty; Burbrink, Frank T
2015-10-12
Using molecular data generated by high throughput next generation sequencing (NGS) platforms to infer phylogeny is becoming common as costs go down and the ability to capture loci from across the genome goes up. While there is a general consensus that greater numbers of independent loci should result in more robust phylogenetic estimates, few studies have compared phylogenies resulting from smaller datasets for commonly used genetic markers with the large datasets captured using NGS. Here, we determine how a 5-locus Sanger dataset compares with a 377-locus anchored genomics dataset for understanding the evolutionary history of the pseudoxyrhophiine snake radiation centered in Madagascar. The Pseudoxyrhophiinae comprise ~86 % of Madagascar's serpent diversity, yet they are poorly known with respect to ecology, behavior, and systematics. Using the 377-locus NGS dataset and the summary statistics species-tree methods STAR and MP-EST, we estimated a well-supported species tree that provides new insights concerning intergeneric relationships for the pseudoxyrhophiines. We also compared how these and other methods performed with respect to estimating tree topology using datasets with varying numbers of loci. Using Sanger sequencing and an anchored phylogenomics approach, we sequenced datasets comprised of 5 and 377 loci, respectively, for 23 pseudoxyrhophiine taxa. For each dataset, we estimated phylogenies using both gene-tree (concatenation) and species-tree (STAR, MP-EST) approaches. We determined the similarity of resulting tree topologies from the different datasets using Robinson-Foulds distances. In addition, we examined how subsets of these data performed compared to the complete Sanger and anchored datasets for phylogenetic accuracy using the same tree inference methodologies, as well as the program *BEAST to determine if a full coalescent model for species tree estimation could generate robust results with fewer loci compared to the summary statistics species tree approaches. We also examined the individual gene trees in comparison to the 377-locus species tree using the program MetaTree. Using the full anchored dataset under a variety of methods gave us the same, well-supported phylogeny for pseudoxyrhophiines. The African pseudoxyrhophiine Duberria is the sister taxon to the Malagasy pseudoxyrhophiines genera, providing evidence for a monophyletic radiation in Madagascar. In addition, within Madagascar, the two major clades inferred correspond largely to the aglyphous and opisthoglyphous genera, suggesting that feeding specializations associated with tooth venom delivery may have played a major role in the early diversification of this radiation. The comparison of tree topologies from the concatenated and species-tree methods using different datasets indicated the 5-locus dataset cannot beused to infer a correct phylogeny for the pseudoxyrhophiines under any method tested here and that summary statistics methods require 50 or more loci to consistently recover the species-tree inferred using the complete anchored dataset. However, as few as 15 loci may infer the correct topology when using the full coalescent species tree method *BEAST. MetaTree analyses of each gene tree from the Sanger and anchored datasets found that none of the individual gene trees matched the 377-locus species tree, and that no gene trees were identical with respect to topology. Our results suggest that ≥50 loci may be necessary to confidently infer phylogenies when using summaryspecies-tree methods, but that the coalescent-based method *BEAST consistently recovers the same topology using only 15 loci. These results reinforce that datasets with small numbers of markers may result in misleading topologies, and further, that the method of inference used to generate a phylogeny also has a major influence on the number of loci necessary to infer robust species trees.
Bernardinelli, Emanuele; Nofziger, Charity; Patsch, Wolfgang; Rasp, Gerd; Paulmichl, Markus; Dossena, Silvia
2018-01-01
The prevalence and spectrum of sequence alterations in the SLC26A4 gene, which codes for the anion exchanger pendrin, are population-specific and account for at least 50% of cases of non-syndromic hearing loss associated with an enlarged vestibular aqueduct. A cohort of nineteen patients from Austria with hearing loss and a radiological alteration of the vestibular aqueduct underwent Sanger sequencing of SLC26A4 and GJB2, coding for connexin 26. The pathogenicity of sequence alterations detected was assessed by determining ion transport and molecular features of the corresponding SLC26A4 protein variants. In this group, four uncharacterized sequence alterations within the SLC26A4 coding region were found. Three of these lead to protein variants with abnormal functional and molecular features, while one should be considered with no pathogenic potential. Pathogenic SLC26A4 sequence alterations were only found in 12% of patients. SLC26A4 sequence alterations commonly found in other Caucasian populations were not detected. This survey represents the first study on the prevalence and spectrum of SLC26A4 sequence alterations in an Austrian cohort and further suggests that genetic testing should always be integrated with functional characterization and determination of the molecular features of protein variants in order to unequivocally identify or exclude a causal link between genotype and phenotype. PMID:29320412
Koutsis, Georgios; Lynch, David S; Tucci, Arianna; Houlden, Henry; Karadima, Georgia; Panas, Marios
2015-08-15
To present a Greek family in which 5 male and 2 female members developed progressive spastic paraplegia. Plasma very long chain fatty acids (VLCFA) were reportedly normal at first testing in an affected male and for over 30 years the presumed diagnosis was hereditary spastic paraplegia (HSP). Targeted next generation sequencing (NGS) was used as a further diagnostic tool. Targeted exome sequencing in the proband, followed by Sanger sequencing confirmation; mutation segregation testing in multiple family members and plasma VLCFA measurement in the proband. NGS of the proband revealed a novel frameshift mutation in ABCD1 (c.1174_1178del, p.Leu392Serfs*7), bringing an end to diagnostic uncertainty by establishing the diagnosis of adrenomyeloneuropathy (AMN), the myelopathic phenotype of X-linked adrenoleukodystrophy (ALD). The mutation segregated in all family members and the diagnosis of AMN/ALD was confirmed by plasma VLCFA measurement. Confounding factors that delayed the diagnosis are presented. This report highlights the diagnostic utility of NGS in patients with undiagnosed spastic paraplegia, establishing a molecular diagnosis of AMN, allowing proper genetic counseling and management, and overcoming the diagnostic delay that can be rarely caused by false negative VLCFA analysis. Copyright © 2015 Elsevier B.V. All rights reserved.
Whole Genome Sequencing Reveals a De Novo SHANK3 Mutation in Familial Autism Spectrum Disorder
Nemirovsky, Sergio I.; Córdoba, Marta; Zaiat, Jonathan J.; Completa, Sabrina P.; Vega, Patricia A.; González-Morón, Dolores; Medina, Nancy M.; Fabbro, Mónica; Romero, Soledad; Brun, Bianca; Revale, Santiago; Ogara, María Florencia; Pecci, Adali; Marti, Marcelo; Vazquez, Martin; Turjanski, Adrián; Kauffman, Marcelo A.
2015-01-01
Introduction Clinical genomics promise to be especially suitable for the study of etiologically heterogeneous conditions such as Autism Spectrum Disorder (ASD). Here we present three siblings with ASD where we evaluated the usefulness of Whole Genome Sequencing (WGS) for the diagnostic approach to ASD. Methods We identified a family segregating ASD in three siblings with an unidentified cause. We performed WGS in the three probands and used a state-of-the-art comprehensive bioinformatic analysis pipeline and prioritized the identified variants located in genes likely to be related to ASD. We validated the finding by Sanger sequencing in the probands and their parents. Results Three male siblings presented a syndrome characterized by severe intellectual disability, absence of language, autism spectrum symptoms and epilepsy with negative family history for mental retardation, language disorders, ASD or other psychiatric disorders. We found germline mosaicism for a heterozygous deletion of a cytosine in the exon 21 of the SHANK3 gene, resulting in a missense sequence of 5 codons followed by a premature stop codon (NM_033517:c.3259_3259delC, p.Ser1088Profs*6). Conclusions We reported an infrequent form of familial ASD where WGS proved useful in the clinic. We identified a mutation in SHANK3 that underscores its relevance in Autism Spectrum Disorder. PMID:25646853
Whole genome sequencing reveals a de novo SHANK3 mutation in familial autism spectrum disorder.
Nemirovsky, Sergio I; Córdoba, Marta; Zaiat, Jonathan J; Completa, Sabrina P; Vega, Patricia A; González-Morón, Dolores; Medina, Nancy M; Fabbro, Mónica; Romero, Soledad; Brun, Bianca; Revale, Santiago; Ogara, María Florencia; Pecci, Adali; Marti, Marcelo; Vazquez, Martin; Turjanski, Adrián; Kauffman, Marcelo A
2015-01-01
Clinical genomics promise to be especially suitable for the study of etiologically heterogeneous conditions such as Autism Spectrum Disorder (ASD). Here we present three siblings with ASD where we evaluated the usefulness of Whole Genome Sequencing (WGS) for the diagnostic approach to ASD. We identified a family segregating ASD in three siblings with an unidentified cause. We performed WGS in the three probands and used a state-of-the-art comprehensive bioinformatic analysis pipeline and prioritized the identified variants located in genes likely to be related to ASD. We validated the finding by Sanger sequencing in the probands and their parents. Three male siblings presented a syndrome characterized by severe intellectual disability, absence of language, autism spectrum symptoms and epilepsy with negative family history for mental retardation, language disorders, ASD or other psychiatric disorders. We found germline mosaicism for a heterozygous deletion of a cytosine in the exon 21 of the SHANK3 gene, resulting in a missense sequence of 5 codons followed by a premature stop codon (NM_033517:c.3259_3259delC, p.Ser1088Profs*6). We reported an infrequent form of familial ASD where WGS proved useful in the clinic. We identified a mutation in SHANK3 that underscores its relevance in Autism Spectrum Disorder.
2013-01-01
Background Characterising genetic diversity through the analysis of massively parallel sequencing (MPS) data offers enormous potential to significantly improve our understanding of the genetic basis for observed phenotypes, including predisposition to and progression of complex human disease. Great challenges remain in resolving genetic variants that are genuine from the millions of artefactual signals. Results FAVR is a suite of new methods designed to work with commonly used MPS analysis pipelines to assist in the resolution of some of the issues related to the analysis of the vast amount of resulting data, with a focus on relatively rare genetic variants. To the best of our knowledge, no equivalent method has previously been described. The most important and novel aspect of FAVR is the use of signatures in comparator sequence alignment files during variant filtering, and annotation of variants potentially shared between individuals. The FAVR methods use these signatures to facilitate filtering of (i) platform and/or mapping-specific artefacts, (ii) common genetic variants, and, where relevant, (iii) artefacts derived from imbalanced paired-end sequencing, as well as annotation of genetic variants based on evidence of co-occurrence in individuals. We applied conventional variant calling applied to whole-exome sequencing datasets, produced using both SOLiD and TruSeq chemistries, with or without downstream processing by FAVR methods. We demonstrate a 3-fold smaller rare single nucleotide variant shortlist with no detected reduction in sensitivity. This analysis included Sanger sequencing of rare variant signals not evident in dbSNP131, assessment of known variant signal preservation, and comparison of observed and expected rare variant numbers across a range of first cousin pairs. The principles described herein were applied in our recent publication identifying XRCC2 as a new breast cancer risk gene and have been made publically available as a suite of software tools. Conclusions FAVR is a platform-agnostic suite of methods that significantly enhances the analysis of large volumes of sequencing data for the study of rare genetic variants and their influence on phenotypes. PMID:23441864
Deletion in the EVC2 gene causes chondrodysplastic dwarfism in Tyrolean Grey cattle.
Murgiano, Leonardo; Jagannathan, Vidhya; Benazzi, Cinzia; Bolcato, Marilena; Brunetti, Barbara; Muscatello, Luisa Vera; Dittmer, Keren; Piffer, Christian; Gentile, Arcangelo; Drögemüller, Cord
2014-01-01
During the summer of 2013 seven Italian Tyrolean Grey calves were born with abnormally short limbs. Detailed clinical and pathological examination revealed similarities to chondrodysplastic dwarfism. Pedigree analysis showed a common founder, assuming autosomal monogenic recessive transmission of the defective allele. A positional cloning approach combining genome wide association and homozygosity mapping identified a single 1.6 Mb genomic region on BTA 6 that was associated with the disease. Whole genome re-sequencing of an affected calf revealed a single candidate causal mutation in the Ellis van Creveld syndrome 2 (EVC2) gene. This gene is known to be associated with chondrodysplastic dwarfism in Japanese Brown cattle, and dwarfism, abnormal nails and teeth, and dysostosis in humans with Ellis-van Creveld syndrome. Sanger sequencing confirmed the presence of a 2 bp deletion in exon 19 (c.2993_2994ACdel) that led to a premature stop codon in the coding sequence of bovine EVC2, and was concordant with the recessive pattern of inheritance in affected and carrier animals. This loss of function mutation confirms the important role of EVC2 in bone development. Genetic testing can now be used to eliminate this form of chondrodysplastic dwarfism from Tyrolean Grey cattle.
Deletion in the EVC2 Gene Causes Chondrodysplastic Dwarfism in Tyrolean Grey Cattle
Murgiano, Leonardo; Jagannathan, Vidhya; Benazzi, Cinzia; Bolcato, Marilena; Brunetti, Barbara; Muscatello, Luisa Vera; Dittmer, Keren; Piffer, Christian; Gentile, Arcangelo; Drögemüller, Cord
2014-01-01
During the summer of 2013 seven Italian Tyrolean Grey calves were born with abnormally short limbs. Detailed clinical and pathological examination revealed similarities to chondrodysplastic dwarfism. Pedigree analysis showed a common founder, assuming autosomal monogenic recessive transmission of the defective allele. A positional cloning approach combining genome wide association and homozygosity mapping identified a single 1.6 Mb genomic region on BTA 6 that was associated with the disease. Whole genome re-sequencing of an affected calf revealed a single candidate causal mutation in the Ellis van Creveld syndrome 2 (EVC2) gene. This gene is known to be associated with chondrodysplastic dwarfism in Japanese Brown cattle, and dwarfism, abnormal nails and teeth, and dysostosis in humans with Ellis-van Creveld syndrome. Sanger sequencing confirmed the presence of a 2 bp deletion in exon 19 (c.2993_2994ACdel) that led to a premature stop codon in the coding sequence of bovine EVC2, and was concordant with the recessive pattern of inheritance in affected and carrier animals. This loss of function mutation confirms the important role of EVC2 in bone development. Genetic testing can now be used to eliminate this form of chondrodysplastic dwarfism from Tyrolean Grey cattle. PMID:24733244
GNE missense mutation in recessive familial amyotrophic lateral sclerosis.
Köroğlu, Çiğdem; Yılmaz, Rezzak; Sorgun, Mine Hayriye; Solakoğlu, Seyhun; Şener, Özden
2017-12-01
Amyotrophic lateral sclerosis (ALS) is a motor neuron disease eventually leading to death from respiratory failure. Recessive inheritance is very rare. Here, we describe the clinical findings in a consanguineous family with five men afflicted with recessive ALS and the identification of the homozygous mutation responsible for the disorder. The onset of the disease ranged from 12 to 35 years of age, with variable disease progressions. We performed clinical investigations including metabolic and paraneoplastic screening, cranial and cervical imaging, and electrophysiology. We mapped the disease gene to 9p21.1-p12 with a LOD score of 5.2 via linkage mapping using genotype data for single-nucleotide polymorphism markers and performed exome sequence analysis to identify the disease-causing gene variant. We also Sanger sequenced all coding sequences of SIGMAR1, a gene reported as responsible for juvenile ALS in a family. We did not find any mutation in SIGMAR1. Instead, we identified a novel homozygous missense mutation p.(His705Arg) in GNE which was predicted as damaging by online tools. GNE has been associated with inclusion body myopathy and is expressed in many tissues. We propose that the GNE mutation underlies the pathology in the family.
Jang, Mi-Ae; Lee, Taeheon; Lee, Junnam
2015-01-01
Waardenburg syndrome (WS) is a clinically and genetically heterogeneous hereditary auditory pigmentary disorder characterized by congenital sensorineural hearing loss and iris discoloration. Many genes have been linked to WS, including PAX3, MITF, SNAI2, EDNRB, EDN3, and SOX10, and many additional genes have been associated with disorders with phenotypic overlap with WS. To screen all possible genes associated with WS and congenital deafness simultaneously, we performed diagnostic exome sequencing (DES) in a male patient with clinical features consistent with WS. Using DES, we identified a novel missense variant (c.220C>G; p.Arg74Gly) in exon 2 of the PAX3 gene in the patient. Further analysis by Sanger sequencing of the patient and his parents revealed a de novo occurrence of the variant. Our findings show that DES can be a useful tool for the identification of pathogenic gene variants in WS patients and for differentiation between WS and similar disorders. To the best of our knowledge, this is the first report of genetically confirmed WS in Korea. PMID:25932447
Blouin, Arnaud G; Chooi, Kar Mun; Warren, Ben; Napier, Kathryn R; Barrero, Roberto A; MacDiarmid, Robin M
2018-05-01
A novel virus, with characteristics of viruses classified within the genus Vitivirus, was identified from a sample of Vitis vinifera cv. Chardonnay in New Zealand. The virus was detected with high throughput sequencing (small RNA and total RNA) and its sequence was confirmed by Sanger sequencing. Its genome is 7507 nt long (excluding the polyA tail) with an organisation similar to that described for other classifiable members of the genus Vitivirus. The closest relative of the virus is grapevine virus E (GVE) with 65% aa identity in ORF1 (65% nt identity) and 63% aa identity in the coat protein (66% nt identity). The relationship with GVE was confirmed with phylogenetic analysis, showing the new virus branching with GVE, Agave tequilina leaf virus and grapevine virus G (GVG). A limited survey revealed the presence of this virus in multiple plants from the same location where the newly described GVG was discovered, and in most cases both viruses were detected as co-infections. The genetic characteristics of this virus suggest it represents an isolate of a new species within the genus Vitivirus and following the current nomenclature, we propose the name "Grapevine virus I".
MMACHC gene mutation in familial hypogonadism with neurological symptoms.
Shi, Changhe; Shang, Dandan; Sun, Shilei; Mao, Chengyuan; Qin, Jie; Luo, Haiyang; Shao, Mingwei; Chen, Zhengguang; Liu, Yutao; Liu, Xinjing; Song, Bo; Xu, Yuming
2015-12-15
Recent studies have convincingly documented that hypogonadism is a component of various hereditary disorders and is often recognized as an important clinical feature in combination with various neurological symptoms, yet, the causative genes in a few related families are still unknown. High-throughput sequencing has become an efficient method to identify causative genes in related complex hereditary disorders. In this study, we performed exome sequencing in a family presenting hypergonadotropic hypogonadism with neurological presentations of mental retardation, epilepsy, ataxia, and leukodystrophy. After bioinformatic analysis and Sanger sequencing validation, we identified compound heterozygous mutations: c.482G>A (p.R161Q) and c.609G>A (p.W203X) in MMACHC gene in this pedigree. MMACHC was previously confirmed to be responsible for methylmalonic aciduria (MMA) combined with homocystinuria, cblC type (cblC disease), a hereditary vitamin B12 metabolic disorder. Biochemical and gas chromatography-mass spectrometry (GC-MS) examinations in this pedigree further supported the cblC disease diagnosis. These results indicated that hypergonadotropic hypogonadism may be a novel clinical manifestation of cblC disease, but more reports on additional patients are needed to support this hypothesis. Copyright © 2015 Elsevier B.V. All rights reserved.
Huang, Tianhong; Yang, Guilin; Dang, Xiao; Ao, Feijian; Li, Jiankang; He, Yizhou; Tang, Qiyuan; He, Qing
2017-11-01
Alagille syndrome (AGS) is a highly variable, autosomal dominant disease that affects multiple structures including the liver, heart, eyes, bones and face. Targeted region capture sequencing focuses on a panel of known pathogenic genes and provides a rapid, cost‑effective and accurate method for molecular diagnosis. In a Chinese family, this method was used on the proband and Sanger sequencing was applied to validate the candidate mutation. A de novo heterozygous mutation (c.3254_3255insT p.Leu1085PhefsX24) of the jagged 1 gene was identified as the potential disease‑causing gene mutation. In conclusion, the present study suggested that target region capture sequencing is an efficient, reliable and accurate approach for the clinical diagnosis of AGS. Furthermore, these results expand on the understanding of the pathogenesis of AGS.
An efficient approach to BAC based assembly of complex genomes.
Visendi, Paul; Berkman, Paul J; Hayashi, Satomi; Golicz, Agnieszka A; Bayer, Philipp E; Ruperao, Pradeep; Hurgobin, Bhavna; Montenegro, Juan; Chan, Chon-Kit Kenneth; Staňková, Helena; Batley, Jacqueline; Šimková, Hana; Doležel, Jaroslav; Edwards, David
2016-01-01
There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.
Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D
2017-01-01
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing.
Hargreaves, Adam D; Mulley, John F
2015-01-01
Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0-2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5' and 3' UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species.
Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing
Hargreaves, Adam D.
2015-01-01
Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0–2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5′ and 3′ UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species. PMID:26623194
Expanding sialidosis spectrum by genome-wide screening: NEU1 mutations in adult-onset myoclonus.
Canafoglia, Laura; Robbiano, Angela; Pareyson, Davide; Panzica, Ferruccio; Nanetti, Lorenzo; Giovagnoli, Anna Rita; Venerando, Anna; Gellera, Cinzia; Franceschetti, Silvana; Zara, Federico
2014-06-03
To identify the genetic cause of a familial form of late-onset action myoclonus in 2 unrelated patients. Both probands had 2 siblings displaying a similar disorder. Extensive laboratory examinations, including biochemical assessment for urine sialic acid in the 2 probands, were negative. Exome sequencing was performed in the probands using an Illumina platform. Segregation analysis of putative mutations was performed in all family members by standard Sanger sequencing protocols. NEU1 mutations were detected in 3 siblings of each family with prominent cortical myoclonus presenting in the third decade of life and having a mild and slowly progressive course. They did not have macular cherry-red spot and their urinary sialic acid excretion was within normal values. Genetic analysis demonstrated a homozygous mutation in family 1 (c.200G>T, p.S67I) and 2 compound heterozygous mutations in family 2 (c.679G>A, p.G227R; c.913C>T, p.R305C). Our observation indicates that sialidosis should be suspected and the NEU1 gene analyzed in patients with isolated action myoclonus presenting in adulthood in the absence of other typical clinical and laboratory findings. © 2014 American Academy of Neurology.
Current molecular genetics strategies for the diagnosis of lysosomal storage disorders.
Giugliani, Roberto; Brusius-Facchin, Ana-Carolina; Pasqualim, Gabriela; Leistner-Segal, Sandra; Riegel, Mariluce; Matte, Ursula
2016-01-01
Lysosomal storage disorders (LSDs) are a group of almost 50 monogenic diseases characterized by mutations causing deficiency of lysosomal enzymes or non-enzyme proteins involved in transport across the lysosomal membrane, protein maturation or lysosomal biogenesis. Usually, affected patients are normal at birth and have a progressive and severe disease with high morbidity and reduced life expectancy. The overall incidence of LSDs is usually estimated as 1:5000, but newborn screening studies are indicating that it could be much higher. Specific therapies were already developed for selected LSDs, making the timely and correct diagnosis very important for successful treatment and also for genetic counseling. In most LSD cases the biochemical techniques provide a reliable diagnosis. However, the identification of pathogenic mutations by genetic analysis is being increasingly recommended to provide additional information. In this paper we discuss the conventional methods for genetic analysis used in the LSDs [restriction fragment length polymorphism (RFLP), amplification-refractory mutation system (ARMS), single strand conformation polymorphism (SSCP), denaturing high performance liquid chromatography (dHPLC), real-time polymerase chain reaction, high resolution melting (HRM), multiplex ligation-dependent probe amplification (MLPA), Sanger sequencing] and also the newer approaches [massive parallel sequencing, array comparative genomic hybridization (CGH)].
Generation and Analysis of the Expressed Sequence Tags from the Mycelium of Ganoderma lucidum
Huang, Yen-Hua; Wu, Hung-Yi; Wu, Keh-Ming; Liu, Tze-Tze; Liou, Ruey-Fen; Tsai, Shih-Feng; Shiao, Ming-Shi; Ho, Low-Tone; Tzean, Shean-Shong; Yang, Ueng-Cheng
2013-01-01
Ganoderma lucidum (G. lucidum) is a medicinal mushroom renowned in East Asia for its potential biological effects. To enable a systematic exploration of the genes associated with the various phenotypes of the fungus, the genome consortium of G. lucidum has carried out an expressed sequence tag (EST) sequencing project. Using a Sanger sequencing based approach, 47,285 ESTs were obtained from in vitro cultures of G. lucidum mycelium of various durations. These ESTs were further clustered and merged into 7,774 non-redundant expressed loci. The features of these expressed contigs were explored in terms of over-representation, alternative splicing, and natural antisense transcripts. Our results provide an invaluable information resource for exploring the G. lucidum transcriptome and its regulation. Many cases of the genes over-represented in fast-growing dikaryotic mycelium are closely related to growth, such as cell wall and bioactive compound synthesis. In addition, the EST-genome alignments containing putative cassette exons and retained introns were manually curated and then used to make inferences about the predominating splice-site recognition mechanism of G. lucidum. Moreover, a number of putative antisense transcripts have been pinpointed, from which we noticed that two cases are likely to reveal hitherto undiscovered biological pathways. To allow users to access the data and the initial analysis of the results of this project, a dedicated web site has been created at http://csb2.ym.edu.tw/est/. PMID:23658685
Jangid, Kamlesh; Kao, Ming-Hung; Lahamge, Aishwarya; Williams, Mark A; Rathbun, Stephen L; Whitman, William B
2016-01-01
K-shuff is a new algorithm for comparing the similarity of gene sequence libraries, providing measures of the structural and compositional diversity as well as the significance of the differences between these measures. Inspired by Ripley's K-function for spatial point pattern analysis, the Intra K-function or IKF measures the structural diversity, including both the richness and overall similarity of the sequences, within a library. The Cross K-function or CKF measures the compositional diversity between gene libraries, reflecting both the number of OTUs shared as well as the overall similarity in OTUs. A Monte Carlo testing procedure then enables statistical evaluation of both the structural and compositional diversity between gene libraries. For 16S rRNA gene libraries from complex bacterial communities such as those found in seawater, salt marsh sediments, and soils, K-shuff yields reproducible estimates of structural and compositional diversity with libraries greater than 50 sequences. Similarly, for pyrosequencing libraries generated from a glacial retreat chronosequence and Illumina® libraries generated from US homes, K-shuff required >300 and 100 sequences per sample, respectively. Power analyses demonstrated that K-shuff is sensitive to small differences in Sanger or Illumina® libraries. This extra sensitivity of K-shuff enabled examination of compositional differences at much deeper taxonomic levels, such as within abundant OTUs. This is especially useful when comparing communities that are compositionally very similar but functionally different. K-shuff will therefore prove beneficial for conventional microbiome analysis as well as specific hypothesis testing.
Expanding the clinical spectrum of COL1A1 mutations in different forms of glaucoma.
Mauri, Lucia; Uebe, Steffen; Sticht, Heinrich; Vossmerbaeumer, Urs; Weisschuh, Nicole; Manfredini, Emanuela; Maselli, Edoardo; Patrosso, Mariacristina; Weinreb, Robert N; Penco, Silvana; Reis, André; Pasutto, Francesca
2016-08-02
Primary congenital glaucoma (PCG) and early onset glaucomas are one of the major causes of children and young adult blindness worldwide. Both autosomal recessive and dominant inheritance have been described with involvement of several genes including CYP1B1, FOXC1, PITX2, MYOC and PAX6. However, mutations in these genes explain only a small fraction of cases suggesting the presence of further candidate genes. To elucidate further genetic causes of these conditions whole exome sequencing (WES) was performed in an Italian patient, diagnosed with PCG and retinal detachment, and his unaffected parents. Sanger sequencing of the complete coding region of COL1A1 was performed in a total of 26 further patients diagnosed with PCG or early onset glaucoma. Exclusion of pathogenic variations in known glaucoma genes as CYP1B1, MYOC, FOXC1, PITX2 and PAX6 was additionally done per Sanger sequencing and Multiple Ligation-dependent Probe Amplification (MLPA) analysis. In the patient diagnosed with PCG and retinal detachment, analysis of WES data identified compound heterozygous variants in COL1A1 (p.Met264Leu; p.Ala1083Thr). Targeted COL1A1 screening of 26 additional patients detected three further heterozygous variants (p.Arg253*, p.Gly767Ser and p.Gly154Val) in three distinct subjects: two of them diagnosed with early onset glaucoma and mild form of osteogenesis imperfecta (OI), one patient with a diagnosis of PCG at age 4 years. All five variants affected evolutionary, highly conserved amino acids indicating important functional restrictions. Molecular modeling predicted that the heterozygous variants are dominant in effect and affect protein stability and thus the amount of available protein, while the compound heterozygous variants act as recessive alleles and impair binding affinity to two main COL1A1 binding proteins: Hsp47 and fibronectin. Dominant inherited mutations in COL1A1 are known causes of connective tissues disorders such as OI. These disorders are also associated with different ocular abnormalities, although recognition of the common pathology for both features is seldom being recognized. Our results expand the role of COL1A1 mutations in different forms of early-onset glaucoma with and without signs of OI. Thus, we suggest including COL1A1 mutation screening in the genetic work-up of glaucoma cases and detailed ophthalmic examinations with fundus analysis in patients with OI.
Poly A tail length analysis of in vitro transcribed mRNA by LC-MS.
Beverly, Michael; Hagen, Caitlin; Slack, Olga
2018-02-01
The 3'-polyadenosine (poly A) tail of in vitro transcribed (IVT) mRNA was studied using liquid chromatography coupled to mass spectrometry (LC-MS). Poly A tails were cleaved from the mRNA using ribonuclease T1 followed by isolation with dT magnetic beads. Extracted tails were then analyzed by LC-MS which provided tail length information at single-nucleotide resolution. A 2100-nt mRNA with plasmid-encoded poly A tail lengths of either 27, 64, 100, or 117 nucleotides was used for these studies as enzymatically added poly A tails showed significant length heterogeneity. The number of As observed in the tails closely matched Sanger sequencing results of the DNA template, and even minor plasmid populations with sequence variations were detected. When the plasmid sequence contained a discreet number of poly As in the tail, analysis revealed a distribution that included tails longer than the encoded tail lengths. These observations were consistent with transcriptional slippage of T7 RNAP taking place within a poly A sequence. The type of RNAP did not alter the observed tail distribution, and comparison of T3, T7, and SP6 showed all three RNAPs produced equivalent tail length distributions. The addition of a sequence at the 3' end of the poly A tail did, however, produce narrower tail length distributions which supports a previously described model of slippage where the 3' end can be locked in place by having a G or C after the poly nucleotide region. Graphical abstract Determination of mRNA poly A tail length using magnetic beads and LC-MS.
Varga, Elizabeth; Chao, Elizabeth C; Yeager, Nicholas D
2015-09-01
Next-generation sequencing (NGS) technology is increasingly utilized to identify therapeutic targets for patients with malignancy. This technology also has the capability to reveal the presence of constitutional genetic alterations, which may have significant implications for patients and their family members. Here we present the case of a 23 year old Caucasian patient with recurrent undifferentiated sarcoma who had NGS-based tumor analysis using an assay which simultaneously analyzed the entire coding sequence of 236 cancer-related genes (3769 exons) plus 47 introns from 19 genes often rearranged or altered in cancer. Pathogenic alterations were reported in tumor as the predicted protein alterations, BRCA2 "R645fs*15″ and MLH1 "E694*". Because constitutional BRCA2 and MLH1 gene mutations are associated with Hereditary Breast Ovarian Cancer Syndrome (HBOCS) and Lynch syndrome respectively, sequence analysis of DNA isolated from peripheral blood was performed. The presence of the alterations, BRCA2 c.1929delG and MLH1 c.2080G>T, corresponding to the previously reported predicted protein alterations, were confirmed by Sanger sequencing in the constitutional DNA. An additional DNA finding was reported in this analysis, MLH1 c.2081A>C at the neighboring nucleotide. Further evaluation of the family revealed that all alterations were paternally inherited and the two MLH1 substitutions were in cis, more appropriately referred to as MLH1 c.2080_2081delGAinsTC, which is classified as a variant of uncertain significance. This case illustrates important considerations related to appropriate interpretation of NGS tumor results and follow-up of patients with potentially deleterious constitutional alterations.
Li, Hong Lian; Gu, Xiao Hui; Li, Bi Jun; Chen, Xiao; Lin, Hao Ran; Xia, Jun Hong
2017-01-01
Hypoxia is a major cause of fish morbidity and mortality in the aquatic environment. Hypoxia-inducible factors are very important modulators in the transcriptional response to hypoxic stress. In this study, we characterized and conducted functional analysis of hypoxia-inducible factor HIF1α and its inhibitor HIF1αn in Nile tilapia (Oreochromis niloticus). By cloning and Sanger sequencing, we obtained the full length cDNA sequences for HIF1α (2686bp) and HIF1αn (1308bp), respectively. The CDS of HIF1α includes 15 exons encoding 768 amino acid residues and the CDS of HIF1αn contains 8 exons encoding 354 amino acid residues. The complete CDS sequences of HIF1α and HIF1αn cloned from tilapia shared very high homology with known genes from other fishes. HIF1α show differentiated expression in different tissues (brain, heart, gill, spleen, liver) and at different hypoxia exposure times (6h, 12h, 24h). HIF1αn expression level under hypoxia is generally increased (6h, 12h, 24h) and shows extremely highly upregulation in brain tissue under hypoxia. A functional determination site analysis in the protein sequences between fish and land animals identified 21 amino acid sites in HIF1α and 2 sites in HIF1αn as significantly associated sites (α = 0.05). Phylogenetic tree-based positive selection analysis suggested 22 sites in HIF1α as positively selected sites with a p-value of at least 95% for fish lineages compared to the land animals. Our study could be important for clarifying the mechanism of fish adaptation to aquatic hypoxia environment.
Li, Hong Lian; Gu, Xiao Hui; Li, Bi Jun; Chen, Xiao; Lin, Hao Ran; Xia, Jun Hong
2017-01-01
Hypoxia is a major cause of fish morbidity and mortality in the aquatic environment. Hypoxia-inducible factors are very important modulators in the transcriptional response to hypoxic stress. In this study, we characterized and conducted functional analysis of hypoxia-inducible factor HIF1α and its inhibitor HIF1αn in Nile tilapia (Oreochromis niloticus). By cloning and Sanger sequencing, we obtained the full length cDNA sequences for HIF1α (2686bp) and HIF1αn (1308bp), respectively. The CDS of HIF1α includes 15 exons encoding 768 amino acid residues and the CDS of HIF1αn contains 8 exons encoding 354 amino acid residues. The complete CDS sequences of HIF1α and HIF1αn cloned from tilapia shared very high homology with known genes from other fishes. HIF1α show differentiated expression in different tissues (brain, heart, gill, spleen, liver) and at different hypoxia exposure times (6h, 12h, 24h). HIF1αn expression level under hypoxia is generally increased (6h, 12h, 24h) and shows extremely highly upregulation in brain tissue under hypoxia. A functional determination site analysis in the protein sequences between fish and land animals identified 21 amino acid sites in HIF1α and 2 sites in HIF1αn as significantly associated sites (α = 0.05). Phylogenetic tree-based positive selection analysis suggested 22 sites in HIF1α as positively selected sites with a p-value of at least 95% for fish lineages compared to the land animals. Our study could be important for clarifying the mechanism of fish adaptation to aquatic hypoxia environment. PMID:28278251
Cold Urticaria, Immunodeficiency, and Autoimmunity Related to PLCG2 Deletions
Ombrello, Michael J.; Remmers, Elaine F.; Sun, Guangping; Freeman, Alexandra F.; Datta, Shrimati; Torabi-Parizi, Parizad; Subramanian, Naeha; Bunney, Tom D.; Baxendale, Rhona W.; Martins, Marta S.; Romberg, Neil; Komarow, Hirsh; Aksentijevich, Ivona; Kim, Hun Sik; Ho, Jason; Cruse, Glenn; Jung, Mi-Yeon; Gilfillan, Alasdair M.; Metcalfe, Dean D.; Nelson, Celeste; O'Brien, Michelle; Wisch, Laura; Stone, Kelly; Douek, Daniel C.; Gandhi, Chhavi; Wanderer, Alan A.; Lee, Hane; Nelson, Stanley F.; Shianna, Kevin V.; Cirulli, Elizabeth T.; Goldstein, David B.; Long, Eric O.; Moir, Susan; Meffre, Eric; Holland, Steven M.; Kastner, Daniel L.; Katan, Matilda; Hoffman, Hal M.; Milner, Joshua D.
2012-01-01
Background Mendelian analysis of disorders of immune regulation can provide insight into molecular pathways associated with host defense and immune tolerance. Methods We identified three families with a dominantly inherited complex of cold-induced urticaria, antibody deficiency, and susceptibility to infection and autoimmunity. Immunophenotyping methods included flow cytometry, analysis of serum immunoglobulins and autoantibodies, lymphocyte stimulation, and enzymatic assays. Genetic studies included linkage analysis, targeted Sanger sequencing, and next-generation whole-genome sequencing. Results Cold urticaria occurred in all affected subjects. Other, variable manifestations included atopy, granulomatous rash, autoimmune thyroiditis, the presence of antinuclear antibodies, sinopulmonary infections, and common variable immunodeficiency. Levels of serum IgM and IgA and circulating natural killer cells and class-switched memory B cells were reduced. Linkage analysis showed a 7-Mb candidate interval on chromosome 16q in one family, overlapping by 3.5 Mb a disease-associated haplotype in a smaller family. This interval includes PLCG2, encoding phospholipase Cγ2 (PLCγ2), a signaling molecule expressed in B cells, natural killer cells, and mast cells. Sequencing of complementary DNA revealed heterozygous transcripts lacking exon 19 in two families and lacking exons 20 through 22 in a third family. Genomic sequencing identified three distinct in-frame deletions that cosegregated with disease. These deletions, located within a region encoding an autoinhibitory domain, result in protein products with constitutive phospholipase activity. PLCG2-expressing cells had diminished cellular signaling at 37°C but enhanced signaling at subphysiologic temperatures. Conclusions Genomic deletions in PLCG2 cause gain of PLCγ2 function, leading to signaling abnormalities in multiple leukocyte subsets and a phenotype encompassing both excessive and deficient immune function. (Funded by the National Institutes of Health Intramural Research Programs and others.) PMID:22236196
McInerney-Leo, A M; Harris, J E; Leo, P J; Marshall, M S; Gardiner, B; Kinning, E; Leong, H Y; McKenzie, F; Ong, W P; Vodopiutz, J; Wicking, C; Brown, M A; Zankl, A; Duncan, E L
2015-12-01
Short-rib thoracic dystrophies (SRTDs) are congenital disorders due to defects in primary cilium function. SRTDs are recessively inherited with mutations identified in 14 genes to date (comprising 398 exons). Conventional mutation detection (usually by iterative Sanger sequencing) is inefficient and expensive, and often not undertaken. Whole exome massive parallel sequencing has been used to identify new genes for SRTD (WDR34, WDR60 and IFT172); however, the clinical utility of whole exome sequencing (WES) has not been established. WES was performed in 11 individuals with SRTDs. Compound heterozygous or homozygous mutations were identified in six confirmed SRTD genes in 10 individuals (IFT172, DYNC2H1, TTC21B, WDR60, WDR34 and NEK1), giving overall sensitivity of 90.9%. WES data from 993 unaffected individuals sequenced using similar technology showed two individuals with rare (minor allele frequency <0.005) compound heterozygous variants of unknown significance in SRTD genes (specificity >99%). Costs for consumables, laboratory processing and bioinformatic analysis were
Rare variants in RTEL1 are associated with familial interstitial pneumonia.
Cogan, Joy D; Kropski, Jonathan A; Zhao, Min; Mitchell, Daphne B; Rives, Lynette; Markin, Cheryl; Garnett, Errine T; Montgomery, Keri H; Mason, Wendi R; McKean, David F; Powers, Julia; Murphy, Elissa; Olson, Lana M; Choi, Leena; Cheng, Dong-Sheng; Blue, Elizabeth Marchani; Young, Lisa R; Lancaster, Lisa H; Steele, Mark P; Brown, Kevin K; Schwarz, Marvin I; Fingerlin, Tasha E; Schwartz, David A; Lawson, William E; Loyd, James E; Zhao, Zhongming; Phillips, John A; Blackwell, Timothy S
2015-03-15
Up to 20% of cases of idiopathic interstitial pneumonia cluster in families, comprising the syndrome of familial interstitial pneumonia (FIP); however, the genetic basis of FIP remains uncertain in most families. To determine if new disease-causing rare genetic variants could be identified using whole-exome sequencing of affected members from FIP families, providing additional insights into disease pathogenesis. Affected subjects from 25 kindreds were selected from an ongoing FIP registry for whole-exome sequencing from genomic DNA. Candidate rare variants were confirmed by Sanger sequencing, and cosegregation analysis was performed in families, followed by additional sequencing of affected individuals from another 163 kindreds. We identified a potentially damaging rare variant in the gene encoding for regulator of telomere elongation helicase 1 (RTEL1) that segregated with disease and was associated with very short telomeres in peripheral blood mononuclear cells in 1 of 25 families in our original whole-exome sequencing cohort. Evaluation of affected individuals in 163 additional kindreds revealed another eight families (4.7%) with heterozygous rare variants in RTEL1 that segregated with clinical FIP. Probands and unaffected carriers of these rare variants had short telomeres (<10% for age) in peripheral blood mononuclear cells and increased T-circle formation, suggesting impaired RTEL1 function. Rare loss-of-function variants in RTEL1 represent a newly defined genetic predisposition for FIP, supporting the importance of telomere-related pathways in pulmonary fibrosis.
Molecular Pathology of Anaplastic Thyroid Carcinomas: A Retrospective Study of 144 Cases.
Bonhomme, Benjamin; Godbert, Yann; Perot, Gaelle; Al Ghuzlan, Abir; Bardet, Stéphane; Belleannée, Geneviève; Crinière, Lise; Do Cao, Christine; Fouilloux, Geneviève; Guyetant, Serge; Kelly, Antony; Leboulleux, Sophie; Buffet, Camille; Leteurtre, Emmanuelle; Michels, Jean-Jacques; Tissier, Frédérique; Toubert, Marie-Elisabeth; Wassef, Michel; Pinard, Clémence; Hostein, Isabelle; Soubeyran, Isabelle
2017-05-01
Anaplastic thyroid carcinoma (ATC) is a rare tumor, with poorly defined oncogenic molecular mechanisms and limited therapeutic options contributing to its poor prognosis. The aims of this retrospective study were to determine the frequency of anaplastic lymphoma kinase (ALK) translocations and to identify the mutational profile of ATC including TERT promoter mutations. One hundred and forty-four ATC cases were collected from 10 centers that are a part of the national French network for management of refractory thyroid tumors. Fluorescence in situ hybridization analysis for ALK rearrangement was performed on tissue microarrays. A panel of 50 genes using next-generation sequencing and TERT promoter mutations using Sanger sequencing were also screened. Fluorescence in situ hybridization was interpretable for 90 (62.5%) cases. One (1.1%) case was positive for an ALK rearrangement with a borderline threshold (15% positive cells). Next-generation sequencing results were interpretable for 94 (65.3%) cases, and Sanger sequencing (TERT) for 98 (68.1%) cases. A total of 210 mutations (intronic and exonic) were identified. TP53 alterations were the most frequent (54.4%). Forty-three percent harbored a mutation in the (H-K-N)RAS genes, 13.8% a mutation in the BRAF gene (essentially p.V600E), 17% a PI3K-AKT pathway mutation, 6.4% both RAS and PI3K pathway mutations, and 4.3% both TP53 and PTEN mutations. Nearly 10% of the cases showed no mutations of the RAS, PI3K-AKT pathways, or TP53, with mutations of ALK, ATM, APC, CDKN2A, ERBB2, RET, or SMAD4, including mutations not yet described in thyroid tumors. Genes encoding potentially druggable targets included: mutations in the ATM gene in four (4.3%) cases, in ERBB2 in one (1.1%) case, in MET in one (1.1%) case, and in ALK in one (1.1%) case. A TERT promoter alteration was found in 53 (54.0%) cases, including 43 C228T and 10 C250T mutations. Three out of our cases did not harbor mutations in the panel of genes with therapeutic interest. This study confirms that ALK rearrangements in ATC are rare and that the mutational landscape of ATC is heterogeneous, with many genes implicated in the follicular epithelial cell dedifferentiation process. This may explain the limited effectiveness of targeted therapeutic options tested so far.
Hennebique, Aurélie; Bidart, Marie; Jarraud, Sophie; Beraud, Laëtitia; Schwebel, Carole; Maurin, Max; Boisset, Sandrine
2017-09-01
The emergence of fluoroquinolone (FQ)-resistant mutants of Legionella pneumophila in infected humans was previously reported using a next-generation DNA sequencing (NGS) approach. This finding could explain part of the therapeutic failures observed in legionellosis patients treated with these antibiotics. The aim of this study was to develop digital PCR (dPCR) assays allowing rapid and accurate detection and quantification of these resistant mutants in respiratory samples, especially when the proportion of mutants in a wild-type background is low. We designed three dPCRgyrA assays to detect and differentiate the wild-type and one of the three gyrA mutations previously described as associated with FQ resistance in L. pneumophila : at positions 248C→T (T83I), 259G→A (D87N), and 259G→C (D87H). To assess the performance of these assays, mixtures of FQ-resistant and -susceptible strains of L. pneumophila were analyzed, and the results were compared with those obtained with Sanger DNA sequencing and real-time quantitative PCR (qPCR) technologies. The dPCRgyrA assays were able to detect mutated gyrA sequences in the presence of wild-type sequences at up to 1:1,000 resistant/susceptible allele ratios. By comparison, Sanger DNA sequencing and qPCR were less sensitive, allowing the detection of gyrA mutants at up to 1:1 and 1:10 ratios, respectively. When testing 38 respiratory samples from 23 legionellosis patients (69.6% treated with an FQ), dPCRgyrA detected small amounts of gyrA mutants in four (10.5%) samples from three (13.0%) patients. These results demonstrate that dPCR is a highly sensitive alternative to quantify FQ resistance in L. pneumophila , and it could be used in clinical practice to detect patients that could be at higher risk of therapeutic failure. Copyright © 2017 American Society for Microbiology.
Variants in the PRPF8 Gene are Associated with Glaucoma.
Micheal, Shazia; Hogewind, Barend F; Khan, Muhammad Imran; Siddiqui, Sorath Noorani; Zafar, Saemah Nuzhat; Akhtar, Farah; Qamar, Raheel; Hoyng, Carel B; den Hollander, Anneke I
2018-05-01
Glaucoma is the cause of irreversible blindness worldwide. Mutations in six genes have been associated with juvenile- and adult-onset familial primary open angle glaucoma (POAG) prior to this report but they explain only a small proportion of the genetic load. The aim of the study is to identify the novel genetic cause of the POAG in the families with adult-onset glaucoma. Whole exome sequencing (WES) was performed on DNA of two affected individuals, and predicted pathogenic variants were evaluated for segregation in four affected and three unaffected Dutch family members by Sanger sequencing. We identified a pathogenic variant (p.Val956Gly) in the PRPF8 gene, which segregates with the disease in Dutch family. Targeted Sanger sequencing of PRPF8 in a panel of 40 POAG families (18 Pakistani and 22 Dutch) revealed two additional nonsynonymous variants (p.Pro13Leu and p.Met25Thr), which segregate with the disease in two other Pakistani families. Both variants were then analyzed in a case-control cohort consisting of Pakistani 320 POAG cases and 250 matched controls. The p.Pro13Leu and p.Met25Thr variants were identified in 14 and 20 cases, respectively, while they were not detected in controls (p values 0.0004 and 0.0001, respectively). Previously, PRPF8 mutations have been associated with autosomal dominant retinitis pigmentosa (RP). The PRPF8 variants associated with POAG are located at the N-terminus, while all RP-associated mutations cluster at the C-terminus, dictating a clear genotype-phenotype correlation.
Ciavarella, Michele; Miccoli, Sara; Prossomariti, Anna; Pippucci, Tommaso; Bonora, Elena; Buscherini, Francesco; Palombo, Flavia; Zuntini, Roberta; Balbi, Tiziana; Ceccarelli, Claudio; Bazzoli, Franco; Ricciardiello, Luigi; Turchetti, Daniela; Piazzi, Giulia
2018-03-01
Germline variants in the APC gene cause familial adenomatous polyposis. Inherited variants in MutYH, POLE, POLD1, NTHL1, and MSH3 genes and somatic APC mosaicism have been reported as alternative causes of polyposis. However, ~30-50% of cases of polyposis remain genetically unsolved. Thus, the aim of this study was to investigate the genetic causes of unexplained adenomatous polyposis. Eight sporadic cases with >20 adenomatous polyps by 35 years of age or >50 adenomatous polyps by 55 years of age, and no causative germline variants in APC and/or MutYH, were enrolled from a cohort of 56 subjects with adenomatous colorectal polyposis. APC gene mosaicism was investigated on DNA from colonic adenomas by Sanger sequencing or Whole Exome Sequencing (WES). Mosaicism extension to other tissues (peripheral blood, saliva, hair follicles) was evaluated using Sanger sequencing and/or digital PCR. APC second hit was investigated in adenomas from mosaic patients. WES was performed on DNA from peripheral blood to identify additional polyposis candidate variants. We identified APC mosaicism in 50% of patients. In three cases mosaicism was restricted to the colon, while in one it also extended to the duodenum and saliva. One patient without APC mosaicism, carrying an APC in-frame deletion of uncertain significance, was found to harbor rare germline variants in OGG1, POLQ, and EXO1 genes. In conclusion, our restrictive selection criteria improved the detection of mosaic APC patients. In addition, we showed for the first time that an oligogenic inheritance of rare variants might have a cooperative role in sporadic colorectal polyposis onset.
Akahori, Masakazu; Itabashi, Takeshi; Nishino, Jo; Yoshitake, Kazutoshi; Ikeo, Kazuho; Tsuneoka, Hiroshi
2014-01-01
Purpose. To investigate genetic and clinical features of patients with rhodopsin (RHO) mutations in two Japanese families with autosomal dominant retinitis pigmentosa (adRP). Methods. Whole-exome sequence analysis was performed in ten adRP families. Identified RHO mutations for the cosegregation analysis were confirmed by Sanger sequencing. Ophthalmic examinations were performed to evaluate the RP phenotypes. The impact of the RHO mutation on the rhodopsin conformation was examined by molecular modeling analysis. Results. In two adRP families, we identified two RHO mutations (c.377G>T (p.W126L) and c.1036G>C (p.A346P)), one of which was novel. Complete cosegregation was confirmed for each mutation exhibiting the RP phenotype in both families. Molecular modeling predicted that the novel mutation (p.W126L) might impair rhodopsin function by affecting its conformational transition in the light-adapted form. Clinical phenotypes showed that patients with p.W126L exhibited sector RP, whereas patients with p.A346P exhibited classic RP. Conclusions. Our findings demonstrated that the novel mutation (p.W126L) may be associated with the phenotype of sector RP. Identification of RHO mutations is a very useful tool for predicting disease severity and providing precise genetic counseling. PMID:25485142
Bernkopf, Marie; Webersinke, Gerald; Tongsook, Chanakan; Koyani, Chintan N.; Rafiq, Muhammad A.; Ayaz, Muhammad; Müller, Doris; Enzinger, Christian; Aslam, Muhammad; Naeem, Farooq; Schmidt, Kurt; Gruber, Karl; Speicher, Michael R.; Malle, Ernst; Macheroux, Peter; Ayub, Muhammad; Vincent, John B.; Windpassinger, Christian; Duba, Hans-Christoph
2014-01-01
We describe the characterization of a gene for mild nonsyndromic autosomal recessive intellectual disability (ID) in two unrelated families, one from Austria, the other from Pakistan. Genome-wide single nucleotide polymorphism microarray analysis enabled us to define a region of homozygosity by descent on chromosome 17q25. Whole-exome sequencing and analysis of this region in an affected individual from the Austrian family identified a 5 bp frameshifting deletion in the METTL23 gene. By means of Sanger sequencing of METTL23, a nonsense mutation was detected in a consanguineous ID family from Pakistan for which homozygosity-by-descent mapping had identified a region on 17q25. Both changes lead to truncation of the putative METTL23 protein, which disrupts the predicted catalytic domain and alters the cellular localization. 3D-modelling of the protein indicates that METTL23 is strongly predicted to function as an S-adenosyl-methionine (SAM)-dependent methyltransferase. Expression analysis of METTL23 indicated a strong association with heat shock proteins, which suggests that these may act as a putative substrate for methylation by METTL23. A number of methyltransferases have been described recently in association with ID. Disruption of METTL23 presented here supports the importance of methylation processes for intact neuronal function and brain development. PMID:24626631
Cho, Sun Young; Law, Chun Yiu; Ng, Kwok Leung; Lam, Ching Wan
2016-04-01
The diagnosis of cranial and nephrogenic diabetes insipidus (DI) can be clinically challenging. The application of molecular genetic analysis can help in resolving diagnostic difficulties. A 3 month-old boy presented with recurrent polyuria was admitted to Intensive Care Unit and was treated as DI. The patient also had a strong family history of polyuria affecting his maternal uncles. Molecular genetic analysis using Single Nucleotide Polymorphism (SNP) array detected a large deletion located at Xq28 region and the breakpoint was identified using PCR and Sanger sequencing. An 11,535 bp novel deletion affecting the entire APVR2 gene and the last intron and exon of the ARHGAP4 gene was confirmed. This large deletion is likely due to the 7-bp microhomology sequence at the junctions of both 5' and 3' breakpoints. No disease-causing mutation was identified for AQP2. We report a novel deletion in a Chinese patient with congenital nephrogenic DI. We suggested that patients with suspected congenital DI should undergo genetic analysis of AVPR2 and AQP2 genes. A definitive diagnosis can benefit patient by treatment of hydrochlorothiazide and amiloride and avoiding unnecessary investigations. Copyright © 2016 Elsevier B.V. All rights reserved.
Thermophilic growth and enzymatic thermostability are polyphyletic traits within Chaetomiaceae.
van den Brink, Joost; Facun, Kryss; de Vries, Michel; Stielow, J Benjamin
2015-12-01
Thermophilic fungi have the potential to produce industrial-relevant thermostable enzymes, in particular for the degradation of plant biomass. Sordariales is one of the few fungal orders containing several thermophilic taxa, of which many have been associated with the production of thermostable enzymes. The evolutionary affiliation of Sordariales fungi, especially between thermophiles and non-thermophilic relatives, is however poorly understood. Phylogenetic analysis within the current study was based on sequence data, derived from a traditional Sanger and highly multiplexed targeted next generation sequencing approach of 45 isolates. The inferred phylogeny and detailed growth analysis rendered the trait 'thermophily' as polyphyletic within Chaetomiaceae (Sordariales, Sordariomycetes), and characteristic to: Myceliophthora spp., Thielavia terrestris, Chaetomium thermophilum, and Mycothermus thermophilus. Compared to mesophiles, the isolates within thermophilic taxa produced enzyme mixtures with the highest thermostability of known cellulase activities. Temperature profiles of the enzyme activities correlated strongly with the optimal growth temperatures of the isolates but not with their phylogenetic relationships. This strong correlation between growth and enzyme characteristics indicated that detailed analysis of growth does give predictive information on enzyme physiology. The variation in growth and enzyme characteristics reveals these fungi as an excellent platform to better understand fungal thermophily and enzyme thermostability. Copyright © 2015 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Hiremath, Pavana J; Farmer, Andrew; Cannon, Steven B; Woodward, Jimmy; Kudapa, Himabindu; Tuteja, Reetu; Kumar, Ashish; Bhanuprakash, Amindala; Mulaosmanovic, Benjamin; Gujaria, Neha; Krishnamurthy, Laxmanan; Gaur, Pooran M; Kavikishor, Polavarapu B; Shah, Trushar; Srinivasan, Ramamurthy; Lohse, Marc; Xiao, Yongli; Town, Christopher D; Cook, Douglas R; May, Gregory D; Varshney, Rajeev K
2011-10-01
Chickpea (Cicer arietinum L.) is an important legume crop in the semi-arid regions of Asia and Africa. Gains in crop productivity have been low however, particularly because of biotic and abiotic stresses. To help enhance crop productivity using molecular breeding techniques, next generation sequencing technologies such as Roche/454 and Illumina/Solexa were used to determine the sequence of most gene transcripts and to identify drought-responsive genes and gene-based molecular markers. A total of 103,215 tentative unique sequences (TUSs) have been produced from 435,018 Roche/454 reads and 21,491 Sanger expressed sequence tags (ESTs). Putative functions were determined for 49,437 (47.8%) of the TUSs, and gene ontology assignments were determined for 20,634 (41.7%) of the TUSs. Comparison of the chickpea TUSs with the Medicago truncatula genome assembly (Mt 3.5.1 build) resulted in 42,141 aligned TUSs with putative gene structures (including 39,281 predicted intron/splice junctions). Alignment of ∼37 million Illumina/Solexa tags generated from drought-challenged root tissues of two chickpea genotypes against the TUSs identified 44,639 differentially expressed TUSs. The TUSs were also used to identify a diverse set of markers, including 728 simple sequence repeats (SSRs), 495 single nucleotide polymorphisms (SNPs), 387 conserved orthologous sequence (COS) markers, and 2088 intron-spanning region (ISR) markers. This resource will be useful for basic and applied research for genome analysis and crop improvement in chickpea. Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd. No claim to original US government works.
Large-scale collection of full-length cDNA and transcriptome analysis in Hevea brasiliensis
Makita, Yuko; Ng, Kiaw Kiaw; Veera Singham, G.; Kawashima, Mika; Hirakawa, Hideki; Sato, Shusei
2017-01-01
Abstract Natural rubber has unique physical properties that cannot be replaced by products from other latex-producing plants or petrochemically produced synthetic rubbers. Rubber from Hevea brasiliensis is the main commercial source for this natural rubber that has a cis-polyisoprene configuration. For sustainable production of enough rubber to meet demand elucidation of the molecular mechanisms involved in the production of latex is vital. To this end, we firstly constructed rubber full-length cDNA libraries of RRIM 600 cultivar and sequenced around 20,000 clones by the Sanger method and over 15,000 contigs by Illumina sequencer. With these data, we updated around 5,500 gene structures and newly annotated around 9,500 transcription start sites. Second, to elucidate the rubber biosynthetic pathways and their transcriptional regulation, we carried out tissue- and cultivar-specific RNA-Seq analysis. By using our recently published genome sequence, we confirmed the expression patterns of the rubber biosynthetic genes. Our data suggest that the cytoplasmic mevalonate (MVA) pathway is the main route for isoprenoid biosynthesis in latex production. In addition to the well-studied polymerization factors, we suggest that rubber elongation factor 8 (REF8) is a candidate factor in cis-polyisoprene biosynthesis. We have also identified 39 transcription factors that may be key regulators in latex production. Expression profile analysis using two additional cultivars, RRIM 901 and PB 350, via an RNA-Seq approach revealed possible expression differences between a high latex-yielding cultivar and a disease-resistant cultivar. PMID:28431015
Long, Xigui; Huang, Yanru; Tan, Hu; Li, Zhuo; Zhang, Rui; Linpeng, Siyuan; Lv, Weigang; Cao, Yingxi; Li, Haoxian; Liang, Desheng; Wu, Lingqian
2018-04-26
To detect the underlying pathogenesis of congenital cataract in a four-generation Chinese family. Whole-exome sequencing (WES) of family members (III:4, IV:4, and IV:6) was performed. Sanger sequencing and bioinformatics analysis were subsequently conducted. Full-length WT-MIP or K228fs-MIP fused to HA markers at the N-terminal was transfected into HeLa cells. Next, quantitative real-time PCR, western blotting and immunofluorescence confocal laser scanning were performed. The age of onset for nonsyndromic cataracts in male patients was by 1-year old, earlier than for female patients, who exhibited onset at adulthood. A novel c.682_683delAA (p.K228fs230X) mutation in main intrinsic protein (MIP) cosegregated with the cataract phenotype. The instability index and unfolded states for truncated MIP were predicted to increase by bioinformatics analysis. The mRNA transcription level of K228fs-MIP was reduced compared with that of WT-MIP, and K228fs-MIP protein expression was also lower than that of WT-MIP. Immunofluorescence images showed that WT-MIP principally localized to the plasma membrane, whereas the mutant protein was trapped in the cytoplasm. Our study generated genetic and primary functional evidence for a novel c.682_683delAA mutation in MIP that expands the variant spectrum of MIP and help us better understand the molecular basis of cataract.
Next-Generation Molecular Testing of Newborn Dried Blood Spots for Cystic Fibrosis.
Lefterova, Martina I; Shen, Peidong; Odegaard, Justin I; Fung, Eula; Chiang, Tsoyu; Peng, Gang; Davis, Ronald W; Wang, Wenyi; Kharrazi, Martin; Schrijver, Iris; Scharfe, Curt
2016-03-01
Newborn screening for cystic fibrosis enables early detection and management of this debilitating genetic disease. Implementing comprehensive CFTR analysis using Sanger sequencing as a component of confirmatory testing of all screen-positive newborns has remained impractical due to relatively lengthy turnaround times and high cost. Here, we describe CFseq, a highly sensitive, specific, rapid (<3 days), and cost-effective assay for comprehensive CFTR gene analysis from dried blood spots, the common newborn screening specimen. The unique design of CFseq integrates optimized dried blood spot sample processing, a novel multiplex amplification method from as little as 1 ng of genomic DNA, and multiplex next-generation sequencing of 96 samples in a single run to detect all relevant CFTR mutation types. Sequence data analysis utilizes publicly available software supplemented by an expert-curated compendium of >2000 CFTR variants. Validation studies across 190 dried blood spots demonstrated 100% sensitivity and a positive predictive value of 100% for single-nucleotide variants and insertions and deletions and complete concordance across the polymorphic poly-TG and consecutive poly-T tracts. Additionally, we accurately detected both a known exon 2,3 deletion and a previously undetected exon 22,23 deletion. CFseq is thus able to replace all existing CFTR molecular assays with a single robust, definitive assay at significant cost and time savings and could be adapted to high-throughput screening of other inherited conditions. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Ataxia telangiectasia presenting as dopa-responsive cervical dystonia
Mohire, Mahavir D.; Schneider, Susanne A.; Stamelou, Maria; Wood, Nicholas W.; Bhatia, Kailash P.
2013-01-01
Objective: To identify the cause of cervical dopa-responsive dystonia (DRD) in a Muslim Indian family inherited in an apparently autosomal recessive fashion, as previously described in this journal. Methods: Previous testing for mutations in the genes known to cause DRD (GCH1, TH, and SPR) had been negative. Whole exome sequencing was performed on all 3 affected individuals for whom DNA was available to identify potentially pathogenic shared variants. Genotyping data obtained for all 3 affected individuals using the OmniExpress single nucleotide polymorphism chip (Illumina, San Diego, CA) were used to perform linkage analysis, autozygosity mapping, and copy number variation analysis. Sanger sequencing was used to confirm all variants. Results: After filtering of the variants, exome sequencing revealed 2 genes harboring potentially pathogenic compound heterozygous variants (ATM and LRRC16A). Of these, the variants in ATM segregated perfectly with the cervical DRD. Both mutations detected in ATM have been shown to be pathogenic, and α-fetoprotein, a marker of ataxia telangiectasia, was increased in all affected individuals. Conclusion: Biallelic mutations in ATM can cause DRD, and mutations in this gene should be considered in the differential diagnosis of unexplained DRD, particularly if the dystonia is cervical and if there is a recessive family history. ATM has previously been reported to cause isolated cervical dystonia, but never, to our knowledge, DRD. Individuals with dystonia related to ataxia telangiectasia may benefit from a trial of levodopa. PMID:23946315
Huang, Tina Y; Piunti, Andrea; Lulla, Rishi R; Qi, Jin; Horbinski, Craig M; Tomita, Tadanori; James, C David; Shilatifard, Ali; Saratsis, Amanda M
2017-04-17
Diffuse midline gliomas (including diffuse intrinsic pontine glioma, DIPG) are highly morbid glial neoplasms of the thalamus or brainstem that typically arise in young children and are not surgically resectable. These tumors are characterized by a high rate of histone H3 mutation, resulting in replacement of lysine 27 with methionine (K27M) in genes encoding H3 variants H3.3 (H3F3A) and H3.1 (HIST1H3B). Detection of these gain-of-function mutations has clinical utility, as they are associated with distinct tumor biology and clinical outcomes. Given the paucity of tumor tissue available for molecular analysis and relative morbidity of midline tumor biopsy, CSF-derived tumor DNA from patients with diffuse midline glioma may serve as a viable alternative for clinical detection of histone H3 mutation. We demonstrate the feasibility of two strategies to detect H3 mutations in CSF-derived tumor DNA from children with brain tumors (n = 11) via either targeted Sanger sequencing of H3F3A and HIST1H3B, or H3F3A c.83 A > T detection via nested PCR with mutation-specific primers. Of the six CSF specimens from children with diffuse midline glioma in our cohort, tumor DNA sufficient in quantity and quality for analysis was isolated from five (83%), with H3.3K27M detected in four (66.7%). In addition, H3.3G34V was identified in tumor DNA from a patient with supratentorial glioblastoma. Test sensitivity (87.5%) and specificity (100%) was validated via immunohistochemical staining and Sanger sequencing in available matched tumor tissue specimens (n = 8). Our results indicate that histone H3 gene mutation is detectable in CSF-derived tumor DNA from children with brain tumors, including diffuse midline glioma, and suggest the feasibility of "liquid biopsy" in lieu of, or to complement, tissue diagnosis, which may prove valuable for stratification to targeted therapies and monitoring treatment response.
Zhang, Chenzi; Yu, Wenjun; Wang, Lin; Zhao, Mingna; Guo, Qiaomei; Lv, Shaogang; Hu, Xiaomeng; Lou, Jiatao
2017-01-01
Introduction: Currently the majority of lung cancer patients are diagnosed as advanced diseases for no sensitive and specific biomarkers exist, noninvasive biomarkers with high sensitivity and specificity are urgently needed in lung cancer diagnosis. Bronchoscopy is a standard procedure of the diagnostic work-up of patients with suspected lung cancer despite of the limited diagnostic accuracy. Besides, epigenetic changes through DNA methylation play an important role in tumorigenesis. Thus, we examined the aberrant methylation of the SHOX2 and RASSF1A in bronchoalveolar lavage fluid (BALF) in comparing with conventional cytology examination and serum CEA in order to evaluate the new diagnostic method. Patients and Methods: BALF and serum samples were collected from 322 patients at the time of diagnosis, 284 of them were pathologically confirmed lung cancer, 35 were benign lung diseases and 3 were malignancies in other systems. For all of the 322 patients, the methylation status of the SHOX2 and RASSF1A gene were detected by a new RT-PCR platform and then confirmed by sanger sequencing. Serum CEA were detected using electrochemiluminescence immunoassay. Results: Profiling data showed the consistency of RT-PCR and sanger sequencing in detecting the methylation of the SHOX2 and RASSF1A. Besides, the combination of SHOX2 and RASSF1A methylation in BALF yielded a diagnostic sensitivity of 81.0% and specificity of 97.4%. When compared with established cytology examination (sensitivity: 68.3%, specificity: 97.4%) and serum biomarker carcinoembryonic antigen (CEA) (sensitivity: 30.6%, specificity: 100.0%), the SHOX2 and RASSF1A methylation panel showed the highest diagnostic efficiency. Notably, the combination of cytology and the SHOX2 and RASSF1A methylation panel could significantly improve the diagnostic efficacy. Conclusion: The methylation analysis of the SHOX2 and RASSF1A panel in BALF with RT-PCR achieved a satisfactory sensitivity and specificity in lung cancer diagnosis, especially in an early stage. It could be used as a promising noninvasive biomarker for auxiliary diagnosis of lung cancer. PMID:29151944
Eastman, Alexander W.; Yuan, Ze-Chun
2015-01-01
Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing projects. PMID:25653642
ParticleCall: A particle filter for base calling in next-generation sequencing systems
2012-01-01
Background Next-generation sequencing systems are capable of rapid and cost-effective DNA sequencing, thus enabling routine sequencing tasks and taking us one step closer to personalized medicine. Accuracy and lengths of their reads, however, are yet to surpass those provided by the conventional Sanger sequencing method. This motivates the search for computationally efficient algorithms capable of reliable and accurate detection of the order of nucleotides in short DNA fragments from the acquired data. Results In this paper, we consider Illumina’s sequencing-by-synthesis platform which relies on reversible terminator chemistry and describe the acquired signal by reformulating its mathematical model as a Hidden Markov Model. Relying on this model and sequential Monte Carlo methods, we develop a parameter estimation and base calling scheme called ParticleCall. ParticleCall is tested on a data set obtained by sequencing phiX174 bacteriophage using Illumina’s Genome Analyzer II. The results show that the developed base calling scheme is significantly more computationally efficient than the best performing unsupervised method currently available, while achieving the same accuracy. Conclusions The proposed ParticleCall provides more accurate calls than the Illumina’s base calling algorithm, Bustard. At the same time, ParticleCall is significantly more computationally efficient than other recent schemes with similar performance, rendering it more feasible for high-throughput sequencing data analysis. Improvement of base calling accuracy will have immediate beneficial effects on the performance of downstream applications such as SNP and genotype calling. ParticleCall is freely available at https://sourceforge.net/projects/particlecall. PMID:22776067
Jia, Ying; Li, Xiaoge; Yang, Dong; Xu, Yi; Guo, Ying; Li, Xin
2018-01-01
The current study aims to identify the pathogenic sites in a core pedigree of Usher syndrome (USH). A core pedigree of USH was analyzed by whole exome sequencing (WES). Mutations were verified by polymerase chain reaction (PCR) amplification and Sanger sequencing. Two pathogenic variations (c.849+2T>C and c.5994G>A) in MYO7A were successfully identified and individually separated from parents. One variant (c.849+2T>C) was nonsense mutation, causing the protein terminated in advance, and the other one (c.5994G>A) located near the boundary of exon could cause aberrant splicing. This study provides a meaningful exploration for identification of clinical core genetic pedigrees. Copyright © 2017 Elsevier B.V. All rights reserved.
Creager, Hannah M; Becker, Ericka A; Sandman, Kelly K; Karl, Julie A; Lank, Simon M; Bimber, Benjamin N; Wiseman, Roger W; Hughes, Austin L; O'Connor, Shelby L; O'Connor, David H
2011-09-01
In recent years, the use of cynomolgus macaques in biomedical research has increased greatly. However, with the exception of the Mauritian population, knowledge of the MHC class II genetics of the species remains limited. Here, using cDNA cloning and Sanger sequencing, we identified 127 full-length MHC class II alleles in a group of 12 Indonesian and 12 Vietnamese cynomolgus macaques. Forty two of these were completely novel to cynomolgus macaques while 61 extended the sequence of previously identified alleles from partial to full length. This more than doubles the number of full-length cynomolgus macaque MHC class II alleles available in GenBank, significantly expanding the allele library for the species and laying the groundwork for future evolutionary and functional studies.
Holt, Kathryn E; Teo, Yik Y; Li, Heng; Nair, Satheesh; Dougan, Gordon; Wain, John; Parkhill, Julian
2009-08-15
Here, we present a method for estimating the frequencies of SNP alleles present within pooled samples of DNA using high-throughput short-read sequencing. The method was tested on real data from six strains of the highly monomorphic pathogen Salmonella Paratyphi A, sequenced individually and in a pool. A variety of read mapping and quality-weighting procedures were tested to determine the optimal parameters, which afforded > or =80% sensitivity of SNP detection and strong correlation with true SNP frequency at poolwide read depth of 40x, declining only slightly at read depths 20-40x. The method was implemented in Perl and relies on the opensource software Maq for read mapping and SNP calling. The Perl script is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/pools/.
Genetic heterogeneity of RPMI-8402, a T-acute lymphoblastic leukemia cell line
STOCZYNSKA-FIDELUS, EWELINA; PIASKOWSKI, SYLWESTER; PAWLOWSKA, ROZA; SZYBKA, MALGORZATA; PECIAK, JOANNA; HULAS-BIGOSZEWSKA, KRYSTYNA; WINIECKA-KLIMEK, MARTA; RIESKE, PIOTR
2016-01-01
Thorough examination of genetic heterogeneity of cell lines is uncommon. In order to address this issue, the present study analyzed the genetic heterogeneity of RPMI-8402, a T-acute lymphoblastic leukemia (T-ALL) cell line. For this purpose, traditional techniques such as fluorescence in situ hybridization and immunocytochemistry were used, in addition to more advanced techniques, including cell sorting, Sanger sequencing and massive parallel sequencing. The results indicated that the RPMI-8402 cell line consists of several genetically different cell subpopulations. Furthermore, massive parallel sequencing of RPMI-8402 provided insight into the evolution of T-ALL carcinogenesis, since this cell line exhibited the genetic heterogeneity typical of T-ALL. Therefore, the use of cell lines for drug testing in future studies may aid the progress of anticancer drug research. PMID:26870252
Multiplexed capillary electrophoresis system
Yeung, Edward S.; Li, Qingbo; Lu, Xiandan
1998-04-21
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification ("base calling") is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations.
Multiplexed capillary electrophoresis system
Yeung, Edward S.; Chang, Huan-Tsang; Fung, Eliza N.; Li, Qingbo; Lu, Xiandan
1996-12-10
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification ("base calling") is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations.
Rfam: Wikipedia, clans and the “decimal” release
Gardner, Paul P.; Daub, Jennifer; Tate, John; Moore, Benjamin L.; Osuch, Isabelle H.; Griffiths-Jones, Sam; Finn, Robert D.; Nawrocki, Eric P.; Kolbe, Diana L.; Eddy, Sean R.; Bateman, Alex
2011-01-01
The Rfam database aims to catalogue non-coding RNAs through the use of sequence alignments and statistical profile models known as covariance models. In this contribution, we discuss the pros and cons of using the online encyclopedia, Wikipedia, as a source of community-derived annotation. We discuss the addition of groupings of related RNA families into clans and new developments to the website. Rfam is available on the Web at http://rfam.sanger.ac.uk. PMID:21062808
Multiplexed capillary electrophoresis system
Yeung, E.S.; Li, Q.; Lu, X.
1998-04-21
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification (``base calling``) is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations. 19 figs.
Multiplexed capillary electrophoresis system
Yeung, E.S.; Chang, H.T.; Fung, E.N.; Li, Q.; Lu, X.
1996-12-10
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification (``base calling``) is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations. 19 figs.
Xu, Xiaojing; Yang, Xiaoxu; Wu, Qixi; Liu, Aijie; Yang, Xiaoling; Ye, Adam Yongxin; Huang, August Yue; Li, Jiarui; Wang, Meng; Yu, Zhe; Wang, Sheng; Zhang, Zhichao; Wu, Xiru
2015-01-01
ABSTRACT The majority of children with Dravet syndrome (DS) are caused by de novo SCN1A mutations. To investigate the origin of the mutations, we developed and applied a new method that combined deep amplicon resequencing with a Bayesian model to detect and quantify allelic fractions with improved sensitivity. Of 174 SCN1A mutations in DS probands which were considered “de novo” by Sanger sequencing, we identified 15 cases (8.6%) of parental mosaicism. We identified another five cases of parental mosaicism that were also detectable by Sanger sequencing. Fraction of mutant alleles in the 20 cases of parental mosaicism ranged from 1.1% to 32.6%. Thirteen (65% of 20) mutations originated paternally and seven (35% of 20) maternally. Twelve (60% of 20) mosaic parents did not have any epileptic symptoms. Their mutant allelic fractions were significantly lower than those in mosaic parents with epileptic symptoms (P = 0.016). We identified mosaicism with varied allelic fractions in blood, saliva, urine, hair follicle, oral epithelium, and semen, demonstrating that postzygotic mutations could affect multiple somatic cells as well as germ cells. Our results suggest that more sensitive tools for detecting low‐level mosaicism in parents of families with seemingly “de novo” mutations will allow for better informed genetic counseling. PMID:26096185
Abal-Fabeiro, J L; Maside, X; Llovo, J; Bello, X; Torres, M; Treviño, M; Moldes, L; Muñoz, A; Carracedo, A; Bartolomé, C
2014-04-01
The epidemiological study of human cryptosporidiosis requires the characterization of species and subtypes involved in human disease in large sample collections. Molecular genotyping is costly and time-consuming, making the implementation of low-cost, highly efficient technologies increasingly necessary. Here, we designed a protocol based on MALDI-TOF mass spectrometry for the high-throughput genotyping of a panel of 55 single nucleotide variants (SNVs) selected as markers for the identification of common gp60 subtypes of four Cryptosporidium species that infect humans. The method was applied to a panel of 608 human and 63 bovine isolates and the results were compared with control samples typed by Sanger sequencing. The method allowed the identification of species in 610 specimens (90·9%) and gp60 subtype in 605 (90·2%). It displayed excellent performance, with sensitivity and specificity values of 87·3 and 98·0%, respectively. Up to nine genotypes from four different Cryptosporidium species (C. hominis, C. parvum, C. meleagridis and C. felis) were detected in humans; the most common ones were C. hominis subtype Ib, and C. parvum IIa (61·3 and 28·3%, respectively). 96·5% of the bovine samples were typed as IIa. The method performs as well as the widely used Sanger sequencing and is more cost-effective and less time consuming.
Kudapa, Himabindu; Bharti, Arvind K; Cannon, Steven B; Farmer, Andrew D; Mulaosmanovic, Benjamin; Kramer, Robin; Bohra, Abhishek; Weeks, Nathan T; Crow, John A; Tuteja, Reetu; Shah, Trushar; Dutta, Sutapa; Gupta, Deepak K; Singh, Archana; Gaikwad, Kishor; Sharma, Tilak R; May, Gregory D; Singh, Nagendra K; Varshney, Rajeev K
2012-09-01
A comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes. The resultant transcriptome assembly, referred to as CcTA v2, comprised 21 434 transcript assembly contigs (TACs) with an N50 of 1510 bp, the largest one being ~8 kb. Of the 21 434 TACs, 16 622 (77.5%) could be mapped on to the soybean genome build 1.0.9 under fairly stringent alignment parameters. Based on knowledge of intron junctions, 10 009 primer pairs were designed from 5033 TACs for amplifying intron spanning regions (ISRs). By using in silico mapping of BAC-end-derived SSR loci of pigeonpea on the soybean genome as a reference, putative mapping positions at the chromosome level were predicted for 6284 ISR markers, covering all 11 pigeonpea chromosomes. A subset of 128 ISR markers were analyzed on a set of eight genotypes. While 116 markers were validated, 70 markers showed one to three alleles, with an average of 0.16 polymorphism information content (PIC) value. In summary, the CcTA v2 transcript assembly and ISR markers will serve as a useful resource to accelerate genetic research and breeding applications in pigeonpea.
Shin, Saeam; Kim, Juwon; Kim, Yoonjung; Cho, Sun-Mi; Lee, Kyung-A
2017-10-26
EGFR mutation is an emerging biomarker for treatment selection in non-small-cell lung cancer (NSCLC) patients. However, optimal mutation detection is hindered by complications associated with the biopsy procedure, tumor heterogeneity and limited sensitivity of test methodology. In this study, we evaluated the diagnostic utility of real-time PCR using malignant pleural effusion samples. A total of 77 pleural fluid samples from 77 NSCLC patients were tested using the cobas EGFR mutation test (Roche Molecular Systems). Pleural fluid was centrifuged, and separated cell pellets and supernatants were tested in parallel. Results were compared with Sanger sequencing and/or peptide nucleic acid (PNA)-mediated PCR clamping of matched tumor tissue or pleural fluid samples. All samples showed valid real-time PCR results in one or more DNA samples extracted from cell pellets and supernatants. Compared with other molecular methods, the sensitivity of real-time PCR method was 100%. Concordance rate of real-time PCR and Sanger sequencing plus PNA-mediated PCR clamping was 98.7%. We have confirmed that real-time PCR using pleural fluid had a high concordance rate compared to conventional methods, with no failed samples. Our data demonstrated that the parallel real-time PCR testing using supernatant and cell pellet could offer reliable and robust surrogate strategy when tissue is not available.
Genetic Characterization of a Panel of Diverse HIV-1 Isolates at Seven International Sites
Chen, Yue; Sanchez, Ana M.; Sabino, Ester; Hunt, Gillian; Ledwaba, Johanna; Hackett, John; Swanson, Priscilla; Hewlett, Indira; Ragupathy, Viswanath; Vikram Vemula, Sai; Zeng, Peibin; Tee, Kok-Keng; Chow, Wei Zhen; Ji, Hezhao; Sandstrom, Paul; Denny, Thomas N.; Busch, Michael P.; Gao, Feng
2016-01-01
HIV-1 subtypes and drug resistance are routinely tested by many international surveillance groups. However, results from different sites often vary. A systematic comparison of results from multiple sites is needed to determine whether a standardized protocol is required for consistent and accurate data analysis. A panel of well-characterized HIV-1 isolates (N = 50) from the External Quality Assurance Program Oversight Laboratory (EQAPOL) was assembled for evaluation at seven international sites. This virus panel included seven subtypes, six circulating recombinant forms (CRFs), nine unique recombinant forms (URFs) and three group O viruses. Seven viruses contained 10 major drug resistance mutations (DRMs). HIV-1 isolates were prepared at a concentration of 107 copies/ml and compiled into blinded panels. Subtypes and DRMs were determined with partial or full pol gene sequences by conventional Sanger sequencing and/or Next Generation Sequencing (NGS). Subtype and DRM results were reported and decoded for comparison with full-length genome sequences generated by EQAPOL. The partial pol gene was amplified by RT-PCR and sequenced for 89.4%-100% of group M viruses at six sites. Subtyping results of majority of the viruses (83%-97.9%) were correctly determined for the partial pol sequences. All 10 major DRMs in seven isolates were detected at these six sites. The complete pol gene sequence was also obtained by NGS at one site. However, this method missed six group M viruses and sequences contained host chromosome fragments. Three group O viruses were only characterized with additional group O-specific RT-PCR primers employed by one site. These results indicate that PCR protocols and subtyping tools should be standardized to efficiently amplify diverse viruses and more consistently assign virus genotypes, which is critical for accurate global subtype and drug resistance surveillance. Targeted NGS analysis of partial pol sequences can serve as an alternative approach, especially for detection of low-abundance DRMs. PMID:27314585
Genetic Characterization of a Panel of Diverse HIV-1 Isolates at Seven International Sites.
Hora, Bhavna; Keating, Sheila M; Chen, Yue; Sanchez, Ana M; Sabino, Ester; Hunt, Gillian; Ledwaba, Johanna; Hackett, John; Swanson, Priscilla; Hewlett, Indira; Ragupathy, Viswanath; Vikram Vemula, Sai; Zeng, Peibin; Tee, Kok-Keng; Chow, Wei Zhen; Ji, Hezhao; Sandstrom, Paul; Denny, Thomas N; Busch, Michael P; Gao, Feng
2016-01-01
HIV-1 subtypes and drug resistance are routinely tested by many international surveillance groups. However, results from different sites often vary. A systematic comparison of results from multiple sites is needed to determine whether a standardized protocol is required for consistent and accurate data analysis. A panel of well-characterized HIV-1 isolates (N = 50) from the External Quality Assurance Program Oversight Laboratory (EQAPOL) was assembled for evaluation at seven international sites. This virus panel included seven subtypes, six circulating recombinant forms (CRFs), nine unique recombinant forms (URFs) and three group O viruses. Seven viruses contained 10 major drug resistance mutations (DRMs). HIV-1 isolates were prepared at a concentration of 107 copies/ml and compiled into blinded panels. Subtypes and DRMs were determined with partial or full pol gene sequences by conventional Sanger sequencing and/or Next Generation Sequencing (NGS). Subtype and DRM results were reported and decoded for comparison with full-length genome sequences generated by EQAPOL. The partial pol gene was amplified by RT-PCR and sequenced for 89.4%-100% of group M viruses at six sites. Subtyping results of majority of the viruses (83%-97.9%) were correctly determined for the partial pol sequences. All 10 major DRMs in seven isolates were detected at these six sites. The complete pol gene sequence was also obtained by NGS at one site. However, this method missed six group M viruses and sequences contained host chromosome fragments. Three group O viruses were only characterized with additional group O-specific RT-PCR primers employed by one site. These results indicate that PCR protocols and subtyping tools should be standardized to efficiently amplify diverse viruses and more consistently assign virus genotypes, which is critical for accurate global subtype and drug resistance surveillance. Targeted NGS analysis of partial pol sequences can serve as an alternative approach, especially for detection of low-abundance DRMs.
HIV-1 transmission linkage in an HIV-1 prevention clinical trial
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leitner, Thomas; Campbell, Mary S; Mullins, James I
2009-01-01
HIV-1 sequencing has been used extensively in epidemiologic and forensic studies to investigate patterns of HIV-1 transmission. However, the criteria for establishing genetic linkage between HIV-1 strains in HIV-1 prevention trials have not been formalized. The Partners in Prevention HSV/HIV Transmission Study (ClinicaITrials.gov NCT00194519) enrolled 3408 HIV-1 serodiscordant heterosexual African couples to determine the efficacy of genital herpes suppression with acyclovir in reducing HIV-1 transmission. The trial analysis required laboratory confirmation of HIV-1 linkage between enrolled partners in couples in which seroconversion occurred. Here we describe the process and results from HIV-1 sequencing studies used to perform transmission linkage determinationmore » in this clinical trial. Consensus Sanger sequencing of env (C2-V3-C3) and gag (p17-p24) genes was performed on plasma HIV-1 RNA from both partners within 3 months of seroconversion; env single molecule or pyrosequencing was also performed in some cases. For linkage, we required monophyletic clustering between HIV-1 sequences in the transmitting and seroconverting partners, and developed a Bayesian algorithm using genetic distances to evaluate the posterior probability of linkage of participants sequences. Adjudicators classified transmissions as linked, unlinked, or indeterminate. Among 151 seroconversion events, we found 108 (71.5%) linked, 40 (26.5%) unlinked, and 3 (2.0%) to have indeterminate transmissions. Nine (8.3%) were linked by consensus gag sequencing only and 8 (7.4%) required deep sequencing of env. In this first use of HIV-1 sequencing to establish endpoints in a large clinical trial, more than one-fourth of transmissions were unlinked to the enrolled partner, illustrating the relevance of these methods in the design of future HIV-1 prevention trials in serodiscordant couples. A hierarchy of sequencing techniques, analysis methods, and expert adjudication contributed to the linkage determination process.« less
Crispo, M; Mulet, A P; Tesson, L; Barrera, N; Cuadro, F; dos Santos-Neto, P C; Nguyen, T H; Crénéguy, A; Brusselle, L; Anegón, I; Menchaca, A
2015-01-01
While CRISPR/Cas9 technology has proven to be a valuable system to generate gene-targeted modified animals in several species, this tool has been scarcely reported in farm animals. Myostatin is encoded by MSTN gene involved in the inhibition of muscle differentiation and growth. We determined the efficiency of the CRISPR/Cas9 system to edit MSTN in sheep and generate knock-out (KO) animals with the aim to promote muscle development and body growth. We generated CRISPR/Cas9 mRNAs specific for ovine MSTN and microinjected them into the cytoplasm of ovine zygotes. When embryo development of CRISPR/Cas9 microinjected zygotes (n = 216) was compared with buffer injected embryos (n = 183) and non microinjected embryos (n = 173), cleavage rate was lower for both microinjected groups (P<0.05) and neither was affected by CRISPR/Cas9 content in the injected medium. Embryo development to blastocyst was not affected by microinjection and was similar among the experimental groups. From 20 embryos analyzed by Sanger sequencing, ten were mutant (heterozygous or mosaic; 50% efficiency). To obtain live MSTN KO lambs, 53 blastocysts produced after zygote CRISPR/Cas9 microinjection were transferred to 29 recipient females resulting in 65.5% (19/29) of pregnant ewes and 41.5% (22/53) of newborns. From 22 born lambs analyzed by T7EI and Sanger sequencing, ten showed indel mutations at MSTN gene. Eight showed mutations in both alleles and five of them were homozygous for indels generating out-of frame mutations that resulted in premature stop codons. Western blot analysis of homozygous KO founders confirmed the absence of myostatin, showing heavier body weight than wild type counterparts. In conclusion, our results demonstrate that CRISPR/Cas9 system was a very efficient tool to generate gene KO sheep. This technology is quick and easy to perform and less expensive than previous techniques, and can be applied to obtain genetically modified animal models of interest for biomedicine and livestock.
DNA demethylation activates genes in seed maternal integument development in rice (Oryza sativa L.).
Wang, Yifeng; Lin, Haiyan; Tong, Xiaohong; Hou, Yuxuan; Chang, Yuxiao; Zhang, Jian
2017-11-01
DNA methylation is an important epigenetic modification that regulates various plant developmental processes. Rice seed integument determines the seed size. However, the role of DNA methylation in its development remains largely unknown. Here, we report the first dynamic DNA methylomic profiling of rice maternal integument before and after pollination by using a whole-genome bisulfite deep sequencing approach. Analysis of DNA methylation patterns identified 4238 differentially methylated regions underpin 4112 differentially methylated genes, including GW2, DEP1, RGB1 and numerous other regulators participated in maternal integument development. Bisulfite sanger sequencing and qRT-PCR of six differentially methylated genes revealed extensive occurrence of DNA hypomethylation triggered by double fertilization at IAP compared with IBP, suggesting that DNA demethylation might be a key mechanism to activate numerous maternal controlling genes. These results presented here not only greatly expanded the rice methylome dataset, but also shed novel insight into the regulatory roles of DNA methylation in rice seed maternal integument development. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Naseer, Muhammad Imran; Rasool, Mahmood; Jan, Mohammed M; Chaudhary, Adeel G; Pushparaj, Peter Natesan; Abuzenadah, Adel M; Al-Qahtani, Mohammad H
2016-12-15
PGAP2 (Post-GPI Attachment to Proteins 2) gene is involved in lipid remodeling steps of Glycosylphosphatidylinositol (GPI)-anchor maturation. At the surface of the cell this gene is required for proper expression of GPI-anchored proteins. Hyperphosphatasia with mental retardation syndrome-3 is an autosomal recessive disorder usually characterized by severe mental retardation. Mutations in the PGAP2 gene cause hyperphosphatasia mental retardation syndrome-3. We have identified a large consanguineous family from Saudi origin segregating developmental delay, intellectual disability, epilepsy and microcephaly. Whole exome sequencing with 100× coverage was performed on two affected siblings of the family. Data analysis in the patient revealed a novel missense mutation c.191C>T in PGAP2 gene resulting in Alanine to Valine substitution (Ala64Val). The mutation was reconfirmed and validated by subsequent Sanger sequencing method. The mutation was ruled out in 100 unrelated healthy controls. We suggest that this pathogenic mutation disrupts the proper function of the gene proteins resulting in the disease state. Copyright © 2016 Elsevier B.V. All rights reserved.
Whole-genome analysis of a patient with early-stage small-cell lung cancer.
Han, J-Y; Lee, Y-S; Kim, B C; Lee, G K; Lee, S; Kim, E-H; Kim, H-M; Bhak, J
2014-12-01
We performed whole-genome sequencing (WGS) of a case of early-stage small-cell lung cancer (SCLC) to analyze the genomic features. WGS revealed a lot of single-nucleotide variations (SNVs), small insertion/deletions and chromosomal abnormality. Chromosomes 4p, 5q, 13q, 15q, 17p and 22q contained many block deletions. Especially, copy loss was observed in tumor suppressor genes RB1 and TP53, and copy gain in oncogene hTERT. Somatic mutations were found in TP53 and CREBBP. Novel nonsynonymous (ns) SNVs in C6ORF103 and SLC5A4 genes were also found. Sanger sequencing of the SLC5A4 gene in 23 independent SCLC samples showed another nsSNV in the SLC5A4 gene, indicating that nsSNVs in the SLC5A4 gene are recurrent in SCLC. WGS of an early-stage SCLC identified novel recurrent mutations and validated known variations, including copy number variations. These findings provide insight into the genomic landscape contributing to SCLC development.
Supply, Philip; Marceau, Michael; Mangenot, Sophie; Roche, David; Rouanet, Carine; Khanna, Varun; Majlessi, Laleh; Criscuolo, Alexis; Tap, Julien; Pawlik, Alexandre; Fiette, Laurence; Orgeur, Mickael; Fabre, Michel; Parmentier, Cécile; Frigui, Wafa; Simeone, Roxane; Boritsch, Eva C.; Debrie, Anne-Sophie; Willery, Eve; Walker, Danielle; Quail, Michael A.; Ma, Laurence; Bouchier, Christiane; Salvignol, Grégory; Sayes, Fadel; Cascioferro, Alessandro; Seemann, Torsten; Barbe, Valérie; Locht, Camille; Gutierrez, Maria-Cristina; Leclerc, Claude; Bentley, Stephen; Stinear, Timothy P.; Brisse, Sylvain; Médigue, Claudine; Parkhill, Julian; Cruveiller, Stéphane; Brosch, Roland
2013-01-01
Global spread and genetic monomorphism are hallmarks of Mycobacterium tuberculosis, the agent of human tuberculosis. In contrast, Mycobacterium canettii, and related tubercle bacilli that also cause human tuberculosis and exhibit unusual smooth colony morphology, are restricted to East-Africa. Here, we sequenced and analyzed the genomes of five representative strains of smooth tubercle bacilli (STB) using Sanger (4-5x coverage), 454/Roche (13-18x coverage) and/or Illumina DNA sequencing (45-105x coverage). We show that STB are highly recombinogenic and evolutionary early-branching, with larger genome sizes, 25-fold more SNPs, fewer molecular scars and distinct CRISPR-Cas systems relative to M. tuberculosis. Despite the differences, all tuberculosis-causing mycobacteria share a highly conserved core genome. Mouse-infection experiments revealed that STB are less persistent and virulent than M. tuberculosis. We conclude that M. tuberculosis emerged from an ancestral, STB-like pool of mycobacteria by gain of persistence and virulence mechanisms and we provide genome-wide insights into the molecular events involved. PMID:23291586
Boileau, Catherine; Guo, Dong-Chuan; Hanna, Nadine; Regalado, Ellen S; Detaint, Delphine; Gong, Limin; Varret, Mathilde; Prakash, Siddharth K; Li, Alexander H; d'Indy, Hyacintha; Braverman, Alan C; Grandchamp, Bernard; Kwartler, Callie S; Gouya, Laurent; Santos-Cortez, Regie Lyn P; Abifadel, Marianne; Leal, Suzanne M; Muti, Christine; Shendure, Jay; Gross, Marie-Sylvie; Rieder, Mark J; Vahanian, Alec; Nickerson, Deborah A; Michel, Jean Baptiste; Jondeau, Guillaume; Milewicz, Dianna M
2012-07-08
A predisposition for thoracic aortic aneurysms leading to acute aortic dissections can be inherited in families in an autosomal dominant manner. Genome-wide linkage analysis of two large unrelated families with thoracic aortic disease followed by whole-exome sequencing of affected relatives identified causative mutations in TGFB2. These mutations-a frameshift mutation in exon 6 and a nonsense mutation in exon 4-segregated with disease with a combined logarithm of odds (LOD) score of 7.7. Sanger sequencing of 276 probands from families with inherited thoracic aortic disease identified 2 additional TGFB2 mutations. TGFB2 encodes transforming growth factor (TGF)-β2, and the mutations are predicted to cause haploinsufficiency for TGFB2; however, aortic tissue from cases paradoxically shows increased TGF-β2 expression and immunostaining. Thus, haploinsufficiency for TGFB2 predisposes to thoracic aortic disease, suggesting that the initial pathway driving disease is decreased cellular TGF-β2 levels leading to a secondary increase in TGF-β2 production in the diseased aorta.
Barclay, Sarah F; Rand, Casey M; Borch, Lauren A; Nguyen, Lisa; Gray, Paul A; Gibson, William T; Wilson, Richard J A; Gordon, Paul M K; Aung, Zaw; Berry-Kravis, Elizabeth M; Ize-Ludlow, Diego; Weese-Mayer, Debra E; Bech-Hansen, N Torben
2015-08-25
Rapid-onset Obesity with Hypothalamic Dysfunction, Hypoventilation, and Autonomic Dysregulation (ROHHAD) is thought to be a genetic disease caused by de novo mutations, though causative mutations have yet to be identified. We searched for de novo coding mutations among a carefully-diagnosed and clinically homogeneous cohort of 35 ROHHAD patients. We sequenced the exomes of seven ROHHAD trios, plus tumours from four of these patients and the unaffected monozygotic (MZ) twin of one (discovery cohort), to identify constitutional and somatic de novo sequence variants. We further analyzed this exome data to search for candidate genes under autosomal dominant and recessive models, and to identify structural variations. Candidate genes were tested by exome or Sanger sequencing in a replication cohort of 28 ROHHAD singletons. The analysis of the trio-based exomes found 13 de novo variants. However, no two patients had de novo variants in the same gene, and additional patient exomes and mutation analysis in the replication cohort did not provide strong genetic evidence to implicate any of these sequence variants in ROHHAD. Somatic comparisons revealed no coding differences between any blood and tumour samples, or between the two discordant MZ twins. Neither autosomal dominant nor recessive analysis yielded candidate genes for ROHHAD, and we did not identify any potentially causative structural variations. Clinical exome sequencing is highly unlikely to be a useful diagnostic test in patients with true ROHHAD. As ROHHAD has a high risk for fatality if not properly managed, it remains imperative to expand the search for non-exomic genetic risk factors, as well as to investigate other possible mechanisms of disease. In so doing, we will be able to confirm objectively the ROHHAD diagnosis and to contribute to our understanding of obesity, respiratory control, hypothalamic function, and autonomic regulation.
Weyhrauch, Derek L; Ye, Dan; Boczek, Nicole J; Tester, David J; Gavrilova, Ralitza H; Patterson, Marc C; Wieben, Eric D; Ackerman, Michael J
2016-02-01
A 4-year-old boy born at 37 weeks' gestation with intrauterine growth retardation presented with developmental delay with pronounced language and gross motor delay, axial hypotonia, and dynamic hypertonia of the extremities. Investigations including the Minnesota Newborn Screen, thyroid stimulating hormone/thyroxin, and inborn errors of metabolism screening were negative. Cerebral magnetic resonance imaging and spectroscopy were normal. Genetic testing was negative for coagulopathy, Smith-Lemli-Opitz, fragile X, and Prader-Willi/Angelman syndromes. Whole genome array analysis was unremarkable. Whole exome sequencing was performed through a commercial testing laboratory to elucidate the underlying etiology for the child's presentation. A de novo mutation was hypothesized. In attempt to establish pathogenicity of our candidate variant, cellular electrophysiologic functional analysis of the putative de novo mutation was performed using patch-clamp technology. Whole exome sequencing revealed a p.P1353L variant in the CACNA1A gene, which encodes for the α1-subunit of the brain-specific P/Q-type calcium channel (CaV2.1). This presynaptic high-voltage-gated channel couples neuronal excitation to the vesicular release of neurotransmitter and is implicated in several neurologic disorders. DNA Sanger sequencing confirmed that the de novo mutation was absent in both parents and present in the child only. Electrophysiologic analysis of P1353L-CACNA1A demonstrated near complete loss of function, with a 95% reduction in peak current density. Whole exome sequencing coupled with cellular electrophysiologic functional analysis of a de novoCACNA1A missense mutation has elucidated the probable underlying pathophysiologic mechanism responsible for the child's phenotype. Genetic testing of CACNA1A in patients with congenital hypotonia and developmental delay may be warranted. Copyright © 2016. Published by Elsevier Inc.
Szopa, Magdalena; Ludwig-Galezowska, Agnieszka H; Radkowski, Piotr; Skupien, Jan; Machlowska, Julita; Klupa, Tomasz; Wolkow, Pawel; Borowiec, Maciej; Mlynarski, Wojciech; Malecki, Maciej T
2016-02-01
Until now only a few families with early onset autosomal diabetes due to the NEUROD1 gene mutations have been identified. Moreover, only some of them meet strict MODY (maturity-onset diabetes of the young) criteria. Next-generation sequencing (NGS) provides an opportunity to detect more pathogenic mutations in this gene. Here, we evaluated the segregation of the Arg103Pro mutation in the NEUROD1 gene in a pedigree in which it was detected, and described the clinical characteristics of the mutation carriers. We included 156 diabetic probands of MODY families, among them 52 patients earlier tested for GCK-MODY and/or HNF1A-MODY by Sanger sequencing with negative results. Genetic testing was performed by targeted NGS sequencing using a panel of 28 monogenic diabetes genes. As detected by NGS, one patient had the missense Arg103Pro (CGC/CCC) mutation in the gene NEUROD1 changing the amino-acid structure of the DNA binding domain of this transcription factor. We confirmed this sequence difference by Sanger sequencing. This family had previously been tested with negative results for HNF1A gene mutations. 17 additional members of this family were invited for further testing. We confirmed the presence of the mutation in 11 subjects. Seven adult mutation carriers (all but one) from three generations had been already diagnosed with diabetes. There were 3 individuals with the Arg103Pro mutation diagnosed before the age of 30 years in the family. The range of age of the four unaffected mutation carriers (3 minors and 1 adult) was 3-48 years. Interestingly, one mutation carrier had a history of transient neonatal hypoglycemia, of which the clinical course resembled episodes typical for HNF4A-MODY. We report a family with autosomal dominant diabetes related to a new NEUROD1 mutation, one of very few meeting MODY criteria. The use of the NGS method will facilitate identification of more families with rare forms of MODY. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Le, Thuy; Chiarella, Jennifer; Simen, Birgitte B; Hanczaruk, Bozena; Egholm, Michael; Landry, Marie L; Dieckhaus, Kevin; Rosen, Marc I; Kozal, Michael J
2009-06-29
It is largely unknown how frequently low-abundance HIV drug-resistant variants at levels under limit of detection of conventional genotyping (<20% of quasi-species) are present in antiretroviral-experienced persons experiencing virologic failure. Further, the clinical implications of low-abundance drug-resistant variants at time of virologic failure are unknown. Plasma samples from 22 antiretroviral-experienced subjects collected at time of virologic failure (viral load 1380 to 304,000 copies/mL) were obtained from a specimen bank (from 2004-2007). The prevalence and profile of drug-resistant mutations were determined using Sanger sequencing and ultra-deep pyrosequencing. Genotypes were interpreted using Stanford HIV database algorithm. Antiretroviral treatment histories were obtained by chart review and correlated with drug-resistant mutations. Low-abundance drug-resistant mutations were detected in all 22 subjects by deep sequencing and only in 3 subjects by Sanger sequencing. In total they accounted for 90 of 247 mutations (36%) detected by deep sequencing; the majority of these (95%) were not detected by standard genotyping. A mean of 4 additional mutations per subject were detected by deep sequencing (p<0.0001, 95%CI: 2.85-5.53). The additional low-abundance drug-resistant mutations increased a subject's genotypic resistance to one or more antiretrovirals in 17 of 22 subjects (77%). When correlated with subjects' antiretroviral treatment histories, the additional low-abundance drug-resistant mutations correlated with the failing antiretroviral drugs in 21% subjects and correlated with historical antiretroviral use in 79% subjects (OR, 13.73; 95% CI, 2.5-74.3, p = 0.0016). Low-abundance HIV drug-resistant mutations in antiretroviral-experienced subjects at time of virologic failure can increase a subject's overall burden of resistance, yet commonly go unrecognized by conventional genotyping. The majority of unrecognized resistant mutations correlate with historical antiretroviral use. Ultra-deep sequencing can provide important historical resistance information for clinicians when planning subsequent antiretroviral regimens for highly treatment-experienced patients, particularly when their prior treatment histories and longitudinal genotypes are not available.
Le, Thuy; Chiarella, Jennifer; Simen, Birgitte B.; Hanczaruk, Bozena; Egholm, Michael; Landry, Marie L.; Dieckhaus, Kevin; Rosen, Marc I.; Kozal, Michael J.
2009-01-01
Background It is largely unknown how frequently low-abundance HIV drug-resistant variants at levels under limit of detection of conventional genotyping (<20% of quasi-species) are present in antiretroviral-experienced persons experiencing virologic failure. Further, the clinical implications of low-abundance drug-resistant variants at time of virologic failure are unknown. Methodology/Principal Findings Plasma samples from 22 antiretroviral-experienced subjects collected at time of virologic failure (viral load 1380 to 304,000 copies/mL) were obtained from a specimen bank (from 2004–2007). The prevalence and profile of drug-resistant mutations were determined using Sanger sequencing and ultra-deep pyrosequencing. Genotypes were interpreted using Stanford HIV database algorithm. Antiretroviral treatment histories were obtained by chart review and correlated with drug-resistant mutations. Low-abundance drug-resistant mutations were detected in all 22 subjects by deep sequencing and only in 3 subjects by Sanger sequencing. In total they accounted for 90 of 247 mutations (36%) detected by deep sequencing; the majority of these (95%) were not detected by standard genotyping. A mean of 4 additional mutations per subject were detected by deep sequencing (p<0.0001, 95%CI: 2.85–5.53). The additional low-abundance drug-resistant mutations increased a subject's genotypic resistance to one or more antiretrovirals in 17 of 22 subjects (77%). When correlated with subjects' antiretroviral treatment histories, the additional low-abundance drug-resistant mutations correlated with the failing antiretroviral drugs in 21% subjects and correlated with historical antiretroviral use in 79% subjects (OR, 13.73; 95% CI, 2.5–74.3, p = 0.0016). Conclusions/Significance Low-abundance HIV drug-resistant mutations in antiretroviral-experienced subjects at time of virologic failure can increase a subject's overall burden of resistance, yet commonly go unrecognized by conventional genotyping. The majority of unrecognized resistant mutations correlate with historical antiretroviral use. Ultra-deep sequencing can provide important historical resistance information for clinicians when planning subsequent antiretroviral regimens for highly treatment-experienced patients, particularly when their prior treatment histories and longitudinal genotypes are not available. PMID:19562031
FAST: FAST Analysis of Sequences Toolbox
Lawrence, Travis J.; Kauffman, Kyle T.; Amrine, Katherine C. H.; Carper, Dana L.; Lee, Raymond S.; Becich, Peter J.; Canales, Claudia J.; Ardell, David H.
2015-01-01
FAST (FAST Analysis of Sequences Toolbox) provides simple, powerful open source command-line tools to filter, transform, annotate and analyze biological sequence data. Modeled after the GNU (GNU's Not Unix) Textutils such as grep, cut, and tr, FAST tools such as fasgrep, fascut, and fastr make it easy to rapidly prototype expressive bioinformatic workflows in a compact and generic command vocabulary. Compact combinatorial encoding of data workflows with FAST commands can simplify the documentation and reproducibility of bioinformatic protocols, supporting better transparency in biological data science. Interface self-consistency and conformity with conventions of GNU, Matlab, Perl, BioPerl, R, and GenBank help make FAST easy and rewarding to learn. FAST automates numerical, taxonomic, and text-based sorting, selection and transformation of sequence records and alignment sites based on content, index ranges, descriptive tags, annotated features, and in-line calculated analytics, including composition and codon usage. Automated content- and feature-based extraction of sites and support for molecular population genetic statistics make FAST useful for molecular evolutionary analysis. FAST is portable, easy to install and secure thanks to the relative maturity of its Perl and BioPerl foundations, with stable releases posted to CPAN. Development as well as a publicly accessible Cookbook and Wiki are available on the FAST GitHub repository at https://github.com/tlawrence3/FAST. The default data exchange format in FAST is Multi-FastA (specifically, a restriction of BioPerl FastA format). Sanger and Illumina 1.8+ FastQ formatted files are also supported. FAST makes it easier for non-programmer biologists to interactively investigate and control biological data at the speed of thought. PMID:26042145
Fernández-Lainez, Cynthia; Aláez-Verson, Carmen; Ibarra-González, Isabel; Enríquez-Flores, Sergio; Carrillo-Sanchez, Karol; Flores-Lagunes, Leonardo; Guillén-López, Sara; Belmont-Martínez, Leticia; Vela-Amieva, Marcela
2018-04-16
Maple syrup urine disease (MSUD) is a metabolic disorder caused by mutations in three of the branched-chain α-keto acid dehydrogenase complex (BCKDC) genes. Classical MSUD symptom can be observed immediately after birth and include ketoacidosis, irritability, lethargy, and coma, which can lead to death or irreversible neurodevelopmental delay in survivors. The molecular diagnosis of MSUD can be time-consuming and difficult to establish using conventional Sanger sequencing because it could be due to pathogenic variants of any of the BCKDC genes. Next-generation sequencing-based methodologies have revolutionized the molecular diagnosis of inborn errors in metabolism and offer a superior approach for genotyping these patients. Here, we report an MSUD case whose molecular diagnosis was performed by clinical exome sequencing (CES), and the possible structural pathogenic effect of a novel E1α subunit pathogenic variant was analyzed using in silico analysis of α and β subunit crystallographic structure. Molecular analysis revealed a new homozygous non-sense c.1267C>T or p.Gln423Ter variant of BCKDHA. The novel BCKDHA variant is considered pathogenic because it caused a premature stop codon that probably led to the loss of the last 22 amino acid residues of the E1α subunit C-terminal end. In silico analysis of this region showed that it is in contact with several residues of the E1β subunit mainly through polar contacts, hydrogen bonds, and hydrophobic interactions. CES strategy could benefit the patients and families by offering precise and prompt diagnosis and better genetic counseling. Copyright © 2018 Elsevier B.V. All rights reserved.
A novel NOTCH3 mutation identified in patients with oral cancer by whole exome sequencing.
Yi, Yanjun; Tian, Zhuowei; Ju, Houyu; Ren, Guoxin; Hu, Jingzhou
2017-06-01
Oral cancer is a serious disease caused by environmental factors and/or susceptible genes. In the present study, in order to identify useful genetic biomarkers for cancer prediction and prevention, and for personalized treatment, we detected somatic mutations in 5 pairs of oral cancer tissues and blood samples using whole exome sequencing (WES). Finally, we confirmed a novel nonsense single-nucleotide polymorphism (SNP; chr19:15288426A>C) in the NOTCH3 gene with sanger sequencing, which resulted in a N1438T mutation in the protein sequence. Using multiple in silico analyses, this variant was found to mildly damaging effects on the NOTCH3 gene, which was supported by the results from analyses using PANTHER, SNAP and SNPs&GO. However, further analysis using Mutation Taster revealed that this SNP had a probability of 0.9997 to be 'disease causing'. In addition, we performed 3D structure simulation analysis and the results suggested that this variant had little effect on the solubility and hydrophobicity of the protein and thus on its function; however, it decreased the stability of the protein by increasing the total energy following minimization (-1,051.39 kcal/mol for the mutant and -1,229.84 kcal/mol for the native) and decreasing one stabilizing residue of the protein. Less stability of the N1438T mutant was also supported by analysis using I-Mutant with a DDG value of -1.67. Overall, the present study identified and confirmed a novel mutation in the NOTCH3 gene, which may decrease the stability of NOTCH3, and may thus prove to be helpful in cancer prognosis.
HLA Diversity in the 1000 Genomes Dataset
Gourraud, Pierre-Antoine; Khankhanian, Pouya; Cereb, Nezih; Yang, Soo Young; Feolo, Michael; Maiers, Martin; D. Rioux, John; Hauser, Stephen; Oksenberg, Jorge
2014-01-01
The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation by sequencing at a level that should allow the genome-wide detection of most variants with frequencies as low as 1%. However, in the major histocompatibility complex (MHC), only the top 10 most frequent haplotypes are in the 1% frequency range whereas thousands of haplotypes are present at lower frequencies. Given the limitation of both the coverage and the read length of the sequences generated by the 1000 Genomes Project, the highly variable positions that define HLA alleles may be difficult to identify. We used classical Sanger sequencing techniques to type the HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 genes in the available 1000 Genomes samples and combined the results with the 103,310 variants in the MHC region genotyped by the 1000 Genomes Project. Using pairwise identity-by-descent distances between individuals and principal component analysis, we established the relationship between ancestry and genetic diversity in the MHC region. As expected, both the MHC variants and the HLA phenotype can identify the major ancestry lineage, informed mainly by the most frequent HLA haplotypes. To some extent, regions of the genome with similar genetic or similar recombination rate have similar properties. An MHC-centric analysis underlines departures between the ancestral background of the MHC and the genome-wide picture. Our analysis of linkage disequilibrium (LD) decay in these samples suggests that overestimation of pairwise LD occurs due to a limited sampling of the MHC diversity. This collection of HLA-specific MHC variants, available on the dbMHC portal, is a valuable resource for future analyses of the role of MHC in population and disease studies. PMID:24988075
HLA diversity in the 1000 genomes dataset.
Gourraud, Pierre-Antoine; Khankhanian, Pouya; Cereb, Nezih; Yang, Soo Young; Feolo, Michael; Maiers, Martin; Rioux, John D; Hauser, Stephen; Oksenberg, Jorge
2014-01-01
The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation by sequencing at a level that should allow the genome-wide detection of most variants with frequencies as low as 1%. However, in the major histocompatibility complex (MHC), only the top 10 most frequent haplotypes are in the 1% frequency range whereas thousands of haplotypes are present at lower frequencies. Given the limitation of both the coverage and the read length of the sequences generated by the 1000 Genomes Project, the highly variable positions that define HLA alleles may be difficult to identify. We used classical Sanger sequencing techniques to type the HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 genes in the available 1000 Genomes samples and combined the results with the 103,310 variants in the MHC region genotyped by the 1000 Genomes Project. Using pairwise identity-by-descent distances between individuals and principal component analysis, we established the relationship between ancestry and genetic diversity in the MHC region. As expected, both the MHC variants and the HLA phenotype can identify the major ancestry lineage, informed mainly by the most frequent HLA haplotypes. To some extent, regions of the genome with similar genetic or similar recombination rate have similar properties. An MHC-centric analysis underlines departures between the ancestral background of the MHC and the genome-wide picture. Our analysis of linkage disequilibrium (LD) decay in these samples suggests that overestimation of pairwise LD occurs due to a limited sampling of the MHC diversity. This collection of HLA-specific MHC variants, available on the dbMHC portal, is a valuable resource for future analyses of the role of MHC in population and disease studies.
Mukda, Ekchol; Trachoo, Objoon; Pasomsub, Ekawat; Tiyasirichokchai, Rawiphorn; Iemwimangsa, Nareenart; Sosothikul, Darintr; Chantratita, Wasun; Pakakasama, Samart
2017-08-01
In the present study, we used exome sequencing to analyze PRF1, UNC13D, STX11, and STXBP2, as well as genes associated with primary immunodeficiency disease (RAB27A, LYST, AP3B1, SH2D1A, ITK, CD27, XIAP, and MAGT1) in Thai children with hemophagocytic lymphohistiocytosis (HLH). We performed mutation analysis of HLH-associated genes in 25 Thai children using an exome sequencing method. Genetic variations found within these target genes were compared to exome sequencing data from 133 healthy individuals. Variants identified with minor allele frequencies <5% and novel mutations were confirmed using Sanger sequencing. Exome sequencing data revealed 101 non-synonymous single nucleotide polymorphisms (SNPs) in all subjects. These SNPs were classified as pathogenic (n = 1), likely pathogenic (n = 16), variant of unknown significance (n = 12), or benign variant (n = 72). Homozygous, compound heterozygous, and double-gene heterozygous variants, involving mutations in PRF1 (n = 3), UNC13D (n = 2), STXBP2 (n = 3), LYST (n = 3), XIAP (n = 2), AP3B1 (n = 1), RAB27A (n = 1), and MAGT1 (n = 1), were demonstrated in 12 patients. Novel mutations were found in most patients in this study. In conclusion, exome sequencing demonstrated the ability to identify rare genetic variants in HLH patients. This method is useful in the detection of mutations in multi-gene associated diseases.
Efficient analysis of mouse genome sequences reveal many nonsense variants
Steeland, Sophie; Timmermans, Steven; Van Ryckeghem, Sara; Hulpiau, Paco; Saeys, Yvan; Van Montagu, Marc; Vandenbroucke, Roosmarijn E.; Libert, Claude
2016-01-01
Genetic polymorphisms in coding genes play an important role when using mouse inbred strains as research models. They have been shown to influence research results, explain phenotypical differences between inbred strains, and increase the amount of interesting gene variants present in the many available inbred lines. SPRET/Ei is an inbred strain derived from Mus spretus that has ∼1% sequence difference with the C57BL/6J reference genome. We obtained a listing of all SNPs and insertions/deletions (indels) present in SPRET/Ei from the Mouse Genomes Project (Wellcome Trust Sanger Institute) and processed these data to obtain an overview of all transcripts having nonsynonymous coding sequence variants. We identified 8,883 unique variants affecting 10,096 different transcripts from 6,328 protein-coding genes, which is about 28% of all coding genes. Because only a subset of these variants results in drastic changes in proteins, we focused on variations that are nonsense mutations that ultimately resulted in a gain of a stop codon. These genes were identified by in silico changing the C57BL/6J coding sequences to the SPRET/Ei sequences, converting them to amino acid (AA) sequences, and comparing the AA sequences. All variants and transcripts affected were also stored in a database, which can be browsed using a SPRET/Ei M. spretus variants web tool (www.spretus.org), including a manual. We validated the tool by demonstrating the loss of function of three proteins predicted to be severely truncated, namely Fas, IRAK2, and IFNγR1. PMID:27147605
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
DOE Office of Scientific and Technical Information (OSTI.GOV)
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.; ...
2017-07-18
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Richard A.; Brown, Steven D.
2017-01-01
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. PMID:28769883
Capillaries for use in a multiplexed capillary electrophoresis system
Yeung, Edward S.; Chang, Huan-Tsang; Fung, Eliza N.
1997-12-09
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification ("base calling") is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations.
Capillaries for use in a multiplexed capillary electrophoresis system
Yeung, E.S.; Chang, H.T.; Fung, E.N.
1997-12-09
The invention provides a side-entry optical excitation geometry for use in a multiplexed capillary electrophoresis system. A charge-injection device is optically coupled to capillaries in the array such that the interior of a capillary is imaged onto only one pixel. In Sanger-type 4-label DNA sequencing reactions, nucleotide identification (``base calling``) is improved by using two long-pass filters to split fluorescence emission into two emission channels. A binary poly(ethyleneoxide) matrix is used in the electrophoretic separations. 19 figs.
Exome capture sequencing identifies a novel mutation in BBS4
Wang, Hui; Chen, Xianfeng; Dudinsky, Lynn; Patenia, Claire; Chen, Yiyun; Li, Yumei; Wei, Yue; Abboud, Emad B.; Al-Rajhi, Ali A.; Lewis, Richard Alan; Lupski, James R.; Mardon, Graeme; Gibbs, Richard A.; Perkins, Brian D.
2011-01-01
Purpose Leber congenital amaurosis (LCA) is one of the most severe eye dystrophies characterized by severe vision loss at an early stage and accounts for approximately 5% of all retinal dystrophies. The purpose of this study was to identify a novel LCA disease allele or gene and to develop an approach combining genetic mapping with whole exome sequencing. Methods Three patients from King Khaled Eye Specialist Hospital (KKESH205) underwent whole genome single nucleotide polymorphism genotyping, and a single candidate region was identified. Taking advantage of next-generation high-throughput DNA sequencing technologies, whole exome capture sequencing was performed on patient KKESH205#7. Sanger direct sequencing was used during the validation step. The zebrafish model was used to examine the function of the mutant allele. Results A novel missense mutation in Bardet-Biedl syndrome 4 protein (BBS4) was identified in a consanguineous family from Saudi Arabia. This missense mutation in the fifth exon (c.253G>C;p.E85Q) of BBS4 is likely a disease-causing mutation as it segregates with the disease. The mutation is not found in the single nucleotide polymorphism (SNP) database, the 1000 Genomes Project, or matching normal controls. Functional analysis of this mutation in zebrafish indicates that the G253C allele is pathogenic. Coinjection of the G253C allele cannot rescue the mislocalization of rhodopsin in the retina when BBS4 is knocked down by morpholino injection. Immunofluorescence analysis in cell culture shows that this missense mutation in BBS4 does not cause obvious defects in protein expression or pericentriolar localization. Conclusions This mutation likely mainly reduces or abolishes BBS4 function in the retina. Further studies of this allele will provide important insights concerning the pleiotropic nature of BBS4 function. PMID:22219648
Park, Kyung-Hwa; Greenwood-Quaintance, Kerryl E; Uhl, James R; Cunningham, Scott A; Chia, Nicholas; Jeraldo, Patricio R; Sampathkumar, Priya; Nelson, Heidi; Patel, Robin
2017-01-01
Staphylococcus aureus is a leading cause of bacteremia in hospitalized patients. Whether or not S. aureus bacteremia (SAB) is associated with clonality, implicating potential nosocomial transmission, has not, however, been investigated. Herein, we examined the epidemiology of SAB using whole genome sequencing (WGS). 152 SAB isolates collected over the course of 2015 at a single large Minnesota medical center were studied. Staphylococcus protein A (spa) typing was performed by PCR/Sanger sequencing; multilocus sequence typing (MLST) and core genome MLST (cgMLST) were determined by WGS. Forty-eight isolates (32%) were methicillin-resistant S. aureus (MRSA). The isolates encompassed 66 spa types, clustered into 11 spa clonal complexes (CCs) and 10 singleton types. 88% of 48 MRSA isolates belonged to spa CC-002 or -008. Methicillin-susceptible S. aureus (MSSA) isolates were more genotypically diverse, with 61% distributed across four spa CCs (CC-002, CC-012, CC-008 and CC-084). By MLST, there was 31 sequence types (STs), including 18 divided into 6 CCs and 13 singleton STs. Amongst MSSA isolates, the common MLST clones were CC5 (23%), CC30 (19%), CC8 (15%) and CC15 (11%). Common MRSA clones were CC5 (67%) and CC8 (25%); there were no MRSA isolates in CC45 or CC30. By cgMLST analysis, there were 9 allelic differences between two isolates, with the remaining 150 isolates differing from each other by over 40 alleles. The two isolates were retroactively epidemiologically linked by medical record review. Overall, cgMLST analysis resulted in higher resolution epidemiological typing than did multilocus sequence or spa typing.
Analysis of Litopenaeus vannamei Transcriptome Using the Next-Generation DNA Sequencing Technique
Li, Chaozheng; Weng, Shaoping; Chen, Yonggui; Yu, Xiaoqiang; Lü, Ling; Zhang, Haiqing; He, Jianguo; Xu, Xiaopeng
2012-01-01
Background Pacific white shrimp (Litopenaeus vannamei), the major species of farmed shrimps in the world, has been attracting extensive studies, which require more and more genome background knowledge. The now available transcriptome data of L. vannamei are insufficient for research requirements, and have not been adequately assembled and annotated. Methodology/Principal Findings This is the first study that used a next-generation high-throughput DNA sequencing technique, the Solexa/Illumina GA II method, to analyze the transcriptome from whole bodies of L. vannamei larvae. More than 2.4 Gb of raw data were generated, and 109,169 unigenes with a mean length of 396 bp were assembled using the SOAP denovo software. 73,505 unigenes (>200 bp) with good quality sequences were selected and subjected to annotation analysis, among which 37.80% can be matched in NCBI Nr database, 37.3% matched in Swissprot, and 44.1% matched in TrEMBL. Using BLAST and BLAST2Go softwares, 11,153 unigenes were classified into 25 Clusters of Orthologous Groups of proteins (COG) categories, 8171 unigenes were assigned into 51 Gene ontology (GO) functional groups, and 18,154 unigenes were divided into 220 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. To primarily verify part of the results of assembly and annotations, 12 assembled unigenes that are homologous to many embryo development-related genes were chosen and subjected to RT-PCR for electrophoresis and Sanger sequencing analyses, and to real-time PCR for expression profile analyses during embryo development. Conclusions/Significance The L. vannamei transcriptome analyzed using the next-generation sequencing technique enriches the information of L. vannamei genes, which will facilitate our understanding of the genome background of crustaceans, and promote the studies on L. vannamei. PMID:23071809
Barbano, Raffaela; Pasculli, Barbara; Coco, Michelina; Fontana, Andrea; Copetti, Massimiliano; Rendina, Michelina; Valori, Vanna Maria; Graziano, Paolo; Maiello, Evaristo; Fazio, Vito Michele; Parrella, Paola
2015-01-01
BRAF codon 600 mutation testing of melanoma patients is mandatory for the choice of the most appropriate therapy in the clinical setting. Competitive allele specific TaqMan PCR (Cast-PCR) technology allows not only the selective amplification of minor alleles, but it also blocks the amplification of non-mutant allele. We genotyped codon 600 of the BRAF gene in 54 patients’ samples by Cast-PCR and bidirectional direct sequence analysis. All the mutations detected by sequencing were also identified by Cast-PCR. In addition, Cast-PCR assay detected four samples carrying mutations and was able to clearly identify two mutations of uncertain interpretation by Sanger sequencing. The limit of detection of Cast-PCR was evaluated by constructing dilution curves of BRAFV600E and BRAFV600K mutated clinical samples mixed with a not-mutated specimens. Both mutations could be detected until a 1:100 mutated/not mutated ratio. Cloning and sequencing of the clones was used to confirm mutations on representative discrepant cases. Cast PCR performances were not affected by intratumour heterogeneity, and less affected by melanin content. Our results indicate that Cast-PCR is a reliable diagnostic tool for the identification of melanoma patients as eligible to be treated with TKIs and might be implemented in the clinical setting as elective screening method. PMID:26690267