Sample records for disease sequence analysis

  1. Phenotype–genotype correlation in Hirschsprung disease is illuminated by comparative analysis of the RET protein sequence

    PubMed Central

    Kashuk, Carl S.; Stone, Eric A.; Grice, Elizabeth A.; Portnoy, Matthew E.; Green, Eric D.; Sidow, Arend; Chakravarti, Aravinda; McCallion, Andrew S.

    2005-01-01

    The ability to discriminate between deleterious and neutral amino acid substitutions in the genes of patients remains a significant challenge in human genetics. The increasing availability of genomic sequence data from multiple vertebrate species allows inclusion of sequence conservation and physicochemical properties of residues to be used for functional prediction. In this study, the RET receptor tyrosine kinase serves as a model disease gene in which a broad spectrum (≥116) of disease-associated mutations has been identified among patients with Hirschsprung disease and multiple endocrine neoplasia type 2. We report the alignment of the human RET protein sequence with the orthologous sequences of 12 non-human vertebrates (eight mammalian, one avian, and three teleost species), their comparative analysis, the evolutionary topology of the RET protein, and predicted tolerance for all published missense mutations. We show that, although evolutionary conservation alone provides significant information to predict the effect of a RET mutation, a model that combines comparative sequence data with analysis of physiochemical properties in a quantitative framework provides far greater accuracy. Although the ability to discern the impact of a mutation is imperfect, our analyses permit substantial discrimination between predicted functional classes of RET mutations and disease severity even for a multigenic disease such as Hirschsprung disease. PMID:15956201

  2. Median network analysis of defectively sequenced entire mitochondrial genomes from early and contemporary disease studies.

    PubMed

    Bandelt, Hans-Jürgen; Yao, Yong-Gang; Bravi, Claudio M; Salas, Antonio; Kivisild, Toomas

    2009-03-01

    Sequence analysis of the mitochondrial genome has become a routine method in the study of mitochondrial diseases. Quite often, the sequencing efforts in the search of pathogenic or disease-associated mutations are affected by technical and interpretive problems, caused by sample mix-up, contamination, biochemical problems, incomplete sequencing, misdocumentation and insufficient reference to previously published data. To assess data quality in case studies of mitochondrial diseases, it is recommended to compare any mtDNA sequence under consideration to their phylogenetically closest lineages available in the Web. The median network method has proven useful for visualizing potential problems with the data. We contrast some early reports of complete mtDNA sequences to more recent total mtDNA sequencing efforts in studies of various mitochondrial diseases. We conclude that the quality of complete mtDNA sequences generated in the medical field in the past few years is somewhat unsatisfactory and may even fall behind that of pioneer manual sequencing in the early nineties. Our study provides a paradigm for an a posteriori evaluation of sequence quality and for detection of potential problems with inferring a pathogenic status of a particular mutation.

  3. Single nucleotide polymorphisms from Theobroma cacao expressed sequence tags associated with witches' broom disease in cacao.

    PubMed

    Lima, L S; Gramacho, K P; Carels, N; Novais, R; Gaiotto, F A; Lopes, U V; Gesteira, A S; Zaidan, H A; Cascardo, J C M; Pires, J L; Micheli, F

    2009-07-14

    In order to increase the efficiency of cacao tree resistance to witches' broom disease, which is caused by Moniliophthora perniciosa (Tricholomataceae), we looked for molecular markers that could help in the selection of resistant cacao genotypes. Among the different markers useful for developing marker-assisted selection, single nucleotide polymorphisms (SNPs) constitute the most common type of sequence difference between alleles and can be easily detected by in silico analysis from expressed sequence tag libraries. We report the first detection and analysis of SNPs from cacao-M. perniciosa interaction expressed sequence tags, using bioinformatics. Selection based on analysis of these SNPs should be useful for developing cacao varieties resistant to this devastating disease.

  4. Identification of a Heterozygous SPG11 Mutation by Clinical Exome Sequencing in a Patient With Hereditary Spastic Paraplegia: A Case Report.

    PubMed

    Oh, Ja-Young; Do, Hyun Jung; Lee, Seungok; Jang, Ja-Hyun; Cho, Eun-Hae; Jang, Dae-Hyun

    2016-12-01

    Next-generation sequencing, such as whole-genome sequencing, whole-exome sequencing, and targeted panel sequencing have been applied for diagnosis of many genetic diseases, and are in the process of replacing the traditional methods of genetic analysis. Clinical exome sequencing (CES), which provides not only sequence variation data but also clinical interpretation, aids in reaching a final conclusion with regards to genetic diagnosis. Sequencing of genes with clinical relevance rather than whole exome sequencing might be more suitable for the diagnosis of known hereditary disease with genetic heterogeneity. Here, we present the clinical usefulness of CES for the diagnosis of hereditary spastic paraplegia (HSP). We report a case of patient who was strongly suspected of having HSP based on her clinical manifestations. HSP is one of the diseases with high genetic heterogeneity, the 72 different loci and 59 discovered genes identified so far. Therefore, traditional approach for diagnosis of HSP with genetic analysis is very challenging and time-consuming. CES with TruSight One Sequencing Panel, which enriches about 4,800 genes with clinical relevance, revealed compound heterozygous mutations in SPG11 . One workflow and one procedure can provide the results of genetic analysis, and CES with enrichment of clinically relevant genes is a cost-effective and time-saving diagnostic tool for diseases with genetic heterogeneity, including HSP.

  5. Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting

    NASA Astrophysics Data System (ADS)

    Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.

    1997-05-01

    Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.

  6. Using whole-exome sequencing to identify variants inherited from mosaic parents

    PubMed Central

    Rios, Jonathan J; Delgado, Mauricio R

    2015-01-01

    Whole-exome sequencing (WES) has allowed the discovery of genes and variants causing rare human disease. This is often achieved by comparing nonsynonymous variants between unrelated patients, and particularly for sporadic or recessive disease, often identifies a single or few candidate genes for further consideration. However, despite the potential for this approach to elucidate the genetic cause of rare human disease, a majority of patients fail to realize a genetic diagnosis using standard exome analysis methods. Although genetic heterogeneity contributes to the difficulty of exome sequence analysis between patients, it remains plausible that rare human disease is not caused by de novo or recessive variants. Multiple human disorders have been described for which the variant was inherited from a phenotypically normal mosaic parent. Here we highlight the potential for exome sequencing to identify a reasonable number of candidate genes when dominant disease variants are inherited from a mosaic parent. We show the power of WES to identify a limited number of candidate genes using this disease model and how sequence coverage affects identification of mosaic variants by WES. We propose this analysis as an alternative to discover genetic causes of rare human disorders for which typical WES approaches fail to identify likely pathogenic variants. PMID:24986828

  7. A functional U-statistic method for association analysis of sequencing data.

    PubMed

    Jadhav, Sneha; Tong, Xiaoran; Lu, Qing

    2017-11-01

    Although sequencing studies hold great promise for uncovering novel variants predisposing to human diseases, the high dimensionality of the sequencing data brings tremendous challenges to data analysis. Moreover, for many complex diseases (e.g., psychiatric disorders) multiple related phenotypes are collected. These phenotypes can be different measurements of an underlying disease, or measurements characterizing multiple related diseases for studying common genetic mechanism. Although jointly analyzing these phenotypes could potentially increase the power of identifying disease-associated genes, the different types of phenotypes pose challenges for association analysis. To address these challenges, we propose a nonparametric method, functional U-statistic method (FU), for multivariate analysis of sequencing data. It first constructs smooth functions from individuals' sequencing data, and then tests the association of these functions with multiple phenotypes by using a U-statistic. The method provides a general framework for analyzing various types of phenotypes (e.g., binary and continuous phenotypes) with unknown distributions. Fitting the genetic variants within a gene using a smoothing function also allows us to capture complexities of gene structure (e.g., linkage disequilibrium, LD), which could potentially increase the power of association analysis. Through simulations, we compared our method to the multivariate outcome score test (MOST), and found that our test attained better performance than MOST. In a real data application, we apply our method to the sequencing data from Minnesota Twin Study (MTS) and found potential associations of several nicotine receptor subunit (CHRN) genes, including CHRNB3, associated with nicotine dependence and/or alcohol dependence. © 2017 WILEY PERIODICALS, INC.

  8. Sequence analysis of Jembrana disease virus strains reveals a genetically stable lentivirus.

    PubMed

    Desport, Moira; Stewart, Meredith E; Mikosza, Andrew S; Sheridan, Carol A; Peterson, Shane E; Chavand, Olivier; Hartaningsih, Nining; Wilcox, Graham E

    2007-06-01

    Jembrana disease virus (JDV) is a lentivirus associated with an acute disease syndrome with a 20% case fatality rate in Bos javanicus (Bali cattle) in Indonesia, occurring after a short incubation period and with no recurrence of the disease after recovery. Partial regions of gag and pol and the entire env were examined for sequence variation in DNA samples from cases of Jembrana disease obtained from Bali, Sumatra and South Kalimantan in Indonesian Borneo. A high level of nucleotide conservation (97-100%) was observed in gag sequences from samples taken in Bali and Sumatra, indicating that the source of JDV in Sumatra was most likely to have originated from Bali. The pol sequences and, unexpectedly, the env sequences from Bali samples were also well conserved with low nucleotide (96-99%) and amino acid substitutions (95-99%). However, the sample from South Kalimantan (JDV(KAL/01)) contained more divergent sequences, particularly in env (88% identity). Phylogenetic analysis revealed that the JDV(KAL/01)env sequences clustered with the sequence from the Pulukan sample (Bali) from 2001. JDV appears to be remarkably stable genetically and has undergone minor genetic changes over a period of nearly 20 years in Bali despite becoming endemic in the cattle population of the island.

  9. Sequencing, Assembly and Analysis of Human Microbial Communities

    ScienceCinema

    Petrosino, Joe

    2018-02-02

    Joe Petrosino of Baylor College of Medicine discusses using next generation sequencing technologies to study human microbial communities associated with health and disease on June 4, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.

  10. Using ProMED-Mail and MedWorm blogs for cross-domain pattern analysis in epidemic intelligence.

    PubMed

    Stewart, Avaré; Denecke, Kerstin

    2010-01-01

    In this work we motivate the use of medical blog user generated content for gathering facts about disease reporting events to support biosurveillance investigation. Given the characteristics of blogs, the extraction of such events is made more difficult due to noise and data abundance. We address the problem of automatically inferring disease reporting event extraction patterns in this more noisy setting. The sublanguage used in outbreak reports is exploited to align with the sequences of disease reporting sentences in blogs. Based our Cross Domain Pattern Analysis Framework, experimental results show that Phase-Level sequences tend to produce more overlap across the domains than Word-Level sequences. The cross domain alignment process is effective at filtering noisy sequences from blogs and extracting good candidate sequence patterns from an abundance of text.

  11. Genome-Wide Prediction and Analysis of 3D-Domain Swapped Proteins in the Human Genome from Sequence Information.

    PubMed

    Upadhyay, Atul Kumar; Sowdhamini, Ramanathan

    2016-01-01

    3D-domain swapping is one of the mechanisms of protein oligomerization and the proteins exhibiting this phenomenon have many biological functions. These proteins, which undergo domain swapping, have acquired much attention owing to their involvement in human diseases, such as conformational diseases, amyloidosis, serpinopathies, proteionopathies etc. Early realisation of proteins in the whole human genome that retain tendency to domain swap will enable many aspects of disease control management. Predictive models were developed by using machine learning approaches with an average accuracy of 78% (85.6% of sensitivity, 87.5% of specificity and an MCC value of 0.72) to predict putative domain swapping in protein sequences. These models were applied to many complete genomes with special emphasis on the human genome. Nearly 44% of the protein sequences in the human genome were predicted positive for domain swapping. Enrichment analysis was performed on the positively predicted sequences from human genome for their domain distribution, disease association and functional importance based on Gene Ontology (GO). Enrichment analysis was also performed to infer a better understanding of the functional importance of these sequences. Finally, we developed hinge region prediction, in the given putative domain swapped sequence, by using important physicochemical properties of amino acids.

  12. Rapid Detection of Rare Deleterious Variants by Next Generation Sequencing with Optional Microarray SNP Genotype Data

    PubMed Central

    Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.

    2015-01-01

    ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133

  13. Genetic Diagnosis in Consanguineous Families With Kidney Disease by Homozygosity Mapping Coupled With Whole-Exome Sequencing

    PubMed Central

    Al-Romaih, Khaldoun I.; Genovese, Giulio; Al-Mojalli, Hamad; Al-Othman, Saleh; Al-Manea, Hadeel; Al-Suleiman, Mohammed; Al-Jondubi, Mohammed; Atallah, Nourah; Al-Rodhyan, Maha; Weins, Astrid; Pollak, Martin R.; Adra, Chaker N.

    2011-01-01

    Background Accurate diagnosis of the primary cause of an individual’s kidney disease can be essential for proper management. Some kidney diseases have overlapping histopathological features despite being caused by defects in different genes. In this report we describe two consanguineous Saudi Arabian families in which individuals presented with kidney failure and mixed clinical and histological features initially thought consistent with focal segmental glomerulosclerosis. Study Design Case series. Setting and participants We studied members of two apparently unrelated families from Saudi Arabia with kidney disease. Measurements Whole-genome single-nucleotide polymorphism analysis followed by targeted isolation and sequencing of exons using genomic DNA samples from affected members of these families, followed by additional focused genotyping and sequence analysis. Results The two apparently unrelated families shared a region of homozygosity on chromosome 2q13. Exome sequence from the affected individuals lacked any sequence reads from the NPHP1 gene, which is located within this homozygous region. Additional PCR based genotyping confirmed that affected individuals had NPHP1 deletions, rather than defects in a known FSGS-associated gene. Limitations The methods used here may not result in a clear genetic diagnosis in many cases of apparent familial kidney disease. Conclusions This analysis demonstrates the power of new high-throughput genotyping and sequencing technologies to aid in the rapid genetic diagnosis of individuals with an inherited form of kidney disease. We believe it is likely that such tools may become useful clinical genetic tools and alter the manner in which diagnoses are made in nephrology. PMID:21658830

  14. Non-exomic and synonymous variants in ABCA4 are an important cause of Stargardt disease

    PubMed Central

    Braun, Terry A.; Mullins, Robert F.; Wagner, Alex H.; Andorf, Jeaneen L.; Johnston, Rebecca M.; Bakall, Benjamin B.; Deluca, Adam P.; Fishman, Gerald A.; Lam, Byron L.; Weleber, Richard G.; Cideciyan, Artur V.; Jacobson, Samuel G.; Sheffield, Val C.; Tucker, Budd A.; Stone, Edwin M.

    2013-01-01

    Mutations in ABCA4 cause Stargardt disease and other blinding autosomal recessive retinal disorders. However, sequencing of the complete coding sequence in patients with clinical features of Stargardt disease sometimes fails to detect one or both mutations. For example, among 208 individuals with clear clinical evidence of ABCA4 disease ascertained at a single institution, 28 had only one disease-causing allele identified in the exons and splice junctions of the primary retinal transcript of the gene. Haplotype analysis of these 28 probands revealed 3 haplotypes shared among ten families, suggesting that 18 of the 28 missing alleles were rare enough to be present only once in the cohort. We hypothesized that mutations near rare alternate splice junctions in ABCA4 might cause disease by increasing the probability of mis-splicing at these sites. Next-generation sequencing of RNA extracted from human donor eyes revealed more than a dozen alternate exons that are occasionally incorporated into the ABCA4 transcript in normal human retina. We sequenced the genomic DNA containing 15 of these minor exons in the 28 one-allele subjects and observed five instances of two different variations in the splice signals of exon 36.1 that were not present in normal individuals (P < 10−6). Analysis of RNA obtained from the keratinocytes of patients with these mutations revealed the predicted alternate transcript. This study illustrates the utility of RNA sequence analysis of human donor tissue and patient-derived cell lines to identify mutations that would be undetectable by exome sequencing. PMID:23918662

  15. Identification of novel mutations in the α-galactosidase A gene in patients with Fabry disease: pitfalls of mutation analyses in patients with low α-galactosidase A activity.

    PubMed

    Yoshimitsu, Makoto; Higuchi, Koji; Miyata, Masaaki; Devine, Sean; Mattman, Andre; Sirrs, Sandra; Medin, Jeffrey A; Tei, Chuwa; Takenaka, Toshihiro

    2011-05-01

    Fabry disease is an X-linked lysosomal storage disorder caused by mutations of the α-galactosidase A (GLA) gene, and the disease is a relatively prevalent cause of left ventricular hypertrophy followed by conduction abnormalities and arrhythmias. Mutation analysis of the GLA gene is a valuable tool for accurate diagnosis of affected families. In this study, we carried out molecular studies of 10 unrelated families diagnosed with Fabry disease. Genetic analysis of the GLA gene using conventional genomic sequencing was performed in 9 hemizygous males and 6 heterozygous females. In patients with no mutations in coding DNA sequence, multiplex ligation-dependent probe amplification (MLPA) and/or cDNA sequencing were performed. We identified a novel exon 2 deletion (IVS1_IVS2) in a heterozygous female by MLPA, which was undetectable by conventional sequencing methods. In addition, the g.9331G>A mutation that has previously been found only in patients with cardiac Fabry disease was found in 3 unrelated, newly-diagnosed, cardiac Fabry patients by sequencing GLA genomic DNA and cDNA. Two other novel mutations, g.8319A>G and 832delA were also found in addition to 4 previously reported mutations (R112C, C142Y, M296I, and G373D) in 6 other families. We could identify GLA gene mutations in all hemizygotes and heterozygotes from 10 families with Fabry disease. Mutations in 4 out of 10 families could not be identified by classical genomic analysis, which focuses on exons and the flanking region. Instead, these data suggest that MLPA analysis and cDNA sequence should be considered in genetic testing surveys of patients with Fabry disease. Copyright © 2011 Japanese College of Cardiology. Published by Elsevier Ltd. All rights reserved.

  16. The European Classical Swine Fever Virus Database: Blueprint for a Pathogen-Specific Sequence Database with Integrated Sequence Analysis Tools

    PubMed Central

    Postel, Alexander; Schmeiser, Stefanie; Zimmermann, Bernd; Becher, Paul

    2016-01-01

    Molecular epidemiology has become an indispensable tool in the diagnosis of diseases and in tracing the infection routes of pathogens. Due to advances in conventional sequencing and the development of high throughput technologies, the field of sequence determination is in the process of being revolutionized. Platforms for sharing sequence information and providing standardized tools for phylogenetic analyses are becoming increasingly important. The database (DB) of the European Union (EU) and World Organisation for Animal Health (OIE) Reference Laboratory for classical swine fever offers one of the world’s largest semi-public virus-specific sequence collections combined with a module for phylogenetic analysis. The classical swine fever (CSF) DB (CSF-DB) became a valuable tool for supporting diagnosis and epidemiological investigations of this highly contagious disease in pigs with high socio-economic impacts worldwide. The DB has been re-designed and now allows for the storage and analysis of traditionally used, well established genomic regions and of larger genomic regions including complete viral genomes. We present an application example for the analysis of highly similar viral sequences obtained in an endemic disease situation and introduce the new geographic “CSF Maps” tool. The concept of this standardized and easy-to-use DB with an integrated genetic typing module is suited to serve as a blueprint for similar platforms for other human or animal viruses. PMID:27827988

  17. MetaDP: a comprehensive web server for disease prediction of 16S rRNA metagenomic datasets.

    PubMed

    Xu, Xilin; Wu, Aiping; Zhang, Xinlei; Su, Mingming; Jiang, Taijiao; Yuan, Zhe-Ming

    2016-01-01

    High-throughput sequencing-based metagenomics has garnered considerable interest in recent years. Numerous methods and tools have been developed for the analysis of metagenomic data. However, it is still a daunting task to install a large number of tools and complete a complicated analysis, especially for researchers with minimal bioinformatics backgrounds. To address this problem, we constructed an automated software named MetaDP for 16S rRNA sequencing data analysis, including data quality control, operational taxonomic unit clustering, diversity analysis, and disease risk prediction modeling. Furthermore, a support vector machine-based prediction model for intestinal bowel syndrome (IBS) was built by applying MetaDP to microbial 16S sequencing data from 108 children. The success of the IBS prediction model suggests that the platform may also be applied to other diseases related to gut microbes, such as obesity, metabolic syndrome, or intestinal cancer, among others (http://metadp.cn:7001/).

  18. Next Generation Sequencing Technology and Genomewide Data Analysis: Perspectives for Retinal Research

    PubMed Central

    Chaitankar, Vijender; Karakülah, Gökhan; Ratnapriya, Rinki; Giuste, Felipe O.; Brooks, Matthew J.; Swaroop, Anand

    2016-01-01

    The advent of high throughput next generation sequencing (NGS) has accelerated the pace of discovery of disease-associated genetic variants and genomewide profiling of expressed sequences and epigenetic marks, thereby permitting systems-based analyses of ocular development and disease. Rapid evolution of NGS and associated methodologies presents significant challenges in acquisition, management, and analysis of large data sets and for extracting biologically or clinically relevant information. Here we illustrate the basic design of commonly used NGS-based methods, specifically whole exome sequencing, transcriptome, and epigenome profiling, and provide recommendations for data analyses. We briefly discuss systems biology approaches for integrating multiple data sets to elucidate gene regulatory or disease networks. While we provide examples from the retina, the NGS guidelines reviewed here are applicable to other tissues/cell types as well. PMID:27297499

  19. Complete genome sequence of a new enamovirus from Argentina infecting alfalfa plants showing dwarfism symptoms.

    PubMed

    Bejerman, Nicolás; Giolitti, Fabián; Trucco, Verónica; de Breuil, Soledad; Dietzgen, Ralf G; Lenardon, Sergio

    2016-07-01

    Alfalfa dwarf disease, probably caused by synergistic interactions of mixed virus infections, is a major and emergent disease that threatens alfalfa production in Argentina. Deep sequencing of diseased alfalfa plant samples from the central region of Argentina resulted in the identification of a new virus genome resembling enamoviruses in sequence and genome structure. Phylogenetic analysis suggests that it is a new member of the genus Enamovirus, family Luteoviridae. The virus is tentatively named "alfalfa enamovirus 1" (AEV-1). The availability of the AEV-1 genome sequence will make it possible to assess the genetic variability of this virus and to construct an infectious clone to investigate its role in alfalfa dwarfism disease.

  20. Evaluation of the Bacterial Diversity in the Human Tongue Coating Based on Genus-Specific Primers for 16S rRNA Sequencing.

    PubMed

    Sun, Beili; Zhou, Dongrui; Tu, Jing; Lu, Zuhong

    2017-01-01

    The characteristics of tongue coating are very important symbols for disease diagnosis in traditional Chinese medicine (TCM) theory. As a habitat of oral microbiota, bacteria on the tongue dorsum have been proved to be the cause of many oral diseases. The high-throughput next-generation sequencing (NGS) platforms have been widely applied in the analysis of bacterial 16S rRNA gene. We developed a methodology based on genus-specific multiprimer amplification and ligation-based sequencing for microbiota analysis. In order to validate the efficiency of the approach, we thoroughly analyzed six tongue coating samples from lung cancer patients with different TCM types, and more than 600 genera of bacteria were detected by this platform. The results showed that ligation-based parallel sequencing combined with enzyme digestion and multiamplification could expand the effective length of sequencing reads and could be applied in the microbiota analysis.

  1. High Throughput Sequence Analysis for Disease Resistance in Maize

    USDA-ARS?s Scientific Manuscript database

    Preliminary results of a computational analysis of high throughput sequencing data from Zea mays and the fungus Aspergillus are reported. The Illumina Genome Analyzer was used to sequence RNA samples from two strains of Z. mays (Va35 and Mp313) collected over a time course as well as several specie...

  2. Complete genome sequence and phylogenetic analyses of an aquabirnavirus isolated from a diseased marbled eel culture in Taiwan.

    PubMed

    Wen, Chiu-Ming

    2017-08-01

    An aquabirnavirus was isolated from diseased marbled eels (Anguilla marmorata; MEIPNV1310) with gill haemorrhages and associated mortality. Its genome segment sequences were obtained through next-generation sequencing and compared with published aquabirnavirus sequences. The results indicated that the genome sequence of MEIPNV1310 contains segment A (3099 nucleotides) and segment B (2789 nucleotides). Phylogenetic analysis showed that MEIPNV1310 is closely related to the infectious pancreatic necrosis Ab strain within genogroup II. This genome sequence is beneficial for studying the geographic distribution and evolution of aquabirnaviruses.

  3. Partial sequencing of sodA gene and its application to identification of Streptococcus dysgalactiae subsp. dysgalactiae isolated from farmed fish.

    PubMed

    Nomoto, R; Kagawa, H; Yoshida, T

    2008-01-01

    To investigate the difference between Lancefield group C Streptococcus dysgalactiae (GCSD) strains isolated from diseased fish and animals by sequencing and phylogenetic analysis of the sodA gene. The sodA gene of Strep. dysgalactiae strains isolated from fish and animals were amplified and its nucleotide sequences were determined. Although 100% sequence identity was observed among fish GCSD strains, the determined sequences from animal isolates showed variations against fish isolate sequences. Thus, all fish GCSD strains were clearly separated from the GCSD strains of other origin by using phylogenetic tree analysis. In addition, the original primer set was designed based on the determined sequences for specifically amplify the sodA gene of fish GCSD strains. The primer set yield amplification products from only fish GCSD strains. By sequencing analysis of the sodA gene, the genetic divergence between Strep. dysgalactiae strains isolated from fish and mammals was demonstrated. Moreover, an original oligonucletide primer set, which could simply detect the genotype of fish GCSD strains was designed. This study shows that Strep. dysgalactiae isolated from diseased fish could be distinguished from conventional GCSD strains by the difference in the sequence of the sodA gene.

  4. An international effort towards developing standards for best practices in analysis, interpretation and reporting of clinical genome sequencing results in the CLARITY Challenge.

    PubMed

    Brownstein, Catherine A; Beggs, Alan H; Homer, Nils; Merriman, Barry; Yu, Timothy W; Flannery, Katherine C; DeChene, Elizabeth T; Towne, Meghan C; Savage, Sarah K; Price, Emily N; Holm, Ingrid A; Luquette, Lovelace J; Lyon, Elaine; Majzoub, Joseph; Neupert, Peter; McCallie, David; Szolovits, Peter; Willard, Huntington F; Mendelsohn, Nancy J; Temme, Renee; Finkel, Richard S; Yum, Sabrina W; Medne, Livija; Sunyaev, Shamil R; Adzhubey, Ivan; Cassa, Christopher A; de Bakker, Paul I W; Duzkale, Hatice; Dworzyński, Piotr; Fairbrother, William; Francioli, Laurent; Funke, Birgit H; Giovanni, Monica A; Handsaker, Robert E; Lage, Kasper; Lebo, Matthew S; Lek, Monkol; Leshchiner, Ignaty; MacArthur, Daniel G; McLaughlin, Heather M; Murray, Michael F; Pers, Tune H; Polak, Paz P; Raychaudhuri, Soumya; Rehm, Heidi L; Soemedi, Rachel; Stitziel, Nathan O; Vestecka, Sara; Supper, Jochen; Gugenmus, Claudia; Klocke, Bernward; Hahn, Alexander; Schubach, Max; Menzel, Mortiz; Biskup, Saskia; Freisinger, Peter; Deng, Mario; Braun, Martin; Perner, Sven; Smith, Richard J H; Andorf, Janeen L; Huang, Jian; Ryckman, Kelli; Sheffield, Val C; Stone, Edwin M; Bair, Thomas; Black-Ziegelbein, E Ann; Braun, Terry A; Darbro, Benjamin; DeLuca, Adam P; Kolbe, Diana L; Scheetz, Todd E; Shearer, Aiden E; Sompallae, Rama; Wang, Kai; Bassuk, Alexander G; Edens, Erik; Mathews, Katherine; Moore, Steven A; Shchelochkov, Oleg A; Trapane, Pamela; Bossler, Aaron; Campbell, Colleen A; Heusel, Jonathan W; Kwitek, Anne; Maga, Tara; Panzer, Karin; Wassink, Thomas; Van Daele, Douglas; Azaiez, Hela; Booth, Kevin; Meyer, Nic; Segal, Michael M; Williams, Marc S; Tromp, Gerard; White, Peter; Corsmeier, Donald; Fitzgerald-Butt, Sara; Herman, Gail; Lamb-Thrush, Devon; McBride, Kim L; Newsom, David; Pierson, Christopher R; Rakowsky, Alexander T; Maver, Aleš; Lovrečić, Luca; Palandačić, Anja; Peterlin, Borut; Torkamani, Ali; Wedell, Anna; Huss, Mikael; Alexeyenko, Andrey; Lindvall, Jessica M; Magnusson, Måns; Nilsson, Daniel; Stranneheim, Henrik; Taylan, Fulya; Gilissen, Christian; Hoischen, Alexander; van Bon, Bregje; Yntema, Helger; Nelen, Marcel; Zhang, Weidong; Sager, Jason; Zhang, Lu; Blair, Kathryn; Kural, Deniz; Cariaso, Michael; Lennon, Greg G; Javed, Asif; Agrawal, Saloni; Ng, Pauline C; Sandhu, Komal S; Krishna, Shuba; Veeramachaneni, Vamsi; Isakov, Ofer; Halperin, Eran; Friedman, Eitan; Shomron, Noam; Glusman, Gustavo; Roach, Jared C; Caballero, Juan; Cox, Hannah C; Mauldin, Denise; Ament, Seth A; Rowen, Lee; Richards, Daniel R; San Lucas, F Anthony; Gonzalez-Garay, Manuel L; Caskey, C Thomas; Bai, Yu; Huang, Ying; Fang, Fang; Zhang, Yan; Wang, Zhengyuan; Barrera, Jorge; Garcia-Lobo, Juan M; González-Lamuño, Domingo; Llorca, Javier; Rodriguez, Maria C; Varela, Ignacio; Reese, Martin G; De La Vega, Francisco M; Kiruluta, Edward; Cargill, Michele; Hart, Reece K; Sorenson, Jon M; Lyon, Gholson J; Stevenson, David A; Bray, Bruce E; Moore, Barry M; Eilbeck, Karen; Yandell, Mark; Zhao, Hongyu; Hou, Lin; Chen, Xiaowei; Yan, Xiting; Chen, Mengjie; Li, Cong; Yang, Can; Gunel, Murat; Li, Peining; Kong, Yong; Alexander, Austin C; Albertyn, Zayed I; Boycott, Kym M; Bulman, Dennis E; Gordon, Paul M K; Innes, A Micheil; Knoppers, Bartha M; Majewski, Jacek; Marshall, Christian R; Parboosingh, Jillian S; Sawyer, Sarah L; Samuels, Mark E; Schwartzentruber, Jeremy; Kohane, Isaac S; Margulies, David M

    2014-03-25

    There is tremendous potential for genome sequencing to improve clinical diagnosis and care once it becomes routinely accessible, but this will require formalizing research methods into clinical best practices in the areas of sequence data generation, analysis, interpretation and reporting. The CLARITY Challenge was designed to spur convergence in methods for diagnosing genetic disease starting from clinical case history and genome sequencing data. DNA samples were obtained from three families with heritable genetic disorders and genomic sequence data were donated by sequencing platform vendors. The challenge was to analyze and interpret these data with the goals of identifying disease-causing variants and reporting the findings in a clinically useful format. Participating contestant groups were solicited broadly, and an independent panel of judges evaluated their performance. A total of 30 international groups were engaged. The entries reveal a general convergence of practices on most elements of the analysis and interpretation process. However, even given this commonality of approach, only two groups identified the consensus candidate variants in all disease cases, demonstrating a need for consistent fine-tuning of the generally accepted methods. There was greater diversity of the final clinical report content and in the patient consenting process, demonstrating that these areas require additional exploration and standardization. The CLARITY Challenge provides a comprehensive assessment of current practices for using genome sequencing to diagnose and report genetic diseases. There is remarkable convergence in bioinformatic techniques, but medical interpretation and reporting are areas that require further development by many groups.

  5. An international effort towards developing standards for best practices in analysis, interpretation and reporting of clinical genome sequencing results in the CLARITY Challenge

    PubMed Central

    2014-01-01

    Background There is tremendous potential for genome sequencing to improve clinical diagnosis and care once it becomes routinely accessible, but this will require formalizing research methods into clinical best practices in the areas of sequence data generation, analysis, interpretation and reporting. The CLARITY Challenge was designed to spur convergence in methods for diagnosing genetic disease starting from clinical case history and genome sequencing data. DNA samples were obtained from three families with heritable genetic disorders and genomic sequence data were donated by sequencing platform vendors. The challenge was to analyze and interpret these data with the goals of identifying disease-causing variants and reporting the findings in a clinically useful format. Participating contestant groups were solicited broadly, and an independent panel of judges evaluated their performance. Results A total of 30 international groups were engaged. The entries reveal a general convergence of practices on most elements of the analysis and interpretation process. However, even given this commonality of approach, only two groups identified the consensus candidate variants in all disease cases, demonstrating a need for consistent fine-tuning of the generally accepted methods. There was greater diversity of the final clinical report content and in the patient consenting process, demonstrating that these areas require additional exploration and standardization. Conclusions The CLARITY Challenge provides a comprehensive assessment of current practices for using genome sequencing to diagnose and report genetic diseases. There is remarkable convergence in bioinformatic techniques, but medical interpretation and reporting are areas that require further development by many groups. PMID:24667040

  6. Sequence analysis of ORF IV RTBV isolated from tungro infected Oryza sativa L. cv Ciherang

    NASA Astrophysics Data System (ADS)

    Hastilestari, Bernadetta Rina; Astuti, Dwi; Estiati, Amy; Nugroho, Satya

    2015-09-01

    The Effort to increase rice production is often constrained by pest and disease such as Tungro. The Tungro disease is caused by the joint infection with two dissimilar viruses; a bacil-form-DNA virus, the Rice tungro bacilliform virus(RTBV) and the spherical RNA virus, Rice tungro spherical virus (RTSV) and transmitted by Green leafhopper (Nephotettix virescens). The symptom of disease is caused by the presence of RTBV. The genome of RTBV consists of four Open reading frames (ORFs) which encode functional proteins. Of the four, ORF IV is unique because it exists only in RTBV. The most efficient method of generating disease resistance plants is to look for natural sources of resistance genes in wild or germplasm and then transfer the gene and the accompanying resistance in cultivated crop varieties. The aim of this study is, therefore, to isolate and analyze of 1170 bp gene of ORF 4 of Tungro virus isolated from an Indonesian rice cultivar, Ciherang (Oryza sativa L. cv Indica). DNA sequencing analysis using BLAST showed 94% similarity with the reference sequence gen bank Acc.M65026.1. The comparisons and mutation analysis of DNA sequences were discussed in this research.

  7. Genetic discovery in Xylella fastidiosa through sequence analysis of selected randomly amplified polymorphic DNAs.

    PubMed

    Chen, Jianchi; Civerolo, Edwin L; Jarret, Robert L; Van Sluys, Marie-Anne; de Oliveira, Mariana C

    2005-02-01

    Xylella fastidiosa causes many important plant diseases including Pierce's disease (PD) in grape and almond leaf scorch disease (ALSD). DNA-based methodologies, such as randomly amplified polymorphic DNA (RAPD) analysis, have been playing key roles in genetic information collection of the bacterium. This study further analyzed the nucleotide sequences of selected RAPDs from X. fastidiosa strains in conjunction with the available genome sequence databases and unveiled several previously unknown novel genetic traits. These include a sequence highly similar to those in the phage family of Podoviridae. Genome comparisons among X. fastidiosa strains suggested that the "phage" is currently active. Two other RAPDs were also related to horizontal gene transfer: one was part of a broadly distributed cryptic plasmid and the other was associated with conjugal transfer. One RAPD inferred a genomic rearrangement event among X. fastidiosa PD strains and another identified a single nucleotide polymorphism of evolutionary value.

  8. Integrated sequence analysis pipeline provides one-stop solution for identifying disease-causing mutations.

    PubMed

    Hu, Hao; Wienker, Thomas F; Musante, Luciana; Kalscheuer, Vera M; Kahrizi, Kimia; Najmabadi, Hossein; Ropers, H Hilger

    2014-12-01

    Next-generation sequencing has greatly accelerated the search for disease-causing defects, but even for experts the data analysis can be a major challenge. To facilitate the data processing in a clinical setting, we have developed a novel medical resequencing analysis pipeline (MERAP). MERAP assesses the quality of sequencing, and has optimized capacity for calling variants, including single-nucleotide variants, insertions and deletions, copy-number variation, and other structural variants. MERAP identifies polymorphic and known causal variants by filtering against public domain databases, and flags nonsynonymous and splice-site changes. MERAP uses a logistic model to estimate the causal likelihood of a given missense variant. MERAP considers the relevant information such as phenotype and interaction with known disease-causing genes. MERAP compares favorably with GATK, one of the widely used tools, because of its higher sensitivity for detecting indels, its easy installation, and its economical use of computational resources. Upon testing more than 1,200 individuals with mutations in known and novel disease genes, MERAP proved highly reliable, as illustrated here for five families with disease-causing variants. We believe that the clinical implementation of MERAP will expedite the diagnostic process of many disease-causing defects. © 2014 WILEY PERIODICALS, INC.

  9. Selected Insights from Application of Whole Genome Sequencing for Outbreak Investigations

    PubMed Central

    Le, Vien Thi Minh; Diep, Binh An

    2014-01-01

    Purpose of review The advent of high-throughput whole genome sequencing has the potential to revolutionize the conduct of outbreak investigation. Because of its ultimate pathogen strain resolution, whole genome sequencing could augment traditional epidemiologic investigations of infectious disease outbreaks. Recent findings The combination of whole genome sequencing and intensive epidemiologic analysis provided new insights on the sources and transmission dynamics of large-scale epidemics caused by Escherichia coli and Vibrio cholerae, nosocomial outbreaks caused by methicillin-resistant Staphylococcus aureus, Klebsiella pneumonia, and Mycobacterium abscessus, community-centered outbreaks caused by Mycobacterium tuberculosis, and natural disaster-associated outbreak caused by environmentally acquired molds. Summary When combined with traditional epidemiologic investigation, whole genome sequencing has proven useful for elucidating sources and transmission dynamics of disease outbreaks. Development of a fully automated bioinformatics pipeline for analysis of whole genome sequence data is much needed to make this powerful tool more widely accessible. PMID:23856896

  10. Identification of the PLA2G6 c.1579G>A Missense Mutation in Papillon Dog Neuroaxonal Dystrophy Using Whole Exome Sequencing Analysis

    PubMed Central

    Tsuboi, Masaya; Watanabe, Manabu; Nibe, Kazumi; Yoshimi, Natsuko; Kato, Akihisa; Sakaguchi, Masahiro; Yamato, Osamu; Tanaka, Miyuu; Kuwamura, Mitsuru; Kushida, Kazuya; Harada, Tomoyuki; Chambers, James Kenn; Sugano, Sumio; Uchida, Kazuyuki; Nakayama, Hiroyuki

    2017-01-01

    Whole exome sequencing (WES) has become a common tool for identifying genetic causes of human inherited disorders, and it has also recently been applied to canine genome research. We conducted WES analysis of neuroaxonal dystrophy (NAD), a neurodegenerative disease that sporadically occurs worldwide in Papillon dogs. The disease is considered an autosomal recessive monogenic disease, which is histopathologically characterized by severe axonal swelling, known as “spheroids,” throughout the nervous system. By sequencing all eleven DNA samples from one NAD-affected Papillon dog and her parents, two unrelated NAD-affected Papillon dogs, and six unaffected control Papillon dogs, we identified 10 candidate mutations. Among them, three candidates were determined to be “deleterious” by in silico pathogenesis evaluation. By subsequent massive screening by TaqMan genotyping analysis, only the PLA2G6 c.1579G>A mutation had an association with the presence or absence of the disease, suggesting that it may be a causal mutation of canine NAD. As a human homologue of this gene is a causative gene for infantile neuroaxonal dystrophy, this canine phenotype may serve as a good animal model for human disease. The results of this study also indicate that WES analysis is a powerful tool for exploring canine hereditary diseases, especially in rare monogenic hereditary diseases. PMID:28107443

  11. Analysis of the 9p21.3 sequence associated with coronary artery disease reveals a tendency for duplication in a CAD patient

    PubMed Central

    Kouprina, Natalay; Noskov, Vladimir N.; Waterfall, Joshua J.; Walker, Robert L.; Meltzer, Paul S.; Topol, Eric J.; Larionov, Vladimir

    2018-01-01

    Tandem segmental duplications (SDs) greater than 10 kb are widespread in complex genomes. They provide material for gene divergence and evolutionary adaptation, while formation of specific de novo SDs is a hallmark of cancer and some human diseases. Most SDs map to distinct genomic regions termed ‘duplication blocks’. SDs organization within these blocks is often poorly characterized as they are mosaics of ancestral duplicons juxtaposed with younger duplicons arising from more recent duplication events. Structural and functional analysis of SDs is further hampered as long repetitive DNA structures are underrepresented in existing BAC and YAC libraries. We applied Transformation-Associated Recombination (TAR) cloning, a versatile technique for large DNA manipulation, to selectively isolate the coronary artery disease (CAD) interval sequence within the 9p21.3 chromosome locus from a patient with coronary artery disease and normal individuals. Four tandem head-to-tail duplicons, each ∼50 kb long, were recovered in the patient but not in normal individuals. Sequence analysis revealed that the repeats varied by 10-15 SNPs between each other and by 82 SNPs between the human genome sequence (version hg19). SNPs polymorphism within the junctions between repeats allowed two junction types to be distinguished, Type 1 and Type 2, which were found at a 2:1 ratio. The junction sequences contained an Alu element, a sequence previously shown to play a role in duplication. Knowledge of structural variation in the CAD interval from more patients could help link this locus to cardiovascular diseases susceptibility, and maybe relevant to other cases of regional amplification, including cancer. PMID:29632643

  12. The complete genome sequence of a virus associated with cotton blue disease, cotton leafroll dwarf virus, confirms that it is a new member of the genus Polerovirus.

    PubMed

    Distéfano, Ana J; Bonacic Kresic, Ivan; Hopp, H Esteban

    2010-11-01

    Cotton blue disease is the most important virus disease of cotton in the southern part of America. The complete nucleotide sequence of the ssRNA genome of the cotton blue disease-associated virus was determined for the first time. It comprised 5,866 nucleotides, and the deduced genomic organization resembled that of members of the genus Polerovirus. Sequence homology comparison and phylogenetic analysis confirm that this virus (previous proposed name cotton leafroll dwarf virus) is a member of a new species within the genus Polerovirus.

  13. Complete genome sequence of the plant pathogen Erwinia amylovora strain ATCC 49946

    USDA-ARS?s Scientific Manuscript database

    Erwinia amylovora causes the economically important disease fire blight that affects rosaceous plants, especially pear and apple. Here we report the complete genome sequence and annotation of strain ATCC 49946. The analysis of the sequence and its comparison with sequenced genomes of closely related...

  14. [Association of phytoplasma with Bermuda grass white-leaf disease].

    PubMed

    Tan, Weijun; Chen, Yong; Zhang, Wu; Han, Chengchou; Tan, Zhiyuan; Zhang, Juming

    2008-10-01

    Bermuda grass white leaf is an important disease on Bermuda grass all over the world. The aim of this research is to identify the pathogen which leads to Bermuda grass white leaf occurring on the Chinese mainland. PCR amplification technique, sequence analysis and Southern hybridization were used. A 1.3 kb fragment was amplified by PCR phytoplasma universal primers and total DNA sample extracted from ill Bermuda grass as the amplified template. Sequence analysis of the amplified fragment indicated it clustered into Candidatus Phytoplasm Cynodontis. Southern hybridization analysis showed differential cingulums. The pathogen of Bermuda grass white leaf on the Chinese mainland contains phytoplasma, which provides a scientific basis for further identification, prevention and control of the disease.

  15. Applying phylogenetic analysis to viral livestock diseases: moving beyond molecular typing.

    PubMed

    Olvera, Alex; Busquets, Núria; Cortey, Marti; de Deus, Nilsa; Ganges, Llilianne; Núñez, José Ignacio; Peralta, Bibiana; Toskano, Jennifer; Dolz, Roser

    2010-05-01

    Changes in livestock production systems in recent years have altered the presentation of many diseases resulting in the need for more sophisticated control measures. At the same time, new molecular assays have been developed to support the diagnosis of animal viral disease. Nucleotide sequences generated by these diagnostic techniques can be used in phylogenetic analysis to infer phenotypes by sequence homology and to perform molecular epidemiology studies. In this review, some key elements of phylogenetic analysis are highlighted, such as the selection of the appropriate neutral phylogenetic marker, the proper phylogenetic method and different techniques to test the reliability of the resulting tree. Examples are given of current and future applications of phylogenetic reconstructions in viral livestock diseases. Copyright 2009 Elsevier Ltd. All rights reserved.

  16. Complete genome sequence of genotype VI Newcastle disease viruses isolated from pigeons in Pakistan

    USDA-ARS?s Scientific Manuscript database

    Two complete genome sequences of Newcastle disease virus (NDV) are described here. Virulent isolates pigeon/Pakistan/Lahore/21A/2015 and pigeon/Pakistan/Lahore/25A/2015 were obtained from racing pigeons sampled in the Pakistani province of Punjab during 2015. Phylogenetic analysis of the fusion prot...

  17. Complete Genome Sequences of Newcastle Disease Virus Strains Circulating in Chicken Populations of Indonesia

    PubMed Central

    Xiao, Sa; Paldurai, Anandan; Nayak, Baibaswata; Samuel, Arthur; Bharoto, Eny E.; Prajitno, Teguh Y.; Collins, Peter L.

    2012-01-01

    Eight highly virulent Newcastle disease virus (NDV) strains were isolated from vaccinated commercial chickens in Indonesia during outbreaks in 2009 and 2010. The complete genome sequences of two NDV strains and the sequences of the surface protein genes (F and HN) of six other strains were determined. Phylogenetic analysis classified them into two new subgroups of genotype VII in the class II cluster that were genetically distinct from vaccine strains. This is the first report of complete genome sequences of NDV strains isolated from chickens in Indonesia. PMID:22532534

  18. Patient perspectives on whole-genome sequencing for undiagnosed diseases.

    PubMed

    Boeldt, Debra L; Cheung, Cynthia; Ariniello, Lauren; Darst, Burcu F; Topol, Sarah; Schork, Nicholas J; Philis-Tsimikas, Athena; Torkamani, Ali; Fortmann, Addie L; Bloss, Cinnamon S

    2017-01-01

    This study assessed perspectives on whole-genome sequencing (WGS) for rare disease diagnosis and the process of receiving genetic results. Semistructured interviews were conducted with adult patients and parents of minor patients affected by idiopathic diseases (n = 10 cases). Three main themes were identified through qualitative data analysis and interpretation: perceived benefits of WGS; perceived drawbacks of WGS; and perceptions of the return of results from WGS. Findings suggest that patients and their families have important perspectives on the use of WGS in diagnostic odyssey cases. These perspectives could inform clinical sequencing research study designs as well as the appropriate deployment of patient and family support services in the context of clinical genome sequencing.

  19. Multiplexed resequencing analysis to identify rare variants in pooled DNA with barcode indexing using next-generation sequencer.

    PubMed

    Mitsui, Jun; Fukuda, Yoko; Azuma, Kyo; Tozaki, Hirokazu; Ishiura, Hiroyuki; Takahashi, Yuji; Goto, Jun; Tsuji, Shoji

    2010-07-01

    We have recently found that multiple rare variants of the glucocerebrosidase gene (GBA) confer a robust risk for Parkinson disease, supporting the 'common disease-multiple rare variants' hypothesis. To develop an efficient method of identifying rare variants in a large number of samples, we applied multiplexed resequencing using a next-generation sequencer to identification of rare variants of GBA. Sixteen sets of pooled DNAs from six pooled DNA samples were prepared. Each set of pooled DNAs was subjected to polymerase chain reaction to amplify the target gene (GBA) covering 6.5 kb, pooled into one tube with barcode indexing, and then subjected to extensive sequence analysis using the SOLiD System. Individual samples were also subjected to direct nucleotide sequence analysis. With the optimization of data processing, we were able to extract all the variants from 96 samples with acceptable rates of false-positive single-nucleotide variants.

  20. Comprehensive Rare Variant Analysis via Whole-Genome Sequencing to Determine the Molecular Pathology of Inherited Retinal Disease.

    PubMed

    Carss, Keren J; Arno, Gavin; Erwood, Marie; Stephens, Jonathan; Sanchis-Juan, Alba; Hull, Sarah; Megy, Karyn; Grozeva, Detelina; Dewhurst, Eleanor; Malka, Samantha; Plagnol, Vincent; Penkett, Christopher; Stirrups, Kathleen; Rizzo, Roberta; Wright, Genevieve; Josifova, Dragana; Bitner-Glindzicz, Maria; Scott, Richard H; Clement, Emma; Allen, Louise; Armstrong, Ruth; Brady, Angela F; Carmichael, Jenny; Chitre, Manali; Henderson, Robert H H; Hurst, Jane; MacLaren, Robert E; Murphy, Elaine; Paterson, Joan; Rosser, Elisabeth; Thompson, Dorothy A; Wakeling, Emma; Ouwehand, Willem H; Michaelides, Michel; Moore, Anthony T; Webster, Andrew R; Raymond, F Lucy

    2017-01-05

    Inherited retinal disease is a common cause of visual impairment and represents a highly heterogeneous group of conditions. Here, we present findings from a cohort of 722 individuals with inherited retinal disease, who have had whole-genome sequencing (n = 605), whole-exome sequencing (n = 72), or both (n = 45) performed, as part of the NIHR-BioResource Rare Diseases research study. We identified pathogenic variants (single-nucleotide variants, indels, or structural variants) for 404/722 (56%) individuals. Whole-genome sequencing gives unprecedented power to detect three categories of pathogenic variants in particular: structural variants, variants in GC-rich regions, which have significantly improved coverage compared to whole-exome sequencing, and variants in non-coding regulatory regions. In addition to previously reported pathogenic regulatory variants, we have identified a previously unreported pathogenic intronic variant in CHM in two males with choroideremia. We have also identified 19 genes not previously known to be associated with inherited retinal disease, which harbor biallelic predicted protein-truncating variants in unsolved cases. Whole-genome sequencing is an increasingly important comprehensive method with which to investigate the genetic causes of inherited retinal disease. Copyright © 2017. Published by Elsevier Inc.

  1. Helicos BioSciences.

    PubMed

    Milos, Patrice

    2008-04-01

    Helicos BioSciences Corporation is a life sciences company developing revolutionary new single molecule sequencing technology to provide the path to the US$1000 genome. True Single Molecule Sequencing (tSMS) will drive advancements in pharmacogenomics that can enable a better understanding of an individual's susceptibility to disease, develop more effective disease diagnoses and differentiate response to disease therapies. During 2007, genome-wide disease-association studies, the encylopedia of DNA elements (ENCODE) and the published genome sequence of two individuals have revealed human genome variation far more extensive than originally believed. These also demonstrated that common variations explain only a fraction of the genetic basis of disease. Therefore, the capability to understand an individual genome is critical in setting the foundation for the next great revolution in healthcare. Helicos is committed to this vision and will provide cost-effective genome sequencing and comprehensive analysis of the transcribed genome that can unlock the era of personalized healthcare.

  2. Validation of Metagenomic Next-Generation Sequencing Tests for Universal Pathogen Detection.

    PubMed

    Schlaberg, Robert; Chiu, Charles Y; Miller, Steve; Procop, Gary W; Weinstock, George

    2017-06-01

    - Metagenomic sequencing can be used for detection of any pathogens using unbiased, shotgun next-generation sequencing (NGS), without the need for sequence-specific amplification. Proof-of-concept has been demonstrated in infectious disease outbreaks of unknown causes and in patients with suspected infections but negative results for conventional tests. Metagenomic NGS tests hold great promise to improve infectious disease diagnostics, especially in immunocompromised and critically ill patients. - To discuss challenges and provide example solutions for validating metagenomic pathogen detection tests in clinical laboratories. A summary of current regulatory requirements, largely based on prior guidance for NGS testing in constitutional genetics and oncology, is provided. - Examples from 2 separate validation studies are provided for steps from assay design, and validation of wet bench and bioinformatics protocols, to quality control and assurance. - Although laboratory and data analysis workflows are still complex, metagenomic NGS tests for infectious diseases are increasingly being validated in clinical laboratories. Many parallels exist to NGS tests in other fields. Nevertheless, specimen preparation, rapidly evolving data analysis algorithms, and incomplete reference sequence databases are idiosyncratic to the field of microbiology and often overlooked.

  3. E-Learning for Rare Diseases: An Example Using Fabry Disease.

    PubMed

    Cimmaruta, Chiara; Liguori, Ludovica; Monticelli, Maria; Andreotti, Giuseppina; Citro, Valentina

    2017-09-24

    Rare diseases represent a challenge for physicians because patients are rarely seen, and they can manifest with symptoms similar to those of common diseases. In this work, genetic confirmation of diagnosis is derived from DNA sequencing. We present a tutorial for the molecular analysis of a rare disease using Fabry disease as an example. An exonic sequence derived from a hypothetical male patient was matched against human reference data using a genome browser. The missense mutation was identified by running BlastX, and information on the affected protein was retrieved from the database UniProt. The pathogenic nature of the mutation was assessed with PolyPhen-2. Disease-specific databases were used to assess whether the missense mutation led to a severe phenotype, and whether pharmacological therapy was an option. An inexpensive bioinformatics approach is presented to get the reader acquainted with the diagnosis of Fabry disease. The reader is introduced to the field of pharmacological chaperones, a therapeutic approach that can be applied only to certain Fabry genotypes. The principle underlying the analysis of exome sequencing can be explained in simple terms using web applications and databases which facilitate diagnosis and therapeutic choices.

  4. Suppressive subtractive hybridization approach revealed differential expression of hypersensitive response and reactive oxygen species production genes in tea (Camellia sinensis (L.) O. Kuntze) leaves during Pestalotiopsis thea infection.

    PubMed

    Senthilkumar, Palanisamy; Thirugnanasambantham, Krishnaraj; Mandal, Abul Kalam Azad

    2012-12-01

    Tea (Camellia sinensis (L.) O. Kuntze) is an economically important plant cultivated for its leaves. Infection of Pestalotiopsis theae in leaves causes gray blight disease and enormous loss to the tea industry. We used suppressive subtractive hybridization (SSH) technique to unravel the differential gene expression pattern during gray blight disease development in tea. Complementary DNA from P. theae-infected and uninfected leaves of disease tolerant cultivar UPASI-10 was used as tester and driver populations respectively. Subtraction efficiency was confirmed by comparing abundance of β-actin gene. A total of 377 and 720 clones with insert size >250 bp from forward and reverse library respectively were sequenced and analyzed. Basic Local Alignment Search Tool analysis revealed 17 sequences in forward SSH library have high degree of similarity with disease and hypersensitive response related genes and 20 sequences with hypothetical proteins while in reverse SSH library, 23 sequences have high degree of similarity with disease and stress response-related genes and 15 sequences with hypothetical proteins. Functional analysis indicated unknown (61 and 59 %) or hypothetical functions (23 and 18 %) for most of the differentially regulated genes in forward and reverse SSH library, respectively, while others have important role in different cellular activities. Majority of the upregulated genes are related to hypersensitive response and reactive oxygen species production. Based on these expressed sequence tag data, putative role of differentially expressed genes were discussed in relation to disease. We also demonstrated the efficiency of SSH as a tool in enriching gray blight disease related up- and downregulated genes in tea. The present study revealed that many genes related to disease resistance were suppressed during P. theae infection and enhancing these genes by the application of inducers may impart better disease tolerance to the plants.

  5. Resolving the Origin of Rabbit Hemorrhagic Disease Virus: Insights from an Investigation of the Viral Stocks Released in Australia

    PubMed Central

    Eden, John-Sebastian; Read, Andrew J.; Duckworth, Janine A.; Strive, Tanja

    2015-01-01

    To resolve the evolutionary history of rabbit hemorrhagic disease virus (RHDV), we performed a genomic analysis of the viral stocks imported and released as a biocontrol measure in Australia, as well as a global phylogenetic analysis. Importantly, conflicts were identified between the sequences determined here and those previously published that may have affected evolutionary rate estimates. By removing likely erroneous sequences, we show that RHDV emerged only shortly before its initial description in China. PMID:26378178

  6. Complete genomic sequence of a tobacco rattle virus isolate from Michigan-grown potatoes

    USDA-ARS?s Scientific Manuscript database

    Tobacco rattle virus (TRV) causes stem mottle on potato leaves and necrotic arcs and rings in potato tubers, known as corky ringspot disease. Recently, TRV was reported in Michigan potato tubers cv. FL1879 exhibiting corky ringspot disease. Sequence analysis of the RNA-1-encoded 16 kDa gene of the...

  7. Identification of a disease complex involving a novel monopartite begomovirus with beta- and alphasatellites associated with okra leaf curl disease in Oman.

    PubMed

    Akhtar, Sohail; Khan, Akhtar J; Singh, Achuit S; Briddon, Rob W

    2014-05-01

    Okra leaf curl disease (OLCD) is an important viral disease of okra in tropical and subtropical areas. The disease is caused by begomovirus-satellite complexes. A begomovirus and associated betasatellite and alphasatellite were identified in symptomatic okra plants from Barka, in the Al-Batinah region of Oman. Analysis of the begomovirus sequences showed them to represent a new begomovirus most closely related to cotton leaf curl Gezira virus (CLCuGeV), a begomovirus of African origin. The sequences showed less than 85 % nucleotide sequence identity to CLCuGeV isolates. The name okra leaf curl Oman virus (OLCOMV) is proposed for the new virus. Further analysis revealed that the OLCOMV is a recombinant begomovirus that evolved by the recombination of CLCuGeV isolates with tomato yellow leaf curl virus-Oman (TYLCV-OM). An alpha- and a betasatellite were also identified from the same plant sample, which were also unique when compared to sequences available in the databases. However, although the betasatellite appeared to be of African origin, the alphasatellite was most closely related to alphasatellites originating from South Asia. This is the first report of a begomovirus-satellite complex infecting okra in Oman.

  8. A survey of tools for variant analysis of next-generation genome sequencing data

    PubMed Central

    Pabinger, Stephan; Dander, Andreas; Fischer, Maria; Snajder, Rene; Sperk, Michael; Efremova, Mirjana; Krabichler, Birgit; Speicher, Michael R.; Zschocke, Johannes

    2014-01-01

    Recent advances in genome sequencing technologies provide unprecedented opportunities to characterize individual genomic landscapes and identify mutations relevant for diagnosis and therapy. Specifically, whole-exome sequencing using next-generation sequencing (NGS) technologies is gaining popularity in the human genetics community due to the moderate costs, manageable data amounts and straightforward interpretation of analysis results. While whole-exome and, in the near future, whole-genome sequencing are becoming commodities, data analysis still poses significant challenges and led to the development of a plethora of tools supporting specific parts of the analysis workflow or providing a complete solution. Here, we surveyed 205 tools for whole-genome/whole-exome sequencing data analysis supporting five distinct analytical steps: quality assessment, alignment, variant identification, variant annotation and visualization. We report an overview of the functionality, features and specific requirements of the individual tools. We then selected 32 programs for variant identification, variant annotation and visualization, which were subjected to hands-on evaluation using four data sets: one set of exome data from two patients with a rare disease for testing identification of germline mutations, two cancer data sets for testing variant callers for somatic mutations, copy number variations and structural variations, and one semi-synthetic data set for testing identification of copy number variations. Our comprehensive survey and evaluation of NGS tools provides a valuable guideline for human geneticists working on Mendelian disorders, complex diseases and cancers. PMID:23341494

  9. Whole-Genome Sequencing in Outbreak Analysis

    PubMed Central

    Turner, Stephen D.; Riley, Margaret F.; Petri, William A.; Hewlett, Erik L.

    2015-01-01

    SUMMARY In addition to the ever-present concern of medical professionals about epidemics of infectious diseases, the relative ease of access and low cost of obtaining, producing, and disseminating pathogenic organisms or biological toxins mean that bioterrorism activity should also be considered when facing a disease outbreak. Utilization of whole-genome sequencing (WGS) in outbreak analysis facilitates the rapid and accurate identification of virulence factors of the pathogen and can be used to identify the path of disease transmission within a population and provide information on the probable source. Molecular tools such as WGS are being refined and advanced at a rapid pace to provide robust and higher-resolution methods for identifying, comparing, and classifying pathogenic organisms. If these methods of pathogen characterization are properly applied, they will enable an improved public health response whether a disease outbreak was initiated by natural events or by accidental or deliberate human activity. The current application of next-generation sequencing (NGS) technology to microbial WGS and microbial forensics is reviewed. PMID:25876885

  10. Determination of disease phenotypes and pathogenic variants from exome sequence data in the CAGI 4 gene panel challenge.

    PubMed

    Kundu, Kunal; Pal, Lipika R; Yin, Yizhou; Moult, John

    2017-09-01

    The use of gene panel sequence for diagnostic and prognostic testing is now widespread, but there are so far few objective tests of methods to interpret these data. We describe the design and implementation of a gene panel sequencing data analysis pipeline (VarP) and its assessment in a CAGI4 community experiment. The method was applied to clinical gene panel sequencing data of 106 patients, with the goal of determining which of 14 disease classes each patient has and the corresponding causative variant(s). The disease class was correctly identified for 36 cases, including 10 where the original clinical pipeline did not find causative variants. For a further seven cases, we found strong evidence of an alternative disease to that tested. Many of the potentially causative variants are missense, with no previous association with disease, and these proved the hardest to correctly assign pathogenicity or otherwise. Post analysis showed that three-dimensional structure data could have helped for up to half of these cases. Over-reliance on HGMD annotation led to a number of incorrect disease assignments. We used a largely ad hoc method to assign probabilities of pathogenicity for each variant, and there is much work still to be done in this area. © 2017 The Authors. **Human Mutation published by Wiley Periodicals, Inc.

  11. Identification and complete genome sequence analysis of a genotype XIV Newcastle disease virus from Nigeria

    USDA-ARS?s Scientific Manuscript database

    The first complete genome sequence of a strain of Newcastle disease virus from genotype XIV is reported here. Strain duck/Nigeria/NG-695/KG.LOM.11-16/2009 was isolated from an apparently healthy domestic duck from a live bird market in Kogi State, Nigeria, in 2009. This strain is classified as a m...

  12. Complete genome sequence of a genotype XVII Newcastle disease virus, isolated from an apparently healthy domestic duck in Nigeria

    USDA-ARS?s Scientific Manuscript database

    The first complete genome sequence of a strain of Newcastle disease virus (NDV) of genotype XVII is described here. A velogenic strain (duck/Nigeria/903/KUDU-113/1992) was isolated from an apparently healthy free-roaming domestic duck sampled in Kuru, Nigeria, in 1992. Phylogenetic analysis of the f...

  13. Complete Genome Sequence of a Highly Virulent Newcastle Disease Virus Currently Circulating in Mexico

    PubMed Central

    Xiao, Sa; Paldurai, Anandan; Nayak, Baibaswata; Mirande, Armando; Collins, Peter L.

    2013-01-01

    The complete genome sequence was determined for a highly virulent Newcastle disease virus strain from vaccinated chicken farms in Mexico during outbreaks in 2010. On the basis of phylogenetic analysis this strain was classified into genotype V in the class II cluster that was closely related to Mexican strains that appeared in 2004–2006. PMID:23409252

  14. Phylogenetic analysis and molecular methods for the detection of lymphocystis disease virus from yellow perch, Perca flavescens (Mitchell).

    PubMed

    Palmer, L J; Hogan, N S; van den Heuvel, M R

    2012-09-01

    Lymphocystis disease is a prevalent, non-fatal disease that affects many teleost fish and is caused by the DNA virus lymphocystis disease virus (LCDV). Lymphocystis-like lesions have been observed in yellow perch, Perca flavescens (Mitchell), in lakes in northern Alberta, Canada. In an effort to confirm the identity of the virus causing these lesions, DNA was extracted from these lesions and PCR with genotype generic LCDV primers specific to the major capsid protein (MCP) gene was performed. A 1357-base pair nucleotide sequence corresponding to a peptide length of 452 amino acids of the MCP gene was sequenced, confirming the lesions as being lymphocystis disease lesions. Phylogenetic analysis of the generated amino acid sequence revealed the perch LCDV isolate to be a distinct and novel genotype. From the obtained sequence, a real-time PCR identification method was developed using fluorgenic LUX primers. The identification method was used to detect the presence/absence of LCDV in yellow perch from two lakes, one where lymphocystis disease was observed to occur and the other where the disease had not been observed. All samples of fin, spleen and liver tested negative for LCDV in the lake where lymphocystis disease had not been observed. The second lake had a 2.6% incidence of LCD, and virus was detected in tissue samples from all individuals tested regardless of whether they were expressing the disease or not. However, estimated viral copy number in spleen and liver of symptomatic perch was four orders of magnitude higher than that in asymptomatic perch. © 2012 Blackwell Publishing Ltd.

  15. Identification and nucleotide sequence analysis of the repetitive DNA element in the genome of fish lymphocystis disease virus.

    PubMed

    Schnitzler, P; Delius, H; Scholz, J; Touray, M; Orth, E; Darai, G

    1987-12-01

    The genome of the fish lymphocystis disease virus (FLDV) was screened for the existence of repetitive DNA sequences using a defined and complete gene library of the viral genome (98 kbp) by DNA-DNA hybridization, heteroduplex analysis, and restriction fine mapping. A repetitive DNA sequence was detected at the coordinates 0.034 to 0.057 and 0.718 to 0.736 map units (m.u.) of the FLDV genome. The first region (0.034 to 0.057 m.u.) corresponds to the 5' terminus of the EcoRI FLDV DNA fragment B (0.034 to 0.165 m.u.) and the second region (0.718 to 0.736 m.u.) is identical to the EcoRI DNA fragment M of the viral genome. The DNA nucleotide sequence of the EcoRI FLDV DNA fragment M was determined. This analysis revealed the presence of many short direct and inverted repetitions, e.g., a 18-mer direct repetition (TTTAAAATTTAATTAA) that started at nucleotide positions 812 and 942 and a 14-mer inverted repeat (TTAAATTTAAATTT) at nucleotide positions 820 and 959. Only short open reading frames were detected within this region. The DNA repetitions are discussed as sequences that play a possible regulatory role for virus replication. Furthermore, hybridization experiments revealed that the repetitive DNA sequences are conserved in the genome of different strains of fish lymphocystis disease virus isolated from two species of Pleuronectidae (flounder and dab).

  16. Mitochondrial Disease Sequence Data Resource (MSeqDR): a global grass-roots consortium to facilitate deposition, curation, annotation, and integrated analysis of genomic data for the mitochondrial disease clinical and research communities.

    PubMed

    Falk, Marni J; Shen, Lishuang; Gonzalez, Michael; Leipzig, Jeremy; Lott, Marie T; Stassen, Alphons P M; Diroma, Maria Angela; Navarro-Gomez, Daniel; Yeske, Philip; Bai, Renkui; Boles, Richard G; Brilhante, Virginia; Ralph, David; DaRe, Jeana T; Shelton, Robert; Terry, Sharon F; Zhang, Zhe; Copeland, William C; van Oven, Mannis; Prokisch, Holger; Wallace, Douglas C; Attimonelli, Marcella; Krotoski, Danuta; Zuchner, Stephan; Gai, Xiaowu

    2015-03-01

    Success rates for genomic analyses of highly heterogeneous disorders can be greatly improved if a large cohort of patient data is assembled to enhance collective capabilities for accurate sequence variant annotation, analysis, and interpretation. Indeed, molecular diagnostics requires the establishment of robust data resources to enable data sharing that informs accurate understanding of genes, variants, and phenotypes. The "Mitochondrial Disease Sequence Data Resource (MSeqDR) Consortium" is a grass-roots effort facilitated by the United Mitochondrial Disease Foundation to identify and prioritize specific genomic data analysis needs of the global mitochondrial disease clinical and research community. A central Web portal (https://mseqdr.org) facilitates the coherent compilation, organization, annotation, and analysis of sequence data from both nuclear and mitochondrial genomes of individuals and families with suspected mitochondrial disease. This Web portal provides users with a flexible and expandable suite of resources to enable variant-, gene-, and exome-level sequence analysis in a secure, Web-based, and user-friendly fashion. Users can also elect to share data with other MSeqDR Consortium members, or even the general public, either by custom annotation tracks or through the use of a convenient distributed annotation system (DAS) mechanism. A range of data visualization and analysis tools are provided to facilitate user interrogation and understanding of genomic, and ultimately phenotypic, data of relevance to mitochondrial biology and disease. Currently available tools for nuclear and mitochondrial gene analyses include an MSeqDR GBrowse instance that hosts optimized mitochondrial disease and mitochondrial DNA (mtDNA) specific annotation tracks, as well as an MSeqDR locus-specific database (LSDB) that curates variant data on more than 1300 genes that have been implicated in mitochondrial disease and/or encode mitochondria-localized proteins. MSeqDR is integrated with a diverse array of mtDNA data analysis tools that are both freestanding and incorporated into an online exome-level dataset curation and analysis resource (GEM.app) that is being optimized to support needs of the MSeqDR community. In addition, MSeqDR supports mitochondrial disease phenotyping and ontology tools, and provides variant pathogenicity assessment features that enable community review, feedback, and integration with the public ClinVar variant annotation resource. A centralized Web-based informed consent process is being developed, with implementation of a Global Unique Identifier (GUID) system to integrate data deposited on a given individual from different sources. Community-based data deposition into MSeqDR has already begun. Future efforts will enhance capabilities to incorporate phenotypic data that enhance genomic data analyses. MSeqDR will fill the existing void in bioinformatics tools and centralized knowledge that are necessary to enable efficient nuclear and mtDNA genomic data interpretation by a range of shareholders across both clinical diagnostic and research settings. Ultimately, MSeqDR is focused on empowering the global mitochondrial disease community to better define and explore mitochondrial diseases. Copyright © 2014 Elsevier Inc. All rights reserved.

  17. Mitochondrial Disease Sequence Data Resource (MSeqDR): A global grass-roots consortium to facilitate deposition, curation, annotation, and integrated analysis of genomic data for the mitochondrial disease clinical and research communities

    PubMed Central

    Falk, Marni J.; Shen, Lishuang; Gonzalez, Michael; Leipzig, Jeremy; Lott, Marie T.; Stassen, Alphons P.M.; Diroma, Maria Angela; Navarro-Gomez, Daniel; Yeske, Philip; Bai, Renkui; Boles, Richard G.; Brilhante, Virginia; Ralph, David; DaRe, Jeana T.; Shelton, Robert; Terry, Sharon; Zhang, Zhe; Copeland, William C.; van Oven, Mannis; Prokisch, Holger; Wallace, Douglas C.; Attimonelli, Marcella; Krotoski, Danuta; Zuchner, Stephan; Gai, Xiaowu

    2014-01-01

    Success rates for genomic analyses of highly heterogeneous disorders can be greatly improved if a large cohort of patient data is assembled to enhance collective capabilities for accurate sequence variant annotation, analysis, and interpretation. Indeed, molecular diagnostics requires the establishment of robust data resources to enable data sharing that informs accurate understanding of genes, variants, and phenotypes. The “Mitochondrial Disease Sequence Data Resource (MSeqDR) Consortium” is a grass-roots effort facilitated by the United Mitochondrial Disease Foundation to identify and prioritize specific genomic data analysis needs of the global mitochondrial disease clinical and research community. A central Web portal (https://mseqdr.org) facilitates the coherent compilation, organization, annotation, and analysis of sequence data from both nuclear and mitochondrial genomes of individuals and families with suspected mitochondrial disease. This Web portal provides users with a flexible and expandable suite of resources to enable variant-, gene-, and exome-level sequence analysis in a secure, Web-based, and user-friendly fashion. Users can also elect to share data with other MSeqDR Consortium members, or even the general public, either by custom annotation tracks or through use of a convenient distributed annotation system (DAS) mechanism. A range of data visualization and analysis tools are provided to facilitate user interrogation and understanding of genomic, and ultimately phenotypic, data of relevance to mitochondrial biology and disease. Currently available tools for nuclear and mitochondrial gene analyses include an MSeqDR GBrowse instance that hosts optimized mitochondrial disease and mitochondrial DNA (mtDNA) specific annotation tracks, as well as an MSeqDR locus-specific database (LSDB) that curates variant data on more than 1,300 genes that have been implicated in mitochondrial disease and/or encode mitochondria-localized proteins. MSeqDR is integrated with a diverse array of mtDNA data analysis tools that are both freestanding and incorporated into an online exome-level dataset curation and analysis resource (GEM.app) that is being optimized to support needs of the MSeqDR community. In addition, MSeqDR supports mitochondrial disease phenotyping and ontology tools, and provides variant pathogenicity assessment features that enable community review, feedback, and integration with the public ClinVar variant annotation resource. A centralized Web-based informed consent process is being developed, with implementation of a Global Unique Identifier (GUID) system to integrate data deposited on a given individual from different sources. Community-based data deposition into MSeqDR has already begun. Future efforts will enhance capabilities to incorporate phenotypic data that enhance genomic data analyses. MSeqDR will fill the existing void in bioinformatics tools and centralized knowledge that are necessary to enable efficient nuclear and mtDNA genomic data interpretation by a range of shareholders across both clinical diagnostic and research settings. Ultimately, MSeqDR is focused on empowering the global mitochondrial disease community to better define and explore mitochondrial disease. PMID:25542617

  18. Spatiotemporal Phylogenetic Analysis and Molecular Characterisation of Infectious Bursal Disease Viruses Based on the VP2 Hyper-Variable Region

    PubMed Central

    Dolz, Roser; Valle, Rosa; Perera, Carmen L.; Bertran, Kateri; Frías, Maria T.; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J.

    2013-01-01

    Background Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Methodology/Principal Findings Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. Conclusions/Significance To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide. PMID:23805195

  19. Spatiotemporal Phylogenetic Analysis and Molecular Characterisation of Infectious Bursal Disease Viruses Based on the VP2 Hyper-Variable Region.

    PubMed

    Alfonso-Morales, Abdulahi; Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L; Bertran, Kateri; Frías, Maria T; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J

    2013-01-01

    Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide.

  20. Dynamic assessment of microbial ecology (DAME): A shiny app for analysis and visualization of microbial sequencing data

    USDA-ARS?s Scientific Manuscript database

    A new renaissance in knowledge about the role of commensal microbiota in health and disease is well underway facilitated by culture-independent sequencing technologies; however, microbial sequencing data poses new challenges (e.g., taxonomic hierarchy, overdispersion) not generally seen in more trad...

  1. A core microbiome associated with the peritoneal tumors of pseudomyxoma peritonei

    PubMed Central

    2013-01-01

    Background Pseudomyxoma peritonei (PMP) is a malignancy characterized by dissemination of mucus-secreting cells throughout the peritoneum. This disease is associated with significant morbidity and mortality and despite effective treatment options for early-stage disease, patients with PMP often relapse. Thus, there is a need for additional treatment options to reduce relapse rate and increase long-term survival. A previous study identified the presence of both typed and non-culturable bacteria associated with PMP tissue and determined that increased bacterial density was associated with more severe disease. These findings highlighted the possible role for bacteria in PMP disease. Methods To more clearly define the bacterial communities associated with PMP disease, we employed a sequenced-based analysis to profile the bacterial populations found in PMP tumor and mucin tissue in 11 patients. Sequencing data were confirmed by in situ hybridization at multiple taxonomic depths and by culturing. A pilot clinical study was initiated to determine whether the addition of antibiotic therapy affected PMP patient outcome. Main results We determined that the types of bacteria present are highly conserved in all PMP patients; the dominant phyla are the Proteobacteria, Actinobacteria, Firmicutes and Bacteroidetes. A core set of taxon-specific sequences were found in all 11 patients; many of these sequences were classified into taxonomic groups that also contain known human pathogens. In situ hybridization directly confirmed the presence of bacteria in PMP at multiple taxonomic depths and supported our sequence-based analysis. Furthermore, culturing of PMP tissue samples allowed us to isolate 11 different bacterial strains from eight independent patients, and in vitro analysis of subset of these isolates suggests that at least some of these strains may interact with the PMP-associated mucin MUC2. Finally, we provide evidence suggesting that targeting these bacteria with antibiotic treatment may increase the survival of PMP patients. Conclusions Using 16S amplicon-based sequencing, direct in situ hybridization analysis and culturing methods, we have identified numerous bacterial taxa that are consistently present in all PMP patients tested. Combined with data from a pilot clinical study, these data support the hypothesis that adding antimicrobials to the standard PMP treatment could improve PMP patient survival. PMID:23844722

  2. Impact of exome sequencing in inflammatory bowel disease

    PubMed Central

    Cardinale, Christopher J; Kelsen, Judith R; Baldassano, Robert N; Hakonarson, Hakon

    2013-01-01

    Approaches to understanding the genetic contribution to inflammatory bowel disease (IBD) have continuously evolved from family- and population-based epidemiology, to linkage analysis, and most recently, to genome-wide association studies (GWAS). The next stage in this evolution seems to be the sequencing of the exome, that is, the regions of the human genome which encode proteins. The GWAS approach has been very fruitful in identifying at least 163 loci as being associated with IBD, and now, exome sequencing promises to take our genetic understanding to the next level. In this review we will discuss the possible contributions that can be made by an exome sequencing approach both at the individual patient level to aid with disease diagnosis and future therapies, as well as in advancing knowledge of the pathogenesis of IBD. PMID:24187447

  3. Exome sequencing links corticospinal motor neuron disease to common neurodegenerative disorders.

    PubMed

    Novarino, Gaia; Fenstermaker, Ali G; Zaki, Maha S; Hofree, Matan; Silhavy, Jennifer L; Heiberg, Andrew D; Abdellateef, Mostafa; Rosti, Basak; Scott, Eric; Mansour, Lobna; Masri, Amira; Kayserili, Hulya; Al-Aama, Jumana Y; Abdel-Salam, Ghada M H; Karminejad, Ariana; Kara, Majdi; Kara, Bulent; Bozorgmehri, Bita; Ben-Omran, Tawfeg; Mojahedi, Faezeh; El Din Mahmoud, Iman Gamal; Bouslam, Naima; Bouhouche, Ahmed; Benomar, Ali; Hanein, Sylvain; Raymond, Laure; Forlani, Sylvie; Mascaro, Massimo; Selim, Laila; Shehata, Nabil; Al-Allawi, Nasir; Bindu, P S; Azam, Matloob; Gunel, Murat; Caglayan, Ahmet; Bilguvar, Kaya; Tolun, Aslihan; Issa, Mahmoud Y; Schroth, Jana; Spencer, Emily G; Rosti, Rasim O; Akizu, Naiara; Vaux, Keith K; Johansen, Anide; Koh, Alice A; Megahed, Hisham; Durr, Alexandra; Brice, Alexis; Stevanin, Giovanni; Gabriel, Stacy B; Ideker, Trey; Gleeson, Joseph G

    2014-01-31

    Hereditary spastic paraplegias (HSPs) are neurodegenerative motor neuron diseases characterized by progressive age-dependent loss of corticospinal motor tract function. Although the genetic basis is partly understood, only a fraction of cases can receive a genetic diagnosis, and a global view of HSP is lacking. By using whole-exome sequencing in combination with network analysis, we identified 18 previously unknown putative HSP genes and validated nearly all of these genes functionally or genetically. The pathways highlighted by these mutations link HSP to cellular transport, nucleotide metabolism, and synapse and axon development. Network analysis revealed a host of further candidate genes, of which three were mutated in our cohort. Our analysis links HSP to other neurodegenerative disorders and can facilitate gene discovery and mechanistic understanding of disease.

  4. [Methods, challenges and opportunities for big data analyses of microbiome].

    PubMed

    Sheng, Hua-Fang; Zhou, Hong-Wei

    2015-07-01

    Microbiome is a novel research field related with a variety of chronic inflamatory diseases. Technically, there are two major approaches to analysis of microbiome: metataxonome by sequencing the 16S rRNA variable tags, and metagenome by shot-gun sequencing of the total microbial (mainly bacterial) genome mixture. The 16S rRNA sequencing analyses pipeline includes sequence quality control, diversity analyses, taxonomy and statistics; metagenome analyses further includes gene annotation and functional analyses. With the development of the sequencing techniques, the cost of sequencing will decrease, and big data analyses will become the central task. Data standardization, accumulation, modeling and disease prediction are crucial for future exploit of these data. Meanwhile, the information property in these data, and the functional verification with culture-dependent and culture-independent experiments remain the focus in future research. Studies of human microbiome will bring a better understanding of the relations between the human body and the microbiome, especially in the context of disease diagnosis and therapy, which promise rich research opportunities.

  5. Targeted Analysis of Whole Genome Sequence Data to Diagnose Genetic Cardiomyopathy

    DOE PAGES

    Golbus, Jessica R.; Puckelwartz, Megan J.; Dellefave-Castillo, Lisa; ...

    2014-09-01

    Background—Cardiomyopathy is highly heritable but genetically diverse. At present, genetic testing for cardiomyopathy uses targeted sequencing to simultaneously assess the coding regions of more than 50 genes. New genes are routinely added to panels to improve the diagnostic yield. With the anticipated $1000 genome, it is expected that genetic testing will shift towards comprehensive genome sequencing accompanied by targeted gene analysis. Therefore, we assessed the reliability of whole genome sequencing and targeted analysis to identify cardiomyopathy variants in 11 subjects with cardiomyopathy. Methods and Results—Whole genome sequencing with an average of 37× coverage was combined with targeted analysis focused onmore » 204 genes linked to cardiomyopathy. Genetic variants were scored using multiple prediction algorithms combined with frequency data from public databases. This pipeline yielded 1-14 potentially pathogenic variants per individual. Variants were further analyzed using clinical criteria and/or segregation analysis. Three of three previously identified primary mutations were detected by this analysis. In six subjects for whom the primary mutation was previously unknown, we identified mutations that segregated with disease, had clinical correlates, and/or had additional pathological correlation to provide evidence for causality. For two subjects with previously known primary mutations, we identified additional variants that may act as modifiers of disease severity. In total, we identified the likely pathological mutation in 9 of 11 (82%) subjects. We conclude that these pilot data demonstrate that ~30-40× coverage whole genome sequencing combined with targeted analysis is feasible and sensitive to identify rare variants in cardiomyopathy-associated genes.« less

  6. Evidence for a close phylogenetic relationship between Melissococcus pluton, the causative agent of European foulbrood disease, and the genus Enterococcus.

    PubMed

    Cai, J; Collins, M D

    1994-04-01

    The 16S rRNA gene sequence of Melissococcus pluton, the causative agent of European foulbrood disease, was determined in order to investigate the phylogenetic relationships between this organism and other low-G + C-content gram-positive bacteria. A comparative sequence analysis revealed that M. pluton is a close phylogenetic relative of the genus Enterococcus.

  7. Whole-genome sequence analysis of Zika virus, amplified from urine of traveler from the Philippines.

    PubMed

    Gu, Se Hun; Song, Dong Hyun; Lee, Daesang; Jang, Jeyoun; Kim, Min Young; Jung, Jaehun; Woo, Koung In; Kim, Mirang; Seog, Woong; Oh, Hong Sang; Choi, Byung Seop; Ahn, Jong-Seong; Park, Quehn; Jeong, Seong Tae

    2017-12-01

    Zika virus (ZIKV) (genus Flavivirus, family Flaviviridae) is an emerging pathogen associated with microcephaly and Guillain-Barré syndrome. The rapid spread of ZIKV disease in over 60 countries and the large numbers of travel-associated cases have caused worldwide concern. Thus, intensified surveillance of cases among immigrants and tourists from ZIKV-endemic areas is important for disease control and prevention. In this study, using Next Generation Sequencing, we reported the first whole-genome sequence of ZIKV strain AFMC-U, amplified from the urine of a traveler returning to Korea from the Philippines. Phylogenetic analysis showed geographic-specific clustering. Our results underscore the importance of examining urine in the diagnosis of ZIKV infection.

  8. Comprehensive analysis of the T-cell receptor beta chain gene in rhesus monkey by high throughput sequencing

    PubMed Central

    Li, Zhoufang; Liu, Guangjie; Tong, Yin; Zhang, Meng; Xu, Ying; Qin, Li; Wang, Zhanhui; Chen, Xiaoping; He, Jiankui

    2015-01-01

    Profiling immune repertoires by high throughput sequencing enhances our understanding of immune system complexity and immune-related diseases in humans. Previously, cloning and Sanger sequencing identified limited numbers of T cell receptor (TCR) nucleotide sequences in rhesus monkeys, thus their full immune repertoire is unknown. We applied multiplex PCR and Illumina high throughput sequencing to study the TCRβ of rhesus monkeys. We identified 1.26 million TCRβ sequences corresponding to 643,570 unique TCRβ sequences and 270,557 unique complementarity-determining region 3 (CDR3) gene sequences. Precise measurements of CDR3 length distribution, CDR3 amino acid distribution, length distribution of N nucleotide of junctional region, and TCRV and TCRJ gene usage preferences were performed. A comprehensive profile of rhesus monkey immune repertoire might aid human infectious disease studies using rhesus monkeys. PMID:25961410

  9. Recognition of Potentially Novel Human Disease-Associated Pathogens by Implementation of Systematic 16S rRNA Gene Sequencing in the Diagnostic Laboratory▿ †

    PubMed Central

    Keller, Peter M.; Rampini, Silvana K.; Büchler, Andrea C.; Eich, Gerhard; Wanner, Roger M.; Speck, Roberto F.; Böttger, Erik C.; Bloemberg, Guido V.

    2010-01-01

    Clinical isolates that are difficult to identify by conventional means form a valuable source of novel human pathogens. We report on a 5-year study based on systematic 16S rRNA gene sequence analysis. We found 60 previously unknown 16S rRNA sequences corresponding to potentially novel bacterial taxa. For 30 of 60 isolates, clinical relevance was evaluated; 18 of the 30 isolates analyzed were considered to be associated with human disease. PMID:20631113

  10. The Promise of Whole Genome Pathogen Sequencing for the Molecular Epidemiology of Emerging Aquaculture Pathogens

    PubMed Central

    Bayliss, Sion C.; Verner-Jeffreys, David W.; Bartie, Kerry L.; Aanensen, David M.; Sheppard, Samuel K.; Adams, Alexandra; Feil, Edward J.

    2017-01-01

    Aquaculture is the fastest growing food-producing sector, and the sustainability of this industry is critical both for global food security and economic welfare. The management of infectious disease represents a key challenge. Here, we discuss the opportunities afforded by whole genome sequencing of bacterial and viral pathogens of aquaculture to mitigate disease emergence and spread. We outline, by way of comparison, how sequencing technology is transforming the molecular epidemiology of pathogens of public health importance, emphasizing the importance of community-oriented databases and analysis tools. PMID:28217117

  11. Simultaneous virus identification and characterization of severe unexplained pneumonia cases using a metagenomics sequencing technique.

    PubMed

    Zou, Xiaohui; Tang, Guangpeng; Zhao, Xiang; Huang, Yan; Chen, Tao; Lei, Mingyu; Chen, Wenbing; Yang, Lei; Zhu, Wenfei; Zhuang, Li; Yang, Jing; Feng, Zhaomin; Wang, Dayan; Wang, Dingming; Shu, Yuelong

    2017-03-01

    Many viruses can cause respiratory diseases in humans. Although great advances have been achieved in methods of diagnosis, it remains challenging to identify pathogens in unexplained pneumonia (UP) cases. In this study, we applied next-generation sequencing (NGS) technology and a metagenomic approach to detect and characterize respiratory viruses in UP cases from Guizhou Province, China. A total of 33 oropharyngeal swabs were obtained from hospitalized UP patients and subjected to NGS. An unbiased metagenomic analysis pipeline identified 13 virus species in 16 samples. Human rhinovirus C was the virus most frequently detected and was identified in seven samples. Human measles virus, adenovirus B 55 and coxsackievirus A10 were also identified. Metagenomic sequencing also provided virus genomic sequences, which enabled genotype characterization and phylogenetic analysis. For cases of multiple infection, metagenomic sequencing afforded information regarding the quantity of each virus in the sample, which could be used to evaluate each viruses' role in the disease. Our study highlights the potential of metagenomic sequencing for pathogen identification in UP cases.

  12. Complete Genome Sequence of the Circulatory Foot-and-Mouth Disease Virus Serotype Asia1 in Bangladesh

    PubMed Central

    Ali, M. Rahmat; Alam, A. S. M. Rubayet Ul; Amin, M. Al; Ullah, Huzzat; Siddique, Mohammad Anwar; Momtaz, Samina; Sultana, Munawar

    2017-01-01

    ABSTRACT The complete genome sequence of foot-and-mouth disease virus (FMDV) serotype Asia1 isolated from Bangladesh is reported here. Genome analysis revealed amino acid substitutions in the VP1 antigenic region and deletions in both the 5′ and 3′ untranslated regions (UTRs) compared to the genome of the existing vaccine strain (GenBank accession no. AY304994). PMID:29074654

  13. Identification and Complete Genome Sequence Analysis of a Genotype XIV Newcastle Disease Virus from Nigeria

    PubMed Central

    Shittu, Ismaila; Sharma, Poonam; Volkening, Jeremy D.; Solomon, Ponman; Sulaiman, Lanre K.; Joannis, Tony M.; Williams-Coplin, Dawn; Miller, Patti J.; Dimitrov, Kiril M.

    2016-01-01

    The first complete genome sequence of a strain of Newcastle disease virus (NDV) from genotype XIV is reported here. Strain duck/Nigeria/NG-695/KG.LOM.11-16/2009 was isolated from an apparently healthy domestic duck from a live bird market in Kogi State, Nigeria, in 2009. This strain is classified as a member of subgenotype XIVb of class II. PMID:26823576

  14. Complete Genome Sequence of Genotype VI Newcastle Disease Viruses Isolated from Pigeons in Pakistan

    PubMed Central

    Wajid, Abdul; Rehmani, Shafqat Fatima; Sharma, Poonam; Goraichuk, Iryna V.; Dimitrov, Kiril M.

    2016-01-01

    Two complete genome sequences of Newcastle disease virus (NDV) are described here. Virulent isolates pigeon/Pakistan/Lahore/21A/2015 and pigeon/Pakistan/Lahore/25A/2015 were obtained from racing pigeons sampled in the Pakistani province of Punjab during 2015. Phylogenetic analysis of the fusion protein genes and complete genomes classified the isolates as members of NDV class II, genotype VI. PMID:27540069

  15. Laser Desorption Mass Spectrometry for DNA Sequencing and Analysis

    NASA Astrophysics Data System (ADS)

    Chen, C. H. Winston; Taranenko, N. I.; Golovlev, V. V.; Isola, N. R.; Allman, S. L.

    1998-03-01

    Rapid DNA sequencing and/or analysis is critically important for biomedical research. In the past, gel electrophoresis has been the primary tool to achieve DNA analysis and sequencing. However, gel electrophoresis is a time-consuming and labor-extensive process. Recently, we have developed and used laser desorption mass spectrometry (LDMS) to achieve sequencing of ss-DNA longer than 100 nucleotides. With LDMS, we succeeded in sequencing DNA in seconds instead of hours or days required by gel electrophoresis. In addition to sequencing, we also applied LDMS for the detection of DNA probes for hybridization LDMS was also used to detect short tandem repeats for forensic applications. Clinical applications for disease diagnosis such as cystic fibrosis caused by base deletion and point mutation have also been demonstrated. Experimental details will be presented in the meeting. abstract.

  16. Extensive sequence analysis of CFTR, SCNN1A, SCNN1B, SCNN1G and SERPINA1 suggests an oligogenic basis for cystic fibrosis-like phenotypes.

    PubMed

    Ramos, M D; Trujillano, D; Olivar, R; Sotillo, F; Ossowski, S; Manzanares, J; Costa, J; Gartner, S; Oliva, C; Quintana, E; Gonzalez, M I; Vazquez, C; Estivill, X; Casals, T

    2014-07-01

    The term cystic fibrosis (CF)-like disease is used to describe patients with a borderline sweat test and suggestive CF clinical features but without two CFTR(cystic fibrosis transmembrane conductance regulator) mutations. We have performed the extensive molecular analysis of four candidate genes (SCNN1A, SCNN1B, SCNN1G and SERPINA1) in a cohort of 10 uncharacterized patients with CF and CF-like disease. We have used whole-exome sequencing to characterize mutations in the CFTR gene and these four candidate genes. CFTR molecular analysis allowed a complete characterization of three of four CF patients. Candidate variants in SCNN1A, SCNN1B, SCNN1G and SERPINA1 in six patients with CF-like phenotypes were confirmed by Sanger sequencing and were further supported by in silico predictive analysis, pedigree studies, sweat test in other family members, and analysis in CF patients and healthy subjects. Our results suggest that CF-like disease probably results from complex genotypes in several genes in an oligogenic form, with rare variants interacting with environmental factors. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  17. Genetic Variation in Cardiomyopathy and Cardiovascular Disorders.

    PubMed

    McNally, Elizabeth M; Puckelwartz, Megan J

    2015-01-01

    With the wider deployment of massively-parallel, next-generation sequencing, it is now possible to survey human genome data for research and clinical purposes. The reduced cost of producing short-read sequencing has now shifted the burden to data analysis. Analysis of genome sequencing remains challenged by the complexity of the human genome, including redundancy and the repetitive nature of genome elements and the large amount of variation in individual genomes. Public databases of human genome sequences greatly facilitate interpretation of common and rare genetic variation, although linking database sequence information to detailed clinical information is limited by privacy and practical issues. Genetic variation is a rich source of knowledge for cardiovascular disease because many, if not all, cardiovascular disorders are highly heritable. The role of rare genetic variation in predicting risk and complications of cardiovascular diseases has been well established for hypertrophic and dilated cardiomyopathy, where the number of genes that are linked to these disorders is growing. Bolstered by family data, where genetic variants segregate with disease, rare variation can be linked to specific genetic variation that offers profound diagnostic information. Understanding genetic variation in cardiomyopathy is likely to help stratify forms of heart failure and guide therapy. Ultimately, genetic variation may be amenable to gene correction and gene editing strategies.

  18. A new arenavirus in a cluster of fatal transplant-associated diseases.

    PubMed

    Palacios, Gustavo; Druce, Julian; Du, Lei; Tran, Thomas; Birch, Chris; Briese, Thomas; Conlan, Sean; Quan, Phenix-Lan; Hui, Jeffrey; Marshall, John; Simons, Jan Fredrik; Egholm, Michael; Paddock, Christopher D; Shieh, Wun-Ju; Goldsmith, Cynthia S; Zaki, Sherif R; Catton, Mike; Lipkin, W Ian

    2008-03-06

    Three patients who received visceral-organ transplants from a single donor on the same day died of a febrile illness 4 to 6 weeks after transplantation. Culture, polymerase-chain-reaction (PCR) and serologic assays, and oligonucleotide microarray analysis for a wide range of infectious agents were not informative. We evaluated RNA obtained from the liver and kidney transplant recipients. Unbiased high-throughput sequencing was used to identify microbial sequences not found by means of other methods. The specificity of sequences for a new candidate pathogen was confirmed by means of culture and by means of PCR, immunohistochemical, and serologic analyses. High-throughput sequencing yielded 103,632 sequences, of which 14 represented an Old World arenavirus. Additional sequence analysis showed that this new arenavirus was related to lymphocytic choriomeningitis viruses. Specific PCR assays based on a unique sequence confirmed the presence of the virus in the kidneys, liver, blood, and cerebrospinal fluid of the recipients. Immunohistochemical analysis revealed arenavirus antigen in the liver and kidney transplants in the recipients. IgM and IgG antiviral antibodies were detected in the serum of the donor. Seroconversion was evident in serum specimens obtained from one recipient at two time points. Unbiased high-throughput sequencing is a powerful tool for the discovery of pathogens. The use of this method during an outbreak of disease facilitated the identification of a new arenavirus transmitted through solid-organ transplantation. Copyright 2008 Massachusetts Medical Society.

  19. Genome analysis of the foxtail millet pathogen Sclerospora graminicola reveals the complex effector repertoire of graminicolous downy mildews.

    PubMed

    Kobayashi, Michie; Hiraka, Yukie; Abe, Akira; Yaegashi, Hiroki; Natsume, Satoshi; Kikuchi, Hideko; Takagi, Hiroki; Saitoh, Hiromasa; Win, Joe; Kamoun, Sophien; Terauchi, Ryohei

    2017-11-22

    Downy mildew, caused by the oomycete pathogen Sclerospora graminicola, is an economically important disease of Gramineae crops including foxtail millet (Setaria italica). Plants infected with S. graminicola are generally stunted and often undergo a transformation of flower organs into leaves (phyllody or witches' broom), resulting in serious yield loss. To establish the molecular basis of downy mildew disease in foxtail millet, we carried out whole-genome sequencing and an RNA-seq analysis of S. graminicola. Sequence reads were generated from S. graminicola using an Illumina sequencing platform and assembled de novo into a draft genome sequence comprising approximately 360 Mbp. Of this sequence, 73% comprised repetitive elements, and a total of 16,736 genes were predicted from the RNA-seq data. The predicted genes included those encoding effector-like proteins with high sequence similarity to those previously identified in other oomycete pathogens. Genes encoding jacalin-like lectin-domain-containing secreted proteins were enriched in S. graminicola compared to other oomycetes. Of a total of 1220 genes encoding putative secreted proteins, 91 significantly changed their expression levels during the infection of plant tissues compared to the sporangia and zoospore stages of the S. graminicola lifecycle. We established the draft genome sequence of a downy mildew pathogen that infects Gramineae plants. Based on this sequence and our transcriptome analysis, we generated a catalog of in planta-induced candidate effector genes, providing a solid foundation from which to identify the effectors causing phyllody.

  20. First molecular detection and characterization of Marek's disease virus in red-crowned cranes (Grus japonensis): a case report.

    PubMed

    Lian, Xue; Ming, Xin; Xu, Jiarong; Cheng, Wangkun; Zhang, Xunhai; Chen, Hongjun; Ding, Chan; Jung, Yong-Sam; Qian, Yingjuan

    2018-04-03

    Marek's disease virus (MDV) resides in the genus Mardivirus in the family Herpesviridae. MDV is a highly contagious virus that can cause neurological lesions, lymphocytic proliferation, immune suppression, and death in avian species, including Galliformes (chickens, quails, partridges, and pheasants), Strigiformes (owls), Anseriformes (ducks, geese, and swans), and Falconiformes (kestrels). In 2015, two red-crowned cranes died in Nanjing (Jiangsu, China). It was determined that the birds were infected with Marek's disease virus by histopathological examination, polymerase chain reaction (PCR), gene sequencing and sequence analysis of tissue samples from two cranes. Gross lesions included diffuse nodules in the skin, muscle, liver, spleen, kidney, gizzard and heart, along with liver enlargement and gizzard mucosa hemorrhage. Histopathological assay showed that infiltrative lymphocytes and mitotic figures existed in liver and heart. The presence of MDV was confirmed by PCR. The sequence analysis of the Meq gene showed 100% identity with Md5, while the VP22 gene showed the highest homology with CVI988. Furthermore, the phylogenetic analysis of the VP22 and Meq genes suggested that the MDV (from cranes) belongs to MDV serotype 1. We describe the first molecular detection of Marek's disease in red-crowned cranes based on the findings previously described. To our knowledge, this is also the first molecular identification of Marek's disease virus in the order Gruiformes and represents detection of a novel MDV strain.

  1. Mutational analysis of the Wolfram syndrome gene in two families with chromosome 4p-linked bipolar affective disorder.

    PubMed

    Evans, K L; Lawson, D; Meitinger, T; Blackwood, D H; Porteous, D J

    2000-04-03

    Bipolar affective disorder (BPAD) is a complex disease with a significant genetic component. Heterozygous carriers of Wolfram syndrome (WFS) are at increased risk of psychiatric illness. A gene for WFS (WFS1) has recently been cloned and mapped to chromosome 4p, in the general region we previously reported as showing linkage to BPAD. Here we present sequence analysis of the WFS1 coding sequence in five affected individuals from two chromosome 4p-linked families. This resulted in the identification of six polymorphisms, two of which are predicted to change the amino acid sequence of the WFS1 protein, however none of the changes segregated with disease status. Am. J. Med. Genet. (Neuropsychiatr. Genet.) 96:158-160, 2000. Copyright 2000 Wiley-Liss, Inc.

  2. An insight into the sialome of the blood-sucking bug Triatoma infestans, a vector of Chagas' disease

    PubMed Central

    Assumpção, Teresa C. F.; Francischetti, Ivo M. B.; Andersen, John F.; Schwarz, Alexandra; Santana, Jaime M.; Ribeiro, José M. C.

    2008-01-01

    Triatoma infestans is a hemiptera, vector of Chagas’ disease, that feeds exclusively on vertebrate blood in all life stages. Hematophagous insects’ salivary glands (SG) produce potent pharmacological compounds that counteract host hemostasis, including anti-clotting, anti-platelet, and vasodilatory molecules. To obtain a further insight into the salivary biochemical and pharmacological complexity of this insect, a cDNA library from its salivary glands was randomly sequenced. Also, salivary proteins were submitted to two dimentional gel (2D-gel) electrophoresis followed by MS analysis. We present the analysis of a set of 1,534 (SG) cDNA sequences, 645 of which coded for proteins of a putative secretory nature. Most salivary proteins described as lipocalins matched peptide sequences obtained from proteomic results. PMID:18207082

  3. Alkaptonuria and Pompe disease in one patient: metabolic and molecular analysis.

    PubMed

    Zouheir Habbal, Mohammad; Bou Assi, Tarek; Mansour, Hicham

    2013-04-29

    Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual.

  4. Alkaptonuria and pompe disease in one patient: metabolic and molecular analysis

    PubMed Central

    Habbal, Mohammad Zouheir; Bou Assi, Tarek; Mansour, Hicham

    2013-01-01

    Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual. PMID:23632174

  5. Live births after simultaneous avoidance of monogenic diseases and chromosome abnormality by next-generation sequencing with linkage analyses.

    PubMed

    Yan, Liying; Huang, Lei; Xu, Liya; Huang, Jin; Ma, Fei; Zhu, Xiaohui; Tang, Yaqiong; Liu, Mingshan; Lian, Ying; Liu, Ping; Li, Rong; Lu, Sijia; Tang, Fuchou; Qiao, Jie; Xie, X Sunney

    2015-12-29

    In vitro fertilization (IVF), preimplantation genetic diagnosis (PGD), and preimplantation genetic screening (PGS) help patients to select embryos free of monogenic diseases and aneuploidy (chromosome abnormality). Next-generation sequencing (NGS) methods, while experiencing a rapid cost reduction, have improved the precision of PGD/PGS. However, the precision of PGD has been limited by the false-positive and false-negative single-nucleotide variations (SNVs), which are not acceptable in IVF and can be circumvented by linkage analyses, such as short tandem repeats or karyomapping. It is noteworthy that existing methods of detecting SNV/copy number variation (CNV) and linkage analysis often require separate procedures for the same embryo. Here we report an NGS-based PGD/PGS procedure that can simultaneously detect a single-gene disorder and aneuploidy and is capable of linkage analysis in a cost-effective way. This method, called "mutated allele revealed by sequencing with aneuploidy and linkage analyses" (MARSALA), involves multiple annealing and looping-based amplification cycles (MALBAC) for single-cell whole-genome amplification. Aneuploidy is determined by CNVs, whereas SNVs associated with the monogenic diseases are detected by PCR amplification of the MALBAC product. The false-positive and -negative SNVs are avoided by an NGS-based linkage analysis. Two healthy babies, free of the monogenic diseases of their parents, were born after such embryo selection. The monogenic diseases originated from a single base mutation on the autosome and the X-chromosome of the disease-carrying father and mother, respectively.

  6. Comparative sequence analysis revealed altered chromosomal organization and a novel insertion sequence encoding DNA modification and potentially stress-related functions in an Escherichia coli O157:H7 foodborne isolate

    USDA-ARS?s Scientific Manuscript database

    We recently described the complete genome of enterohemorrhagic Escherichia coli (EHEC) O157:H7 strain NADC 6564, an isolate of strain 86-24 linked to the 1986 disease outbreak. In the current study, we compared the chromosomal sequence of NADC 6564 to the well-characterized chromosomal sequences of ...

  7. Precision Medicine: Functional Advancements.

    PubMed

    Caskey, Thomas

    2018-01-29

    Precision medicine was conceptualized on the strength of genomic sequence analysis. High-throughput functional metrics have enhanced sequence interpretation and clinical precision. These technologies include metabolomics, magnetic resonance imaging, and I rhythm (cardiac monitoring), among others. These technologies are discussed and placed in clinical context for the medical specialties of internal medicine, pediatrics, obstetrics, and gynecology. Publications in these fields support the concept of a higher level of precision in identifying disease risk. Precise disease risk identification has the potential to enable intervention with greater specificity, resulting in disease prevention-an important goal of precision medicine.

  8. A weighted U-statistic for genetic association analyses of sequencing data.

    PubMed

    Wei, Changshuai; Li, Ming; He, Zihuai; Vsevolozhskaya, Olga; Schaid, Daniel J; Lu, Qing

    2014-12-01

    With advancements in next-generation sequencing technology, a massive amount of sequencing data is generated, which offers a great opportunity to comprehensively investigate the role of rare variants in the genetic etiology of complex diseases. Nevertheless, the high-dimensional sequencing data poses a great challenge for statistical analysis. The association analyses based on traditional statistical methods suffer substantial power loss because of the low frequency of genetic variants and the extremely high dimensionality of the data. We developed a Weighted U Sequencing test, referred to as WU-SEQ, for the high-dimensional association analysis of sequencing data. Based on a nonparametric U-statistic, WU-SEQ makes no assumption of the underlying disease model and phenotype distribution, and can be applied to a variety of phenotypes. Through simulation studies and an empirical study, we showed that WU-SEQ outperformed a commonly used sequence kernel association test (SKAT) method when the underlying assumptions were violated (e.g., the phenotype followed a heavy-tailed distribution). Even when the assumptions were satisfied, WU-SEQ still attained comparable performance to SKAT. Finally, we applied WU-SEQ to sequencing data from the Dallas Heart Study (DHS), and detected an association between ANGPTL 4 and very low density lipoprotein cholesterol. © 2014 WILEY PERIODICALS, INC.

  9. Deep Sequencing to Identify the Causes of Viral Encephalitis

    PubMed Central

    Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.

    2014-01-01

    Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691

  10. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Golbus, Jessica R.; Puckelwartz, Megan J.; Dellefave-Castillo, Lisa

    Background—Cardiomyopathy is highly heritable but genetically diverse. At present, genetic testing for cardiomyopathy uses targeted sequencing to simultaneously assess the coding regions of more than 50 genes. New genes are routinely added to panels to improve the diagnostic yield. With the anticipated $1000 genome, it is expected that genetic testing will shift towards comprehensive genome sequencing accompanied by targeted gene analysis. Therefore, we assessed the reliability of whole genome sequencing and targeted analysis to identify cardiomyopathy variants in 11 subjects with cardiomyopathy. Methods and Results—Whole genome sequencing with an average of 37× coverage was combined with targeted analysis focused onmore » 204 genes linked to cardiomyopathy. Genetic variants were scored using multiple prediction algorithms combined with frequency data from public databases. This pipeline yielded 1-14 potentially pathogenic variants per individual. Variants were further analyzed using clinical criteria and/or segregation analysis. Three of three previously identified primary mutations were detected by this analysis. In six subjects for whom the primary mutation was previously unknown, we identified mutations that segregated with disease, had clinical correlates, and/or had additional pathological correlation to provide evidence for causality. For two subjects with previously known primary mutations, we identified additional variants that may act as modifiers of disease severity. In total, we identified the likely pathological mutation in 9 of 11 (82%) subjects. We conclude that these pilot data demonstrate that ~30-40× coverage whole genome sequencing combined with targeted analysis is feasible and sensitive to identify rare variants in cardiomyopathy-associated genes.« less

  11. Identification and molecular analysis of infectious bursal disease in broiler farms in the Kurdistan Regional Government of Iraq.

    PubMed

    Amin, Oumed Gerjis M; Jackwood, Daral J

    2014-10-01

    The present study was undertaken to characterize field isolates of infectious bursal disease virus (IBDV). The identification was done using reverse transcription-polymerase chain reaction (RT-PCR) and partial sequencing of the VP2 gene. Pooled bursal samples were collected from commercial broiler farms located in the Kurdistan Regional Government (KRG) of Iraq. The genetic material of the IBDV was detected in 10 out of 29 field samples. Sequences of the hypervariable VP2 region were determined for 10 of these viruses. Molecular analysis of the VP2 gene of five IBDVs showed amino acid sequences consistent with the very virulent (vv) IBDV. Two samples were identified as classic vaccine viruses, and three samples were classic vaccine viruses that appear to have mutated during replication in the field. Phylogenetic analysis showed that all five field IBDV strains of the present study were closely related to each other. On the basis of nucleotide sequencing and phylogenetic analysis, it is very likely that IBD-causing viruses in this part of Iraq are of the very virulent type. These IBDVs appear to be evolving relative to their type strains.

  12. Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism

    PubMed Central

    Yin, Ling; An, Yunhe; Qu, Junjie; Li, Xinlong; Zhang, Yali; Dry, Ian; Wu, Huijuan; Lu, Jiang

    2017-01-01

    Plasmopara viticola causes downy mildew disease of grapevine which is one of the most devastating diseases of viticulture worldwide. Here we report a 101.3 Mb whole genome sequence of P. viticola isolate ‘JL-7-2’ obtained by a combination of Illumina and PacBio sequencing technologies. The P. viticola genome contains 17,014 putative protein-coding genes and has ~26% repetitive sequences. A total of 1,301 putative secreted proteins, including 100 putative RXLR effectors and 90 CRN effectors were identified in this genome. In the secretome, 261 potential pathogenicity genes and 95 carbohydrate-active enzymes were predicted. Transcriptional analysis revealed that most of the RXLR effectors, pathogenicity genes and carbohydrate-active enzymes were significantly up-regulated during infection. Comparative genomic analysis revealed that P. viticola evolved independently from the Arabidopsis downy mildew pathogen Hyaloperonospora arabidopsidis. The availability of the P. viticola genome provides a valuable resource not only for comparative genomic analysis and evolutionary studies among oomycetes, but also enhance our knowledge on the mechanism of interactions between this biotrophic pathogen and its host. PMID:28417959

  13. [Analysis of gene mutation in a Chinese family with Norrie disease].

    PubMed

    Zhang, Tian-xiao; Zhao, Xiu-li; Hua, Rui; Zhang, Jin-song; Zhang, Xue

    2012-09-01

    To detect the pathogenic mutation in a Chinese family with Norrie disease. Clinical diagnosis was based on familial history, clinical sign and B ultrasonic examination. Peripheral blood samples were obtained from all available members in a Chinese family with Norrie disease. Genomic DNA was extracted from lymphocytes by the standard SDS-proteinase K-phenol/chloroform method. Two coding exons and all intron-exon boundaries of the NDP gene were PCR amplified using three pairs of primers and subjected to automatic DNA sequence. The causative mutation was confirmed by restriction enzyme analysis and genotyping analysis in all members. Sequence analysis of NDP gene revealed a missense mutation c.220C > T (p.Arg74Cys) in the proband and his mother. Further mutation identification by restriction enzyme analysis and genotyping analysis showed that the proband was homozygote of this mutation. His mother and other four unaffected members (III3, IV4, III5 and II2) were carriers of this mutation. The mutant amino acid located in the C-terminal cystine knot-like domain, which was critical motif for the structure and function of NDP. A NDP missense mutation was identified in a Chinese family with Norrie disease.

  14. Significance of functional disease-causal/susceptible variants identified by whole-genome analyses for the understanding of human diseases.

    PubMed

    Hitomi, Yuki; Tokunaga, Katsushi

    2017-01-01

    Human genome variation may cause differences in traits and disease risks. Disease-causal/susceptible genes and variants for both common and rare diseases can be detected by comprehensive whole-genome analyses, such as whole-genome sequencing (WGS), using next-generation sequencing (NGS) technology and genome-wide association studies (GWAS). Here, in addition to the application of an NGS as a whole-genome analysis method, we summarize approaches for the identification of functional disease-causal/susceptible variants from abundant genetic variants in the human genome and methods for evaluating their functional effects in human diseases, using an NGS and in silico and in vitro functional analyses. We also discuss the clinical applications of the functional disease causal/susceptible variants to personalized medicine.

  15. Population-Sequencing as a Biomarker of Burkholderia mallei and Burkholderia pseudomallei Evolution through Microbial Forensic Analysis.

    PubMed

    Jakupciak, John P; Wells, Jeffrey M; Karalus, Richard J; Pawlowski, David R; Lin, Jeffrey S; Feldman, Andrew B

    2013-01-01

    Large-scale genomics projects are identifying biomarkers to detect human disease. B. pseudomallei and B. mallei are two closely related select agents that cause melioidosis and glanders. Accurate characterization of metagenomic samples is dependent on accurate measurements of genetic variation between isolates with resolution down to strain level. Often single biomarker sensitivity is augmented by use of multiple or panels of biomarkers. In parallel with single biomarker validation, advances in DNA sequencing enable analysis of entire genomes in a single run: population-sequencing. Potentially, direct sequencing could be used to analyze an entire genome to serve as the biomarker for genome identification. However, genome variation and population diversity complicate use of direct sequencing, as well as differences caused by sample preparation protocols including sequencing artifacts and mistakes. As part of a Department of Homeland Security program in bacterial forensics, we examined how to implement whole genome sequencing (WGS) analysis as a judicially defensible forensic method for attributing microbial sample relatedness; and also to determine the strengths and limitations of whole genome sequence analysis in a forensics context. Herein, we demonstrate use of sequencing to provide genetic characterization of populations: direct sequencing of populations.

  16. Population-Sequencing as a Biomarker of Burkholderia mallei and Burkholderia pseudomallei Evolution through Microbial Forensic Analysis

    PubMed Central

    Jakupciak, John P.; Wells, Jeffrey M.; Karalus, Richard J.; Pawlowski, David R.; Lin, Jeffrey S.; Feldman, Andrew B.

    2013-01-01

    Large-scale genomics projects are identifying biomarkers to detect human disease. B. pseudomallei and B. mallei are two closely related select agents that cause melioidosis and glanders. Accurate characterization of metagenomic samples is dependent on accurate measurements of genetic variation between isolates with resolution down to strain level. Often single biomarker sensitivity is augmented by use of multiple or panels of biomarkers. In parallel with single biomarker validation, advances in DNA sequencing enable analysis of entire genomes in a single run: population-sequencing. Potentially, direct sequencing could be used to analyze an entire genome to serve as the biomarker for genome identification. However, genome variation and population diversity complicate use of direct sequencing, as well as differences caused by sample preparation protocols including sequencing artifacts and mistakes. As part of a Department of Homeland Security program in bacterial forensics, we examined how to implement whole genome sequencing (WGS) analysis as a judicially defensible forensic method for attributing microbial sample relatedness; and also to determine the strengths and limitations of whole genome sequence analysis in a forensics context. Herein, we demonstrate use of sequencing to provide genetic characterization of populations: direct sequencing of populations. PMID:24455204

  17. A de novo whole gene deletion of XIAP detected by exome sequencing analysis in very early onset inflammatory bowel disease: a case report.

    PubMed

    Kelsen, Judith R; Dawany, Noor; Martinez, Alejandro; Martinez, Alejuandro; Grochowski, Christopher M; Maurer, Kelly; Rappaport, Eric; Piccoli, David A; Baldassano, Robert N; Mamula, Petar; Sullivan, Kathleen E; Devoto, Marcella

    2015-11-18

    Children with very early-onset inflammatory bowel disease (VEO-IBD), those diagnosed at less than 5 years of age, are a unique population. A subset of these patients present with a distinct phenotype and more severe disease than older children and adults. Host genetics is thought to play a more prominent role in this young population, and monogenic defects in genes related to primary immunodeficiencies are responsible for the disease in a small subset of patients with VEO-IBD. We report a child who presented at 3 weeks of life with very early-onset inflammatory bowel disease (VEO-IBD). He had a complicated disease course and remained unresponsive to medical and surgical therapy. The refractory nature of his disease, together with his young age of presentation, prompted utilization of whole exome sequencing (WES) to detect an underlying monogenic primary immunodeficiency and potentially target therapy to the identified defect. Copy number variation analysis (CNV) was performed using the eXome-Hidden Markov Model. Whole exome sequencing revealed 1,380 nonsense and missense variants in the patient. Plausible candidate variants were not detected following analysis of filtered variants, therefore, we performed CNV analysis of the WES data, which led us to identify a de novo whole gene deletion in XIAP. This is the first reported whole gene deletion in XIAP, the causal gene responsible for XLP2 (X-linked lymphoproliferative Disease 2). XLP2 is a syndrome resulting in VEO-IBD and can increase susceptibility to hemophagocytic lymphohistocytosis (HLH). This identification allowed the patient to be referred for bone marrow transplantation, potentially curative for his disease and critical to prevent the catastrophic sequela of HLH. This illustrates the unique etiology of VEO-IBD, and the subsequent effects on therapeutic options. This cohort requires careful and thorough evaluation for monogenic defects and primary immunodeficiencies.

  18. Morphological and molecular characterization of fungal pathogen, Magnaphorthe oryzae

    NASA Astrophysics Data System (ADS)

    Hasan, Nor'Aishah; Rafii, Mohd Y.; Rahim, Harun A.; Ali, Nusaibah Syd; Mazlan, Norida; Abdullah, Shamsiah

    2016-02-01

    Rice is arguably the most crucial food crops supplying quarter of calories intake. Fungal pathogen, Magnaphorthe oryzae promotes blast disease unconditionally to gramineous host including rice species. This disease spurred an outbreaks and constant threat to cereal production. Global rice yield declining almost 10-30% including Malaysia. As Magnaphorthe oryzae and its host is model in disease plant study, the rice blast pathosystem has been the subject of intense interest to overcome the importance of the disease to world agriculture. Therefore, in this study, our prime objective was to isolate samples of Magnaphorthe oryzae from diseased leaf obtained from MARDI Seberang Perai, Penang, Malaysia. Molecular identification was performed by sequences analysis from internal transcribed spacer (ITS) region of nuclear ribosomal RNA genes. Phylogenetic affiliation of the isolated samples were analyzed by comparing the ITS sequences with those deposited in the GenBank database. The sequence of the isolate demonstrated at least 99% nucleotide identity with the corresponding sequence in GenBank for Magnaphorthe oryzae. Morphological observed under microscope demonstrated that the structure of conidia followed similar characteristic as M. oryzae. Finding in this study provide useful information for breeding programs, epidemiology studies and improved disease management.

  19. Morphological and molecular characterization of fungal pathogen, Magnaphorthe oryzae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hasan, Nor’Aishah, E-mail: aishahnh@ns.uitm.edu.my; Rafii, Mohd Y., E-mail: mrafii@upm.edu.my; Department of Crop Science, Universiti Putra Malaysia

    2016-02-01

    Rice is arguably the most crucial food crops supplying quarter of calories intake. Fungal pathogen, Magnaphorthe oryzae promotes blast disease unconditionally to gramineous host including rice species. This disease spurred an outbreaks and constant threat to cereal production. Global rice yield declining almost 10-30% including Malaysia. As Magnaphorthe oryzae and its host is model in disease plant study, the rice blast pathosystem has been the subject of intense interest to overcome the importance of the disease to world agriculture. Therefore, in this study, our prime objective was to isolate samples of Magnaphorthe oryzae from diseased leaf obtained from MARDI Seberangmore » Perai, Penang, Malaysia. Molecular identification was performed by sequences analysis from internal transcribed spacer (ITS) region of nuclear ribosomal RNA genes. Phylogenetic affiliation of the isolated samples were analyzed by comparing the ITS sequences with those deposited in the GenBank database. The sequence of the isolate demonstrated at least 99% nucleotide identity with the corresponding sequence in GenBank for Magnaphorthe oryzae. Morphological observed under microscope demonstrated that the structure of conidia followed similar characteristic as M. oryzae. Finding in this study provide useful information for breeding programs, epidemiology studies and improved disease management.« less

  20. A novel missense Norrie disease mutation associated with a severe ocular phenotype.

    PubMed

    Khan, Arif O; Shamsi, Farrukh A; Al-Saif, Amr; Kambouris, Marios

    2004-01-01

    Clinical findings and pedigree analysis led to the diagnosis of severe Norrie disease in two brothers. DNA sequencing demonstrated a novel missense mutation (703G>T) that significantly alters predicted protein structure. Less severe retinal developmental disease may be associated with milder mutations in the Norrie disease gene.

  1. MutAid: Sanger and NGS Based Integrated Pipeline for Mutation Identification, Validation and Annotation in Human Molecular Genetics.

    PubMed

    Pandey, Ram Vinay; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas

    2016-01-01

    Traditional Sanger sequencing as well as Next-Generation Sequencing have been used for the identification of disease causing mutations in human molecular research. The majority of currently available tools are developed for research and explorative purposes and often do not provide a complete, efficient, one-stop solution. As the focus of currently developed tools is mainly on NGS data analysis, no integrative solution for the analysis of Sanger data is provided and consequently a one-stop solution to analyze reads from both sequencing platforms is not available. We have therefore developed a new pipeline called MutAid to analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and analysis of Sanger and NGS data under a single platform. It is capable of analyzing reads from multiple patients in a single run to create a list of potential disease causing base substitutions as well as insertions and deletions. MutAid has been developed for expert and non-expert users and supports four sequencing platforms including Sanger, Illumina, 454 and Ion Torrent. Furthermore, for NGS data analysis, five read mappers including BWA, TMAP, Bowtie, Bowtie2 and GSNAP and four variant callers including GATK-HaplotypeCaller, SAMTOOLS, Freebayes and VarScan2 pipelines are supported. MutAid is freely available at https://sourceforge.net/projects/mutaid.

  2. MutAid: Sanger and NGS Based Integrated Pipeline for Mutation Identification, Validation and Annotation in Human Molecular Genetics

    PubMed Central

    Pandey, Ram Vinay; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas

    2016-01-01

    Traditional Sanger sequencing as well as Next-Generation Sequencing have been used for the identification of disease causing mutations in human molecular research. The majority of currently available tools are developed for research and explorative purposes and often do not provide a complete, efficient, one-stop solution. As the focus of currently developed tools is mainly on NGS data analysis, no integrative solution for the analysis of Sanger data is provided and consequently a one-stop solution to analyze reads from both sequencing platforms is not available. We have therefore developed a new pipeline called MutAid to analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and analysis of Sanger and NGS data under a single platform. It is capable of analyzing reads from multiple patients in a single run to create a list of potential disease causing base substitutions as well as insertions and deletions. MutAid has been developed for expert and non-expert users and supports four sequencing platforms including Sanger, Illumina, 454 and Ion Torrent. Furthermore, for NGS data analysis, five read mappers including BWA, TMAP, Bowtie, Bowtie2 and GSNAP and four variant callers including GATK-HaplotypeCaller, SAMTOOLS, Freebayes and VarScan2 pipelines are supported. MutAid is freely available at https://sourceforge.net/projects/mutaid. PMID:26840129

  3. SIMPLEX: Cloud-Enabled Pipeline for the Comprehensive Analysis of Exome Sequencing Data

    PubMed Central

    Fischer, Maria; Snajder, Rene; Pabinger, Stephan; Dander, Andreas; Schossig, Anna; Zschocke, Johannes; Trajanoski, Zlatko; Stocker, Gernot

    2012-01-01

    In recent studies, exome sequencing has proven to be a successful screening tool for the identification of candidate genes causing rare genetic diseases. Although underlying targeted sequencing methods are well established, necessary data handling and focused, structured analysis still remain demanding tasks. Here, we present a cloud-enabled autonomous analysis pipeline, which comprises the complete exome analysis workflow. The pipeline combines several in-house developed and published applications to perform the following steps: (a) initial quality control, (b) intelligent data filtering and pre-processing, (c) sequence alignment to a reference genome, (d) SNP and DIP detection, (e) functional annotation of variants using different approaches, and (f) detailed report generation during various stages of the workflow. The pipeline connects the selected analysis steps, exposes all available parameters for customized usage, performs required data handling, and distributes computationally expensive tasks either on a dedicated high-performance computing infrastructure or on the Amazon cloud environment (EC2). The presented application has already been used in several research projects including studies to elucidate the role of rare genetic diseases. The pipeline is continuously tested and is publicly available under the GPL as a VirtualBox or Cloud image at http://simplex.i-med.ac.at; additional supplementary data is provided at http://www.icbi.at/exome. PMID:22870267

  4. Next-generation sequencing for identification of candidate genes for Fusarium wilt and sterility mosaic disease in pigeonpea (Cajanus cajan).

    PubMed

    Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Kumar, Vinay; Kale, Sandip M; Sinha, Pallavi; Chitikineni, Annapurna; Pazhamala, Lekha T; Garg, Vanika; Sharma, Mamta; Sameer Kumar, Chanda Venkata; Parupalli, Swathi; Vechalapu, Suryanarayana; Patil, Suyash; Muniswamy, Sonnappa; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Dharmaraj, Pallavi Subbanna; Varshney, Rajeev K

    2016-05-01

    To map resistance genes for Fusarium wilt (FW) and sterility mosaic disease (SMD) in pigeonpea, sequencing-based bulked segregant analysis (Seq-BSA) was used. Resistant (R) and susceptible (S) bulks from the extreme recombinant inbred lines of ICPL 20096 × ICPL 332 were sequenced. Subsequently, SNP index was calculated between R- and S-bulks with the help of draft genome sequence and reference-guided assembly of ICPL 20096 (resistant parent). Seq-BSA has provided seven candidate SNPs for FW and SMD resistance in pigeonpea. In parallel, four additional genotypes were re-sequenced and their combined analysis with R- and S-bulks has provided a total of 8362 nonsynonymous (ns) SNPs. Of 8362 nsSNPs, 60 were found within the 2-Mb flanking regions of seven candidate SNPs identified through Seq-BSA. Haplotype analysis narrowed down to eight nsSNPs in seven genes. These eight nsSNPs were further validated by re-sequencing 11 genotypes that are resistant and susceptible to FW and SMD. This analysis revealed association of four candidate nsSNPs in four genes with FW resistance and four candidate nsSNPs in three genes with SMD resistance. Further, In silico protein analysis and expression profiling identified two most promising candidate genes namely C.cajan_01839 for SMD resistance and C.cajan_03203 for FW resistance. Identified candidate genomic regions/SNPs will be useful for genomics-assisted breeding in pigeonpea. © 2015 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  5. The Initial Evaluation of Patients After Positive Newborn Screening: Recommended Algorithms Leading to a Confirmed Diagnosis of Pompe Disease.

    PubMed

    Burton, Barbara K; Kronn, David F; Hwu, Wuh-Liang; Kishnani, Priya S

    2017-07-01

    Newborn screening (NBS) for Pompe disease is done through analysis of acid α-glucosidase (GAA) activity in dried blood spots. When GAA levels are below established cutoff values, then second-tier testing is required to confirm or refute a diagnosis of Pompe disease. This article in the "Newborn Screening, Diagnosis, and Treatment for Pompe Disease" guidance supplement provides recommendations for confirmatory testing after a positive NBS result indicative of Pompe disease is obtained. Two algorithms were developed by the Pompe Disease Newborn Screening Working Group, a group of international experts on both NBS and Pompe disease, based on whether DNA sequencing is performed as part of the screening method. Using the recommendations in either algorithm will lead to 1 of 3 diagnoses: classic infantile-onset Pompe disease, late-onset Pompe disease, or no disease/not affected/carrier. Mutation analysis of the GAA gene is essential for confirming the biochemical diagnosis of Pompe disease. For NBS laboratories that do not have DNA sequencing capabilities, the responsibility of obtaining sequencing of the GAA gene will fall on the referral center. The recommendations for confirmatory testing and the initial evaluation are intended for a broad global audience. However, the Working Group recognizes that clinical practices, standards of care, and resource capabilities vary not only regionally, but also by testing centers. Individual patient needs and health status as well as local/regional insurance reimbursement programs and regulations also must be considered. Copyright © 2017 by the American Academy of Pediatrics.

  6. How next-generation sequencing and multiscale data analysis will transform infectious disease management.

    PubMed

    Pak, Theodore R; Kasarskis, Andrew

    2015-12-01

    Recent reviews have examined the extent to which routine next-generation sequencing (NGS) on clinical specimens will improve the capabilities of clinical microbiology laboratories in the short term, but do not explore integrating NGS with clinical data from electronic medical records (EMRs), immune profiling data, and other rich datasets to create multiscale predictive models. This review introduces a range of "omics" and patient data sources relevant to managing infections and proposes 3 potentially disruptive applications for these data in the clinical workflow. The combined threats of healthcare-associated infections and multidrug-resistant organisms may be addressed by multiscale analysis of NGS and EMR data that is ideally updated and refined over time within each healthcare organization. Such data and analysis should form the cornerstone of future learning health systems for infectious disease. © The Author 2015. Published by Oxford University Press on behalf of the Infectious Diseases Society of America.

  7. Single-Cell Sequencing of the Healthy and Diseased Heart Reveals Ckap4 as a New Modulator of Fibroblasts Activation.

    PubMed

    Gladka, Monika M; Molenaar, Bas; de Ruiter, Hesther; van der Elst, Stefan; Tsui, Hoyee; Versteeg, Danielle; Lacraz, Grègory P A; Huibers, Manon M H; van Oudenaarden, Alexander; van Rooij, Eva

    2018-01-31

    Background -Genome-wide transcriptome analysis has greatly advanced our understanding of the regulatory networks underlying basic cardiac biology and mechanisms driving disease. However, so far, the resolution of studying gene expression patterns in the adult heart has been limited to the level of extracts from whole tissues. The use of tissue homogenates inherently causes the loss of any information on cellular origin or cell type-specific changes in gene expression. Recent developments in RNA amplification strategies provide a unique opportunity to use small amounts of input RNA for genome-wide sequencing of single cells. Methods -Here, we present a method to obtain high quality RNA from digested cardiac tissue from adult mice for automated single-cell sequencing of both the healthy and diseased heart. Results -After optimization, we were able to perform single-cell sequencing on adult cardiac tissue under both homeostatic conditions and after ischemic injury. Clustering analysis based on differential gene expression unveiled known and novel markers of all main cardiac cell types. Based on differential gene expression we were also able to identify multiple subpopulations within a certain cell type. Furthermore, applying single-cell sequencing on both the healthy and the injured heart indicated the presence of disease-specific cell subpopulations. As such, we identified cytoskeleton associated protein 4 ( Ckap4 ) as a novel marker for activated fibroblasts that positively correlates with known myofibroblast markers in both mouse and human cardiac tissue. Ckap4 inhibition in activated fibroblasts treated with TGFβ triggered a greater increase in the expression of genes related to activated fibroblasts compared to control, suggesting a role of Ckap4 in modulating fibroblast activation in the injured heart. Conclusions -Single-cell sequencing on both the healthy and diseased adult heart allows us to study transcriptomic differences between cardiac cells, as well as cell type-specific changes in gene expression during cardiac disease. This new approach provides a wealth of novel insights into molecular changes that underlie the cellular processes relevant for cardiac biology and pathophysiology. Applying this technology could lead to the discovery of new therapeutic targets relevant for heart disease.

  8. Patome: a database server for biological sequence annotation and analysis in issued patents and published patent applications.

    PubMed

    Lee, Byungwook; Kim, Taehyung; Kim, Seon-Kyu; Lee, Kwang H; Lee, Doheon

    2007-01-01

    With the advent of automated and high-throughput techniques, the number of patent applications containing biological sequences has been increasing rapidly. However, they have attracted relatively little attention compared to other sequence resources. We have built a database server called Patome, which contains biological sequence data disclosed in patents and published applications, as well as their analysis information. The analysis is divided into two steps. The first is an annotation step in which the disclosed sequences were annotated with RefSeq database. The second is an association step where the sequences were linked to Entrez Gene, OMIM and GO databases, and their results were saved as a gene-patent table. From the analysis, we found that 55% of human genes were associated with patenting. The gene-patent table can be used to identify whether a particular gene or disease is related to patenting. Patome is available at http://www.patome.org/; the information is updated bimonthly.

  9. Patome: a database server for biological sequence annotation and analysis in issued patents and published patent applications

    PubMed Central

    Lee, Byungwook; Kim, Taehyung; Kim, Seon-Kyu; Lee, Kwang H.; Lee, Doheon

    2007-01-01

    With the advent of automated and high-throughput techniques, the number of patent applications containing biological sequences has been increasing rapidly. However, they have attracted relatively little attention compared to other sequence resources. We have built a database server called Patome, which contains biological sequence data disclosed in patents and published applications, as well as their analysis information. The analysis is divided into two steps. The first is an annotation step in which the disclosed sequences were annotated with RefSeq database. The second is an association step where the sequences were linked to Entrez Gene, OMIM and GO databases, and their results were saved as a gene–patent table. From the analysis, we found that 55% of human genes were associated with patenting. The gene–patent table can be used to identify whether a particular gene or disease is related to patenting. Patome is available at ; the information is updated bimonthly. PMID:17085479

  10. Polyomavirus BK non-coding control region rearrangements in health and disease.

    PubMed

    Sharma, Preety M; Gupta, Gaurav; Vats, Abhay; Shapiro, Ron; Randhawa, Parmjeet S

    2007-08-01

    BK virus is an increasingly recognized pathogen in transplanted patients. DNA sequencing of this virus shows considerable genomic variability. To understand the clinical significance of rearrangements in the non-coding control region (NCCR) of BK virus (BKV), we report a meta-analysis of 507 sequences, including 40 sequences generated in our own laboratory, for associations between rearrangements and disease, tissue tropism, geographic origin, and viral genotype. NCCR rearrangements were less frequent in (a) asymptomatic BKV viruria compared to patients viral nephropathy (1.7% vs. 22.5%), and (b) viral genotype 1 compared to other genotypes (2.4% vs. 11.2%). Rearrangements were commoner in malignancy (78.6%), and Norwegians (45.7%), and less common in East Indians (0%), and Japanese (4.3%). A surprising number of rearranged sequences were reported from mononuclear cells of healthy subjects, whereas most plasma sequences were archetypal. This difference could not be related to potential recombinase activity in lymphocytes, as consensus recombination signal sequences could not be found in the NCCR region. NCCR rearrangements are neither required nor a sufficient condition to produce clinical disease. BKV nephropathy and hemorrhagic cystitis are not associated with any unique NCCR configuration or nucleotide sequence.

  11. CoryneBase: Corynebacterium Genomic Resources and Analysis Tools at Your Fingertips

    PubMed Central

    Tan, Mui Fern; Jakubovics, Nick S.; Wee, Wei Yee; Mutha, Naresh V. R.; Wong, Guat Jah; Ang, Mia Yang; Yazdi, Amir Hessam; Choo, Siew Woh

    2014-01-01

    Corynebacteria are used for a wide variety of industrial purposes but some species are associated with human diseases. With increasing number of corynebacterial genomes having been sequenced, comparative analysis of these strains may provide better understanding of their biology, phylogeny, virulence and taxonomy that may lead to the discoveries of beneficial industrial strains or contribute to better management of diseases. To facilitate the ongoing research of corynebacteria, a specialized central repository and analysis platform for the corynebacterial research community is needed to host the fast-growing amount of genomic data and facilitate the analysis of these data. Here we present CoryneBase, a genomic database for Corynebacterium with diverse functionality for the analysis of genomes aimed to provide: (1) annotated genome sequences of Corynebacterium where 165,918 coding sequences and 4,180 RNAs can be found in 27 species; (2) access to comprehensive Corynebacterium data through the use of advanced web technologies for interactive web interfaces; and (3) advanced bioinformatic analysis tools consisting of standard BLAST for homology search, VFDB BLAST for sequence homology search against the Virulence Factor Database (VFDB), Pairwise Genome Comparison (PGC) tool for comparative genomic analysis, and a newly designed Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomic analysis. CoryneBase offers the access of a range of Corynebacterium genomic resources as well as analysis tools for comparative genomics and pathogenomics. It is publicly available at http://corynebacterium.um.edu.my/. PMID:24466021

  12. Targeted next-generation sequencing in steroid-resistant nephrotic syndrome: mutations in multiple glomerular genes may influence disease severity.

    PubMed

    Bullich, Gemma; Trujillano, Daniel; Santín, Sheila; Ossowski, Stephan; Mendizábal, Santiago; Fraga, Gloria; Madrid, Álvaro; Ariceta, Gema; Ballarín, José; Torra, Roser; Estivill, Xavier; Ars, Elisabet

    2015-09-01

    Genetic diagnosis of steroid-resistant nephrotic syndrome (SRNS) using Sanger sequencing is complicated by the high genetic heterogeneity and phenotypic variability of this disease. We aimed to improve the genetic diagnosis of SRNS by simultaneously sequencing 26 glomerular genes using massive parallel sequencing and to study whether mutations in multiple genes increase disease severity. High-throughput mutation analysis was performed in 50 SRNS and/or focal segmental glomerulosclerosis (FSGS) patients, a validation cohort of 25 patients with known pathogenic mutations, and a discovery cohort of 25 uncharacterized patients with probable genetic etiology. In the validation cohort, we identified the 42 previously known pathogenic mutations across NPHS1, NPHS2, WT1, TRPC6, and INF2 genes. In the discovery cohort, disease-causing mutations in SRNS/FSGS genes were found in nine patients. We detected three patients with mutations in an SRNS/FSGS gene and COL4A3. Two of them were familial cases and presented a more severe phenotype than family members with mutation in only one gene. In conclusion, our results show that massive parallel sequencing is feasible and robust for genetic diagnosis of SRNS/FSGS. Our results indicate that patients carrying mutations in an SRNS/FSGS gene and also in COL4A3 gene have increased disease severity.

  13. Pathogenesis, Molecular Genetics, and Genomics of Mycobacterium avium subsp. paratuberculosis, the Etiologic Agent of Johne’s Disease

    PubMed Central

    Rathnaiah, Govardhan; Zinniel, Denise K.; Bannantine, John P.; Stabel, Judith R.; Gröhn, Yrjö T.; Collins, Michael T.; Barletta, Raúl G.

    2017-01-01

    Mycobacterium avium subsp. paratuberculosis (MAP) is the etiologic agent of Johne’s disease in ruminants causing chronic diarrhea, malnutrition, and muscular wasting. Neonates and young animals are infected primarily by the fecal–oral route. MAP attaches to, translocates via the intestinal mucosa, and is phagocytosed by macrophages. The ensuing host cellular immune response leads to granulomatous enteritis characterized by a thick and corrugated intestinal wall. We review various tissue culture systems, ileal loops, and mice, goats, and cattle used to study MAP pathogenesis. MAP can be detected in clinical samples by microscopy, culturing, PCR, and an enzyme-linked immunosorbent assay. There are commercial vaccines that reduce clinical disease and shedding, unfortunately, their efficacies are limited and may not engender long-term protective immunity. Moreover, the potential linkage with Crohn’s disease and other human diseases makes MAP a concern as a zoonotic pathogen. Potential therapies with anti-mycobacterial agents are also discussed. The completion of the MAP K-10 genome sequence has greatly improved our understanding of MAP pathogenesis. The analysis of this sequence has identified a wide range of gene functions involved in virulence, lipid metabolism, transcriptional regulation, and main metabolic pathways. We also review the transposons utilized to generate random transposon mutant libraries and the recent advances in the post-genomic era. This includes the generation and characterization of allelic exchange mutants, transcriptomic analysis, transposon mutant banks analysis, new efforts to generate comprehensive mutant libraries, and the application of transposon site hybridization mutagenesis and transposon sequencing for global analysis of the MAP genome. Further analysis of candidate vaccine strains development is also provided with critical discussions on their benefits and shortcomings, and strategies to develop a highly efficacious live-attenuated vaccine capable of differentiating infected from vaccinated animals. PMID:29164142

  14. Genomic and clinical profiling of a national nephrotic syndrome cohort advocates a precision medicine approach to disease management.

    PubMed

    Bierzynska, Agnieszka; McCarthy, Hugh J; Soderquest, Katrina; Sen, Ethan S; Colby, Elizabeth; Ding, Wen Y; Nabhan, Marwa M; Kerecuk, Larissa; Hegde, Shivram; Hughes, David; Marks, Stephen; Feather, Sally; Jones, Caroline; Webb, Nicholas J A; Ognjanovic, Milos; Christian, Martin; Gilbert, Rodney D; Sinha, Manish D; Lord, Graham M; Simpson, Michael; Koziell, Ania B; Welsh, Gavin I; Saleem, Moin A

    2017-04-01

    Steroid Resistant Nephrotic Syndrome (SRNS) in children and young adults has differing etiologies with monogenic disease accounting for 2.9-30% in selected series. Using whole exome sequencing we sought to stratify a national population of children with SRNS into monogenic and non-monogenic forms, and further define those groups by detailed phenotypic analysis. Pediatric patients with SRNS were identified via a national United Kingdom Renal Registry. Whole exome sequencing was performed on 187 patients, of which 12% have a positive family history with a focus on the 53 genes currently known to be associated with nephrotic syndrome. Genetic findings were correlated with individual case disease characteristics. Disease causing variants were detected in 26.2% of patients. Most often this occurred in the three most common SRNS-associated genes: NPHS1, NPHS2, and WT1 but also in 14 other genes. The genotype did not always correlate with expected phenotype since mutations in OCRL, COL4A3, and DGKE associated with specific syndromes were detected in patients with isolated renal disease. Analysis by primary/presumed compared with secondary steroid resistance found 30.8% monogenic disease in primary compared with none in secondary SRNS permitting further mechanistic stratification. Genetic SRNS progressed faster to end stage renal failure, with no documented disease recurrence post-transplantation within this cohort. Primary steroid resistance in which no gene mutation was identified had a 47.8% risk of recurrence. In this unbiased pediatric population, whole exome sequencing allowed screening of all current candidate genes. Thus, deep phenotyping combined with whole exome sequencing is an effective tool for early identification of SRNS etiology, yielding an evidence-based algorithm for clinical management. Copyright © 2016 International Society of Nephrology. Published by Elsevier Inc. All rights reserved.

  15. Increased Probability of Co-Occurrence of Two Rare Diseases in Consanguineous Families and Resolution of a Complex Phenotype by Next Generation Sequencing

    PubMed Central

    Lal, Dennis; Neubauer, Bernd A.; Toliat, Mohammad R.; Altmüller, Janine; Thiele, Holger; Nürnberg, Peter; Kamrath, Clemens; Schänzer, Anne; Sander, Thomas; Hahn, Andreas; Nothnagel, Michael

    2016-01-01

    Massively parallel sequencing of whole genomes and exomes has facilitated a direct assessment of causative genetic variation, now enabling the identification of genetic factors involved in rare diseases (RD) with Mendelian inheritance patterns on an almost routine basis. Here, we describe the illustrative case of a single consanguineous family where this strategy suffered from the difficulty to distinguish between two etiologically distinct disorders, namely the co-occurrence of hereditary hypophosphatemic rickets (HRR) and congenital myopathies (CM), by their phenotypic manifestation alone. We used parametric linkage analysis, homozygosity mapping and whole exome-sequencing to identify mutations underlying HRR and CM. We also present an approximate approach for assessing the probability of co-occurrence of two unlinked recessive RD in a single family as a function of the degree of consanguinity and the frequency of the disease-causing alleles. Linkage analysis and homozygosity mapping yielded elusive results when assuming a single RD, but whole-exome sequencing helped to identify two mutations in two genes, namely SLC34A3 and SEPN1, that segregated independently in this family and that have previously been linked to two etiologically different diseases. We assess the increase in chance co-occurrence of rare diseases due to consanguinity, i.e. under circumstances that generally favor linkage mapping of recessive disease, and show that this probability can increase by several orders of magnitudes. We conclude that such potential co-occurrence represents an underestimated risk when analyzing rare or undefined diseases in consanguineous families and should be given more consideration in the clinical and genetic evaluation. PMID:26789268

  16. Molecular characterization and phylogenetic analysis of infectious bursal disease viruses isolated from chicken in South China in 2011.

    PubMed

    Liu, Di; Zhang, Xiang-Bin; Yan, Zhuan-Qiang; Chen, Feng; Ji, Jun; Qin, Jian-Ping; Li, Hai-Yan; Lu, Jun-Peng; Xue, Yu; Liu, Jia-Jia; Xie, Qing-Mei; Ma, Jing-Yun; Xue, Chun-Yi; Bee, Ying-Zuo

    2013-06-01

    Infectious bursal disease virus (IBDV) is a double-stranded RNA virus that causes immunosuppressive disease in young chickens. Thousands of cases of IBDV infection are reported each year in South China, and these infections can result in considerable economic losses to the poultry industry. To monitor variations of the virus during the outbreaks, 30 IBDVs were identified from vaccinated chicken flocks from nine provinces in South China in 2011. VP2 fragments from different virus strains were sequenced and analyzed by comparison with the published sequences of IBDV strains from China and around the world. Phylogenetic analysis of hypervariable regions of the VP2 (vVP2) gene showed that 29 of the isolates were very virulent (vv) IBDVs, and were closely related to vvIBDV strains from Europe and Asia. Alignment analysis of the deduced amino acid (aa) sequences of vVP2 showed the 29 vv isolates had high uniformity, indicated low variability and slow evolution of the virus. The non-vvIBDV isolate JX2-11 was associated with higher than expected mortality, and had high deduced aa sequence similarity (99.2 %) with the attenuated vaccine strain B87 (BJ). The present study has demonstrated the continued circulation of IBDV strains in South China, and emphasizes the importance of reinforcing IBDV surveillance.

  17. Community and gene composition of a human dental plaque microbiota obtained by metagenomic sequencing

    PubMed Central

    Xie, G.; Chain, P.S.G.; Lo, C.; Liu, K-L.; Gans, J.; Merritt, J.; Qi, F.

    2010-01-01

    SUMMARY Human dental plaque is a complex microbial community containing an estimated 700 to 19,000 species/phylotypes. Despite numerous studies analysing species richness in healthy and diseased human subjects, the true genomic composition of the human dental plaque microbiota remains unknown. Here we report a metagenomic analysis of a healthy human plaque sample using a combination of second-generation sequencing platforms. A total of 860 million base pairs of non-human sequences were generated. Various analysis tools revealed the presence of 12 well-characterized phyla, members of the TM-7 and BRC1 clade, and sequences that could not be classified. Both pathogens and opportunistic pathogens were identified, supporting the ecological plaque hypothesis for oral diseases. Mapping the metagenomic reads to sequenced reference genomes demonstrated that 4% of the reads could be assigned to the sequenced species. Preliminary annotation identified genes belonging to all known functional categories. Interestingly, although 73% of the total assembled contig sequences were predicted to code for proteins, only 51% of them could be assigned a functional role. Furthermore, ~ 2.8% of the total predicted genes coded for proteins involved in resistance to antibiotics and toxic compounds, suggesting that the oral cavity is an important reservoir for antimicrobial resistance. PMID:21040513

  18. Community and gene composition of a human dental plaque microbiota obtained by metagenomic sequencing.

    PubMed

    Xie, G; Chain, P S G; Lo, C-C; Liu, K-L; Gans, J; Merritt, J; Qi, F

    2010-12-01

    Human dental plaque is a complex microbial community containing an estimated 700 to 19,000 species/phylotypes. Despite numerous studies analysing species richness in healthy and diseased human subjects, the true genomic composition of the human dental plaque microbiota remains unknown. Here we report a metagenomic analysis of a healthy human plaque sample using a combination of second-generation sequencing platforms. A total of 860 million base pairs of non-human sequences were generated. Various analysis tools revealed the presence of 12 well-characterized phyla, members of the TM-7 and BRC1 clade, and sequences that could not be classified. Both pathogens and opportunistic pathogens were identified, supporting the ecological plaque hypothesis for oral diseases. Mapping the metagenomic reads to sequenced reference genomes demonstrated that 4% of the reads could be assigned to the sequenced species. Preliminary annotation identified genes belonging to all known functional categories. Interestingly, although 73% of the total assembled contig sequences were predicted to code for proteins, only 51% of them could be assigned a functional role. Furthermore, ~2.8% of the total predicted genes coded for proteins involved in resistance to antibiotics and toxic compounds, suggesting that the oral cavity is an important reservoir for antimicrobial resistance. © 2010 John Wiley & Sons A/S.

  19. A novel tandem repeat sequence located on human chromosome 4p: isolation and characterization.

    PubMed

    Kogi, M; Fukushige, S; Lefevre, C; Hadano, S; Ikeda, J E

    1997-06-01

    In an effort to analyze the genomic region of the distal half of human chromosome 4p, to where Huntington disease and other diseases have been mapped, we have isolated the cosmid clone (CRS447) that was likely to contain a region with specific repeat sequences. Clone CRS447 was subjected to detailed analysis, including chromosome mapping, restriction mapping, and DNA sequencing. Chromosome mapping by both a human-CHO hybrid cell panel and FISH revealed that CRS447 was predominantly located in the 4p15.1-15.3 region. CRS447 was shown to consist of tandem repeats of 4.7-kb units present on chromosome 4p. A single EcoRI unit was subcloned (pRS447), and the complete sequence was determined as 4752 nucleotides. When pRS447 was used as a probe, the number of copies of this repeat per haploid genome was estimated to be 50-70. Sequence analysis revealed that it contained two internal CA repeats and one putative ORF. Database search established that this sequence was unreported. However, two homologous STS markers were found in the database. We concluded that CRS447/pRS447 is a novel tandem repeat sequence that is mainly specific to human chromosome 4p.

  20. Study of infectious diseases in archaeological bone material - A dataset.

    PubMed

    Pucu, Elisa; Cascardo, Paula; Chame, Marcia; Felice, Gisele; Guidon, Niéde; Cleonice Vergne, Maria; Campos, Guadalupe; Roberto Machado-Silva, José; Leles, Daniela

    2017-08-01

    Bones of human and ground sloth remains were analyzed for presence of Trypanosoma cruzi by conventional PCR using primers TC, TC1 and TC2. Sequence results amplified a fragment with the same product size as the primers (300 and 350pb). Amplified PCR product was sequenced and analyzed on GenBank, using Blast. Although these sequences did not match with these parasites they showed high amplification with species of bacteria. This article presents the methodology used and the alignment of the sequences. The display of this dataset will allow further analysis of our results and discussion presented in the manuscript "Finding the unexpected: a critical view on molecular diagnosis of infectious diseases in archaeological samples" (Pucu et al. 2017) [1].

  1. Complete Genome Sequence of a Genotype XVII Newcastle Disease Virus, Isolated from an Apparently Healthy Domestic Duck in Nigeria

    PubMed Central

    Shittu, Ismaila; Sharma, Poonam; Joannis, Tony M.; Volkening, Jeremy D.; Odaibo, Georgina N.; Olaleye, David O.; Williams-Coplin, Dawn; Solomon, Ponman; Abolnik, Celia; Miller, Patti J.; Dimitrov, Kiril M.

    2016-01-01

    The first complete genome sequence of a strain of Newcastle disease virus (NDV) of genotype XVII is described here. A velogenic strain (duck/Nigeria/903/KUDU-113/1992) was isolated from an apparently healthy free-roaming domestic duck sampled in Kuru, Nigeria, in 1992. Phylogenetic analysis of the fusion protein gene and complete genome classified the isolate as a member of NDV class II, genotype XVII. PMID:26847901

  2. Usage of mitochondrial D-loop variation to predict risk for Huntington disease.

    PubMed

    Mousavizadeh, Kazem; Rajabi, Peyman; Alaee, Mahsa; Dadgar, Sepideh; Houshmand, Massoud

    2015-08-01

    Huntington's disease (HD) is an inherited autosomal neurodegenerative disease caused by the abnormal expansion of the CAG repeats in the Huntingtin (Htt) gene. It has been proven that mitochondrial dysfunction is contributed to the pathogenesis of Huntington's disease. The mitochondrial displacement loop (D-loop) is proven to accumulate mutations at a higher rate than other regions of mtDNA. Thus, we hypothesized that specific SNPs in the D-loop may contribute to the pathogenesis of Huntington's disease. In the present study, 30 patients with Huntington's disease and 463 healthy controls were evaluated for mitochondrial mutation sites within the D-loop region using PCR-sequencing method. Sequence analysis revealed 35 variations in HD group from Cambridge Mitochondrial Sequences. A significant difference (p < 0.05) was seen between patients and control group in eight SNPs. Polymorphisms at C16069T, T16126C, T16189C, T16519C and C16223T were correlated with an increased risk of HD while SNPs at C16150T, T16086C and T16195C were associated with a decreased risk of Huntington's disease.

  3. Sequence analysis of Leukemia DNA

    NASA Astrophysics Data System (ADS)

    Nacong, Nasria; Lusiyanti, Desy; Irawan, Muhammad. Isa

    2018-03-01

    Cancer is a very deadly disease, one of which is leukemia disease or better known as blood cancer. The cancer cell can be detected by taking DNA in laboratory test. This study focused on local alignment of leukemia and non leukemia data resulting from NCBI in the form of DNA sequences by using Smith-Waterman algorithm. SmithWaterman algorithm was invented by TF Smith and MS Waterman in 1981. These algorithms try to find as much as possible similarity of a pair of sequences, by giving a negative value to the unequal base pair (mismatch), and positive values on the same base pair (match). So that will obtain the maximum positive value as the end of the alignment, and the minimum value as the initial alignment. This study will use sequences of leukemia and 3 sequences of non leukemia.

  4. Inferring Higher Functional Information for RIKEN Mouse Full-Length cDNA Clones With FACTS

    PubMed Central

    Nagashima, Takeshi; Silva, Diego G.; Petrovsky, Nikolai; Socha, Luis A.; Suzuki, Harukazu; Saito, Rintaro; Kasukawa, Takeya; Kurochkin, Igor V.; Konagaya, Akihiko; Schönbach, Christian

    2003-01-01

    FACTS (Functional Association/Annotation of cDNA Clones from Text/Sequence Sources) is a semiautomated knowledge discovery and annotation system that integrates molecular function information derived from sequence analysis results (sequence inferred) with functional information extracted from text. Text-inferred information was extracted from keyword-based retrievals of MEDLINE abstracts and by matching of gene or protein names to OMIM, BIND, and DIP database entries. Using FACTS, we found that 47.5% of the 60,770 RIKEN mouse cDNA FANTOM2 clone annotations were informative for text searches. MEDLINE queries yielded molecular interaction-containing sentences for 23.1% of the clones. When disease MeSH and GO terms were matched with retrieved abstracts, 22.7% of clones were associated with potential diseases, and 32.5% with GO identifiers. A significant number (23.5%) of disease MeSH-associated clones were also found to have a hereditary disease association (OMIM Morbidmap). Inferred neoplastic and nervous system disease represented 49.6% and 36.0% of disease MeSH-associated clones, respectively. A comparison of sequence-based GO assignments with informative text-based GO assignments revealed that for 78.2% of clones, identical GO assignments were provided for that clone by either method, whereas for 21.8% of clones, the assignments differed. In contrast, for OMIM assignments, only 28.5% of clones had identical sequence-based and text-based OMIM assignments. Sequence, sentence, and term-based functional associations are included in the FACTS database (http://facts.gsc.riken.go.jp/), which permits results to be annotated and explored through web-accessible keyword and sequence search interfaces. The FACTS database will be a critical tool for investigating the functional complexity of the mouse transcriptome, cDNA-inferred interactome (molecular interactions), and pathome (pathologies). PMID:12819151

  5. Clan Genomics and the Complex Architecture of Human Disease

    PubMed Central

    Belmont, John W.; Boerwinkle, Eric

    2013-01-01

    Human diseases are caused by alleles that encompass the full range of variant types, from single-nucleotide changes to copy-number variants, and these variations span a broad frequency spectrum, from the very rare to the common. The picture emerging from analysis of whole-genome sequences, the 1000 Genomes Project pilot studies, and targeted genomic sequencing derived from very large sample sizes reveals an abundance of rare and private variants. One implication of this realization is that recent mutation may have a greater influence on disease susceptibility or protection than is conferred by variations that arose in distant ancestors. PMID:21962505

  6. Quantifying Transmission.

    PubMed

    Woolhouse, Mark

    2017-07-01

    Transmissibility is the defining characteristic of infectious diseases. Quantifying transmission matters for understanding infectious disease epidemiology and designing evidence-based disease control programs. Tracing individual transmission events can be achieved by epidemiological investigation coupled with pathogen typing or genome sequencing. Individual infectiousness can be estimated by measuring pathogen loads, but few studies have directly estimated the ability of infected hosts to transmit to uninfected hosts. Individuals' opportunities to transmit infection are dependent on behavioral and other risk factors relevant given the transmission route of the pathogen concerned. Transmission at the population level can be quantified through knowledge of risk factors in the population or phylogeographic analysis of pathogen sequence data. Mathematical model-based approaches require estimation of the per capita transmission rate and basic reproduction number, obtained by fitting models to case data and/or analysis of pathogen sequence data. Heterogeneities in infectiousness, contact behavior, and susceptibility can have substantial effects on the epidemiology of an infectious disease, so estimates of only mean values may be insufficient. For some pathogens, super-shedders (infected individuals who are highly infectious) and super-spreaders (individuals with more opportunities to transmit infection) may be important. Future work on quantifying transmission should involve integrated analyses of multiple data sources.

  7. Identification of a previously undescribed divergent virus from the Flaviviridae family in an outbreak of equine serum hepatitis.

    PubMed

    Chandriani, Sanjay; Skewes-Cox, Peter; Zhong, Weidong; Ganem, Donald E; Divers, Thomas J; Van Blaricum, Anita J; Tennant, Bud C; Kistler, Amy L

    2013-04-09

    Theiler's disease is an acute hepatitis in horses that is associated with the administration of equine blood products; its etiologic agent has remained unknown for nearly a century. Here, we used massively parallel sequencing to explore samples from a recent Theiler's disease outbreak. Metatranscriptomic analysis of the short sequence reads identified a 10.5-kb sequence from a previously undescribed virus of the Flaviviridae family, which we designate "Theiler's disease-associated virus" (TDAV). Phylogenetic analysis clusters TDAV with GB viruses of the recently proposed Pegivirus genus, although it shares only 35.3% amino acid identity with its closest relative, GB virus D. An epidemiological survey of additional horses from three separate locations supports an association between TDAV infection and acute serum hepatitis. Experimental inoculation of horses with TDAV-positive plasma provides evidence that several weeks of viremia preceded liver injury and that liver disease may not be directly related to the level of viremia. Like hepatitis C virus, the best characterized Flaviviridae species known to cause hepatitis, we find TDAV is capable of efficient parenteral transmission, engendering acute and chronic infections associated with a diversity of clinical presentations ranging from subclinical infection to clinical hepatitis.

  8. SNP-VISTA: An Interactive SNPs Visualization Tool

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shah, Nameeta; Teplitsky, Michael V.; Pennacchio, Len A.

    2005-07-05

    Recent advances in sequencing technologies promise better diagnostics for many diseases as well as better understanding of evolution of microbial populations. Single Nucleotide Polymorphisms(SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it is possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease and then screen for causative mutations.In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmentalmore » samples makes possible more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at http://genome.lbl.gov/vista/snpvista.« less

  9. Characterization of bacterial knot disease caused by Pseudomonas savastanoi pv. savastanoi on pomegranate (Punica granatum L.) trees: a new host of the pathogen.

    PubMed

    Bozkurt, I A; Soylu, S; Mirik, M; Ulubas Serce, C; Baysal, Ö

    2014-11-01

    This study aimed to isolate and identify the causal organism causing hyperplastic outgrowths (knots) on stems and branches of pomegranate trees in the Eastern Mediterranean region of Turkey. Bacterial colonies were isolated from young knots on plates containing selective nutrient media. Biochemical tests, fatty acid analysis and PCR were performed to identify possible causal disease agent. Representative isolates were identified as Pseudomonas.pv.savastanoi (Psv) using biochemical tests, fatty acid profiling and PCR. Following inoculation of pomegranate plants (cv. hicaz) with bacterial suspensions, 25 of 54 bacterial isolates caused typical knots at the site of inoculation. PCR analysis, using specific primer for Psv, generated a single amplicon from all isolates. The similarity of the sequence of Turkish pomegranate isolate was 99% similar to the corresponding gene sequences of Psv in the databases. Based on symptoms, biochemical, molecular, pathogenicity tests and sequence analyses, the disease agent of knots observed on the pomegranate trees is Psv. To the best of our knowledge, this research has revealed pomegranate as a natural host of Psv, which extends the list of host plant species affected by the pathogen in the world and Turkey. Pomegranate trees were affected by the disease with outgrowths (galls or knot) disease. Currently, there is no published study on disease agent(s) causing the galls or knots on pomegranate trees in worldwide. Bacterial colonies were isolated from young knots. The causal agent of the knot Pseudomonas savastanoi pv.savastanoi (Psv) was identified based on symptoms, biochemical, molecular methods, pathogenicity tests and sequence analysis. To the best of our knowledge, this is the first report of Psv on pomegranate as a natural host, which extends the growing list of plant species affected by this bacterium in the world and Turkey. © 2014 The Society for Applied Microbiology.

  10. Molecular characterization of banana bunchy top virus isolate from Sri Lanka and its genetic relationship with other isolates.

    PubMed

    Wickramaarachchi, W A R T; Shankarappa, K S; Rangaswamy, K T; Maruthi, M N; Rajapakse, R G A S; Ghosh, Saptarshi

    2016-06-01

    Bunchy top disease of banana caused by Banana bunchy top virus (BBTV, genus Babuvirus family Nanoviridae) is one of the most important constraints in production of banana in the different parts of the world. Six genomic DNA components of BBTV isolate from Kandy, Sri Lanka (BBTV-K) were amplified by polymerase chain reaction (PCR) with specific primers using total DNA extracted from banana tissues showing typical symptoms of bunchy top disease. The amplicons were of expected size of 1.0-1.1 kb, which were cloned and sequenced. Analysis of sequence data revealed the presence of six DNA components; DNA-R, DNA-U3, DNA-S, DNA-N, DNA-M and DNA-C for Sri Lanka isolate. Comparisons of sequence data of DNA components followed by the phylogenetic analysis, grouped Sri Lanka-(Kandy) isolate in the Pacific Indian Oceans (PIO) group. Sri Lanka-(Kandy) isolate of BBTV is classified a new member of PIO group based on analysis of six components of the virus.

  11. Application of circular consensus sequencing and network analysis to characterize the bovine IgG repertoire

    USDA-ARS?s Scientific Manuscript database

    Background: Vertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens. Next generation sequencing methods provide unique approaches to a number of immuno-based research areas including antibody discovery and engineering, disease surve...

  12. Mutation Spectrum of the ABCA4 Gene in a Greek Cohort with Stargardt Disease: Identification of Novel Mutations and Evidence of Three Prevalent Mutated Alleles

    PubMed Central

    Vassiliki, Kokkinou; George, Koutsodontis; Polixeni, Stamatiou; Christoforos, Giatzakis; Minas, Aslanides Ioannis; Stavrenia, Koukoula; Ioannis, Datseris

    2018-01-01

    Aim To evaluate the frequency and pattern of disease-associated mutations of ABCA4 gene among Greek patients with presumed Stargardt disease (STGD1). Materials and Methods A total of 59 patients were analyzed for ABCA4 mutations using the ABCR400 microarray and PCR-based sequencing of all coding exons and flanking intronic regions. MLPA analysis as well as sequencing of two regions in introns 30 and 36 reported earlier to harbor deep intronic disease-associated variants was used in 4 selected cases. Results An overall detection rate of at least one mutant allele was achieved in 52 of the 59 patients (88.1%). Direct sequencing improved significantly the complete characterization rate, that is, identification of two mutations compared to the microarray analysis (93.1% versus 50%). In total, 40 distinct potentially disease-causing variants of the ABCA4 gene were detected, including six previously unreported potentially pathogenic variants. Among the disease-causing variants, in this cohort, the most frequent was c.5714+5G>A representing 16.1%, while p.Gly1961Glu and p.Leu541Pro represented 15.2% and 8.5%, respectively. Conclusions By using a combination of methods, we completely molecularly diagnosed 48 of the 59 patients studied. In addition, we identified six previously unreported, potentially pathogenic ABCA4 mutations. PMID:29854428

  13. Integrated systems analysis reveals a molecular network underlying autism spectrum disorders

    PubMed Central

    Li, Jingjing; Shi, Minyi; Ma, Zhihai; Zhao, Shuchun; Euskirchen, Ghia; Ziskin, Jennifer; Urban, Alexander; Hallmayer, Joachim; Snyder, Michael

    2014-01-01

    Autism is a complex disease whose etiology remains elusive. We integrated previously and newly generated data and developed a systems framework involving the interactome, gene expression and genome sequencing to identify a protein interaction module with members strongly enriched for autism candidate genes. Sequencing of 25 patients confirmed the involvement of this module in autism, which was subsequently validated using an independent cohort of over 500 patients. Expression of this module was dichotomized with a ubiquitously expressed subcomponent and another subcomponent preferentially expressed in the corpus callosum, which was significantly affected by our identified mutations in the network center. RNA-sequencing of the corpus callosum from patients with autism exhibited extensive gene mis-expression in this module, and our immunochemical analysis showed that the human corpus callosum is predominantly populated by oligodendrocyte cells. Analysis of functional genomic data further revealed a significant involvement of this module in the development of oligodendrocyte cells in mouse brain. Our analysis delineates a natural network involved in autism, helps uncover novel candidate genes for this disease and improves our understanding of its molecular pathology. PMID:25549968

  14. EventThread: Visual Summarization and Stage Analysis of Event Sequence Data.

    PubMed

    Guo, Shunan; Xu, Ke; Zhao, Rongwen; Gotz, David; Zha, Hongyuan; Cao, Nan

    2018-01-01

    Event sequence data such as electronic health records, a person's academic records, or car service records, are ordered series of events which have occurred over a period of time. Analyzing collections of event sequences can reveal common or semantically important sequential patterns. For example, event sequence analysis might reveal frequently used care plans for treating a disease, typical publishing patterns of professors, and the patterns of service that result in a well-maintained car. It is challenging, however, to visually explore large numbers of event sequences, or sequences with large numbers of event types. Existing methods focus on extracting explicitly matching patterns of events using statistical analysis to create stages of event progression over time. However, these methods fail to capture latent clusters of similar but not identical evolutions of event sequences. In this paper, we introduce a novel visualization system named EventThread which clusters event sequences into threads based on tensor analysis and visualizes the latent stage categories and evolution patterns by interactively grouping the threads by similarity into time-specific clusters. We demonstrate the effectiveness of EventThread through usage scenarios in three different application domains and via interviews with an expert user.

  15. Sequencing of a Patient with Balanced Chromosome Abnormalities and Neurodevelopmental Disease Identifies Disruption of Multiple High Risk Loci by Structural Variation

    PubMed Central

    Blake, Jonathon; Riddell, Andrew; Theiss, Susanne; Gonzalez, Alexis Perez; Haase, Bettina; Jauch, Anna; Janssen, Johannes W. G.; Ibberson, David; Pavlinic, Dinko; Moog, Ute; Benes, Vladimir; Runz, Heiko

    2014-01-01

    Balanced chromosome abnormalities (BCAs) occur at a high frequency in healthy and diseased individuals, but cost-efficient strategies to identify BCAs and evaluate whether they contribute to a phenotype have not yet become widespread. Here we apply genome-wide mate-pair library sequencing to characterize structural variation in a patient with unclear neurodevelopmental disease (NDD) and complex de novo BCAs at the karyotype level. Nucleotide-level characterization of the clinically described BCA breakpoints revealed disruption of at least three NDD candidate genes (LINC00299, NUP205, PSMD14) that gave rise to abnormal mRNAs and could be assumed as disease-causing. However, unbiased genome-wide analysis of the sequencing data for cryptic structural variation was key to reveal an additional submicroscopic inversion that truncates the schizophrenia- and bipolar disorder-associated brain transcription factor ZNF804A as an equally likely NDD-driving gene. Deep sequencing of fluorescent-sorted wild-type and derivative chromosomes confirmed the clinically undetected BCA. Moreover, deep sequencing further validated a high accuracy of mate-pair library sequencing to detect structural variants larger than 10 kB, proposing that this approach is powerful for clinical-grade genome-wide structural variant detection. Our study supports previous evidence for a role of ZNF804A in NDD and highlights the need for a more comprehensive assessment of structural variation in karyotypically abnormal individuals and patients with neurocognitive disease to avoid diagnostic deception. PMID:24625750

  16. Differentiation of BHV-1 isolates from vaccine virus by high-resolution melting analysis.

    PubMed

    Ostertag-Hill, Claire; Fang, Liang; Izume, Satoko; Lee, Megan; Reed, Aimee; Jin, Ling

    2015-02-16

    An efficacious bovine herpesvirus type-1 (BHV-1) vaccine has been used for many years. However, in the past few years, abortion and respiratory diseases have occurred after administration of the modified live vaccine. To investigate whether BHV-1 isolates from disease outbreaks are identical to those of the vaccines used, selected regions of the BHV-1 genome were investigated by high-resolution melting (HRM) analysis and PCR-DNA sequencing. When a target region within the thymidine kinase (TK) gene was examined by HRM analysis, 6 out of the 11 isolates from abortion cases and 22 out of the 25 isolates from bovine respiratory disease (BRD) cases had different melting curves compared to the vaccine virus. Surprisingly, when a conserved region within the US6 gene that encodes glycoprotein D (gD) was examined by HRM analysis, 5 out of the 11 abortion isolates and 18 out of the 23 BRD isolates had different melting curves from the vaccine virus. To determine whether SNPs within the coding regions of glycoprotein E (gE) and TK genes can be used to differentiate the isolates from the vaccine virus, PCR-DNA sequencing was used to examine these SNPs in all the isolates. This revealed that only 1 out of 11 of the abortion isolates and 4 out of 24 of the BRD isolates are different in the target region of gE from the vaccine virus, while 5 out of 11 abortion isolates and 4 out of 22 BRD isolates are different in the target region of TK from the vaccine virus. No DNA sequence differences were observed in glycoprotein G (gG) region between disease and vaccine isolates. Our study demonstrated that many disease isolates had genetic differences from the vaccine virus in regions examined by HRM and PCR-DNA sequencing analysis. In addition, many isolates contained more than one type of mutation and were composed of mixed variants. Our study suggests that a mixture of variants were present in isolates collected post-vaccination. HRM is a rapid diagnostic method that can be used for rapid differentiation of clinical isolates from vaccine strains. Copyright © 2014 Elsevier B.V. All rights reserved.

  17. Making the Leap from Research Laboratory to Clinic: Challenges and Opportunities for Next-Generation Sequencing in Infectious Disease Diagnostics

    PubMed Central

    Goldberg, Brittany; Sichtig, Heike; Geyer, Chelsie; Ledeboer, Nathan

    2015-01-01

    ABSTRACT Next-generation DNA sequencing (NGS) has progressed enormously over the past decade, transforming genomic analysis and opening up many new opportunities for applications in clinical microbiology laboratories. The impact of NGS on microbiology has been revolutionary, with new microbial genomic sequences being generated daily, leading to the development of large databases of genomes and gene sequences. The ability to analyze microbial communities without culturing organisms has created the ever-growing field of metagenomics and microbiome analysis and has generated significant new insights into the relation between host and microbe. The medical literature contains many examples of how this new technology can be used for infectious disease diagnostics and pathogen analysis. The implementation of NGS in medical practice has been a slow process due to various challenges such as clinical trials, lack of applicable regulatory guidelines, and the adaptation of the technology to the clinical environment. In April 2015, the American Academy of Microbiology (AAM) convened a colloquium to begin to define these issues, and in this document, we present some of the concepts that were generated from these discussions. PMID:26646014

  18. Identification of feline polycystic kidney disease mutation using fret probes and melting curve analysis.

    PubMed

    Criado-Fornelio, A; Buling, A; Barba-Carretero, J C

    2009-02-01

    We developed and validated a real-time polymerase chain reaction (PCR) assay using fluorescent hybridization probes and melting curve analysis to identify the PKD1 exon 29 (C-->A) mutation, which is implicated in polycystic kidney disease of cats. DNA was isolated from peripheral blood of 20 Persian cats. The employ of the new real-time PCR and melting curve analysis in these samples indicated that 13 cats (65%) were wild type homozygotes and seven cats (35%) were heterozygotes. Both PCR-RFLP and sequencing procedures were in full agreement with real-time PCR test results. Sequence analysis showed that the mutant gene had the expected base change compared to the wild type gene. The new procedure is not only very reliable but also faster than the techniques currently applied for diagnosis of the mutation.

  19. Neutralizing antibodies against West Nile virus identified directly from human B cells by single-cell analysis and next generation sequencing

    PubMed Central

    Tsioris, Konstantinos; Gupta, Namita T.; Ogunniyi, Adebola O.; Zimnisky, Ross M.; Qian, Feng; Yao, Yi; Wang, Xiaomei; Stern, Joel N. H.; Chari, Raj; Briggs, Adrian W.; Clouser, Christopher R.; Vigneault, Francois; Church, George M.; Garcia, Melissa N.; Murray, Kristy O.; Montgomery, Ruth R.; Kleinstein, Steven H.; Love, J. Christopher

    2015-01-01

    West Nile virus infection (WNV) is an emerging mosquito-borne disease that can lead to severe neurological illness and currently has no available treatment or vaccine. Using microengraving, an integrated single-cell analysis method, we analyzed a cohort of subjects infected with WNV - recently infected and post-convalescent subjects - and efficiently identified four novel WNV neutralizing antibodies. We also assessed the humoral response to WNV on a single-cell and repertoire level by integrating next generation sequencing (NGS) into our analysis. The results from single-cell analysis indicate persistence of WNV-specific memory B cells and antibody-secreting cells in post-convalescent subjects. These cells exhibited class-switched antibody isotypes. Furthermore, the results suggest that the antibody response itself does not predict the clinical severity of the disease (asymptomatic or symptomatic). Using the nucleotide coding sequences for WNV-specific antibodies derived from single cells, we revealed the ontogeny of expanded WNV-specific clones in the repertoires of recently infected subjects through NGS and bioinformatic analysis. This analysis also indicated that the humoral response to WNV did not depend on an anamnestic response, due to an unlikely previous exposure to the virus. The innovative and integrative approach presented here to analyze the evolution of neutralizing antibodies from natural infection on a single-cell and repertoire level can also be applied to vaccine studies, and could potentially aid the development of therapeutic antibodies and our basic understanding of other infectious diseases. PMID:26481611

  20. Neutralizing antibodies against West Nile virus identified directly from human B cells by single-cell analysis and next generation sequencing.

    PubMed

    Tsioris, Konstantinos; Gupta, Namita T; Ogunniyi, Adebola O; Zimnisky, Ross M; Qian, Feng; Yao, Yi; Wang, Xiaomei; Stern, Joel N H; Chari, Raj; Briggs, Adrian W; Clouser, Christopher R; Vigneault, Francois; Church, George M; Garcia, Melissa N; Murray, Kristy O; Montgomery, Ruth R; Kleinstein, Steven H; Love, J Christopher

    2015-12-01

    West Nile virus (WNV) infection is an emerging mosquito-borne disease that can lead to severe neurological illness and currently has no available treatment or vaccine. Using microengraving, an integrated single-cell analysis method, we analyzed a cohort of subjects infected with WNV - recently infected and post-convalescent subjects - and efficiently identified four novel WNV neutralizing antibodies. We also assessed the humoral response to WNV on a single-cell and repertoire level by integrating next generation sequencing (NGS) into our analysis. The results from single-cell analysis indicate persistence of WNV-specific memory B cells and antibody-secreting cells in post-convalescent subjects. These cells exhibited class-switched antibody isotypes. Furthermore, the results suggest that the antibody response itself does not predict the clinical severity of the disease (asymptomatic or symptomatic). Using the nucleotide coding sequences for WNV-specific antibodies derived from single cells, we revealed the ontogeny of expanded WNV-specific clones in the repertoires of recently infected subjects through NGS and bioinformatic analysis. This analysis also indicated that the humoral response to WNV did not depend on an anamnestic response, due to an unlikely previous exposure to the virus. The innovative and integrative approach presented here to analyze the evolution of neutralizing antibodies from natural infection on a single-cell and repertoire level can also be applied to vaccine studies, and could potentially aid the development of therapeutic antibodies and our basic understanding of other infectious diseases.

  1. First report of Pantoea ananatis (Syn. Erwinia uredovora) being associated with peanut rust in Georgia

    USDA-ARS?s Scientific Manuscript database

    Peanut rust is caused by the fungus Puccinia arachidis. This disease, if not treated can cause severe damage and defoliation. While sequencing DNA of urediniospores of the rust fungus, BLAST analysis detected many sequences corresponding to the bacterial species Pantoea ananatis. This bacterium, ...

  2. The Papillomavirus Episteme: a central resource for papillomavirus sequence data and analysis.

    PubMed

    Van Doorslaer, Koenraad; Tan, Qina; Xirasagar, Sandhya; Bandaru, Sandya; Gopalan, Vivek; Mohamoud, Yasmin; Huyen, Yentram; McBride, Alison A

    2013-01-01

    The goal of the Papillomavirus Episteme (PaVE) is to provide an integrated resource for the analysis of papillomavirus (PV) genome sequences and related information. The PaVE is a freely accessible, web-based tool (http://pave.niaid.nih.gov) created around a relational database, which enables storage, analysis and exchange of sequence information. From a design perspective, the PaVE adopts an Open Source software approach and stresses the integration and reuse of existing tools. Reference PV genome sequences have been extracted from publicly available databases and reannotated using a custom-created tool. To date, the PaVE contains 241 annotated PV genomes, 2245 genes and regions, 2004 protein sequences and 47 protein structures, which users can explore, analyze or download. The PaVE provides scientists with the data and tools needed to accelerate scientific progress for the study and treatment of diseases caused by PVs.

  3. Molecular characterization of infectious bursal disease viruses from Pakistan.

    PubMed

    Shabbir, Muhammad Zubair; Ali, Muhammad; Abbas, Muhammad; Chaudhry, Umer Naveed; Zia-Ur-Rehman; Munir, Muhammad

    2016-07-01

    Since the first report of infectious bursal disease in Pakistan in 1987, outbreaks have been common even in vaccinated flocks. Despite appropriate administration of vaccines, concerns arise if the circulating strains are different from the ones used in the vaccine. Here, we sequenced the hypervariable region (HVR) of the VP2 gene of circulating strains of infectious bursal disease virus (IBDV) originating from outbreaks (n = 4) in broiler flocks in Pakistan. Nucleotide sequencing followed by phylogeny and deduced amino acid sequence analysis showed the circulating strains to be very virulent (vv) and identified characteristic residues at position 222 (A), 242 (I), 256 (I), 294 (I) and 299 (S). In addition, a substitution at positions 221 (Q→H) was found to be exclusive to Pakistani strains in our analysis, although a larger dataset is required to confirm this finding. Compared to vaccine strains that are commonly used in Pakistan, substitution mutations were found at key amino acid positions in VP2 that may be responsible for potential changes in neutralization epitopes and vaccine failure.

  4. Marek’s Disease Virus influences the core gut microbiome of the chicken during the early and late phases of viral replication

    USDA-ARS?s Scientific Manuscript database

    Marek’s disease (MD) is an important neoplastic disease of chickens caused by the Marek’s disease virus (MDV), an oncogenic alphaherpesvirus. In this study, dysbiosis induced by MDV on the core gut flora of chicken was assessed using next generation sequence (NGS) analysis. Total fecal and cecum-der...

  5. Genomics and metagenomics in medical microbiology.

    PubMed

    Padmanabhan, Roshan; Mishra, Ajay Kumar; Raoult, Didier; Fournier, Pierre-Edouard

    2013-12-01

    Over the last two decades, sequencing tools have evolved from laborious time-consuming methodologies to real-time detection and deciphering of genomic DNA. Genome sequencing, especially using next generation sequencing (NGS) has revolutionized the landscape of microbiology and infectious disease. This deluge of sequencing data has not only enabled advances in fundamental biology but also helped improve diagnosis, typing of pathogen, virulence and antibiotic resistance detection, and development of new vaccines and culture media. In addition, NGS also enabled efficient analysis of complex human micro-floras, both commensal, and pathological, through metagenomic methods, thus helping the comprehension and management of human diseases such as obesity. This review summarizes technological advances in genomics and metagenomics relevant to the field of medical microbiology. Copyright © 2013 Elsevier B.V. All rights reserved.

  6. Analysis of the beak and feather disease viral genome indicates the existence of several genotypes which have a complex psittacine host specificity.

    PubMed

    de Kloet, E; de Kloet, S R

    2004-12-01

    A study was made of the phylogenetic relationships between fifteen complete nucleotide sequences as well as 43 nucleotide sequences of the putative coat protein gene of different strains belonging to the virus species Beak and feather disease virus obtained from 39 individuals of 16 psittacine species. The species included among others, cockatoos ( Cacatuini), African grey parrots ( Psittacus erithacus) and peach-faced lovebirds ( Agapornis roseicollis), which were infected at different geographical locations, within and outside Australia, the native origin of the virus. The derived amino acid sequences of the putative coat protein were highly diverse, with differences between some strains amounting to 50 of the 250 amino acids. Phylogenetic analysis demonstrated that the putative coat gene sequences form six clusters which show a varying degree of psittacine species specificity. Most, but not all strains infecting African grey parrots formed a single cluster as did the strains infecting the cockatoos. Strains infecting the lovebirds clustered with those infecting such Australasian species as Eclectus roratus, Psittacula kramerii and Psephotus haematogaster. Although individual birds included in this study were, where studied, often infected by closely related strains, infection by highly diverged trains was also detected. The possible relationship between BFD viral strains and clinical disease signs is discussed.

  7. Nucleotide sequence analysis of the L gene of Newcastle disease virus: homologies with Sendai and vesicular stomatitis viruses.

    PubMed Central

    Yusoff, K; Millar, N S; Chambers, P; Emmerson, P T

    1987-01-01

    The nucleotide sequence of the L gene of the Beaudette C strain of Newcastle disease virus (NDV) has been determined. The L gene is 6704 nucleotides long and encodes a protein of 2204 amino acids with a calculated molecular weight of 248822. Mung bean nuclease mapping of the 5' terminus of the L gene mRNA indicates that the transcription of the L gene is initiated 11 nucleotides upstream of the translational start site. Comparison with the amino acid sequences of the L genes of Sendai virus and vesicular stomatitis virus (VSV) suggests that there are several regions of homology between the sequences. These data provide further evidence for an evolutionary relationship between the Paramyxoviridae and the Rhabdoviridae. A non-coding sequence of 46 nucleotides downstream of the presumed polyadenylation site of the L gene may be part of a negative strand leader RNA. Images PMID:3035486

  8. Whole genome sequence and comparative analysis of Borrelia burgdorferi MM1

    PubMed Central

    Jabbari, Neda; Reddy, Panga Jaipal; Hood, Leroy

    2018-01-01

    Lyme disease is caused by spirochaetes of the Borrelia burgdorferi sensu lato genospecies. Complete genome assemblies are available for fewer than ten strains of Borrelia burgdorferi sensu stricto, the primary cause of Lyme disease in North America. MM1 is a sensu stricto strain originally isolated in the midwestern United States. Aside from a small number of genes, the complete genome sequence of this strain has not been reported. Here we present the complete genome sequence of MM1 in relation to other sensu stricto strains and in terms of its Multi Locus Sequence Typing. Our results indicate that MM1 is a new sequence type which contains a conserved main chromosome and 15 plasmids. Our results include the first contiguous 28.5 kb assembly of lp28-8, a linear plasmid carrying the vls antigenic variation system, from a Borrelia burgdorferi sensu stricto strain. PMID:29889842

  9. Characterisation and Next-generation Sequencing Analysis of Unknown Arboviruses

    DTIC Science & Technology

    2012-09-01

    on the development of real- time PCR detection assays for Vibrio cholerae, a water-borne bacterium responsible for severe enteric disease. From...specific sequence [22]. The length of time from harvesting virus to generating samples that are ready for sequencing takes about two weeks, which is a...two viruses, and on day 4 post infection significant and widespread cytopathic effect was observed. The viruses were harvested by ultracentrifugation

  10. Annotate-it: a Swiss-knife approach to annotation, analysis and interpretation of single nucleotide variation in human disease

    PubMed Central

    2012-01-01

    The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org. PMID:23013645

  11. Cracking the Code of Human Diseases Using Next-Generation Sequencing: Applications, Challenges, and Perspectives

    PubMed Central

    Precone, Vincenza; Del Monaco, Valentina; Esposito, Maria Valeria; De Palma, Fatima Domenica Elisa; Ruocco, Anna; D'Argenio, Valeria

    2015-01-01

    Next-generation sequencing (NGS) technologies have greatly impacted on every field of molecular research mainly because they reduce costs and increase throughput of DNA sequencing. These features, together with the technology's flexibility, have opened the way to a variety of applications including the study of the molecular basis of human diseases. Several analytical approaches have been developed to selectively enrich regions of interest from the whole genome in order to identify germinal and/or somatic sequence variants and to study DNA methylation. These approaches are now widely used in research, and they are already being used in routine molecular diagnostics. However, some issues are still controversial, namely, standardization of methods, data analysis and storage, and ethical aspects. Besides providing an overview of the NGS-based approaches most frequently used to study the molecular basis of human diseases at DNA level, we discuss the principal challenges and applications of NGS in the field of human genomics. PMID:26665001

  12. [Two novel pathogenic mutations of GAN gene identified in a patient with giant axonal neuropathy].

    PubMed

    Wang, Juan; Ma, Qingwen; Cai, Qin; Liu, Yanna; Wang, Wei; Ren, Zhaorui

    2016-06-01

    To explore the disease-causing mutations in a patient suspected for giant axonal neuropathy(GAN). Target sequence capture sequencing was used to screen potential mutations in genomic DNA extracted from peripheral blood sample of the patient. Sanger sequencing was applied to confirm the detected mutation. The mutation was verified among 400 GAN alleles from 200 healthy individuals by Sanger sequencing. The function of the mutations was predicted by bioinformatics analysis. The patient was identified as a compound heterozygote carrying two novel pathogenic GAN mutations, i.e., c.778G>T (p.Glu260Ter) and c.277G>A (p.Gly93Arg). Sanger sequencing confirmed that the c.778G>T (p.Glu260Ter) mutation was inherited from his father, while c.277G>A (p.Gly93Arg) was inherited from his mother. The same mutations was not found in the 200 healthy individuals. Bioinformatics analysis predicted that the two mutations probably caused functional abnormality of gigaxonin. Two novel GAN mutations were detected in a patient with GAN. Both mutations are pathogenic and can cause abnormalities of gigaxonin structure and function, leading to pathogenesis of GAN. The results may also offer valuable information for similar diseases.

  13. Cloning and characterization of a Prevotella melaninogenica hemolysin.

    PubMed Central

    Allison, H E; Hillman, J D

    1997-01-01

    Hemolysins have been proven to be important virulence factors in many medically relevant pathogenic organisms. Their production has also been implicated in the etiology of periodontal disease. Hemolytic strain 361B of Prevotella melaninogenica, a putative etiologic agent of periodontal disease, was used in this study. The cloning, sequencing, and characterization of phyA, the structural gene for a P. melaninogenica hemolysin, is described. No extensive sequence homology could be identified between phyA and any reported sequence at either the nucleotide or amino acid level. As predicted from sequence analysis, this gene produces a 39-kDa protein which has hemolytic activity as measured by zymogram analysis. Unlike many Ca2+-dependent bacterial hemolysins, both the cloned and native PhyA proteins were enhanced by the presence of EDTA in a dose-dependent fashion with 40 mM EDTA allowing maximum activity. Ca2+ and Mg2+ were found to be inhibitory. The hemolytic activity also was found to have a dose-dependent endpoint. Through recovery of hemolytic activity from a spent reaction, this endpoint was shown to be the result of end product inhibition. This is the first report describing the cloning and sequencing of a gene from P. melaninogenica. PMID:9199448

  14. Cloning and characterization of a Prevotella melaninogenica hemolysin.

    PubMed

    Allison, H E; Hillman, J D

    1997-07-01

    Hemolysins have been proven to be important virulence factors in many medically relevant pathogenic organisms. Their production has also been implicated in the etiology of periodontal disease. Hemolytic strain 361B of Prevotella melaninogenica, a putative etiologic agent of periodontal disease, was used in this study. The cloning, sequencing, and characterization of phyA, the structural gene for a P. melaninogenica hemolysin, is described. No extensive sequence homology could be identified between phyA and any reported sequence at either the nucleotide or amino acid level. As predicted from sequence analysis, this gene produces a 39-kDa protein which has hemolytic activity as measured by zymogram analysis. Unlike many Ca2+-dependent bacterial hemolysins, both the cloned and native PhyA proteins were enhanced by the presence of EDTA in a dose-dependent fashion with 40 mM EDTA allowing maximum activity. Ca2+ and Mg2+ were found to be inhibitory. The hemolytic activity also was found to have a dose-dependent endpoint. Through recovery of hemolytic activity from a spent reaction, this endpoint was shown to be the result of end product inhibition. This is the first report describing the cloning and sequencing of a gene from P. melaninogenica.

  15. Characterization of the in vitro expressed autoimmune rippling muscle disease immunogenic domain of human titin encoded by TTN exons 248-249

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zelinka, L.; McCann, S.; Budde, J.

    2011-08-05

    Highlights: {yields} Affinity purification of the autoimmune rippling muscle disease immunogenic domain of titin. {yields} Partial sequence analysis confirms that the peptides is in the I band region of titin. {yields} This region of the human titin shows high degree of homology to mouse titin N2-A. -- Abstract: Autoimmune rippling muscle disease (ARMD) is an autoimmune neuromuscular disease associated with myasthenia gravis (MG). Past studies in our laboratory recognized a very high molecular weight skeletal muscle protein antigen identified by ARMD patient antisera as the titin isoform. These past studies used antisera from ARMD and MG patients as probes tomore » screen a human skeletal muscle cDNA library and several pBluescript clones revealed supporting expression of immunoreactive peptides. This study characterizes the products of subcloning the titin immunoreactive domain into pGEX-3X and the subsequent fusion protein. Sequence analysis of the fusion gene indicates the cloned titin domain (GenBank ID: (EU428784)) is in frame and is derived from a sequence of N2-A spanning the exons 248-250 an area that encodes the fibronectin III domain. PCR and EcoR1 restriction mapping studies have demonstrated that the inserted cDNA is of a size that is predicted by bioinformatics analysis of the subclone. Expression of the fusion protein result in the isolation of a polypeptide of 52 kDa consistent with the predicted inferred amino acid sequence. Immunoblot experiments of the fusion protein, using rippling muscle/myasthenia gravis antisera, demonstrate that only the titin domain is immunoreactive.« less

  16. Refined identification of Vibrio bacterial flora from Acanthasther planci based on biochemical profiling and analysis of housekeeping genes.

    PubMed

    Rivera-Posada, J A; Pratchett, M; Cano-Gomez, A; Arango-Gomez, J D; Owens, L

    2011-09-09

    We used a polyphasic approach for precise identification of bacterial flora (Vibrionaceae) isolated from crown-of-thorns starfish (COTS) from Lizard Island (Great Barrier Reef, Australia) and Guam (U.S.A., Western Pacific Ocean). Previous 16S rRNA gene phylogenetic analysis was useful to allocate and identify isolates within the Photobacterium, Splendidus and Harveyi clades but failed in the identification of Vibrio harveyi-like isolates. Species of the V harveyi group have almost indistinguishable phenotypes and genotypes, and thus, identification by standard biochemical tests and 16S rRNA gene analysis is commonly inaccurate. Biochemical profiling and sequence analysis of additional topA and mreB housekeeping genes were carried out for definitive identification of 19 bacterial isolates recovered from sick and wild COTS. For 8 isolates, biochemical profiles and topA and mreB gene sequence alignments with the closest relatives (GenBank) confirmed previous 16S rRNA-based identification: V. fortis and Photobacterium eurosenbergii species (from wild COTS), and V natriegens (from diseased COTS). Further phylogenetic analysis based on topA and mreB concatenated sequences served to identify the remaining 11 V harveyi-like isolates: V. owensii and V. rotiferianus (from wild COTS), and V. owensii, V. rotiferianus, and V. harveyi (from diseased COTS). This study further confirms the reliability of topA-mreB gene sequence analysis for identification of these close species, and it reveals a wider distribution range of the potentially pathogenic V. harveyi group.

  17. HIV-1 Transmission during Early Infection in Men Who Have Sex with Men: A Phylodynamic Analysis

    DOE PAGES

    Volz, Erik M.; Ionides, Edward; Romero-Severson, Ethan O.; ...

    2013-12-10

    Conventional epidemiological surveillance of infectious diseases is focused on characterization of incident infections and estimation of the number of prevalent infections. Advances in methods for the analysis of the population-level genetic variation of viruses can potentially provide information about donors, not just recipients, of infection. Genetic sequences from many viruses are increasingly abundant, especially HIV, which is routinely sequenced for surveillance of drug resistance mutations. In this study, we conducted a phylodynamic analysis of HIV genetic sequence data and surveillance data from a US population of men who have sex with men (MSM) and estimated incidence and transmission rates bymore » stage of infection.« less

  18. Parallel gene analysis with allele-specific padlock probes and tag microarrays

    PubMed Central

    Banér, Johan; Isaksson, Anders; Waldenström, Erik; Jarvius, Jonas; Landegren, Ulf; Nilsson, Mats

    2003-01-01

    Parallel, highly specific analysis methods are required to take advantage of the extensive information about DNA sequence variation and of expressed sequences. We present a scalable laboratory technique suitable to analyze numerous target sequences in multiplexed assays. Sets of padlock probes were applied to analyze single nucleotide variation directly in total genomic DNA or cDNA for parallel genotyping or gene expression analysis. All reacted probes were then co-amplified and identified by hybridization to a standard tag oligonucleotide array. The technique was illustrated by analyzing normal and pathogenic variation within the Wilson disease-related ATP7B gene, both at the level of DNA and RNA, using allele-specific padlock probes. PMID:12930977

  19. HIV-1 Transmission during Early Infection in Men Who Have Sex with Men: A Phylodynamic Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Volz, Erik M.; Ionides, Edward; Romero-Severson, Ethan O.

    Conventional epidemiological surveillance of infectious diseases is focused on characterization of incident infections and estimation of the number of prevalent infections. Advances in methods for the analysis of the population-level genetic variation of viruses can potentially provide information about donors, not just recipients, of infection. Genetic sequences from many viruses are increasingly abundant, especially HIV, which is routinely sequenced for surveillance of drug resistance mutations. In this study, we conducted a phylodynamic analysis of HIV genetic sequence data and surveillance data from a US population of men who have sex with men (MSM) and estimated incidence and transmission rates bymore » stage of infection.« less

  20. Multi-Locus Sequence Typing of Bartonella henselae Isolates from Three Continents Reveals Hypervirulent and Feline-Associated Clones

    PubMed Central

    Arvand, Mardjan; Feil, Edward J.; Giladi, Michael; Boulouis, Henri-Jean; Viezens, Juliane

    2007-01-01

    Bartonella henselae is a zoonotic pathogen and the causative agent of cat scratch disease and a variety of other disease manifestations in humans. Previous investigations have suggested that a limited subset of B. henselae isolates may be associated with human disease. In the present study, 182 human and feline B. henselae isolates from Europe, North America and Australia were analysed by multi-locus sequence typing (MLST) to detect any associations between sequence type (ST), host species and geographical distribution of the isolates. A total of 14 sequence types were detected, but over 66% (16/24) of the isolates recovered from human disease corresponded to a single genotype, ST1, and this type was detected in all three continents. In contrast, 27.2% (43/158) of the feline isolates corresponded to ST7, but this ST was not recovered from humans and was restricted to Europe. The difference in host association of STs 1 (human) and 7 (feline) was statistically significant (P≤0.001). eBURST analysis assigned the 14 STs to three clonal lineages, which contained two or more STs, and a singleton comprising ST7. These groups were broadly consistent with a neighbour-joining tree, although splits decomposition analysis was indicative of a history of recombination. These data indicate that B. henselae lineages differ in their virulence properties for humans and contribute to a better understanding of the population structure of B. henselae. PMID:18094753

  1. Molecular differentiation and phylogenetic analysis of the Egyptian foot-and-mouth disease virus SAT2.

    PubMed

    El-Shehawy, Laila I; Abu-Elnaga, Hany I; Rizk, Sonia A; Abd El-Kreem, Ahmed S; Mohamed, A A; Fawzy, Hossam G

    2014-03-01

    In February 2012, a massive new foot-and-mouth disease (FMD) outbreak struck Egypt. In this work, one-step RT-PCR assays were used for in-house detection and differentiation of foot-and-mouth disease virus (FMDV) in Egypt in this year using pan-serotypic and serotype-targeting sequence primers. FMDV SAT2 was the dominant virus in the examined isolates from the epidemic. The complete VP1 coding regions of two isolates were sequenced. The two isolates had 99.2 % sequence identity to most contemporary Egyptian SAT2 reference viruses, whereas they had 89.7-90.1 % identity to the SAT2/EGY/2/2012 isolate, which was collected from Alexandria, Egypt, and previously sequenced by WRLFMD. Phylogenetic analysis showed that Egypt had one topotype and two lineage of FMDV SAT2 in 2012. The Egyptian and the Palestinian 2012 strains were associated mainly with topotype VII, lineage SAT2/VII/Ghb-12, while the virus isolated from Alexandria Governorate belonged to the SAT2/VII/Alx-12 lineage. Topotype VII also comprised lineages that included strains isolated from Libya in 2012 and 2003. Furthermore, within the same topotype, the Egyptian SAT2/2012 isolates were related to strains from Saudi Arabia, Sudan, Eritrea, Cameroon and Nigeria. Nevertheless, more epidemiological work with neighboring countries is needed to prevent cross-border spread of disease and to reach a precise conclusion about the origin of the 2012 FMDV SAT2 emergency in the Middle East.

  2. Comparative Analysis of Expressed Genes from Cacao Meristems Infected by Moniliophthora perniciosa

    PubMed Central

    Gesteira, Abelmon S.; Micheli, Fabienne; Carels, Nicolas; Da Silva, Aline C.; Gramacho, Karina P.; Schuster, Ivan; Macêdo, Joci N.; Pereira, Gonçalo A. G.; Cascardo, Júlio C. M.

    2007-01-01

    Background and Aims Witches' broom disease is caused by the hemibiotrophic basidiomycete Moniliophthora perniciosa, and is one of the most important diseases of cacao in the western hemisphere. Because very little is known about the global process of such disease development, expressed sequence tags (ESTs) were used to identify genes expressed during the Theobroma cacao–Moniliophthora perniciosa interaction. Methods Two cDNA libraries corresponding to the resistant (RT) and susceptible (SP) cacao–M. perniciosa interactions were constructed from total RNA, using the DB SMART Creator cDNA library kit (Clontech). Clones were randomly selected, sequenced from the 5′ end and analysed using bioinformatics tools including in silico analysis of the differential gene expression. Key Results A total of 6884 ESTs were generated from the RT and SP cDNA libraries. These ESTs were composed of 2585 singlets and 341 contigs for a total of 2926 non-redundant sequences. The redundancy of the libraries was low and their specificity high when compared with the few other cacao libraries already published. Sequence analysis allowed the assignment of a putative functional category for 54 % of sequences, whereas approx. 22 % of sequences corresponded to unknown function and approx. 24 % of sequences did not show any significant similarity with other proteins present in the database. Despite the similar overall distribution of the sequences in functional categories between the two libraries, qualitative differences were observed. Genes involved during the defence response to pathogen infection or in programmed cell death were identified, such as pathogenesis related-proteins, trypsin inhibitor or oxalate oxidase, and some of them showed an in silico differential expression between the resistant and the susceptible interactions. Conclusions As far as is known this is the first EST resource from the cacao–M. perniciosa interaction and it is believed that it will provide a significant contribution to the understanding of the molecular mechanisms of the resistance and susceptibility of cacao to M. perniciosa, to develop strategies to control witches broom, and as a source of polymorphism for molecular marker development and marker-assisted selection. PMID:17557832

  3. Whole-Exome Sequencing to Identify Novel Biological Pathways Associated With Infertility After Pelvic Inflammatory Disease.

    PubMed

    Taylor, Brandie D; Zheng, Xiaojing; Darville, Toni; Zhong, Wujuan; Konganti, Kranti; Abiodun-Ojo, Olayinka; Ness, Roberta B; O'Connell, Catherine M; Haggerty, Catherine L

    2017-01-01

    Ideal management of sexually transmitted infections (STI) may require risk markers for pathology or vaccine development. Previously, we identified common genetic variants associated with chlamydial pelvic inflammatory disease (PID) and reduced fecundity. As this explains only a proportion of the long-term morbidity risk, we used whole-exome sequencing to identify biological pathways that may be associated with STI-related infertility. We obtained stored DNA from 43 non-Hispanic black women with PID from the PID Evaluation and Clinical Health Study. Infertility was assessed at a mean of 84 months. Principal component analysis revealed no population stratification. Potential covariates did not significantly differ between groups. Sequencing kernel association test was used to examine associations between aggregates of variants on a single gene and infertility. The results from the sequencing kernel association test were used to choose "focus genes" (P < 0.01; n = 150) for subsequent Ingenuity Pathway Analysis to identify "gene sets" that are enriched in biologically relevant pathways. Pathway analysis revealed that focus genes were enriched in canonical pathways including, IL-1 signaling, P2Y purinergic receptor signaling, and bone morphogenic protein signaling. Focus genes were enriched in pathways that impact innate and adaptive immunity, protein kinase A activity, cellular growth, and DNA repair. These may alter host resistance or immunopathology after infection. Targeted sequencing of biological pathways identified in this study may provide insight into STI-related infertility.

  4. Complete genome sequence of the European sheatfish virus.

    PubMed

    Mavian, Carla; López-Bueno, Alberto; Fernández Somalo, María Pilar; Alcamí, Antonio; Alejo, Alí

    2012-06-01

    Viral diseases are an increasing threat to the thriving aquaculture industry worldwide. An emerging group of fish pathogens is formed by several ranaviruses, which have been isolated at different locations from freshwater and seawater fish species since 1985. We report the complete genome sequence of European sheatfish ranavirus (ESV), the first ranavirus isolated in Europe, which causes high mortality rates in infected sheatfish (Silurus glanis) and in other species. Analysis of the genome sequence shows that ESV belongs to the amphibian-like ranaviruses and is closely related to the epizootic hematopoietic necrosis virus (EHNV), a disease agent geographically confined to the Australian continent and notifiable to the World Organization for Animal Health.

  5. Complete Genome Sequence of the European Sheatfish Virus

    PubMed Central

    Mavian, Carla; López-Bueno, Alberto; Fernández Somalo, María Pilar; Alcamí, Antonio

    2012-01-01

    Viral diseases are an increasing threat to the thriving aquaculture industry worldwide. An emerging group of fish pathogens is formed by several ranaviruses, which have been isolated at different locations from freshwater and seawater fish species since 1985. We report the complete genome sequence of European sheatfish ranavirus (ESV), the first ranavirus isolated in Europe, which causes high mortality rates in infected sheatfish (Silurus glanis) and in other species. Analysis of the genome sequence shows that ESV belongs to the amphibian-like ranaviruses and is closely related to the epizootic hematopoietic necrosis virus (EHNV), a disease agent geographically confined to the Australian continent and notifiable to the World Organization for Animal Health. PMID:22570241

  6. Transcriptomic analysis of Ruditapes philippinarum hemocytes reveals cytoskeleton disruption after in vitro Vibrio tapetis challenge.

    PubMed

    Brulle, Franck; Jeffroy, Fanny; Madec, Stéphanie; Nicolas, Jean-Louis; Paillard, Christine

    2012-10-01

    The Manila clam, Ruditapes philippinarum, is an economically-important, commercial shellfish; harvests are diminished in some European waters by a pathogenic bacterium, Vibrio tapetis, that causes Brown Ring disease. To identify molecular characteristics associated with susceptibility or resistance to Brown Ring disease, Suppression Subtractive Hybridization (SSH) analyzes were performed to construct cDNA libraries enriched in up- or down-regulated transcripts from clam immune cells, hemocytes, after a 3-h in vitro challenge with cultured V. tapetis. Nine hundred and ninety eight sequences from the two libraries were sequenced, and an in silico analysis identified 235 unique genes. BLAST and "Gene ontology" classification analyzes revealed that 60.4% of the Expressed Sequence Tags (ESTs) have high similarities with genes involved in various physiological functions, such as immunity, apoptosis and cytoskeleton organization; whereas, 39.6% remain unidentified. From the 235 unique genes, we selected 22 candidates based upon physiological function and redundancy in the libraries. Then, Real-Time PCR analysis identified 3 genes related to cytoskeleton organization showing significant variation in expression attributable to V. tapetis exposure. Disruption in regulation of these genes is consistent with the etiologic agent of Brown Ring disease in Manila clams. Copyright © 2012 Elsevier Ltd. All rights reserved.

  7. Somatic hypermutation and antigen-driven selection of B cells are altered in autoimmune diseases.

    PubMed

    Zuckerman, Neta S; Hazanov, Helena; Barak, Michal; Edelman, Hanna; Hess, Shira; Shcolnik, Hadas; Dunn-Walters, Deborah; Mehr, Ramit

    2010-12-01

    B cells have been found to play a critical role in the pathogenesis of several autoimmune (AI) diseases. A common feature amongst many AI diseases is the formation of ectopic germinal centers (GC) within the afflicted tissue or organ, in which activated B cells expand and undergo somatic hypermutation (SHM) and antigen-driven selection on their immunoglobulin variable region (IgV) genes. However, it is not yet clear whether these processes occurring in ectopic GCs are identical to those in normal GCs. The analysis of IgV mutations has aided in revealing many aspects concerning B cell expansion, mutation and selection in GC reactions. We have applied several mutation analysis methods, based on lineage tree construction, to a large set of data, containing IgV productive and non-productive heavy and light chain sequences from several different tissues, to examine three of the most profoundly studied AI diseases - Rheumatoid Arthritis (RA), Multiple Sclerosis (MS) and Sjögren's Syndrome (SS). We have found that RA and MS sequences exhibited normal mutation spectra and targeting motifs, but a stricter selection compared to normal controls, which was more apparent in RA. SS sequence analysis results deviated from normal controls in both mutation spectra and indications of selection, also showing differences between light and heavy chain IgV and between different tissues. The differences revealed between AI diseases and normal control mutation patterns may result from the different microenvironmental influences to which ectopic GCs are exposed, relative to those in normal secondary lymphoid tissues. Copyright © 2010 Elsevier Ltd. All rights reserved.

  8. Analysis of the mitochondrial genome of cheetahs (Acinonyx jubatus) with neurodegenerative disease.

    PubMed

    Burger, Pamela A; Steinborn, Ralf; Walzer, Christian; Petit, Thierry; Mueller, Mathias; Schwarzenberger, Franz

    2004-08-18

    The complete mitochondrial genome of Acinonyx jubatus was sequenced and mitochondrial DNA (mtDNA) regions were screened for polymorphisms as candidates for the cause of a neurodegenerative demyelinating disease affecting captive cheetahs. The mtDNA reference sequences were established on the basis of the complete sequences of two diseased and two nondiseased animals as well as partial sequences of 26 further individuals. The A. jubatus mitochondrial genome is 17,047-bp long and shows a high sequence similarity (91%) to the domestic cat. Based on single nucleotide polymorphisms (SNPs) in the control region (CR) and pedigree information, the 18 myelopathic and 12 non-myelopathic cheetahs included in this study were classified into haplotypes I, II and III. In view of the phenotypic comparability of the neurodegenerative disease observed in cheetahs and human mtDNA-associated diseases, specific coding regions including the tRNAs leucine UUR, lysine, serine UCN, and partial complex I and V sequences were screened. We identified a heteroplasmic and a homoplasmic SNP at codon 507 in the subunit 5 (MTND5) of complex I. The heteroplasmic haplotype I-specific valine to methionine substitution represents a nonconservative amino acid change and was found in 11 myelopathic and eight non-myelopathic cheetahs with levels ranging from 29% to 79%. The homoplasmic conservative amino acid substitution valine to alanine was identified in two myelopathic animals of haplotype II. In addition, a synonymous SNP in the codon 76 of the MTND4L gene was found in the single haplotype III animal. The amino acid exchanges in the MTND5 gene were not associated with the occurrence of neurodegenerative disease in captive cheetahs.

  9. Molecular Characterization of Watermelon Chlorotic Stunt Virus (WmCSV) from Palestine

    PubMed Central

    Ali-Shtayeh, Mohammed S.; Jamous, Rana M.; Mallah, Omar B.; Abu-Zeitoun, Salam Y.

    2014-01-01

    The incidence of watermelon chlorotic stunt disease and molecular characterization of the Palestinian isolate of Watermelon chlorotic stunt virus (WmCSV-[PAL]) are described in this study. Symptomatic leaf samples obtained from watermelon Citrullus lanatus (Thunb.), and cucumber (Cucumis sativus L.) plants were tested for WmCSV-[PAL] infection by polymerase chain reaction (PCR) and Rolling Circle Amplification (RCA). Disease incidence ranged between 25%–98% in watermelon fields in the studied area, 77% of leaf samples collected from Jenin were found to be mixed infected with WmCSV-[PAL] and SLCV. The full-length DNA-A and DNA-B genomes of WmCSV-[PAL] were amplified and sequenced, and the sequences were deposited in the GenBank. Sequence analysis of virus genomes showed that DNA-A and DNA-B had 97.6%–99.42% and 93.16%–98.26% nucleotide identity with other virus isolates in the region, respectively. Sequence analysis also revealed that the Palestinian isolate of WmCSV shared the highest nucleotide identity with an isolate from Israel suggesting that the virus was introduced to Palestine from Israel. PMID:24956181

  10. Indel variant analysis of short-read sequencing data with Scalpel

    PubMed Central

    Fang, Han; Bergmann, Ewa A; Arora, Kanika; Vacic, Vladimir; Zody, Michael C; Iossifov, Ivan; O’Rawe, Jason A; Wu, Yiyang; Barron, Laura T Jimenez; Rosenbaum, Julie; Ronemus, Michael; Lee, Yoon-ha; Wang, Zihua; Dikoglu, Esra; Jobanputra, Vaidehi; Lyon, Gholson J; Wigler, Michael; Schatz, Michael C; Narzisi, Giuseppe

    2017-01-01

    As the second most common type of variation in the human genome, insertions and deletions (indels) have been linked to many diseases, but the discovery of indels of more than a few bases in size from short-read sequencing data remains challenging. Scalpel (http://scalpel.sourceforge.net) is an open-source software for reliable indel detection based on the microassembly technique. It has been successfully used to discover mutations in novel candidate genes for autism, and it is extensively used in other large-scale studies of human diseases. This protocol gives an overview of the algorithm and describes how to use Scalpel to perform highly accurate indel calling from whole-genome and whole-exome sequencing data. We provide detailed instructions for an exemplary family-based de novo study, but we also characterize the other two supported modes of operation: single-sample and somatic analysis. Indel normalization, visualization and annotation of the mutations are also illustrated. Using a standard server, indel discovery and characterization in the exonic regions of the example sequencing data can be completed in ~5 h after read mapping. PMID:27854363

  11. Exome Sequencing Analysis Reveals Variants in Primary Immunodeficiency Genes in Patients With Very Early Onset Inflammatory Bowel Disease

    PubMed Central

    Kelsen, Judith R.; Dawany, Noor; Moran, Christopher J.; Petersen, Britt-Sabina; Sarmady, Mahdi; Sasson, Ariella; Pauly-Hubbard, Helen; Martinez, Alejandro; Maurer, Kelly; Soong, Joanne; Rappaport, Eric; Franke, Andre; Keller, Andreas; Winter, Harland S.; Mamula, Petar; Piccoli, David; Artis, David; Sonnenberg, Gregory F.; Daly, Mark; Sullivan, Kathleen E.; Baldassano, Robert N.; Devoto, Marcella

    2016-01-01

    Background & Aims Very early onset inflammatory bowel disease (VEO-IBD), IBD diagnosed ≤5 y of age, frequently presents with a different and more severe phenotype than older-onset IBD. We investigated whether patients with VEO-IBD carry rare or novel variants in genes associated with immunodeficiencies that might contribute to disease development. Methods Patients with VEO-IBD and parents (when available) were recruited from the Children's Hospital of Philadelphia from March 2013 through July 2014. We analyzed DNA from 125 patients with VEO-IBD (ages 3 weeks to 4 y) and 19 parents, 4 of whom also had IBD. Exome capture was performed by Agilent SureSelect V4, and sequencing was performed using the Illumina HiSeq platform. Alignment to human genome GRCh37 was achieved followed by post-processing and variant calling. Following functional annotation, candidate variants were analyzed for change in protein function, minor allele frequency <0.1%, and scaled combined annotation dependent depletion scores ≤10. We focused on genes associated with primary immunodeficiencies and related pathways. An additional 210 exome samples from patients with pediatric IBD (n=45) or adult-onset Crohn's disease (n=20) and healthy individuals (controls, n=145) were obtained from the University of Kiel, Germany and used as control groups. Results Four-hundred genes and regions associated with primary immunodeficiency, covering approximately 6500 coding exons totaling > 1 Mbp of coding sequence, were selected from the whole exome data. Our analysis revealed novel and rare variants within these genes that could contribute to the development of VEO-IBD, including rare heterozygous missense variants in IL10RA and previously unidentified variants in MSH5 and CD19. Conclusions In an exome sequence analysis of patients with VEO-IBD and their parents, we identified variants in genes that regulate B- and T-cell functions and could contribute to pathogenesis. Our analysis could lead to the identification of previously unidentified IBD-associated variants. PMID:26193622

  12. Chameleon sequences in neurodegenerative diseases.

    PubMed

    Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Salari, Ali

    2016-03-25

    Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases. Copyright © 2016 Elsevier Inc. All rights reserved.

  13. Chameleon sequences in neurodegenerative diseases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bahramali, Golnaz; Goliaei, Bahram, E-mail: goliaei@ut.ac.ir; Minuchehr, Zarrin, E-mail: minuchehr@nigeb.ac.ir

    2016-03-25

    Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to “helix to strand (HE)”, “helix tomore » coil (HC)” and “strand to coil (CE)” alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.« less

  14. Genome sequencing reveals loci under artificial selection that underlie disease phenotypes in the laboratory rat.

    PubMed

    Atanur, Santosh S; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R; Kaisaki, Pamela J; Otto, Georg W; Ma, Man Chun John; Keane, Thomas M; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J

    2013-08-01

    Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Genome Sequencing Reveals Loci under Artificial Selection that Underlie Disease Phenotypes in the Laboratory Rat

    PubMed Central

    Atanur, Santosh S.; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R.; Kaisaki, Pamela J.; Otto, Georg W.; Ma, Man Chun John; Keane, Thomas M.; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R.; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J.; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J.

    2013-01-01

    Summary Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. PaperClip PMID:23890820

  16. Analysis of full-length sequences of two Citrus yellow mosaic badnavirus isolates infecting Citrus jambhiri (Rough Lemon) and Citrus sinensis L. Osbeck (Sweet Orange) from a nursery in India.

    PubMed

    Anthony Johnson, A M; Borah, B K; Sai Gopal, D V R; Dasgupta, I

    2012-12-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus is the causative agent of mosaic disease among Citrus species in southern India. Despite its reported prevalence in several citrus species, complete information on clear functional genomics or functional information of full-length genomes from all the CMBV isolates infecting citrus species are not available in publicly accessible databases. CMBV isolates from Rough Lemon and Sweet Orange collected from a nursery were cloned and sequenced. The analysis revealed high sequence homology of the two CMBV isolates with previously reported CMBV sequences implying that they represent new variants. Based on computational analysis of the predicted secondary structures, the possible functions of some CMBV proteins have been analyzed.

  17. Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.

    PubMed

    Doan, Ryan; Cohen, Noah D; Sawyer, Jason; Ghaffari, Noushin; Johnson, Charlie D; Dindot, Scott V

    2012-02-17

    The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse's genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.

  18. Newborn Sequencing in Genomic Medicine and Public Health

    PubMed Central

    Agrawal, Pankaj B.; Bailey, Donald B.; Beggs, Alan H.; Brenner, Steven E.; Brower, Amy M.; Cakici, Julie A.; Ceyhan-Birsoy, Ozge; Chan, Kee; Chen, Flavia; Currier, Robert J.; Dukhovny, Dmitry; Green, Robert C.; Harris-Wai, Julie; Holm, Ingrid A.; Iglesias, Brenda; Joseph, Galen; Kingsmore, Stephen F.; Koenig, Barbara A.; Kwok, Pui-Yan; Lantos, John; Leeder, Steven J.; Lewis, Megan A.; McGuire, Amy L.; Milko, Laura V.; Mooney, Sean D.; Parad, Richard B.; Pereira, Stacey; Petrikin, Joshua; Powell, Bradford C.; Powell, Cynthia M.; Puck, Jennifer M.; Rehm, Heidi L.; Risch, Neil; Roche, Myra; Shieh, Joseph T.; Veeraraghavan, Narayanan; Watson, Michael S.; Willig, Laurel; Yu, Timothy W.; Urv, Tiina; Wise, Anastasia L.

    2017-01-01

    The rapid development of genomic sequencing technologies has decreased the cost of genetic analysis to the extent that it seems plausible that genome-scale sequencing could have widespread availability in pediatric care. Genomic sequencing provides a powerful diagnostic modality for patients who manifest symptoms of monogenic disease and an opportunity to detect health conditions before their development. However, many technical, clinical, ethical, and societal challenges should be addressed before such technology is widely deployed in pediatric practice. This article provides an overview of the Newborn Sequencing in Genomic Medicine and Public Health Consortium, which is investigating the application of genome-scale sequencing in newborns for both diagnosis and screening. PMID:28096516

  19. Building a genome analysis pipeline to predict disease risk and prevent disease.

    PubMed

    Bromberg, Y

    2013-11-01

    Reduced costs and increased speed and accuracy of sequencing can bring the genome-based evaluation of individual disease risk to the bedside. While past efforts have identified a number of actionable mutations, the bulk of genetic risk remains hidden in sequence data. The biggest challenge facing genomic medicine today is the development of new techniques to predict the specifics of a given human phenome (set of all expressed phenotypes) encoded by each individual variome (full set of genome variants) in the context of the given environment. Numerous tools exist for the computational identification of the functional effects of a single variant. However, the pipelines taking advantage of full genomic, exomic, transcriptomic (and other) sequences have only recently become a reality. This review looks at the building of methodologies for predicting "variome"-defined disease risk. It also discusses some of the challenges for incorporating such a pipeline into everyday medical practice. © 2013. Published by Elsevier Ltd. All rights reserved.

  20. Retinitis Pigmentosa with EYS Mutations Is the Most Prevalent Inherited Retinal Dystrophy in Japanese Populations.

    PubMed

    Arai, Yuuki; Maeda, Akiko; Hirami, Yasuhiko; Ishigami, Chie; Kosugi, Shinji; Mandai, Michiko; Kurimoto, Yasuo; Takahashi, Masayo

    2015-01-01

    The aim of this study was to gain information about disease prevalence and to identify the responsible genes for inherited retinal dystrophies (IRD) in Japanese populations. Clinical and molecular evaluations were performed on 349 patients with IRD. For segregation analyses, 63 of their family members were employed. Bioinformatics data from 1,208 Japanese individuals were used as controls. Molecular diagnosis was obtained by direct sequencing in a stepwise fashion utilizing one or two panels of 15 and 27 genes for retinitis pigmentosa patients. If a specific clinical diagnosis was suspected, direct sequencing of disease-specific genes, that is, ABCA4 for Stargardt disease, was conducted. Limited availability of intrafamily information and decreasing family size hampered identifying inherited patterns. Differential disease profiles with lower prevalence of Stargardt disease from European and North American populations were obtained. We found 205 sequence variants in 159 of 349 probands with an identification rate of 45.6%. This study found 43 novel sequence variants. In silico analysis suggests that 20 of 25 novel missense variants are pathogenic. EYS mutations had the highest prevalence at 23.5%. c.4957_4958insA and c.8868C>A were the two major EYS mutations identified in this cohort. EYS mutations are the most prevalent among Japanese patients with IRD.

  1. High‑throughput sequencing analyses of oral microbial diversity in healthy people and patients with dental caries and periodontal disease.

    PubMed

    Chen, Tingtao; Shi, Yan; Wang, Xiaolei; Wang, Xin; Meng, Fanjing; Yang, Shaoguo; Yang, Jian; Xin, Hongbo

    2017-07-01

    Recurrence of oral diseases caused by antibiotics has brought about an urgent requirement to explore the oral microbial diversity in the human oral cavity. In the present study, the high‑throughput sequencing method was adopted to compare the microbial diversity of healthy people and oral patients and sequence analysis was performed by UPARSE software package. The Venn results indicated that a mean of 315 operational taxonomic units (OTUs) was obtained, and 73, 64, 53, 19 and 18 common OTUs belonging to Firmicutes, Bacteroidetes, Proteobacteria, Actinobacteria and Fusobacteria, respectively, were identified in healthy people. Moreover, the reduction of Firmicutes and the increase of Proteobacteria in the children group, and the increase of Firmicutes and the reduction of Proteobacteria in the youth and adult groups, indicated that the age bracket and oral disease had largely influenced the tooth development and microbial development in the oral cavity. In addition, the traditional 'pathogenic bacteria' of Firmicutes, Proteobacteria and Bacteroidetes (accounted for >95% of the total sequencing number in each group) indicated that the 'harmful' bacteria may exert beneficial effects on oral health. Therefore, the data will provide certain clues for curing some oral diseases by the strategy of adjusting the disturbed microbial compositions in oral disease to healthy level.

  2. Protein Sequencing with Tandem Mass Spectrometry

    NASA Astrophysics Data System (ADS)

    Ziady, Assem G.; Kinter, Michael

    The recent introduction of electrospray ionization techniques that are suitable for peptides and whole proteins has allowed for the design of mass spectrometric protocols that provide accurate sequence information for proteins. The advantages gained by these approaches over traditional Edman Degradation sequencing include faster analysis and femtomole, sometimes attomole, sensitivity. The ability to efficiently identify proteins has allowed investigators to conduct studies on their differential expression or modification in response to various treatments or disease states. In this chapter, we discuss the use of electrospray tandem mass spectrometry, a technique whereby protein-derived peptides are subjected to fragmentation in the gas phase, revealing sequence information for the protein. This powerful technique has been instrumental for the study of proteins and markers associated with various disorders, including heart disease, cancer, and cystic fibrosis. We use the study of protein expression in cystic fibrosis as an example.

  3. A comparative molecular characterization of AMDV strains isolated from cases of clinical and subclinical infection.

    PubMed

    Kowalczyk, Marek; Jakubczak, Andrzej; Horecka, Beata; Kostro, Krzysztof

    2018-05-29

    The Aleutian mink disease virus (AMDV) is one of the most serious threats to modern mink breeding. The disease can have various courses, from progressive to subclinical infections. The objective of the study was to provide a comparative molecular characterization of isolates of AMDV from farms with a clinical and subclinical course of the disease. The qPCR analysis showed a difference of two orders of magnitude between the number of copies of the viral DNA on the farm with the clinical course of the disease (10 5 ) and the farm with the subclinical course (10 3 ). The sequencing results confirm a high level of homogeneity within each farm and variation between them. The phylogenetic analysis indicates that the variants belonging to different farms are closely related and occupy different branches of the same clade. The in silico analysis of the effect of differences in the sequence encoding the VP2 protein between the farms revealed no effect of the polymorphism on its functionality. The close phylogenetic relationship between the isolates from the two farms, the synonymous nature of most of the polymorphisms and the potentially minor effect on the functionality of the protein indicate that the differences in the clinical picture may be due not only to polymorphisms in the nucleotide and amino acid sequences, but also to the stage of infection on the farm and the degree of stabilization of the pathogen-host relationship.

  4. Identification of a monopartite begomovirus associated with yellow vein mosaic of Mentha longifolia in Saudi Arabia.

    PubMed

    Sohrab, Sayed Sartaj; Daur, Ihsanullah

    2018-02-01

    Mentha is a very important crop grown and used extensively for many purposes in the Kingdom of Saudi Arabia. Begomoviruses are whitefly-transmitted viruses causing serious disease in many important plants exhibiting variable symptoms with significant economic loss globally. During farmers' field survey, yellow vein mosaic disease was observed in Mentha longifolia plants growing near tomato fields in Saudi Arabia. The causative agent was identified in 11 out of 19 samples using begomovirus-specific primers and the association of begomovirus with yellow vein mosaic disease in M. longifolia was confirmed. The full-length viral genome and betasatellite were amplified, cloned, and sequenced bidirectionally. The full DNA-A genome was found to have 2785 nucleotides with 1365 bp-associated betasatellite molecule. An attempt was made to amplify DNA-B, but none of the samples produced any positive amplicon of expected size which indicated the presence of monopartite begomovirus. The sequence identity matrix and phylogenetic analysis, based on full genome showed the highest identity (99.6%) with Tomato yellow leaf curl virus (TYLCV) and in phylogenetic analysis it formed a closed cluster with Tomato leaf curl virus infecting tomato and Corchorus crop in Saudi Arabia. The sequence analysis results of betasatellites showed the highest identity (98.9%) with Tomato yellow leaf curl betasatellites infecting tomato and phylogenetic analysis using betasatellites formed a close cluster with Tomato yellow leaf curl betasatellites infecting tomato and Corchorus crops, which has already been reported to cause yellow vein mosaic and leaf curl disease in many cultivated and weed crops growing in Saudi Arabia. The identified begomovirus associated with yellow vein mosaic disease in mentha could be a mutated strain of TYLCV and tentatively designated as TYLCV-Mentha isolate. Based on published data and latest information, this is the first report of identification of Tomato yellow leaf curl virus associated with yellow vein mosaic disease of M. longifolia from Saudi Arabia.

  5. Identical mitochondrial somatic mutations unique to chronic periodontitis and coronary artery disease

    PubMed Central

    Pallavi, Tokala; Chandra, Rampalli Viswa; Reddy, Aileni Amarender; Reddy, Bavigadda Harish; Naveen, Anumala

    2016-01-01

    Context: The inflammatory processes involved in chronic periodontitis and coronary artery diseases (CADs) are similar and produce reactive oxygen species that may result in similar somatic mutations in mitochondrial deoxyribonucleic acid (mtDNA). Aims: The aims of the present study were to identify somatic mtDNA mutations in periodontal and cardiac tissues from subjects undergoing coronary artery bypass surgery and determine what fraction was identical and unique to these tissues. Settings and Design: The study population consisted of 30 chronic periodontitis subjects who underwent coronary artery surgery after an angiogram had indicated CAD. Materials and Methods: Gingival tissue samples were taken from the site with deepest probing depth; coronary artery tissue samples were taken during the coronary artery bypass grafting procedures, and blood samples were drawn during this surgical procedure. These samples were stored under aseptic conditions and later transported for mtDNA analysis. Statistical Analysis Used: Complete mtDNA sequences were obtained and aligned with the revised Cambridge reference sequence (NC_012920) using sequence analysis and auto assembler tools. Results: Among the complete mtDNA sequences, a total of 162 variations were spread across the whole mitochondrial genome and present only in the coronary artery and the gingival tissue samples but not in the blood samples. Among the 162 variations, 12 were novel and four of the 12 novel variations were found in mitochondrial NADH dehydrogenase subunit 5 complex I gene (33.3%). Conclusions: Analysis of mtDNA mutations indicated 162 variants unique to periodontitis and CAD. Of these, 12 were novel and may have resulted from destructive oxidative forces common to these two diseases. PMID:27041832

  6. Insights Into the Etiology of Polerovirus-Induced Pepper Yellows Disease.

    PubMed

    Lotos, Leonidas; Olmos, Antonio; Orfanidou, Chrysoula; Efthimiou, Konstantinos; Avgelis, Apostolos; Katis, Nikolaos I; Maliogka, Varvara I

    2017-12-01

    The study of an emerging yellows disease of pepper crops (pepper yellows disease [PYD]) in Greece led to the identification of a polerovirus closely related to Pepper vein yellows virus (PeVYV). Recovery of its full genome sequence by next-generation sequencing of small interfering RNAs allowed its characterization as a new poleroviruses, which was provisionally named Pepper yellows virus (PeYV). Transmission experiments revealed its association with the disease. Sequence similarity and phylogenetic analysis highlighted the common ancestry of the three poleroviruses (PeVYV, PeYV, and Pepper yellow leaf curl virus [PYLCV]) currently reported to be associated with PYD, even though significant genetic differences were identified among them, especially in the C-terminal region of P5 and the 3' noncoding region. Most of the differences observed can be attributed to a modular type of evolution, which produces mosaic-like variants giving rise to these different poleroviruses Overall, similar to other polerovirus-related diseases, PYD is caused by at least three species (PeVYV, PeYV, and PYLCV) belonging to this group of closely related pepper-infecting viruses.

  7. Pooled Sequencing of 531 Genes in Inflammatory Bowel Disease Identifies an Associated Rare Variant in BTNL2 and Implicates Other Immune Related Genes

    PubMed Central

    Prescott, Natalie J.; Lehne, Benjamin; Stone, Kristina; Lee, James C.; Taylor, Kirstin; Knight, Jo; Papouli, Efterpi; Mirza, Muddassar M.; Simpson, Michael A.; Spain, Sarah L.; Lu, Grace; Fraternali, Franca; Bumpstead, Suzannah J.; Gray, Emma; Amar, Ariella; Bye, Hannah; Green, Peter; Chung-Faye, Guy; Hayee, Bu’Hussain; Pollok, Richard; Satsangi, Jack; Parkes, Miles; Barrett, Jeffrey C.; Mansfield, John C.; Sanderson, Jeremy; Lewis, Cathryn M.; Weale, Michael E.; Schlitt, Thomas; Mathew, Christopher G.

    2015-01-01

    The contribution of rare coding sequence variants to genetic susceptibility in complex disorders is an important but unresolved question. Most studies thus far have investigated a limited number of genes from regions which contain common disease associated variants. Here we investigate this in inflammatory bowel disease by sequencing the exons and proximal promoters of 531 genes selected from both genome-wide association studies and pathway analysis in pooled DNA panels from 474 cases of Crohn’s disease and 480 controls. 80 variants with evidence of association in the sequencing experiment or with potential functional significance were selected for follow up genotyping in 6,507 IBD cases and 3,064 population controls. The top 5 disease associated variants were genotyped in an extension panel of 3,662 IBD cases and 3,639 controls, and tested for association in a combined analysis of 10,147 IBD cases and 7,008 controls. A rare coding variant p.G454C in the BTNL2 gene within the major histocompatibility complex was significantly associated with increased risk for IBD (p = 9.65x10−10, OR = 2.3[95% CI = 1.75–3.04]), but was independent of the known common associated CD and UC variants at this locus. Rare (<1%) and low frequency (1–5%) variants in 3 additional genes showed suggestive association (p<0.005) with either an increased risk (ARIH2 c.338-6C>T) or decreased risk (IL12B p.V298F, and NICN p.H191R) of IBD. These results provide additional insights into the involvement of the inhibition of T cell activation in the development of both sub-phenotypes of inflammatory bowel disease. We suggest that although rare coding variants may make a modest overall contribution to complex disease susceptibility, they can inform our understanding of the molecular pathways that contribute to pathogenesis. PMID:25671699

  8. Exome sequence analysis suggests genetic burden contributes to phenotypic variability and complex neuropathy

    PubMed Central

    Gonzaga-Jauregui, Claudia; Harel, Tamar; Gambin, Tomasz; Kousi, Maria; Griffin, Laurie B.; Francescatto, Ludmila; Ozes, Burcak; Karaca, Ender; Jhangiani, Shalini; Bainbridge, Matthew N.; Lawson, Kim S.; Pehlivan, Davut; Okamoto, Yuji; Withers, Marjorie; Mancias, Pedro; Slavotinek, Anne; Reitnauer, Pamela J; Goksungur, Meryem T.; Shy, Michael; Crawford, Thomas O.; Koenig, Michel; Willer, Jason; Flores, Brittany N.; Pediaditrakis, Igor; Us, Onder; Wiszniewski, Wojciech; Parman, Yesim; Antonellis, Anthony; Muzny, Donna M.; Katsanis, Nicholas; Battaloglu, Esra; Boerwinkle, Eric; Gibbs, Richard A.; Lupski, James R.

    2015-01-01

    Charcot-Marie-Tooth (CMT) disease is a clinically and genetically heterogeneous distal symmetric polyneuropathy. Whole-exome sequencing (WES) of 40 individuals from 37 unrelated families with CMT-like peripheral neuropathy refractory to molecular diagnosis identified apparent causal mutations in ~45% (17/37) of families. Three candidate disease genes are proposed, supported by a combination of genetic and in vivo studies. Aggregate analysis of mutation data revealed a significantly increased number of rare variants across 58 neuropathy associated genes in subjects versus controls; confirmed in a second ethnically discrete neuropathy cohort, suggesting mutation burden potentially contributes to phenotypic variability. Neuropathy genes shown to have highly penetrant Mendelizing variants (HMPVs) and implicated by burden in families were shown to interact genetically in a zebrafish assay exacerbating the phenotype established by the suppression of single genes. Our findings suggest that the combinatorial effect of rare variants contributes to disease burden and variable expressivity. PMID:26257172

  9. How Next-Generation Sequencing and Multiscale Data Analysis Will Transform Infectious Disease Management

    PubMed Central

    Pak, Theodore R.; Kasarskis, Andrew

    2015-01-01

    Recent reviews have examined the extent to which routine next-generation sequencing (NGS) on clinical specimens will improve the capabilities of clinical microbiology laboratories in the short term, but do not explore integrating NGS with clinical data from electronic medical records (EMRs), immune profiling data, and other rich datasets to create multiscale predictive models. This review introduces a range of “omics” and patient data sources relevant to managing infections and proposes 3 potentially disruptive applications for these data in the clinical workflow. The combined threats of healthcare-associated infections and multidrug-resistant organisms may be addressed by multiscale analysis of NGS and EMR data that is ideally updated and refined over time within each healthcare organization. Such data and analysis should form the cornerstone of future learning health systems for infectious disease. PMID:26251049

  10. High Resolution Melt analysis for mutation screening in PKD1 and PKD2

    PubMed Central

    2011-01-01

    Background Autosomal dominant polycystic kidney disease (ADPKD) is the most common hereditary kidney disorder. It is characterized by focal development and progressive enlargement of renal cysts leading to end-stage renal disease. PKD1 and PKD2 have been implicated in ADPKD pathogenesis but genetic features and the size of PKD1 make genetic diagnosis tedious. Methods We aim to prove that high resolution melt analysis (HRM), a recent technique in molecular biology, can facilitate molecular diagnosis of ADPKD. We screened for mutations in PKD1 and PKD2 with HRM in 37 unrelated patients with ADPKD. Results We identified 440 sequence variants in the 37 patients. One hundred and thirty eight were different. We found 28 pathogenic mutations (25 in PKD1 and 3 in PKD2 ) within 28 different patients, which is a diagnosis rate of 75% consistent with literature mean direct sequencing diagnosis rate. We describe 52 new sequence variants in PKD1 and two in PKD2. Conclusion HRM analysis is a sensitive and specific method for molecular diagnosis of ADPKD. HRM analysis is also costless and time sparing. Thus, this method is efficient and might be used for mutation pre-screening in ADPKD genes. PMID:22008521

  11. A Phylogenetic and Phenotypic Analysis of Salmonella enterica Serovar Weltevreden, an Emerging Agent of Diarrheal Disease in Tropical Regions

    PubMed Central

    Makendi, Carine; Page, Andrew J.; Wren, Brendan W.; Le Thi Phuong, Tu; Clare, Simon; Hale, Christine; Goulding, David; Klemm, Elizabeth J.; Pickard, Derek; Okoro, Chinyere; Hunt, Martin; Thompson, Corinne N.; Phu Huong Lan, Nguyen; Tran Do Hoang, Nhu; Thwaites, Guy E.; Le Hello, Simon; Brisabois, Anne; Weill, François-Xavier; Baker, Stephen; Dougan, Gordon

    2016-01-01

    Salmonella enterica serovar Weltevreden (S. Weltevreden) is an emerging cause of diarrheal and invasive disease in humans residing in tropical regions. Despite the regional and international emergence of this Salmonella serovar, relatively little is known about its genetic diversity, genomics or virulence potential in model systems. Here we used whole genome sequencing and bioinformatics analyses to define the phylogenetic structure of a diverse global selection of S. Weltevreden. Phylogenetic analysis of more than 100 isolates demonstrated that the population of S. Weltevreden can be segregated into two main phylogenetic clusters, one associated predominantly with continental Southeast Asia and the other more internationally dispersed. Subcluster analysis suggested the local evolution of S. Weltevreden within specific geographical regions. Four of the isolates were sequenced using long read sequencing to produce high quality reference genomes. Phenotypic analysis in Hep-2 cells and in a murine infection model indicated that S. Weltevreden were significantly attenuated in these models compared to the classical S. Typhimurium reference strain SL1344. Our work outlines novel insights into this important emerging pathogen and provides a baseline understanding for future research studies. PMID:26867150

  12. Sarcococca blight: Use of whole genome sequencing as a strategy for fungal disease diagnosis

    USDA-ARS?s Scientific Manuscript database

    Early and accurate diagnosis of new plant pathogens is vital for the rapid implementation of effective mitigation strategies and appropriate regulatory responses. Most commonly, pathogen identification relies on morphology and DNA marker analysis. However, for new diseases, these approaches may not...

  13. Multi-loci diagnosis of acute lymphoblastic leukaemia with high-throughput sequencing and bioinformatics analysis.

    PubMed

    Ferret, Yann; Caillault, Aurélie; Sebda, Shéhérazade; Duez, Marc; Grardel, Nathalie; Duployez, Nicolas; Villenet, Céline; Figeac, Martin; Preudhomme, Claude; Salson, Mikaël; Giraud, Mathieu

    2016-05-01

    High-throughput sequencing (HTS) is considered a technical revolution that has improved our knowledge of lymphoid and autoimmune diseases, changing our approach to leukaemia both at diagnosis and during follow-up. As part of an immunoglobulin/T cell receptor-based minimal residual disease (MRD) assessment of acute lymphoblastic leukaemia patients, we assessed the performance and feasibility of the replacement of the first steps of the approach based on DNA isolation and Sanger sequencing, using a HTS protocol combined with bioinformatics analysis and visualization using the Vidjil software. We prospectively analysed the diagnostic and relapse samples of 34 paediatric patients, thus identifying 125 leukaemic clones with recombinations on multiple loci (TRG, TRD, IGH and IGK), including Dd2/Dd3 and Intron/KDE rearrangements. Sequencing failures were halved (14% vs. 34%, P = 0.0007), enabling more patients to be monitored. Furthermore, more markers per patient could be monitored, reducing the probability of false negative MRD results. The whole analysis, from sample receipt to clinical validation, was shorter than our current diagnostic protocol, with equal resources. V(D)J recombination was successfully assigned by the software, even for unusual recombinations. This study emphasizes the progress that HTS with adapted bioinformatics tools can bring to the diagnosis of leukaemia patients. © 2016 John Wiley & Sons Ltd.

  14. Single-Exome sequencing identified a novel RP2 mutation in a child with X-linked retinitis pigmentosa.

    PubMed

    Lim, Hassol; Park, Young-Mi; Lee, Jong-Keuk; Taek Lim, Hyun

    2016-10-01

    To present an efficient and successful application of a single-exome sequencing study in a family clinically diagnosed with X-linked retinitis pigmentosa. Exome sequencing study based on clinical examination data. An 8-year-old proband and his family. The proband and his family members underwent comprehensive ophthalmologic examinations. Exome sequencing was undertaken in the proband using Agilent SureSelect Human All Exon Kit and Illumina HiSeq 2000 platform. Bioinformatic analysis used Illumina pipeline with Burrows-Wheeler Aligner-Genome Analysis Toolkit (BWA-GATK), followed by ANNOVAR to perform variant functional annotation. All variants passing filter criteria were validated by Sanger sequencing to confirm familial segregation. Analysis of exome sequence data identified a novel frameshift mutation in RP2 gene resulting in a premature stop codon (c.665delC, p.Pro222fsTer237). Sanger sequencing revealed this mutation co-segregated with the disease phenotype in the child's family. We identified a novel causative mutation in RP2 from a single proband's exome sequence data analysis. This study highlights the effectiveness of the whole-exome sequencing in the genetic diagnosis of X-linked retinitis pigmentosa, over the conventional sequencing methods. Even using a single exome, exome sequencing technology would be able to pinpoint pathogenic variant(s) for X-linked retinitis pigmentosa, when properly applied with aid of adequate variant filtering strategy. Copyright © 2016 Canadian Ophthalmological Society. Published by Elsevier Inc. All rights reserved.

  15. Phytoplasma phylogenetics based on analysis of secA and 23S rRNA gene sequences for improved resolution of candidate species of 'Candidatus Phytoplasma'.

    PubMed

    Hodgetts, Jennifer; Boonham, Neil; Mumford, Rick; Harrison, Nigel; Dickinson, Matthew

    2008-08-01

    Phytoplasma phylogenetics has focused primarily on sequences of the non-coding 16S rRNA gene and the 16S-23S rRNA intergenic spacer region (16-23S ISR), and primers that enable amplification of these regions from all phytoplasmas by PCR are well established. In this study, primers based on the secA gene have been developed into a semi-nested PCR assay that results in a sequence of the expected size (about 480 bp) from all 34 phytoplasmas examined, including strains representative of 12 16Sr groups. Phylogenetic analysis of secA gene sequences showed similar clustering of phytoplasmas when compared with clusters resolved by similar sequence analyses of a 16-23S ISR-23S rRNA gene contig or of the 16S rRNA gene alone. The main differences between trees were in the branch lengths, which were elongated in the 16-23S ISR-23S rRNA gene tree when compared with the 16S rRNA gene tree and elongated still further in the secA gene tree, despite this being a shorter sequence. The improved resolution in the secA gene-derived phylogenetic tree resulted in the 16SrII group splitting into two distinct clusters, while phytoplasmas associated with coconut lethal yellowing-type diseases split into three distinct groups, thereby supporting past proposals that they represent different candidate species within 'Candidatus Phytoplasma'. The ability to differentiate 16Sr groups and subgroups by virtual RFLP analysis of secA gene sequences suggests that this gene may provide an informative alternative molecular marker for pathogen identification and diagnosis of phytoplasma diseases.

  16. Diagnosis of autosomal dominant polycystic kidney disease using efficient PKD1 and PKD2 targeted next-generation sequencing.

    PubMed

    Trujillano, Daniel; Bullich, Gemma; Ossowski, Stephan; Ballarín, José; Torra, Roser; Estivill, Xavier; Ars, Elisabet

    2014-09-01

    Molecular diagnostics of autosomal dominant polycystic kidney disease (ADPKD) relies on mutation screening of PKD1 and PKD2, which is complicated by extensive allelic heterogeneity and the presence of six highly homologous sequences of PKD1. To date, specific sequencing of PKD1 requires laborious long-range amplifications. The high cost and long turnaround time of PKD1 and PKD2 mutation analysis using conventional techniques limits its widespread application in clinical settings. We performed targeted next-generation sequencing (NGS) of PKD1 and PKD2. Pooled barcoded DNA patient libraries were enriched by in-solution hybridization with PKD1 and PKD2 capture probes. Bioinformatics analysis was performed using an in-house developed pipeline. We validated the assay in a cohort of 36 patients with previously known PKD1 and PKD2 mutations and five control individuals. Then, we used the same assay and bioinformatics analysis in a discovery cohort of 12 uncharacterized patients. We detected 35 out of 36 known definitely, highly likely, and likely pathogenic mutations in the validation cohort, including two large deletions. In the discovery cohort, we detected 11 different pathogenic mutations in 10 out of 12 patients. This study demonstrates that laborious long-range PCRs of the repeated PKD1 region can be avoided by in-solution enrichment of PKD1 and PKD2 and NGS. This strategy significantly reduces the cost and time for simultaneous PKD1 and PKD2 sequence analysis, facilitating routine genetic diagnostics of ADPKD.

  17. Identifying Group-Specific Sequences for Microbial Communities Using Long k-mer Sequence Signatures

    PubMed Central

    Wang, Ying; Fu, Lei; Ren, Jie; Yu, Zhaoxia; Chen, Ting; Sun, Fengzhu

    2018-01-01

    Comparing metagenomic samples is crucial for understanding microbial communities. For different groups of microbial communities, such as human gut metagenomic samples from patients with a certain disease and healthy controls, identifying group-specific sequences offers essential information for potential biomarker discovery. A sequence that is present, or rich, in one group, but absent, or scarce, in another group is considered “group-specific” in our study. Our main purpose is to discover group-specific sequence regions between control and case groups as disease-associated markers. We developed a long k-mer (k ≥ 30 bps)-based computational pipeline to detect group-specific sequences at strain resolution free from reference sequences, sequence alignments, and metagenome-wide de novo assembly. We called our method MetaGO: Group-specific oligonucleotide analysis for metagenomic samples. An open-source pipeline on Apache Spark was developed with parallel computing. We applied MetaGO to one simulated and three real metagenomic datasets to evaluate the discriminative capability of identified group-specific markers. In the simulated dataset, 99.11% of group-specific logical 40-mers covered 98.89% disease-specific regions from the disease-associated strain. In addition, 97.90% of group-specific numerical 40-mers covered 99.61 and 96.39% of differentially abundant genome and regions between two groups, respectively. For a large-scale metagenomic liver cirrhosis (LC)-associated dataset, we identified 37,647 group-specific 40-mer features. Any one of the features can predict disease status of the training samples with the average of sensitivity and specificity higher than 0.8. The random forests classification using the top 10 group-specific features yielded a higher AUC (from ∼0.8 to ∼0.9) than that of previous studies. All group-specific 40-mers were present in LC patients, but not healthy controls. All the assembled 11 LC-specific sequences can be mapped to two strains of Veillonella parvula: UTDB1-3 and DSM2008. The experiments on the other two real datasets related to Inflammatory Bowel Disease and Type 2 Diabetes in Women consistently demonstrated that MetaGO achieved better prediction accuracy with fewer features compared to previous studies. The experiments showed that MetaGO is a powerful tool for identifying group-specific k-mers, which would be clinically applicable for disease prediction. MetaGO is available at https://github.com/VVsmileyx/MetaGO. PMID:29774017

  18. Improved diagonal queue medical image steganography using Chaos theory, LFSR, and Rabin cryptosystem.

    PubMed

    Jain, Mamta; Kumar, Anil; Choudhary, Rishabh Charan

    2017-06-01

    In this article, we have proposed an improved diagonal queue medical image steganography for patient secret medical data transmission using chaotic standard map, linear feedback shift register, and Rabin cryptosystem, for improvement of previous technique (Jain and Lenka in Springer Brain Inform 3:39-51, 2016). The proposed algorithm comprises four stages, generation of pseudo-random sequences (pseudo-random sequences are generated by linear feedback shift register and standard chaotic map), permutation and XORing using pseudo-random sequences, encryption using Rabin cryptosystem, and steganography using the improved diagonal queues. Security analysis has been carried out. Performance analysis is observed using MSE, PSNR, maximum embedding capacity, as well as by histogram analysis between various Brain disease stego and cover images.

  19. The Gap Procedure: for the identification of phylogenetic clusters in HIV-1 sequence data.

    PubMed

    Vrbik, Irene; Stephens, David A; Roger, Michel; Brenner, Bluma G

    2015-11-04

    In the context of infectious disease, sequence clustering can be used to provide important insights into the dynamics of transmission. Cluster analysis is usually performed using a phylogenetic approach whereby clusters are assigned on the basis of sufficiently small genetic distances and high bootstrap support (or posterior probabilities). The computational burden involved in this phylogenetic threshold approach is a major drawback, especially when a large number of sequences are being considered. In addition, this method requires a skilled user to specify the appropriate threshold values which may vary widely depending on the application. This paper presents the Gap Procedure, a distance-based clustering algorithm for the classification of DNA sequences sampled from individuals infected with the human immunodeficiency virus type 1 (HIV-1). Our heuristic algorithm bypasses the need for phylogenetic reconstruction, thereby supporting the quick analysis of large genetic data sets. Moreover, this fully automated procedure relies on data-driven gaps in sorted pairwise distances to infer clusters, thus no user-specified threshold values are required. The clustering results obtained by the Gap Procedure on both real and simulated data, closely agree with those found using the threshold approach, while only requiring a fraction of the time to complete the analysis. Apart from the dramatic gains in computational time, the Gap Procedure is highly effective in finding distinct groups of genetically similar sequences and obviates the need for subjective user-specified values. The clusters of genetically similar sequences returned by this procedure can be used to detect patterns in HIV-1 transmission and thereby aid in the prevention, treatment and containment of the disease.

  20. Norrie disease: linkage analysis using a 4.2-kb RFLP detected by a human ornithine aminotransferase cDNA probe.

    PubMed

    Ngo, J T; Bateman, J B; Cortessis, V; Sparkes, R S; Mohandas, T; Inana, G; Spence, M A

    1989-05-01

    Previous study has shown that the usual DNA marker for Norrie disease, the L1.28 probe which identifies the DXS7 locus, can recombine with the disease locus. In this study, we used a human ornithine aminotransferase (OAT) cDNA which detects OAT-related DNA sequences mapped to the same region on the X chromosome as that of the L1.28 probe to investigate the family with Norrie disease who exhibited the recombinational event. When genomic DNA from this family was digested with the PvuII restriction endonuclease, we found a restriction fragment length polymorphism (RFLP) of 4.2 kb in size. This fragment was absent in the affected males and cosegregated with the disease locus; we calculated a lod score of 0.602, at theta = 0.00. No deletion could be detected by chromosomal analysis or on Southern blots with other enzymes. These results suggest that one of the OAT-related sequences on the X chromosome may be in close proximity to the Norrie disease locus and represent the first report which indicates that the OAT cDNA may be useful for the identification of carrier status and/or prenatal diagnosis.

  1. The complete genome sequence of a new polerovirus in strawberry plants from eastern Canada showing strawberry decline symptoms.

    PubMed

    Xiang, Yu; Bernardy, Mike; Bhagwat, Basdeo; Wiersma, Paul A; DeYoung, Robyn; Bouthillier, Michel

    2015-02-01

    Strawberry decline disease, probably caused by synergistic reactions of mixed virus infections, threatens the North American strawberry industry. Deep sequencing of strawberry plant samples from eastern Canada resulted in the identification of a new virus genome resembling poleroviruses in sequence and genome structure. Phylogenetic analysis suggests that it is a new member of the genus Polerovirus, family Luteoviridae. The virus is tentatively named "strawberry polerovirus 1" (SPV1).

  2. Methylation pattern of fish lymphocystis disease virus DNA.

    PubMed

    Wagner, H; Simon, D; Werner, E; Gelderblom, H; Darai, C; Flügel, R M

    1985-03-01

    The content and distribution of 5-methylcytosine in DNA from fish lymphocystis disease virus was analyzed by high-pressure liquid chromatography, nearest-neighbor analysis, and with restriction endonucleases. We found that 22% of all C residues were methylated, including methylation of the following dinucleotide sequences: CpG to 75%, CpC to ca. 1%, and CpA to 2 to 5%. Comparison of relative digestion of viral DNA with MspI and HpaII indicated that CCGG sequences were almost completely methylated at the inner C. The degree of methylation of GCGC was much lower. The methylation pattern of fish lymphocystis disease virus DNA differed from that of the host cell DNA.

  3. Methylation pattern of fish lymphocystis disease virus DNA.

    PubMed Central

    Wagner, H; Simon, D; Werner, E; Gelderblom, H; Darai, C; Flügel, R M

    1985-01-01

    The content and distribution of 5-methylcytosine in DNA from fish lymphocystis disease virus was analyzed by high-pressure liquid chromatography, nearest-neighbor analysis, and with restriction endonucleases. We found that 22% of all C residues were methylated, including methylation of the following dinucleotide sequences: CpG to 75%, CpC to ca. 1%, and CpA to 2 to 5%. Comparison of relative digestion of viral DNA with MspI and HpaII indicated that CCGG sequences were almost completely methylated at the inner C. The degree of methylation of GCGC was much lower. The methylation pattern of fish lymphocystis disease virus DNA differed from that of the host cell DNA. Images PMID:3973962

  4. Genetic diversity and population structure of the endangered marsupial Sarcophilus harrisii (Tasmanian devil)

    PubMed Central

    Miller, Webb; Hayes, Vanessa M.; Ratan, Aakrosh; Petersen, Desiree C.; Wittekindt, Nicola E.; Miller, Jason; Walenz, Brian; Knight, James; Qi, Ji; Zhao, Fangqing; Wang, Qingyu; Bedoya-Reina, Oscar C.; Katiyar, Neerja; Tomsho, Lynn P.; Kasson, Lindsay McClellan; Hardie, Rae-Anne; Woodbridge, Paula; Tindall, Elizabeth A.; Bertelsen, Mads Frost; Dixon, Dale; Pyecroft, Stephen; Helgen, Kristofer M.; Lesk, Arthur M.; Pringle, Thomas H.; Patterson, Nick; Zhang, Yu; Kreiss, Alexandre; Woods, Gregory M.; Jones, Menna E.; Schuster, Stephan C.

    2011-01-01

    The Tasmanian devil (Sarcophilus harrisii) is threatened with extinction because of a contagious cancer known as Devil Facial Tumor Disease. The inability to mount an immune response and to reject these tumors might be caused by a lack of genetic diversity within a dwindling population. Here we report a whole-genome analysis of two animals originating from extreme northwest and southeast Tasmania, the maximal geographic spread, together with the genome from a tumor taken from one of them. A 3.3-Gb de novo assembly of the sequence data from two complementary next-generation sequencing platforms was used to identify 1 million polymorphic genomic positions, roughly one-quarter of the number observed between two genetically distant human genomes. Analysis of 14 complete mitochondrial genomes from current and museum specimens, as well as mitochondrial and nuclear SNP markers in 175 animals, suggests that the observed low genetic diversity in today's population preceded the Devil Facial Tumor Disease disease outbreak by at least 100 y. Using a genetically characterized breeding stock based on the genome sequence will enable preservation of the extant genetic diversity in future Tasmanian devil populations. PMID:21709235

  5. [Recombinant OspC identification and antigenicity detection from Borrelia burgdorferi PD91 in China].

    PubMed

    Chen, Jian; Wan, Kang-Lin

    2003-10-01

    To recombine OspC gene from Borrelia burgdorferi PD91 of China and expressed it in E. coli for early diagnosis of Lyme disease. The OspC gene was amplified from the genome of Borrelia burgdorferi PD91 strain by polymerase chain reaction and recombined with plasmid PET-11D. The recombinant plasmid PET-11D-OspC was identified with PCR, restriction endonuclease analysis and sequencing. The antigenicity was verified with Western Blot. OspC gene was cloned correctly into vector PET-11D. The resultant sequence was definitely different from the published sequence. The recombinant OspC seemed to have had strong antigenicity. The findings laid basis for the studies on early diagnosis of Lyme disease.

  6. Association analysis of rare variants near the APOE region with CSF and neuroimaging biomarkers of Alzheimer's disease.

    PubMed

    Nho, Kwangsik; Kim, Sungeun; Horgusluoglu, Emrin; Risacher, Shannon L; Shen, Li; Kim, Dokyoon; Lee, Seunggeun; Foroud, Tatiana; Shaw, Leslie M; Trojanowski, John Q; Aisen, Paul S; Petersen, Ronald C; Jack, Clifford R; Weiner, Michael W; Green, Robert C; Toga, Arthur W; Saykin, Andrew J

    2017-05-24

    The APOE ε4 allele is the most significant common genetic risk factor for late-onset Alzheimer's disease (LOAD). The region surrounding APOE on chromosome 19 has also shown consistent association with LOAD. However, no common variants in the region remain significant after adjusting for APOE genotype. We report a rare variant association analysis of genes in the vicinity of APOE with cerebrospinal fluid (CSF) and neuroimaging biomarkers of LOAD. Whole genome sequencing (WGS) was performed on 817 blood DNA samples from the Alzheimer's Disease Neuroimaging Initiative (ADNI). Sequence data from 757 non-Hispanic Caucasian participants was used in the present analysis. We extracted all rare variants (MAF (minor allele frequency) < 0.05) within a 312 kb window in APOE's vicinity encompassing 12 genes. We assessed CSF and neuroimaging (MRI and PET) biomarkers as LOAD-related quantitative endophenotypes. Gene-based analyses of rare variants were performed using the optimal Sequence Kernel Association Test (SKAT-O). A total of 3,334 rare variants (MAF < 0.05) were found within the APOE region. Among them, 72 rare non-synonymous variants were observed. Eight genes spanning the APOE region were significantly associated with CSF Aβ 1-42 (p < 1.0 × 10 -3 ). After controlling for APOE genotype and adjusting for multiple comparisons, 4 genes (CBLC, BCAM, APOE, and RELB) remained significant. Whole-brain surface-based analysis identified highly significant clusters associated with rare variants of CBLC in the temporal lobe region including the entorhinal cortex, as well as frontal lobe regions. Whole-brain voxel-wise analysis of amyloid PET identified significant clusters in the bilateral frontal and parietal lobes showing associations of rare variants of RELB with cortical amyloid burden. Rare variants within genes spanning the APOE region are significantly associated with LOAD-related CSF Aβ 1-42 and neuroimaging biomarkers after adjusting for APOE genotype. These findings warrant further investigation and illustrate the role of next generation sequencing and quantitative endophenotypes in assessing rare variants which may help explain missing heritability in AD and other complex diseases.

  7. Mutations in ABCR (ABCA4) in patients with Stargardt macular degeneration or cone-rod degeneration.

    PubMed

    Briggs, C E; Rucinski, D; Rosenfeld, P J; Hirose, T; Berson, E L; Dryja, T P

    2001-09-01

    To determine the spectrum of ABCR mutations associated with Stargardt macular degeneration and cone-rod degeneration (CRD). One hundred eighteen unrelated patients with recessive Stargardt macular degeneration and eight with recessive CRD were screened for mutations in ABCR (ABCA4) by single-strand conformation polymorphism analysis. Variants were characterized by direct genomic sequencing. Segregation analysis was performed on the families of 20 patients in whom at least two or more likely pathogenic sequence changes were identified. The authors found 77 sequence changes likely to be pathogenic: 21 null mutations (15 novel), 55 missense changes (26 novel), and one deletion of a consensus glycosylation site (also novel). Fifty-two patients with Stargardt macular degeneration (44% of those screened) and five with CRD each had two of these sequence changes or were homozygous for one of them. Segregation analyses in the families of 19 of these patients were informative and revealed that the index cases and all available affected siblings were compound heterozygotes or homozygotes. The authors found one instance of an apparently de novo mutation, Ile824Thr, in a patient. Thirty-seven (31%) of the 118 patients with Stargardt disease and one with CRD had only one likely pathogenic sequence change. Twenty-nine patients with Stargardt disease (25%) and two with CRD had no identified sequence changes. This report of 42 novel mutations brings the growing number of identified likely pathogenic sequence changes in ABCR to approximately 250.

  8. Comparative analysis of the Flavobacterium columnare genomovar I and II genomes

    USDA-ARS?s Scientific Manuscript database

    Columnaris disease caused by Gram-negative rod Flavobacterium columnare is one of the most common diseases of catfish. F. columnare is also a common problem in other cultured fish species worldwide. F. columnare has three major genomovars; we have sequenced a representative strain from genomovar I (...

  9. Effects of the HN gene c-terminal extensions on the Newcastle disease virus virulence

    USDA-ARS?s Scientific Manuscript database

    The hemagglutinin-neuraminidase (HN) of Newcastle disease virus (NDV) is a multifunctional protein that has receptor recognition, neuraminidase and fusion promotion activities. Sequence analysis revealed that the HN gene of many extremely low virulence NDV strains encodes a larger open reading frame...

  10. Border Disease Virus among Chamois, Spain

    PubMed Central

    Rosell, Rosa; Cabezón, Oscar; Mentaberre, Gregorio; Casas, Encarna; Velarde, Roser; Lavín, Santiago

    2009-01-01

    Approximately 3,000 Pyrenean chamois (Rupicapra pyrenaica pyrenaica) died in northeastern Spain during 2005–2007. Border disease virus infection was identified by reverse transcription–PCR and sequencing analysis. These results implicate this virus as the primary cause of death, similar to findings in the previous epizootic in 2001. PMID:19239761

  11. Genotype diversity of hepatitis C virus (HCV) in HCV-associated liver disease patients in Indonesia.

    PubMed

    Utama, Andi; Tania, Navessa Padma; Dhenni, Rama; Gani, Rino Alvani; Hasan, Irsan; Sanityoso, Andri; Lelosutan, Syafruddin A R; Martamala, Ruswhandi; Lesmana, Laurentius Adrianus; Sulaiman, Ali; Tai, Susan

    2010-09-01

    Hepatitis C virus (HCV) genotype distribution in Indonesia has been reported. However, the identification of HCV genotype was based on 5'-UTR or NS5B sequence. This study was aimed to observe HCV core sequence variation among HCV-associated liver disease patients in Jakarta, and to analyse the HCV genotype diversity based on the core sequence. Sixty-eight chronic hepatitis (CH), 48 liver cirrhosis (LC) and 34 hepatocellular carcinoma (HCC) were included in this study. HCV core variation was analysed by direct sequencing. Alignment of HCV core sequences demonstrated that the core sequence was relatively varied among the genotype. Indeed, 237 bases of the core sequence could classify the HCV subtype; however, 236 bases failed to differentiate several subtypes. Based on 237 bases of the core sequences, the HCV strains were classified into genotypes 1 (subtypes 1a, 1b and 1c), 2 (subtypes 2a, 2e and 2f) and 3 (subtypes 3a and 3k). The HCV 1b (47.3%) was the most prevalent, followed by subtypes 1c (18.7%), 3k (10.7%), 2a (10.0%), 1a (6.7%), 2e (5.3%), 2f (0.7%) and 3a (0.7%). HCV 1b was the most common in all patients, and the prevalence increased with the severity of liver disease (36.8% in CH, 54.2% in LC and 58.8% in HCC). These results were similar to a previous report based on NS5B sequence analysis. Hepatitis C virus core sequence (237 bases) could identify the HCV subtype and the prevalence of HCV subtype based on core sequence was similar to those based on the NS5B region.

  12. Molecular Genetics of the Usher Syndrome in Lebanon: Identification of 11 Novel Protein Truncating Mutations by Whole Exome Sequencing

    PubMed Central

    Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima

    2014-01-01

    Background Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Methods Whole exome sequencing followed by expanded familial validation by Sanger sequencing. Results We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Conclusion Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes. PMID:25211151

  13. Molecular genetics of the Usher syndrome in Lebanon: identification of 11 novel protein truncating mutations by whole exome sequencing.

    PubMed

    Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima

    2014-01-01

    Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Whole exome sequencing followed by expanded familial validation by Sanger sequencing. We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes.

  14. Whole-exome sequencing analysis of Waardenburg syndrome in a Chinese family.

    PubMed

    Chen, Dezhong; Zhao, Na; Wang, Jing; Li, Zhuoyu; Wu, Changxin; Fu, Jie; Xiao, Han

    2017-01-01

    Waardenburg syndrome (WS) is a dominantly inherited, genetically heterogeneous auditory-pigmentary syndrome characterized by non-progressive sensorineural hearing loss and iris discoloration. By whole-exome sequencing (WES), we identified a nonsense mutation (c.598C>T) in PAX3 gene, predicted to be disease causing by in silico analysis. This is the first report of genetically diagnosed case of WS PAX3 c.598C>T nonsense mutation in Chinese ethnic origin by WES and in silico functional prediction methods.

  15. Whole-exome sequencing analysis of Waardenburg syndrome in a Chinese family

    PubMed Central

    Chen, Dezhong; Zhao, Na; Wang, Jing; Li, Zhuoyu; Wu, Changxin; Fu, Jie; Xiao, Han

    2017-01-01

    Waardenburg syndrome (WS) is a dominantly inherited, genetically heterogeneous auditory-pigmentary syndrome characterized by non-progressive sensorineural hearing loss and iris discoloration. By whole-exome sequencing (WES), we identified a nonsense mutation (c.598C>T) in PAX3 gene, predicted to be disease causing by in silico analysis. This is the first report of genetically diagnosed case of WS PAX3 c.598C>T nonsense mutation in Chinese ethnic origin by WES and in silico functional prediction methods. PMID:28690861

  16. Structural analysis of key gap junction domains--Lessons from genome data and disease-linked mutants.

    PubMed

    Bai, Donglin

    2016-02-01

    A gap junction (GJ) channel is formed by docking of two GJ hemichannels and each of these hemichannels is a hexamer of connexins. All connexin genes have been identified in human, mouse, and rat genomes and their homologous genes in many other vertebrates are available in public databases. The protein sequences of these connexins align well with high sequence identity in the same connexin across different species. Domains in closely related connexins and several residues in all known connexins are also well-conserved. These conserved residues form signatures (also known as sequence logos) in these domains and are likely to play important biological functions. In this review, the sequence logos of individual connexins, groups of connexins with common ancestors, and all connexins are analyzed to visualize natural evolutionary variations and the hot spots for human disease-linked mutations. Several gap junction domains are homologous, likely forming similar structures essential for their function. The availability of a high resolution Cx26 GJ structure and the subsequently-derived homology structure models for other connexin GJ channels elevated our understanding of sequence logos at the three-dimensional GJ structure level, thus facilitating the understanding of how disease-linked connexin mutants might impair GJ structure and function. This knowledge will enable the design of complementary variants to rescue disease-linked mutants. Copyright © 2015 Elsevier Ltd. All rights reserved.

  17. Genomic Diversity and Evolution of the Lyssaviruses

    PubMed Central

    Delmas, Olivier; Holmes, Edward C.; Talbi, Chiraz; Larrous, Florence; Dacheux, Laurent; Bouchier, Christiane; Bourhy, Hervé

    2008-01-01

    Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as ‘Lagos Bat’. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses. PMID:18446239

  18. Analysis of protein-coding genetic variation in 60,706 humans.

    PubMed

    Lek, Monkol; Karczewski, Konrad J; Minikel, Eric V; Samocha, Kaitlin E; Banks, Eric; Fennell, Timothy; O'Donnell-Luria, Anne H; Ware, James S; Hill, Andrew J; Cummings, Beryl B; Tukiainen, Taru; Birnbaum, Daniel P; Kosmicki, Jack A; Duncan, Laramie E; Estrada, Karol; Zhao, Fengmei; Zou, James; Pierce-Hoffman, Emma; Berghout, Joanne; Cooper, David N; Deflaux, Nicole; DePristo, Mark; Do, Ron; Flannick, Jason; Fromer, Menachem; Gauthier, Laura; Goldstein, Jackie; Gupta, Namrata; Howrigan, Daniel; Kiezun, Adam; Kurki, Mitja I; Moonshine, Ami Levy; Natarajan, Pradeep; Orozco, Lorena; Peloso, Gina M; Poplin, Ryan; Rivas, Manuel A; Ruano-Rubio, Valentin; Rose, Samuel A; Ruderfer, Douglas M; Shakir, Khalid; Stenson, Peter D; Stevens, Christine; Thomas, Brett P; Tiao, Grace; Tusie-Luna, Maria T; Weisburd, Ben; Won, Hong-Hee; Yu, Dongmei; Altshuler, David M; Ardissino, Diego; Boehnke, Michael; Danesh, John; Donnelly, Stacey; Elosua, Roberto; Florez, Jose C; Gabriel, Stacey B; Getz, Gad; Glatt, Stephen J; Hultman, Christina M; Kathiresan, Sekar; Laakso, Markku; McCarroll, Steven; McCarthy, Mark I; McGovern, Dermot; McPherson, Ruth; Neale, Benjamin M; Palotie, Aarno; Purcell, Shaun M; Saleheen, Danish; Scharf, Jeremiah M; Sklar, Pamela; Sullivan, Patrick F; Tuomilehto, Jaakko; Tsuang, Ming T; Watkins, Hugh C; Wilson, James G; Daly, Mark J; MacArthur, Daniel G

    2016-08-18

    Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

  19. Clinical evaluation incorporating a personal genome

    PubMed Central

    Ashley, Euan A.; Butte, Atul J.; Wheeler, Matthew T.; Chen, Rong; Klein, Teri E.; Dewey, Frederick E.; Dudley, Joel T.; Ormond, Kelly E.; Pavlovic, Aleksandra; Hudgins, Louanne; Gong, Li; Hodges, Laura M.; Berlin, Dorit S.; Thorn, Caroline F.; Sangkuhl, Katrin; Hebert, Joan M.; Woon, Mark; Sagreiya, Hersh; Whaley, Ryan; Morgan, Alexander A.; Pushkarev, Dmitry; Neff, Norma F; Knowles, Joshua W.; Chou, Mike; Thakuria, Joseph; Rosenbaum, Abraham; Zaranek, Alexander Wait; Church, George; Greely, Henry T.; Quake, Stephen R.; Altman, Russ B.

    2010-01-01

    Background The cost of genomic information has fallen steeply but the path to clinical translation of risk estimates for common variants found in genome wide association studies remains unclear. Since the speed and cost of sequencing complete genomes is rapidly declining, more comprehensive means of analyzing these data in concert with rare variants for genetic risk assessment and individualisation of therapy are required. Here, we present the first integrated analysis of a complete human genome in a clinical context. Methods An individual with a family history of vascular disease and early sudden death was evaluated. Clinical assessment included risk prediction for coronary artery disease, screening for causes of sudden cardiac death, and genetic counselling. Genetic analysis included the development of novel methods for the integration of whole genome sequence data including 2.6 million single nucleotide polymorphisms and 752 copy number variations. The algorithm focused on predicting genetic risk of genes associated with known Mendelian disease, recognised drug responses, and pathogenicity for novel variants. In addition, since integration of risk ratios derived from case control studies is challenging, we estimated posterior probabilities from age and sex appropriate prior probability and likelihood ratios derived for each genotype. In addition, we developed a visualisation approach to account for gene-environment interactions and conditionally dependent risks. Findings We found increased genetic risk for myocardial infarction, type II diabetes and certain cancers. Rare variants in LPA are consistent with the family history of coronary artery disease. Pharmacogenomic analysis suggested a positive response to lipid lowering therapy, likely clopidogrel resistance, and a low initial dosing requirement for warfarin. Many variants of uncertain significance were reported. Interpretation Although challenges remain, our results suggest that whole genome sequencing can yield useful and clinically relevant information for individual patients, especially for those with a strong family history of significant disease. PMID:20435227

  20. Deep Sequencing in Infectious Diseases: Immune and Pathogen Repertoires for the Improvement of Patient Outcomes.

    PubMed

    Burkholder, William F; Newell, Evan W; Poidinger, Michael; Chen, Swaine; Fink, Katja

    2017-01-01

    The inaugural workshop "Deep Sequencing in Infectious Diseases: Immune and Pathogen Repertoires for the Improvement of Patient Outcomes" was held in Singapore on 13-14 October 2016. The aim of the workshop was to discuss the latest trends in using high-throughput sequencing, bioinformatics, and allied technologies to analyze immune and pathogen repertoires and their interplay within the host, bringing together key international players in the field and Singapore-based researchers and clinician-scientists. The focus was in particular on the application of these technologies for the improvement of patient diagnosis, prognosis and treatment, and for other broad public health outcomes. The presentations by scientists and clinicians showed the potential of deep sequencing technology to capture the coevolution of adaptive immunity and pathogens. For clinical applications, some key challenges remain, such as the long turnaround time and relatively high cost of deep sequencing for pathogen identification and characterization and the lack of international standardization in immune repertoire analysis.

  1. Sequence analysis of infectious pancreatic necrosis virus isolated from Iranian reared rainbow trout (Oncorhynchus mykiss) in 2012.

    PubMed

    Dadar, Maryam; Peyghan, Rahim; Memari, Hamid Rajabi; Shapouri, Masod Reza Seifi Abad; Hasanzadeh, Reza; Goudarzi, Laleh Moazzami; Vakharia, Vikram N

    2013-12-01

    Infectious pancreatic necrosis virus (IPNV) is the causal agent of a highly contagious disease that affects many species of fish and shellfish. This virus causes economically significant diseases of farmed rainbow trout, Oncorhynchus mykiss (Walbaum), in Iran, which is often associated with the transmission of pathogens from European resources. In this study, moribund rainbow trout fry samples were collected during an outbreak of IPNV in three different fish farms in north and west provinces of Iran in 2012; and we investigated the full genome sequence of Iranian IPNV and compared it with previously identified IPNV sequences. The sequences of different structural and nonstructural-protein genes were compared to those of other aquatic birnaviruses sequenced to date. Our results show that the Iranian isolate falls within genogroup 5, serotype A2 strain SP, having 99% identity with the strain 1146 from Spain. These results suggest that the Iranian isolate may have originated from Europe.

  2. Deep Sequencing in Infectious Diseases: Immune and Pathogen Repertoires for the Improvement of Patient Outcomes

    PubMed Central

    Burkholder, William F.; Newell, Evan W.; Poidinger, Michael; Chen, Swaine; Fink, Katja

    2017-01-01

    The inaugural workshop “Deep Sequencing in Infectious Diseases: Immune and Pathogen Repertoires for the Improvement of Patient Outcomes” was held in Singapore on 13–14 October 2016. The aim of the workshop was to discuss the latest trends in using high-throughput sequencing, bioinformatics, and allied technologies to analyze immune and pathogen repertoires and their interplay within the host, bringing together key international players in the field and Singapore-based researchers and clinician-scientists. The focus was in particular on the application of these technologies for the improvement of patient diagnosis, prognosis and treatment, and for other broad public health outcomes. The presentations by scientists and clinicians showed the potential of deep sequencing technology to capture the coevolution of adaptive immunity and pathogens. For clinical applications, some key challenges remain, such as the long turnaround time and relatively high cost of deep sequencing for pathogen identification and characterization and the lack of international standardization in immune repertoire analysis. PMID:28620372

  3. Novel USH2A compound heterozygous mutations cause RP/USH2 in a Chinese family.

    PubMed

    Liu, Xiaowen; Tang, Zhaohui; Li, Chang; Yang, Kangjuan; Gan, Guanqi; Zhang, Zibo; Liu, Jingyu; Jiang, Fagang; Wang, Qing; Liu, Mugen

    2010-03-17

    To identify the disease-causing gene in a four-generation Chinese family affected with retinitis pigmentosa (RP). Linkage analysis was performed with a panel of microsatellite markers flanking the candidate genetic loci of RP. These loci included 38 known RP genes. The complete coding region and exon-intron boundaries of Usher syndrome 2A (USH2A) were sequenced with the proband DNA to screen the disease-causing gene mutation. Restriction fragment length polymorphism (RFLP) analysis and direct DNA sequence analysis were done to demonstrate co-segregation of the USH2A mutations with the family disease. One hundred normal controls were used without the mutations. The disease-causing gene in this Chinese family was linked to the USH2A locus on chromosome 1q41. Direct DNA sequence analysis of USH2A identified two novel mutations in the patients: one missense mutation p.G1734R in exon 26 and a splice site mutation, IVS32+1G>A, which was found in the donor site of intron 32 of USH2A. Neither the p.G1734R nor the IVS32+1G>A mutation was found in the unaffected family members or the 100 normal controls. One patient with a homozygous mutation displayed only RP symptoms until now, while three patients with compound heterozygous mutations in the family of study showed both RP and hearing impairment. This study identified two novel mutations: p.G1734R and IVS32+1G>A of USH2A in a four-generation Chinese RP family. In this study, the heterozygous mutation and the homozygous mutation in USH2A may cause Usher syndrome Type II or RP, respectively. These two mutations expand the mutant spectrum of USH2A.

  4. Newborn Sequencing in Genomic Medicine and Public Health.

    PubMed

    Berg, Jonathan S; Agrawal, Pankaj B; Bailey, Donald B; Beggs, Alan H; Brenner, Steven E; Brower, Amy M; Cakici, Julie A; Ceyhan-Birsoy, Ozge; Chan, Kee; Chen, Flavia; Currier, Robert J; Dukhovny, Dmitry; Green, Robert C; Harris-Wai, Julie; Holm, Ingrid A; Iglesias, Brenda; Joseph, Galen; Kingsmore, Stephen F; Koenig, Barbara A; Kwok, Pui-Yan; Lantos, John; Leeder, Steven J; Lewis, Megan A; McGuire, Amy L; Milko, Laura V; Mooney, Sean D; Parad, Richard B; Pereira, Stacey; Petrikin, Joshua; Powell, Bradford C; Powell, Cynthia M; Puck, Jennifer M; Rehm, Heidi L; Risch, Neil; Roche, Myra; Shieh, Joseph T; Veeraraghavan, Narayanan; Watson, Michael S; Willig, Laurel; Yu, Timothy W; Urv, Tiina; Wise, Anastasia L

    2017-02-01

    The rapid development of genomic sequencing technologies has decreased the cost of genetic analysis to the extent that it seems plausible that genome-scale sequencing could have widespread availability in pediatric care. Genomic sequencing provides a powerful diagnostic modality for patients who manifest symptoms of monogenic disease and an opportunity to detect health conditions before their development. However, many technical, clinical, ethical, and societal challenges should be addressed before such technology is widely deployed in pediatric practice. This article provides an overview of the Newborn Sequencing in Genomic Medicine and Public Health Consortium, which is investigating the application of genome-scale sequencing in newborns for both diagnosis and screening. Copyright © 2017 by the American Academy of Pediatrics.

  5. Functional annotation of HOT regions in the human genome: implications for human disease and cancer

    PubMed Central

    Li, Hao; Chen, Hebing; Liu, Feng; Ren, Chao; Wang, Shengqi; Bo, Xiaochen; Shu, Wenjie

    2015-01-01

    Advances in genome-wide association studies (GWAS) and large-scale sequencing studies have resulted in an impressive and growing list of disease- and trait-associated genetic variants. Most studies have emphasised the discovery of genetic variation in coding sequences, however, the noncoding regulatory effects responsible for human disease and cancer biology have been substantially understudied. To better characterise the cis-regulatory effects of noncoding variation, we performed a comprehensive analysis of the genetic variants in HOT (high-occupancy target) regions, which are considered to be one of the most intriguing findings of recent large-scale sequencing studies. We observed that GWAS variants that map to HOT regions undergo a substantial net decrease and illustrate development-specific localisation during haematopoiesis. Additionally, genetic risk variants are disproportionally enriched in HOT regions compared with LOT (low-occupancy target) regions in both disease-relevant and cancer cells. Importantly, this enrichment is biased toward disease- or cancer-specific cell types. Furthermore, we observed that cancer cells generally acquire cancer-specific HOT regions at oncogenes through diverse mechanisms of cancer pathogenesis. Collectively, our findings demonstrate the key roles of HOT regions in human disease and cancer and represent a critical step toward further understanding disease biology, diagnosis, and therapy. PMID:26113264

  6. Functional annotation of HOT regions in the human genome: implications for human disease and cancer.

    PubMed

    Li, Hao; Chen, Hebing; Liu, Feng; Ren, Chao; Wang, Shengqi; Bo, Xiaochen; Shu, Wenjie

    2015-06-26

    Advances in genome-wide association studies (GWAS) and large-scale sequencing studies have resulted in an impressive and growing list of disease- and trait-associated genetic variants. Most studies have emphasised the discovery of genetic variation in coding sequences, however, the noncoding regulatory effects responsible for human disease and cancer biology have been substantially understudied. To better characterise the cis-regulatory effects of noncoding variation, we performed a comprehensive analysis of the genetic variants in HOT (high-occupancy target) regions, which are considered to be one of the most intriguing findings of recent large-scale sequencing studies. We observed that GWAS variants that map to HOT regions undergo a substantial net decrease and illustrate development-specific localisation during haematopoiesis. Additionally, genetic risk variants are disproportionally enriched in HOT regions compared with LOT (low-occupancy target) regions in both disease-relevant and cancer cells. Importantly, this enrichment is biased toward disease- or cancer-specific cell types. Furthermore, we observed that cancer cells generally acquire cancer-specific HOT regions at oncogenes through diverse mechanisms of cancer pathogenesis. Collectively, our findings demonstrate the key roles of HOT regions in human disease and cancer and represent a critical step toward further understanding disease biology, diagnosis, and therapy.

  7. Comprehensive molecular diagnosis of Epstein-Barr virus-associated lymphoproliferative diseases using next-generation sequencing.

    PubMed

    Ono, Shintaro; Nakayama, Manabu; Kanegane, Hirokazu; Hoshino, Akihiro; Shimodera, Saeko; Shibata, Hirofumi; Fujino, Hisanori; Fujino, Takahiro; Yunomae, Yuta; Okano, Tsubasa; Yamashita, Motoi; Yasumi, Takahiro; Izawa, Kazushi; Takagi, Masatoshi; Imai, Kohsuke; Zhang, Kejian; Marsh, Rebecca; Picard, Capucine; Latour, Sylvain; Ohara, Osamu; Morio, Tomohiro

    2018-05-18

    Epstein-Barr virus (EBV) is associated with several life-threatening diseases, such as lymphoproliferative disease (LPD), particularly in immunocompromised hosts. Some categories of primary immunodeficiency diseases (PIDs) including X-linked lymphoproliferative syndrome (XLP), are characterized by susceptibility and vulnerability to EBV infection. The number of genetically defined PIDs is rapidly increasing, and clinical genetic testing plays an important role in establishing a definitive diagnosis. Whole-exome sequencing is performed for diagnosing rare genetic diseases, but is both expensive and time-consuming. Low-cost, high-throughput gene analysis systems are thus necessary. We developed a comprehensive molecular diagnostic method using a two-step tailed polymerase chain reaction (PCR) and a next-generation sequencing (NGS) platform to detect mutations in 23 candidate genes responsible for XLP or XLP-like diseases. Samples from 19 patients suspected of having EBV-associated LPD were used in this comprehensive molecular diagnosis. Causative gene mutations (involving PRF1 and SH2D1A) were detected in two of the 19 patients studied. This comprehensive diagnosis method effectively detected mutations in all coding exons of 23 genes with sufficient read numbers for each amplicon. This comprehensive molecular diagnostic method using PCR and NGS provides a rapid, accurate, low-cost diagnosis for patients with XLP or XLP-like diseases.

  8. Rapid Whole-Genome Sequencing for Genetic Disease Diagnosis in Neonatal Intensive Care Units

    PubMed Central

    Saunders, Carol Jean; Miller, Neil Andrew; Soden, Sarah Elizabeth; Dinwiddie, Darrell Lee; Noll, Aaron; Alnadi, Noor Abu; Andraws, Nevene; Patterson, Melanie LeAnn; Krivohlavek, Lisa Ann; Fellis, Joel; Humphray, Sean; Saffrey, Peter; Kingsbury, Zoya; Weir, Jacqueline Claire; Betley, Jason; Grocock, Russell James; Margulies, Elliott Harrison; Farrow, Emily Gwendolyn; Artman, Michael; Safina, Nicole Pauline; Petrikin, Joshua Erin; Hall, Kevin Peter; Kingsmore, Stephen Francis

    2014-01-01

    Monogenic diseases are frequent causes of neonatal morbidity and mortality, and disease presentations are often undifferentiated at birth. More than 3500 monogenic diseases have been characterized, but clinical testing is available for only some of them and many feature clinical and genetic heterogeneity. Hence, an immense unmet need exists for improved molecular diagnosis in infants. Because disease progression is extremely rapid, albeit heterogeneous, in newborns, molecular diagnoses must occur quickly to be relevant for clinical decision-making. We describe 50-hour differential diagnosis of genetic disorders by whole-genome sequencing (WGS) that features automated bioinformatic analysis and is intended to be a prototype for use in neonatal intensive care units. Retrospective 50-hour WGS identified known molecular diagnoses in two children. Prospective WGS disclosed potential molecular diagnosis of a severe GJB2-related skin disease in one neonate; BRAT1-related lethal neonatal rigidity and multifocal seizure syndrome in another infant; identified BCL9L as a novel, recessive visceral heterotaxy gene (HTX6) in a pedigree; and ruled out known candidate genes in one infant. Sequencing of parents or affected siblings expedited the identification of disease genes in prospective cases. Thus, rapid WGS can potentially broaden and foreshorten differential diagnosis, resulting in fewer empirical treatments and faster progression to genetic and prognostic counseling. PMID:23035047

  9. Mycobacterium tuberculosis and whole genome sequencing: a practical guide and online tools available for the clinical microbiologist.

    PubMed

    Satta, G; Atzeni, A; McHugh, T D

    2017-02-01

    Whole genome sequencing (WGS) has the potential to revolutionize the diagnosis of Mycobacterium tuberculosis infection but the lack of bioinformatic expertise among clinical microbiologists is a barrier for adoption. Software products for analysis should be simple, free of charge, able to accept data directly from the sequencer (FASTQ files) and to provide the basic functionalities all-in-one. The main aim of this narrative review is to provide a practical guide for the clinical microbiologist, with little or no practical experience of WGS analysis, with a specific focus on software products tailor-made for M. tuberculosis analysis. With sequencing performed by an external provider, it is now feasible to implement WGS analysis in the routine clinical practice of any microbiology laboratory, with the potential to detect resistance weeks before traditional phenotypic culture methods, but the clinical microbiologist should be aware of the limitations of this approach. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.

  10. Native T1 reference values for nonischemic cardiomyopathies and populations with increased cardiovascular risk: A systematic review and meta‐analysis

    PubMed Central

    Slart, Riemer H.J.A.; Hulleman, Enzo V.; Dierckx, Rudi A.J.O.; Velthuis, Birgitta K.; van der Harst, Pim; Sosnovik, David E.; Borra, Ronald J.H.; Prakken, Niek H.J.

    2017-01-01

    Background Although cardiac MR and T1 mapping are increasingly used to diagnose diffuse fibrosis based cardiac diseases, studies reporting T1 values in healthy and diseased myocardium, particular in nonischemic cardiomyopathies (NICM) and populations with increased cardiovascular risk, seem contradictory. Purpose To determine the range of native myocardial T1 value ranges in patients with NICM and populations with increased cardiovascular risk. Study Type Systemic review and meta‐analysis. Population Patients with NICM, including hypertrophic cardiomyopathy (HCM) and dilated cardiomyopathy (DCM), and patients with myocarditis (MC), iron overload, amyloidosis, Fabry disease, and populations with hypertension (HT), diabetes mellitus (DM), and obesity. Field Strength/Sequence (Shortened) modified Look–Locker inversion‐recovery MR sequence at 1.5 or 3T. Assessment PubMed and Embase were searched following the PRISMA guidelines. Statistical Tests The summary of standard mean difference (SMD) between the diseased and a healthy control populations was generated using a random‐effects model in combination with meta‐regression analysis. Results The SMD for HCM, DCM, and MC patients were significantly increased (1.41, 1.48, and 1.96, respectively, P < 0.01) compared with healthy controls. The SMD for HT patients with and without left‐ventricle hypertrophy (LVH) together was significantly increased (0.19, P = 0.04), while for HT patients without LVH the SMD was zero (0.03, P = 0.52). The number of studies on amyloidosis, iron overload, Fabry disease, and HT patients with LVH did not meet the requirement to perform a meta‐analysis. However, most studies reported a significantly increased T1 for amyloidosis and HT patients with LVH and a significant decreased T1 for iron overload and Fabry disease patients. Data Conclusions Native T1 mapping by using an (Sh)MOLLI sequence can potentially assess myocardial changes in HCM, DCM, MC, iron overload, amyloidosis, and Fabry disease compared to controls. In addition, it can help to diagnose left‐ventricular remodeling in HT patients. Level of Evidence: 2 Technical Efficacy: Stage 3 J. Magn. Reson. Imaging 2018;47:891–912. PMID:29131444

  11. Application of industrial scale genomics to discovery of therapeutic targets in heart failure.

    PubMed

    Mehraban, F; Tomlinson, J E

    2001-12-01

    In recent years intense activity in both academic and industrial sectors has provided a wealth of information on the human genome with an associated impressive increase in the number of novel gene sequences deposited in sequence data repositories and patent applications. This genomic industrial revolution has transformed the way in which drug target discovery is now approached. In this article we discuss how various differential gene expression (DGE) technologies are being utilized for cardiovascular disease (CVD) drug target discovery. Other approaches such as sequencing cDNA from cardiovascular derived tissues and cells coupled with bioinformatic sequence analysis are used with the aim of identifying novel gene sequences that may be exploited towards target discovery. Additional leverage from gene sequence information is obtained through identification of polymorphisms that may confer disease susceptibility and/or affect drug responsiveness. Pharmacogenomic studies are described wherein gene expression-based techniques are used to evaluate drug response and/or efficacy. Industrial-scale genomics supports and addresses not only novel target gene discovery but also the burgeoning issues in pharmaceutical and clinical cardiovascular medicine relative to polymorphic gene responses.

  12. Human Genome Sequencing in Health and Disease

    PubMed Central

    Gonzaga-Jauregui, Claudia; Lupski, James R.; Gibbs, Richard A.

    2013-01-01

    Following the “finished,” euchromatic, haploid human reference genome sequence, the rapid development of novel, faster, and cheaper sequencing technologies is making possible the era of personalized human genomics. Personal diploid human genome sequences have been generated, and each has contributed to our better understanding of variation in the human genome. We have consequently begun to appreciate the vastness of individual genetic variation from single nucleotide to structural variants. Translation of genome-scale variation into medically useful information is, however, in its infancy. This review summarizes the initial steps undertaken in clinical implementation of personal genome information, and describes the application of whole-genome and exome sequencing to identify the cause of genetic diseases and to suggest adjuvant therapies. Better analysis tools and a deeper understanding of the biology of our genome are necessary in order to decipher, interpret, and optimize clinical utility of what the variation in the human genome can teach us. Personal genome sequencing may eventually become an instrument of common medical practice, providing information that assists in the formulation of a differential diagnosis. We outline herein some of the remaining challenges. PMID:22248320

  13. Immunoreactivity of polyclonal antibodies generated against the carboxy terminus of the predicted amino acid sequence of the Huntington disease gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Alkatib, G.; Graham, R.; Pelmear-Telenius, A.

    1994-09-01

    A cDNA fragment spanning the 3{prime}-end of the Huntington disease gene (from 8052 to 9252) was cloned into a prokaryotic expression vector containing the E. Coli lac promoter and a portion of the coding sequence for {beta}-galactosidase. The truncated {beta}-galactosidase gene was cleaved with BamHl and fused in frame to the BamHl fragment of the Huntington disease gene 3{prime}-end. Expression analysis of proteins made in E. Coli revealed that 20-30% of the total cellular proteins was represented by the {beta}-galactosidase-huntingtin fusion protein. The identity of the Huntington disease protein amino acid sequences was confirmed by protein sequence analysis. Affinity chromatographymore » was used to purify large quantities of the fusion protein from bacterial cell lysates. Affinity-purified proteins were used to immunize New Zealand white rabbits for antibody production. The generated polyclonal antibodies were used to immunoprecipitate the Huntington disease gene product expressed in a neuroblastoma cell line. In this cell line the antibodies precipitated two protein bands of apparent gel migrations of 200 and 150 kd which together, correspond to the calculated molecular weight of the Huntington disease gene product (350 kd). Immunoblotting experiments revealed the presence of a large precursor protein in the range of 350-750 kd which is in agreement with the predicted molecular weight of the protein without post-translational modifications. These results indicate that the huntingtin protein is cleaved into two subunits in this neuroblastoma cell line and implicate that cleavage of a large precursor protein may contribute to its biological activity. Experiments are ongoing to determine the precursor-product relationship and to examine the synthesis of the huntingtin protein in freshly isolated rat brains, and to determine cellular and subcellular distribution of the gene product.« less

  14. VDJServer: A Cloud-Based Analysis Portal and Data Commons for Immune Repertoire Sequences and Rearrangements.

    PubMed

    Christley, Scott; Scarborough, Walter; Salinas, Eddie; Rounds, William H; Toby, Inimary T; Fonner, John M; Levin, Mikhail K; Kim, Min; Mock, Stephen A; Jordan, Christopher; Ostmeyer, Jared; Buntzman, Adam; Rubelt, Florian; Davila, Marco L; Monson, Nancy L; Scheuermann, Richard H; Cowell, Lindsay G

    2018-01-01

    Recent technological advances in immune repertoire sequencing have created tremendous potential for advancing our understanding of adaptive immune response dynamics in various states of health and disease. Immune repertoire sequencing produces large, highly complex data sets, however, which require specialized methods and software tools for their effective analysis and interpretation. VDJServer is a cloud-based analysis portal for immune repertoire sequence data that provide access to a suite of tools for a complete analysis workflow, including modules for preprocessing and quality control of sequence reads, V(D)J gene segment assignment, repertoire characterization, and repertoire comparison. VDJServer also provides sophisticated visualizations for exploratory analysis. It is accessible through a standard web browser via a graphical user interface designed for use by immunologists, clinicians, and bioinformatics researchers. VDJServer provides a data commons for public sharing of repertoire sequencing data, as well as private sharing of data between users. We describe the main functionality and architecture of VDJServer and demonstrate its capabilities with use cases from cancer immunology and autoimmunity. VDJServer provides a complete analysis suite for human and mouse T-cell and B-cell receptor repertoire sequencing data. The combination of its user-friendly interface and high-performance computing allows large immune repertoire sequencing projects to be analyzed with no programming or software installation required. VDJServer is a web-accessible cloud platform that provides access through a graphical user interface to a data management infrastructure, a collection of analysis tools covering all steps in an analysis, and an infrastructure for sharing data along with workflows, results, and computational provenance. VDJServer is a free, publicly available, and open-source licensed resource.

  15. Diagnostics based on nucleic acid sequence variant profiling: PCR, hybridization, and NGS approaches.

    PubMed

    Khodakov, Dmitriy; Wang, Chunyan; Zhang, David Yu

    2016-10-01

    Nucleic acid sequence variations have been implicated in many diseases, and reliable detection and quantitation of DNA/RNA biomarkers can inform effective therapeutic action, enabling precision medicine. Nucleic acid analysis technologies being translated into the clinic can broadly be classified into hybridization, PCR, and sequencing, as well as their combinations. Here we review the molecular mechanisms of popular commercial assays, and their progress in translation into in vitro diagnostics. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  16. Comparative analysis of the prion protein gene sequences in African lion.

    PubMed

    Wu, Chang-De; Pang, Wan-Yong; Zhao, De-Ming

    2006-10-01

    The prion protein gene of African lion (Panthera Leo) was first cloned and polymorphisms screened. The results suggest that the prion protein gene of eight African lions is highly homogenous. The amino acid sequences of the prion protein (PrP) of all samples tested were identical. Four single nucleotide polymorphisms (C42T, C81A, C420T, T600C) in the prion protein gene (Prnp) of African lion were found, but no amino acid substitutions. Sequence analysis showed that the higher homology is observed to felis catus AF003087 (96.7%) and to sheep number M31313.1 (96.2%) Genbank accessed. With respect to all the mammalian prion protein sequences compared, the African lion prion protein sequence has three amino acid substitutions. The homology might in turn affect the potential intermolecular interactions critical for cross species transmission of prion disease.

  17. Genomic analysis of Meckel–Gruber syndrome in Arabs reveals marked genetic heterogeneity and novel candidate genes

    PubMed Central

    Shaheen, Ranad; Faqeih, Eissa; Alshammari, Muneera J; Swaid, Abdulrahman; Al-Gazali, Lihadh; Mardawi, Elham; Ansari, Shinu; Sogaty, Sameera; Seidahmed, Mohammed Z; AlMotairi, Muhammed I; Farra, Chantal; Kurdi, Wesam; Al-Rasheed, Shatha; Alkuraya, Fowzan S

    2013-01-01

    Meckel–Gruber syndrome (MKS, OMIM #249000) is a multiple congenital malformation syndrome that represents the severe end of the ciliopathy phenotypic spectrum. Despite the relatively common occurrence of this syndrome among Arabs, little is known about its genetic architecture in this population. This is a series of 18 Arab families with MKS, who were evaluated clinically and studied using autozygome-guided mutation analysis and exome sequencing. We show that autozygome-guided candidate gene analysis identified the underlying mutation in the majority (n=12, 71%). Exome sequencing revealed a likely pathogenic mutation in three novel candidate MKS disease genes. These include C5orf42, Ellis–van-Creveld disease gene EVC2 and SEC8 (also known as EXOC4), which encodes an exocyst protein with an established role in ciliogenesis. This is the largest and most comprehensive genomic study on MKS in Arabs and the results, in addition to revealing genetic and allelic heterogeneity, suggest that previously reported disease genes and the novel candidates uncovered by this study account for the overwhelming majority of MKS patients in our population. PMID:23169490

  18. Fine genetic mapping of spot blotch resistance gene Sb3 in wheat (Triticum aestivum).

    PubMed

    Lu, Ping; Liang, Yong; Li, Delin; Wang, Zhengzhong; Li, Wenbin; Wang, Guoxin; Wang, Yong; Zhou, Shenghui; Wu, Qiuhong; Xie, Jingzhong; Zhang, Deyun; Chen, Yongxing; Li, Miaomiao; Zhang, Yan; Sun, Qixin; Han, Chenggui; Liu, Zhiyong

    2016-03-01

    Spot blotch disease resistance gene Sb3 was mapped to a 0.15 centimorgan (cM) genetic interval spanning a 602 kb physical genomic region on chromosome 3BS. Wheat spot blotch disease, caused by B. sorokiniana, is a devastating disease that can cause severe yield losses. Although inoculum levels can be reduced by planting disease-free seed, treatment of plants with fungicides and crop rotation, genetic resistance is likely to be a robust, economical and environmentally friendly tool in the control of spot blotch. The winter wheat line 621-7-1 confers immune resistance against B. sorokiniana. Genetic analysis indicates that the spot blotch resistance of 621-7-1 is controlled by a single dominant gene, provisionally designated Sb3. Bulked segregant analysis (BSA) and simple sequence repeat (SSR) mapping showed that Sb3 is located on chromosome arm 3BS linked with markers Xbarc133 and Xbarc147. Seven and twelve new polymorphic markers were developed from the Chinese Spring 3BS shotgun survey sequence contigs and 3BS reference sequences, respectively. Finally, Sb3 was mapped in a 0.15 cM genetic interval spanning a 602 kb physical genomic region of Chinese Spring chromosome 3BS. The genetic and physical maps of Sb3 provide a framework for map-based cloning and marker-assisted selection (MAS) of the spot blotch resistance.

  19. Mutational and structural analysis of diffuse large B-cell lymphoma using whole genome sequencing | Office of Cancer Genomics

    Cancer.gov

    Abstract: Diffuse large B-cell lymphoma (DLBCL) is a genetically heterogeneous cancer comprising at least two molecular subtypes that differ in gene expression and distribution of mutations. Recently, application of genome/exome sequencing and RNA-seq to DLBCL has revealed numerous genes that are recurrent targets of somatic point mutation in this disease.

  20. Draft Genome Sequence of the Fish Pathogen Flavobacterium columnare Genomovar III Strain PH-97028 (=CIP 109753).

    PubMed

    Criscuolo, Alexis; Chesneau, Olivier; Clermont, Dominique; Bizet, Chantal

    2018-04-05

    Flavobacterium columnare strain PH-97028 (=CIP 109753) is a genomovar III reference strain that was isolated from a diseased Ayu fish in Japan. We report here the analysis of the first available genomovar III sequence of this species to aid in identification, epidemiological tracking, and virulence studies. Copyright © 2018 Criscuolo et al.

  1. in silico Whole Genome Sequencer & Analyzer (iWGS): a computational pipeline to guide the design and analysis of de novo genome sequencing studies

    USDA-ARS?s Scientific Manuscript database

    The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding it...

  2. Is High Resolution Melting Analysis (HRMA) Accurate for Detection of Human Disease-Associated Mutations? A Meta Analysis

    PubMed Central

    Ma, Feng-Li; Jiang, Bo; Song, Xiao-Xiao; Xu, An-Gao

    2011-01-01

    Background High Resolution Melting Analysis (HRMA) is becoming the preferred method for mutation detection. However, its accuracy in the individual clinical diagnostic setting is variable. To assess the diagnostic accuracy of HRMA for human mutations in comparison to DNA sequencing in different routine clinical settings, we have conducted a meta-analysis of published reports. Methodology/Principal Findings Out of 195 publications obtained from the initial search criteria, thirty-four studies assessing the accuracy of HRMA were included in the meta-analysis. We found that HRMA was a highly sensitive test for detecting disease-associated mutations in humans. Overall, the summary sensitivity was 97.5% (95% confidence interval (CI): 96.8–98.5; I2 = 27.0%). Subgroup analysis showed even higher sensitivity for non-HR-1 instruments (sensitivity 98.7% (95%CI: 97.7–99.3; I2 = 0.0%)) and an eligible sample size subgroup (sensitivity 99.3% (95%CI: 98.1–99.8; I2 = 0.0%)). HRMA specificity showed considerable heterogeneity between studies. Sensitivity of the techniques was influenced by sample size and instrument type but by not sample source or dye type. Conclusions/Significance These findings show that HRMA is a highly sensitive, simple and low-cost test to detect human disease-associated mutations, especially for samples with mutations of low incidence. The burden on DNA sequencing could be significantly reduced by the implementation of HRMA, but it should be recognized that its sensitivity varies according to the number of samples with/without mutations, and positive results require DNA sequencing for confirmation. PMID:22194806

  3. An NGS Workflow Blueprint for DNA Sequencing Data and Its Application in Individualized Molecular Oncology

    PubMed Central

    Li, Jian; Batcha, Aarif Mohamed Nazeer; Grüning, Björn; Mansmann, Ulrich R.

    2015-01-01

    Next-generation sequencing (NGS) technologies that have advanced rapidly in the past few years possess the potential to classify diseases, decipher the molecular code of related cell processes, identify targets for decision-making on targeted therapy or prevention strategies, and predict clinical treatment response. Thus, NGS is on its way to revolutionize oncology. With the help of NGS, we can draw a finer map for the genetic basis of diseases and can improve our understanding of diagnostic and prognostic applications and therapeutic methods. Despite these advantages and its potential, NGS is facing several critical challenges, including reduction of sequencing cost, enhancement of sequencing quality, improvement of technical simplicity and reliability, and development of semiautomated and integrated analysis workflow. In order to address these challenges, we conducted a literature research and summarized a four-stage NGS workflow for providing a systematic review on NGS-based analysis, explaining the strength and weakness of diverse NGS-based software tools, and elucidating its potential connection to individualized medicine. By presenting this four-stage NGS workflow, we try to provide a minimal structural layout required for NGS data storage and reproducibility. PMID:27081306

  4. Whole Genome Re-Sequencing and Characterization of Powdery Mildew Disease-Associated Allelic Variation in Melon.

    PubMed

    Natarajan, Sathishkumar; Kim, Hoy-Taek; Thamilarasan, Senthil Kumar; Veerappan, Karpagam; Park, Jong-In; Nou, Ill-Sup

    2016-01-01

    Powdery mildew is one of the most common fungal diseases in the world. This disease frequently affects melon (Cucumis melo L.) and other Cucurbitaceous family crops in both open field and greenhouse cultivation. One of the goals of genomics is to identify the polymorphic loci responsible for variation in phenotypic traits. In this study, powdery mildew disease assessment scores were calculated for four melon accessions, 'SCNU1154', 'Edisto47', 'MR-1', and 'PMR5'. To investigate the genetic variation of these accessions, whole genome re-sequencing using the Illumina HiSeq 2000 platform was performed. A total of 754,759,704 quality-filtered reads were generated, with an average of 82.64% coverage relative to the reference genome. Comparisons of the sequences for the melon accessions revealed around 7.4 million single nucleotide polymorphisms (SNPs), 1.9 million InDels, and 182,398 putative structural variations (SVs). Functional enrichment analysis of detected variations classified them into biological process, cellular component and molecular function categories. Further, a disease-associated QTL map was constructed for 390 SNPs and 45 InDels identified as related to defense-response genes. Among them 112 SNPs and 12 InDels were observed in powdery mildew responsive chromosomes. Accordingly, this whole genome re-sequencing study identified SNPs and InDels associated with defense genes that will serve as candidate polymorphisms in the search for sources of resistance against powdery mildew disease and could accelerate marker-assisted breeding in melon.

  5. Sequence heterogeneity in the two 16S rRNA genes of Phormium yellow leaf phytoplasma.

    PubMed Central

    Liefting, L W; Andersen, M T; Beever, R E; Gardner, R C; Forster, R L

    1996-01-01

    Phormium yellow leaf (PYL) phytoplasma causes a lethal disease of the monocotyledon, New Zealand flax (Phormium tenax). The 16S rRNA genes of PYL phytoplasma were amplified from infected flax by PCR and cloned, and the nucleotide sequences were determined. DNA sequencing and Southern hybridization analysis of genomic DNA indicated the presence of two copies of the 16S rRNA gene. The two 16S rRNA genes exhibited sequence heterogeneity in 4 nucleotide positions and could be distinguished by the restriction enzymes BpmI and BsrI. This is the first record in which sequence heterogeneity in the 16S rRNA genes of a phytoplasma has been determined by sequence analysis. A phylogenetic tree based on 16S rRNA gene sequences showed that PYL phytoplasma is most closely related to the stolbur and German grapevine yellows phytoplasmas, which form the stolbur subgroup of the aster yellows group. This phylogenetic position of PYL phytoplasma was supported by 16S/23S spacer region sequence data. PMID:8795200

  6. Genome and secretome analysis of the hemibiotrophic fungal pathogen, Moniliophthora roreri, which causes frosty pod rot disease of cacao: mechanisms of the biotrophic and necrotrophic phases.

    PubMed

    Meinhardt, Lyndel W; Costa, Gustavo Gilson Lacerda; Thomazella, Daniela P T; Teixeira, Paulo José P L; Carazzolle, Marcelo Falsarella; Schuster, Stephan C; Carlson, John E; Guiltinan, Mark J; Mieczkowski, Piotr; Farmer, Andrew; Ramaraj, Thiruvarangan; Crozier, Jayne; Davis, Robert E; Shao, Jonathan; Melnick, Rachel L; Pereira, Gonçalo A G; Bailey, Bryan A

    2014-02-27

    The basidiomycete Moniliophthora roreri is the causal agent of Frosty pod rot (FPR) disease of cacao (Theobroma cacao), the source of chocolate, and FPR is one of the most destructive diseases of this important perennial crop in the Americas. This hemibiotroph infects only cacao pods and has an extended biotrophic phase lasting up to sixty days, culminating in plant necrosis and sporulation of the fungus without the formation of a basidiocarp. We sequenced and assembled 52.3 Mb into 3,298 contigs that represent the M. roreri genome. Of the 17,920 predicted open reading frames (OFRs), 13,760 were validated by RNA-Seq. Using read count data from RNA sequencing of cacao pods at 30 and 60 days post infection, differential gene expression was estimated for the biotrophic and necrotrophic phases of this plant-pathogen interaction. The sequencing data were used to develop a genome based secretome for the infected pods. Of the 1,535 genes encoding putative secreted proteins, 1,355 were expressed in the biotrophic and necrotrophic phases. Analysis of the data revealed secretome gene expression that correlated with infection and intercellular growth in the biotrophic phase and invasive growth and plant cellular death in the necrotrophic phase. Genome sequencing and RNA-Seq was used to determine and validate the Moniliophthora roreri genome and secretome. High sequence identity between Moniliophthora roreri genes and Moniliophthora perniciosa genes supports the taxonomic relationship with Moniliophthora perniciosa and the relatedness of this fungus to other basidiomycetes. Analysis of RNA-Seq data from infected plant tissues revealed differentially expressed genes in the biotrophic and necrotrophic phases. The secreted protein genes that were upregulated in the biotrophic phase are primarily associated with breakdown of the intercellular matrix and modification of the fungal mycelia, possibly to mask the fungus from plant defenses. Based on the transcriptome data, the upregulated secreted proteins in the necrotrophic phase are hypothesized to be actively attacking the plant cell walls and plant cellular components resulting in necrosis. These genes are being used to develop a new understanding of how this disease interaction progresses and to identify potential targets to reduce the impact of this devastating disease.

  7. Genome and secretome analysis of the hemibiotrophic fungal pathogen, Moniliophthora roreri, which causes frosty pod rot disease of cacao: mechanisms of the biotrophic and necrotrophic phases

    PubMed Central

    2014-01-01

    Background The basidiomycete Moniliophthora roreri is the causal agent of Frosty pod rot (FPR) disease of cacao (Theobroma cacao), the source of chocolate, and FPR is one of the most destructive diseases of this important perennial crop in the Americas. This hemibiotroph infects only cacao pods and has an extended biotrophic phase lasting up to sixty days, culminating in plant necrosis and sporulation of the fungus without the formation of a basidiocarp. Results We sequenced and assembled 52.3 Mb into 3,298 contigs that represent the M. roreri genome. Of the 17,920 predicted open reading frames (OFRs), 13,760 were validated by RNA-Seq. Using read count data from RNA sequencing of cacao pods at 30 and 60 days post infection, differential gene expression was estimated for the biotrophic and necrotrophic phases of this plant-pathogen interaction. The sequencing data were used to develop a genome based secretome for the infected pods. Of the 1,535 genes encoding putative secreted proteins, 1,355 were expressed in the biotrophic and necrotrophic phases. Analysis of the data revealed secretome gene expression that correlated with infection and intercellular growth in the biotrophic phase and invasive growth and plant cellular death in the necrotrophic phase. Conclusions Genome sequencing and RNA-Seq was used to determine and validate the Moniliophthora roreri genome and secretome. High sequence identity between Moniliophthora roreri genes and Moniliophthora perniciosa genes supports the taxonomic relationship with Moniliophthora perniciosa and the relatedness of this fungus to other basidiomycetes. Analysis of RNA-Seq data from infected plant tissues revealed differentially expressed genes in the biotrophic and necrotrophic phases. The secreted protein genes that were upregulated in the biotrophic phase are primarily associated with breakdown of the intercellular matrix and modification of the fungal mycelia, possibly to mask the fungus from plant defenses. Based on the transcriptome data, the upregulated secreted proteins in the necrotrophic phase are hypothesized to be actively attacking the plant cell walls and plant cellular components resulting in necrosis. These genes are being used to develop a new understanding of how this disease interaction progresses and to identify potential targets to reduce the impact of this devastating disease. PMID:24571091

  8. Human ETS2 gene on chromosome 21 is not rearranged in Alzheimer disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sacchi, N.; Nalbantoglu, J.; Sergovich, F.R.

    1988-10-01

    The human ETS2 gene, a member of the ETS gene family, with sequence homology with the retroviral ets sequence of the avian erythroblastosis retrovirus E26 is located on chromosome 21. Molecular genetic analysis of Down syndrome (DS) patients with partial trisomy 21 allowed us to reinforce the supposition that ETS2 may be a gene of the minimal DS genetic region. It was originally proposed that a duplication of a portion of the DS region represents the genetic basis of Alzheimer disease, a condition associated also with DS. No evidence of either rearrangements or duplications of ETS2 could be detected inmore » DNA from fibroblasts and brain tissue of Alzheimer disease patients with either the sporadic or the familiar form of the disease. Thus, an altered ETS2 gene dosage does not seem to be a genetic cause or component of Alzheimer disease.« less

  9. Molecular epidemiology of goat pox viruses.

    PubMed

    Roy, P; Jaisree, S; Balakrishnan, S; Senthilkumar, K; Mahaprabhu, R; Mishra, A; Maity, B; Ghosh, T K; Karmakar, A P

    2018-02-01

    Goat pox disease outbreaks were observed in different places affecting Black Bengal Goats in West Bengal (WB) and Tellicherry, Vembur and non-descriptive breeds in Tamil Nadu (TN) causing severe lesions and mortality up to 30%. Clinical specimens from all the outbreaks were screened by polymerase chain reaction followed by restriction fragment length polymorphism (PCR-RFLP) and confirmed the diseases as Goat Pox. Virus isolation in Vero cell line was done with randomly selected ten samples, cytopathic effects (CPE) characterized by syncytia and intracytoplasmic inclusion bodies were observed after several blind passages. Nucleotide sequence of complete p32 gene using randomly selected two isolates and three clinical specimens revealed presence of Goat pox virus (GTPV)-specific signature residues in all the sequences. Phylogenetic analysis using the present five sequences along with GenBank data of GTPV complete p32 gene sequences showed all the GTPV sequences cluster together except Pellor strain (NC004003) and FZ Chinese strain (KC951854). The five sequences either from WB or TN cluster more closely with GTPV isolates of Maharashtra state that were responsible for cross species outbreak of pox disease in both sheep (KF468759) and goats (KF468762) in India during the year 2010. All the Indian goat pox viruses, including the Mukteswar strain, isolated in 1946 and sequence reported in 2004 clustered together with the GTPVs causing the recent outbreaks. It was observed that GTPVs caused similar clinical manifestation irrespective of their geographical locations and breed characteristics, no variation observed among the Indian isolates based on p32 gene over the period of seventy years and disease outbreaks could not be observed or reported in vaccinated goats. © 2017 Blackwell Verlag GmbH.

  10. Novel compound heterozygous mutations in MYO7A in a Chinese family with Usher syndrome type 1.

    PubMed

    Liu, Fei; Li, Pengcheng; Liu, Ying; Li, Weirong; Wong, Fulton; Du, Rong; Wang, Lei; Li, Chang; Jiang, Fagang; Tang, Zhaohui; Liu, Mugen

    2013-01-01

    To identify the disease-causing mutation(s) in a Chinese family with autosomal recessive Usher syndrome type 1 (USH1). An ophthalmic examination and an audiometric test were conducted to ascertain the phenotype of two affected siblings. The microsatellite marker D11S937, which is close to the candidate gene MYO7A (USH1B locus), was selected for genotyping. From the DNA of the proband, all coding exons and exon-intron boundaries of MYO7A were sequenced to identify the disease-causing mutation(s). Restriction fragment length polymorphism (RFLP) analysis was performed to exclude the alternative conclusion that the mutations are non-pathogenic rare polymorphisms. Based on severe hearing impairment, unintelligible speech, and retinitis pigmentosa, a clinical diagnosis of Usher syndrome type 1 was made. The genotyping results did not exclude the USH1B locus, which suggested that the MYO7A gene was likely the gene associated with the disease-causing mutation(s) in the family. With direct DNA sequencing of MYO7A, two novel compound heterozygous mutations (c.3742G>A and c.6051+1G>A) of MYO7A were identified in the proband. DNA sequence analysis and RFLP analysis of other family members showed that the mutations cosegregated with the disease. Unaffected members, including the parents, uncle, and sister of the proband, carry only one of the two mutations. The mutations were not present in the controls (100 normal Chinese subjects=200 chromosomes) according to the RFLP analysis. In this study, we identified two novel mutations, c.3742G>A (p.E1248K) and c.6051+1G>A (donor splice site mutation in intron 44), of MYO7A in a Chinese non-consanguineous family with USH1. The mutations cosegregated with the disease and most likely cause the phenotype in the two affected siblings who carry these mutations compound heterozygously. Our finding expands the mutational spectrum of MYO7A.

  11. Role of the human endogenous retrovirus HERV-K18 in autoimmune disease susceptibility: study in the Spanish population and meta-analysis.

    PubMed

    de la Hera, Belén; Varadé, Jezabel; García-Montojo, Marta; Lamas, José Ramón; de la Encarnación, Ana; Arroyo, Rafael; Fernández-Gutiérrez, Benjamín; Alvarez-Lafuente, Roberto; Urcelay, Elena

    2013-01-01

    Human endogenous retroviruses (HERVs) are genomic sequences that resulted from ancestral germ-line infections by exogenous retroviruses and therefore are transmitted in a Mendelian fashion. Increased HERV expression and antibodies to HERV antigens have been found in various autoimmune diseases. HERV-K18 in chromosome 1 was previously associated with type one diabetes and multiple sclerosis (MS). The etiology of these complex conditions has not been completely elucidated even after the powerful genome wide association studies (GWAS) performed. Nonetheless, this approach does not scrutinize the repetitive sequences within the genome, and part of the missing heritability could lie behind these sequences. We aimed at evaluating the role of HERV-K18 in chromosome 1 on autoimmune disease susceptibility. Two HERV-K18 SNPs (97Y/C and 154W/Stop substitutions) conforming three haplotypes were genotyped in Spanish cohorts of multiple sclerosis (n = 942), rheumatoid arthritis (n = 462) and ethnically matched controls (n = 601). Our findings were pooled in a meta-analysis including 5312 autoimmune patients and 4032 controls. Significant associations of both HERV-K18 polymorphisms in chromosome 1 with MS patients stratified by HLA-DRB1*15:01 were observed [97Y/C p = 0.02; OR (95% CI) = 1.5 (1.04-2.17) and 154W/Stop: p = 0.001; OR (95% CI) = 1.6 (1.19-2.16)]. Combined meta-analysis of the previously published association studies of HERV-K18 with different autoimmune diseases, together with data derived from Spanish cohorts, yielded a significant association of the HERV-K18.3 haplotype [97Y-154W: p(M-H) = 0.0008; OR(M-H) (95% CI) = 1.22 (1.09-1.38)]. Association of the HERV-K18.3 haplotype in chromosome 1 with autoimmune-disease susceptibility was confirmed through meta-analysis.

  12. Ultra-Deep Sequencing Analysis of the Hepatitis A Virus 5'-Untranslated Region among Cases of the Same Outbreak from a Single Source

    PubMed Central

    Wu, Shuang; Nakamoto, Shingo; Kanda, Tatsuo; Jiang, Xia; Nakamura, Masato; Miyamura, Tatsuo; Shirasawa, Hiroshi; Sugiura, Nobuyuki; Takahashi-Nakaguchi, Azusa; Gonoi, Tohru; Yokosuka, Osamu

    2014-01-01

    Hepatitis A virus (HAV) is a causative agent of acute viral hepatitis for which an effective vaccine has been developed. Here we describe ultra-deep pyrosequences (UDPSs) of HAV 5'-untranslated region (5'UTR) among cases of the same outbreak, which arose from a single source, associated with a revolving sushi bar. We determined the reference sequence from HAV-derived clone from an attendant by the Sanger method. Sixteen UDPSs from this outbreak and one from another sporadic case were compared with this reference. Nucleotide errors yielded a UDPS error rate of < 1%. This study confirmed that nucleotide substitutions of this region are transition mutations in outbreak cases, that insertion was observed only in non-severe cases, and that these nucleotide substitutions were different from those of the sporadic case. Analysis of UDPSs detected low-prevalence HAV variations in 5'UTR, but no specific mutations associated with severity in these outbreak cases. To our surprise, HAV strains in this outbreak conserved HAV IRES sequence even if we performed analysis of UDPSs. UDPS analysis of HAV 5'UTR gave us no association between the disease severity of hepatitis A and HAV 5'UTR substitutions. It might be more interesting to perform ultra-deep sequencing of full length HAV genome in order to reveal possible unknown genomic determinants associated with disease severity. Further studies will be needed. PMID:24396287

  13. Single-Cell Sequencing Technologies for Cardiac Stem Cell Studies.

    PubMed

    Liu, Tiantian; Wu, Hongjin; Wu, Shixiu; Wang, Charles

    2017-11-01

    Today with the rapid advancements in stem cell studies and the promising potential of using stem cells in clinical therapy, there is an increasing demand for in-depth comprehensive analysis on individual cell transcriptome and epigenome, as they play critical roles in a number of cell functions such as cell differentiation, growth, and reprogramming. The development of single-cell sequencing technologies has helped in revealing some exciting new perspectives in stem cells and regenerative medicine research. Among the various potential applications, single-cell analysis for cardiac stem cells (CSCs) holds tremendous promises in understanding the mechanisms of heart development and regeneration, which might light up the path toward cell therapy for cardiovascular diseases. This review briefly highlights the recent progresses in single-cell sequencing analysis technologies and their applications in CSC research.

  14. 'Candidatus Phytoplasma sudamericanum' a novel taxon from diseased passion fruit (Passiflora edulis f. flavicarpa Deg.)

    USDA-ARS?s Scientific Manuscript database

    Symptoms of abnormal proliferation of shoots resulting in formation of witches’ broom growths were observed in diseased plants of passion fruit (Passiflora edulis f. flavicarpa Deg.) in Brazil. RFLP analysis of 16S rRNA gene sequences amplified in polymerase chain reactions containing template DNAs...

  15. Analysis of the genome sequence of Phomopsis longicolla: A fungal pathogen causing Phomopsis seed decay in soybean

    USDA-ARS?s Scientific Manuscript database

    Phomopsis longicolla T. W. Hobbs (syn. Diaporthe longicolla) is a seed-borne fungus causing Phomopsis seed decay in soybean. This disease is one of the most devastating diseases reducing soybean seed quality worldwide. To facilitate investigation of the genomic basis of pathogenicity and to understa...

  16. Identification of species of viridans group streptococci in clinical blood culture isolates by sequence analysis of the RNase P RNA gene, rnpB.

    PubMed

    Westling, Katarina; Julander, Inger; Ljungman, Per; Vondracek, Martin; Wretlind, Bengt; Jalal, Shah

    2008-03-01

    Viridans group streptococci (VGS) cause severe diseases such as infective endocarditis and septicaemia. Genetically, VGS species are very close to each other and it is difficult to identify them to species level with conventional methods. The aims of the present study were to use sequence analysis of the RNase P RNA gene (rnpB) to identify VGS species in clinical blood culture isolates, and to compare the results with the API 20 Strep system that is based on phenotypical characteristics. Strains from patients with septicaemia or endocarditis were analysed with PCR amplification and sequence analysis of the rnpB gene. Clinical data were registered as well. One hundred and thirty two VGS clinical blood culture isolates from patients with septicaemia (n=95) or infective endocarditis (n=36) were analysed; all but one were identified by rnpB. Streptococcus oralis, Streptococcus sanguinis and Streptococcus gordonii strains were most common in the patients with infective endocarditis. In the isolates from patients with haematological diseases, Streptococcus mitis and S. oralis dominated. In addition in 76 of the isolates it was possible to compare the results from rnpB analysis and the API 20 Strep system. In 39/76 (51%) of the isolates the results were concordant to species level; in 55 isolates there were no results from API 20 Strep. Sequence analysis of the RNase P RNA gene (rnpB) showed that almost all isolates could be identified. This could be of importance for evaluation of the portal of entry in patients with septicaemia or infective endocarditis.

  17. Connecting the Human Variome Project to nutrigenomics.

    PubMed

    Kaput, Jim; Evelo, Chris T; Perozzi, Giuditta; van Ommen, Ben; Cotton, Richard

    2010-12-01

    Nutrigenomics is the science of analyzing and understanding gene-nutrient interactions, which because of the genetic heterogeneity, varying degrees of interaction among gene products, and the environmental diversity is a complex science. Although much knowledge of human diversity has been accumulated, estimates suggest that ~90% of genetic variation has not yet been characterized. Identification of the DNA sequence variants that contribute to nutrition-related disease risk is essential for developing a better understanding of the complex causes of disease in humans, including nutrition-related disease. The Human Variome Project (HVP; http://www.humanvariomeproject.org/) is an international effort to systematically identify genes, their mutations, and their variants associated with phenotypic variability and indications of human disease or phenotype. Since nutrigenomic research uses genetic information in the design and analysis of experiments, the HVP is an essential collaborator for ongoing studies of gene-nutrient interactions. With the advent of next generation sequencing methodologies and the understanding of the undiscovered variation in human genomes, the nutrigenomic community will be generating novel sequence data and results. The guidelines and practices of the HVP can guide and harmonize these efforts.

  18. Connecting the Human Variome Project to nutrigenomics

    PubMed Central

    Evelo, Chris T.; Perozzi, Giuditta; van Ommen, Ben; Cotton, Richard

    2010-01-01

    Nutrigenomics is the science of analyzing and understanding gene–nutrient interactions, which because of the genetic heterogeneity, varying degrees of interaction among gene products, and the environmental diversity is a complex science. Although much knowledge of human diversity has been accumulated, estimates suggest that ~90% of genetic variation has not yet been characterized. Identification of the DNA sequence variants that contribute to nutrition-related disease risk is essential for developing a better understanding of the complex causes of disease in humans, including nutrition-related disease. The Human Variome Project (HVP; http://www.humanvariomeproject.org/) is an international effort to systematically identify genes, their mutations, and their variants associated with phenotypic variability and indications of human disease or phenotype. Since nutrigenomic research uses genetic information in the design and analysis of experiments, the HVP is an essential collaborator for ongoing studies of gene–nutrient interactions. With the advent of next generation sequencing methodologies and the understanding of the undiscovered variation in human genomes, the nutrigenomic community will be generating novel sequence data and results. The guidelines and practices of the HVP can guide and harmonize these efforts. PMID:28300226

  19. A Systematic Bayesian Integration of Epidemiological and Genetic Data

    PubMed Central

    Lau, Max S. Y.; Marion, Glenn; Streftaris, George; Gibson, Gavin

    2015-01-01

    Genetic sequence data on pathogens have great potential to inform inference of their transmission dynamics ultimately leading to better disease control. Where genetic change and disease transmission occur on comparable timescales additional information can be inferred via the joint analysis of such genetic sequence data and epidemiological observations based on clinical symptoms and diagnostic tests. Although recently introduced approaches represent substantial progress, for computational reasons they approximate genuine joint inference of disease dynamics and genetic change in the pathogen population, capturing partially the joint epidemiological-evolutionary dynamics. Improved methods are needed to fully integrate such genetic data with epidemiological observations, for achieving a more robust inference of the transmission tree and other key epidemiological parameters such as latent periods. Here, building on current literature, a novel Bayesian framework is proposed that infers simultaneously and explicitly the transmission tree and unobserved transmitted pathogen sequences. Our framework facilitates the use of realistic likelihood functions and enables systematic and genuine joint inference of the epidemiological-evolutionary process from partially observed outbreaks. Using simulated data it is shown that this approach is able to infer accurately joint epidemiological-evolutionary dynamics, even when pathogen sequences and epidemiological data are incomplete, and when sequences are available for only a fraction of exposures. These results also characterise and quantify the value of incomplete and partial sequence data, which has important implications for sampling design, and demonstrate the abilities of the introduced method to identify multiple clusters within an outbreak. The framework is used to analyse an outbreak of foot-and-mouth disease in the UK, enhancing current understanding of its transmission dynamics and evolutionary process. PMID:26599399

  20. GNE missense mutation in recessive familial amyotrophic lateral sclerosis.

    PubMed

    Köroğlu, Çiğdem; Yılmaz, Rezzak; Sorgun, Mine Hayriye; Solakoğlu, Seyhun; Şener, Özden

    2017-12-01

    Amyotrophic lateral sclerosis (ALS) is a motor neuron disease eventually leading to death from respiratory failure. Recessive inheritance is very rare. Here, we describe the clinical findings in a consanguineous family with five men afflicted with recessive ALS and the identification of the homozygous mutation responsible for the disorder. The onset of the disease ranged from 12 to 35 years of age, with variable disease progressions. We performed clinical investigations including metabolic and paraneoplastic screening, cranial and cervical imaging, and electrophysiology. We mapped the disease gene to 9p21.1-p12 with a LOD score of 5.2 via linkage mapping using genotype data for single-nucleotide polymorphism markers and performed exome sequence analysis to identify the disease-causing gene variant. We also Sanger sequenced all coding sequences of SIGMAR1, a gene reported as responsible for juvenile ALS in a family. We did not find any mutation in SIGMAR1. Instead, we identified a novel homozygous missense mutation p.(His705Arg) in GNE which was predicted as damaging by online tools. GNE has been associated with inclusion body myopathy and is expressed in many tissues. We propose that the GNE mutation underlies the pathology in the family.

  1. Exome sequencing and genome-wide linkage analysis in 17 families illustrate the complex contribution of TTN truncating variants to dilated cardiomyopathy.

    PubMed

    Norton, Nadine; Li, Duanxiang; Rampersaud, Evadnie; Morales, Ana; Martin, Eden R; Zuchner, Stephan; Guo, Shengru; Gonzalez, Michael; Hedges, Dale J; Robertson, Peggy D; Krumm, Niklas; Nickerson, Deborah A; Hershberger, Ray E

    2013-04-01

    BACKGROUND- Familial dilated cardiomyopathy (DCM) is a genetically heterogeneous disease with >30 known genes. TTN truncating variants were recently implicated in a candidate gene study to cause 25% of familial and 18% of sporadic DCM cases. METHODS AND RESULTS- We used an unbiased genome-wide approach using both linkage analysis and variant filtering across the exome sequences of 48 individuals affected with DCM from 17 families to identify genetic cause. Linkage analysis ranked the TTN region as falling under the second highest genome-wide multipoint linkage peak, multipoint logarithm of odds, 1.59. We identified 6 TTN truncating variants carried by individuals affected with DCM in 7 of 17 DCM families (logarithm of odds, 2.99); 2 of these 7 families also had novel missense variants that segregated with disease. Two additional novel truncating TTN variants did not segregate with DCM. Nucleotide diversity at the TTN locus, including missense variants, was comparable with 5 other known DCM genes. The average number of missense variants in the exome sequences from the DCM cases or the ≈5400 cases from the Exome Sequencing Project was ≈23 per individual. The average number of TTN truncating variants in the Exome Sequencing Project was 0.014 per individual. We also identified a region (chr9q21.11-q22.31) with no known DCM genes with a maximum heterogeneity logarithm of odds score of 1.74. CONCLUSIONS- These data suggest that TTN truncating variants contribute to DCM cause. However, the lack of segregation of all identified TTN truncating variants illustrates the challenge of determining variant pathogenicity even with full exome sequencing.

  2. SeqHBase: a big data toolset for family based sequencing data analysis.

    PubMed

    He, Min; Person, Thomas N; Hebbring, Scott J; Heinzen, Ethan; Ye, Zhan; Schrodi, Steven J; McPherson, Elizabeth W; Lin, Simon M; Peissig, Peggy L; Brilliant, Murray H; O'Rawe, Jason; Robison, Reid J; Lyon, Gholson J; Wang, Kai

    2015-04-01

    Whole-genome sequencing (WGS) and whole-exome sequencing (WES) technologies are increasingly used to identify disease-contributing mutations in human genomic studies. It can be a significant challenge to process such data, especially when a large family or cohort is sequenced. Our objective was to develop a big data toolset to efficiently manipulate genome-wide variants, functional annotations and coverage, together with conducting family based sequencing data analysis. Hadoop is a framework for reliable, scalable, distributed processing of large data sets using MapReduce programming models. Based on Hadoop and HBase, we developed SeqHBase, a big data-based toolset for analysing family based sequencing data to detect de novo, inherited homozygous, or compound heterozygous mutations that may contribute to disease manifestations. SeqHBase takes as input BAM files (for coverage at every site), variant call format (VCF) files (for variant calls) and functional annotations (for variant prioritisation). We applied SeqHBase to a 5-member nuclear family and a 10-member 3-generation family with WGS data, as well as a 4-member nuclear family with WES data. Analysis times were almost linearly scalable with number of data nodes. With 20 data nodes, SeqHBase took about 5 secs to analyse WES familial data and approximately 1 min to analyse WGS familial data. These results demonstrate SeqHBase's high efficiency and scalability, which is necessary as WGS and WES are rapidly becoming standard methods to study the genetics of familial disorders. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  3. Cloning and sequence analysis of the Antheraea pernyi nucleopolyhedrovirus gp64 gene.

    PubMed

    Wang, Wenbing; Zhu, Shanying; Wang, Liqun; Yu, Feng; Shen, Weide

    2005-12-01

    Frequent outbreaks of the purulence disease of Chinese oak silkworm are reported in Middle and Northeast China. The disease is produced by the pathogen Antheraea pernyi nucleopolyhedrovirus (AnpeNPV). To obtain molecular information of the virus, the polyhedra of AnpeNPV were purified and characterized. The genomic DNA of AnpeNPV was extracted and digested with HindIII. The genome size of AnpeNPV is estimated at 128 kb. Based on the analysis of DNA fragments digested with HindIII, 23 fragments were bigger than 564 bp. A genomic library was generated using HindIII and the positive clones were sequenced and analysed. The gp64 gene, encoding the baculovirus envelope protein GP64, was found in an insert. The nucleotide sequence analysis indicated that the AnpeNPV gp64 gene consists of a 1,530 nucleotide open reading frame (ORF), encoding a protein of 509 amino acids. Of the eight gp64 homologues, the AnpeNPV gp64 ORF shared the most sequence similarity with the gp64 gene of Anticarsia gemmatalis NPV, but not Bombyx mori NPV. The upstream region of the AnpeNPV gp64 ORF encoded the conserved transcriptional elements for early and late stage of the viral infection cycle. These results indicated that AnpeNPV belongs to group I NPV and was far removed in molecular phylogeny from the BmNPV.

  4. Familial cases of Norrie disease detected by copy number analysis.

    PubMed

    Arai, Eisuke; Fujimaki, Takuro; Yanagawa, Ai; Fujiki, Keiko; Yokoyama, Toshiyuki; Okumura, Akihisa; Shimizu, Toshiaki; Murakami, Akira

    2014-09-01

    Norrie disease (ND, MIM#310600) is an X-linked disorder characterized by severe vitreoretinal dysplasia at birth. We report the results of causative NDP gene analysis in three male siblings with Norrie disease and describe the associated phenotypes. Three brothers with suspected Norrie disease and their mother presented for clinical examination. After obtaining informed consent, DNA was extracted from the peripheral blood of the proband, one of his brothers and his unaffected mother. Exons 1-3 of the NDP gene were amplified by polymerase chain reaction (PCR), and direct sequencing was performed. Multiplex ligation-dependent probe amplification (MLPA) was also performed to search for copy number variants in the NDP gene. The clinical findings of the three brothers included no light perception, corneal opacity, shallow anterior chamber, leukocoria, total retinal detachment and mental retardation. Exon 2 of the NDP gene was not amplified in the proband and one brother, even when the PCR primers for exon 2 were changed, whereas the other two exons showed no mutations by direct sequencing. MLPA analysis showed deletion of exon 2 of the NDP gene in the proband and one brother, while there was only one copy of exon 2 in the mother. Norrie disease was diagnosed in three patients from a Japanese family by clinical examination and was confirmed by genetic analysis. To localize the defect, confirmation of copy number variation by the MLPA method was useful in the present study.

  5. Design of association studies with pooled or un-pooled next-generation sequencing data.

    PubMed

    Kim, Su Yeon; Li, Yingrui; Guo, Yiran; Li, Ruiqiang; Holmkvist, Johan; Hansen, Torben; Pedersen, Oluf; Wang, Jun; Nielsen, Rasmus

    2010-07-01

    Most common hereditary diseases in humans are complex and multifactorial. Large-scale genome-wide association studies based on SNP genotyping have only identified a small fraction of the heritable variation of these diseases. One explanation may be that many rare variants (a minor allele frequency, MAF <5%), which are not included in the common genotyping platforms, may contribute substantially to the genetic variation of these diseases. Next-generation sequencing, which would allow the analysis of rare variants, is now becoming so cheap that it provides a viable alternative to SNP genotyping. In this paper, we present cost-effective protocols for using next-generation sequencing in association mapping studies based on pooled and un-pooled samples, and identify optimal designs with respect to total number of individuals, number of individuals per pool, and the sequencing coverage. We perform a small empirical study to evaluate the pooling variance in a realistic setting where pooling is combined with exon-capturing. To test for associations, we develop a likelihood ratio statistic that accounts for the high error rate of next-generation sequencing data. We also perform extensive simulations to determine the power and accuracy of this method. Overall, our findings suggest that with a fixed cost, sequencing many individuals at a more shallow depth with larger pool size achieves higher power than sequencing a small number of individuals in higher depth with smaller pool size, even in the presence of high error rates. Our results provide guidelines for researchers who are developing association mapping studies based on next-generation sequencing. (c) 2010 Wiley-Liss, Inc.

  6. Construction of a yeast artificial chromosome contig encompassing the chromosome 14 Alzheimer`s disease locus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sharma, V.; Bonnycastle, L.; Poorkai, P.

    1994-09-01

    We have constructed a yeast artificial chromosome (YAC) contig of chromosome 14q24.3 which encompasses the chromosome 14 Alzheimer`s disease locus (AD3). Determined by linkage analysis of early-onset Alzheimer`s disease kindreds, this interval is bounded by the genetic markers D14S61-D14S63 and spans approximately 15 centimorgans. The contig consists of 29 markers and 74 YACs of which 57 are defined by one or more sequence tagged sites (STSs). The STS markers comprise 5 genes, 16 short tandem repeat polymorphisms and 8 cDNA clones. An additional number of genes, expressed sequence tags and cDNA fragments have been identified and localized to the contigmore » by hybridization and sequence analysis of anonymous clones isolated by cDNA direct selection techniques. A minimal contig of about 15 YACs averaging 0.5-1.5 megabase in length will span this interval and is, at first approximation, in rough agreement with the genetic map. For two regions of the contig, our coverage has relied on L1/THE fingerprint and Alu-PCR hybridization data of YACs provided by CEPH/Genethon. We are currently developing sequence tagged sites from these to confirm the overlaps revealed by the fingerprint data. Among the genes which map to the contig are transforming growth factor beta 3, c-fos, and heat shock protein 2A (HSPA2). C-fos is not a candidate gene for AD3 based on the sequence analysis of affected and unaffected individuals. HSPA2 maps to the proximal edge of the contig and Calmodulin 1, a candidate gene from 4q24.3, maps outside of the region. The YAC contig is a framework physical map from which cosmid or P1 clone contigs can be constructed. As more genes and cDNAs are mapped, a highly resolved transcription map will emerge, a necessary step towards positionally cloning the AD3 gene.« less

  7. 'Candidatus Phytoplasma phoenicium' associated with almond witches'-broom disease: from draft genome to genetic diversity among strain populations.

    PubMed

    Quaglino, Fabio; Kube, Michael; Jawhari, Maan; Abou-Jawdah, Yusuf; Siewert, Christin; Choueiri, Elia; Sobh, Hana; Casati, Paola; Tedeschi, Rosemarie; Lova, Marina Molino; Alma, Alberto; Bianco, Piero Attilio

    2015-07-30

    Almond witches'-broom (AlmWB), a devastating disease of almond, peach and nectarine in Lebanon, is associated with 'Candidatus Phytoplasma phoenicium'. In the present study, we generated a draft genome sequence of 'Ca. P. phoenicium' strain SA213, representative of phytoplasma strain populations from different host plants, and determined the genetic diversity among phytoplasma strain populations by phylogenetic analyses of 16S rRNA, groEL, tufB and inmp gene sequences. Sequence-based typing and phylogenetic analysis of the gene inmp, coding an integral membrane protein, distinguished AlmWB-associated phytoplasma strains originating from diverse host plants, whereas their 16S rRNA, tufB and groEL genes shared 100 % sequence identity. Moreover, dN/dS analysis indicated positive selection acting on inmp gene. Additionally, the analysis of 'Ca. P. phoenicium' draft genome revealed the presence of integral membrane proteins and effector-like proteins and potential candidates for interaction with hosts. One of the integral membrane proteins was predicted as BI-1, an inhibitor of apoptosis-promoting Bax factor. Bioinformatics analyses revealed the presence of putative BI-1 in draft and complete genomes of other 'Ca. Phytoplasma' species. The genetic diversity within 'Ca. P. phoenicium' strain populations in Lebanon suggested that AlmWB disease could be associated with phytoplasma strains derived from the adaptation of an original strain to diverse hosts. Moreover, the identification of a putative inhibitor of apoptosis-promoting Bax factor (BI-1) in 'Ca. P. phoenicium' draft genome and within genomes of other 'Ca. Phytoplasma' species suggested its potential role as a phytoplasma fitness-increasing factor by modification of the host-defense response.

  8. Recovery of divergent avian bornaviruses from cases of proventricular dilatation disease: Identification of a candidate etiologic agent

    PubMed Central

    Kistler, Amy L; Gancz, Ady; Clubb, Susan; Skewes-Cox, Peter; Fischer, Kael; Sorber, Katherine; Chiu, Charles Y; Lublin, Avishai; Mechani, Sara; Farnoushi, Yigal; Greninger, Alexander; Wen, Christopher C; Karlene, Scott B; Ganem, Don; DeRisi, Joseph L

    2008-01-01

    Background Proventricular dilatation disease (PDD) is a fatal disorder threatening domesticated and wild psittacine birds worldwide. It is characterized by lymphoplasmacytic infiltration of the ganglia of the central and peripheral nervous system, leading to central nervous system disorders as well as disordered enteric motility and associated wasting. For almost 40 years, a viral etiology for PDD has been suspected, but to date no candidate etiologic agent has been reproducibly linked to the disease. Results Analysis of 2 PDD case-control series collected independently on different continents using a pan-viral microarray revealed a bornavirus hybridization signature in 62.5% of the PDD cases (5/8) and none of the controls (0/8). Ultra high throughput sequencing was utilized to recover the complete viral genome sequence from one of the virus-positive PDD cases. This revealed a bornavirus-like genome organization for this agent with a high degree of sequence divergence from all prior bornavirus isolates. We propose the name avian bornavirus (ABV) for this agent. Further specific ABV PCR analysis of an additional set of independently collected PDD cases and controls yielded a significant difference in ABV detection rate among PDD cases (71%, n = 7) compared to controls (0%, n = 14) (P = 0.01; Fisher's Exact Test). Partial sequence analysis of a total of 16 ABV isolates we have now recovered from these and an additional set of cases reveals at least 5 distinct ABV genetic subgroups. Conclusion These studies clearly demonstrate the existence of an avian reservoir of remarkably diverse bornaviruses and provide a compelling candidate in the search for an etiologic agent of PDD. PMID:18671869

  9. Natural Variation of Epstein-Barr Virus Genes, Proteins, and Primary MicroRNA.

    PubMed

    Correia, Samantha; Palser, Anne; Elgueta Karstegl, Claudio; Middeldorp, Jaap M; Ramayanti, Octavia; Cohen, Jeffrey I; Hildesheim, Allan; Fellner, Maria Dolores; Wiels, Joelle; White, Robert E; Kellam, Paul; Farrell, Paul J

    2017-08-01

    Viral gene sequences from an enlarged set of about 200 Epstein-Barr virus (EBV) strains, including many primary isolates, have been used to investigate variation in key viral genetic regions, particularly LMP1, Zp, gp350, EBNA1, and the BART microRNA (miRNA) cluster 2. Determination of type 1 and type 2 EBV in saliva samples from people from a wide range of geographic and ethnic backgrounds demonstrates a small percentage of healthy white Caucasian British people carrying predominantly type 2 EBV. Linkage of Zp and gp350 variants to type 2 EBV is likely to be due to their genes being adjacent to the EBNA3 locus, which is one of the major determinants of the type 1/type 2 distinction. A novel classification of EBNA1 DNA binding domains, named QCIGP, results from phylogeny analysis of their protein sequences but is not linked to the type 1/type 2 classification. The BART cluster 2 miRNA region is classified into three major variants through single-nucleotide polymorphisms (SNPs) in the primary miRNA outside the mature miRNA sequences. These SNPs can result in altered levels of expression of some miRNAs from the BART variant frequently present in Chinese and Indonesian nasopharyngeal carcinoma (NPC) samples. The EBV genetic variants identified here provide a basis for future, more directed analysis of association of specific EBV variations with EBV biology and EBV-associated diseases. IMPORTANCE Incidence of diseases associated with EBV varies greatly in different parts of the world. Thus, relationships between EBV genome sequence variation and health, disease, geography, and ethnicity of the host may be important for understanding the role of EBV in diseases and for development of an effective EBV vaccine. This paper provides the most comprehensive analysis so far of variation in specific EBV genes relevant to these diseases and proposed EBV vaccines. By focusing on variation in LMP1, Zp, gp350, EBNA1, and the BART miRNA cluster 2, new relationships with the known type 1/type 2 strains are demonstrated, and a novel classification of EBNA1 and the BART miRNAs is proposed. Copyright © 2017 Correia et al.

  10. BayesPI-BAR: a new biophysical model for characterization of regulatory sequence variations

    PubMed Central

    Wang, Junbai; Batmanov, Kirill

    2015-01-01

    Sequence variations in regulatory DNA regions are known to cause functionally important consequences for gene expression. DNA sequence variations may have an essential role in determining phenotypes and may be linked to disease; however, their identification through analysis of massive genome-wide sequencing data is a great challenge. In this work, a new computational pipeline, a Bayesian method for protein–DNA interaction with binding affinity ranking (BayesPI-BAR), is proposed for quantifying the effect of sequence variations on protein binding. BayesPI-BAR uses biophysical modeling of protein–DNA interactions to predict single nucleotide polymorphisms (SNPs) that cause significant changes in the binding affinity of a regulatory region for transcription factors (TFs). The method includes two new parameters (TF chemical potentials or protein concentrations and direct TF binding targets) that are neglected by previous methods. The new method is verified on 67 known human regulatory SNPs, of which 47 (70%) have predicted true TFs ranked in the top 10. Importantly, the performance of BayesPI-BAR, which uses principal component analysis to integrate multiple predictions from various TF chemical potentials, is found to be better than that of existing programs, such as sTRAP and is-rSNP, when evaluated on the same SNPs. BayesPI-BAR is a publicly available tool and is able to carry out parallelized computation, which helps to investigate a large number of TFs or SNPs and to detect disease-associated regulatory sequence variations in the sea of genome-wide noncoding regions. PMID:26202972

  11. Targeted exome sequencing reveals novel USH2A mutations in Chinese patients with simplex Usher syndrome.

    PubMed

    Shu, Hai-Rong; Bi, Huai; Pan, Yang-Chun; Xu, Hang-Yu; Song, Jian-Xin; Hu, Jie

    2015-09-16

    Usher syndrome (USH) is an autosomal recessive disorder characterized by hearing impairment and vision dysfunction due to retinitis pigmentosa. Phenotypic and genetic heterogeneities of this disease make it impractical to obtain a genetic diagnosis by conventional Sanger sequencing. In this study, we applied a next-generation sequencing approach to detect genetic abnormalities in patients with USH. Two unrelated Chinese families were recruited, consisting of two USH afflicted patients and four unaffected relatives. We selected 199 genes related to inherited retinal diseases as targets for deep exome sequencing. Through systematic data analysis using an established bioinformatics pipeline, all variants that passed filter criteria were validated by Sanger sequencing and co-segregation analysis. A homozygous frameshift mutation (c.4382delA, p.T1462Lfs*2) was revealed in exon20 of gene USH2A in the F1 family. Two compound heterozygous mutations, IVS47 + 1G > A and c.13156A > T (p.I4386F), located in intron 48 and exon 63 respectively, of USH2A, were identified as causative mutations for the F2 family. Of note, the missense mutation c.13156A > T has not been reported so far. In conclusion, targeted exome sequencing precisely and rapidly identified the genetic defects in two Chinese USH families and this technique can be applied as a routine examination for these disorders with significant clinical and genetic heterogeneity.

  12. In Silico Detection of Sequence Variations Modifying Transcriptional Regulation

    PubMed Central

    Andersen, Malin C; Engström, Pär G; Lithwick, Stuart; Arenillas, David; Eriksson, Per; Lenhard, Boris; Wasserman, Wyeth W; Odeberg, Jacob

    2008-01-01

    Identification of functional genetic variation associated with increased susceptibility to complex diseases can elucidate genes and underlying biochemical mechanisms linked to disease onset and progression. For genes linked to genetic diseases, most identified causal mutations alter an encoded protein sequence. Technological advances for measuring RNA abundance suggest that a significant number of undiscovered causal mutations may alter the regulation of gene transcription. However, it remains a challenge to separate causal genetic variations from linked neutral variations. Here we present an in silico driven approach to identify possible genetic variation in regulatory sequences. The approach combines phylogenetic footprinting and transcription factor binding site prediction to identify variation in candidate cis-regulatory elements. The bioinformatics approach has been tested on a set of SNPs that are reported to have a regulatory function, as well as background SNPs. In the absence of additional information about an analyzed gene, the poor specificity of binding site prediction is prohibitive to its application. However, when additional data is available that can give guidance on which transcription factor is involved in the regulation of the gene, the in silico binding site prediction improves the selection of candidate regulatory polymorphisms for further analyses. The bioinformatics software generated for the analysis has been implemented as a Web-based application system entitled RAVEN (regulatory analysis of variation in enhancers). The RAVEN system is available at http://www.cisreg.ca for all researchers interested in the detection and characterization of regulatory sequence variation. PMID:18208319

  13. Exome sequencing analysis reveals variants in primary immunodeficiency genes in patients with very early onset inflammatory bowel disease.

    PubMed

    Kelsen, Judith R; Dawany, Noor; Moran, Christopher J; Petersen, Britt-Sabina; Sarmady, Mahdi; Sasson, Ariella; Pauly-Hubbard, Helen; Martinez, Alejandro; Maurer, Kelly; Soong, Joanne; Rappaport, Eric; Franke, Andre; Keller, Andreas; Winter, Harland S; Mamula, Petar; Piccoli, David; Artis, David; Sonnenberg, Gregory F; Daly, Mark; Sullivan, Kathleen E; Baldassano, Robert N; Devoto, Marcella

    2015-11-01

    Very early onset inflammatory bowel disease (VEO-IBD), IBD diagnosed at 5 years of age or younger, frequently presents with a different and more severe phenotype than older-onset IBD. We investigated whether patients with VEO-IBD carry rare or novel variants in genes associated with immunodeficiencies that might contribute to disease development. Patients with VEO-IBD and parents (when available) were recruited from the Children's Hospital of Philadelphia from March 2013 through July 2014. We analyzed DNA from 125 patients with VEO-IBD (age, 3 wk to 4 y) and 19 parents, 4 of whom also had IBD. Exome capture was performed by Agilent SureSelect V4, and sequencing was performed using the Illumina HiSeq platform. Alignment to human genome GRCh37 was achieved followed by postprocessing and variant calling. After functional annotation, candidate variants were analyzed for change in protein function, minor allele frequency less than 0.1%, and scaled combined annotation-dependent depletion scores of 10 or less. We focused on genes associated with primary immunodeficiencies and related pathways. An additional 210 exome samples from patients with pediatric IBD (n = 45) or adult-onset Crohn's disease (n = 20) and healthy individuals (controls, n = 145) were obtained from the University of Kiel, Germany, and used as control groups. Four hundred genes and regions associated with primary immunodeficiency, covering approximately 6500 coding exons totaling more than 1 Mbp of coding sequence, were selected from the whole-exome data. Our analysis showed novel and rare variants within these genes that could contribute to the development of VEO-IBD, including rare heterozygous missense variants in IL10RA and previously unidentified variants in MSH5 and CD19. In an exome sequence analysis of patients with VEO-IBD and their parents, we identified variants in genes that regulate B- and T-cell functions and could contribute to pathogenesis. Our analysis could lead to the identification of previously unidentified IBD-associated variants. Copyright © 2015 AGA Institute. Published by Elsevier Inc. All rights reserved.

  14. Characterization of a novel chicken muscle disorder through differential gene expression and pathway analysis using RNA-sequencing.

    PubMed

    Mutryn, Marie F; Brannick, Erin M; Fu, Weixuan; Lee, William R; Abasht, Behnam

    2015-05-21

    Improvements in poultry production within the past 50 years have led to increased muscle yield and growth rate, which may be contributing to an increased rate and development of new muscle disorders in chickens. Previously reported muscle disorders and conditions are generally associated with poor meat quality traits and have a significant negative economic impact on the poultry industry. Recently, a novel myopathy phenotype has emerged which is characterized by palpably "hard" or tough breast muscle. The objective of this study is to identify the underlying biological mechanisms that contribute to this emerging muscle disorder colloquially referred to as "Wooden Breast", through the use of RNA-sequencing technology. We constructed cDNA libraries from five affected and six unaffected breast muscle samples from a line of commercial broiler chickens. After paired-end sequencing of samples using the Illumina Hiseq platform, we used Tophat to align the resulting sequence reads to the chicken reference genome and then used Cufflinks to find significant changes in gene transcript expression between each group. By comparing our gene list to previously published histology findings on this disorder and using Ingenuity Pathways Analysis (IPA®), we aim to develop a characteristic gene expression profile for this novel disorder through analyzing genes, gene families, and predicted biological pathways. Over 1500 genes were differentially expressed between affected and unaffected birds. There was an average of approximately 98 million reads per sample, across all samples. Results from the IPA analysis suggested "Diseases and Disorders" such as connective tissue disorders, "Molecular and Cellular Functions" such as cellular assembly and organization, cellular function and maintenance, and cellular movement, "Physiological System Development and Function" such as tissue development, and embryonic development, and "Top Canonical Pathways" such as, coagulation system, axonal guidance signaling, and acute phase response signaling, are associated with the Wooden Breast disease. There is convincing evidence by RNA-seq analysis to support localized hypoxia, oxidative stress, increased intracellular calcium, as well as the possible presence of muscle fiber-type switching, as key features of Wooden Breast Disease, which are supported by reported microscopic lesions of the disease.

  15. [Fine mapping of complex disease susceptibility loci].

    PubMed

    Song, Qingfeng; Zhang, Hongxing; Ma, Yilong; Zhou, Gangqiao

    2014-01-01

    Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers have identified more than 3800 susceptibility loci for more than 660 diseases or traits. However, the most significantly associated variants or causative variants in these loci and their biological functions have remained to be clarified. These causative variants can help to elucidate the pathogenesis and discover new biomarkers of complex diseases. One of the main goals in the post-GWAS era is to identify the causative variants and susceptibility genes, and clarify their functional aspects by fine mapping. For common variants, imputation or re-sequencing based strategies were implemented to increase the number of analyzed variants and help to identify the most significantly associated variants. In addition, functional element, expression quantitative trait locus (eQTL) and haplotype analyses were performed to identify functional common variants and susceptibility genes. For rare variants, fine mapping was carried out by re-sequencing, rare haplotype analysis, family-based analysis, burden test, etc.This review summarizes the strategies and problems for fine mapping.

  16. Isolation and characterization of a novel herpesvirus from a free-ranging eastern grey kangaroo (Macropus giganteus).

    PubMed

    Vaz, Paola Karinna; Motha, Julian; McCowan, Christina; Ficorilli, Nino; Whiteley, Pam Lizette; Wilks, Colin Reginald; Hartley, Carol Anne; Gilkerson, James Rudkin; Browning, Glenn Francis; Devlin, Joanne Maree

    2013-01-01

    We isolated a macropodid herpesvirus from a free-ranging eastern grey kangaroo (Macropus giganteous) displaying clinical signs of respiratory disease and possibly neurologic disease. Sequence analysis of the herpesvirus glycoprotein G (gG) and glycoprotein B (gB) genes revealed that the virus was an alphaherpesvirus most closely related to macropodid herpesvirus 2 (MaHV-2) with 82.7% gG and 94.6% gB amino acid sequence identity. Serologic analyses showed similar cross-neutralization patterns to those of MaHV-2. The two viruses had different growth characteristics in cell culture. Most notably, this virus formed significantly larger plaques and extensive syncytia when compared with MaHV-2. No syncytia were observed for MaHV-2. Restriction endonuclease analysis of whole viral genomes demonstrated distinct restriction endonuclease cleavage patterns for all three macropodid herpesviruses. These studies suggest that a distinct macropodid alphaherpesvirus may be capable of infecting and causing disease in eastern grey kangaroos.

  17. Genotyping microarray (gene chip) for the ABCR (ABCA4) gene.

    PubMed

    Jaakson, K; Zernant, J; Külm, M; Hutchinson, A; Tonisson, N; Glavac, D; Ravnik-Glavac, M; Hawlina, M; Meltzer, M R; Caruso, R C; Testa, F; Maugeri, A; Hoyng, C B; Gouras, P; Simonelli, F; Lewis, R A; Lupski, J R; Cremers, F P M; Allikmets, R

    2003-11-01

    Genetic variation in the ABCR (ABCA4) gene has been associated with five distinct retinal phenotypes, including Stargardt disease/fundus flavimaculatus (STGD/FFM), cone-rod dystrophy (CRD), and age-related macular degeneration (AMD). Comparative genetic analyses of ABCR variation and diagnostics have been complicated by substantial allelic heterogeneity and by differences in screening methods. To overcome these limitations, we designed a genotyping microarray (gene chip) for ABCR that includes all approximately 400 disease-associated and other variants currently described, enabling simultaneous detection of all known ABCR variants. The ABCR genotyping microarray (the ABCR400 chip) was constructed by the arrayed primer extension (APEX) technology. Each sequence change in ABCR was included on the chip by synthesis and application of sequence-specific oligonucleotides. We validated the chip by screening 136 confirmed STGD patients and 96 healthy controls, each of whom we had analyzed previously by single strand conformation polymorphism (SSCP) technology and/or heteroduplex analysis. The microarray was >98% effective in determining the existing genetic variation and was comparable to direct sequencing in that it yielded many sequence changes undetected by SSCP. In STGD patient cohorts, the efficiency of the array to detect disease-associated alleles was between 54% and 78%, depending on the ethnic composition and degree of clinical and molecular characterization of a cohort. In addition, chip analysis suggested a high carrier frequency (up to 1:10) of ABCR variants in the general population. The ABCR genotyping microarray is a robust, cost-effective, and comprehensive screening tool for variation in one gene in which mutations are responsible for a substantial fraction of retinal disease. The ABCR chip is a prototype for the next generation of screening and diagnostic tools in ophthalmic genetics, bridging clinical and scientific research. Copyright 2003 Wiley-Liss, Inc.

  18. 'Candidatus Phytoplasma noviguineense', a novel taxon associated with Bogia coconut syndrome and banana wilt disease on the island of New Guinea.

    PubMed

    Miyazaki, Akio; Shigaki, Toshiro; Koinuma, Hiroaki; Iwabuchi, Nozomu; Rauka, Gou Bue; Kembu, Alfred; Saul, Josephine; Watanabe, Kiyoto; Nijo, Takamichi; Maejima, Kensaku; Yamaji, Yasuyuki; Namba, Shigetou

    2018-01-01

    Bogia coconut syndrome (BCS) is one of the lethal yellowing (LY)-type diseases associated with phytoplasma presence that are seriously threatening coconut cultivation worldwide. It has recently emerged, and is rapidly spreading in northern parts of the island of New Guinea. BCS-associated phytoplasmas collected in different regions were compared in terms of 16S rRNA gene sequences, revealing high identity among them represented by strain BCS-Bo R . Comparative analysis of the 16S rRNA gene sequences revealed that BCS-Bo R shared less than a 97.5 % similarity with other species of 'Candidatus Phytoplasma', with a maximum value of 96.08 % (with strain LY; GenBank accession no. U18747). This result indicates the necessity and propriety of a novel taxon for BCS phytoplasmas according to the recommendations of the IRPCM. Phylogenetic analysis was also conducted on 16S rRNA gene sequences, resulting in a monophyletic cluster composed of BCS-Bo R and other LY-associated phytoplasmas. Other phytoplasmas on the island of New Guinea associated with banana wilt and arecanut yellow leaf diseases showed high similarities to BCS-Bo R and were closely related to BCS phytoplasmas. Based on the uniqueness of their 16S rRNA gene sequences, a novel taxon 'Ca.Phytoplasma noviguineense' is proposed for these phytoplasmas found on the island of New Guinea, with strain BCS-Bo R (GenBank accession no. LC228755) as the reference strain. The novel taxon is described in detail, including information on the symptoms of associated diseases and additional genetic features of the secY gene and rp operon.

  19. Diverse Array of New Viral Sequences Identified in Worldwide Populations of the Asian Citrus Psyllid (Diaphorina citri) Using Viral Metagenomics

    PubMed Central

    Nouri, Shahideh; Salem, Nidá; Nigg, Jared C.

    2015-01-01

    ABSTRACT The Asian citrus psyllid, Diaphorina citri, is the natural vector of the causal agent of Huanglongbing (HLB), or citrus greening disease. Together; HLB and D. citri represent a major threat to world citrus production. As there is no cure for HLB, insect vector management is considered one strategy to help control the disease, and D. citri viruses might be useful. In this study, we used a metagenomic approach to analyze viral sequences associated with the global population of D. citri. By sequencing small RNAs and the transcriptome coupled with bioinformatics analysis, we showed that the virus-like sequences of D. citri are diverse. We identified novel viral sequences belonging to the picornavirus superfamily, the Reoviridae, Parvoviridae, and Bunyaviridae families, and an unclassified positive-sense single-stranded RNA virus. Moreover, a Wolbachia prophage-related sequence was identified. This is the first comprehensive survey to assess the viral community from worldwide populations of an agricultural insect pest. Our results provide valuable information on new putative viruses, some of which may have the potential to be used as biocontrol agents. IMPORTANCE Insects have the most species of all animals, and are hosts to, and vectors of, a great variety of known and unknown viruses. Some of these most likely have the potential to be important fundamental and/or practical resources. In this study, we used high-throughput next-generation sequencing (NGS) technology and bioinformatics analysis to identify putative viruses associated with Diaphorina citri, the Asian citrus psyllid. D. citri is the vector of the bacterium causing Huanglongbing (HLB), currently the most serious threat to citrus worldwide. Here, we report several novel viral sequences associated with D. citri. PMID:26676774

  20. Diverse Array of New Viral Sequences Identified in Worldwide Populations of the Asian Citrus Psyllid (Diaphorina citri) Using Viral Metagenomics.

    PubMed

    Nouri, Shahideh; Salem, Nidá; Nigg, Jared C; Falk, Bryce W

    2015-12-16

    The Asian citrus psyllid, Diaphorina citri, is the natural vector of the causal agent of Huanglongbing (HLB), or citrus greening disease. Together; HLB and D. citri represent a major threat to world citrus production. As there is no cure for HLB, insect vector management is considered one strategy to help control the disease, and D. citri viruses might be useful. In this study, we used a metagenomic approach to analyze viral sequences associated with the global population of D. citri. By sequencing small RNAs and the transcriptome coupled with bioinformatics analysis, we showed that the virus-like sequences of D. citri are diverse. We identified novel viral sequences belonging to the picornavirus superfamily, the Reoviridae, Parvoviridae, and Bunyaviridae families, and an unclassified positive-sense single-stranded RNA virus. Moreover, a Wolbachia prophage-related sequence was identified. This is the first comprehensive survey to assess the viral community from worldwide populations of an agricultural insect pest. Our results provide valuable information on new putative viruses, some of which may have the potential to be used as biocontrol agents. Insects have the most species of all animals, and are hosts to, and vectors of, a great variety of known and unknown viruses. Some of these most likely have the potential to be important fundamental and/or practical resources. In this study, we used high-throughput next-generation sequencing (NGS) technology and bioinformatics analysis to identify putative viruses associated with Diaphorina citri, the Asian citrus psyllid. D. citri is the vector of the bacterium causing Huanglongbing (HLB), currently the most serious threat to citrus worldwide. Here, we report several novel viral sequences associated with D. citri. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  1. Assessment of clinical analytical sensitivity and specificity of next-generation sequencing for detection of simple and complex mutations.

    PubMed

    Chin, Ephrem L H; da Silva, Cristina; Hegde, Madhuri

    2013-02-19

    Detecting mutations in disease genes by full gene sequence analysis is common in clinical diagnostic laboratories. Sanger dideoxy terminator sequencing allows for rapid development and implementation of sequencing assays in the clinical laboratory, but it has limited throughput, and due to cost constraints, only allows analysis of one or at most a few genes in a patient. Next-generation sequencing (NGS), on the other hand, has evolved rapidly, although to date it has mainly been used for large-scale genome sequencing projects and is beginning to be used in the clinical diagnostic testing. One advantage of NGS is that many genes can be analyzed easily at the same time, allowing for mutation detection when there are many possible causative genes for a specific phenotype. In addition, regions of a gene typically not tested for mutations, like deep intronic and promoter mutations, can also be detected. Here we use 20 previously characterized Sanger-sequenced positive controls in disease-causing genes to demonstrate the utility of NGS in a clinical setting using standard PCR based amplification to assess the analytical sensitivity and specificity of the technology for detecting all previously characterized changes (mutations and benign SNPs). The positive controls chosen for validation range from simple substitution mutations to complex deletion and insertion mutations occurring in autosomal dominant and recessive disorders. The NGS data was 100% concordant with the Sanger sequencing data identifying all 119 previously identified changes in the 20 samples. We have demonstrated that NGS technology is ready to be deployed in clinical laboratories. However, NGS and associated technologies are evolving, and clinical laboratories will need to invest significantly in staff and infrastructure to build the necessary foundation for success.

  2. Detection of novel mutations that cause autosomal dominant retinitis pigmentosa in candidate genes by long-range PCR amplification and next-generation sequencing

    PubMed Central

    Dias, Miguel de Sousa; Hernan, Imma; Pascual, Beatriz; Borràs, Emma; Mañé, Begoña; Gamundi, Maria José

    2013-01-01

    Purpose To devise an effective method for detecting mutations in 12 genes (CA4, CRX, IMPDH1, NR2E3, RP9, PRPF3, PRPF8, PRPF31, PRPH2, RHO, RP1, and TOPORS) commonly associated with autosomal dominant retinitis pigmentosa (adRP) that account for more than 95% of known mutations. Methods We used long-range PCR (LR-PCR) amplification and next-generation sequencing (NGS) performed in a GS Junior 454 benchtop sequencing platform. Twenty LR-PCR fragments, between 3,000 and 10,000 bp, containing all coding exons and flanking regions of the 12 genes, were obtained from DNA samples of patients with adRP. Sequencing libraries were prepared with an enzymatic (Fragmentase technology) method. Results Complete coverage of the coding and flanking sequences of the 12 genes assayed was obtained with NGS, with an average sequence depth of 380× (ranging from 128× to 1,077×). Five previous known mutations in the adRP genes were detected with a sequence variation percentage between 35% and 65%. We also performed a parallel sequence analysis of four samples, three of them new patients with index adRP, in which two novel mutations were detected in RHO (p.Asn73del) and PRPF31 (p.Ile109del). Conclusions The results demonstrate that genomic LR-PCR amplification together with NGS is an effective method for analyzing individual patient samples for mutations in a monogenic heterogeneous disease such as adRP. This approach proved effective for the parallel analysis of adRP and has been introduced as routine. Additionally, this approach could be extended to other heterogeneous genetic diseases. PMID:23559859

  3. [Analysis of MAT1A gene mutations in a child affected with simple hypermethioninemia].

    PubMed

    Sun, Yun; Ma, Dingyuan; Wang, Yanyun; Yang, Bin; Jiang, Tao

    2017-02-10

    To detect potential mutations of MAT1A gene in a child suspected with simple hypermethioninemia by MS/MS neonatal screening. Clinical data of the child was collected. Genomic DNA was extracted by a standard method and subjected to targeted sequencing using an Ion Ampliseq TM Inherited Disease Panel. Detected mutations were verified by Sanger sequencing. The child showed no clinical features except evaluated methionine. A novel compound mutation of the MAT1A gene, i.e., c.345delA and c.529C>T, was identified in the child. His father and mother were found to be heterozygous for the c.345delA mutation and c.529C>T mutation, respectively. The compound mutation c.345delA and c.529C>T of the MAT1A gene probably underlie the disease in the child. The semi-conductor sequencing has provided an important means for the diagnosis of hereditary diseases.

  4. Text mining facilitates database curation - extraction of mutation-disease associations from Bio-medical literature.

    PubMed

    Ravikumar, Komandur Elayavilli; Wagholikar, Kavishwar B; Li, Dingcheng; Kocher, Jean-Pierre; Liu, Hongfang

    2015-06-06

    Advances in the next generation sequencing technology has accelerated the pace of individualized medicine (IM), which aims to incorporate genetic/genomic information into medicine. One immediate need in interpreting sequencing data is the assembly of information about genetic variants and their corresponding associations with other entities (e.g., diseases or medications). Even with dedicated effort to capture such information in biological databases, much of this information remains 'locked' in the unstructured text of biomedical publications. There is a substantial lag between the publication and the subsequent abstraction of such information into databases. Multiple text mining systems have been developed, but most of them focus on the sentence level association extraction with performance evaluation based on gold standard text annotations specifically prepared for text mining systems. We developed and evaluated a text mining system, MutD, which extracts protein mutation-disease associations from MEDLINE abstracts by incorporating discourse level analysis, using a benchmark data set extracted from curated database records. MutD achieves an F-measure of 64.3% for reconstructing protein mutation disease associations in curated database records. Discourse level analysis component of MutD contributed to a gain of more than 10% in F-measure when compared against the sentence level association extraction. Our error analysis indicates that 23 of the 64 precision errors are true associations that were not captured by database curators and 68 of the 113 recall errors are caused by the absence of associated disease entities in the abstract. After adjusting for the defects in the curated database, the revised F-measure of MutD in association detection reaches 81.5%. Our quantitative analysis reveals that MutD can effectively extract protein mutation disease associations when benchmarking based on curated database records. The analysis also demonstrates that incorporating discourse level analysis significantly improved the performance of extracting the protein-mutation-disease association. Future work includes the extension of MutD for full text articles.

  5. New splicing mutation in the choline kinase beta (CHKB) gene causing a muscular dystrophy detected by whole-exome sequencing.

    PubMed

    Oliveira, Jorge; Negrão, Luís; Fineza, Isabel; Taipa, Ricardo; Melo-Pires, Manuel; Fortuna, Ana Maria; Gonçalves, Ana Rita; Froufe, Hugo; Egas, Conceição; Santos, Rosário; Sousa, Mário

    2015-06-01

    Muscular dystrophies (MDs) are a group of hereditary muscle disorders that include two particularly heterogeneous subgroups: limb-girdle MD and congenital MD, linked to 52 different genes (seven common to both subgroups). Massive parallel sequencing technology may avoid the usual stepwise gene-by-gene analysis. We report the whole-exome sequencing (WES) analysis of a patient with childhood-onset progressive MD, also presenting mental retardation and dilated cardiomyopathy. Conventional sequencing had excluded eight candidate genes. WES of the trio (patient and parents) was performed using the ion proton sequencing system. Data analysis resorted to filtering steps using the GEMINI software revealed a novel silent variant in the choline kinase beta (CHKB) gene. Inspection of sequence alignments ultimately identified the causal variant (CHKB:c.1031+3G>C). This splice site mutation was confirmed using Sanger sequencing and its effect was further evaluated with gene expression analysis. On reassessment of the muscle biopsy, typical abnormal mitochondrial oxidative changes were observed. Mutations in CHKB have been shown to cause phosphatidylcholine deficiency in myofibers, causing a rare form of CMD (only 21 patients reported). Notwithstanding interpretative difficulties that need to be overcome before the integration of WES in the diagnostic workflow, this work corroborates its utility in solving cases from highly heterogeneous groups of diseases, in which conventional diagnostic approaches fail to provide a definitive diagnosis.

  6. Epidemiological survey of idiopathic scoliosis and sequence alignment analysis of multiple candidate genes.

    PubMed

    Yang, Tao; Jia, Quanzhang; Guo, Hong; Xu, Jianzhong; Bai, Yun; Yang, Kai; Luo, Fei; Zhang, Zehua; Hou, Tianyong

    2012-06-01

    To investigate the effects of genetic factors on idiopathic scoliosis (IS) and genetic modes through genetic epidemiological survey on IS in Chongqing City, China, and to determine whether SH3GL1, GADD45B, and FGF22 in the chromosome 19p13.3 are the pathogenic genes of IS through genetic sequence analysis. 214 nuclear families were investigated to analyse the age incidence, familial aggregation, and heritability. SH3GL1, GADD45B, and FGF22 were chosen as candidate genes for mutation screening in 56 IS patients of 214 families. The sequence alignment analysis was performed to determine mutations and predict the protein structure. The average age of onset of 10.8 years suggests that IS is a early onset disease. Incidences of IS in first-, second-, third-degree relatives and the overall incidence in families (5.68%) were also significantly higher than that of the general population (1.04%). The U test indicated a significant difference, suggesting that IS has a familial aggregation. The heritability of first-degree relatives (77.68 ±10.39%), second-degree relatives (69.89 ±3.14%), and third-degree relatives (62.14 ±11.92%) illustrated that genetic factors play an important role in IS pathogenesis. The incidence of first-degree relatives (10.01%), second-degree relatives (2.55%) and third-degree relatives (1.76%) illustrated that IS is not in simple accord with monogenic Mendel's law but manifests as traits of multifactorial hereditary diseases. Sequence alignment of exons of SH3GL1, GADD45B, and FGF22 showed 17 base mutations, of which 16 mutations do not induce open reading frame (ORF) shift or amino acid changes whereas one mutation (C→T)occurred in SH3GL1 results in formation of the termination codon, which induces variation of protein reading frame. Prediction analysis of protein sequence showed that the SH3GL1 mutant encoded a truncated protein, thus affecting the protein structure. IS is a multifactorial genetic disease and SH3GL1 may be one of the pathogenic genes for IS.

  7. dbWGFP: a database and web server of human whole-genome single nucleotide variants and their functional predictions.

    PubMed

    Wu, Jiaxin; Wu, Mengmeng; Li, Lianshuo; Liu, Zhuo; Zeng, Wanwen; Jiang, Rui

    2016-01-01

    The recent advancement of the next generation sequencing technology has enabled the fast and low-cost detection of all genetic variants spreading across the entire human genome, making the application of whole-genome sequencing a tendency in the study of disease-causing genetic variants. Nevertheless, there still lacks a repository that collects predictions of functionally damaging effects of human genetic variants, though it has been well recognized that such predictions play a central role in the analysis of whole-genome sequencing data. To fill this gap, we developed a database named dbWGFP (a database and web server of human whole-genome single nucleotide variants and their functional predictions) that contains functional predictions and annotations of nearly 8.58 billion possible human whole-genome single nucleotide variants. Specifically, this database integrates 48 functional predictions calculated by 17 popular computational methods and 44 valuable annotations obtained from various data sources. Standalone software, user-friendly query services and free downloads of this database are available at http://bioinfo.au.tsinghua.edu.cn/dbwgfp. dbWGFP provides a valuable resource for the analysis of whole-genome sequencing, exome sequencing and SNP array data, thereby complementing existing data sources and computational resources in deciphering genetic bases of human inherited diseases. © The Author(s) 2016. Published by Oxford University Press.

  8. Exome capture sequencing identifies a novel mutation in BBS4

    PubMed Central

    Wang, Hui; Chen, Xianfeng; Dudinsky, Lynn; Patenia, Claire; Chen, Yiyun; Li, Yumei; Wei, Yue; Abboud, Emad B.; Al-Rajhi, Ali A.; Lewis, Richard Alan; Lupski, James R.; Mardon, Graeme; Gibbs, Richard A.; Perkins, Brian D.

    2011-01-01

    Purpose Leber congenital amaurosis (LCA) is one of the most severe eye dystrophies characterized by severe vision loss at an early stage and accounts for approximately 5% of all retinal dystrophies. The purpose of this study was to identify a novel LCA disease allele or gene and to develop an approach combining genetic mapping with whole exome sequencing. Methods Three patients from King Khaled Eye Specialist Hospital (KKESH205) underwent whole genome single nucleotide polymorphism genotyping, and a single candidate region was identified. Taking advantage of next-generation high-throughput DNA sequencing technologies, whole exome capture sequencing was performed on patient KKESH205#7. Sanger direct sequencing was used during the validation step. The zebrafish model was used to examine the function of the mutant allele. Results A novel missense mutation in Bardet-Biedl syndrome 4 protein (BBS4) was identified in a consanguineous family from Saudi Arabia. This missense mutation in the fifth exon (c.253G>C;p.E85Q) of BBS4 is likely a disease-causing mutation as it segregates with the disease. The mutation is not found in the single nucleotide polymorphism (SNP) database, the 1000 Genomes Project, or matching normal controls. Functional analysis of this mutation in zebrafish indicates that the G253C allele is pathogenic. Coinjection of the G253C allele cannot rescue the mislocalization of rhodopsin in the retina when BBS4 is knocked down by morpholino injection. Immunofluorescence analysis in cell culture shows that this missense mutation in BBS4 does not cause obvious defects in protein expression or pericentriolar localization. Conclusions This mutation likely mainly reduces or abolishes BBS4 function in the retina. Further studies of this allele will provide important insights concerning the pleiotropic nature of BBS4 function. PMID:22219648

  9. India Allele Finder: a web-based annotation tool for identifying common alleles in next-generation sequencing data of Indian origin.

    PubMed

    Zhang, Jimmy F; James, Francis; Shukla, Anju; Girisha, Katta M; Paciorkowski, Alex R

    2017-06-27

    We built India Allele Finder, an online searchable database and command line tool, that gives researchers access to variant frequencies of Indian Telugu individuals, using publicly available fastq data from the 1000 Genomes Project. Access to appropriate population-based genomic variant annotation can accelerate the interpretation of genomic sequencing data. In particular, exome analysis of individuals of Indian descent will identify population variants not reflected in European exomes, complicating genomic analysis for such individuals. India Allele Finder offers improved ease-of-use to investigators seeking to identify and annotate sequencing data from Indian populations. We describe the use of India Allele Finder to identify common population variants in a disease quartet whole exome dataset, reducing the number of candidate single nucleotide variants from 84 to 7. India Allele Finder is freely available to investigators to annotate genomic sequencing data from Indian populations. Use of India Allele Finder allows efficient identification of population variants in genomic sequencing data, and is an example of a population-specific annotation tool that simplifies analysis and encourages international collaboration in genomics research.

  10. Parkin Somatic Mutations Link Melanoma and Parkinson's Disease.

    PubMed

    Levin, Lotan; Srour, Shani; Gartner, Jared; Kapitansky, Oxana; Qutob, Nouar; Dror, Shani; Golan, Tamar; Dayan, Roy; Brener, Ronen; Ziv, Tamar; Khaled, Mehdi; Schueler-Furman, Ora; Samuels, Yardena; Levy, Carmit

    2016-06-20

    Epidemiological studies suggest a direct link between melanoma and Parkinson's disease (PD); however, the underlying molecular basis is unknown. Since mutations in Parkin are the major driver of early-onset PD and Parkin was recently reported to play a role in cancer development, we hypothesized that Parkin links melanoma and PD. By analyzing whole exome/genome sequencing of Parkin from 246 melanoma patients, we identified five non-synonymous mutations, three synonymous mutations, and one splice region variant in Parkin in 3.6% of the samples. In vitro analysis showed that wild-type Parkin plays a tumor suppressive role in melanoma development resulting in cell-cycle arrest, reduction of metabolic activity, and apoptosis. Using a mass spectrometry-based analysis, we identified potential Parkin substrates in melanoma and generated a functional protein association network. The activity of mutated Parkin was assessed by protein structure modeling and examination of Parkin E3 ligase activity. The Parkin-E28K mutation impairs Parkin ubiquitination activity and abolishes its tumor suppressive effect. Taken together, our analysis of genomic sequence and in vitro data indicate that Parkin is a potential link between melanoma and Parkinson's disease. Our findings suggest new approaches for early diagnosis and treatment against both diseases. Copyright © 2016. Published by Elsevier Ltd.

  11. Implication of common and disease specific variants in CLU, CR1, and PICALM.

    PubMed

    Ferrari, Raffaele; Moreno, Jorge H; Minhajuddin, Abu T; O'Bryant, Sid E; Reisch, Joan S; Barber, Robert C; Momeni, Parastoo

    2012-08-01

    Two recent genome-wide association studies (GWAS) for late onset Alzheimer's disease (LOAD) revealed 3 new genes: clusterin (CLU), phosphatidylinositol binding clathrin assembly protein (PICALM), and complement receptor 1 (CR1). In order to evaluate association with these genome-wide association study-identified genes and to isolate the variants contributing to the pathogenesis of LOAD, we genotyped the top single nucleotide polymorphisms (SNPs), rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), and sequenced the entire coding regions of these genes in our cohort of 342 LOAD patients and 277 control subjects. We confirmed the association of rs3851179 (PICALM) (p = 7.4 × 10(-3)) with the disease status. Through sequencing we identified 18 variants in CLU, 3 of which were found exclusively in patients; 8 variants (out of 65) in CR1 gene were only found in patients and the 16 variants identified in PICALM gene were present in both patients and controls. In silico analysis of the variants in PICALM did not predict any damaging effect on the protein. The haplotype analysis of the variants in each gene predicted a common haplotype when the 3 single nucleotide polymorphisms rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), respectively, were included. For each gene the haplotype structure and size differed between patients and controls. In conclusion, we confirmed association of CLU, CR1, and PICALM genes with the disease status in our cohort through identification of a number of disease-specific variants among patients through the sequencing of the coding region of these genes. Published by Elsevier Inc.

  12. Transcriptome Analysis of Orbital Adipose Tissue in Active Thyroid Eye Disease Using Next Generation RNA Sequencing Technology

    PubMed Central

    Lee, Bradford W.; Kumar, Virender B.; Biswas, Pooja; Ko, Audrey C.; Alameddine, Ramzi M.; Granet, David B.; Ayyagari, Radha; Kikkawa, Don O.; Korn, Bobby S.

    2018-01-01

    Objective: This study utilized Next Generation Sequencing (NGS) to identify differentially expressed transcripts in orbital adipose tissue from patients with active Thyroid Eye Disease (TED) versus healthy controls. Method: This prospective, case-control study enrolled three patients with severe, active thyroid eye disease undergoing orbital decompression, and three healthy controls undergoing routine eyelid surgery with removal of orbital fat. RNA Sequencing (RNA-Seq) was performed on freshly obtained orbital adipose tissue from study patients to analyze the transcriptome. Bioinformatics analysis was performed to determine pathways and processes enriched for the differential expression profile. Quantitative Reverse Transcriptase-Polymerase Chain Reaction (qRT-PCR) was performed to validate the differential expression of selected genes identified by RNA-Seq. Results: RNA-Seq identified 328 differentially expressed genes associated with active thyroid eye disease, many of which were responsible for mediating inflammation, cytokine signaling, adipogenesis, IGF-1 signaling, and glycosaminoglycan binding. The IL-5 and chemokine signaling pathways were highly enriched, and very-low-density-lipoprotein receptor activity and statin medications were implicated as having a potential role in TED. Conclusion: This study is the first to use RNA-Seq technology to elucidate differential gene expression associated with active, severe TED. This study suggests a transcriptional basis for the role of statins in modulating differentially expressed genes that mediate the pathogenesis of thyroid eye disease. Furthermore, the identification of genes with altered levels of expression in active, severe TED may inform the molecular pathways central to this clinical phenotype and guide the development of novel therapeutic agents. PMID:29760827

  13. Association analysis of bacterial leaf spot resistance and SNP markers derived from expressed sequence tags (ESTs) in lettuce (Lactuca sativa L.)

    USDA-ARS?s Scientific Manuscript database

    Bacterial leaf spot of lettuce, caused by Xanthomonas campestris pv. vitians, is a devastating disease of lettuce worldwide. Since there are no chemicals available for effective control of the disease, host-plant resistance is highly desirable to protect lettuce production. A total of 179 lettuce ge...

  14. Simple, quick and cost-efficient: A universal RT-PCR and sequencing strategy for genomic characterisation of foot-and-mouth disease viruses.

    PubMed

    Dill, V; Beer, M; Hoffmann, B

    2017-08-01

    Foot-and-mouth disease (FMD) is a major contributor to poverty and food insecurity in Africa and Asia, and it is one of the biggest threats to agriculture in highly developed countries. As FMD is extremely contagious, strategies for its prevention, early detection, and the immediate characterisation of outbreak strains are of great importance. The generation of whole-genome sequences enables phylogenetic characterisation, the epidemiological tracing of virus transmission pathways and is supportive in disease control strategies. This study describes the development and validation of a rapid, universal and cost-efficient RT-PCR system to generate genome sequences of FMDV, reaching from the IRES to the end of the open reading frame. The method was evaluated using twelve different virus strains covering all seven serotypes of FMDV. Additionally, samples from experimentally infected animals were tested to mimic diagnostic field samples. All primer pairs showed a robust amplification with a high sensitivity for all serotypes. In summary, the described assay is suitable for the generation of FMDV sequences from all serotypes to allow immediate phylogenetic analysis, detailed genotyping and molecular epidemiology. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. The clinical application of genome-wide sequencing for monogenic diseases in Canada: Position Statement of the Canadian College of Medical Geneticists

    PubMed Central

    Boycott, Kym; Hartley, Taila; Adam, Shelin; Bernier, Francois; Chong, Karen; Fernandez, Bridget A; Friedman, Jan M; Geraghty, Michael T; Hume, Stacey; Knoppers, Bartha M; Laberge, Anne-Marie; Majewski, Jacek; Mendoza-Londono, Roberto; Meyn, M Stephen; Michaud, Jacques L; Nelson, Tanya N; Richer, Julie; Sadikovic, Bekim; Skidmore, David L; Stockley, Tracy; Taylor, Sherry; van Karnebeek, Clara; Zawati, Ma'n H; Lauzon, Julie; Armour, Christine M

    2015-01-01

    Purpose and scope The aim of this Position Statement is to provide recommendations for Canadian medical geneticists, clinical laboratory geneticists, genetic counsellors and other physicians regarding the use of genome-wide sequencing of germline DNA in the context of clinical genetic diagnosis. This statement has been developed to facilitate the clinical translation and development of best practices for clinical genome-wide sequencing for genetic diagnosis of monogenic diseases in Canada; it does not address the clinical application of this technology in other fields such as molecular investigation of cancer or for population screening of healthy individuals. Methods of statement development Two multidisciplinary groups consisting of medical geneticists, clinical laboratory geneticists, genetic counsellors, ethicists, lawyers and genetic researchers were assembled to review existing literature and guidelines on genome-wide sequencing for clinical genetic diagnosis in the context of monogenic diseases, and to make recommendations relevant to the Canadian context. The statement was circulated for comment to the Canadian College of Medical Geneticists (CCMG) membership-at-large and, following incorporation of feedback, approved by the CCMG Board of Directors. The CCMG is a Canadian organisation responsible for certifying medical geneticists and clinical laboratory geneticists, and for establishing professional and ethical standards for clinical genetics services in Canada. Results and conclusions Recommendations include (1) clinical genome-wide sequencing is an appropriate approach in the diagnostic assessment of a patient for whom there is suspicion of a significant monogenic disease that is associated with a high degree of genetic heterogeneity, or where specific genetic tests have failed to provide a diagnosis; (2) until the benefits of reporting incidental findings are established, we do not endorse the intentional clinical analysis of disease-associated genes other than those linked to the primary indication; and (3) clinicians should provide genetic counselling and obtain informed consent prior to undertaking clinical genome-wide sequencing. Counselling should include discussion of the limitations of testing, likelihood and implications of diagnosis and incidental findings, and the potential need for further analysis to facilitate clinical interpretation, including studies performed in a research setting. These recommendations will be routinely re-evaluated as knowledge of diagnostic and clinical utility of clinical genome-wide sequencing improves. While the document was developed to direct practice in Canada, the applicability of the statement is broader and will be of interest to clinicians and health jurisdictions internationally. PMID:25951830

  16. The clinical application of genome-wide sequencing for monogenic diseases in Canada: Position Statement of the Canadian College of Medical Geneticists.

    PubMed

    Boycott, Kym; Hartley, Taila; Adam, Shelin; Bernier, Francois; Chong, Karen; Fernandez, Bridget A; Friedman, Jan M; Geraghty, Michael T; Hume, Stacey; Knoppers, Bartha M; Laberge, Anne-Marie; Majewski, Jacek; Mendoza-Londono, Roberto; Meyn, M Stephen; Michaud, Jacques L; Nelson, Tanya N; Richer, Julie; Sadikovic, Bekim; Skidmore, David L; Stockley, Tracy; Taylor, Sherry; van Karnebeek, Clara; Zawati, Ma'n H; Lauzon, Julie; Armour, Christine M

    2015-07-01

    The aim of this Position Statement is to provide recommendations for Canadian medical geneticists, clinical laboratory geneticists, genetic counsellors and other physicians regarding the use of genome-wide sequencing of germline DNA in the context of clinical genetic diagnosis. This statement has been developed to facilitate the clinical translation and development of best practices for clinical genome-wide sequencing for genetic diagnosis of monogenic diseases in Canada; it does not address the clinical application of this technology in other fields such as molecular investigation of cancer or for population screening of healthy individuals. Two multidisciplinary groups consisting of medical geneticists, clinical laboratory geneticists, genetic counsellors, ethicists, lawyers and genetic researchers were assembled to review existing literature and guidelines on genome-wide sequencing for clinical genetic diagnosis in the context of monogenic diseases, and to make recommendations relevant to the Canadian context. The statement was circulated for comment to the Canadian College of Medical Geneticists (CCMG) membership-at-large and, following incorporation of feedback, approved by the CCMG Board of Directors. The CCMG is a Canadian organisation responsible for certifying medical geneticists and clinical laboratory geneticists, and for establishing professional and ethical standards for clinical genetics services in Canada. Recommendations include (1) clinical genome-wide sequencing is an appropriate approach in the diagnostic assessment of a patient for whom there is suspicion of a significant monogenic disease that is associated with a high degree of genetic heterogeneity, or where specific genetic tests have failed to provide a diagnosis; (2) until the benefits of reporting incidental findings are established, we do not endorse the intentional clinical analysis of disease-associated genes other than those linked to the primary indication; and (3) clinicians should provide genetic counselling and obtain informed consent prior to undertaking clinical genome-wide sequencing. Counselling should include discussion of the limitations of testing, likelihood and implications of diagnosis and incidental findings, and the potential need for further analysis to facilitate clinical interpretation, including studies performed in a research setting. These recommendations will be routinely re-evaluated as knowledge of diagnostic and clinical utility of clinical genome-wide sequencing improves. While the document was developed to direct practice in Canada, the applicability of the statement is broader and will be of interest to clinicians and health jurisdictions internationally. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  17. Evolutionary Influenced Interaction Pattern as Indicator for the Investigation of Natural Variants Causing Nephrogenic Diabetes Insipidus

    PubMed Central

    Labudde, Dirk

    2015-01-01

    The importance of short membrane sequence motifs has been shown in many works and emphasizes the related sequence motif analysis. Together with specific transmembrane helix-helix interactions, the analysis of interacting sequence parts is helpful for understanding the process during membrane protein folding and in retaining the three-dimensional fold. Here we present a simple high-throughput analysis method for deriving mutational information of interacting sequence parts. Applied on aquaporin water channel proteins, our approach supports the analysis of mutational variants within different interacting subsequences and finally the investigation of natural variants which cause diseases like, for example, nephrogenic diabetes insipidus. In this work we demonstrate a simple method for massive membrane protein data analysis. As shown, the presented in silico analyses provide information about interacting sequence parts which are constrained by protein evolution. We present a simple graphical visualization medium for the representation of evolutionary influenced interaction pattern pairs (EIPPs) adapted to mutagen investigations of aquaporin-2, a protein whose mutants are involved in the rare endocrine disorder known as nephrogenic diabetes insipidus, and membrane proteins in general. Furthermore, we present a new method to derive new evolutionary variations within EIPPs which can be used for further mutagen laboratory investigations. PMID:26180540

  18. Evolutionary Influenced Interaction Pattern as Indicator for the Investigation of Natural Variants Causing Nephrogenic Diabetes Insipidus.

    PubMed

    Grunert, Steffen; Labudde, Dirk

    2015-01-01

    The importance of short membrane sequence motifs has been shown in many works and emphasizes the related sequence motif analysis. Together with specific transmembrane helix-helix interactions, the analysis of interacting sequence parts is helpful for understanding the process during membrane protein folding and in retaining the three-dimensional fold. Here we present a simple high-throughput analysis method for deriving mutational information of interacting sequence parts. Applied on aquaporin water channel proteins, our approach supports the analysis of mutational variants within different interacting subsequences and finally the investigation of natural variants which cause diseases like, for example, nephrogenic diabetes insipidus. In this work we demonstrate a simple method for massive membrane protein data analysis. As shown, the presented in silico analyses provide information about interacting sequence parts which are constrained by protein evolution. We present a simple graphical visualization medium for the representation of evolutionary influenced interaction pattern pairs (EIPPs) adapted to mutagen investigations of aquaporin-2, a protein whose mutants are involved in the rare endocrine disorder known as nephrogenic diabetes insipidus, and membrane proteins in general. Furthermore, we present a new method to derive new evolutionary variations within EIPPs which can be used for further mutagen laboratory investigations.

  19. CDC Vital Signs: Recipe for Food Safety

    MedlinePlus

    ... KB] Building public health capacity for advanced genome sequencing and analysis, which will make it possible to ... Content source: National Center for Emerging and Zoonotic Infectious Diseases Page maintained by: Office of the Associate ...

  20. Prolonged and mixed non-O157 Escherichia coli infection in an Australian household.

    PubMed

    Staples, M; Graham, R M A; Doyle, C J; Smith, H V; Jennison, A V

    2012-05-01

    An Australian family was identified through a Public Health follow up on a Shiga-toxigenic Escherichia coli (STEC) positive bloody diarrhoea case, with three of the four family members experiencing either symptomatic or asymptomatic STEC shedding. Bacterial isolates were submitted to stx sequence sub-typing, multi-locus variable number tandem repeat analysis (MLVA), multi-locus sequence typing (MLST) and binary typing. The analysis revealed that there were multiple strains of STEC being shed by the family members, with similar virulence gene profiles and the same serogroup but differing in their MLVA and MLST profiles. This study illustrates the potentially complicated nature of non-O157 STEC infections and the importance of molecular epidemiology in understanding disease clusters. © 2012 QUEENSLAND HEALTH. Clinical Microbiology and Infection © 2012 European Society of Clinical Microbiology and Infectious Diseases.

  1. Lack of detection of a putative retrovirus associated with haemic neoplasia in the soft shell clam Mya arenaria.

    PubMed

    AboElkhair, M; Iwamoto, T; Clark, K F; McKenna, P; Siah, A; Greenwood, S J; Berthe, F C J; Casey, J W; Cepica, A

    2012-01-01

    Haemic neoplasia (HN) is a leukemia-like disease that affects at least 20 species of marine bivalves including soft shell clam, Mya arenaria. Since the disease was discovered in 1969, the etiology remains unknown. A retroviral etiology has been suggested based on the detection of reverse transcriptase activity and electron microscopic observation of retroviral-like particles using negative staining. To date, however no virus isolate and no retroviral sequence from HN has been obtained. Moreover, transmission of the disease by cell-free filtrate from affected clams has not been reproduced. In the current study, we reinvestigated the association of HN with a putative retrovirus. Sucrose gradient centrifugation followed by assessment of reverse transcriptase activity, electrophoretic analysis of protein and RNA, and electron microscopic examinations of fractions corresponding to retroviral density were employed. Detection of retroviral pol sequences using degenerate RT-PCR approaches was also attempted. Our results showed visible bands at the expected density of retrovirus in HN-positive and HN-negative clam tissues and both with reverse transcriptase activity. Electron microscopy, RNA analysis, protein analysis, and PCR systems targeting the pol gene of retroviruses did not however provide clear evidence supporting presence of a retrovirus. We point out that the retrovirus etiology of HN of Mya arenaria proposed some 25 years ago should be reconsidered in the absence of a virus isolate or virus sequences. Copyright © 2011 Elsevier Inc. All rights reserved.

  2. Construction of high-quality recombination maps with low-coverage genomic sequencing for joint linkage analysis in maize

    USDA-ARS?s Scientific Manuscript database

    A genome-wide association study (GWAS) is the foremost strategy used for finding genes that control human diseases and agriculturally important traits, but it often reports false positives. In contrast, its complementary method, linkage analysis, provides direct genetic confirmation, but with limite...

  3. PolyPhred analysis software for mutation detection from fluorescence-based sequence data.

    PubMed

    Montgomery, Kate T; Iartchouck, Oleg; Li, Li; Loomis, Stephanie; Obourn, Vanessa; Kucherlapati, Raju

    2008-10-01

    The ability to search for genetic variants that may be related to human disease is one of the most exciting consequences of the availability of the sequence of the human genome. Large cohorts of individuals exhibiting certain phenotypes can be studied and candidate genes resequenced. However, the challenge of analyzing sequence data from many individuals with accuracy, speed, and economy is great. This unit describes one set of software tools: Phred, Phrap, PolyPhred, and Consed. Coverage includes the advantages and disadvantages of these analysis tools, details for obtaining and using the software, and the results one may expect. The software is being continually updated to permit further automation of mutation analysis. Currently, however, at least some manual review is required if one wishes to identify 100% of the variants in a sample set.

  4. Whole Exome Analysis of Early Onset Alzheimer’s Disease

    DTIC Science & Technology

    2016-04-01

    Early Onset Alzheimer’s Disease 5a. CONTRACT NUMBER W81XWH-12-1-0013 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR( S ) Margaret A. Pericak...relationship between SORL1, AD, and Parkinsonism . 16 Appendix V: ABCA7 Frameshift Deletion Associated with Alzheimer’s Disease in African Americans...onset Alzheimer disease identified using whole-exome sequencing G. W. Beecham1, B. W. Kunkle1, B. Vardarajan2, P. L. Whitehead1, S . Rolati1, E. R

  5. CWDPRNP: A tool for cervid prion sequence analysis in program R

    USGS Publications Warehouse

    Miller, William L.; Walter, W. David

    2017-01-01

    Chronic wasting disease is a fatal, neurological disease caused by an infectious prion protein, which affects economically and ecologically important members of the family Cervidae. Single nucleotide polymorphisms within the prion protein gene have been linked to differential susceptibility to the disease in many species. Wildlife managers are seeking to determine the frequencies of disease-associated alleles and genotypes and delineate spatial genetic patterns. The CWDPRNP package, implemented in program R, provides a unified framework for analyzing prion protein gene variability and spatial structure.

  6. Next-Generation Sequencing: The Translational Medicine Approach from “Bench to Bedside to Population”

    PubMed Central

    Beigh, Mohammad Muzafar

    2016-01-01

    Humans have predicted the relationship between heredity and diseases for a long time. Only in the beginning of the last century, scientists begin to discover the connotations between different genes and disease phenotypes. Recent trends in next-generation sequencing (NGS) technologies have brought a great momentum in biomedical research that in turn has remarkably augmented our basic understanding of human biology and its associated diseases. State-of-the-art next generation biotechnologies have started making huge strides in our current understanding of mechanisms of various chronic illnesses like cancers, metabolic disorders, neurodegenerative anomalies, etc. We are experiencing a renaissance in biomedical research primarily driven by next generation biotechnologies like genomics, transcriptomics, proteomics, metabolomics, lipidomics etc. Although genomic discoveries are at the forefront of next generation omics technologies, however, their implementation into clinical arena had been painstakingly slow mainly because of high reaction costs and unavailability of requisite computational tools for large-scale data analysis. However rapid innovations and steadily lowering cost of sequence-based chemistries along with the development of advanced bioinformatics tools have lately prompted launching and implementation of large-scale massively parallel genome sequencing programs in different fields ranging from medical genetics, infectious biology, agriculture sciences etc. Recent advances in large-scale omics-technologies is bringing healthcare research beyond the traditional “bench to bedside” approach to more of a continuum that will include improvements, in public healthcare and will be primarily based on predictive, preventive, personalized, and participatory medicine approach (P4). Recent large-scale research projects in genetic and infectious disease biology have indicated that massively parallel whole-genome/whole-exome sequencing, transcriptome analysis, and other functional genomic tools can reveal large number of unique functional elements and/or markers that otherwise would be undetected by traditional sequencing methodologies. Therefore, latest trends in the biomedical research is giving birth to the new branch in medicine commonly referred to as personalized and/or precision medicine. Developments in the post-genomic era are believed to completely restructure the present clinical pattern of disease prevention and treatment as well as methods of diagnosis and prognosis. The next important step in the direction of the precision/personalized medicine approach should be its early adoption in clinics for future medical interventions. Consequently, in coming year’s next generation biotechnologies will reorient medical practice more towards disease prediction and prevention approaches rather than curing them at later stages of their development and progression, even at wider population level(s) for general public healthcare system. PMID:28930123

  7. Photoreceptor dysplasia (pd) in miniature schnauzer dogs: evaluation of candidate genes by molecular genetic analysis.

    PubMed

    Zhang, Q; Baldwin, V J; Acland, G M; Parshall, C J; Haskel, J; Aguirre, G D; Ray, K

    1999-01-01

    Photoreceptor dysplasia (pd) is one of a group of at least six distinct autosomal and one X-linked retinal disorders identified in dogs which are collectively known as progressive retinal atrophy (PRA). It is an early onset retinal disease identified in miniature schnauzer dogs, and pedigree analysis and breeding studies have established autosomal recessive inheritance of the disease. Using a gene-based approach, a number of retina-expressed genes, including some members of the phototransduction pathway, have been causally implicated in retinal diseases of humans and other animals. Here we examined seven such potential candidate genes (opsin, RDS/peripherin, ROM1, rod cGMP-gated cation channel alpha-subunit, and three subunits of transducin) for their causal association with the pd locus by testing segregation of intragenic markers with the disease locus, or, in the absence of informative polymorphisms, sequencing of the coding regions of the genes. Based on these results, we have conclusively excluded four photoreceptor-specific genes as candidates for pd by linkage analysis. For three other photoreceptor-specific genes, we did not find any mutation in the coding sequences of the genes and have excluded them provisionally. Formal exclusion would require investigation of the levels of expression of the candidate genes in pd-affected dogs relative to age-matched controls. At present we are building suitable informative pedigrees for the disease locus with a sufficient number of meiosis to be useful for genomewide screening. This should identify markers linked to the disease locus and eventually permit progress toward the identification of the photoreceptor dysplasia gene and the disease-causing mutation.

  8. Microbiota and Metatranscriptome Changes Accompanying the Onset of Gingivitis

    PubMed Central

    2018-01-01

    ABSTRACT Over half of adults experience gingivitis, a mild yet treatable form of periodontal disease caused by the overgrowth of oral microbes. Left untreated, gingivitis can progress to a more severe and irreversible disease, most commonly chronic periodontitis. While periodontal diseases are associated with a shift in the oral microbiota composition, it remains unclear how this shift impacts microbiota function early in disease progression. Here, we analyzed the transition from health to gingivitis through both 16S v4-v5 rRNA amplicon and metatranscriptome sequencing of subgingival plaque samples from individuals undergoing an experimental gingivitis treatment. Beta-diversity analysis of 16S rRNA reveals that samples cluster based on disease severity and patient but not by oral hygiene status. Significant shifts in the abundance of several genera occurred during disease transition, suggesting a dysbiosis due to development of gingivitis. Comparing taxonomic abundance with transcriptomic activity revealed concordance of bacterial diversity composition between the two quantification assays in samples originating from both healthy and diseased teeth. Metatranscriptome sequencing analysis indicates that during the early stages of transition to gingivitis, a number of virulence-related transcripts were significantly differentially expressed in individual and across pooled patient samples. Upregulated genes include those involved in proteolytic and nucleolytic processes, while expression levels of those involved in surface structure assembly and other general virulence functions leading to colonization or adaptation within the host are more dynamic. These findings help characterize the transition from health to periodontal disease and identify genes associated with early disease. PMID:29666288

  9. Characterizing ncRNAs in Human Pathogenic Protists Using High-Throughput Sequencing Technology

    PubMed Central

    Collins, Lesley Joan

    2011-01-01

    ncRNAs are key genes in many human diseases including cancer and viral infection, as well as providing critical functions in pathogenic organisms such as fungi, bacteria, viruses, and protists. Until now the identification and characterization of ncRNAs associated with disease has been slow or inaccurate requiring many years of testing to understand complicated RNA and protein gene relationships. High-throughput sequencing now offers the opportunity to characterize miRNAs, siRNAs, small nucleolar RNAs (snoRNAs), and long ncRNAs on a genomic scale, making it faster and easier to clarify how these ncRNAs contribute to the disease state. However, this technology is still relatively new, and ncRNA discovery is not an application of high priority for streamlined bioinformatics. Here we summarize background concepts and practical approaches for ncRNA analysis using high-throughput sequencing, and how it relates to understanding human disease. As a case study, we focus on the parasitic protists Giardia lamblia and Trichomonas vaginalis, where large evolutionary distance has meant difficulties in comparing ncRNAs with those from model eukaryotes. A combination of biological, computational, and sequencing approaches has enabled easier classification of ncRNA classes such as snoRNAs, but has also aided the identification of novel classes. It is hoped that a higher level of understanding of ncRNA expression and interaction may aid in the development of less harsh treatment for protist-based diseases. PMID:22303390

  10. Next generation sequencing in women affected by nonsyndromic premature ovarian failure displays new potential causative genes and mutations.

    PubMed

    Fonseca, Dora Janeth; Patiño, Liliana Catherine; Suárez, Yohjana Carolina; de Jesús Rodríguez, Asid; Mateus, Heidi Eliana; Jiménez, Karen Marcela; Ortega-Recalde, Oscar; Díaz-Yamal, Ivonne; Laissue, Paul

    2015-07-01

    To identify new molecular actors involved in nonsyndromic premature ovarian failure (POF) etiology. This is a retrospective case-control cohort study. University research group and IVF medical center. Twelve women affected by nonsyndromic POF. The control group included 176 women whose menopause had occurred after age 50 and had no antecedents regarding gynecological disease. A further 345 women from the same ethnic origin (general population group) were also recruited to assess allele frequency for potentially deleterious sequence variants. Next generation sequencing (NGS), Sanger sequencing, and bioinformatics analysis. The complete coding regions of 70 candidate genes were massively sequenced, via NGS, in POF patients. Bioinformatics and genetics were used to confirm NGS results and to identify potential sequence variants related to the disease pathogenesis. We have identified mutations in two novel genes, ADAMTS19 and BMPR2, that are potentially related to POF origin. LHCGR mutations, which might have contributed to the phenotype, were also detected. We thus recommend NGS as a powerful tool for identifying new molecular actors in POF and for future diagnostic/prognostic purposes. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.

  11. The primary structure of the thymidine kinase gene of fish lymphocystis disease virus.

    PubMed

    Schnitzler, P; Handermann, M; Szépe, O; Darai, G

    1991-06-01

    The DNA nucleotide sequence of the thymidine kinase (TK) gene of fish lymphocystis disease virus (FLDV) which has been localized between the coordinates 0.678 to 0.688 of the viral genome was determined. The analysis of the DNA nucleotide sequence located between the recognition sites of HindIII (0.669 map unit; nucleotide position 1) and AccI (nucleotide position 2032) revealed the presence of an open reading frame of 954 bp on the lower strand of this region between nucleotide positions 1868 (ATG) and 915 (TAA). It encodes for a protein of 318 amino acid residues. The evolutionary relationships of the TK gene of FLDV to the other known TK genes was investigated using the method of progressive sequence alignment. These analyses revealed a high degree of diversity between the protein sequence of FLDV TK gene and the amino acid composition of other TKs tested. However, significant conservations were detected at several regions of amino acid residues of the FLDV TK protein when compared to the amino acid sequence of TKs of African swine fever virus, fowlpox virus, shope fibroma virus, and vaccinia virus and to the amino acid sequences of the cellular cytoplasmic TK of chicken, mouse, and man.

  12. Sequence of Radiotherapy and Chemotherapy in Breast Cancer After Breast-Conserving Surgery

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jobsen, Jan J., E-mail: J.Jobsen@mst.nl; Palen, Job van der; Department of Research Methodology, Measurement and Data Analysis, Faculty of Behavioural Science, University of Twente

    2012-04-01

    Purpose: The optimal sequence of radiotherapy and chemotherapy in breast-conserving therapy is unknown. Methods and Materials: From 1983 through 2007, a total of 641 patients with 653 instances of breast-conserving therapy (BCT), received both chemotherapy and radiotherapy and are the basis of this analysis. Patients were divided into three groups. Groups A and B comprised patients treated before 2005, Group A radiotherapy first and Group B chemotherapy first. Group C consisted of patients treated from 2005 onward, when we had a fixed sequence of radiotherapy first, followed by chemotherapy. Results: Local control did not show any differences among the threemore » groups. For distant metastasis, no difference was shown between Groups A and B. Group C, when compared with Group A, showed, on univariate and multivariate analyses, a significantly better distant metastasis-free survival. The same was noted for disease-free survival. With respect to disease-specific survival, no differences were shown on multivariate analysis among the three groups. Conclusion: Radiotherapy, as an integral part of the primary treatment of BCT, should be administered first, followed by adjuvant chemotherapy.« less

  13. Design of DNA pooling to allow incorporation of covariates in rare variants analysis.

    PubMed

    Guan, Weihua; Li, Chun

    2014-01-01

    Rapid advances in next-generation sequencing technologies facilitate genetic association studies of an increasingly wide array of rare variants. To capture the rare or less common variants, a large number of individuals will be needed. However, the cost of a large scale study using whole genome or exome sequencing is still high. DNA pooling can serve as a cost-effective approach, but with a potential limitation that the identity of individual genomes would be lost and therefore individual characteristics and environmental factors could not be adjusted in association analysis, which may result in power loss and a biased estimate of genetic effect. For case-control studies, we propose a design strategy for pool creation and an analysis strategy that allows covariate adjustment, using multiple imputation technique. Simulations show that our approach can obtain reasonable estimate for genotypic effect with only slight loss of power compared to the much more expensive approach of sequencing individual genomes. Our design and analysis strategies enable more powerful and cost-effective sequencing studies of complex diseases, while allowing incorporation of covariate adjustment.

  14. Phylogenetic Analysis of Aedes aegypti Based on Mitochondrial ND4 Gene Sequences in Almadinah, Saudi Arabia.

    PubMed

    Ali, Khalil H Al; El-Badry, Ayman A; Ali, Mouhanad Al; El-Sayed, Wael S M; El-Beshbishy, Hesham A

    2016-06-01

    Aedes aegypti is the main vector of the yellow fever and dengue virus. This mosquito has become the major indirect cause of morbidity and mortality of the human worldwide. Dengue virus activity has been reported recently in the western areas of Saudi Arabia. There is no vaccine for dengue virus until now, and the control of the disease depends on the control of the vector. The present study has aimed to perform phylogenetic analysis of Aedes aegypti based on mitochondrial NADH dehydrogenase subunit 4 ( ND4 ) gene at Almadinah, Saudi Arabia in order to get further insight into the epidemiology and transmission of this vector. Mitochondrial ND4 gene was sequenced in the eight isolated Aedes aegypti mosquitoes from Almadinah, Saudi Arabia, sequences were aligned, and phylogenetic analysis were performed and compared with 54 sequences of Aedes reported in the previous studies from Mexico, Thailand, Brazil, and Africa. Our results suggest that increased gene flow among Aedes aegypti populations occurs between Africa and Saudi Arabia. Phylogenetic relationship analysis showed two genetically distinct Aedes aegypti in Saudi Arabia derived from dual African ancestor.

  15. Differential Expression and Functional Analysis of High-Throughput -Omics Data Using Open Source Tools.

    PubMed

    Kebschull, Moritz; Fittler, Melanie Julia; Demmer, Ryan T; Papapanou, Panos N

    2017-01-01

    Today, -omics analyses, including the systematic cataloging of messenger RNA and microRNA sequences or DNA methylation patterns in a cell population, organ, or tissue sample, allow for an unbiased, comprehensive genome-level analysis of complex diseases, offering a large advantage over earlier "candidate" gene or pathway analyses. A primary goal in the analysis of these high-throughput assays is the detection of those features among several thousand that differ between different groups of samples. In the context of oral biology, our group has successfully utilized -omics technology to identify key molecules and pathways in different diagnostic entities of periodontal disease.A major issue when inferring biological information from high-throughput -omics studies is the fact that the sheer volume of high-dimensional data generated by contemporary technology is not appropriately analyzed using common statistical methods employed in the biomedical sciences.In this chapter, we outline a robust and well-accepted bioinformatics workflow for the initial analysis of -omics data generated using microarrays or next-generation sequencing technology using open-source tools. Starting with quality control measures and necessary preprocessing steps for data originating from different -omics technologies, we next outline a differential expression analysis pipeline that can be used for data from both microarray and sequencing experiments, and offers the possibility to account for random or fixed effects. Finally, we present an overview of the possibilities for a functional analysis of the obtained data.

  16. Virulence and Draft Genome Sequence Overview of Multiple Strains of the Swine Pathogen Haemophilus parasuis

    PubMed Central

    Brockmeier, Susan L.; Register, Karen B.; Kuehn, Joanna S.; Nicholson, Tracy L.; Loving, Crystal L.; Bayles, Darrell O.; Shore, Sarah M.; Phillips, Gregory J.

    2014-01-01

    Haemophilus parasuis is the cause of Glässer's disease in swine, which is characterized by systemic infection resulting in polyserositis, meningitis, and arthritis. Investigation of this animal disease is complicated by the enormous differences in the severity of disease caused by H. parasuis strains, ranging from lethal systemic disease to subclinical carriage. To identify differences in genotype that could account for virulence phenotypes, we established the virulence of, and performed whole genome sequence analysis on, 11 H. parasuis strains. Virulence was assessed by evaluating morbidity and mortality following intranasal challenge of Caesarean-derived, colostrum-deprived (CDCD) pigs. Genomic DNA from strains Nagasaki (serotype 5), 12939 (serotype 1), SW140 (serotype 2), 29755 (serotype 5), MN-H (serotype 13), 84-15995 (serotype 15), SW114 (serotype 3), H465 (serotype 11), D74 (serotype 9), and 174 (serotype 7) was used to generate Illumina paired-end libraries for genomic sequencing and de novo assembly. H. parasuis strains Nagasaki, 12939, SH0165 (serotype 5), SW140, 29755, and MN-H exhibited a high level of virulence. Despite minor differences in expression of disease among these groups, all pigs challenged with these strains developed clinical signs consistent with Glässer's disease between 1–7 days post-challenge. H. parasuis strains 84-15995 and SW114 were moderately virulent, in that approximately half of the pigs infected with each developed Glässer's disease. H. parasuis strains H465, D74, and 174 were minimally virulent or avirulent in the CDCD pig model. Comparative genomic analysis among strains identified several noteworthy differences in coding regions. These coding regions include predicted outer membrane, metabolism, and pilin or adhesin related genes, some of which likely contributed to the differences in virulence and systemic disease observed following challenge. These data will be useful for identifying H. parasuis virulence factors and vaccine targets. PMID:25137096

  17. Virulence and draft genome sequence overview of multiple strains of the swine pathogen Haemophilus parasuis.

    PubMed

    Brockmeier, Susan L; Register, Karen B; Kuehn, Joanna S; Nicholson, Tracy L; Loving, Crystal L; Bayles, Darrell O; Shore, Sarah M; Phillips, Gregory J

    2014-01-01

    Haemophilus parasuis is the cause of Glässer's disease in swine, which is characterized by systemic infection resulting in polyserositis, meningitis, and arthritis. Investigation of this animal disease is complicated by the enormous differences in the severity of disease caused by H. parasuis strains, ranging from lethal systemic disease to subclinical carriage. To identify differences in genotype that could account for virulence phenotypes, we established the virulence of, and performed whole genome sequence analysis on, 11 H. parasuis strains. Virulence was assessed by evaluating morbidity and mortality following intranasal challenge of Caesarean-derived, colostrum-deprived (CDCD) pigs. Genomic DNA from strains Nagasaki (serotype 5), 12939 (serotype 1), SW140 (serotype 2), 29755 (serotype 5), MN-H (serotype 13), 84-15995 (serotype 15), SW114 (serotype 3), H465 (serotype 11), D74 (serotype 9), and 174 (serotype 7) was used to generate Illumina paired-end libraries for genomic sequencing and de novo assembly. H. parasuis strains Nagasaki, 12939, SH0165 (serotype 5), SW140, 29755, and MN-H exhibited a high level of virulence. Despite minor differences in expression of disease among these groups, all pigs challenged with these strains developed clinical signs consistent with Glässer's disease between 1-7 days post-challenge. H. parasuis strains 84-15995 and SW114 were moderately virulent, in that approximately half of the pigs infected with each developed Glässer's disease. H. parasuis strains H465, D74, and 174 were minimally virulent or avirulent in the CDCD pig model. Comparative genomic analysis among strains identified several noteworthy differences in coding regions. These coding regions include predicted outer membrane, metabolism, and pilin or adhesin related genes, some of which likely contributed to the differences in virulence and systemic disease observed following challenge. These data will be useful for identifying H. parasuis virulence factors and vaccine targets.

  18. Mining biological databases for candidate disease genes

    NASA Astrophysics Data System (ADS)

    Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

    2001-07-01

    The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).

  19. Streptococcus mitis Strains Causing Severe Clinical Disease in Cancer Patients

    PubMed Central

    Sahasrabhojane, Pranoti; Saldana, Miguel; Yao, Hui; Su, Xiaoping; Horstmann, Nicola; Thompson, Erika; Flores, Anthony R.

    2014-01-01

    The genetically diverse viridans group streptococci (VGS) are increasingly recognized as the cause of a variety of human diseases. We used a recently developed multilocus sequence analysis scheme to define the species of 118 unique VGS strains causing bacteremia in patients with cancer; Streptococcus mitis (68 patients) and S. oralis (22 patients) were the most frequently identified strains. Compared with patients infected with non–S. mitis strains, patients infected with S. mitis strains were more likely to have moderate or severe clinical disease (e.g., VGS shock syndrome). Combined with the sequence data, whole-genome analyses showed that S. mitis strains may more precisely be considered as >2 species. Furthermore, we found that multiple S. mitis strains induced disease in neutropenic mice in a dose-dependent fashion. Our data define the prominent clinical effect of the group of organisms currently classified as S. mitis and lay the groundwork for increased understanding of this understudied pathogen. PMID:24750901

  20. Whole-genome CNV analysis: advances in computational approaches.

    PubMed

    Pirooznia, Mehdi; Goes, Fernando S; Zandi, Peter P

    2015-01-01

    Accumulating evidence indicates that DNA copy number variation (CNV) is likely to make a significant contribution to human diversity and also play an important role in disease susceptibility. Recent advances in genome sequencing technologies have enabled the characterization of a variety of genomic features, including CNVs. This has led to the development of several bioinformatics approaches to detect CNVs from next-generation sequencing data. Here, we review recent advances in CNV detection from whole genome sequencing. We discuss the informatics approaches and current computational tools that have been developed as well as their strengths and limitations. This review will assist researchers and analysts in choosing the most suitable tools for CNV analysis as well as provide suggestions for new directions in future development.

  1. Gold nanoparticles for high-throughput genotyping of long-range haplotypes

    NASA Astrophysics Data System (ADS)

    Chen, Peng; Pan, Dun; Fan, Chunhai; Chen, Jianhua; Huang, Ke; Wang, Dongfang; Zhang, Honglu; Li, You; Feng, Guoyin; Liang, Peiji; He, Lin; Shi, Yongyong

    2011-10-01

    Completion of the Human Genome Project and the HapMap Project has led to increasing demands for mapping complex traits in humans to understand the aetiology of diseases. Identifying variations in the DNA sequence, which affect how we develop disease and respond to pathogens and drugs, is important for this purpose, but it is difficult to identify these variations in large sample sets. Here we show that through a combination of capillary sequencing and polymerase chain reaction assisted by gold nanoparticles, it is possible to identify several DNA variations that are associated with age-related macular degeneration and psoriasis on significant regions of human genomic DNA. Our method is accurate and promising for large-scale and high-throughput genetic analysis of susceptibility towards disease and drug resistance.

  2. Isolation of Vibrio alginolyticus and Vibrio splendidus from captive-bred seahorses with disease symptoms.

    PubMed

    Balcázar, José L; Gallo-Bueno, Alfonso; Planas, Miquel; Pintado, José

    2010-02-01

    Vibrio species isolated from diseased seahorses were characterized by PCR amplification of repetitive bacterial DNA elements (rep-PCR) and identified by 16S ribosomal RNA gene sequence analysis. The results demonstrated that Vibrio alginolyticus and Vibrio splendidus were predominant in the lesions of these seahorses. To our knowledge, this is the first time that these bacterial species have been associated with disease symptoms in captive-bred seahorses.

  3. Whole genome sequencing distinguishes between relapse and reinfection in recurrent leprosy cases

    PubMed Central

    Bührer-Sékula, Samira; Benjak, Andrej; Loiseau, Chloé; Singh, Pushpendra; Pontes, Maria A. A.; Gonçalves, Heitor S.; Hungria, Emerith M.; Busso, Philippe; Piton, Jérémie; Silveira, Maria I. S.; Cruz, Rossilene; Schetinni, Antônio; Costa, Maurício B.; Virmond, Marcos C. L.; Diorio, Suzana M.; Dias-Baptista, Ida M. F.; Rosa, Patricia S.; Matsuoka, Masanori; Penna, Maria L. F.; Cole, Stewart T.; Penna, Gerson O.

    2017-01-01

    Background Since leprosy is both treated and controlled by multidrug therapy (MDT) it is important to monitor recurrent cases for drug resistance and to distinguish between relapse and reinfection as a means of assessing therapeutic efficacy. All three objectives can be reached with single nucleotide resolution using next generation sequencing and bioinformatics analysis of Mycobacterium leprae DNA present in human skin. Methodology DNA was isolated by means of optimized extraction and enrichment methods from samples from three recurrent cases in leprosy patients participating in an open-label, randomized, controlled clinical trial of uniform MDT in Brazil (U-MDT/CT-BR). Genome-wide sequencing of M. leprae was performed and the resultant sequence assemblies analyzed in silico. Principal findings In all three cases, no mutations responsible for resistance to rifampicin, dapsone and ofloxacin were found, thus eliminating drug resistance as a possible cause of disease recurrence. However, sequence differences were detected between the strains from the first and second disease episodes in all three patients. In one case, clear evidence was obtained for reinfection with an unrelated strain whereas in the other two cases, relapse appeared more probable. Conclusions/Significance This is the first report of using M. leprae whole genome sequencing to reveal that treated and cured leprosy patients who remain in endemic areas can be reinfected by another strain. Next generation sequencing can be applied reliably to M. leprae DNA extracted from biopsies to discriminate between cases of relapse and reinfection, thereby providing a powerful tool for evaluating different outcomes of therapeutic regimens and for following disease transmission. PMID:28617800

  4. Ocular findings associated with a Cys39Arg mutation in the Norrie disease gene.

    PubMed

    Joos, K M; Kimura, A E; Vandenburgh, K; Bartley, J A; Stone, E M

    1994-12-01

    To diagnose the carriers and noncarriers in a family affected with Norrie disease based on molecular analysis. Family members from three generations, including one affected patient, two obligate carriers, one carrier identified with linkage analysis, one noncarrier identified with linkage analysis, and one female family member with indeterminate carrier status, were examined clinically and electrophysiologically. Linkage analysis had previously failed to determine the carrier status of one female family member in the third generation. Blood samples were screened for mutations in the Norrie disease gene with single-strand conformation polymorphism analysis. The mutation was characterized by dideoxy-termination sequencing. Ophthalmoscopy and electroretinographic examination failed to detect the carrier state. The affected individuals and carriers in this family were found to have a transition from thymidine to cytosine in the first nucleotide of codon 39 of the Norrie disease gene, causing a cysteine-to-arginine mutation. Single-strand conformation polymorphism analysis identified a patient of indeterminate status (by linkage) to be a noncarrier of Norrie disease. Ophthalmoscopy and electroretinography could not identify carriers of this Norrie disease mutation. Single-strand conformation polymorphism analysis was more sensitive and specific than linkage analysis in identifying carriers in this family.

  5. Characterization of class II β chain major histocompatibility complex genes in a family of Hawaiian honeycreepers: 'amakihi (Hemignathus virens).

    PubMed

    Jarvi, Susan I; Bianchi, Kiara R; Farias, Margaret Em; Txakeeyang, Ann; McFarland, Thomas; Belcaid, Mahdi; Asano, Ashley

    2016-07-01

    Hawaiian honeycreepers (Drepanidinae) have evolved in the absence of mosquitoes for over five million years. Through human activity, mosquitoes were introduced to the Hawaiian archipelago less than 200 years ago. Mosquito-vectored diseases such as avian malaria caused by Plasmodium relictum and Avipoxviruses have greatly impacted these vulnerable species. Susceptibility to these diseases is variable among and within species. Due to their function in adaptive immunity, the role of major histocompatibility complex genes (Mhc) in disease susceptibility is under investigation. In this study, we evaluate gene organization and levels of diversity of Mhc class II β chain genes (exon 2) in a captive-reared family of Hawaii 'amakihi (Hemignathus virens). A total of 233 sequences (173 bp) were obtained by PCR+1 amplification and cloning, and 5720 sequences were generated by Roche 454 pyrosequencing. We report a total of 17 alleles originating from a minimum of 14 distinct loci. We detected three linkage groups that appear to represent three distinct haplotypes. Phylogenetic analysis revealed one variable cluster resembling classical Mhc sequences (DAB) and one highly conserved, low variability cluster resembling non-classical Mhc sequences (DBB). High net evolutionary divergence values between DAB and DBB resemble that seen between chicken BLB system and YLB system genes. High amino acid identity among non-classical alleles from 12 species of passerines (DBB) and four species of Galliformes (YLB) was found, suggesting that these non-classical passerine sequences may be related to the Galliforme YLB sequences.

  6. [The genetical evolution of the full length genes of 5 EV 71 strains from 5 Shenzhen patients with hand-food-mouth disease associated with EV71 infection].

    PubMed

    Liu, Wei-long; Yang, Gui-lin; Wei, Qing; Zhang, Ming-xia; Chen, Xin-chun; Liu, Ying-xia; Gao, Yang; Zhou, Bo-ping

    2011-02-01

    To investigate the characteristics of molecular epidemiology and molecular evolution of 5 EV 71 (enterovirus 71, EV71) strains from 5 Shenzhen patients with hand-food-mouth disease associated with EV 71 infection. 5 EV 71 strains were isolated, and sequenced to analyzed the full length gene sequences in order to compare nucleotide and amino acid homology with other EV71 strains from other regions and countries as well as previous strains across the world through bioinformatics software. 5 strains of EV 71 belonged to sub-genotype C4 by analysis of nucleotide sequences of VP1 and VP4 of EV 71. The differences of nucleotide and amino acid sequences were much small with nucleotide homology of 93% and amino acid homology of 98% among these 5 strains. A phylogenetic tree analysis indicated that 2008 Shenzhen epidemic strains were the most close to 2004 Shenzhen circulating strains, and also much close to 1998 Shenzhen epidemic strains and 2008 Fuyang Anhui strains. The dead strain was very close to 2008 Fuyang Anhui epidemic strains. It can be speculated that this epidemic strains of EV 71 probably originate from the same ancient strain in the history, may from 1998 Shenzhen strain.

  7. Genetic Architecture of Vitamin B12 and Folate Levels Uncovered Applying Deeply Sequenced Large Datasets

    PubMed Central

    Thorleifsson, Gudmar; Ahluwalia, Tarunveer S.; Steinthorsdottir, Valgerdur; Bjarnason, Helgi; Gudbjartsson, Daniel F.; Magnusson, Olafur T.; Sparsø, Thomas; Albrechtsen, Anders; Kong, Augustine; Masson, Gisli; Tian, Geng; Cao, Hongzhi; Nie, Chao; Kristiansen, Karsten; Husemoen, Lise Lotte; Thuesen, Betina; Li, Yingrui; Nielsen, Rasmus; Linneberg, Allan; Olafsson, Isleifur; Eyjolfsson, Gudmundur I.; Jørgensen, Torben; Wang, Jun; Hansen, Torben; Thorsteinsdottir, Unnur; Stefánsson, Kari; Pedersen, Oluf

    2013-01-01

    Genome-wide association studies have mainly relied on common HapMap sequence variations. Recently, sequencing approaches have allowed analysis of low frequency and rare variants in conjunction with common variants, thereby improving the search for functional variants and thus the understanding of the underlying biology of human traits and diseases. Here, we used a large Icelandic whole genome sequence dataset combined with Danish exome sequence data to gain insight into the genetic architecture of serum levels of vitamin B12 (B12) and folate. Up to 22.9 million sequence variants were analyzed in combined samples of 45,576 and 37,341 individuals with serum B12 and folate measurements, respectively. We found six novel loci associating with serum B12 (CD320, TCN2, ABCD4, MMAA, MMACHC) or folate levels (FOLR3) and confirmed seven loci for these traits (TCN1, FUT6, FUT2, CUBN, CLYBL, MUT, MTHFR). Conditional analyses established that four loci contain additional independent signals. Interestingly, 13 of the 18 identified variants were coding and 11 of the 13 target genes have known functions related to B12 and folate pathways. Contrary to epidemiological studies we did not find consistent association of the variants with cardiovascular diseases, cancers or Alzheimer's disease although some variants demonstrated pleiotropic effects. Although to some degree impeded by low statistical power for some of these conditions, these data suggest that sequence variants that contribute to the population diversity in serum B12 or folate levels do not modify the risk of developing these conditions. Yet, the study demonstrates the value of combining whole genome and exome sequencing approaches to ascertain the genetic and molecular architectures underlying quantitative trait associations. PMID:23754956

  8. Targeted gene panel sequencing in children with very early onset inflammatory bowel disease--evaluation and prospective analysis.

    PubMed

    Kammermeier, Jochen; Drury, Suzanne; James, Chela T; Dziubak, Robert; Ocaka, Louise; Elawad, Mamoun; Beales, Philip; Lench, Nicholas; Uhlig, Holm H; Bacchelli, Chiara; Shah, Neil

    2014-11-01

    Multiple monogenetic conditions with partially overlapping phenotypes can present with inflammatory bowel disease (IBD)-like intestinal inflammation. With novel genotype-specific therapies emerging, establishing a molecular diagnosis is becoming increasingly important. We have introduced targeted next-generation sequencing (NGS) technology as a prospective screening tool in children with very early onset IBD (VEOIBD). We evaluated the coverage of 40 VEOIBD genes in two separate cohorts undergoing targeted gene panel sequencing (TGPS) (n=25) and whole exome sequencing (WES) (n=20). TGPS revealed causative mutations in four genes (IL10RA, EPCAM, TTC37 and SKIV2L) discovered unexpected phenotypes and directly influenced clinical decision making by supporting as well as avoiding haematopoietic stem cell transplantation. TGPS resulted in significantly higher median coverage when compared with WES, fewer coverage deficiencies and improved variant detection across established VEOIBD genes. Excluding or confirming known VEOIBD genotypes should be considered early in the disease course in all cases of therapy-refractory VEOIBD, as it can have a direct impact on patient management. To combine both described NGS technologies would compensate for the limitations of WES for disease-specific application while offering the opportunity for novel gene discovery in the research setting. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  9. Molecular and clinical profile of von Willebrand disease in Spain (PCM-EVW-ES): comprehensive genetic analysis by next-generation sequencing of 480 patients

    PubMed Central

    Borràs, Nina; Batlle, Javier; Pérez-Rodríguez, Almudena; López-Fernández, María Fernanda; Rodríguez-Trillo, Ángela; Lourés, Esther; Cid, Ana Rosa; Bonanad, Santiago; Cabrera, Noelia; Moret, Andrés; Parra, Rafael; Mingot-Castellano, María Eva; Balda, Ignacia; Altisent, Carme; Pérez-Montes, Rocío; Fisac, Rosa María; Iruín, Gemma; Herrero, Sonia; Soto, Inmaculada; de Rueda, Beatriz; Jiménez-Yuste, Víctor; Alonso, Nieves; Vilariño, Dolores; Arija, Olga; Campos, Rosa; Paloma, María José; Bermejo, Nuria; Berrueco, Rubén; Mateo, José; Arribalzaga, Karmele; Marco, Pascual; Palomo, Ángeles; Sarmiento, Lizheidy; Iñigo, Belén; Nieto, María del Mar; Vidal, Rosa; Martínez, María Paz; Aguinaco, Reyes; César, Jesús María; Ferreiro, María; García-Frade, Javier; Rodríguez-Huerta, Ana María; Cuesta, Jorge; Rodríguez-González, Ramón; García-Candel, Faustino; Cornudella, Rosa; Aguilar, Carlos; Vidal, Francisco; Corrales, Irene

    2017-01-01

    Molecular diagnosis of patients with von Willebrand disease is pending in most populations due to the complexity and high cost of conventional molecular analyses. The need for molecular and clinical characterization of von Willebrand disease in Spain prompted the creation of a multicenter project (PCM-EVW-ES) that resulted in the largest prospective cohort study of patients with all types of von Willebrand disease. Molecular analysis of relevant regions of the VWF, including intronic and promoter regions, was achieved in the 556 individuals recruited via the development of a simple, innovative, relatively low-cost protocol based on microfluidic technology and next-generation sequencing. A total of 704 variants (237 different) were identified along VWF, 155 of which had not been previously recorded in the international mutation database. The potential pathogenic effect of these variants was assessed by in silico analysis. Furthermore, four short tandem repeats were analyzed in order to evaluate the ancestral origin of recurrent mutations. The outcome of genetic analysis allowed for the reclassification of 110 patients, identification of 37 asymptomatic carriers (important for genetic counseling) and re-inclusion of 43 patients previously excluded by phenotyping results. In total, 480 patients were definitively diagnosed. Candidate mutations were identified in all patients except 13 type 1 von Willebrand disease, yielding a high genotype-phenotype correlation. Our data reinforce the capital importance and usefulness of genetics in von Willebrand disease diagnostics. The progressive implementation of molecular study as the first-line test for routine diagnosis of this condition will lead to increasingly more personalized and effective care for this patient population. PMID:28971901

  10. Molecular and clinical profile of von Willebrand disease in Spain (PCM-EVW-ES): comprehensive genetic analysis by next-generation sequencing of 480 patients.

    PubMed

    Borràs, Nina; Batlle, Javier; Pérez-Rodríguez, Almudena; López-Fernández, María Fernanda; Rodríguez-Trillo, Ángela; Lourés, Esther; Cid, Ana Rosa; Bonanad, Santiago; Cabrera, Noelia; Moret, Andrés; Parra, Rafael; Mingot-Castellano, María Eva; Balda, Ignacia; Altisent, Carme; Pérez-Montes, Rocío; Fisac, Rosa María; Iruín, Gemma; Herrero, Sonia; Soto, Inmaculada; de Rueda, Beatriz; Jiménez-Yuste, Víctor; Alonso, Nieves; Vilariño, Dolores; Arija, Olga; Campos, Rosa; Paloma, María José; Bermejo, Nuria; Berrueco, Rubén; Mateo, José; Arribalzaga, Karmele; Marco, Pascual; Palomo, Ángeles; Sarmiento, Lizheidy; Iñigo, Belén; Nieto, María Del Mar; Vidal, Rosa; Martínez, María Paz; Aguinaco, Reyes; César, Jesús María; Ferreiro, María; García-Frade, Javier; Rodríguez-Huerta, Ana María; Cuesta, Jorge; Rodríguez-González, Ramón; García-Candel, Faustino; Cornudella, Rosa; Aguilar, Carlos; Vidal, Francisco; Corrales, Irene

    2017-12-01

    Molecular diagnosis of patients with von Willebrand disease is pending in most populations due to the complexity and high cost of conventional molecular analyses. The need for molecular and clinical characterization of von Willebrand disease in Spain prompted the creation of a multicenter project (PCM-EVW-ES) that resulted in the largest prospective cohort study of patients with all types of von Willebrand disease. Molecular analysis of relevant regions of the VWF , including intronic and promoter regions, was achieved in the 556 individuals recruited via the development of a simple, innovative, relatively low-cost protocol based on microfluidic technology and next-generation sequencing. A total of 704 variants (237 different) were identified along VWF , 155 of which had not been previously recorded in the international mutation database. The potential pathogenic effect of these variants was assessed by in silico analysis. Furthermore, four short tandem repeats were analyzed in order to evaluate the ancestral origin of recurrent mutations. The outcome of genetic analysis allowed for the reclassification of 110 patients, identification of 37 asymptomatic carriers (important for genetic counseling) and re-inclusion of 43 patients previously excluded by phenotyping results. In total, 480 patients were definitively diagnosed. Candidate mutations were identified in all patients except 13 type 1 von Willebrand disease, yielding a high genotype-phenotype correlation. Our data reinforce the capital importance and usefulness of genetics in von Willebrand disease diagnostics. The progressive implementation of molecular study as the first-line test for routine diagnosis of this condition will lead to increasingly more personalized and effective care for this patient population. Copyright© 2017 Ferrata Storti Foundation.

  11. Quantitative RNA-seq analysis of the Campylobacter jejuni transcriptome

    PubMed Central

    Chaudhuri, Roy R.; Yu, Lu; Kanji, Alpa; Perkins, Timothy T.; Gardner, Paul P.; Choudhary, Jyoti; Maskell, Duncan J.

    2011-01-01

    Campylobacter jejuni is the most common bacterial cause of foodborne disease in the developed world. Its general physiology and biochemistry, as well as the mechanisms enabling it to colonize and cause disease in various hosts, are not well understood, and new approaches are required to understand its basic biology. High-throughput sequencing technologies provide unprecedented opportunities for functional genomic research. Recent studies have shown that direct Illumina sequencing of cDNA (RNA-seq) is a useful technique for the quantitative and qualitative examination of transcriptomes. In this study we report RNA-seq analyses of the transcriptomes of C. jejuni (NCTC11168) and its rpoN mutant. This has allowed the identification of hitherto unknown transcriptional units, and further defines the regulon that is dependent on rpoN for expression. The analysis of the NCTC11168 transcriptome was supplemented by additional proteomic analysis using liquid chromatography-MS. The transcriptomic and proteomic datasets represent an important resource for the Campylobacter research community. PMID:21816880

  12. Multilocus sequence typing of Pseudomonas syringae sensu lato confirms previously described genomospecies and permits rapid identification of P. syringae pv. coriandricola and P. syringae pv. apii causing bacterial leaf spot on parsley.

    PubMed

    Bull, Carolee T; Clarke, Christopher R; Cai, Rongman; Vinatzer, Boris A; Jardini, Teresa M; Koike, Steven T

    2011-07-01

    Since 2002, severe leaf spotting on parsley (Petroselinum crispum) has occurred in Monterey County, CA. Either of two different pathovars of Pseudomonas syringae sensu lato were isolated from diseased leaves from eight distinct outbreaks and once from the same outbreak. Fragment analysis of DNA amplified between repetitive sequence polymerase chain reaction; 16S rDNA sequence analysis; and biochemical, physiological, and host range tests identified the pathogens as Pseudomonas syringae pv. apii and P. syringae pv. coriandricola. Koch's postulates were completed for the isolates from parsley, and host range tests with parsley isolates and pathotype strains demonstrated that P. syringae pv. apii and P. syringae pv. coriandricola cause leaf spot diseases on parsley, celery, and coriander or cilantro. In a multilocus sequence typing (MLST) approach, four housekeeping gene fragments were sequenced from 10 strains isolated from parsley and 56 pathotype strains of P. syringae. Allele sequences were uploaded to the Plant-Associated Microbes Database and a phylogenetic tree was built based on concatenated sequences. Tree topology directly corresponded to P. syringae genomospecies and P. syringae pv. apii was allocated appropriately to genomospecies 3. This is the first demonstration that MLST can accurately allocate new pathogens directly to P. syringae sensu lato genomospecies. According to MLST, P. syringae pv. coriandricola is a member of genomospecies 9, P. cannabina. In a blind test, both P. syringae pv. coriandricola and P. syringae pv. apii isolates from parsley were correctly identified to pathovar. In both cases, MLST described diversity within each pathovar that was previously unknown.

  13. Winnowing DNA for rare sequences: highly specific sequence and methylation based enrichment.

    PubMed

    Thompson, Jason D; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

    2012-01-01

    Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue.

  14. Loeffler 4.0: Diagnostic Metagenomics.

    PubMed

    Höper, Dirk; Wylezich, Claudia; Beer, Martin

    2017-01-01

    A new world of possibilities for "virus discovery" was opened up with high-throughput sequencing becoming available in the last decade. While scientifically metagenomic analysis was established before the start of the era of high-throughput sequencing, the availability of the first second-generation sequencers was the kick-off for diagnosticians to use sequencing for the detection of novel pathogens. Today, diagnostic metagenomics is becoming the standard procedure for the detection and genetic characterization of new viruses or novel virus variants. Here, we provide an overview about technical considerations of high-throughput sequencing-based diagnostic metagenomics together with selected examples of "virus discovery" for animal diseases or zoonoses and metagenomics for food safety or basic veterinary research. © 2017 Elsevier Inc. All rights reserved.

  15. Genome sequence, comparative analysis and haplotype structure of the domestic dog.

    PubMed

    Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S

    2005-12-08

    Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.

  16. Identification of a 'Candidatus Phytoplasma hispanicum'-related strain, associated with yellows-type diseases, in smoke-tree sharpshooter (Homalodisca liturata Ball).

    PubMed

    Servín-Villegas, Rosalía; Caamal-Chan, Maria Goretty; Chavez-Medina, Alicia; Loera-Muro, Abraham; Barraza, Aarón; Medina-Hernández, Diana; Holguín-Peña, Ramón Jaime

    2018-04-11

    The 16SrXIII group from phytoplasma bacteria were identified in salivary glands from Homalodisca liturata, which were collected in El Comitán on the Baja California peninsula in Mexico. We were able to positively identify 15 16S rRNA gene sequences with the corresponding signature sequence of 'CandidatusPhytoplasma' (CAAGAYBATKATGTKTAGCYGGDCT) and in silico restriction fragment length polymorphism (RFLP) profiles (F value estimations) coupled with a phylogenetic analysis to confirm their relatedness to 'CandidatusPhytoplasma hispanicum', which in turn belongs to the 16SrXIII group. A restriction analysis was carried out with AluI and EcoRI to confirm that the five sequences belongs to subgroup D. The rest of the sequences did not exhibit any known RFLP profile related to a subgroup reported in the 16SrXIII group.

  17. Cutaneous Granulomas in Dolphins Caused by Novel Uncultivated Paracoccidioides brasiliensis

    PubMed Central

    Vilela, Raquel; Bossart, Gregory D.; St. Leger, Judy A.; Dalton, Leslie M.; Reif, John S.; Schaefer, Adam M.; McCarthy, Peter J.; Fair, Patricia A.

    2016-01-01

    Cutaneous granulomas in dolphins were believed to be caused by Lacazia loboi, which also causes a similar disease in humans. This hypothesis was recently challenged by reports that fungal DNA sequences from dolphins grouped this pathogen with Paracoccidioides brasiliensis. We conducted phylogenetic analysis of fungi from 6 bottlenose dolphins (Tursiops truncatus) with cutaneous granulomas and chains of yeast cells in infected tissues. Kex gene sequences of P. brasiliensis from dolphins showed 100% homology with sequences from cultivated P. brasiliensis, 73% with those of L. loboi, and 93% with those of P. lutzii. Parsimony analysis placed DNA sequences from dolphins within a cluster with human P. brasiliensis strains. This cluster was the sister taxon to P. lutzii and L. loboi. Our molecular data support previous findings and suggest that a novel uncultivated strain of P. brasiliensis restricted to cutaneous lesions in dolphins is probably the cause of lacaziosis/lobomycosis, herein referred to as paracoccidioidomycosis ceti. PMID:27869614

  18. Cutaneous Granulomas in Dolphins Caused by Novel Uncultivated Paracoccidioides brasiliensis.

    PubMed

    Vilela, Raquel; Bossart, Gregory D; St Leger, Judy A; Dalton, Leslie M; Reif, John S; Schaefer, Adam M; McCarthy, Peter J; Fair, Patricia A; Mendoza, Leonel

    2016-12-01

    Cutaneous granulomas in dolphins were believed to be caused by Lacazia loboi, which also causes a similar disease in humans. This hypothesis was recently challenged by reports that fungal DNA sequences from dolphins grouped this pathogen with Paracoccidioides brasiliensis. We conducted phylogenetic analysis of fungi from 6 bottlenose dolphins (Tursiops truncatus) with cutaneous granulomas and chains of yeast cells in infected tissues. Kex gene sequences of P. brasiliensis from dolphins showed 100% homology with sequences from cultivated P. brasiliensis, 73% with those of L. loboi, and 93% with those of P. lutzii. Parsimony analysis placed DNA sequences from dolphins within a cluster with human P. brasiliensis strains. This cluster was the sister taxon to P. lutzii and L. loboi. Our molecular data support previous findings and suggest that a novel uncultivated strain of P. brasiliensis restricted to cutaneous lesions in dolphins is probably the cause of lacaziosis/lobomycosis, herein referred to as paracoccidioidomycosis ceti.

  19. The draft genome sequence of the ferret (Mustela putorius furo) facilitates study of human respiratory disease.

    PubMed

    Peng, Xinxia; Alföldi, Jessica; Gori, Kevin; Eisfeld, Amie J; Tyler, Scott R; Tisoncik-Go, Jennifer; Brawand, David; Law, G Lynn; Skunca, Nives; Hatta, Masato; Gasper, David J; Kelly, Sara M; Chang, Jean; Thomas, Matthew J; Johnson, Jeremy; Berlin, Aaron M; Lara, Marcia; Russell, Pamela; Swofford, Ross; Turner-Maier, Jason; Young, Sarah; Hourlier, Thibaut; Aken, Bronwen; Searle, Steve; Sun, Xingshen; Yi, Yaling; Suresh, M; Tumpey, Terrence M; Siepel, Adam; Wisely, Samantha M; Dessimoz, Christophe; Kawaoka, Yoshihiro; Birren, Bruce W; Lindblad-Toh, Kerstin; Di Palma, Federica; Engelhardt, John F; Palermo, Robert E; Katze, Michael G

    2014-12-01

    The domestic ferret (Mustela putorius furo) is an important animal model for multiple human respiratory diseases. It is considered the 'gold standard' for modeling human influenza virus infection and transmission. Here we describe the 2.41 Gb draft genome assembly of the domestic ferret, constituting 2.28 Gb of sequence plus gaps. We annotated 19,910 protein-coding genes on this assembly using RNA-seq data from 21 ferret tissues. We characterized the ferret host response to two influenza virus infections by RNA-seq analysis of 42 ferret samples from influenza time-course data and showed distinct signatures in ferret trachea and lung tissues specific to 1918 or 2009 human pandemic influenza virus infections. Using microarray data from 16 ferret samples reflecting cystic fibrosis disease progression, we showed that transcriptional changes in the CFTR-knockout ferret lung reflect pathways of early disease that cannot be readily studied in human infants with cystic fibrosis disease.

  20. The draft genome sequence of the ferret (Mustela putorius furo) facilitates study of human respiratory disease

    PubMed Central

    Peng, Xinxia; Alföldi, Jessica; Gori, Kevin; Eisfeld, Amie J.; Tyler, Scott R.; Tisoncik-Go, Jennifer; Brawand, David; Law, G. Lynn; Skunca, Nives; Hatta, Masato; Gasper, David J.; Kelly, Sara M.; Chang, Jean; Thomas, Matthew J.; Johnson, Jeremy; Berlin, Aaron M.; Lara, Marcia; Russell, Pamela; Swofford, Ross; Turner-Maier, Jason; Young, Sarah; Hourlier, Thibaut; Aken, Bronwen; Searle, Steve; Sun, Xingshen; Yi, Yaling; Suresh, M.; Tumpey, Terrence M.; Siepel, Adam; Wisely, Samantha M.; Dessimoz, Christophe; Kawaoka, Yoshihiro; Birren, Bruce W.; Lindblad-Toh, Kerstin; Di Palma, Federica; Engelhardt, John F.; Palermo, Robert E.; Katze, Michael G.

    2014-01-01

    The domestic ferret (Mustela putorius furo) is an important animal model for multiple human respiratory diseases. It is considered the ‘gold standard’ for modeling human influenza virus infection and transmission1–4. Here we describe the 2.41 Gb draft genome assembly of the domestic ferret, constituting 2.28 Gb of sequence plus gaps. We annotate 19,910 protein-coding genes on this assembly using RNA-seq data from 21 ferret tissues. We characterize the ferret host response to two influenza virus infections by RNA-seq analysis of 42 ferret samples from influenza time courses, and show distinct signatures in ferret trachea and lung tissues specific to 1918 or 2009 human pandemic influenza virus infections. Using microarray data from 16 ferret samples reflecting cystic fibrosis (CF) disease progression, we show that transcriptional changes in the CFTR-knockout ferret lung reflect pathways of early disease that cannot be readily studied in human infants with CF disease. PMID:25402615

  1. A sequence database allowing automated genotyping of Classical swine fever virus isolates.

    PubMed

    Dreier, Sabrina; Zimmermann, Bernd; Moennig, Volker; Greiser-Wilke, Irene

    2007-03-01

    Classical swine fever (CSF) is a highly contagious viral disease of pigs. According to the OIE classification of diseases it is classified as a notifiable (previously List A) disease, thus having the potential for causing severe socio-economic problems and affecting severely the international trade of pigs and pig products. Effective control measures are compulsory, and to expose weaknesses a reliable tracing of the spread of the virus is necessary. Genetic typing has proved to be the method of choice. However, genotyping involves the use of multiple software applications, which is laborious and complex. The implementation of a sequence database, which is accessible by the World Wide Web with the option to type automatically new CSF virus isolates once the sequence is available is described. The sequence to be typed is tested for correct orientation and, if necessary, adjusted to the right length. The alignment and the neighbor-joining phylogenetic analysis with a standard set of sequences can then be calculated. The results are displayed as a graph. As an example, the determination is shown of the genetic subgroup of the isolate obtained from the outbreaks registered in Russia, in 2005. After registration (Irene.greiser-wilke@tiho-hannover.de) the database including the module for genotyping are accessible under http://viro08.tiho-hannover.de/eg/eurl_virus_db.htm.

  2. Whole genome sequences of a male and female supercentenarian, ages greater than 114 years.

    PubMed

    Sebastiani, Paola; Riva, Alberto; Montano, Monty; Pham, Phillip; Torkamani, Ali; Scherba, Eugene; Benson, Gary; Milton, Jacqueline N; Baldwin, Clinton T; Andersen, Stacy; Schork, Nicholas J; Steinberg, Martin H; Perls, Thomas T

    2011-01-01

    Supercentenarians (age 110+ years old) generally delay or escape age-related diseases and disability well beyond the age of 100 and this exceptional survival is likely to be influenced by a genetic predisposition that includes both common and rare genetic variants. In this report, we describe the complete genomic sequences of male and female supercentenarians, both age >114 years old. We show that: (1) the sequence variant spectrum of these two individuals' DNA sequences is largely comparable to existing non-supercentenarian genomes; (2) the two individuals do not appear to carry most of the well-established human longevity enabling variants already reported in the literature; (3) they have a comparable number of known disease-associated variants relative to most human genomes sequenced to-date; (4) approximately 1% of the variants these individuals possess are novel and may point to new genes involved in exceptional longevity; and (5) both individuals are enriched for coding variants near longevity-associated variants that we discovered through a large genome-wide association study. These analyses suggest that there are both common and rare longevity-associated variants that may counter the effects of disease-predisposing variants and extend lifespan. The continued analysis of the genomes of these and other rare individuals who have survived to extremely old ages should provide insight into the processes that contribute to the maintenance of health during extreme aging.

  3. Whole Genome Sequences of a Male and Female Supercentenarian, Ages Greater than 114 Years

    PubMed Central

    Sebastiani, Paola; Riva, Alberto; Montano, Monty; Pham, Phillip; Torkamani, Ali; Scherba, Eugene; Benson, Gary; Milton, Jacqueline N.; Baldwin, Clinton T.; Andersen, Stacy; Schork, Nicholas J.; Steinberg, Martin H.; Perls, Thomas T.

    2012-01-01

    Supercentenarians (age 110+ years old) generally delay or escape age-related diseases and disability well beyond the age of 100 and this exceptional survival is likely to be influenced by a genetic predisposition that includes both common and rare genetic variants. In this report, we describe the complete genomic sequences of male and female supercentenarians, both age >114 years old. We show that: (1) the sequence variant spectrum of these two individuals’ DNA sequences is largely comparable to existing non-supercentenarian genomes; (2) the two individuals do not appear to carry most of the well-established human longevity enabling variants already reported in the literature; (3) they have a comparable number of known disease-associated variants relative to most human genomes sequenced to-date; (4) approximately 1% of the variants these individuals possess are novel and may point to new genes involved in exceptional longevity; and (5) both individuals are enriched for coding variants near longevity-associated variants that we discovered through a large genome-wide association study. These analyses suggest that there are both common and rare longevity-associated variants that may counter the effects of disease-predisposing variants and extend lifespan. The continued analysis of the genomes of these and other rare individuals who have survived to extremely old ages should provide insight into the processes that contribute to the maintenance of health during extreme aging. PMID:22303384

  4. Information Topics of Greatest Interest for Return of Genome Sequencing Results among Women Diagnosed with Breast Cancer at a Young Age.

    PubMed

    Seo, Joann; Ivanovich, Jennifer; Goodman, Melody S; Biesecker, Barbara B; Kaphingst, Kimberly A

    2017-06-01

    We investigated what information women diagnosed with breast cancer at a young age would want to learn when genome sequencing results are returned. We conducted 60 semi-structured interviews with women diagnosed with breast cancer at age 40 or younger. We examined what specific information participants would want to learn across result types and for each type of result, as well as how much information they would want. Genome sequencing was not offered to participants as part of the study. Two coders independently coded interview transcripts; analysis was conducted using NVivo10. Across result types, participants wanted to learn about health implications, risk and prevalence in quantitative terms, causes of variants, and causes of diseases. Participants wanted to learn actionable information for variants affecting risk of preventable or treatable disease, medication response, and carrier status. The amount of desired information differed for variants affecting risk of unpreventable or untreatable disease, with uncertain significance, and not health-related. Women diagnosed with breast cancer at a young age recognize the value of genome sequencing results in identifying potential causes and effective treatments and expressed interest in using the information to help relatives and to further understand their other health risks. Our findings can inform the development of effective feedback strategies for genome sequencing that meet patients' information needs and preferences.

  5. Rare variants in RTEL1 are associated with familial interstitial pneumonia.

    PubMed

    Cogan, Joy D; Kropski, Jonathan A; Zhao, Min; Mitchell, Daphne B; Rives, Lynette; Markin, Cheryl; Garnett, Errine T; Montgomery, Keri H; Mason, Wendi R; McKean, David F; Powers, Julia; Murphy, Elissa; Olson, Lana M; Choi, Leena; Cheng, Dong-Sheng; Blue, Elizabeth Marchani; Young, Lisa R; Lancaster, Lisa H; Steele, Mark P; Brown, Kevin K; Schwarz, Marvin I; Fingerlin, Tasha E; Schwartz, David A; Lawson, William E; Loyd, James E; Zhao, Zhongming; Phillips, John A; Blackwell, Timothy S

    2015-03-15

    Up to 20% of cases of idiopathic interstitial pneumonia cluster in families, comprising the syndrome of familial interstitial pneumonia (FIP); however, the genetic basis of FIP remains uncertain in most families. To determine if new disease-causing rare genetic variants could be identified using whole-exome sequencing of affected members from FIP families, providing additional insights into disease pathogenesis. Affected subjects from 25 kindreds were selected from an ongoing FIP registry for whole-exome sequencing from genomic DNA. Candidate rare variants were confirmed by Sanger sequencing, and cosegregation analysis was performed in families, followed by additional sequencing of affected individuals from another 163 kindreds. We identified a potentially damaging rare variant in the gene encoding for regulator of telomere elongation helicase 1 (RTEL1) that segregated with disease and was associated with very short telomeres in peripheral blood mononuclear cells in 1 of 25 families in our original whole-exome sequencing cohort. Evaluation of affected individuals in 163 additional kindreds revealed another eight families (4.7%) with heterozygous rare variants in RTEL1 that segregated with clinical FIP. Probands and unaffected carriers of these rare variants had short telomeres (<10% for age) in peripheral blood mononuclear cells and increased T-circle formation, suggesting impaired RTEL1 function. Rare loss-of-function variants in RTEL1 represent a newly defined genetic predisposition for FIP, supporting the importance of telomere-related pathways in pulmonary fibrosis.

  6. Candidate Sequence Variants and Fetal Hemoglobin in Children with Sickle Cell Disease Treated with Hydroxyurea

    PubMed Central

    Green, Nancy S.; Ender, Katherine L.; Pashankar, Farzana; Driscoll, Catherine; Giardina, Patricia J.; Mullen, Craig A.; Clark, Lorraine N.; Manwani, Deepa; Crotty, Jennifer; Kisselev, Sergey; Neville, Kathleen A.; Hoppe, Carolyn; Barral, Sandra

    2013-01-01

    Background Fetal hemoglobin level is a heritable complex trait that strongly correlates swith the clinical severity of sickle cell disease. Only few genetic loci have been identified as robustly associated with fetal hemoglobin in patients with sickle cell disease, primarily adults. The sole approved pharmacologic therapy for this disease is hydroxyurea, with effects largely attributable to induction of fetal hemoglobin. Methodology/Principal Findings In a multi-site observational analysis of children with sickle cell disease, candidate single nucleotide polymorphisms associated with baseline fetal hemoglobin levels in adult sickle cell disease were examined in children at baseline and induced by hydroxyurea therapy. For baseline levels, single marker analysis demonstrated significant association with BCL11A and the beta and epsilon globin loci (HBB and HBE, respectively), with an additive attributable variance from these loci of 23%. Among a subset of children on hydroxyurea, baseline fetal hemoglobin levels explained 33% of the variance in induced levels. The variant in HBE accounted for an additional 13% of the variance in induced levels, while variants in the HBB and BCL11A loci did not contribute beyond baseline levels. Conclusions/Significance These findings clarify the overlap between baseline and hydroxyurea-induced fetal hemoglobin levels in pediatric disease. Studies assessing influences of specific sequence variants in these and other genetic loci in larger populations and in unusual hydroxyurea responders are needed to further understand the maintenance and therapeutic induction of fetal hemoglobin in pediatric sickle cell disease. PMID:23409025

  7. Outcome of ABCA4 disease-associated alleles in autosomal recessive retinal dystrophies: retrospective analysis in 420 Spanish families.

    PubMed

    Riveiro-Alvarez, Rosa; Lopez-Martinez, Miguel-Angel; Zernant, Jana; Aguirre-Lamban, Jana; Cantalapiedra, Diego; Avila-Fernandez, Almudena; Gimenez, Ascension; Lopez-Molina, Maria-Isabel; Garcia-Sandoval, Blanca; Blanco-Kelly, Fiona; Corton, Marta; Tatu, Sorina; Fernandez-San Jose, Patricia; Trujillo-Tiebas, Maria-Jose; Ramos, Carmen; Allikmets, Rando; Ayuso, Carmen

    2013-11-01

    To provide a comprehensive overview of all detected mutations in the ABCA4 gene in Spanish families with autosomal recessive retinal disorders, including Stargardt's disease (arSTGD), cone-rod dystrophy (arCRD), and retinitis pigmentosa (arRP), and to assess genotype-phenotype correlation and disease progression in 10 years by considering the type of variants and age at onset. Case series. A total of 420 unrelated Spanish families: 259 arSTGD, 86 arCRD, and 75 arRP. Spanish families were analyzed through a combination of ABCR400 genotyping microarray, denaturing high-performance liquid chromatography, and high-resolution melting scanning. Direct sequencing was used as a confirmation technique for the identified variants. Screening by multiple ligation probe analysis was used to detect possible large deletions or insertions in the ABCA4 gene. Selected families were analyzed further by next generation sequencing. DNA sequence variants, mutation detection rates, haplotypes, age at onset, central or peripheral vision loss, and night blindness. Overall, we detected 70.5% and 36.6% of all expected ABCA4 mutations in arSTGD and arCRD patient cohorts, respectively. In the fraction of the cohort where the ABCA4 gene was sequenced completely, the detection rates reached 73.6% for arSTGD and 66.7% for arCRD. However, the frequency of possibly pathogenic ABCA4 alleles in arRP families was only slightly higher than that in the general population. Moreover, in some families, mutations in other known arRP genes segregated with the disease phenotype. An increasing understanding of causal ABCA4 alleles in arSTGD and arCRD facilitates disease diagnosis and prognosis and also is paramount in selecting patients for emerging clinical trials of therapeutic interventions. Because ABCA4-associated diseases are evolving retinal dystrophies, assessment of age at onset, accurate clinical diagnosis, and genetic testing are crucial. We suggest that ABCA4 mutations may be associated with a retinitis pigmentosa-like phenotype often as a consequence of severe (null) mutations, in cases of long-term, advanced disease, or both. Patients with classical arRP phenotypes, especially from the onset of the disease, should be screened first for mutations in known arRP genes and not ABCA4. Copyright © 2013 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.

  8. Comparative genomic analysis of full genome sequences of two closely related isolates of Clostridium perfringens reveals regions of genome plasticity with prevention potential

    USDA-ARS?s Scientific Manuscript database

    The spore-forming anaerobic Clostridium perfringens (CP) is the primary etiological agent of necrotic enteritis (NE) disease, one of priority enteric diseases in chickens which is responsible for annual losses of $6 billion in the US poultry industry. Our long term goal is to develop a recombinant v...

  9. Bioinformatics and molecular analysis of the evolutionary relationship between bovine rhinitis A viruses and foot-and-mouth disease virus

    USDA-ARS?s Scientific Manuscript database

    Bovine rhinitis viruses (BRV) cause mild respiratory disease of cattle. In this study, a near full length genome sequence of a virus named RS3X, formerly classified as bovine rhinovirus type 1, isolated from infected cattle from the United Kingdom in the 1960s, was obtained and analyzed. Phylogeneti...

  10. Transfer of genetic therapy across human populations: molecular targets for increasing patient coverage in repeat expansion diseases

    PubMed Central

    Varela, Miguel A; Curtis, Helen J; Douglas, Andrew GL; Hammond, Suzan M; O'Loughlin, Aisling J; Sobrido, Maria J; Scholefield, Janine; Wood, Matthew JA

    2016-01-01

    Allele-specific gene therapy aims to silence expression of mutant alleles through targeting of disease-linked single-nucleotide polymorphisms (SNPs). However, SNP linkage to disease varies between populations, making such molecular therapies applicable only to a subset of patients. Moreover, not all SNPs have the molecular features necessary for potent gene silencing. Here we provide knowledge to allow the maximisation of patient coverage by building a comprehensive understanding of SNPs ranked according to their predicted suitability toward allele-specific silencing in 14 repeat expansion diseases: amyotrophic lateral sclerosis and frontotemporal dementia, dentatorubral-pallidoluysian atrophy, myotonic dystrophy 1, myotonic dystrophy 2, Huntington's disease and several spinocerebellar ataxias. Our systematic analysis of DNA sequence variation shows that most annotated SNPs are not suitable for potent allele-specific silencing across populations because of suboptimal sequence features and low variability (>97% in HD). We suggest maximising patient coverage by selecting SNPs with high heterozygosity across populations, and preferentially targeting SNPs that lead to purine:purine mismatches in wild-type alleles to obtain potent allele-specific silencing. We therefore provide fundamental knowledge on strategies for optimising patient coverage of therapeutics for microsatellite expansion disorders by linking analysis of population genetic variation to the selection of molecular targets. PMID:25990798

  11. Transfer of genetic therapy across human populations: molecular targets for increasing patient coverage in repeat expansion diseases.

    PubMed

    Varela, Miguel A; Curtis, Helen J; Douglas, Andrew G L; Hammond, Suzan M; O'Loughlin, Aisling J; Sobrido, Maria J; Scholefield, Janine; Wood, Matthew J A

    2016-02-01

    Allele-specific gene therapy aims to silence expression of mutant alleles through targeting of disease-linked single-nucleotide polymorphisms (SNPs). However, SNP linkage to disease varies between populations, making such molecular therapies applicable only to a subset of patients. Moreover, not all SNPs have the molecular features necessary for potent gene silencing. Here we provide knowledge to allow the maximisation of patient coverage by building a comprehensive understanding of SNPs ranked according to their predicted suitability toward allele-specific silencing in 14 repeat expansion diseases: amyotrophic lateral sclerosis and frontotemporal dementia, dentatorubral-pallidoluysian atrophy, myotonic dystrophy 1, myotonic dystrophy 2, Huntington's disease and several spinocerebellar ataxias. Our systematic analysis of DNA sequence variation shows that most annotated SNPs are not suitable for potent allele-specific silencing across populations because of suboptimal sequence features and low variability (>97% in HD). We suggest maximising patient coverage by selecting SNPs with high heterozygosity across populations, and preferentially targeting SNPs that lead to purine:purine mismatches in wild-type alleles to obtain potent allele-specific silencing. We therefore provide fundamental knowledge on strategies for optimising patient coverage of therapeutics for microsatellite expansion disorders by linking analysis of population genetic variation to the selection of molecular targets.

  12. ENPP1 Mutation Causes Recessive Cole Disease by Altering Melanogenesis.

    PubMed

    Chourabi, Marwa; Liew, Mei Shan; Lim, Shawn; H'mida-Ben Brahim, Dorra; Boussofara, Lobna; Dai, Liang; Wong, Pui Mun; Foo, Jia Nee; Sriha, Badreddine; Robinson, Kim Samirah; Denil, Simon; Common, John Ea; Mamaï, Ons; Ben Khalifa, Youcef; Bollen, Mathieu; Liu, Jianjun; Denguezli, Mohamed; Bonnard, Carine; Saad, Ali; Reversade, Bruno

    2018-02-01

    Cole disease is a genodermatosis of pigmentation following a strict dominant mode of inheritance. In this study, we investigated eight patients affected with an overlapping genodermatosis after recessive inheritance. The patients presented with hypo- and hyperpigmented macules over the body, resembling dyschromatosis universalis hereditaria in addition to punctuate palmoplantar keratosis. By homozygosity mapping and whole-exome sequencing, a biallelic p.Cys120Arg mutation in ectonucleotide pyrophosphatase/phosphodiesterase 1 (ENPP1) was identified in all patients. We found that this mutation, like those causing dominant Cole disease, impairs homodimerization of the ENPP1 enzyme that is mediated by its two somatomedin-B-like domains. Histological analysis revealed structural and molecular changes in affected skin that were likely to originate from defective melanocytes because keratinocytes do not express ENPP1. Consistently, RNA-sequencing analysis of patient-derived primary melanocytes revealed alterations in melanocyte development and in pigmentation signaling pathways. We therefore conclude that germline ENPP1 cysteine-specific mutations, primarily affecting the melanocyte lineage, cause a clinical spectrum of dyschromatosis, in which the p.Cys120Arg allele represents a recessive and more severe form of Cole disease. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  13. The Application of Next-Generation Sequencing for Mutation Detection in Autosomal-Dominant Hereditary Hearing Impairment.

    PubMed

    Gürtler, Nicolas; Röthlisberger, Benno; Ludin, Katja; Schlegel, Christoph; Lalwani, Anil K

    2017-07-01

    Identification of the causative mutation using next-generation sequencing in autosomal-dominant hereditary hearing impairment, as mutation analysis in hereditary hearing impairment by classic genetic methods, is hindered by the high heterogeneity of the disease. Two Swiss families with autosomal-dominant hereditary hearing impairment. Amplified DNA libraries for next-generation sequencing were constructed from extracted genomic DNA, derived from peripheral blood, and enriched by a custom-made sequence capture library. Validated, pooled libraries were sequenced on an Illumina MiSeq instrument, 300 cycles and paired-end sequencing. Technical data analysis was performed with SeqMonk, variant analysis with GeneTalk or VariantStudio. The detection of mutations in genes related to hearing loss by next-generation sequencing was subsequently confirmed using specific polymerase-chain-reaction and Sanger sequencing. Mutation detection in hearing-loss-related genes. The first family harbored the mutation c.5383+5delGTGA in the TECTA-gene. In the second family, a novel mutation c.2614-2625delCATGGCGCCGTG in the WFS1-gene and a second mutation TCOF1-c.1028G>A were identified. Next-generation sequencing successfully identified the causative mutation in families with autosomal-dominant hereditary hearing impairment. The results helped to clarify the pathogenic role of a known mutation and led to the detection of a novel one. NGS represents a feasible approach with great potential future in the diagnostics of hereditary hearing impairment, even in smaller labs.

  14. Molecular characterization of a distinct begomovirus species from Vernonia cinerea and its associated DNA-beta using the bacteriophage Phi 29 DNA polymerase.

    PubMed

    Packialakshmi, R M; Srivastava, N; Girish, K R; Usha, R

    2010-08-01

    Vernonia cinerea plants with yellow vein symptoms were collected around crop fields in Madurai. A portion (550 bp) of the AV1 gene amplified using degenerate primers from the total DNA purified from diseased leaf sample was cloned and sequenced. Specific primers derived from the above sequence were used to amplify 2,745 nucleotides with the typical genome organization of begomoviral DNA A (EMBL Accession No. AM182232). Sequence comparison with other begomoviruses revealed the greatest identity (82.4%) with Emilia yellow vein virus (EmYVV-[Fz1]) from China and less than 80% with all other known begomoviruses. The International Committee on Taxonomy of Viruses (ICTV) has therefore recognized Vernonia yellow vein virus (VeYVV) as a distinct begomovirus species. Conventional PCR could not amplify the DNA B or DNA beta from the diseased tissue. However, the beta DNA (1364 bp) associated with the disease was obtained (Accession No. FN435836) by the rolling circle amplification-restriction fragment length polymorphism method (RCA-RFLP) using Phi 29 DNA polymerase. Sequence analysis shows that DNA beta of VeYVV has the highest identity (56.8%) with DNA beta of Sigesbeckia yellow vein Guangxi betasatellite (SibYVGxB-[CN: Gx111:05]) and 56-53% with DNA beta associated with other begomoviruses. This is the first report of the molecular characterization of VeYVV from V. cinerea in India. The complete molecular characterization, phylogenetic analysis, and putative recombination events in VeYVV are reported.

  15. A KCNH2 branch point mutation causing aberrant splicing contributes to an explanation of genotype-negative long QT syndrome.

    PubMed

    Crotti, Lia; Lewandowska, Marzena A; Schwartz, Peter J; Insolia, Roberto; Pedrazzini, Matteo; Bussani, Erica; Dagradi, Federica; George, Alfred L; Pagani, Franco

    2009-02-01

    Genetic screening of long QT syndrome (LQTS) fails to identify disease-causing mutations in about 30% of patients. So far, molecular screening has focused mainly on coding sequence mutations or on substitutions at canonical splice sites. The purpose of this study was to explore the possibility that intronic variants not at canonical splice sites might affect splicing regulatory elements, lead to aberrant transcripts, and cause LQTS. Molecular screening was performed through DHPLC and sequence analysis. The role of the intronic mutation identified was assessed with a hybrid minigene splicing assay. A three-generation LQTS family was investigated. Molecular screening failed to identify an obvious disease-causing mutation in the coding sequences of the major LQTS genes but revealed an intronic A-to-G substitution in KCNH2 (IVS9-28A/G) cosegregating with the clinical phenotype in family members. In vitro analysis proved that the mutation disrupts the acceptor splice site definition by affecting the branch point (BP) sequence and promoting intron retention. We further demonstrated a tight functional relationship between the BP and the polypyrimidine tract, whose weakness is responsible for the pathological effect of the IVS9-28A/G mutation. We identified a novel BP mutation in KCNH2 that disrupts the intron 9 acceptor splice site definition and causes LQT2. The present finding demonstrates that intronic mutations affecting pre-mRNA processing may contribute to the failure of traditional molecular screening in identifying disease-causing mutations in LQTS subjects and offers a rationale strategy for the reduction of genotype-negative cases.

  16. Integrating 400 million variants from 80,000 human samples with extensive annotations: towards a knowledge base to analyze disease cohorts.

    PubMed

    Hakenberg, Jörg; Cheng, Wei-Yi; Thomas, Philippe; Wang, Ying-Chih; Uzilov, Andrew V; Chen, Rong

    2016-01-08

    Data from a plethora of high-throughput sequencing studies is readily available to researchers, providing genetic variants detected in a variety of healthy and disease populations. While each individual cohort helps gain insights into polymorphic and disease-associated variants, a joint perspective can be more powerful in identifying polymorphisms, rare variants, disease-associations, genetic burden, somatic variants, and disease mechanisms. We have set up a Reference Variant Store (RVS) containing variants observed in a number of large-scale sequencing efforts, such as 1000 Genomes, ExAC, Scripps Wellderly, UK10K; various genotyping studies; and disease association databases. RVS holds extensive annotations pertaining to affected genes, functional impacts, disease associations, and population frequencies. RVS currently stores 400 million distinct variants observed in more than 80,000 human samples. RVS facilitates cross-study analysis to discover novel genetic risk factors, gene-disease associations, potential disease mechanisms, and actionable variants. Due to its large reference populations, RVS can also be employed for variant filtration and gene prioritization. A web interface to public datasets and annotations in RVS is available at https://rvs.u.hpc.mssm.edu/.

  17. Identification of mutations in Colombian patients affected with Fabry disease.

    PubMed

    Uribe, Alfredo; Mateus, Heidi Eliana; Prieto, Juan Carlos; Palacios, Maria Fernanda; Ospina, Sandra Yaneth; Pasqualim, Gabriela; da Silveira Matte, Ursula; Giugliani, Roberto

    2015-12-15

    Fabry Disease (FD) is an X-linked inborn error of glycosphingolipid catabolism, caused by a deficiency of the lisosomal α-galactosidase A (AGAL). The disorder leads to a vascular disease secondary to the involvement of kidney, heart and the central nervous system. The mutation analysis is a valuable tool for diagnosis and genetic counseling. Although more than 600 mutations have been identified, most mutations are private. Our objective was to describe the analysis of nine Colombian patients with Fabry disease by automated sequencing of the seven exons of the GLA gene. Two novel mutations were identified in two patients affected with the classical subtype of FD, in addition to other 6 mutations previously reported. The present study confirms the heterogeneity of mutations in Fabry disease and the importance of molecular analysis for genetic counseling, female heterozygotes detection as well as therapeutic decisions. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. Genetic diagnosis of developmental disorders in the DDD study: a scalable analysis of genome-wide research data.

    PubMed

    Wright, Caroline F; Fitzgerald, Tomas W; Jones, Wendy D; Clayton, Stephen; McRae, Jeremy F; van Kogelenberg, Margriet; King, Daniel A; Ambridge, Kirsty; Barrett, Daniel M; Bayzetinova, Tanya; Bevan, A Paul; Bragin, Eugene; Chatzimichali, Eleni A; Gribble, Susan; Jones, Philip; Krishnappa, Netravathi; Mason, Laura E; Miller, Ray; Morley, Katherine I; Parthiban, Vijaya; Prigmore, Elena; Rajan, Diana; Sifrim, Alejandro; Swaminathan, G Jawahar; Tivey, Adrian R; Middleton, Anna; Parker, Michael; Carter, Nigel P; Barrett, Jeffrey C; Hurles, Matthew E; FitzPatrick, David R; Firth, Helen V

    2015-04-04

    Human genome sequencing has transformed our understanding of genomic variation and its relevance to health and disease, and is now starting to enter clinical practice for the diagnosis of rare diseases. The question of whether and how some categories of genomic findings should be shared with individual research participants is currently a topic of international debate, and development of robust analytical workflows to identify and communicate clinically relevant variants is paramount. The Deciphering Developmental Disorders (DDD) study has developed a UK-wide patient recruitment network involving over 180 clinicians across all 24 regional genetics services, and has performed genome-wide microarray and whole exome sequencing on children with undiagnosed developmental disorders and their parents. After data analysis, pertinent genomic variants were returned to individual research participants via their local clinical genetics team. Around 80,000 genomic variants were identified from exome sequencing and microarray analysis in each individual, of which on average 400 were rare and predicted to be protein altering. By focusing only on de novo and segregating variants in known developmental disorder genes, we achieved a diagnostic yield of 27% among 1133 previously investigated yet undiagnosed children with developmental disorders, whilst minimising incidental findings. In families with developmentally normal parents, whole exome sequencing of the child and both parents resulted in a 10-fold reduction in the number of potential causal variants that needed clinical evaluation compared to sequencing only the child. Most diagnostic variants identified in known genes were novel and not present in current databases of known disease variation. Implementation of a robust translational genomics workflow is achievable within a large-scale rare disease research study to allow feedback of potentially diagnostic findings to clinicians and research participants. Systematic recording of relevant clinical data, curation of a gene-phenotype knowledge base, and development of clinical decision support software are needed in addition to automated exclusion of almost all variants, which is crucial for scalable prioritisation and review of possible diagnostic variants. However, the resource requirements of development and maintenance of a clinical reporting system within a research setting are substantial. Health Innovation Challenge Fund, a parallel funding partnership between the Wellcome Trust and the UK Department of Health. Copyright © 2015 Wright et al. Open Access article distributed under the terms of CC BY. Published by Elsevier Ltd. All rights reserved.

  19. Genetic diagnosis of developmental disorders in the DDD study: a scalable analysis of genome-wide research data

    PubMed Central

    Wright, Caroline F; Fitzgerald, Tomas W; Jones, Wendy D; Clayton, Stephen; McRae, Jeremy F; van Kogelenberg, Margriet; King, Daniel A; Ambridge, Kirsty; Barrett, Daniel M; Bayzetinova, Tanya; Bevan, A Paul; Bragin, Eugene; Chatzimichali, Eleni A; Gribble, Susan; Jones, Philip; Krishnappa, Netravathi; Mason, Laura E; Miller, Ray; Morley, Katherine I; Parthiban, Vijaya; Prigmore, Elena; Rajan, Diana; Sifrim, Alejandro; Swaminathan, G Jawahar; Tivey, Adrian R; Middleton, Anna; Parker, Michael; Carter, Nigel P; Barrett, Jeffrey C; Hurles, Matthew E; FitzPatrick, David R; Firth, Helen V

    2015-01-01

    Summary Background Human genome sequencing has transformed our understanding of genomic variation and its relevance to health and disease, and is now starting to enter clinical practice for the diagnosis of rare diseases. The question of whether and how some categories of genomic findings should be shared with individual research participants is currently a topic of international debate, and development of robust analytical workflows to identify and communicate clinically relevant variants is paramount. Methods The Deciphering Developmental Disorders (DDD) study has developed a UK-wide patient recruitment network involving over 180 clinicians across all 24 regional genetics services, and has performed genome-wide microarray and whole exome sequencing on children with undiagnosed developmental disorders and their parents. After data analysis, pertinent genomic variants were returned to individual research participants via their local clinical genetics team. Findings Around 80 000 genomic variants were identified from exome sequencing and microarray analysis in each individual, of which on average 400 were rare and predicted to be protein altering. By focusing only on de novo and segregating variants in known developmental disorder genes, we achieved a diagnostic yield of 27% among 1133 previously investigated yet undiagnosed children with developmental disorders, whilst minimising incidental findings. In families with developmentally normal parents, whole exome sequencing of the child and both parents resulted in a 10-fold reduction in the number of potential causal variants that needed clinical evaluation compared to sequencing only the child. Most diagnostic variants identified in known genes were novel and not present in current databases of known disease variation. Interpretation Implementation of a robust translational genomics workflow is achievable within a large-scale rare disease research study to allow feedback of potentially diagnostic findings to clinicians and research participants. Systematic recording of relevant clinical data, curation of a gene–phenotype knowledge base, and development of clinical decision support software are needed in addition to automated exclusion of almost all variants, which is crucial for scalable prioritisation and review of possible diagnostic variants. However, the resource requirements of development and maintenance of a clinical reporting system within a research setting are substantial. Funding Health Innovation Challenge Fund, a parallel funding partnership between the Wellcome Trust and the UK Department of Health. PMID:25529582

  20. Phenotype analysis of congenital and neurodevelopmental disorders in the next generation sequencing era.

    PubMed

    Carey, John C

    2017-09-01

    The designation, phenotype, was proposed as a term by Wilhelm Johannsen in 1909. The word is derived from the Greek, phano (showing) and typo (type), phanotypos. Phenotype has become a widely recognized term, even outside of the genetics community, in recent years with the ongoing identification of human disease genes. The term has been defined as the observable constitution of an organism, but sometimes refers to a condition when a person has a particular clinical presentation. Analysis of phenotype is a timely theme because advances in the understanding of the genetic basis of human disease and the emergence of next generation sequencing have spurred a renewed interest in phenotype and the proposal to establish a "Human Phenome Project." This article summarizes the principles of phenotype analysis that are important in medical genetics and describes approaches to comprehensive phenotype analysis in the investigation of patients with human disorders. I discuss the various elements related to disease phenotypes and highlight neurofibromatosis type 1 and the Elements of Morphology Project as illustrations of the principles. In recent years, the notion of "deep phenotyping" has emerged. Currently there are now a number of proposed strategies and resources to approach this concept. Not since the 1960s and 1970s has there been such an exciting time in the history of medicine surrounding the analysis of phenotype in genetic disorders. © 2017 Wiley Periodicals, Inc.

  1. PrionHome: a database of prions and other sequences relevant to prion phenomena.

    PubMed

    Harbi, Djamel; Parthiban, Marimuthu; Gendoo, Deena M A; Ehsani, Sepehr; Kumar, Manish; Schmitt-Ulms, Gerold; Sowdhamini, Ramanathan; Harrison, Paul M

    2012-01-01

    Prions are units of propagation of an altered state of a protein or proteins; prions can propagate from organism to organism, through cooption of other protein copies. Prions contain no necessary nucleic acids, and are important both as both pathogenic agents, and as a potential force in epigenetic phenomena. The original prions were derived from a misfolded form of the mammalian Prion Protein PrP. Infection by these prions causes neurodegenerative diseases. Other prions cause non-Mendelian inheritance in budding yeast, and sometimes act as diseases of yeast. We report the bioinformatic construction of the PrionHome, a database of >2000 prion-related sequences. The data was collated from various public and private resources and filtered for redundancy. The data was then processed according to a transparent classification system of prionogenic sequences (i.e., sequences that can make prions), prionoids (i.e., proteins that propagate like prions between individual cells), and other prion-related phenomena. There are eight PrionHome classifications for sequences. The first four classifications are derived from experimental observations: prionogenic sequences, prionoids, other prion-related phenomena, and prion interactors. The second four classifications are derived from sequence analysis: orthologs, paralogs, pseudogenes, and candidate-prionogenic sequences. Database entries list: supporting information for PrionHome classifications, prion-determinant areas (where relevant), and disordered and compositionally-biased regions. Also included are literature references for the PrionHome classifications, transcripts and genomic coordinates, and structural data (including comparative models made for the PrionHome from manually curated alignments). We provide database usage examples for both vertebrate and fungal prion contexts. Using the database data, we have performed a detailed analysis of the compositional biases in known budding-yeast prionogenic sequences, showing that the only abundant bias pattern is for asparagine bias with subsidiary serine bias. We anticipate that this database will be a useful experimental aid and reference resource. It is freely available at: http://libaio.biol.mcgill.ca/prion.

  2. PrionHome: A Database of Prions and Other Sequences Relevant to Prion Phenomena

    PubMed Central

    Harbi, Djamel; Parthiban, Marimuthu; Gendoo, Deena M. A.; Ehsani, Sepehr; Kumar, Manish; Schmitt-Ulms, Gerold; Sowdhamini, Ramanathan; Harrison, Paul M.

    2012-01-01

    Prions are units of propagation of an altered state of a protein or proteins; prions can propagate from organism to organism, through cooption of other protein copies. Prions contain no necessary nucleic acids, and are important both as both pathogenic agents, and as a potential force in epigenetic phenomena. The original prions were derived from a misfolded form of the mammalian Prion Protein PrP. Infection by these prions causes neurodegenerative diseases. Other prions cause non-Mendelian inheritance in budding yeast, and sometimes act as diseases of yeast. We report the bioinformatic construction of the PrionHome, a database of >2000 prion-related sequences. The data was collated from various public and private resources and filtered for redundancy. The data was then processed according to a transparent classification system of prionogenic sequences (i.e., sequences that can make prions), prionoids (i.e., proteins that propagate like prions between individual cells), and other prion-related phenomena. There are eight PrionHome classifications for sequences. The first four classifications are derived from experimental observations: prionogenic sequences, prionoids, other prion-related phenomena, and prion interactors. The second four classifications are derived from sequence analysis: orthologs, paralogs, pseudogenes, and candidate-prionogenic sequences. Database entries list: supporting information for PrionHome classifications, prion-determinant areas (where relevant), and disordered and compositionally-biased regions. Also included are literature references for the PrionHome classifications, transcripts and genomic coordinates, and structural data (including comparative models made for the PrionHome from manually curated alignments). We provide database usage examples for both vertebrate and fungal prion contexts. Using the database data, we have performed a detailed analysis of the compositional biases in known budding-yeast prionogenic sequences, showing that the only abundant bias pattern is for asparagine bias with subsidiary serine bias. We anticipate that this database will be a useful experimental aid and reference resource. It is freely available at: http://libaio.biol.mcgill.ca/prion. PMID:22363733

  3. Complete genomic sequence of a Tobacco rattle virus isolate from Michigan-grown potatoes.

    PubMed

    Crosslin, James M; Hamm, Philip B; Kirk, William W; Hammond, Rosemarie W

    2010-04-01

    Tobacco rattle virus (TRV) causes stem mottle on potato leaves and necrotic arcs and rings in potato tubers, known as corky ringspot disease. Recently, TRV was reported in Michigan potato tubers cv. FL1879 exhibiting corky ringspot disease. Sequence analysis of the RNA-1-encoded 16-kDa gene of the Michigan isolate, designated MI-1, revealed homology to TRV isolates from Florida and Washington. Here, we report the complete genomic sequence of RNA-1 (6,791 nt) and RNA-2 (3,685 nt) of TRV MI-1. RNA-1 is predicted to contain four open reading frames, and the genome structure and phylogenetic analyses of the RNA-1 nucleotide sequence revealed significant homologies to the known sequences of other TRV-1 isolates. The relationships based on the full-length nucleotide sequence were different from than those based on the 16-kDa gene encoded on genomic RNA-1 and reflect sequence variation within a 20-25-aa residue region of the 16-kDa protein. MI-1 RNA-2 is predicted to contain three ORFs, encoding the coat protein (CP), a 37.6-kDa protein (ORF 2b), and a 33.6-kDa protein (ORF 2c). In addition, it contains a region of similarity to the 3' terminus of RNA-1, including a truncated portion of the 16-kDa cistron. Phylogenetic analysis of RNA-2, based on a comparison of nucleotide sequences with other members of the genus Tobravirus, indicates that TRV MI-1 and other North American isolates cluster as a distinct group. TRV M1-1 is only the second North American isolate for which there is a complete sequence of the genome, and it is distinct from the North American isolate TRV ORY. The relationship of the TRV MI-1 isolate to other tobravirus isolates is discussed.

  4. Whole genome sequencing and phylogenetic analysis of Bluetongue virus serotype 2 strains isolated in the Americas including a novel strain from the western United States.

    PubMed

    Gaudreault, Natasha N; Mayo, Christie E; Jasperson, Dane C; Crossley, Beate M; Breitmeyer, Richard E; Johnson, Donna J; Ostlund, Eileen N; MacLachlan, N James; Wilson, William C

    2014-07-01

    Bluetongue is a potentially fatal arboviral disease of domestic and wild ruminants that is characterized by widespread edema and tissue necrosis. Bluetongue virus (BTV) serotypes 10, 11, 13, and 17 occur throughout much of the United States, whereas serotype 2 (BTV-2) was previously only detected in the southeastern United States. Since 1998, 10 other BTV serotypes have also been isolated from ruminants in the southeastern United States. In 2010, BTV-2 was identified in California for the first time, and preliminary sequence analysis indicated that the virus isolate was closely related to BTV strains circulating in the southeastern United States. In the current study, the whole genome sequence of the California strain of BTV-2 was compared with those of other BTV-2 strains in the Americas. The results of the analysis suggest co-circulation of genetically distinct viruses in the southeastern United States, and further suggest that the 2010 western isolate is closely related to southeastern strains of BTV. Although it remains uncertain as to how this novel virus was translocated to California, the findings of the current study underscore the need for ongoing surveillance of this economically important livestock disease.

  5. DNA Sequences Proximal to Human Mitochondrial DNA Deletion Breakpoints Prevalent in Human Disease Form G-quadruplexes, a Class of DNA Structures Inefficiently Unwound by the Mitochondrial Replicative Twinkle Helicase*

    PubMed Central

    Bharti, Sanjay Kumar; Sommers, Joshua A.; Zhou, Jun; Kaplan, Daniel L.; Spelbrink, Johannes N.; Mergny, Jean-Louis; Brosh, Robert M.

    2014-01-01

    Mitochondrial DNA deletions are prominent in human genetic disorders, cancer, and aging. It is thought that stalling of the mitochondrial replication machinery during DNA synthesis is a prominent source of mitochondrial genome instability; however, the precise molecular determinants of defective mitochondrial replication are not well understood. In this work, we performed a computational analysis of the human mitochondrial genome using the “Pattern Finder” G-quadruplex (G4) predictor algorithm to assess whether G4-forming sequences reside in close proximity (within 20 base pairs) to known mitochondrial DNA deletion breakpoints. We then used this information to map G4P sequences with deletions characteristic of representative mitochondrial genetic disorders and also those identified in various cancers and aging. Circular dichroism and UV spectral analysis demonstrated that mitochondrial G-rich sequences near deletion breakpoints prevalent in human disease form G-quadruplex DNA structures. A biochemical analysis of purified recombinant human Twinkle protein (gene product of c10orf2) showed that the mitochondrial replicative helicase inefficiently unwinds well characterized intermolecular and intramolecular G-quadruplex DNA substrates, as well as a unimolecular G4 substrate derived from a mitochondrial sequence that nests a deletion breakpoint described in human renal cell carcinoma. Although G4 has been implicated in the initiation of mitochondrial DNA replication, our current findings suggest that mitochondrial G-quadruplexes are also likely to be a source of instability for the mitochondrial genome by perturbing the normal progression of the mitochondrial replication machinery, including DNA unwinding by Twinkle helicase. PMID:25193669

  6. Bacterial community composition of chronic periodontitis and novel oral sampling sites for detecting disease indicators

    PubMed Central

    2014-01-01

    Background Periodontitis is an infectious and inflammatory disease of polymicrobial etiology that can lead to the destruction of bones and tissues that support the teeth. The management of chronic periodontitis (CP) relies heavily on elimination or at least control of known pathogenic consortia associated with the disease. Until now, microbial plaque obtained from the subgingival (SubG) sites has been the primary focus for bacterial community analysis using deep sequencing. In addition to the use of SubG plaque, here, we investigated whether plaque obtained from supragingival (SupG) and tongue dorsum sites can serve as alternatives for monitoring CP-associated bacterial biomarkers. Results Using SubG, SupG, and tongue plaque DNA from 11 healthy and 13 diseased subjects, we sequenced V3 regions (approximately 200 bases) of the 16S rRNA gene using Illumina sequencing. After quality filtering, approximately 4.1 million sequences were collapsed into operational taxonomic units (OTUs; sequence identity cutoff of >97%) that were classified to a total of 19 phyla spanning 114 genera. Bacterial community diversity and overall composition was not affected by health or disease, and multiresponse permutation procedure (MRPP) on Bray-Curtis distance measures only supported weakly distinct bacterial communities in SubG and tongue plaque depending on health or disease status (P < 0.05). Nonetheless, in SubG and tongue sites, the relative abundance of Firmicutes was increased significantly from health to disease and members of Synergistetes were found in higher abundance across all sites in disease. Taxa indicative of CP were identified in all three locations (for example, Treponema denticola, Porphyromonas gingivalis, Synergistes oral taxa 362 and 363). Conclusions For the first time, this study demonstrates that SupG and tongue dorsum plaque can serve as alternative sources for detecting and enumerating known and novel bacterial biomarkers of CP. This finding is clinically important because, in contrast with SubG sampling that requires trained professionals, obtaining plaque from SupG and tongue sites is convenient and minimally-invasive and offers a novel means to track CP-biomarker organisms during treatment outcome monitoring. PMID:25225610

  7. Phylogenetic relationship of Ornithobacterium rhinotracheale strains.

    PubMed

    DE Oca-Jimenez, Roberto Montes; Vega-Sanchez, Vicente; Morales-Erasto, Vladimir; Salgado-Miranda, Celene; Blackall, Patrick J; Soriano-Vargas, Edgardo

    2018-04-10

    The bacterium Ornithobacterium rhinotracheale is associated with respiratory disease in wild birds and poultry. In this study, the phylogenetic analysis of nine reference strains of O. rhinotracheale belonging to serovars A to I, and eight Mexican isolates belonging to serovar A, was performed. The analysis was extended to include available sequences from another 23 strains available in the public domain. The analysis showed that the 40 sequences formed six clusters, I to VI. All eight Mexican field isolates were placed in cluster I. One of the reference strains appears to present genetic diversity not previously recognized and was placed in a new genetic cluster. In conclusion, the phylogenetic analysis of O. rhinotracheale strains, based on the 16S rRNA gene, is a suitable tool for epidemiologic studies.

  8. Methods, Tools and Current Perspectives in Proteogenomics *

    PubMed Central

    Ruggles, Kelly V.; Krug, Karsten; Wang, Xiaojing; Clauser, Karl R.; Wang, Jing; Payne, Samuel H.; Fenyö, David; Zhang, Bing; Mani, D. R.

    2017-01-01

    With combined technological advancements in high-throughput next-generation sequencing and deep mass spectrometry-based proteomics, proteogenomics, i.e. the integrative analysis of proteomic and genomic data, has emerged as a new research field. Early efforts in the field were focused on improving protein identification using sample-specific genomic and transcriptomic sequencing data. More recently, integrative analysis of quantitative measurements from genomic and proteomic studies have identified novel insights into gene expression regulation, cell signaling, and disease. Many methods and tools have been developed or adapted to enable an array of integrative proteogenomic approaches and in this article, we systematically classify published methods and tools into four major categories, (1) Sequence-centric proteogenomics; (2) Analysis of proteogenomic relationships; (3) Integrative modeling of proteogenomic data; and (4) Data sharing and visualization. We provide a comprehensive review of methods and available tools in each category and highlight their typical applications. PMID:28456751

  9. Clavibacter michiganensis subsp. phaseoli subsp. nov., pathogenic in bean.

    PubMed

    González, Ana J; Trapiello, Estefanía

    2014-05-01

    A yellow Gram-reaction-positive bacterium isolated from bean seeds (Phaseolus vulgaris L.) was identified as Clavibacter michiganensis by 16S rRNA gene sequencing. Molecular methods were employed in order to identify the subspecies. Such methods included the amplification of specific sequences by PCR, 16S amplified rDNA restriction analysis (ARDRA), RFLP and multilocus sequence analysis as well as the analysis of biochemical and phenotypic traits including API 50CH and API ZYM results. The results showed that strain LPPA 982T did not represent any known subspecies of C. michiganensis. Pathogenicity tests revealed that the strain is a bean pathogen causing a newly identified bacterial disease that we name bacterial bean leaf yellowing. On the basis of these results, strain LPPA 982T is regarded as representing a novel subspecies for which the name Clavibacter michiganensis subsp. phaseoli subsp. nov. is proposed. The type strain is LPPA 982T (=CECT 8144T=LMG 27667T).

  10. High-Throughput Single-Cell RNA Sequencing and Data Analysis.

    PubMed

    Sagar; Herman, Josip Stefan; Pospisilik, John Andrew; Grün, Dominic

    2018-01-01

    Understanding biological systems at a single cell resolution may reveal several novel insights which remain masked by the conventional population-based techniques providing an average readout of the behavior of cells. Single-cell transcriptome sequencing holds the potential to identify novel cell types and characterize the cellular composition of any organ or tissue in health and disease. Here, we describe a customized high-throughput protocol for single-cell RNA-sequencing (scRNA-seq) combining flow cytometry and a nanoliter-scale robotic system. Since scRNA-seq requires amplification of a low amount of endogenous cellular RNA, leading to substantial technical noise in the dataset, downstream data filtering and analysis require special care. Therefore, we also briefly describe in-house state-of-the-art data analysis algorithms developed to identify cellular subpopulations including rare cell types as well as to derive lineage trees by ordering the identified subpopulations of cells along the inferred differentiation trajectories.

  11. Advanced diffusion MRI and biomarkers in the central nervous system: a new approach.

    PubMed

    Martín Noguerol, T; Martínez Barbero, J P

    The introduction of diffusion-weighted sequences has revolutionized the detection and characterization of central nervous system (CNS) disease. Nevertheless, the assessment of diffusion studies of the CNS is often limited to qualitative estimation. Moreover, the pathophysiological complexity of the different entities that affect the CNS cannot always be correctly explained through classical models. The development of new models for the analysis of diffusion sequences provides numerous parameters that enable a quantitative approach to both diagnosis and prognosis as well as to monitoring the response to treatment; these parameters can be considered potential biomarkers of health and disease. In this update, we review the physical bases underlying diffusion studies and diffusion tensor imaging, advanced models for their analysis (intravoxel coherent motion and kurtosis), and the biological significance of the parameters derived. Copyright © 2017 SERAM. Publicado por Elsevier España, S.L.U. All rights reserved.

  12. The evolution of pigeon paramyxovirus type 1 (PPMV-1) in Great Britain: a molecular epidemiological study.

    PubMed

    Aldous, E W; Fuller, C M; Ridgeon, J H; Irvine, R M; Alexander, D J; Brown, I H

    2014-04-01

    Newcastle disease (ND), caused by virulent strains of avian paramyxovirus type 1 (APMV-1), is considered throughout the world as one of the most important animal diseases. For over three decades now, there has been a continuing panzootic caused by a variant virulent APMV-1 strain, so-called pigeon paramyxovirus type 1 (PPMV-1), primarily in racing pigeons, which has also spread to wild birds and poultry. PPMV-1 isolations have been made in Great Britain every year since 1983. In this study, we have completed a comparative phylogenetic analysis based on a 374 nucleotide section of the fusion protein gene of 63 isolates of PPMV-1 that were isolated over a 26-year period; 43 of these were sequenced for this study. Phylogenetic analysis of these sequences revealed that all were closely related and placed in the genetic sublineage 4b (VIb), subdivision 4biif. © 2012 Crown copyright.

  13. Secure Genomic Computation through Site-Wise Encryption

    PubMed Central

    Zhao, Yongan; Wang, XiaoFeng; Tang, Haixu

    2015-01-01

    Commercial clouds provide on-demand IT services for big-data analysis, which have become an attractive option for users who have no access to comparable infrastructure. However, utilizing these services for human genome analysis is highly risky, as human genomic data contains identifiable information of human individuals and their disease susceptibility. Therefore, currently, no computation on personal human genomic data is conducted on public clouds. To address this issue, here we present a site-wise encryption approach to encrypt whole human genome sequences, which can be subject to secure searching of genomic signatures on public clouds. We implemented this method within the Hadoop framework, and tested it on the case of searching disease markers retrieved from the ClinVar database against patients’ genomic sequences. The secure search runs only one order of magnitude slower than the simple search without encryption, indicating our method is ready to be used for secure genomic computation on public clouds. PMID:26306278

  14. Secure Genomic Computation through Site-Wise Encryption.

    PubMed

    Zhao, Yongan; Wang, XiaoFeng; Tang, Haixu

    2015-01-01

    Commercial clouds provide on-demand IT services for big-data analysis, which have become an attractive option for users who have no access to comparable infrastructure. However, utilizing these services for human genome analysis is highly risky, as human genomic data contains identifiable information of human individuals and their disease susceptibility. Therefore, currently, no computation on personal human genomic data is conducted on public clouds. To address this issue, here we present a site-wise encryption approach to encrypt whole human genome sequences, which can be subject to secure searching of genomic signatures on public clouds. We implemented this method within the Hadoop framework, and tested it on the case of searching disease markers retrieved from the ClinVar database against patients' genomic sequences. The secure search runs only one order of magnitude slower than the simple search without encryption, indicating our method is ready to be used for secure genomic computation on public clouds.

  15. Genetic characterization and phylogenetic analysis of porcine circovirus type 2 (PCV2) in Serbia.

    PubMed

    Savic, Bozidar; Milicevic, Vesna; Jakic-Dimic, Dobrila; Bojkovski, Jovan; Prodanovic, Radisa; Kureljusic, Branislav; Potkonjak, Aleksandar; Savic, Borivoje

    2012-01-01

    Porcine circovirus type 2 (PCV2) is the main causative agent of postweaning multisystemic wasting syndrome (PMWS). To characterize and determine the genetic diversity of PCV2 in the porcine population of Serbia, nucleotide and deduced amino acid sequences of the open reading frame 2 (ORF2) of PCV2 collected from the tissues of pigs that either had died as a result of PMWS or did not exhibit disease symptoms were analyzed. Sequencing and phylogenetic analysis showed considerable diversity among PCV2 ORF2 sequences and the existence of two main PCV2 genotypes, PCV2b and PCV2a, with at least three clusters, 1A/B, 1C and 2D. In order to provide further proof that the 1C strain is circulating in the porcine population, the whole viral genome of one PCV2 isolate was sequenced. Genotyping and phylogenetic analysis using the entire viral genome sequences confirmed that there was a PMWS-associated 1C strain emerging in Serbia. Our analysis also showed that PCV2b is dominant in the porcine population, and that it is exclusively associated with PMWS occurrences in the country. These data constitute a useful basis for further epidemiological studies regarding the heterogeneity of PCV2 strains on the European continent.

  16. Novel compound heterozygous mutations in MYO7A in a Chinese family with Usher syndrome type 1

    PubMed Central

    Liu, Fei; Li, Pengcheng; Liu, Ying; Li, Weirong; Wong, Fulton; Du, Rong; Wang, Lei; Li, Chang; Jiang, Fagang; Tang, Zhaohui

    2013-01-01

    Purpose To identify the disease-causing mutation(s) in a Chinese family with autosomal recessive Usher syndrome type 1 (USH1). Methods An ophthalmic examination and an audiometric test were conducted to ascertain the phenotype of two affected siblings. The microsatellite marker D11S937, which is close to the candidate gene MYO7A (USH1B locus), was selected for genotyping. From the DNA of the proband, all coding exons and exon-intron boundaries of MYO7A were sequenced to identify the disease-causing mutation(s). Restriction fragment length polymorphism (RFLP) analysis was performed to exclude the alternative conclusion that the mutations are non-pathogenic rare polymorphisms. Results Based on severe hearing impairment, unintelligible speech, and retinitis pigmentosa, a clinical diagnosis of Usher syndrome type 1 was made. The genotyping results did not exclude the USH1B locus, which suggested that the MYO7A gene was likely the gene associated with the disease-causing mutation(s) in the family. With direct DNA sequencing of MYO7A, two novel compound heterozygous mutations (c.3742G>A and c.6051+1G>A) of MYO7A were identified in the proband. DNA sequence analysis and RFLP analysis of other family members showed that the mutations cosegregated with the disease. Unaffected members, including the parents, uncle, and sister of the proband, carry only one of the two mutations. The mutations were not present in the controls (100 normal Chinese subjects=200 chromosomes) according to the RFLP analysis. Conclusions In this study, we identified two novel mutations, c.3742G>A (p.E1248K) and c.6051+1G>A (donor splice site mutation in intron 44), of MYO7A in a Chinese non-consanguineous family with USH1. The mutations cosegregated with the disease and most likely cause the phenotype in the two affected siblings who carry these mutations compound heterozygously. Our finding expands the mutational spectrum of MYO7A. PMID:23559863

  17. in silico Whole Genome Sequencer & Analyzer (iWGS): A Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhou, Xiaofan; Peris, David; Kominek, Jacek

    The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in nonmodel organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimentalmore » design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies and popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects, and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS.« less

  18. in silico Whole Genome Sequencer & Analyzer (iWGS): A Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies

    DOE PAGES

    Zhou, Xiaofan; Peris, David; Kominek, Jacek; ...

    2016-09-16

    The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in nonmodel organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimentalmore » design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies and popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects, and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS.« less

  19. Analysis of the Legionella longbeachae Genome and Transcriptome Uncovers Unique Strategies to Cause Legionnaires' Disease

    PubMed Central

    Rusniok, Christophe; Lomma, Mariella; Dervins-Ravault, Delphine; Newton, Hayley J.; Sansom, Fiona M.; Jarraud, Sophie; Zidane, Nora; Ma, Laurence; Bouchier, Christiane; Etienne, Jerôme; Hartland, Elizabeth L.; Buchrieser, Carmen

    2010-01-01

    Legionella pneumophila and L. longbeachae are two species of a large genus of bacteria that are ubiquitous in nature. L. pneumophila is mainly found in natural and artificial water circuits while L. longbeachae is mainly present in soil. Under the appropriate conditions both species are human pathogens, capable of causing a severe form of pneumonia termed Legionnaires' disease. Here we report the sequencing and analysis of four L. longbeachae genomes, one complete genome sequence of L. longbeachae strain NSW150 serogroup (Sg) 1, and three draft genome sequences another belonging to Sg1 and two to Sg2. The genome organization and gene content of the four L. longbeachae genomes are highly conserved, indicating strong pressure for niche adaptation. Analysis and comparison of L. longbeachae strain NSW150 with L. pneumophila revealed common but also unexpected features specific to this pathogen. The interaction with host cells shows distinct features from L. pneumophila, as L. longbeachae possesses a unique repertoire of putative Dot/Icm type IV secretion system substrates, eukaryotic-like and eukaryotic domain proteins, and encodes additional secretion systems. However, analysis of the ability of a dotA mutant of L. longbeachae NSW150 to replicate in the Acanthamoeba castellanii and in a mouse lung infection model showed that the Dot/Icm type IV secretion system is also essential for the virulence of L. longbeachae. In contrast to L. pneumophila, L. longbeachae does not encode flagella, thereby providing a possible explanation for differences in mouse susceptibility to infection between the two pathogens. Furthermore, transcriptome analysis revealed that L. longbeachae has a less pronounced biphasic life cycle as compared to L. pneumophila, and genome analysis and electron microscopy suggested that L. longbeachae is encapsulated. These species-specific differences may account for the different environmental niches and disease epidemiology of these two Legionella species. PMID:20174605

  20. Detection of novel divergent arenaviruses in boid snakes with inclusion body disease in The Netherlands.

    PubMed

    Bodewes, R; Kik, M J L; Raj, V Stalin; Schapendonk, C M E; Haagmans, B L; Smits, S L; Osterhaus, A D M E

    2013-06-01

    Arenaviruses are bi-segmented negative-stranded RNA viruses, which were until recently only detected in rodents and humans. Now highly divergent arenaviruses have been identified in boid snakes with inclusion body disease (IBD). Here, we describe the identification of a new species and variants of the highly divergent arenaviruses, which were detected in tissues of captive boid snakes with IBD in The Netherlands by next-generation sequencing. Phylogenetic analysis of the complete sequence of the open reading frames of the four predicted proteins of one of the detected viruses revealed that this virus was most closely related to the recently identified Golden Gate virus, while considerable sequence differences were observed between the highly divergent arenaviruses detected in this study. These findings add to the recent identification of the highly divergent arenaviruses in boid snakes with IBD in the United States and indicate that these viruses also circulate among boid snakes in Europe.

  1. Whole-Genome Sequence Analysis of Streptococcus pneumoniae Strains That Cause Hospital-Acquired Pneumonia Infections.

    PubMed

    Chang, Bin; Morita, Masatomo; Lee, Ken-Ichi; Ohnishi, Makoto

    2018-05-01

    Streptococcus pneumoniae colonizes the nasopharyngeal mucus in healthy individuals and can cause otitis media, pneumonia, and invasive pneumococcal diseases. In this study, we analyzed S. pneumoniae strains that caused 19 pneumonia episodes in long-term inpatients with severe underlying disease in a hospital during a period of 14 months (from January 2014 to February 2015). Serotyping and whole-genome sequencing analyses revealed that 18 of the 19 pneumonia cases were caused by S. pneumoniae strains belonging to 3 genetically distinct groups: clonal complex 9999 (CC9999), sequence type 282 (ST282), and ST166. The CC9999 and ST282 strains appeared to have emerged separately by a capsule switch from the pandemic PMEN 1 strain (Spain 23F -ST81). After all the long-term inpatients were inoculated with the 23-valent pneumococcal polysaccharide vaccine, no other nosocomial pneumonia infections occurred until March 2016. Copyright © 2018 American Society for Microbiology.

  2. Next generation sequencing--implications for clinical practice.

    PubMed

    Raffan, Eleanor; Semple, Robert K

    2011-01-01

    Genetic testing in inherited disease has traditionally relied upon recognition of the presenting clinical syndrome and targeted analysis of genes known to be linked to that syndrome. Consequently, many patients with genetic syndromes remain without a specific diagnosis. New 'next-generation' sequencing (NGS) techniques permit simultaneous sequencing of enormous amounts of DNA. A slew of research publications have recently demonstrated the tremendous power of these technologies in increasing understanding of human genetic disease. These approaches are likely to be increasingly employed in routine diagnostic practice, but the scale of the genetic information yielded about individuals means that caution must be exercised to avoid net harm in this setting. Use of NGS in a research setting will increasingly have a major but indirect beneficial impact on clinical practice. However, important technical, ethical and social challenges need to be addressed through informed professional and public dialogue before it finds its mature niche as a direct tool in the clinical diagnostic armoury.

  3. Intraspecific Genetic Variation and Phylogenetic Analysis of Dirofilaria immitis Samples from Western China Using Complete ND1 and 16S rDNA Gene Sequences

    PubMed Central

    Liu, Tianyu; Liang, Yinan; Zhong, Xiuqin; Wang, Ning; Hu, Dandan; Zhou, Xuan; Gu, Xiaobin; Peng, Xuerong; Yang, Guangyou

    2014-01-01

    Dirofilaria immitis (heartworm) is the causative agent of an important zoonotic disease that is spread by mosquitoes. In this study, molecular and phylogenetic characterization of D. immitis were performed based on complete ND1 and 16S rDNA gene sequences, which provided the foundation for more advanced molecular diagnosis, prevention, and control of heartworm diseases. The mutation rate and evolutionary divergence in adult heartworm samples from seven dogs in western China were analyzed to obtain information on genetic diversity and variability. Phylogenetic relationships were inferred using both maximum parsimony (MP) and Bayes methods based on the complete gene sequences. The results suggest that D. immitis formed an independent monophyletic group in which the 16S rDNA gene has mutated more rapidly than has ND1. PMID:24639299

  4. Full genome sequence of the first bluetongue virus serotype 21 (BTV-21) isolated from China: evidence for genetic reassortment between BTV-21 and bluetongue virus serotype 16 (BTV-16).

    PubMed

    Qin, Shaomin; Yang, Heng; Zhang, Yixuan; Li, Zhanhong; Lin, Jun; Gao, Lin; Liao, Defang; Cao, Yingying; Ren, Pengfei; Li, Huachun; Wu, Jianmin

    2018-05-01

    Bluetongue (BT) is one of the most important insect-borne, non-contagious viral diseases of ruminants and can cause severe disease and death in sheep. Its pathogen, bluetongue virus (BTV) has a double-stranded RNA genome consisting of 10 segments that provides an opportunity for field and vaccine strains of different serotypes to reassort whilst simultaneously infecting the same animal. For the first time, we report the full-length genome sequence of a BTV strain of serotype 21 (5149E) isolated from sentinel cattle in Guangxi Province in China in 2015. Sequence analysis suggested that the isolate 5149E had undergone a reassortment incident and acquired seg-6 from an isolate of BTV-16 which originated from Japan. This study aims to provide more understanding as to the origin and epidemiology of BTV.

  5. [Inverse PCR amplification of the complete major capsid protein gene of lymphocystis disease virus isolated from Rachycentron canadum and the phylogenetic analysis of the virus].

    PubMed

    Fu, Xiao-Zhe; Shi, Cun-Bin; Li, Ning-Qiu; Pan, Hou-Jun; Chang, Ou-Qin; Wu, Shu-Qin

    2007-09-01

    The major capsid protein of lymphocystis disease virus isolated from Rachycentron canadum (LCDV-rc) was amplified and analysed. The 457bp DNA core fragment was amplified with the degenerate primers designed according to the conserved sequences of MCP gene of iridoviruses, then the flaking sequences adjacent to the core region were amplified by inverse PCR, and the complete sequence was obtained by combining all of them. The open reading frame of the gene is 1380bp in length, encoding a putative protein of 459 aa with molecular weight 51.12 kD and pI 6.87. Constructing the phylogenetic tree for comparing the MCP amino acid of iridoviruses, the results indicated that LCDV-rc is most homologous to the other Lymphocystis viruses and all of them constitute a branch. Accordingly LCDV-rc is identified as Lymphocystivirus.

  6. Whole genome sequencing of an African American family highlights toll like receptor 6 variants in Kawasaki disease susceptibility.

    PubMed

    Kim, Jihoon; Shimizu, Chisato; Kingsmore, Stephen F; Veeraraghavan, Narayanan; Levy, Eric; Ribeiro Dos Santos, Andre M; Yang, Hai; Flatley, Jay; Hoang, Long Truong; Hibberd, Martin L; Tremoulet, Adriana H; Harismendy, Olivier; Ohno-Machado, Lucila; Burns, Jane C

    2017-01-01

    Kawasaki disease (KD) is the most common acquired pediatric heart disease. We analyzed Whole Genome Sequences (WGS) from a 6-member African American family in which KD affected two of four children. We sought rare, potentially causative genotypes by sequentially applying the following WGS filters: sequence quality scores, inheritance model (recessive homozygous and compound heterozygous), predicted deleteriousness, allele frequency, genes in KD-associated pathways or with significant associations in published KD genome-wide association studies (GWAS), and with differential expression in KD blood transcriptomes. Biologically plausible genotypes were identified in twelve variants in six genes in the two affected children. The affected siblings were compound heterozygous for the rare variants p.Leu194Pro and p.Arg247Lys in Toll-like receptor 6 (TLR6), which affect TLR6 signaling. The affected children were also homozygous for three common, linked (r2 = 1) intronic single nucleotide variants (SNVs) in TLR6 (rs56245262, rs56083757 and rs7669329), that have previously shown association with KD in cohorts of European descent. Using transcriptome data from pre-treatment whole blood of KD subjects (n = 146), expression quantitative trait loci (eQTL) analyses were performed. Subjects homozygous for the intronic risk allele (A allele of TLR6 rs56245262) had differential expression of Interleukin-6 (IL-6) as a function of genotype (p = 0.0007) and a higher erythrocyte sedimentation rate at diagnosis. TLR6 plays an important role in pathogen-associated molecular pattern recognition, and sequence variations may affect binding affinities that in turn influence KD susceptibility. This integrative genomic approach illustrates how the analysis of WGS in multiplex families with a complex genetic disease allows examination of both the common disease-common variant and common disease-rare variant hypotheses.

  7. Whole-genome sequencing identifies recurrent mutations in chronic lymphocytic leukaemia

    PubMed Central

    Puente, Xose S.; Pinyol, Magda; Quesada, Víctor; Conde, Laura; Ordóñez, Gonzalo R.; Villamor, Neus; Escaramis, Georgia; Jares, Pedro; Beà, Sílvia; González-Díaz, Marcos; Bassaganyas, Laia; Baumann, Tycho; Juan, Manel; López-Guerra, Mónica; Colomer, Dolors; Tubío, José M. C.; López, Cristina; Navarro, Alba; Tornador, Cristian; Aymerich, Marta; Rozman, María; Hernández, Jesús M.; Puente, Diana A.; Freije, José M. P.; Velasco, Gloria; Gutiérrez-Fernández, Ana; Costa, Dolors; Carrió, Anna; Guijarro, Sara; Enjuanes, Anna; Hernández, Lluís; Yagüe, Jordi; Nicolás, Pilar; Romeo-Casabona, Carlos M.; Himmelbauer, Heinz; Castillo, Ester; Dohm, Juliane C.; de Sanjosé, Silvia; Piris, Miguel A.; de Alava, Enrique; Miguel, Jesús San; Royo, Romina; Gelpí, Josep L.; Torrents, David; Orozco, Modesto; Pisano, David G.; Valencia, Alfonso; Guigó, Roderic; Bayés, Mónica; Heath, Simon; Gut, Marta; Klatt, Peter; Marshall, John; Raine, Keiran; Stebbings, Lucy A.; Futreal, P. Andrew; Stratton, Michael R.; Campbell, Peter J.; Gut, Ivo; López-Guillermo, Armando; Estivill, Xavier; Montserrat, Emili; López-Otín, Carlos; Campo, Elías

    2012-01-01

    Chronic lymphocytic leukaemia (CLL), the most frequent leukaemia in adults in Western countries, is a heterogeneous disease with variable clinical presentation and evolution1,2. Two major molecular subtypes can be distinguished, characterized respectively by a high or low number of somatic hypermutations in the variable region of immunoglobulin genes3,4. The molecular changes leading to the pathogenesis of the disease are still poorly understood. Here we performed whole-genome sequencing of four cases of CLL and identified 46 somatic mutations that potentially affect gene function. Further analysis of these mutations in 363 patients with CLL identified four genes that are recurrently mutated: notch 1 (NOTCH1), exportin 1 (XPO1), myeloid differentiation primary response gene 88 (MYD88) and kelch-like 6 (KLHL6). Mutations in MYD88 and KLHL6 are predominant in cases of CLL with mutated immunoglobulin genes, whereas NOTCH1 and XPO1 mutations are mainly detected in patients with unmutated immunoglobulins. The patterns of somatic mutation, supported by functional and clinical analyses, strongly indicate that the recurrent NOTCH1, MYD88 and XPO1 mutations are oncogenic changes that contribute to the clinical evolution of the disease. To our knowledge, this is the first comprehensive analysis of CLL combining whole-genome sequencing with clinical characteristics and clinical outcomes. It highlights the usefulness of this approach for the identification of clinically relevant mutations in cancer. PMID:21642962

  8. High incidence of unrecognized visceral/neurological late-onset Niemann-Pick disease, type C1, predicted by analysis of massively parallel sequencing data sets.

    PubMed

    Wassif, Christopher A; Cross, Joanna L; Iben, James; Sanchez-Pulido, Luis; Cougnoux, Antony; Platt, Frances M; Ory, Daniel S; Ponting, Chris P; Bailey-Wilson, Joan E; Biesecker, Leslie G; Porter, Forbes D

    2016-01-01

    Niemann-Pick disease type C (NPC) is a recessive, neurodegenerative, lysosomal storage disease caused by mutations in either NPC1 or NPC2. The diagnosis is difficult and frequently delayed. Ascertainment is likely incomplete because of both these factors and because the full phenotypic spectrum may not have been fully delineated. Given the recent development of a blood-based diagnostic test and the development of potential therapies, understanding the incidence of NPC and defining at-risk patient populations are important. We evaluated data from four large, massively parallel exome sequencing data sets. Variant sequences were identified and classified as pathogenic or nonpathogenic based on a combination of literature review and bioinformatic analysis. This methodology provided an unbiased approach to determining the allele frequency. Our data suggest an incidence rate for NPC1 and NPC2 of 1/92,104 and 1/2,858,998, respectively. Evaluation of common NPC1 variants, however, suggests that there may be a late-onset NPC1 phenotype with a markedly higher incidence, on the order of 1/19,000-1/36,000. We determined a combined incidence of classical NPC of 1/89,229, or 1.12 affected patients per 100,000 conceptions, but predict incomplete ascertainment of a late-onset phenotype of NPC1. This finding strongly supports the need for increased screening of potential patients.

  9. Sex-specific differences in the occurrence of Fusobacterium nucleatum subspecies and Fusobacterium periodonticum in the oral cavity

    PubMed Central

    Henne, Karsten; Schilling, Hildegard; Stoneking, Mark; Conrads, Georg; Horz, Hans-Peter

    2018-01-01

    The periodontitis-associated species Fusobacterium nucleatum (FN) has been implicated in several extra-oral diseases, including preterm birth and colorectal cancer. Due to its genetic and phenotypic heterogeneity, FN is classified in four subspecies which may differ in their disease potential. Here we compared the prevalence of FN subspecies and the close relative F. periodonticum (FP) via 16S rRNA gene analysis in saliva from 100 healthy individuals (60 females, and 40 males) from eleven countries spanning five continents. By focusing on the most abundant sequence types (i.e. analysis of approximately ten clone sequences each) the average number of FN/FP subspecies per individual differed significantly between females and males, i.e. 2.93 versus 2.5, respectively (P = 0.043). FN subsp. fusiforme/vincentii was significantly more prevalent in females vs males, with 2.85 vs. 1.68 sequence reads per individual, respectively (P = 0.012). A significant age-related difference was observed in females but not in males, i.e. 2.6 subspecies on average in females ≤ 30 years vs. 3.2 in females > 30 (P = 0.0076). Given the link between FN and systemic disorders our findings highlight the need for microbial studies at the subspecies level to further characterize the role of periodontal pathogens in diseases that affect females and males differently, e.g. colorectal cancer. PMID:29755677

  10. Single-Cell RNA-Sequencing in Glioma.

    PubMed

    Johnson, Eli; Dickerson, Katherine L; Connolly, Ian D; Hayden Gephart, Melanie

    2018-04-10

    In this review, we seek to summarize the literature concerning the use of single-cell RNA-sequencing for CNS gliomas. Single-cell analysis has revealed complex tumor heterogeneity, subpopulations of proliferating stem-like cells and expanded our view of tumor microenvironment influence in the disease process. Although bulk RNA-sequencing has guided our initial understanding of glioma genetics, this method does not accurately define the heterogeneous subpopulations found within these tumors. Single-cell techniques have appealing applications in cancer research, as diverse cell types and the tumor microenvironment have important implications in therapy. High cost and difficult protocols prevent widespread use of single-cell RNA-sequencing; however, continued innovation will improve accessibility and expand our of knowledge gliomas.

  11. DNA Barcode Analysis of Thrips (Thysanoptera) Diversity in Pakistan Reveals Cryptic Species Complexes.

    PubMed

    Iftikhar, Romana; Ashfaq, Muhammad; Rasool, Akhtar; Hebert, Paul D N

    2016-01-01

    Although thrips are globally important crop pests and vectors of viral disease, species identifications are difficult because of their small size and inconspicuous morphological differences. Sequence variation in the mitochondrial COI-5' (DNA barcode) region has proven effective for the identification of species in many groups of insect pests. We analyzed barcode sequence variation among 471 thrips from various plant hosts in north-central Pakistan. The Barcode Index Number (BIN) system assigned these sequences to 55 BINs, while the Automatic Barcode Gap Discovery detected 56 partitions, a count that coincided with the number of monophyletic lineages recognized by Neighbor-Joining analysis and Bayesian inference. Congeneric species showed an average of 19% sequence divergence (range = 5.6% - 27%) at COI, while intraspecific distances averaged 0.6% (range = 0.0% - 7.6%). BIN analysis suggested that all intraspecific divergence >3.0% actually involved a species complex. In fact, sequences for three major pest species (Haplothrips reuteri, Thrips palmi, Thrips tabaci), and one predatory thrips (Aeolothrips intermedius) showed deep intraspecific divergences, providing evidence that each is a cryptic species complex. The study compiles the first barcode reference library for the thrips of Pakistan, and examines global haplotype diversity in four important pest thrips.

  12. TGFB2 mutations cause familial thoracic aortic aneurysms and dissections associated with mild systemic features of Marfan syndrome.

    PubMed

    Boileau, Catherine; Guo, Dong-Chuan; Hanna, Nadine; Regalado, Ellen S; Detaint, Delphine; Gong, Limin; Varret, Mathilde; Prakash, Siddharth K; Li, Alexander H; d'Indy, Hyacintha; Braverman, Alan C; Grandchamp, Bernard; Kwartler, Callie S; Gouya, Laurent; Santos-Cortez, Regie Lyn P; Abifadel, Marianne; Leal, Suzanne M; Muti, Christine; Shendure, Jay; Gross, Marie-Sylvie; Rieder, Mark J; Vahanian, Alec; Nickerson, Deborah A; Michel, Jean Baptiste; Jondeau, Guillaume; Milewicz, Dianna M

    2012-07-08

    A predisposition for thoracic aortic aneurysms leading to acute aortic dissections can be inherited in families in an autosomal dominant manner. Genome-wide linkage analysis of two large unrelated families with thoracic aortic disease followed by whole-exome sequencing of affected relatives identified causative mutations in TGFB2. These mutations-a frameshift mutation in exon 6 and a nonsense mutation in exon 4-segregated with disease with a combined logarithm of odds (LOD) score of 7.7. Sanger sequencing of 276 probands from families with inherited thoracic aortic disease identified 2 additional TGFB2 mutations. TGFB2 encodes transforming growth factor (TGF)-β2, and the mutations are predicted to cause haploinsufficiency for TGFB2; however, aortic tissue from cases paradoxically shows increased TGF-β2 expression and immunostaining. Thus, haploinsufficiency for TGFB2 predisposes to thoracic aortic disease, suggesting that the initial pathway driving disease is decreased cellular TGF-β2 levels leading to a secondary increase in TGF-β2 production in the diseased aorta.

  13. Phylogenetic analysis of Newcastle disease viruses from Bangladesh suggests continuing evolution of genotype XIII.

    PubMed

    Barman, Lalita Rani; Nooruzzaman, Mohammed; Sarker, Rahul Deb; Rahman, Md Tazinur; Saife, Md Rajib Bin; Giasuddin, Mohammad; Das, Bidhan Chandra; Das, Priya Mohan; Chowdhury, Emdadul Haque; Islam, Mohammad Rafiqul

    2017-10-01

    A total of 23 Newcastle disease virus (NDV) isolates from Bangladesh taken between 2010 and 2012 were characterized on the basis of partial F gene sequences. All the isolates belonged to genotype XIII of class II NDV but segregated into three sub-clusters. One sub-cluster with 17 isolates aligned with sub-genotype XIIIc. The other two sub-clusters were phylogenetically distinct from the previously described sub-genotypes XIIIa, XIIIb and XIIIc and could be candidates of new sub-genotypes; however, that needs to be validated through full-length F gene sequence data. The results of the present study suggest that genotype XIII NDVs are under continuing evolution in Bangladesh.

  14. Cloning and sequence analysis of a cDNA encoding the alpha-subunit of mouse beta-N-acetylhexosaminidase and comparison with the human enzyme.

    PubMed Central

    Beccari, T; Hoade, J; Orlacchio, A; Stirling, J L

    1992-01-01

    cDNAs encoding the mouse beta-N-acetylhexosaminidase alpha-subunit were isolated from a mouse testis library. The longest of these (1.7 kb) was sequenced and showed 83% similarity with the human alpha-subunit cDNA sequence. The 5' end of the coding sequence was obtained from a genomic DNA clone. Alignment of the human and mouse sequences showed that all three putative N-glycosylation sites are conserved, but that the mouse alpha-subunit has an additional site towards the C-terminus. All eight cysteines in the human sequence are conserved in the mouse. There are an additional two cysteines in the mouse alpha-subunit signal peptide. All amino acids affected in Tay-Sachs-disease mutations are conserved in the mouse. Images Fig. 1. PMID:1379046

  15. Understanding sequence similarity and framework analysis between centromere proteins using computational biology.

    PubMed

    Doss, C George Priya; Chakrabarty, Chiranjib; Debajyoti, C; Debottam, S

    2014-11-01

    Certain mysteries pointing toward their recruitment pathways, cell cycle regulation mechanisms, spindle checkpoint assembly, and chromosome segregation process are considered the centre of attraction in cancer research. In modern times, with the established databases, ranges of computational platforms have provided a platform to examine almost all the physiological and biochemical evidences in disease-associated phenotypes. Using existing computational methods, we have utilized the amino acid residues to understand the similarity within the evolutionary variance of different associated centromere proteins. This study related to sequence similarity, protein-protein networking, co-expression analysis, and evolutionary trajectory of centromere proteins will speed up the understanding about centromere biology and will create a road map for upcoming researchers who are initiating their work of clinical sequencing using centromere proteins.

  16. Beyond DNA Sequencing in Space: Current and Future Omics Capabilities of the Biomolecule Sequencer Payload

    NASA Technical Reports Server (NTRS)

    Wallace, Sarah

    2017-01-01

    Why do we need a DNA sequencer to support the human exploration of space? (A) Operational environmental monitoring; (1) Identification of contaminating microbes, (2) Infectious disease diagnosis, (3) Reduce down mass (sample return for environmental monitoring, crew health, etc.). (B) Research; (1) Human, (2) Animal, (3) Microbes/Cell lines, (4) Plant. (C) Med Ops; (1) Response to countermeasures, (2) Radiation, (3) Real-time analysis can influence medical intervention. (C) Support astrobiology science investigations; (1) Technology superiorly suited to in situ nucleic acid-based life detection, (2) Functional testing for integration into robotics for extraplanetary exploration mission.

  17. Staphylococcus aureus and Staphylococcus epidermidis strain diversity underlying pediatric atopic dermatitis.

    PubMed

    Byrd, Allyson L; Deming, Clay; Cassidy, Sara K B; Harrison, Oliver J; Ng, Weng-Ian; Conlan, Sean; Belkaid, Yasmine; Segre, Julia A; Kong, Heidi H

    2017-07-05

    The heterogeneous course, severity, and treatment responses among patients with atopic dermatitis (AD; eczema) highlight the complexity of this multifactorial disease. Prior studies have used traditional typing methods on cultivated isolates or sequenced a bacterial marker gene to study the skin microbial communities of AD patients. Shotgun metagenomic sequence analysis provides much greater resolution, elucidating multiple levels of microbial community assembly ranging from kingdom to species and strain-level diversification. We analyzed microbial temporal dynamics from a cohort of pediatric AD patients sampled throughout the disease course. Species-level investigation of AD flares showed greater Staphylococcus aureus predominance in patients with more severe disease and Staphylococcus epidermidis predominance in patients with less severe disease. At the strain level, metagenomic sequencing analyses demonstrated clonal S. aureus strains in more severe patients and heterogeneous S. epidermidis strain communities in all patients. To investigate strain-level biological effects of S. aureus , we topically colonized mice with human strains isolated from AD patients and controls. This cutaneous colonization model demonstrated S. aureus strain-specific differences in eliciting skin inflammation and immune signatures characteristic of AD patients. Specifically, S. aureus isolates from AD patients with more severe flares induced epidermal thickening and expansion of cutaneous T helper 2 (T H 2) and T H 17 cells. Integrating high-resolution sequencing, culturing, and animal models demonstrated how functional differences of staphylococcal strains may contribute to the complexity of AD disease. Copyright © 2017 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.

  18. Spatio-temporal Analysis of the Genetic Diversity of Arctic Rabies Viruses and Their Reservoir Hosts in Greenland

    PubMed Central

    Hanke, Dennis; Freuling, Conrad M.; Fischer, Susanne; Hueffer, Karsten; Hundertmark, Kris; Nadin-Davis, Susan; Marston, Denise; Fooks, Anthony R.; Bøtner, Anette; Mettenleiter, Thomas C.; Beer, Martin; Rasmussen, Thomas B.; Müller, Thomas F.; Höper, Dirk

    2016-01-01

    There has been limited knowledge on spatio-temporal epidemiology of zoonotic arctic fox rabies among countries bordering the Arctic, in particular Greenland. Previous molecular epidemiological studies have suggested the occurrence of one particular arctic rabies virus (RABV) lineage (arctic-3), but have been limited by a low number of available samples preventing in-depth high resolution phylogenetic analysis of RABVs at that time. However, an improved knowledge of the evolution, at a molecular level, of the circulating RABVs and a better understanding of the historical perspective of the disease in Greenland is necessary for better direct control measures on the island. These issues have been addressed by investigating the spatio-temporal genetic diversity of arctic RABVs and their reservoir host, the arctic fox, in Greenland using both full and partial genome sequences. Using a unique set of 79 arctic RABV full genome sequences from Greenland, Canada, USA (Alaska) and Russia obtained between 1977 and 2014, a description of the historic context in relation to the genetic diversity of currently circulating RABV in Greenland and neighboring Canadian Northern territories has been provided. The phylogenetic analysis confirmed delineation into four major arctic RABV lineages (arctic 1–4) with viruses from Greenland exclusively grouping into the circumpolar arctic-3 lineage. High resolution analysis enabled distinction of seven geographically distinct subclades (3.I – 3.VII) with two subclades containing viruses from both Greenland and Canada. By combining analysis of full length RABV genome sequences and host derived sequences encoding mitochondrial proteins obtained simultaneously from brain tissues of 49 arctic foxes, the interaction of viruses and their hosts was explored in detail. Such an approach can serve as a blueprint for analysis of infectious disease dynamics and virus-host interdependencies. The results showed a fine-scale spatial population structure in Greenland arctic foxes based on mitochondrial sequences, but provided no evidence for independent isolated evolutionary development of RABV in different arctic fox lineages. These data are invaluable to support future initiatives for arctic fox rabies control and elimination in Greenland. PMID:27459154

  19. Spatio-temporal Analysis of the Genetic Diversity of Arctic Rabies Viruses and Their Reservoir Hosts in Greenland.

    PubMed

    Hanke, Dennis; Freuling, Conrad M; Fischer, Susanne; Hueffer, Karsten; Hundertmark, Kris; Nadin-Davis, Susan; Marston, Denise; Fooks, Anthony R; Bøtner, Anette; Mettenleiter, Thomas C; Beer, Martin; Rasmussen, Thomas B; Müller, Thomas F; Höper, Dirk

    2016-07-01

    There has been limited knowledge on spatio-temporal epidemiology of zoonotic arctic fox rabies among countries bordering the Arctic, in particular Greenland. Previous molecular epidemiological studies have suggested the occurrence of one particular arctic rabies virus (RABV) lineage (arctic-3), but have been limited by a low number of available samples preventing in-depth high resolution phylogenetic analysis of RABVs at that time. However, an improved knowledge of the evolution, at a molecular level, of the circulating RABVs and a better understanding of the historical perspective of the disease in Greenland is necessary for better direct control measures on the island. These issues have been addressed by investigating the spatio-temporal genetic diversity of arctic RABVs and their reservoir host, the arctic fox, in Greenland using both full and partial genome sequences. Using a unique set of 79 arctic RABV full genome sequences from Greenland, Canada, USA (Alaska) and Russia obtained between 1977 and 2014, a description of the historic context in relation to the genetic diversity of currently circulating RABV in Greenland and neighboring Canadian Northern territories has been provided. The phylogenetic analysis confirmed delineation into four major arctic RABV lineages (arctic 1-4) with viruses from Greenland exclusively grouping into the circumpolar arctic-3 lineage. High resolution analysis enabled distinction of seven geographically distinct subclades (3.I - 3.VII) with two subclades containing viruses from both Greenland and Canada. By combining analysis of full length RABV genome sequences and host derived sequences encoding mitochondrial proteins obtained simultaneously from brain tissues of 49 arctic foxes, the interaction of viruses and their hosts was explored in detail. Such an approach can serve as a blueprint for analysis of infectious disease dynamics and virus-host interdependencies. The results showed a fine-scale spatial population structure in Greenland arctic foxes based on mitochondrial sequences, but provided no evidence for independent isolated evolutionary development of RABV in different arctic fox lineages. These data are invaluable to support future initiatives for arctic fox rabies control and elimination in Greenland.

  20. Origin of porcine circovirus type 2 (PCV2) from swine affected by PCV2-associated diseases in Croatia.

    PubMed

    Novosel, D; Tuboly, T; Csagola, A; Lorincz, M; Cubric-Curik, V; Jungic, A; Curik, I; Segalés, J; Cortey, M; Lipej, Z

    2014-04-26

    Porcine circovirus type 2 (PCV2) causes some of the most significant economic losses in pig production. Several multisystemic syndromes have been attributed to PCV2 infection, which are known as PCV2-associated diseases (PCVDs). This study investigated the origin and evolution of PCV2 sequences in domestic pigs and wild boars affected by PCVDs in Croatia. Viral sequences were recovered from three wild boars diagnosed with PCV2-systemic disease (PCV2-SD), 63 fetuses positive for PCV2 DNA as determined by PCR, 14 domestic pigs affected with PCV2-SD (displaying severe interstitial nephritis) and five domestic pigs with proliferative and necrotising pneumonia. Seventeen complete PCV2 genomes were recovered. Phylogenetic and evolutionary analyses based on median-joining phylogenetic networks, amino acid alignments and principal coordinate analysis were performed using complete genomes, as well as complete and partial ORF sequences for ORF1 and ORF2. Two of the 17 PCV2 sequences belonged to PCV2a, 14 to PCV2b and one was unclustered. PCV2b was the predominant genotype in Croatia and has been linked to international trade as a route of introduction. Correlation between particular viral strains with PCVDs is lacking.

  1. Microbial Communities in the Surface Mucopolysaccharide Layer and the Black Band Microbial Mat of Black Band-Diseased Siderastrea siderea

    PubMed Central

    Sekar, Raju; Mills, DeEtta K.; Remily, Elizabeth R.; Voss, Joshua D.; Richardson, Laurie L.

    2006-01-01

    Microbial community profiles and species composition associated with two black band-diseased colonies of the coral Siderastrea siderea were studied by 16S rRNA-targeted gene cloning, sequencing, and amplicon-length heterogeneity PCR (LH-PCR). Bacterial communities associated with the surface mucopolysaccharide layer (SML) of apparently healthy tissues of the infected colonies, together with samples of the black band disease (BBD) infections, were analyzed using the same techniques for comparison. Gene sequences, ranging from 424 to 1,537 bp, were retrieved from all positive clones (n = 43 to 48) in each of the four clone libraries generated and used for comparative sequence analysis. In addition to LH-PCR community profiling, all of the clone sequences were aligned with LH-PCR primer sequences, and the theoretical lengths of the amplicons were determined. Results revealed that the community profiles were significantly different between BBD and SML samples. The SML samples were dominated by γ-proteobacteria (53 to 64%), followed by β-proteobacteria (18 to 21%) and α-proteobacteria (5 to 11%). In contrast, both BBD clone libraries were dominated by α-proteobacteria (58 to 87%), followed by verrucomicrobia (2 to 10%) and 0 to 6% each of δ-proteobacteria, bacteroidetes, firmicutes, and cyanobacteria. Alphaproteobacterial sequence types related to the bacteria associated with toxin-producing dinoflagellates were observed in BBD clone libraries but were not found in the SML libraries. Similarly, sequences affiliated with the family Desulfobacteraceae and toxin-producing cyanobacteria, both believed to be involved in BBD pathogenesis, were found only in BBD libraries. These data provide evidence for an association of numerous toxin-producing heterotrophic microorganisms with BBD of corals. PMID:16957217

  2. Rapid-Onset Obesity with Hypothalamic Dysfunction, Hypoventilation, and Autonomic Dysregulation (ROHHAD): exome sequencing of trios, monozygotic twins and tumours.

    PubMed

    Barclay, Sarah F; Rand, Casey M; Borch, Lauren A; Nguyen, Lisa; Gray, Paul A; Gibson, William T; Wilson, Richard J A; Gordon, Paul M K; Aung, Zaw; Berry-Kravis, Elizabeth M; Ize-Ludlow, Diego; Weese-Mayer, Debra E; Bech-Hansen, N Torben

    2015-08-25

    Rapid-onset Obesity with Hypothalamic Dysfunction, Hypoventilation, and Autonomic Dysregulation (ROHHAD) is thought to be a genetic disease caused by de novo mutations, though causative mutations have yet to be identified. We searched for de novo coding mutations among a carefully-diagnosed and clinically homogeneous cohort of 35 ROHHAD patients. We sequenced the exomes of seven ROHHAD trios, plus tumours from four of these patients and the unaffected monozygotic (MZ) twin of one (discovery cohort), to identify constitutional and somatic de novo sequence variants. We further analyzed this exome data to search for candidate genes under autosomal dominant and recessive models, and to identify structural variations. Candidate genes were tested by exome or Sanger sequencing in a replication cohort of 28 ROHHAD singletons. The analysis of the trio-based exomes found 13 de novo variants. However, no two patients had de novo variants in the same gene, and additional patient exomes and mutation analysis in the replication cohort did not provide strong genetic evidence to implicate any of these sequence variants in ROHHAD. Somatic comparisons revealed no coding differences between any blood and tumour samples, or between the two discordant MZ twins. Neither autosomal dominant nor recessive analysis yielded candidate genes for ROHHAD, and we did not identify any potentially causative structural variations. Clinical exome sequencing is highly unlikely to be a useful diagnostic test in patients with true ROHHAD. As ROHHAD has a high risk for fatality if not properly managed, it remains imperative to expand the search for non-exomic genetic risk factors, as well as to investigate other possible mechanisms of disease. In so doing, we will be able to confirm objectively the ROHHAD diagnosis and to contribute to our understanding of obesity, respiratory control, hypothalamic function, and autonomic regulation.

  3. G-CNV: A GPU-Based Tool for Preparing Data to Detect CNVs with Read-Depth Methods.

    PubMed

    Manconi, Andrea; Manca, Emanuele; Moscatelli, Marco; Gnocchi, Matteo; Orro, Alessandro; Armano, Giuliano; Milanesi, Luciano

    2015-01-01

    Copy number variations (CNVs) are the most prevalent types of structural variations (SVs) in the human genome and are involved in a wide range of common human diseases. Different computational methods have been devised to detect this type of SVs and to study how they are implicated in human diseases. Recently, computational methods based on high-throughput sequencing (HTS) are increasingly used. The majority of these methods focus on mapping short-read sequences generated from a donor against a reference genome to detect signatures distinctive of CNVs. In particular, read-depth based methods detect CNVs by analyzing genomic regions with significantly different read-depth from the other ones. The pipeline analysis of these methods consists of four main stages: (i) data preparation, (ii) data normalization, (iii) CNV regions identification, and (iv) copy number estimation. However, available tools do not support most of the operations required at the first two stages of this pipeline. Typically, they start the analysis by building the read-depth signal from pre-processed alignments. Therefore, third-party tools must be used to perform most of the preliminary operations required to build the read-depth signal. These data-intensive operations can be efficiently parallelized on graphics processing units (GPUs). In this article, we present G-CNV, a GPU-based tool devised to perform the common operations required at the first two stages of the analysis pipeline. G-CNV is able to filter low-quality read sequences, to mask low-quality nucleotides, to remove adapter sequences, to remove duplicated read sequences, to map the short-reads, to resolve multiple mapping ambiguities, to build the read-depth signal, and to normalize it. G-CNV can be efficiently used as a third-party tool able to prepare data for the subsequent read-depth signal generation and analysis. Moreover, it can also be integrated in CNV detection tools to generate read-depth signals.

  4. OVAS: an open-source variant analysis suite with inheritance modelling.

    PubMed

    Mozere, Monika; Tekman, Mehmet; Kari, Jameela; Bockenhauer, Detlef; Kleta, Robert; Stanescu, Horia

    2018-02-08

    The advent of modern high-throughput genetics continually broadens the gap between the rising volume of sequencing data, and the tools required to process them. The need to pinpoint a small subset of functionally important variants has now shifted towards identifying the critical differences between normal variants and disease-causing ones. The ever-increasing reliance on cloud-based services for sequence analysis and the non-transparent methods they utilize has prompted the need for more in-situ services that can provide a safer and more accessible environment to process patient data, especially in circumstances where continuous internet usage is limited. To address these issues, we herein propose our standalone Open-source Variant Analysis Sequencing (OVAS) pipeline; consisting of three key stages of processing that pertain to the separate modes of annotation, filtering, and interpretation. Core annotation performs variant-mapping to gene-isoforms at the exon/intron level, append functional data pertaining the type of variant mutation, and determine hetero/homozygosity. An extensive inheritance-modelling module in conjunction with 11 other filtering components can be used in sequence ranging from single quality control to multi-file penetrance model specifics such as X-linked recessive or mosaicism. Depending on the type of interpretation required, additional annotation is performed to identify organ specificity through gene expression and protein domains. In the course of this paper we analysed an autosomal recessive case study. OVAS made effective use of the filtering modules to recapitulate the results of the study by identifying the prescribed compound-heterozygous disease pattern from exome-capture sequence input samples. OVAS is an offline open-source modular-driven analysis environment designed to annotate and extract useful variants from Variant Call Format (VCF) files, and process them under an inheritance context through a top-down filtering schema of swappable modules, run entirely off a live bootable medium and accessed locally through a web-browser.

  5. ‘Candidatus Phytoplasma luffae’, a novel taxon associated with a witches’-broom disease of loofah, Luffa aegyptica Mill

    USDA-ARS?s Scientific Manuscript database

    The phytoplasma associated with witches’ broom disease of loofah (Luffa aegyptica Mill., syn. L.uffa cylindrica (L.) M.J. Roem.) in Taiwan was classified in group 16SrVIII, subgroup A (16SrVIII-A), based on results from actual and in silico RFLP analysis of 16S rRNA gene sequences. Nucleotide sequ...

  6. Cold Urticaria, Immunodeficiency, and Autoimmunity Related to PLCG2 Deletions

    PubMed Central

    Ombrello, Michael J.; Remmers, Elaine F.; Sun, Guangping; Freeman, Alexandra F.; Datta, Shrimati; Torabi-Parizi, Parizad; Subramanian, Naeha; Bunney, Tom D.; Baxendale, Rhona W.; Martins, Marta S.; Romberg, Neil; Komarow, Hirsh; Aksentijevich, Ivona; Kim, Hun Sik; Ho, Jason; Cruse, Glenn; Jung, Mi-Yeon; Gilfillan, Alasdair M.; Metcalfe, Dean D.; Nelson, Celeste; O'Brien, Michelle; Wisch, Laura; Stone, Kelly; Douek, Daniel C.; Gandhi, Chhavi; Wanderer, Alan A.; Lee, Hane; Nelson, Stanley F.; Shianna, Kevin V.; Cirulli, Elizabeth T.; Goldstein, David B.; Long, Eric O.; Moir, Susan; Meffre, Eric; Holland, Steven M.; Kastner, Daniel L.; Katan, Matilda; Hoffman, Hal M.; Milner, Joshua D.

    2012-01-01

    Background Mendelian analysis of disorders of immune regulation can provide insight into molecular pathways associated with host defense and immune tolerance. Methods We identified three families with a dominantly inherited complex of cold-induced urticaria, antibody deficiency, and susceptibility to infection and autoimmunity. Immunophenotyping methods included flow cytometry, analysis of serum immunoglobulins and autoantibodies, lymphocyte stimulation, and enzymatic assays. Genetic studies included linkage analysis, targeted Sanger sequencing, and next-generation whole-genome sequencing. Results Cold urticaria occurred in all affected subjects. Other, variable manifestations included atopy, granulomatous rash, autoimmune thyroiditis, the presence of antinuclear antibodies, sinopulmonary infections, and common variable immunodeficiency. Levels of serum IgM and IgA and circulating natural killer cells and class-switched memory B cells were reduced. Linkage analysis showed a 7-Mb candidate interval on chromosome 16q in one family, overlapping by 3.5 Mb a disease-associated haplotype in a smaller family. This interval includes PLCG2, encoding phospholipase Cγ2 (PLCγ2), a signaling molecule expressed in B cells, natural killer cells, and mast cells. Sequencing of complementary DNA revealed heterozygous transcripts lacking exon 19 in two families and lacking exons 20 through 22 in a third family. Genomic sequencing identified three distinct in-frame deletions that cosegregated with disease. These deletions, located within a region encoding an autoinhibitory domain, result in protein products with constitutive phospholipase activity. PLCG2-expressing cells had diminished cellular signaling at 37°C but enhanced signaling at subphysiologic temperatures. Conclusions Genomic deletions in PLCG2 cause gain of PLCγ2 function, leading to signaling abnormalities in multiple leukocyte subsets and a phenotype encompassing both excessive and deficient immune function. (Funded by the National Institutes of Health Intramural Research Programs and others.) PMID:22236196

  7. Filovirus RefSeq Entries: Evaluation and Selection of Filovirus Type Variants, Type Sequences, and Names

    PubMed Central

    Kuhn, Jens H.; Andersen, Kristian G.; Bào, Yīmíng; Bavari, Sina; Becker, Stephan; Bennett, Richard S.; Bergman, Nicholas H.; Blinkova, Olga; Bradfute, Steven; Brister, J. Rodney; Bukreyev, Alexander; Chandran, Kartik; Chepurnov, Alexander A.; Davey, Robert A.; Dietzgen, Ralf G.; Doggett, Norman A.; Dolnik, Olga; Dye, John M.; Enterlein, Sven; Fenimore, Paul W.; Formenty, Pierre; Freiberg, Alexander N.; Garry, Robert F.; Garza, Nicole L.; Gire, Stephen K.; Gonzalez, Jean-Paul; Griffiths, Anthony; Happi, Christian T.; Hensley, Lisa E.; Herbert, Andrew S.; Hevey, Michael C.; Hoenen, Thomas; Honko, Anna N.; Ignatyev, Georgy M.; Jahrling, Peter B.; Johnson, Joshua C.; Johnson, Karl M.; Kindrachuk, Jason; Klenk, Hans-Dieter; Kobinger, Gary; Kochel, Tadeusz J.; Lackemeyer, Matthew G.; Lackner, Daniel F.; Leroy, Eric M.; Lever, Mark S.; Mühlberger, Elke; Netesov, Sergey V.; Olinger, Gene G.; Omilabu, Sunday A.; Palacios, Gustavo; Panchal, Rekha G.; Park, Daniel J.; Patterson, Jean L.; Paweska, Janusz T.; Peters, Clarence J.; Pettitt, James; Pitt, Louise; Radoshitzky, Sheli R.; Ryabchikova, Elena I.; Saphire, Erica Ollmann; Sabeti, Pardis C.; Sealfon, Rachel; Shestopalov, Aleksandr M.; Smither, Sophie J.; Sullivan, Nancy J.; Swanepoel, Robert; Takada, Ayato; Towner, Jonathan S.; van der Groen, Guido; Volchkov, Viktor E.; Volchkova, Valentina A.; Wahl-Jensen, Victoria; Warren, Travis K.; Warfield, Kelly L.; Weidmann, Manfred; Nichol, Stuart T.

    2014-01-01

    Sequence determination of complete or coding-complete genomes of viruses is becoming common practice for supporting the work of epidemiologists, ecologists, virologists, and taxonomists. Sequencing duration and costs are rapidly decreasing, sequencing hardware is under modification for use by non-experts, and software is constantly being improved to simplify sequence data management and analysis. Thus, analysis of virus disease outbreaks on the molecular level is now feasible, including characterization of the evolution of individual virus populations in single patients over time. The increasing accumulation of sequencing data creates a management problem for the curators of commonly used sequence databases and an entry retrieval problem for end users. Therefore, utilizing the data to their fullest potential will require setting nomenclature and annotation standards for virus isolates and associated genomic sequences. The National Center for Biotechnology Information’s (NCBI’s) RefSeq is a non-redundant, curated database for reference (or type) nucleotide sequence records that supplies source data to numerous other databases. Building on recently proposed templates for filovirus variant naming [ ()////-], we report consensus decisions from a majority of past and currently active filovirus experts on the eight filovirus type variants and isolates to be represented in RefSeq, their final designations, and their associated sequences. PMID:25256396

  8. Post-mortem whole-exome analysis in a large sudden infant death syndrome cohort with a focus on cardiovascular and metabolic genetic diseases.

    PubMed

    Neubauer, Jacqueline; Lecca, Maria Rita; Russo, Giancarlo; Bartsch, Christine; Medeiros-Domingo, Argelia; Berger, Wolfgang; Haas, Cordula

    2017-04-01

    Sudden infant death syndrome (SIDS) is described as the sudden and unexplained death of an apparently healthy infant younger than one year of age. Genetic studies indicate that up to 35% of SIDS cases might be explained by familial or genetic diseases such as cardiomyopathies, ion channelopathies or metabolic disorders that remained undetected during conventional forensic autopsy procedures. Post-mortem genetic testing by using massive parallel sequencing (MPS) approaches represents an efficient and rapid tool to further investigate unexplained death cases and might help to elucidate pathogenic genetic variants and mechanisms in cases without a conclusive cause of death. In this study, we performed whole-exome sequencing (WES) in 161 European SIDS infants with focus on 192 genes associated with cardiovascular and metabolic diseases. Potentially causative variants were detected in 20% of the SIDS cases. The majority of infants had variants with likely functional effects in genes associated with channelopathies (9%), followed by cardiomyopathies (7%) and metabolic diseases (1%). Although lethal arrhythmia represents the most plausible and likely cause of death, the majority of SIDS cases still remains elusive and might be explained by a multifactorial etiology, triggered by a combination of different genetic and environmental risk factors. As WES is not substantially more expensive than a targeted sequencing approach, it represents an unbiased screening of the exome, which could help to investigate different pathogenic mechanisms within the genetically heterogeneous SIDS cohort. Additionally, re-analysis of the datasets provides the basis to identify new candidate genes in sudden infant death.

  9. Complete genome sequence of a new begomovirus associated with yellow mosaic disease of Hemidesmus indicus in India.

    PubMed

    Reddy, M Sreekanth; Kanakala, S; Srinivas, K P; Hema, M; Malathi, V G; Sreenivasulu, P

    2014-05-01

    The complete DNA A genome of a virus isolate associated with yellow mosaic disease of a medicinal plant, Hemidesmus indicus, from India was cloned and sequenced. The length of DNA A was 2825 nucleotides, 35 nucleotides longer than the unit genome of monopartite begomoviruses. Comparison of the nucleotide sequence of DNA A of the virus isolate with those of other begomoviruses showed maximum sequence identity of 69 % to DNA A of ageratum yellow vein China virus (AYVCNV; AJ558120) and 68 % with tomato yellow leaf curl virus- LBa4 (TYLCV; EF185318), and it formed a distinct clade in phylogenetic analysis. The genome organization of the present virus isolate was found to be similar to that of Old World monopartite begomoviruses. The genome was considered to be monopartite, because association of DNA B and β satellite DNA components was not detected. Based on its sequence identity (<70 %) to all other begomoviruses known to date and ICTV (International Committee on Taxonomy of Viruses) species demarcating criteria (<89 % identity), it is considered a member of a novel begomovirus species, and the tentative name "Hemidesmus yellow mosaic virus" (HeYMV) is proposed.

  10. Congenital Heart Disease: Causes, Diagnosis, Symptoms, and Treatments.

    PubMed

    Sun, RongRong; Liu, Min; Lu, Lei; Zheng, Yi; Zhang, Peiying

    2015-07-01

    The congenital heart disease includes abnormalities in heart structure that occur before birth. Such defects occur in the fetus while it is developing in the uterus during pregnancy. About 500,000 adults have congenital heart disease in USA (WebMD, Congenital heart defects medications, www.WebMD.com/heart-disease/tc/congenital-heart-defects-medications , 2014). 1 in every 100 children has defects in their heart due to genetic or chromosomal abnormalities, such as Down syndrome. The excessive alcohol consumption during pregnancy and use of medications, maternal viral infection, such as Rubella virus, measles (German), in the first trimester of pregnancy, all these are risk factors for congenital heart disease in children, and the risk increases if parent or sibling has a congenital heart defect. These are heart valves defects, atrial and ventricular septa defects, stenosis, the heart muscle abnormalities, and a hole inside wall of the heart which causes defect in blood circulation, heart failure, and eventual death. There are no particular symptoms of congenital heart disease, but shortness of breath and limited ability to do exercise, fatigue, abnormal sound of heart as heart murmur, which is diagnosed by a physician while listening to the heart beats. The echocardiogram or transesophageal echocardiogram, electrocardiogram, chest X-ray, cardiac catheterization, and MRI methods are used to detect congenital heart disease. Several medications are given depending on the severity of this disease, and catheter method and surgery are required for serious cases to repair heart valves or heart transplantation as in endocarditis. For genetic study, first DNA is extracted from blood followed by DNA sequence analysis and any defect in nucleotide sequence of DNA is determined. For congenital heart disease, genes in chromosome 1 show some defects in nucleotide sequence. In this review the causes, diagnosis, symptoms, and treatments of congenital heart disease are described.

  11. High-Throughput Sequencing, a Versatile Weapon to Support Genome-Based Diagnosis in Infectious Diseases: Applications to Clinical Bacteriology

    PubMed Central

    Caboche, Ségolène; Audebert, Christophe; Hot, David

    2014-01-01

    The recent progresses of high-throughput sequencing (HTS) technologies enable easy and cost-reduced access to whole genome sequencing (WGS) or re-sequencing. HTS associated with adapted, automatic and fast bioinformatics solutions for sequencing applications promises an accurate and timely identification and characterization of pathogenic agents. Many studies have demonstrated that data obtained from HTS analysis have allowed genome-based diagnosis, which has been consistent with phenotypic observations. These proofs of concept are probably the first steps toward the future of clinical microbiology. From concept to routine use, many parameters need to be considered to promote HTS as a powerful tool to help physicians and clinicians in microbiological investigations. This review highlights the milestones to be completed toward this purpose. PMID:25437800

  12. Winnowing DNA for Rare Sequences: Highly Specific Sequence and Methylation Based Enrichment

    PubMed Central

    Thompson, Jason D.; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

    2012-01-01

    Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue. PMID:22355378

  13. An image-based model of brain volume biomarker changes in Huntington's disease.

    PubMed

    Wijeratne, Peter A; Young, Alexandra L; Oxtoby, Neil P; Marinescu, Razvan V; Firth, Nicholas C; Johnson, Eileanoir B; Mohan, Amrita; Sampaio, Cristina; Scahill, Rachael I; Tabrizi, Sarah J; Alexander, Daniel C

    2018-05-01

    Determining the sequence in which Huntington's disease biomarkers become abnormal can provide important insights into the disease progression and a quantitative tool for patient stratification. Here, we construct and present a uniquely fine-grained model of temporal progression of Huntington's disease from premanifest through to manifest stages. We employ a probabilistic event-based model to determine the sequence of appearance of atrophy in brain volumes, learned from structural MRI in the Track-HD study, as well as to estimate the uncertainty in the ordering. We use longitudinal and phenotypic data to demonstrate the utility of the patient staging system that the resulting model provides. The model recovers the following order of detectable changes in brain region volumes: putamen, caudate, pallidum, insula white matter, nonventricular cerebrospinal fluid, amygdala, optic chiasm, third ventricle, posterior insula, and basal forebrain. This ordering is mostly preserved even under cross-validation of the uncertainty in the event sequence. Longitudinal analysis performed using 6 years of follow-up data from baseline confirms efficacy of the model, as subjects consistently move to later stages with time, and significant correlations are observed between the estimated stages and nonimaging phenotypic markers. We used a data-driven method to provide new insight into Huntington's disease progression as well as new power to stage and predict conversion. Our results highlight the potential of disease progression models, such as the event-based model, to provide new insight into Huntington's disease progression and to support fine-grained patient stratification for future precision medicine in Huntington's disease.

  14. Hepatitis E virus genotype 3 diversity: phylogenetic analysis and presence of subtype 3b in wild boar in Europe.

    PubMed

    Vina-Rodriguez, Ariel; Schlosser, Josephine; Becher, Dietmar; Kaden, Volker; Groschup, Martin H; Eiden, Martin

    2015-05-22

    An increasing number of indigenous cases of hepatitis E caused by genotype 3 viruses (HEV-3) have been diagnosed all around the word, particularly in industrialized countries. Hepatitis E is a zoonotic disease and accumulating evidence indicates that domestic pigs and wild boars are the main reservoirs of HEV-3. A detailed analysis of HEV-3 subtypes could help to determine the interplay of human activity, the role of animals as reservoirs and cross species transmission. Although complete genome sequences are most appropriate for HEV subtype determination, in most cases only partial genomic sequences are available. We therefore carried out a subtype classification analysis, which uses regions from all three open reading frames of the genome. Using this approach, more than 1000 published HEV-3 isolates were subtyped. Newly recovered HEV partial sequences from hunted German wild boars were also included in this study. These sequences were assigned to genotype 3 and clustered within subtype 3a, 3i and, unexpectedly, one of them within the subtype 3b, a first non-human report of this subtype in Europe.

  15. Sequence, structure and function relationships in flaviviruses as assessed by evolutive aspects of its conserved non-structural protein domains.

    PubMed

    da Fonseca, Néli José; Lima Afonso, Marcelo Querino; Pedersolli, Natan Gonçalves; de Oliveira, Lucas Carrijo; Andrade, Dhiego Souto; Bleicher, Lucas

    2017-10-28

    Flaviviruses are responsible for serious diseases such as dengue, yellow fever, and zika fever. Their genomes encode a polyprotein which, after cleavage, results in three structural and seven non-structural proteins. Homologous proteins can be studied by conservation and coevolution analysis as detected in multiple sequence alignments, usually reporting positions which are strictly necessary for the structure and/or function of all members in a protein family or which are involved in a specific sub-class feature requiring the coevolution of residue sets. This study provides a complete conservation and coevolution analysis on all flaviviruses non-structural proteins, with results mapped on all well-annotated available sequences. A literature review on the residues found in the analysis enabled us to compile available information on their roles and distribution among different flaviviruses. Also, we provide the mapping of conserved and coevolved residues for all sequences currently in SwissProt as a supplementary material, so that particularities in different viruses can be easily analyzed. Copyright © 2017 Elsevier Inc. All rights reserved.

  16. Analysis of the ergosterol biosynthesis pathway cloning, molecular characterization and phylogeny of lanosterol 14 α-demethylase (ERG11) gene of Moniliophthora perniciosa.

    PubMed

    de Oliveira Ceita, Geruza; Vilas-Boas, Laurival Antônio; Castilho, Marcelo Santos; Carazzolle, Marcelo Falsarella; Pirovani, Carlos Priminho; Selbach-Schnadelbach, Alessandra; Gramacho, Karina Peres; Ramos, Pablo Ivan Pereira; Barbosa, Luciana Veiga; Pereira, Gonçalo Amarante Guimarães; Góes-Neto, Aristóteles

    2014-10-01

    The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches' broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea.

  17. Analysis of the ergosterol biosynthesis pathway cloning, molecular characterization and phylogeny of lanosterol 14 α-demethylase (ERG11) gene of Moniliophthora perniciosa

    PubMed Central

    de Oliveira Ceita, Geruza; Vilas-Boas, Laurival Antônio; Castilho, Marcelo Santos; Carazzolle, Marcelo Falsarella; Pirovani, Carlos Priminho; Selbach-Schnadelbach, Alessandra; Gramacho, Karina Peres; Ramos, Pablo Ivan Pereira; Barbosa, Luciana Veiga; Pereira, Gonçalo Amarante Guimarães; Góes-Neto, Aristóteles

    2014-01-01

    The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches’ broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea. PMID:25505843

  18. Differential expression of genes encoding anti-oxidant enzymes in Sydney rock oysters, Saccostrea glomerata (Gould) selected for disease resistance.

    PubMed

    Green, Timothy J; Dixon, Tom J; Devic, Emilie; Adlard, Robert D; Barnes, Andrew C

    2009-05-01

    Sydney rock oysters (Saccostrea glomerata) selectively bred for disease resistance (R) and wild-caught control oysters (W) were exposed to a field infection of disseminating neoplasia. Cumulative mortality of W oysters (31.7%) was significantly greater than R oysters (0.0%) over the 118 days of the experiment. In an attempt to understand the biochemical and molecular pathways involved in disease resistance, differentially expressed sequence tags (ESTs) between R and W S. glomerata hemocytes were identified using the PCR technique, suppression subtractive hybridisation (SSH). Sequencing of 300 clones from two SSH libraries revealed 183 distinct sequences of which 113 shared high similarity to sequences in the public databases. Putative function could be assigned to 64 of the sequences. Expression of nine ESTs homologous to genes previously shown to be involved in bivalve immunity was further studied using quantitative reverse-transcriptase PCR (qRT-PCR). The base-line expression of an extracellular superoxide dismutase (ecSOD) and a small heat shock protein (sHsP) were significantly increased, whilst peroxiredoxin 6 (Prx6) and interferon inhibiting cytokine factor (IK) were significantly decreased in R oysters. From these results it was hypothesised that R oysters would be able to generate the anti-parasitic compound, hydrogen peroxide (H(2)O(2)) faster and to higher concentrations during respiratory burst due to the differential expression of genes for the two anti-oxidant enzymes of ecSOD and Prx6. To investigate this hypothesis, protein extracts from hemolymph were analysed for oxidative burst enzyme activity. Analysis of the cell free hemolymph proteins separated by native-polyacrylamide gel electrophoresis (PAGE) failed to detect true superoxide dismutase (SOD) activity by assaying dismutation of superoxide anion in zymograms. However, the ecSOD enzyme appears to generate hydrogen peroxide, presumably via another process, which is yet to be elucidated. This corroborates our hypothesis, whilst phylogenetic analysis of the complete coding sequence (CDS) of the S. glomerata ecSOD gene is supportive of the atypical nature of the ecSOD enzyme. Results obtained from this work further the current understanding of the molecular mechanisms involved in resistance to disease in this economically important bivalve, and shed further light on the anomalous oxidative processes involved.

  19. Evaluation of microbial community in hydrothermal field by direct DNA sequencing

    NASA Astrophysics Data System (ADS)

    Kawarabayasi, Y.; Maruyama, A.

    2002-12-01

    Many extremophiles have been discovered from terrestrial and marine hydrothermal fields. Some thermophiles can grow beyond 90°C in culture, while direct microscopic analysis occasionally indicates that microbes may survive in much hotter hydrothermal fluids. However, it is very difficult to isolate and cultivate such microbes from the environments, i.e., over 99% of total microbes remains undiscovered. Based on experiences of entire microbial genome analysis (Y.K.) and microbial community analysis (A.M.), we started to find out unique microbes/genes in hydrothermal fields through direct sequencing of environmental DNA fragments. At first, shotgun plasmid libraries were directly constructed with the DNA molecules prepared from mixed microbes collected by an in situ filtration system from low-temperature fluids at RM24 in the Southern East Pacific Rise (S-EPR). A gene amplification (PCR) technique was not used for preventing mutation in the process. The nucleotide sequences of 285 clones indicated that no sequence had identical data in public databases. Among 27 clones determined entire sequences, no ORF was identified on 14 clones like intron in Eukaryote. On four clones, tetra-nucleotide-long multiple tandem repetitive sequences were identified. This type of sequence was identified in some familiar disease in human. The result indicates that living/dead materials with eukaryotic features may exist in this low temperature field. Secondly, shotgun plasmid libraries were constructed from the environmental DNA prepared from Beppu hot springs. In randomly-selected 143 clones used for sequencing, no known sequence was identified. Unlike the clones in S-EPR library, clear ORFs were identified on all nine clones determined the entire sequence. It was found that one clone, H4052, contained the complete Aspartyl-tRNA synthetase. Phylogenetic analysis using amino acid sequences of this gene indicated that this gene was separated from other Euryarchaea before the differentiation of species. Thus, some novel archaeal species are expected to be in this field. The present direct cloning and sequencing technique is now opening a window to the new world in hydrothermal microbial community analysis.

  20. Genetic characterization of poxviruses in Camelus dromedarius in Ethiopia, 2011-2014.

    PubMed

    Gelaye, Esayas; Achenbach, Jenna Elizabeth; Ayelet, Gelagay; Jenberie, Shiferaw; Yami, Martha; Grabherr, Reingard; Loitsch, Angelika; Diallo, Adama; Lamien, Charles Euloge

    2016-10-01

    Camelpox and camel contagious ecthyma are infectious viral diseases of camelids caused by camelpox virus (CMLV) and camel contagious ecthyma virus (CCEV), respectively. Even though, in Ethiopia, pox disease has been creating significant economic losses in camel production, little is known on the responsible pathogens and their genetic diversity. Thus, the present study aimed at isolation, identification and genetic characterization of the causative viruses. Accordingly, clinical case observations, infectious virus isolation, and molecular and phylogenetic analysis of poxviruses infecting camels in three regions and six districts in the country, Afar (Chifra), Oromia (Arero, Miyu and Yabello) and Somali (Gursum and Jijiga) between 2011 and 2014 were undertaken. The full hemagglutinin (HA) and partial A-type inclusion protein (ATIP) genes of CMLV and full major envelope protein (B2L) gene of CCEV of Ethiopian isolates were sequenced, analyzed and compared among each other and to foreign isolates. The viral isolation confirmed the presence of infectious poxviruses. The preliminary screening by PCR showed 27 CMLVs and 20 CCEVs. The sequence analyses showed that the HA and ATIP gene sequences are highly conserved within the local isolates of CMLVs, and formed a single cluster together with isolates from Somalia and Syria. Unlike CMLVs, the B2L gene analysis of Ethiopian CCEV showed few genetic variations. The phylogenetic analysis revealed three clusters of CCEV in Ethiopia with the isolates clustering according to their geographical origins. To our knowledge, this is the first report indicating the existence of CCEV in Ethiopia where camel contagious ecthyma was misdiagnosed as camelpox. Additionally, this study has also disclosed the existence of co-infections with CMLV and CCEV. A comprehensive characterization of poxviruses affecting camels in Ethiopia and the full genome sequencing of representative isolates are recommended to better understand the dynamics of pox diseases of camels and to assist in the implementation of more efficient control measures. Copyright © 2016 Elsevier B.V. All rights reserved.

  1. 2015 Epidemic of Severe Streptococcus agalactiae Sequence Type 283 Infections in Singapore Associated With the Consumption of Raw Freshwater Fish: A Detailed Analysis of Clinical, Epidemiological, and Bacterial Sequencing Data.

    PubMed

    Kalimuddin, Shirin; Chen, Swaine L; Lim, Cindy T K; Koh, Tse Hsien; Tan, Thean Yen; Kam, Michelle; Wong, Christopher W; Mehershahi, Kurosh S; Chau, Man Ling; Ng, Lee Ching; Tang, Wen Ying; Badaruddin, Hishamuddin; Teo, Jeanette; Apisarnthanarak, Anucha; Suwantarat, Nuntra; Ip, Margaret; Holden, Matthew T G; Hsu, Li Yang; Barkham, Timothy

    2017-05-15

    Streptococcus agalactiae (group B Streptococcus [GBS]) has not been described as a foodborne pathogen. However, in 2015, a large outbreak of severe invasive sequence type (ST) 283 GBS infections in adults epidemiologically linked to the consumption of raw freshwater fish occurred in Singapore. We attempted to determine the scale of the outbreak, define the clinical spectrum of disease, and link the outbreak to contaminated fish. Time-series analysis was performed on microbiology laboratory data. Food handlers and fishmongers were screened for enteric carriage of GBS. A retrospective cohort study was conducted to assess differences in demographic and clinical characteristics of patients with invasive ST283 and non-ST283 infections. Whole-genome sequencing was performed on human and fish ST283 isolates from Singapore, Thailand, and Hong Kong. The outbreak was estimated to have started in late January 2015. Within the study cohort of 408 patients, ST283 accounted for 35.8% of cases. Patients with ST283 infection were younger and had fewer comorbidities but were more likely to develop meningoencephalitis, septic arthritis, and spinal infection. Of 82 food handlers and fishmongers screened, none carried ST283. Culture of 43 fish samples yielded 13 ST283-positive samples. Phylogenomic analysis of 161 ST283 isolates from humans and fish revealed they formed a tight clade distinguished by 93 single-nucleotide polymorphisms. ST283 is a zoonotic GBS clone associated with farmed freshwater fish, capable of causing severe disease in humans. It caused a large foodborne outbreak in Singapore and poses both a regional and potentially more widespread threat. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.

  2. GUTSS: An Alignment-Free Sequence Comparison Method for Use in Human Intestinal Microbiome and Fecal Microbiota Transplantation Analysis.

    PubMed

    Brittnacher, Mitchell J; Heltshe, Sonya L; Hayden, Hillary S; Radey, Matthew C; Weiss, Eli J; Damman, Christopher J; Zisman, Timothy L; Suskind, David L; Miller, Samuel I

    2016-01-01

    Comparative analysis of gut microbiomes in clinical studies of human diseases typically rely on identification and quantification of species or genes. In addition to exploring specific functional characteristics of the microbiome and potential significance of species diversity or expansion, microbiome similarity is also calculated to study change in response to therapies directed at altering the microbiome. Established ecological measures of similarity can be constructed from species abundances, however methods for calculating these commonly used ecological measures of similarity directly from whole genome shotgun (WGS) metagenomic sequence are lacking. We present an alignment-free method for calculating similarity of WGS metagenomic sequences that is analogous to the Bray-Curtis index for species, implemented by the General Utility for Testing Sequence Similarity (GUTSS) software application. This method was applied to intestinal microbiomes of healthy young children to measure developmental changes toward an adult microbiome during the first 3 years of life. We also calculate similarity of donor and recipient microbiomes to measure establishment, or engraftment, of donor microbiota in fecal microbiota transplantation (FMT) studies focused on mild to moderate Crohn's disease. We show how a relative index of similarity to donor can be calculated as a measure of change in a patient's microbiome toward that of the donor in response to FMT. Because clinical efficacy of the transplant procedure cannot be fully evaluated without analysis methods to quantify actual FMT engraftment, we developed a method for detecting change in the gut microbiome that is independent of species identification and database bias, sensitive to changes in relative abundance of the microbial constituents, and can be formulated as an index for correlating engraftment success with clinical measures of disease. More generally, this method may be applied to clinical evaluation of human microbiomes and provide potential diagnostic determination of individuals who may be candidates for specific therapies directed at alteration of the microbiome.

  3. CFTR gene mutations in isolated chronic obstructive pulmonary disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pignatti, P.F.; Bombien, C.; Marigo, C.

    1994-09-01

    In order to identify a possible hereditary predisposition to the development of chronic obstructive pulmonary disease (COPD), we have looked for the presence of cystic fibrosis transmembrane regulator (CFTR) gene DNA sequence modifications in 28 unrelated patients with no signs of cystic fibrosis. The known mutations in Italian CF patients, as well as the most frequent worldwide CF mutations, were investigated. In addition, a denaturing gradient gel electrophoresis analysis of about half of the coding sequence of the gene in 56 chromosomes from the patients and in 102 chromosomes from control individuals affected by other pulmonary diseases and from normalmore » controls was performed. Nine different CFTR gene mutations and polymorphisms were found in seven patients, a highly significant increase over controls. Two of the patients were compound heterozygotes. Two frequent CF mutations were detected: deletion F508 and R117H; two rare CF mutations: R1066C and 3667ins4; and five CF sequence variants: R75Q (which was also described as a disease-causing mutation in male sterility cases due to the absence of the vasa deferentia), G576A, 2736 A{r_arrow}G, L997F, and 3271+18C{r_arrow}T. Seven (78%) of the mutations are localized in transmembrane domains. Six (86%) of the patients with defined mutations and polymorphisms had bronchiectasis. These results indicate that CFTR gene mutations and sequence alterations may be involved in the etiopathogenesis of some cases of COPD.« less

  4. Mitochondrial DNA sequence analysis of four Alzheimer`s and Parkinson`s disease patients

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brown, M.D.; Shoffner, J.M.; Wallace, D.C.

    1996-01-22

    The mitochondrial DNA (mtDNA) sequence was determined on 3 patients with Alzheimer`s disease (AD) exhibiting AD plus Parkinson`s disease (PD) neuropathologic changes and one patient with PD. Patient mtDNA sequences were compared to the standard Cambridge sequence to identify base changes. In the first AD + PD patient, 2 of the 15 nucleotide substitutions may contribute to the neuropathology, a nucleotide pair (np) 4336 transition in the tRNA{sup Gln} gene found 7.4 times more frequently in patients than in controls, and a unique np 721 transition in the 12S rRNA gene which was not found in 70 other patients ormore » 905 controls. In the second AD + PD patient, 27 nucleotide substitutions were detected, including an np 3397 transition in the ND1 gene which converts a conserved methionine to a valine. In the third AD + PD patient, 2 polymorphic base substitutions frequently found at increased frequency in Leber`s hereditary optic neuropathy patients were observed, an np 4216 transition in ND1 and an np 13708 transition in the ND5 gene. For the PD patient, 2 novel variants were observed among 25 base substitutions, an np 1709 substitution in the 16S rRNA gene and an np 15851 missense mutation in the cytb gene. Further studies will be required to demonstrate a casual role for these base substitutions in neurodegenerative disease. 68 refs., 2 tabs.« less

  5. Towards pathogenomics: a web-based resource for pathogenicity islands

    PubMed Central

    Yoon, Sung Ho; Park, Young-Kyu; Lee, Soohyun; Choi, Doil; Oh, Tae Kwang; Hur, Cheol-Goo; Kim, Jihyun F.

    2007-01-01

    Pathogenicity islands (PAIs) are genetic elements whose products are essential to the process of disease development. They have been horizontally (laterally) transferred from other microbes and are important in evolution of pathogenesis. In this study, a comprehensive database and search engines specialized for PAIs were established. The pathogenicity island database (PAIDB) is a comprehensive relational database of all the reported PAIs and potential PAI regions which were predicted by a method that combines feature-based analysis and similarity-based analysis. Also, using the PAI Finder search application, a multi-sequence query can be analyzed onsite for the presence of potential PAIs. As of April 2006, PAIDB contains 112 types of PAIs and 889 GenBank accessions containing either partial or all PAI loci previously reported in the literature, which are present in 497 strains of pathogenic bacteria. The database also offers 310 candidate PAIs predicted from 118 sequenced prokaryotic genomes. With the increasing number of prokaryotic genomes without functional inference and sequenced genetic regions of suspected involvement in diseases, this web-based, user-friendly resource has the potential to be of significant use in pathogenomics. PAIDB is freely accessible at . PMID:17090594

  6. Multi-tissue transcriptomics for construction of a comprehensive gene resource for the terrestrial snail Theba pisana.

    PubMed

    Zhao, M; Wang, T; Adamson, K J; Storey, K B; Cummins, S F

    2016-02-08

    The land snail Theba pisana is native to the Mediterranean region but has become one of the most abundant invasive species worldwide. Here, we present three transcriptomes of this agriculture pest derived from three tissues: the central nervous system, hepatopancreas (digestive gland), and foot muscle. Sequencing of the three tissues produced 339,479,092 high quality reads and a global de novo assembly generated a total of 250,848 unique transcripts (unigenes). BLAST analysis mapped 52,590 unigenes to NCBI non-redundant protein databases and further functional analysis annotated 21,849 unigenes with gene ontology. We report that T. pisana transcripts have representatives in all functional classes and a comparison of differentially expressed transcripts amongst all three tissues demonstrates enormous differences in their potential metabolic activities. The genes differentially expressed include those with sequence similarity to those genes associated with multiple bacterial diseases and neurological diseases. To provide a valuable resource that will assist functional genomics study, we have implemented a user-friendly web interface, ThebaDB (http://thebadb.bioinfo-minzhao.org/). This online database allows for complex text queries, sequence searches, and data browsing by enriched functional terms and KEGG mapping.

  7. Investigation of a Possible Link Between Vaccination and the 2010 Sheep Pox Epizootic in Morocco.

    PubMed

    Haegeman, A; Zro, K; Sammin, D; Vandenbussche, F; Ennaji, M M; De Clercq, K

    2016-12-01

    Sheep pox is endemic in most parts of Northern Africa and has the potential to cause severe economic problems. Live attenuated vaccines are used in Morocco, and in many other countries, to control the disease. Sheep pox virus (SPPV) re-appeared in 2010 causing a nodular clinical form previously not observed in Morocco. The severe clinical signs observed during the course of this outbreak and initial reports citing similarity in nucleotide sequence between the Moroccan vaccine strain and field isolates warranted a more in depth analysis of this epizootic. In this study, sequence analysis showed that isolates obtained from four provinces of eastern Morocco were identical, demonstrating that a single SPPV strain was responsible for the 2010 epizootic. In addition, the genome fragments sequenced and phylogenetic analyses undertaken as part of this study showed significant differences between field isolates and the Moroccan vaccine strain. New PCR methods were developed to differentiate between wild-type isolates and vaccine strains of SPPV. Using these methods, no trace of wild-type SPPV was found in the vaccine and no evidence was found to suggest that the vaccine strain was causing clinical disease. © 2015 Blackwell Verlag GmbH.

  8. Mutation analysis of the chromosome 14q24.3 dihydrolipoyl succinyltransferase (DLST) gene in patients with early-onset Alzheimer disease.

    PubMed

    Cruts, M; Backhovens, H; Van Gassen, G; Theuns, J; Wang, S Y; Wehnert, A; van Duijn, C M; Karlsson, T; Hofman, A; Adolfsson, R

    1995-10-13

    Linkage analysis studies have indicated that the chromosome band 14q24.3 harbours a major gene for familial early-onset Alzheimer's disease (AD). Recently we localized the chromosome 14 AD gene (AD3) in the 6.4 cM interval between the markers D14S289 and D14S61. We mapped the gene encoding dihydrolipoyl succinyltransferase (DLST), the E2k component of human alpha-ketoglutarate dehydrogenase complex (KGDHC), in the AD3 candidate region using yeast artificial chromosomes (YACs). The DLST gene is a candidate for the AD3 gene since deficiencies in KGDHC activity have been observed in brain tissue and fibroblasts of AD patients. The 15 exons and the promoter region of the DLST gene were analysed for mutations in chromosome 14 linked AD cases and in two series of unrelated early-onset AD cases (onset age < 55 years). Sequence variations in intronic sequences (introns 3, 5 and 10) or silent mutations in exonic sequences (exons 8 and 14) were identified. However, no AD related mutations were observed, suggesting that the DLST gene is not the chromosome 14 AD3 gene.

  9. CAPA-Gene Products in the Haematophagous Sandfly Phlebotomus papatasi (Scopoli) - Vector for Leishmaniasis Disease

    DTIC Science & Technology

    2012-01-01

    121 glass capillary and the tissues were air-dried. For peptide analysis, 122 a limited amount of matrix solution (-cyano-4-hydroxycinnamic 123 acid ...genomic sequence of P. papatasi was screened with 143 the amino acid sequence RSGNMGLFPFPRVGR using TBLASTN. 144 The genomic data were produced by The...250 have the N-terminus of CAPA-PVK-2 blocked by pyroglutamate 251 (see Table 1). Pyroglutamate may prevent rapid degradation of this 252 peptide

  10. [Analysis of the molecular characteristics and cloning of full-length coding sequence of interleukin-2 in tree shrews].

    PubMed

    Huang, Xiao-Yan; Li, Ming-Li; Xu, Juan; Gao, Yue-Dong; Wang, Wen-Guang; Yin, An-Guo; Li, Xiao-Fei; Sun, Xiao-Mei; Xia, Xue-Shan; Dai, Jie-Jie

    2013-04-01

    While the tree shrew (Tupaia belangeri chinensis) is an excellent animal model for studying the mechanisms of human diseases, but few studies examine interleukin-2 (IL-2), an important immune factor in disease model evaluation. In this study, a 465 bp of the full-length IL-2 cDNA encoding sequence was cloned from the RNA of tree shrew spleen lymphocytes, which were then cultivated and stimulated with ConA (concanavalin). Clustal W 2.0 was used to compare and analyze the sequence and molecular characteristics, and establish the similarity of the overall structure of IL-2 between tree shrews and other mammals. The homology of the IL-2 nucleotide sequence between tree shrews and humans was 93%, and the amino acid homology was 80%. The phylogenetic tree results, derived through the Neighbour-Joining method using MEGA5.0, indicated a close genetic relationship between tree shrews, Homo sapiens, and Macaca mulatta. The three-dimensional structure analysis showed that the surface charges in most regions of tree shrew IL-2 were similar to between tree shrews and humans; however, the N-glycosylation sites and local structures were different, which may affect antibody binding. These results provide a fundamental basis for the future study of IL-2 monoclonal antibody in tree shrews, thereby improving their utility as a model.

  11. Variation of amino acid sequences of serum amyloid a (SAA) and immunohistochemical analysis of amyloid a (AA) in Japanese domestic cats.

    PubMed

    Tei, Meina; Uchida, Kazuyuki; Chambers, James K; Watanabe, Ken-Ichi; Tamamoto, Takashi; Ohno, Koichi; Nakayama, Hiroyuki

    2018-02-02

    Amyloid A (AA) amyloidosis, a fatal systemic amyloid disease, occurs secondary to chronic inflammatory conditions in humans. Although persistently elevated serum amyloid A (SAA) levels are required for its pathogenesis, not all individuals with chronic inflammation necessarily develop AA amyloidosis. Furthermore, many diseases in cats are associated with the elevated production of SAA, whereas only a small number actually develop AA amyloidosis. We hypothesized that a genetic mutation in the SAA gene may strongly contribute to the pathogenesis of feline AA amyloidosis. In the present study, genomic DNA from four Japanese domestic cats (JDCs) with AA amyloidosis and from five without amyloidosis was analyzed using polymerase chain reaction (PCR) amplification and direct sequencing. We identified the novel variation combination of 45R-51A in the deduced amino acid sequences of four JDCs with amyloidosis and five without. However, there was no relationship between amino acid variations and the distribution of AA amyloid deposits, indicating that differences in SAA sequences do not contribute to the pathogenesis of AA amyloidosis. Immunohistochemical analysis using antisera against the three different parts of the feline SAA protein-i.e., the N-terminal, central, and C-terminal regions-revealed that feline AA contained the C-terminus, unlike human AA. These results indicate that the cleavage and degradation of the C-terminus are not essential for amyloid fibril formation in JDCs.

  12. Bi-exponential T2 analysis of healthy and diseased Achilles tendons: an in vivo preliminary magnetic resonance study and correlation with clinical score.

    PubMed

    Juras, Vladimir; Apprich, Sebastian; Szomolanyi, Pavol; Bieri, Oliver; Deligianni, Xeni; Trattnig, Siegfried

    2013-10-01

    To compare mono- and bi-exponential T2 analysis in healthy and degenerated Achilles tendons using a recently introduced magnetic resonance variable-echo-time sequence (vTE) for T2 mapping. Ten volunteers and ten patients were included in the study. A variable-echo-time sequence was used with 20 echo times. Images were post-processed with both techniques, mono- and bi-exponential [T2 m, short T2 component (T2 s) and long T2 component (T2 l)]. The number of mono- and bi-exponentially decaying pixels in each region of interest was expressed as a ratio (B/M). Patients were clinically assessed with the Achilles Tendon Rupture Score (ATRS), and these values were correlated with the T2 values. The means for both T2 m and T2 s were statistically significantly different between patients and volunteers; however, for T2 s, the P value was lower. In patients, the Pearson correlation coefficient between ATRS and T2 s was -0.816 (P = 0.007). The proposed variable-echo-time sequence can be successfully used as an alternative method to UTE sequences with some added benefits, such as a short imaging time along with relatively high resolution and minimised blurring artefacts, and minimised susceptibility artefacts and chemical shift artefacts. Bi-exponential T2 calculation is superior to mono-exponential in terms of statistical significance for the diagnosis of Achilles tendinopathy. • Magnetic resonance imaging offers new insight into healthy and diseased Achilles tendons • Bi-exponential T2 calculation in Achilles tendons is more beneficial than mono-exponential • A short T2 component correlates strongly with clinical score • Variable echo time sequences successfully used instead of ultrashort echo time sequences.

  13. Hepatitis delta genotypes in chronic delta infection in the northeast of Spain (Catalonia).

    PubMed

    Cotrina, M; Buti, M; Jardi, R; Quer, J; Rodriguez, F; Pascual, C; Esteban, R; Guardia, J

    1998-06-01

    Based on genetic analysis of variants obtained around the world, three genotypes of the hepatitis delta virus have been defined. Hepatitis delta virus variants have been associated with different disease patterns and geographic distributions. To determine the prevalence of hepatitis delta virus genotypes in the northeast of Spain (Catalonia) and the correlation with transmission routes and clinical disease, we studied the nucleotide divergence of the consensus sequence of HDV RNA obtained from 33 patients with chronic delta hepatitis (24 were intravenous drug users and nine had no risk factors), and four patients with acute self-limited delta infection. Serum HDV RNA was amplified by the polymerase chain reaction technique and a fragment of 350 nucleotides (nt 910 to 1259) was directly sequenced. Genetic analysis of the nucleotide consensus sequence obtained showed a high degree of conservation among sequences (93% of mean). Comparison of these sequences with those derived from different geographic areas and pertaining to genotypes I, II and III, showed a mean sequence identity of 92% with genotype I, 73% with genotype II and 61% with genotype III. At the amino acid level (aa 115 to 214), the mean identity was 87% with genotype I, 63% with genotype II and 56% with genotype III. Conserved regions included the RNA editing domain, the carboxyl terminal 19 amino acids of the hepatitis delta antigen and the polyadenylation signal of the viral mRNA. Hepatitis delta virus isolates in the northeast of Spain are exclusively genotype I, independently of the transmission route and the type of infection. No hepatitis delta virus subgenotypes were found, suggesting that the origin of hepatitis delta virus infection in our geographical area is homogeneous.

  14. Transcriptomics of In Vitro Immune-Stimulated Hemocytes from the Manila Clam Ruditapes philippinarum Using High-Throughput Sequencing

    PubMed Central

    Moreira, Rebeca; Balseiro, Pablo; Planas, Josep V.; Fuste, Berta; Beltran, Sergi; Novoa, Beatriz; Figueras, Antonio

    2012-01-01

    Background The Manila clam (Ruditapes philippinarum) is a worldwide cultured bivalve species with important commercial value. Diseases affecting this species can result in large economic losses. Because knowledge of the molecular mechanisms of the immune response in bivalves, especially clams, is scarce and fragmentary, we sequenced RNA from immune-stimulated R. philippinarum hemocytes by 454-pyrosequencing to identify genes involved in their immune defense against infectious diseases. Methodology and Principal Findings High-throughput deep sequencing of R. philippinarum using 454 pyrosequencing technology yielded 974,976 high-quality reads with an average read length of 250 bp. The reads were assembled into 51,265 contigs and the 44.7% of the translated nucleotide sequences into protein were annotated successfully. The 35 most frequently found contigs included a large number of immune-related genes, and a more detailed analysis showed the presence of putative members of several immune pathways and processes like the apoptosis, the toll like signaling pathway and the complement cascade. We have found sequences from molecules never described in bivalves before, especially in the complement pathway where almost all the components are present. Conclusions This study represents the first transcriptome analysis using 454-pyrosequencing conducted on R. philippinarum focused on its immune system. Our results will provide a rich source of data to discover and identify new genes, which will serve as a basis for microarray construction and the study of gene expression as well as for the identification of genetic markers. The discovery of new immune sequences was very productive and resulted in a large variety of contigs that may play a role in the defense mechanisms of Ruditapes philippinarum. PMID:22536348

  15. SNP-VISTA: An interactive SNP visualization tool

    PubMed Central

    Shah, Nameeta; Teplitsky, Michael V; Minovitsky, Simon; Pennacchio, Len A; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L

    2005-01-01

    Background Recent advances in sequencing technologies promise to provide a better understanding of the genetics of human disease as well as the evolution of microbial populations. Single Nucleotide Polymorphisms (SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it has become possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease in an attempt to identify causative mutations. In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples enables more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at [1]. Results We have developed and present two modifications of an interactive visualization tool, SNP-VISTA, to aid in the analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein evolutionary conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. Conclusion The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNP data by the user. PMID:16336665

  16. Whole-Genome Sequencing and Concordance Between Antimicrobial Susceptibility Genotypes and Phenotypes of Bacterial Isolates Associated with Bovine Respiratory Disease.

    PubMed

    Owen, Joseph R; Noyes, Noelle; Young, Amy E; Prince, Daniel J; Blanchard, Patricia C; Lehenbauer, Terry W; Aly, Sharif S; Davis, Jessica H; O'Rourke, Sean M; Abdo, Zaid; Belk, Keith; Miller, Michael R; Morley, Paul; Van Eenennaam, Alison L

    2017-09-07

    Extended laboratory culture and antimicrobial susceptibility testing timelines hinder rapid species identification and susceptibility profiling of bacterial pathogens associated with bovine respiratory disease, the most prevalent cause of cattle mortality in the United States. Whole-genome sequencing offers a culture-independent alternative to current bacterial identification methods, but requires a library of bacterial reference genomes for comparison. To contribute new bacterial genome assemblies and evaluate genetic diversity and variation in antimicrobial resistance genotypes, whole-genome sequencing was performed on bovine respiratory disease-associated bacterial isolates ( Histophilus somni , Mycoplasma bovis , Mannheimia haemolytica , and Pasteurella multocida ) from dairy and beef cattle. One hundred genomically distinct assemblies were added to the NCBI database, doubling the available genomic sequences for these four species. Computer-based methods identified 11 predicted antimicrobial resistance genes in three species, with none being detected in M. bovis While computer-based analysis can identify antibiotic resistance genes within whole-genome sequences (genotype), it may not predict the actual antimicrobial resistance observed in a living organism (phenotype). Antimicrobial susceptibility testing on 64 H. somni , M. haemolytica , and P. multocida isolates had an overall concordance rate between genotype and phenotypic resistance to the associated class of antimicrobials of 72.7% ( P < 0.001), showing substantial discordance. Concordance rates varied greatly among different antimicrobial, antibiotic resistance gene, and bacterial species combinations. This suggests that antimicrobial susceptibility phenotypes are needed to complement genomically predicted antibiotic resistance gene genotypes to better understand how the presence of antibiotic resistance genes within a given bacterial species could potentially impact optimal bovine respiratory disease treatment and morbidity/mortality outcomes. Copyright © 2017 Owen et al.

  17. Exome Sequencing and the Management of Neurometabolic Disorders.

    PubMed

    Tarailo-Graovac, Maja; Shyr, Casper; Ross, Colin J; Horvath, Gabriella A; Salvarinova, Ramona; Ye, Xin C; Zhang, Lin-Hua; Bhavsar, Amit P; Lee, Jessica J Y; Drögemöller, Britt I; Abdelsayed, Mena; Alfadhel, Majid; Armstrong, Linlea; Baumgartner, Matthias R; Burda, Patricie; Connolly, Mary B; Cameron, Jessie; Demos, Michelle; Dewan, Tammie; Dionne, Janis; Evans, A Mark; Friedman, Jan M; Garber, Ian; Lewis, Suzanne; Ling, Jiqiang; Mandal, Rupasri; Mattman, Andre; McKinnon, Margaret; Michoulas, Aspasia; Metzger, Daniel; Ogunbayo, Oluseye A; Rakic, Bojana; Rozmus, Jacob; Ruben, Peter; Sayson, Bryan; Santra, Saikat; Schultz, Kirk R; Selby, Kathryn; Shekel, Paul; Sirrs, Sandra; Skrypnyk, Cristina; Superti-Furga, Andrea; Turvey, Stuart E; Van Allen, Margot I; Wishart, David; Wu, Jiang; Wu, John; Zafeiriou, Dimitrios; Kluijtmans, Leo; Wevers, Ron A; Eydoux, Patrice; Lehman, Anna M; Vallance, Hilary; Stockler-Ipsiroglu, Sylvia; Sinclair, Graham; Wasserman, Wyeth W; van Karnebeek, Clara D

    2016-06-09

    Whole-exome sequencing has transformed gene discovery and diagnosis in rare diseases. Translation into disease-modifying treatments is challenging, particularly for intellectual developmental disorder. However, the exception is inborn errors of metabolism, since many of these disorders are responsive to therapy that targets pathophysiological features at the molecular or cellular level. To uncover the genetic basis of potentially treatable inborn errors of metabolism, we combined deep clinical phenotyping (the comprehensive characterization of the discrete components of a patient's clinical and biochemical phenotype) with whole-exome sequencing analysis through a semiautomated bioinformatics pipeline in consecutively enrolled patients with intellectual developmental disorder and unexplained metabolic phenotypes. We performed whole-exome sequencing on samples obtained from 47 probands. Of these patients, 6 were excluded, including 1 who withdrew from the study. The remaining 41 probands had been born to predominantly nonconsanguineous parents of European descent. In 37 probands, we identified variants in 2 genes newly implicated in disease, 9 candidate genes, 22 known genes with newly identified phenotypes, and 9 genes with expected phenotypes; in most of the genes, the variants were classified as either pathogenic or probably pathogenic. Complex phenotypes of patients in five families were explained by coexisting monogenic conditions. We obtained a diagnosis in 28 of 41 probands (68%) who were evaluated. A test of a targeted intervention was performed in 18 patients (44%). Deep phenotyping and whole-exome sequencing in 41 probands with intellectual developmental disorder and unexplained metabolic abnormalities led to a diagnosis in 68%, the identification of 11 candidate genes newly implicated in neurometabolic disease, and a change in treatment beyond genetic counseling in 44%. (Funded by BC Children's Hospital Foundation and others.).

  18. Exome Sequencing and the Management of Neurometabolic Disorders

    PubMed Central

    Tarailo-Graovac, M.; Shyr, C.; Ross, C.J.; Horvath, G.A.; Salvarinova, R.; Ye, X.C.; Zhang, L.-H.; Bhavsar, A.P.; Lee, J.J.Y.; Drögemöller, B.I.; Abdelsayed, M.; Alfadhel, M.; Armstrong, L.; Baumgartner, M.R.; Burda, P.; Connolly, M.B.; Cameron, J.; Demos, M.; Dewan, T.; Dionne, J.; Evans, A.M.; Friedman, J.M.; Garber, I.; Lewis, S.; Ling, J.; Mandal, R.; Mattman, A.; McKinnon, M.; Michoulas, A.; Metzger, D.; Ogunbayo, O.A.; Rakic, B.; Rozmus, J.; Ruben, P.; Sayson, B.; Santra, S.; Schultz, K.R.; Selby, K.; Shekel, P.; Sirrs, S.; Skrypnyk, C.; Superti-Furga, A.; Turvey, S.E.; Van Allen, M.I.; Wishart, D.; Wu, J.; Wu, J.; Zafeiriou, D.; Kluijtmans, L.; Wevers, R.A.; Eydoux, P.; Lehman, A.M.; Vallance, H.; Stockler-Ipsiroglu, S.; Sinclair, G.; Wasserman, W.W.; van Karnebeek, C.D.

    2016-01-01

    BACKGROUND Whole-exome sequencing has transformed gene discovery and diagnosis in rare diseases. Translation into disease-modifying treatments is challenging, particularly for intellectual developmental disorder. However, the exception is inborn errors of metabolism, since many of these disorders are responsive to therapy that targets pathophysiological features at the molecular or cellular level. METHODS To uncover the genetic basis of potentially treatable inborn errors of metabolism, we combined deep clinical phenotyping (the comprehensive characterization of the discrete components of a patient’s clinical and biochemical phenotype) with whole-exome sequencing analysis through a semiautomated bioinformatics pipeline in consecutively enrolled patients with intellectual developmental disorder and unexplained metabolic phenotypes. RESULTS We performed whole-exome sequencing on samples obtained from 47 probands. Of these patients, 6 were excluded, including 1 who withdrew from the study. The remaining 41 probands had been born to predominantly nonconsanguineous parents of European descent. In 37 probands, we identified variants in 2 genes newly implicated in disease, 9 candidate genes, 22 known genes with newly identified phenotypes, and 9 genes with expected phenotypes; in most of the genes, the variants were classified as either pathogenic or probably pathogenic. Complex phenotypes of patients in five families were explained by coexisting monogenic conditions. We obtained a diagnosis in 28 of 41 probands (68%) who were evaluated. A test of a targeted intervention was performed in 18 patients (44%). CONCLUSIONS Deep phenotyping and whole-exome sequencing in 41 probands with intellectual developmental disorder and unexplained metabolic abnormalities led to a diagnosis in 68%, the identification of 11 candidate genes newly implicated in neurometabolic disease, and a change in treatment beyond genetic counseling in 44%. (Funded by BC Children’s Hospital Foundation and others.) PMID:27276562

  19. Molecular Diagnostics in Autosomal Dominant Polycystic Kidney Disease: Utility and Limitations

    PubMed Central

    Zhao, Xiao; Paterson, Andrew D.; Zahirieh, Alireza; He, Ning; Wang, Kairong; Pei, York

    2008-01-01

    Background and objectives: Gene-based mutation screening is now available and has the potential to provide diagnostic confirmation or exclusion of autosomal dominant polycystic kidney disease. This study illustrates its utility and limitations in the clinical setting. Design, setting, participants, & measurements: Using a molecular diagnostic service, genomic DNA of one affected individual from each study family was screened for pathologic PKD1 and PKD2 mutations. Bidirectional sequencing was performed to identify sequence variants in all exons and splice junctions of both genes and to confirm the specific mutations in other family members. In two multiplex families, microsatellite markers were genotyped at both PDK1 and PKD2 loci, and pair-wise and multipoint linkage analysis was performed. Results: Three of five probands studied were referred for assessment of renal cystic disease without a family history of autosomal dominant polycystic kidney disease, and two others were younger at-risk members of families with autosomal dominant polycystic kidney disease being evaluated as living-related kidney donors. Gene-based mutation screening identified pathogenic mutations that provided confirmation or exclusion of disease in three probands, but in the other two, only unclassified variants were identified. In one proband in which mutation screening was indeterminate, DNA linkage studies provided strong evidence for disease exclusion. Conclusions: Gene-based mutation screening or DNA linkage analysis should be considered in individuals in whom the diagnosis of autosomal dominant polycystic kidney disease is uncertain because of a lack of family history or equivocal imaging results and in younger at-risk individuals who are being evaluated as living-related kidney donors. PMID:18077784

  20. The Phantom Menace for Patients with Hepatobiliary Diseases: Shewanella haliotis, Often Misidentified as Shewanella algae in Biochemical Tests and MALDI-TOF Analysis.

    PubMed

    Byun, Jung-Hyun; Park, Hyunwoong; Kim, Sunjoo

    2017-03-24

    Although Shewanella algae has been known to have weak pathogenicity, case reports on infections with this species have been steadily increasing. S. algae and S. haliotis are difficult to distinguish from each other with conventional phenotypic methods. We reviewed the microbiological and clinical features of S. algae and S. haliotis infections at our institute. Bacterial culture and identification reports from patient samples from 2010 to 2014 were reviewed to screen the cases of Shewanella infections. In addition to conventional biochemical tests, 16S rRNA gene sequence analysis and matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry were performed for 19 stored bacterial isolates. Medical records were reviewed for clinical characteristics and laboratory findings. All isolates were identified as S. algae by using VITEK 2. MALDI-TOF also identified all isolates as S. algae with a 99.9 confidence value. In contrast, 16S rRNA analysis identified 10 isolates as S. algae and 9 isolates as S. haliotis. Both S. algae (60%) and S. haliotis (77%) infections were strongly associated with diseases of the hepatobiliary tract and pancreas. To distinguish between S. algae and S. haliotis, 16S rRNA gene sequence analysis seems more accurate than biochemical tests or MALDI-TOF. Patients with underlying diseases in the hepatobiliary tract and pancreas seem to be susceptible to these marine pathogens.

  1. Next-generation sequencing reveals a novel NDP gene mutation in a Chinese family with Norrie disease.

    PubMed

    Huang, Xiaoyan; Tian, Mao; Li, Jiankang; Cui, Ling; Li, Min; Zhang, Jianguo

    2017-11-01

    Norrie disease (ND) is a rare X-linked genetic disorder, the main symptoms of which are congenital blindness and white pupils. It has been reported that ND is caused by mutations in the NDP gene. Although many mutations in NDP have been reported, the genetic cause for many patients remains unknown. In this study, the aim is to investigate the genetic defect in a five-generation family with typical symptoms of ND. To identify the causative gene, next-generation sequencing based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members using Sanger sequencing. We identified a novel missense variant (c.314C>A) located within the NDP gene. The mutation cosegregated within all affected individuals in the family and was not found in unaffected members. By happenstance, in this family, we also detected a known pathogenic variant of retinitis pigmentosa in a healthy individual. c.314C>A mutation of NDP gene is a novel mutation and broadens the genetic spectrum of ND.

  2. Identification and molecular characterization of a new recombinant begomovirus and associated betasatellite DNA infecting Capsicum annuum in India.

    PubMed

    Bhatt, Bhavin S; Chahwala, Fenisha D; Rathod, Sangeeta; Singh, Achuit K

    2016-05-01

    Capsicum annuum (Chilli) is a perennial herbaceous plant that is cultivated as an annual crop throughout the world, including India. Chilli leaf curl disease (ChiLCD) is a major biotic constraint, causing major losses in chilli production. During 2014, leaf samples of chilli plants displaying leaf curl disease were collected from the Ahmedabad district of Gujarat, India. These samples were used to isolate, clone and sequence viral genomic DNA and an associated betasatellite DNA molecule. Sequence analysis showed 90.4 % nucleotide sequence identity to the previously reported chilli leaf curl virus-[India:Guntur:2009] (ChiLCV-[IN:Gun:09]. As per ICTV nomenclature rules, ChiLCV-Ahm represents a new species of begomovirus, and we therefore propose the name chilli leaf curl Ahmedabad virus-[India:Ahmedabad:2014] (ChiLCAV-[IN:Ahm:14]). The associated betasatellite DNA showed a maximum of 93.5 % nucleotide sequence identity to a previously reported tomato leaf curl Bangladesh betasatellite and may be named tomato leaf curl Bangladesh betasatellite-[India:Ahmedabad:Chilli:2014].

  3. Next-generation sequencing reveals a novel NDP gene mutation in a Chinese family with Norrie disease

    PubMed Central

    Huang, Xiaoyan; Tian, Mao; Li, Jiankang; Cui, Ling; Li, Min; Zhang, Jianguo

    2017-01-01

    Purpose: Norrie disease (ND) is a rare X-linked genetic disorder, the main symptoms of which are congenital blindness and white pupils. It has been reported that ND is caused by mutations in the NDP gene. Although many mutations in NDP have been reported, the genetic cause for many patients remains unknown. In this study, the aim is to investigate the genetic defect in a five-generation family with typical symptoms of ND. Methods: To identify the causative gene, next-generation sequencing based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members using Sanger sequencing. Results: We identified a novel missense variant (c.314C>A) located within the NDP gene. The mutation cosegregated within all affected individuals in the family and was not found in unaffected members. By happenstance, in this family, we also detected a known pathogenic variant of retinitis pigmentosa in a healthy individual. Conclusion: c.314C>A mutation of NDP gene is a novel mutation and broadens the genetic spectrum of ND. PMID:29133643

  4. IFRD1 Is a Candidate Gene for SMNA on Chromosome 7q22-q23

    PubMed Central

    Brkanac, Zoran; Spencer, David; Shendure, Jay; Robertson, Peggy D.; Matsushita, Mark; Vu, Tiffany; Bird, Thomas D.; Olson, Maynard V.; Raskind, Wendy H.

    2009-01-01

    We have established strong linkage evidence that supports mapping autosomal-dominant sensory/motor neuropathy with ataxia (SMNA) to chromosome 7q22-q32. SMNA is a rare neurological disorder whose phenotype encompasses both the central and the peripheral nervous system. In order to identify a gene responsible for SMNA, we have undertaken a comprehensive genomic evaluation of the region of linkage, including evaluation for repeat expansion and small deletions or duplications, capillary sequencing of candidate genes, and massively parallel sequencing of all coding exons. We excluded repeat expansion and small deletions or duplications as causative, and through microarray-based hybrid capture and massively parallel short-read sequencing, we identified a nonsynonymous variant in the human interferon-related developmental regulator gene 1 (IFRD1) as a disease-causing candidate. Sequence conservation, animal models, and protein structure evaluation support the involvement of IFRD1 in SMNA. Mutation analysis of IFRD1 in additional patients with similar phenotypes is needed for demonstration of causality and further evaluation of its importance in neurological diseases. PMID:19409521

  5. Strawberry disease lesions in rainbow trout from southern Idaho are associated with DNA from a Rickettsia-like organism.

    PubMed

    Lloyd, Sonja J; LaPatra, Scott E; Snekvik, Kevin R; St-Hilaire, Sophie; Cain, Kenneth D; Call, Douglas R

    2008-11-20

    Strawberry disease (SD) in the USA is a skin disorder of unknown etiology that occurs in rainbow trout Oncorhynchus mykiss and is characterized by bright red inflammatory lesions. To identify a candidate bacterial agent responsible for SD, we constructed 16S rDNA libraries from 7 SD lesion samples and 2 apparently healthy skin samples from SD-affected fish. A 16S rDNA sequence highly similar to members of the order Rickettsiales was present in 3 lesion libraries at 1%, 32% and 54% prevalence, but this sequence was not found in either healthy tissue library. Based on phylogenetic analysis, this Rickettsia-like organism (RLO) sequence is most closely related to 16S rDNA sequences of bacteria that may form a novel lineage within the Rickettsiales. We used nested PCR assays to screen 25 SD-affected fish for RLO or Flavobacterium psychrophilum DNA. Sixteen lesion samples were positive for the RLO sequence and 4 of the matched healthy samples were positive resulting in a significant association between SD lesions and presence of RLO DNA. While F. psychrophilum is reportedly associated with 'cold water strawberry disease' in the UK, we found no significant association between SD lesions and the presence of F. psychrophilum DNA. The statistical association between SD lesions and presence of RLO DNA is not proof of etiology, but these data suggest that RLO may play a role in SD in southern Idaho, USA.

  6. Rare Variants in RTEL1 Are Associated with Familial Interstitial Pneumonia

    PubMed Central

    Cogan, Joy D.; Zhao, Min; Mitchell, Daphne B.; Rives, Lynette; Markin, Cheryl; Garnett, Errine T.; Montgomery, Keri H.; Mason, Wendi R.; McKean, David F.; Powers, Julia; Murphy, Elissa; Olson, Lana M.; Choi, Leena; Cheng, Dong-Sheng; Blue, Elizabeth Marchani; Young, Lisa R.; Lancaster, Lisa H.; Steele, Mark P.; Brown, Kevin K.; Schwarz, Marvin I.; Fingerlin, Tasha E.; Schwartz, David A.; Lawson, William E.; Loyd, James E.; Zhao, Zhongming; Phillips, John A.; Blackwell, Timothy S.

    2015-01-01

    Rationale: Up to 20% of cases of idiopathic interstitial pneumonia cluster in families, comprising the syndrome of familial interstitial pneumonia (FIP); however, the genetic basis of FIP remains uncertain in most families. Objectives: To determine if new disease-causing rare genetic variants could be identified using whole-exome sequencing of affected members from FIP families, providing additional insights into disease pathogenesis. Methods: Affected subjects from 25 kindreds were selected from an ongoing FIP registry for whole-exome sequencing from genomic DNA. Candidate rare variants were confirmed by Sanger sequencing, and cosegregation analysis was performed in families, followed by additional sequencing of affected individuals from another 163 kindreds. Measurements and Main Results: We identified a potentially damaging rare variant in the gene encoding for regulator of telomere elongation helicase 1 (RTEL1) that segregated with disease and was associated with very short telomeres in peripheral blood mononuclear cells in 1 of 25 families in our original whole-exome sequencing cohort. Evaluation of affected individuals in 163 additional kindreds revealed another eight families (4.7%) with heterozygous rare variants in RTEL1 that segregated with clinical FIP. Probands and unaffected carriers of these rare variants had short telomeres (<10% for age) in peripheral blood mononuclear cells and increased T-circle formation, suggesting impaired RTEL1 function. Conclusions: Rare loss-of-function variants in RTEL1 represent a newly defined genetic predisposition for FIP, supporting the importance of telomere-related pathways in pulmonary fibrosis. PMID:25607374

  7. Microsporidian genome analysis reveals evolutionary strategies for obligate intracellular growth

    USDA-ARS?s Scientific Manuscript database

    Microsporidia comprise a large phylum of obligate intracellular eukaryotes that are fungalrelated parasites responsible for widespread disease, and here we address questions about microsporidia biology and evolution. We sequenced three microsporidian genomes from two species, Nematocida parisii and...

  8. Mutations in SURF1 are important genetic causes of Leigh syndrome in Slovak patients.

    PubMed

    Danis, Daniel; Brennerova, Katarina; Skopkova, Martina; Kurdiova, Timea; Ukropec, Jozef; Stanik, Juraj; Kolnikova, Miriam; Gasperikova, Daniela

    2018-04-01

    Leigh syndrome is a progressive early onset neurodegenerative disease typically presenting with psychomotor regression, signs of brainstem and/or basal ganglia disease, lactic acidosis, and characteristic magnetic resonance imaging findings. At molecular level, deficiency of respiratory complexes and/or pyruvate dehydrogenase complex is usually observed. Nuclear gene SURF1 encodes an assembly factor for cytochrome c-oxidase complex of the respiratory chain and autosomal recessive mutations in SURF1 are one of the most frequent causes of cytochrome c-oxidase-related Leigh syndrome cases. Here, we aimed to elucidate the genetic basis of Leigh syndrome in three Slovak families. Three probands presenting with Leigh syndrome were selected for DNA analysis. The first proband, presenting with atypical LS onset without abnormal basal ganglia magnetic resonance imaging findings, was analyzed with whole exome sequencing. In the two remaining probands, SURF1 was screened by Sanger sequencing. Four different heterozygous mutations were identified in SURF1: c.312_321delinsAT:p.(Pro104Profs*1), c.588+1G>A, c.823_833+7del:p. (?) and c.845_846del:p.(Ser282Cysfs*9). All the mutations are predicted to have a loss-of-function effect. We identified disease-causing mutations in all three probands, which points to the important role of SURF1 gene in etiology of Leigh syndrome in Slovakia. Our data showed that patients with atypical Leigh syndrome phenotype without lesions in basal ganglia may benefit from the whole exome sequencing method. In the case of probands presenting the typical phenotype, Sanger sequencing of the SURF1 gene seems to be an effective method of DNA analysis.

  9. Alignment-free design of highly discriminatory diagnostic primer sets for Escherichia coli O104:H4 outbreak strains.

    PubMed

    Pritchard, Leighton; Holden, Nicola J; Bielaszewska, Martina; Karch, Helge; Toth, Ian K

    2012-01-01

    An Escherichia coli O104:H4 outbreak in Germany in summer 2011 caused 53 deaths, over 4000 individual infections across Europe, and considerable economic, social and political impact. This outbreak was the first in a position to exploit rapid, benchtop high-throughput sequencing (HTS) technologies and crowdsourced data analysis early in its investigation, establishing a new paradigm for rapid response to disease threats. We describe a novel strategy for design of diagnostic PCR primers that exploited this rapid draft bacterial genome sequencing to distinguish between E. coli O104:H4 outbreak isolates and other pathogenic E. coli isolates, including the historical hæmolytic uræmic syndrome (HUSEC) E. coli HUSEC041 O104:H4 strain, which possesses the same serotype as the outbreak isolates. Primers were designed using a novel alignment-free strategy against eleven draft whole genome assemblies of E. coli O104:H4 German outbreak isolates from the E. coli O104:H4 Genome Analysis Crowd-Sourcing Consortium website, and a negative sequence set containing 69 E. coli chromosome and plasmid sequences from public databases. Validation in vitro against 21 'positive' E. coli O104:H4 outbreak and 32 'negative' non-outbreak EHEC isolates indicated that individual primer sets exhibited 100% sensitivity for outbreak isolates, with false positive rates of between 9% and 22%. A minimal combination of two primers discriminated between outbreak and non-outbreak E. coli isolates with 100% sensitivity and 100% specificity. Draft genomes of isolates of disease outbreak bacteria enable high throughput primer design and enhanced diagnostic performance in comparison to traditional molecular assays. Future outbreak investigations will be able to harness HTS rapidly to generate draft genome sequences and diagnostic primer sets, greatly facilitating epidemiology and clinical diagnostics. We expect that high throughput primer design strategies will enable faster, more precise responses to future disease outbreaks of bacterial origin, and help to mitigate their societal impact.

  10. Diversity of interferon inducible Mx gene in horses and association of variations with susceptibility vis-à-vis resistance against equine influenza infection.

    PubMed

    Manuja, Balvinder K; Manuja, Anju; Dahiya, Rajni; Singh, Sandeep; Sharma, R C; Gahlot, S K

    2014-10-01

    Equine influenza (EI) is primarily an infection of the upper respiratory tract and is one of the major infectious respiratory diseases of economic importance in equines. Re-emergence of the disease, species jumping by H3N8 virus in canines and possible threat of human pandemic due to the unpredictable nature of the virus have necessitated research on devising strategies for preventing the disease. The myxovirus resistance protein (Mx) has been reported to confer resistance to Orthomyxo virus infection by modifying cellular functions needed along the viral replication pathway. Polymorphisms and differential antiviral activities of Mx gene have been reported in pigs and chicken. Here we report the diversity of Mx gene, its expression in response to stimulation with interferon (IFN) α/β and their association with EI resistance and susceptibility in Marwari horses. Blood samples were collected from horses declared positive for equine influenza and in contact animals with a history of no clinical signs. Mx gene was amplified by reverse transcription from total RNA isolated from peripheral blood mononuclear cells (PBMCs) stimulated with IFN α/β using gene specific primers. The amplified gene products from representative samples were cloned and sequenced. Nucleotide sequences and deduced amino acid sequences were analyzed. Out of a total 24 amino acids substitutions sorting intolerant from tolerant (SIFT) analysis predicted 13 substitutions with functional consequences. Five substitutions (V67A, W123L, E346Y, N347Y, S689N) were observed only in resistant animals. Evolutionary distances based on nucleotide sequences with in equines ranged between 0.3-2.0% and 20-24% with other species. On phylogenetic analysis all equine sequences clustered together while other species formed separate clades. Copyright © 2014 Elsevier B.V. All rights reserved.

  11. RNA sequencing-based longitudinal transcriptomic profiling gives novel insights into the disease mechanism of generalized pustular psoriasis.

    PubMed

    Wang, Lingyan; Yu, Xiaoling; Wu, Chao; Zhu, Teng; Wang, Wenming; Zheng, Xiaofeng; Jin, Hongzhong

    2018-06-05

    Generalized pustular psoriasis (GPP) is a rare, episodic, potentially life-threatening inflammatory disease. However, the pathogenesis of GPP, and universally accepted therapies for treating it, remain undefined. To better understand the disease mechanism of GPP, we performed a transcriptome analysis to profile the gene expression of peripheral blood mononuclear cells (PBMCs) from patients enrolled at the time of diagnosis and receiving follow-up treatment for up to 6 months. RNA sequencing data revealed that gene expression in five GPP patients' PBMCs was profoundly altered following acitretin treatment. Differentially expressed gene (DEG) analysis suggested that genes related to psoriatic inflammation, including CXCL1, CXCL8 (IL-8), S100A8, S100A9, S100A12 and LCN2, were significantly downregulated in patients in remission from GPP. Functional enrichment and annotation analysis unveiled a cluster of DEGs significantly associated with the function of leukocytes, particularly neutrophils. Pathway analysis suggested that a variety of pro-inflammatory pathways were inhibited in patients in remission. This analysis not only reaffirmed known signaling pathways in GPP pathogenesis, but also implicated novel factors and pathways, such as cell cycle regulation pathways. Furthermore, regulator network analysis provided bioinformatics-based support for upstream molecules as potential therapeutic targets such as oncostatin M. This longitudinal analysis of blood transcriptomes provides the first evidence that dysregulated gene expression in peripheral blood may significantly contribute to psoriatic inflammation in GPP patients. Novel canonical pathways and biomarkers identified in the current research may provide insights to help understand GPP pathobiology and advance novel therapeutics.

  12. Proteomic Analysis of Lyme Disease: Global Protein Comparison of Three Strains of Borrelia burgdorferi

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jacobs, Jon M.; Yang, Xiaohua; Luft, Benjamin J.

    2005-04-01

    The Borrelia burgdorferi spirochete is the causative agent of Lyme disease, the most common tick-borne disease in the United States. It has been studied extensively to help understand its pathogenicity of infection and how it can persist in different mammalian hosts. We report the proteomic analysis of the archetype B. burgdorferi B31 strain and two other strains (ND40, and JD-1) having different Borrelia pathotypes using strong cation exchange fractionation of proteolytic peptides followed by high-resolution, reversed phase capillary liquid chromatography coupled with ion trap tandem mass spectrometric (LC-MS/MS) analysis. Protein identification was facilitated by the availability of the complete B31more » genome sequence. A total of 665 Borrelia proteins were identified representing ~38 % coverage of the theoretical B31 proteome. A significant overlap was observed between the identified proteins in direct comparisons between any two strains (>72%), but distinct differences were observed among identified hypothetical and outer membrane proteins of the three strains. Such a concurrent proteomic overview of three Borrelia strains based upon only the B31 genome sequence is shown to provide significant insights into the presence or absence of specific proteins and a broad overall comparison among strains.« less

  13. CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence

    PubMed Central

    Nepal, Madhav P; Benson, Benjamin V

    2015-01-01

    Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the Ks-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future. PMID:25922568

  14. CNL Disease Resistance Genes in Soybean and Their Evolutionary Divergence.

    PubMed

    Nepal, Madhav P; Benson, Benjamin V

    2015-01-01

    Disease resistance genes (R-genes) encode proteins involved in detecting pathogen attack and activating downstream defense molecules. Recent availability of soybean genome sequences makes it possible to examine the diversity of gene families including disease-resistant genes. The objectives of this study were to identify coiled-coil NBS-LRR (= CNL) R-genes in soybean, infer their evolutionary relationships, and assess structural as well as functional divergence of the R-genes. Profile hidden Markov models were used for sequence identification and model-based maximum likelihood was used for phylogenetic analysis, and variation in chromosomal positioning, gene clustering, and functional divergence were assessed. We identified 188 soybean CNL genes nested into four clades consistent to their orthologs in Arabidopsis. Gene clustering analysis revealed the presence of 41 gene clusters located on 13 different chromosomes. Analyses of the K s-values and chromosomal positioning suggest duplication events occurring at varying timescales, and an extrapericentromeric positioning may have facilitated their rapid evolution. Each of the four CNL clades exhibited distinct patterns of gene expression. Phylogenetic analysis further supported the extrapericentromeric positioning effect on the divergence and retention of the CNL genes. The results are important for understanding the diversity and divergence of CNL genes in soybean, which would have implication in soybean crop improvement in future.

  15. Electronic Transport in Single-Stranded DNA Molecule Related to Huntington's Disease

    NASA Astrophysics Data System (ADS)

    Sarmento, R. G.; Silva, R. N. O.; Madeira, M. P.; Frazão, N. F.; Sousa, J. O.; Macedo-Filho, A.

    2018-04-01

    We report a numerical analysis of the electronic transport in single chain DNA molecule consisting of 182 nucleotides. The DNA chains studied were extracted from a segment of the human chromosome 4p16.3, which were modified by expansion of CAG (cytosine-adenine-guanine) triplet repeats to mimics Huntington's disease. The mutated DNA chains were connected between two platinum electrodes to analyze the relationship between charge propagation in the molecule and Huntington's disease. The computations were performed within a tight-binding model, together with a transfer matrix technique, to investigate the current-voltage (I-V) of 23 types of DNA sequence and compare them with the distributions of the related CAG repeat numbers with the disease. All DNA sequences studied have a characteristic behavior of a semiconductor. In addition, the results showed a direct correlation between the current-voltage curves and the distributions of the CAG repeat numbers, suggesting possible applications in the development of DNA-based biosensors for molecular diagnostics.

  16. A polymorphic DNA marker that represents a conserved expressed sequence in the region of the Huntington disease gene.

    PubMed Central

    Hayden, M R; Hewitt, J; Wasmuth, J J; Kastelein, J J; Langlois, S; Conneally, M; Haines, J; Smith, B; Hilbert, C; Allard, D

    1988-01-01

    A polymorphic marker (D4S62) that is genetically closely linked to D4S10 and is in the region of the gene for Huntington disease is described. A four-allele polymorphism is detected when HincII-digested DNA is hybridized with D4S62. D4S62 maps, by Southern blot analysis using somatic-cell hybrids, to 4p16.1 closer to the centromere than does D4S10. The use of the polymorphisms detected by D4S62 increases the informativeness of markers close to the gene for Huntington disease and will be useful for preclinical diagnosis. D4S62 detects transcripts of approximately 6,000 nucleotides in rat, mouse, and monkey liver and brain. This represents the first demonstration of conserved expressed sequences close to the gene for Huntington disease. Images Figure 1 Figure 2 Figure 3 Figure 4 Figure 6 PMID:2892395

  17. Toward a mtDNA locus-specific mutation database using the LOVD platform.

    PubMed

    Elson, Joanna L; Sweeney, Mary G; Procaccio, Vincent; Yarham, John W; Salas, Antonio; Kong, Qing-Peng; van der Westhuizen, Francois H; Pitceathly, Robert D S; Thorburn, David R; Lott, Marie T; Wallace, Douglas C; Taylor, Robert W; McFarland, Robert

    2012-09-01

    The Human Variome Project (HVP) is a global effort to collect and curate all human genetic variation affecting health. Mutations of mitochondrial DNA (mtDNA) are an important cause of neurogenetic disease in humans; however, identification of the pathogenic mutations responsible can be problematic. In this article, we provide explanations as to why and suggest how such difficulties might be overcome. We put forward a case in support of a new Locus Specific Mutation Database (LSDB) implemented using the Leiden Open-source Variation Database (LOVD) system that will not only list primary mutations, but also present the evidence supporting their role in disease. Critically, we feel that this new database should have the capacity to store information on the observed phenotypes alongside the genetic variation, thereby facilitating our understanding of the complex and variable presentation of mtDNA disease. LOVD supports fast queries of both seen and hidden data and allows storage of sequence variants from high-throughput sequence analysis. The LOVD platform will allow construction of a secure mtDNA database; one that can fully utilize currently available data, as well as that being generated by high-throughput sequencing, to link genotype with phenotype enhancing our understanding of mitochondrial disease, with a view to providing better prognostic information. © 2012 Wiley Periodicals, Inc.

  18. Toward a mtDNA Locus-Specific Mutation Database Using the LOVD Platform

    PubMed Central

    Elson, Joanna L.; Sweeney, Mary G.; Procaccio, Vincent; Yarham, John W.; Salas, Antonio; Kong, Qing-Peng; van der Westhuizen, Francois H.; Pitceathly, Robert D.S.; Thorburn, David R.; Lott, Marie T.; Wallace, Douglas C.; Taylor, Robert W.; McFarland, Robert

    2015-01-01

    The Human Variome Project (HVP) is a global effort to collect and curate all human genetic variation affecting health. Mutations of mitochondrial DNA (mtDNA) are an important cause of neurogenetic disease in humans; however, identification of the pathogenic mutations responsible can be problematic. In this article, we provide explanations as to why and suggest how such difficulties might be overcome. We put forward a case in support of a new Locus Specific Mutation Database (LSDB) implemented using the Leiden Open-source Variation Database (LOVD) system that will not only list primary mutations, but also present the evidence supporting their role in disease. Critically, we feel that this new database should have the capacity to store information on the observed phenotypes alongside the genetic variation, thereby facilitating our understanding of the complex and variable presentation of mtDNA disease. LOVD supports fast queries of both seen and hidden data and allows storage of sequence variants from high-throughput sequence analysis. The LOVD platform will allow construction of a secure mtDNA database; one that can fully utilize currently available data, as well as that being generated by high-throughput sequencing, to link genotype with phenotype enhancing our understanding of mitochondrial disease, with a view to providing better prognostic information. PMID:22581690

  19. Somatic mutations in benign breast disease tissue and risk of subsequent invasive breast cancer.

    PubMed

    Rohan, Thomas E; Miller, Christopher A; Li, Tiandao; Wang, Yihong; Loudig, Olivier; Ginsberg, Mindy; Glass, Andrew; Mardis, Elaine

    2018-06-06

    Insights into the molecular pathogenesis of breast cancer might come from molecular analysis of tissue from early stages of the disease. We conducted a case-control study nested in a cohort of women who had biopsy-confirmed benign breast disease (BBD) diagnosed between 1971 and 2006 at Kaiser Permanente Northwest and who were followed to mid-2015 to ascertain subsequent invasive breast cancer (IBC); cases (n = 218) were women with BBD who developed subsequent IBC and controls, individually matched (1:1) to cases, were women with BBD who did not develop IBC in the same follow-up interval as that for the corresponding case. Targeted sequence capture and sequencing were performed for 83 genes of importance in breast cancer. There were no significant case-control differences in mutation burden overall, for non-silent mutations, for individual genes, or with respect either to the nature of the gene mutations or to mutational enrichment at the pathway level. For seven subjects with DNA from the BBD and ipsilateral IBC, virtually no mutations were shared. This study, the first to use a targeted multi-gene sequencing approach on early breast cancer precursor lesions to investigate the genomic basis of the disease, showed that somatic mutations detected in BBD tissue were not associated with breast cancer risk.

  20. Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

    PubMed

    Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

    2004-02-01

    To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.

  1. Myelin protein zero gene sequencing diagnoses Charcot-Marie-Tooth Type 1B disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Su, Y.; Zhang, H.; Madrid, R.

    1994-09-01

    Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, affects about 1 in 2600 people in Norway and is found worldwide. CMT Type 1 (CMT1) has slow nerve conduction with demyelinated Schwann cells. Autosomal dominant CMT Type 1B (CMT1B) results from mutations in the myelin protein zero gene which directs the synthesis of more than half of all Schwann cell protein. This gene was mapped to the chromosome 1q22-1q23.1 borderline by fluorescence in situ hybridization. The first 7 of 7 reported CMT1B mutations are unique. Thus the most effective means to identify CMT1B mutations in at-risk family members and fetuses ismore » to sequence the entire coding sequence in dominant or sporadic CMT patients without the CMT1A duplication. Of the 19 primers used in 16 pars to uniquely amplify the entire MPZ coding sequence, 6 primer pairs were used to amplify and sequence the 6 exons. The DyeDeoxy Terminator cycle sequencing method used with four different color fluorescent lables was superior to manual sequencing because it sequences more bases unambiguously from extracted genomic DNA samples within 24 hours. This protocol was used to test 28 CMT and Dejerine-Sottas patients without CMT1A gene duplication. Sequencing MPZ gene-specific amplified fragments identified 9 polymorphic sites within the 6 exons that encode the 248 amino acid MPZ protein. The large number of major CMT1B mutations identified by single strand sequencing are being verified by reverse strand sequencing and when possible, by restriction enzyme analysis. This protocol can be used to distringuish CMT1B patients from othre CMT phenotypes and to determine the CMT1B status of relatives both presymptomatically and prenatally.« less

  2. Experience of targeted Usher exome sequencing as a clinical test

    PubMed Central

    Besnard, Thomas; García-García, Gema; Baux, David; Vaché, Christel; Faugère, Valérie; Larrieu, Lise; Léonard, Susana; Millan, Jose M; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise

    2014-01-01

    We show that massively parallel targeted sequencing of 19 genes provides a new and reliable strategy for molecular diagnosis of Usher syndrome (USH) and nonsyndromic deafness, particularly appropriate for these disorders characterized by a high clinical and genetic heterogeneity and a complex structure of several of the genes involved. A series of 71 patients including Usher patients previously screened by Sanger sequencing plus newly referred patients was studied. Ninety-eight percent of the variants previously identified by Sanger sequencing were found by next-generation sequencing (NGS). NGS proved to be efficient as it offers analysis of all relevant genes which is laborious to reach with Sanger sequencing. Among the 13 newly referred Usher patients, both mutations in the same gene were identified in 77% of cases (10 patients) and one candidate pathogenic variant in two additional patients. This work can be considered as pilot for implementing NGS for genetically heterogeneous diseases in clinical service. PMID:24498627

  3. Norrie disease: first mutation report and prenatal diagnosis in an Indian family.

    PubMed

    Ghosh, Manju; Sharma, Shipra; Shastri, Shivaram; Arora, Sadhna; Shukla, Rashmi; Gupta, Neerja; Deka, Deepika; Kabra, Madhulika

    2012-11-01

    Norrie Disease (ND) is a rare X-linked recessive disorder characterised by congenital blindness due to severe retinal dysgenesis. Hearing loss and intellectual disability is present in 30-50 % cases. ND is caused by mutations in the NDP gene, located at Xp11.3. The authors describe mutation analysis of a proband with ND and subsequently prenatal diagnosis. Sequence analysis of the NDP gene revealed a hemizygous missense mutation arginine to serine in codon 41 (p.Arg41Ser) in the affected child. Mother was carrier for the mutation. In a subsequent di-chorionic di-amniotic pregnancy, the authors performed prenatal diagnosis by mutation analysis on chorionic villi sample at 11 wk of gestation. The fetuses were unaffected. This is a first mutation report and prenatal diagnosis of a familial case of Norrie disease from India. The importance of genetic testing of Norrie disease for confirmation, carrier testing, prenatal diagnosis and genetic counseling is emphasized.

  4. Host Transcriptional Response to Ebola Virus Infection

    PubMed Central

    Speranza, Emily; Connor, John H

    2017-01-01

    Ebola virus disease (EVD) is a serious illness that causes severe disease in humans and non-human primates (NHPs) and has mortality rates up to 90%. EVD is caused by the Ebolavirus and currently there are no licensed therapeutics or vaccines to treat EVD. Due to its high mortality rates and potential as a bioterrorist weapon, a better understanding of the disease is of high priority. Multiparametric analysis techniques allow for a more complete understanding of a disease and the host response. Analysis of RNA species present in a sample can lead to a greater understanding of activation or suppression of different states of the immune response. Transcriptomic analyses such as microarrays and RNA-Sequencing (RNA-Seq) have been important tools to better understand the global gene expression response to EVD. In this review, we outline the current knowledge gained by transcriptomic analysis of EVD. PMID:28930167

  5. Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

    PubMed

    Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi

    2017-07-01

    PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.

  6. CCAAT/enhancer-binding protein alpha (CEBPA) polymorphisms and mutations in healthy individuals and in patients with peripheral artery disease, ischaemic heart disease and hyperlipidaemia.

    PubMed

    Fuchs, O; Kostecka, A; Provazníková, D; Krásná, B; Kotlín, R; Stanková, M; Kobylka, P; Dostálová, G; Zeman, M; Chochola, M

    2010-01-01

    The CCAAT/enhancer-binding protein alpha, encoded by the intronless CEBPA gene, is a transcription factor that induces expression of genes involved in differentiation of granulocytes, monocytes, adipocytes and hepatocytes. Both mono- and bi-allelic CEBPA mutations were detected in acute myeloid leukaemia and myelodysplastic syndrome. In this study we also identified CEBPA mutations in healthy individuals and in patients with peripheral artery disease, ischaemic heart disease and hyperlipidaemia. We found 16 various deletions with the presence of two direct repeats in CEBPA by analysis of 431 individuals. Three most frequent repeats included in these deletions in CEBPA gene are CGCGAG (493- 498_865-870), GG (486-487_885-886), and GCCAAGCAGC (508-517_907-916), all according to GenBank Accession No. NM_004364.2. In one case we identified that a father with ischaemic heart disease and his healthy son had two identical deletions (493_864del and 508_906del, both according to GenBank Accession No. NM_004364.2) in CEBPA. The occurrence of deletions between two repetitive sequences may be caused by recombination events in the repair process. A double-stranded cut in DNA may initiate these recombination events in adjacent DNA sequences. Four types of polymorphisms in the CEBPA gene were also detected in the screened individuals. Polymorphism in CEBPA gene 690 G>T according to GenBank Accession No. NM_004364.2 is the most frequent type in our analysis. Statistical analysis did not find significant differences in the frequency of polymorphisms in CEBPA in patients and in healthy individuals with the exception of P4 polymorphism (580_585dup according to GenBank Accesion No. NM_004364.2). P4 polymorphism was significantly increased in ischaemic heart disease patients.

  7. Clinical application of next-generation sequencing for Mendelian diseases.

    PubMed

    Jamuar, Saumya Shekhar; Tan, Ene-Choo

    2015-06-16

    Over the past decade, next-generation sequencing (NGS) has led to an exponential increase in our understanding of the genetic basis of Mendelian diseases. NGS allows for the analysis of multiple regions of the genome in one single reaction and has been shown to be a cost-effective and efficient tool in investigating patients with Mendelian diseases. More recently, NGS has been successfully deployed in the clinics, with a reported diagnostic yield of ~25 %. However, recommendations on clinical implementation of NGS are still evolving with numerous key challenges that impede the widespread use of genetics in everyday medicine. These challenges include when to order, on whom to order, what type of test to order, and how to interpret and communicate the results, including incidental findings, to the patient and family. In this review, we discuss these challenges and suggest guidelines on implementing NGS in the routine clinical workflow.

  8. The Sequence and Analysis of Duplication Rich Human Chromosome 16

    DOE R&D Accomplishments Database

    Martin, Joel; Han, Cliff; Gordon, Laurie A.; Terry, Astrid; Prabhakar, Shyam; She, Xinwei; Xie, Gary; Hellsten, Uffe; Man Chan, Yee; Altherr, Michael; Couronne, Olivier; Aerts, Andrea; Bajorek, Eva; Black, Stacey; Blumer, Heather; Branscomb, Elbert; Brown, Nancy C.; Bruno, William J.; Buckingham, Judith M.; Callen, David F.; Campbell, Connie S.; Campbell, Mary L.; Campbell, Evelyn W.; Caoile, Chenier; Challacombe, Jean F.; Chasteen, Leslie A.; Chertkov, Olga; Chi, Han C.; Christensen, Mari; Clark, Lynn M.; Cohn, Judith D.; Denys, Mirian; Detter, John C.; Dickson, Mark; Dimitrijevic-Bussod, Mira; Escobar, Julio; Fawcett, Joseph J.; Flowers, Dave; Fotopulos, Dea; Glavina, Tijana; Gomez, Maria; Gonzales, Eidelyn; Goodstein, David; Goodwin, Lynne A.; Grady, Deborah L.; Grigoriev, Igor; Groza, Matthew; Hammon, Nancy; Hawkins, Trevor; Haydu, Lauren; Hildebrand, Carl E.; Huang, Wayne; Israni, Sanjay; Jett, Jamie; Jewett, Phillip E.; Kadner, Kristen; Kimball, Heather; Kobayashi, Arthur; Krawczyk, Marie-Claude; Leyba, Tina; Longmire, Jonathan L.; Lopez, Frederick; Lou, Yunian; Lowry, Steve; Ludeman, Thom; Mark, Graham A.; Mcmurray, Kimberly L.; Meincke, Linda J.; Morgan, Jenna; Moyzis, Robert K.; Mundt, Mark O.; Munk, A. Christine; Nandkeshwar, Richard D.; Pitluck, Sam; Pollard, Martin; Predki, Paul; Parson-Quintana, Beverly; Ramirez, Lucia; Rash, Sam; Retterer, James; Ricke, Darryl O.; Robinson, Donna L.; Rodriguez, Alex; Salamov, Asaf; Saunders, Elizabeth H.; Scott, Duncan; Shough, Timothy; Stallings, Raymond L.; Stalvey, Malinda; Sutherland, Robert D.; Tapia, Roxanne; Tesmer, Judith G.; Thayer, Nina; Thompson, Linda S.; Tice, Hope; Torney, David C.; Tran-Gyamfi, Mary; Tsai, Ming; Ulanovsky, Levy E.; Ustaszewska, Anna; Vo, Nu; White, P. Scott; Williams, Albert L.; Wills, Patricia L.; Wu, Jung-Rung; Wu, Kevin; Yang, Joan; DeJong, Pieter; Bruce, David; Doggett, Norman; Deaven, Larry; Schmutz, Jeremy; Grimwood, Jane; Richardson, Paul; et al.

    2004-01-01

    We report here the 78,884,754 base pairs of finished human chromosome 16 sequence, representing over 99.9 percent of its euchromatin. Manual annotation revealed 880 protein coding genes confirmed by 1,637 aligned transcripts, 19 tRNA genes, 341 pseudogenes and 3 RNA pseudogenes. These genes include metallothionein, cadherin and iroquois gene families, as well as the disease genes for polycystic kidney disease and acute myelomonocytic leukemia. Several large-scale structural polymorphisms spanning hundreds of kilobasepairs were identified and result in gene content differences across humans. One of the unique features of chromosome 16 is its high level of segmental duplication, ranked among the highest of the human autosomes. While the segmental duplications are enriched in the relatively gene poor pericentromere of the p-arm, some are involved in recent gene duplication and conversion events which are likely to have had an impact on the evolution of primates and human disease susceptibility.

  9. Whole Exome Analysis of Early Onset Alzheimer’s Disease

    DTIC Science & Technology

    2013-04-01

    FTD), FTD with Parkinsonism , and early-onset Alzheimer Disease (EOAD)-like presentations. Using whole exome capture with subsequent sequencing, we...dementia. The MAPT R406W mutation is associated with EOAD-like symptoms and Parkinsonism without FTD, as well as distinct cognitive courses. KEY...OUTCOMES: Carney RM, Kohli MA, Kunkle BW, Naj AC, Gilbert JR, Züchner S, PERICAK-VANCE MA, Parkinsonism and distinct dementia patterns in a

  10. Clavibacter michiganensis subsp. capsici subsp. nov., causing bacterial canker disease in pepper.

    PubMed

    Oh, Eom-Ji; Bae, Chungyun; Lee, Han-Beoyl; Hwang, In Sun; Lee, Hyok-In; Yea, Mi Chi; Yim, Kyu-Ock; Lee, Seungdon; Heu, Sunggi; Cha, Jae-Soon; Oh, Chang-Sik

    2016-10-01

    Clavibacter michiganensis is a Gram-stain-positive bacterium with eight subspecies. One of these subspecies is C. michiganensis subsp. michiganensis, which causes bacterial canker disease in tomato. Bacterial strains showing very similar canker disease symptoms to those of a strain originally classified as C. michiganensis have been isolated from pepper. In this paper, we reclassified strains isolated from pepper. On the basis of phylogenetic analysis with 16S rRNA gene sequences, the strains isolated from pepper were grouped in a separate clade from other subspecies of C. michiganensis. Biochemical, physiological and genetic characteristics of strain PF008T, which is the representative strain of the isolates from pepper, were examined in this study. Based on multi-locus sequence typing and other biochemical and physiological features including colony color, utilization of carbon sources and enzyme activities, strain PF008T was categorically differentiated from eight subspecies of C. michiganensis. Moreover, genome analysis showed that the DNA G+C content of strain PF008T is 73.2 %. These results indicate that PF008T is distinct from other known subspecies of C. michiganensis. Therefore, we propose a novel subspecies, C. michiganensis subsp. capsici, causing bacterial canker disease in pepper, with a type strain of PF008T (=KACC 18448T=LMG 29047T).

  11. Isolating Viral and Host RNA Sequences from Archival Material and Production of cDNA Libraries for High-Throughput DNA Sequencing

    PubMed Central

    Xiao, Yongli; Sheng, Zong-Mei; Taubenberger, Jeffery K.

    2015-01-01

    The vast majority of surgical biopsy and post-mortem tissue samples are formalin-fixed and paraffin-embedded (FFPE), but this process leads to RNA degradation that limits gene expression analysis. As an example, the viral RNA genome of the 1918 pandemic influenza A virus was previously determined in a 9-year effort by overlapping RT-PCR from post-mortem samples. Using the protocols described here, the full genome of the 1918 virus at high coverage was determined in one high-throughput sequencing run of a cDNA library derived from total RNA of a 1918 FFPE sample after duplex-specific nuclease treatments. This basic methodological approach should assist in the analysis of FFPE tissue samples isolated over the past century from a variety of infectious diseases. PMID:26344216

  12. Deleterious ABCA7 mutations and transcript rescue mechanisms in early onset Alzheimer's disease.

    PubMed

    De Roeck, Arne; Van den Bossche, Tobi; van der Zee, Julie; Verheijen, Jan; De Coster, Wouter; Van Dongen, Jasper; Dillen, Lubina; Baradaran-Heravi, Yalda; Heeman, Bavo; Sanchez-Valle, Raquel; Lladó, Albert; Nacmias, Benedetta; Sorbi, Sandro; Gelpi, Ellen; Grau-Rivera, Oriol; Gómez-Tortosa, Estrella; Pastor, Pau; Ortega-Cubero, Sara; Pastor, Maria A; Graff, Caroline; Thonberg, Håkan; Benussi, Luisa; Ghidoni, Roberta; Binetti, Giuliano; de Mendonça, Alexandre; Martins, Madalena; Borroni, Barbara; Padovani, Alessandro; Almeida, Maria Rosário; Santana, Isabel; Diehl-Schmid, Janine; Alexopoulos, Panagiotis; Clarimon, Jordi; Lleó, Alberto; Fortea, Juan; Tsolaki, Magda; Koutroumani, Maria; Matěj, Radoslav; Rohan, Zdenek; De Deyn, Peter; Engelborghs, Sebastiaan; Cras, Patrick; Van Broeckhoven, Christine; Sleegers, Kristel

    2017-09-01

    Premature termination codon (PTC) mutations in the ATP-Binding Cassette, Sub-Family A, Member 7 gene (ABCA7) have recently been identified as intermediate-to-high penetrant risk factor for late-onset Alzheimer's disease (LOAD). High variability, however, is observed in downstream ABCA7 mRNA and protein expression, disease penetrance, and onset age, indicative of unknown modifying factors. Here, we investigated the prevalence and disease penetrance of ABCA7 PTC mutations in a large early onset AD (EOAD)-control cohort, and examined the effect on transcript level with comprehensive third-generation long-read sequencing. We characterized the ABCA7 coding sequence with next-generation sequencing in 928 EOAD patients and 980 matched control individuals. With MetaSKAT rare variant association analysis, we observed a fivefold enrichment (p = 0.0004) of PTC mutations in EOAD patients (3%) versus controls (0.6%). Ten novel PTC mutations were only observed in patients, and PTC mutation carriers in general had an increased familial AD load. In addition, we observed nominal risk reducing trends for three common coding variants. Seven PTC mutations were further analyzed using targeted long-read cDNA sequencing on an Oxford Nanopore MinION platform. PTC-containing transcripts for each investigated PTC mutation were observed at varying proportion (5-41% of the total read count), implying incomplete nonsense-mediated mRNA decay (NMD). Furthermore, we distinguished and phased several previously unknown alternative splicing events (up to 30% of transcripts). In conjunction with PTC mutations, several of these novel ABCA7 isoforms have the potential to rescue deleterious PTC effects. In conclusion, ABCA7 PTC mutations play a substantial role in EOAD, warranting genetic screening of ABCA7 in genetically unexplained patients. Long-read cDNA sequencing revealed both varying degrees of NMD and transcript-modifying events, which may influence ABCA7 dosage, disease severity, and may create opportunities for therapeutic interventions in AD.

  13. Use of eluted peptide sequence data to identify the binding characteristics of peptides to the insulin-dependent diabetes susceptibility allele HLA-DQ8 (DQ 3.2).

    PubMed

    Godkin, A; Friede, T; Davenport, M; Stevanovic, S; Willis, A; Jewell, D; Hill, A; Rammensee, H G

    1997-06-01

    HLA-DQ8 (A1*0301, B1*0302) and -DQ2 (A1*0501, B1*0201) are both associated with diseases such as insulin-dependent diabetes mellitus and coeliac disease. We used the technique of pool sequencing to look at the requirements of peptides binding to HLA-DQ8, and combined these data with naturally sequenced ligands and in vitro binding assays to describe a novel motif for HLA-DQ8. The motif, which has the same basic format as many HLA-DR molecules, consists of four or five anchor regions, in the positions from the N-terminus of the binding core of n, n + 3, n + 5/6 and n + 8, i.e. P1, P4, P6/7 and P9. P1 and P9 require negative or polar residues, with mainly aliphatic residues at P4 and P6/7. The features of the HLA-DQ8 motif were then compared to a pool sequence of peptides eluted from HLA-DQ2. A consensus motif for the binding of a common peptide which may be involved in disease pathogenesis is described. Neither of the disease-associated alleles HLA-DQ2 and -DQ8 have Asp at position 57 of the beta-chain. This Asp, if present, may form a salt bridge with an Arg at position 79 of the alpha-chain and so alter the binding specificity of P9. HLA-DQ2 and -DQ8 both appear to prefer negatively charged amino acids at P9. In contrast, HLA-DQ7 (A1*0301, B1*0301), which is not associated with diabetes, has Asp at beta 57, allowing positively charged amino acids at P9. This analysis of the sequence features of DQ-binding peptides suggests molecular characteristics which may be useful to predict epitopes involved in disease pathogenesis.

  14. Diagnosis and molecular basis of mitochondrial respiratory chain disorders: exome sequencing for disease gene identification.

    PubMed

    Ohtake, A; Murayama, K; Mori, M; Harashima, H; Yamazaki, T; Tamaru, S; Yamashita, Y; Kishita, Y; Nakachi, Y; Kohda, M; Tokuzawa, Y; Mizuno, Y; Moriyama, Y; Kato, H; Okazaki, Y

    2014-04-01

    Mitochondrial disorders have the highest incidence among congenital metabolic diseases, and are thought to occur at a rate of 1 in 5000 births. About 25% of the diseases diagnosed as mitochondrial disorders in the field of pediatrics have mitochondrial DNA abnormalities, while the rest occur due to defects in genes encoded in the nucleus. The most important function of the mitochondria is biosynthesis of ATP. Mitochondrial disorders are nearly synonymous with mitochondrial respiratory chain disorder, as respiratory chain complexes serve a central role in ATP biosynthesis. By next-generation sequencing of the exome, we analyzed 104 patients with mitochondrial respiratory chain disorders. The results of analysis to date were 18 patients with novel variants in genes previously reported to be disease-causing, and 27 patients with mutations in genes suggested to be associated in some way with mitochondria, and it is likely that they are new disease-causing genes in mitochondrial disorders. This article is part of a Special Issue entitled Frontiers of Mitochondrial Research. Copyright © 2014 The Authors. Published by Elsevier B.V. All rights reserved.

  15. Genome-wide gene–gene interaction analysis for next-generation sequencing

    PubMed Central

    Zhao, Jinying; Zhu, Yun; Xiong, Momiao

    2016-01-01

    The critical barrier in interaction analysis for next-generation sequencing (NGS) data is that the traditional pairwise interaction analysis that is suitable for common variants is difficult to apply to rare variants because of their prohibitive computational time, large number of tests and low power. The great challenges for successful detection of interactions with NGS data are (1) the demands in the paradigm of changes in interaction analysis; (2) severe multiple testing; and (3) heavy computations. To meet these challenges, we shift the paradigm of interaction analysis between two SNPs to interaction analysis between two genomic regions. In other words, we take a gene as a unit of analysis and use functional data analysis techniques as dimensional reduction tools to develop a novel statistic to collectively test interaction between all possible pairs of SNPs within two genome regions. By intensive simulations, we demonstrate that the functional logistic regression for interaction analysis has the correct type 1 error rates and higher power to detect interaction than the currently used methods. The proposed method was applied to a coronary artery disease dataset from the Wellcome Trust Case Control Consortium (WTCCC) study and the Framingham Heart Study (FHS) dataset, and the early-onset myocardial infarction (EOMI) exome sequence datasets with European origin from the NHLBI's Exome Sequencing Project. We discovered that 6 of 27 pairs of significantly interacted genes in the FHS were replicated in the independent WTCCC study and 24 pairs of significantly interacted genes after applying Bonferroni correction in the EOMI study. PMID:26173972

  16. Rapid identification of candidate genes for resistance to tomato late blight disease using next-generation sequencing technologies

    PubMed Central

    Arafa, Ramadan A.; Rakha, Mohamed T.; Kamel, Said M.

    2017-01-01

    Tomato late blight caused by Phytophthora infestans (Mont.) de Bary, also known as the Irish famine pathogen, is one of the most destructive plant diseases. Wild relatives of tomato possess useful resistance genes against this disease, and could therefore be used in breeding to improve cultivated varieties. In the genome of a wild relative of tomato, Solanum habrochaites accession LA1777, we identified a new quantitative trait locus for resistance against blight caused by an aggressive Egyptian isolate of P. infestans. Using double-digest restriction site–associated DNA sequencing (ddRAD-Seq) technology, we determined 6,514 genome-wide SNP genotypes of an F2 population derived from an interspecific cross. Subsequent association analysis of genotypes and phenotypes of the mapping population revealed that a 6.8 Mb genome region on chromosome 6 was a candidate locus for disease resistance. Whole-genome resequencing analysis revealed that 298 genes in this region potentially had functional differences between the parental lines. Among of them, two genes with missense mutations, Solyc06g071810.1 and Solyc06g083640.3, were considered to be potential candidates for disease resistance. SNP and SSR markers linking to this region can be used in marker-assisted selection in future breeding programs for late blight disease, including introgression of new genetic loci from wild species. In addition, the approach developed in this study provides a model for identification of other genes for attractive agronomical traits. PMID:29253902

  17. Novel mutations in CRB1 and ABCA4 genes cause Leber congenital amaurosis and Stargardt disease in a Swedish family

    PubMed Central

    Jonsson, Frida; Burstedt, Marie S; Sandgren, Ola; Norberg, Anna; Golovleva, Irina

    2013-01-01

    This study aimed to identify genetic mechanisms underlying severe retinal degeneration in one large family from northern Sweden, members of which presented with early-onset autosomal recessive retinitis pigmentosa and juvenile macular dystrophy. The clinical records of affected family members were analysed retrospectively and ophthalmological and electrophysiological examinations were performed in selected cases. Mutation screening was initially performed with microarrays, interrogating known mutations in the genes associated with recessive retinitis pigmentosa, Leber congenital amaurosis and Stargardt disease. Searching for homozygous regions with putative causative disease genes was done by high-density SNP-array genotyping, followed by segregation analysis of the family members. Two distinct phenotypes of retinal dystrophy, Leber congenital amaurosis and Stargardt disease were present in the family. In the family, four patients with Leber congenital amaurosis were homozygous for a novel c.2557C>T (p.Q853X) mutation in the CRB1 gene, while of two cases with Stargardt disease, one was homozygous for c.5461-10T>C in the ABCA4 gene and another was carrier of the same mutation and a novel ABCA4 mutation c.4773+3A>G. Sequence analysis of the entire ABCA4 gene in patients with Stargardt disease revealed complex alleles with additional sequence variants, which were evaluated by bioinformatics tools. In conclusion, presence of different genetic mechanisms resulting in variable phenotype within the family is not rare and can challenge molecular geneticists, ophthalmologists and genetic counsellors. PMID:23443024

  18. Rolling circle amplification-based analysis of Sri Lankan cassava mosaic virus isolates from Tamil Nadu, India, suggests a low level of genetic variability.

    PubMed

    Kushawaha, Akhilesh Kumar; Rabindran, Ramalingam; Dasgupta, Indranil

    2018-03-01

    Cassava mosaic disease is a widespread disease of cassava in south Asia and the African continent. In India, CMD is known to be caused by two single-stranded DNA viruses (geminiviruses), Indian cassava mosaic virus (ICMV) and Sri Lankan cassava mosdaic virus (SLCMV). Previously, the diversity of ICMV and SLCMV in India has been studied using PCR, a sequence-dependent method. To have a more in-depth study of the variability of the above viruses and to detect any novel geminiviruses associated with CMD, sequence-independent amplification using rolling circle amplification (RCA)-based methods were used. CMD affected cassava plants were sampled across eighty locations in nine districts of the southern Indian state of Tamil Nadu. Twelve complete sequence of coat protein genes of the resident geminiviruses, comprising 256 amino acid residues were generated from the above samples, which indicated changes at only six positions. RCA followed by RFLP of the 80 samples indicated that most samples (47) contained only SLCMV, followed by 8, which were infected jointly with ICMV and SLCMV. In 11 samples, the pattern did not match the expected patterns from either of the two viruses and hence, were variants. Sequence analysis of an average of 700 nucleotides from 31 RCA-generated fragments of the variants indicated identities of 97-99% with the sequence of a previously reported infectious clone of SLCMV. The evidence suggests low levels of genetic variability in the begomoviruses infecting cassava, mainly in the form of scattered single nucleotide changes.

  19. Prevalence and genotypic characterization of bovine Echinococcus granulosus isolates by using cytochrome oxidase 1 (Co1) gene in Hyderabad, Pakistan.

    PubMed

    Ehsan, Muhammad; Akhter, Nasreen; Bhutto, Bachal; Arijo, Abdullah; Ali Gadahi, Javaid

    2017-05-30

    Cystic echinococcosis is an important zoonotic disease; it has serious impacts on animals as well as human health throughout the world. Genotypic characterization of Echinocossus granulosus (E. granulosus) in buffaloes has not been addressed in Pakistan. Therefore, the present study was conducted to evaluate the incidence and genotypic characterization of bovine E. granulosus. Out of 832 buffaloes examined, 112 (13.46%) were found infected. The favorable site for hydatid cyst development was liver (8.65%) followed by lungs (4.80%). The rate of cystic echinococcosis was found higher in females 14.43% than males 9.77%. The females above seven years aged were more infected as compared to the young ones. The partial sequence of mitochondrial cytochrome oxidase 1 (CO1) gene was used for identification and molecular analysis of buffalo's E. granulosus isolates. The alignment of redundant sequences were compared with already identified 10 genotypes available at National Centre for Biotechnology Information (NCBI) GenBank. The sequencing and phylogenetic analysis of all randomly selected buffalo isolates were belong to the G1- G3 complex (E. granulosus sensu stricto). All sequences were diverse from the reference sequence. No one showed complete identity to the buffalo strain (G3), representing substantial microsequence variability in G1, G2 and G3 genotypes. We evaluated the echinococcal infectivity and first time identification of genotypes in buffaloes in Sindh, Pakistan. This study will lead to determine accurate source of this zoonotic disease to humans in Pakistan. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Multilocus sequence analysis reveals extensive genetic variety within Tenacibaculum spp. associated with ulcers in sea-farmed fish in Norway.

    PubMed

    Olsen, Anne Berit; Gulla, Snorre; Steinum, Terje; Colquhoun, Duncan J; Nilsen, Hanne K; Duchaud, Eric

    2017-06-01

    Skin ulcer development in sea-reared salmonids, commonly associated with Tenacibaculum spp., is a significant fish welfare- and economical problem in Norwegian aquaculture. A collection of 89 Tenacibaculum isolates was subjected to multilocus sequence analysis (MLSA). The isolates were retrieved from outbreaks of clinical disease in farms spread along the Norwegian coast line from seven different fish species over a period of 19 years. MLSA analysis reveals considerable genetic diversity, but allows identification of four main clades. One clade encompasses isolates belonging to the species T. dicentrarchi, whereas three clades encompass bacteria that likely represent novel, as yet undescribed species. The study identified T. maritimum in lumpsucker, T. ovolyticum in halibut, and has extended the host and geographic range for T. soleae, isolated from wrasse. The overall lack of clonality and host specificity, with some indication of geographical range restriction argue for local epidemics involving multiple strains. The diversity of Tenacibaculum isolates from fish displaying ulcerative disease may complicate vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. Identification and phylogenetic diversity of parvovirus circulating in commercial chicken and turkey flocks in Croatia.

    PubMed

    Bidin, M; Lojkić, I; Bidin, Z; Tiljar, M; Majnarić, D

    2011-12-01

    Phylogenetic diversity of parvovirus detected in commercial chicken and turkey flocks is described. Nine chicken and six turkey flocks from Croatian farms were tested for parvovirus presence. Intestinal samples from one turkey and seven chicken flocks were found positive, and were sequenced. Natural parvovirus infection was more frequently detected in chickens than in turkeys examined in this study. Sequence analysis of 400 nucleotide fragments of the nonstructural gene (NS) showed that our sequences had more similarity with chicken parvovirus (ChPV) (92.3%-99.7%) than turkey parvovirus (TuPV) (89.5%-98.9%) strains. Phylogenetic analysis grouped our sequences in two clades. Also, the higher prevalence of ChPV than TuPV in tested flocks was defined. The necropsy findings suggested a malabsorption syndrome followed by a preascitic condition. Further research of parvovirus infection, pathogenesis, and the possibility of its association with poult enteritis and mortality syndrome (PEMS) and runting and stunting syndrome (RSS) is needed to clarify its significance as an agent of enteric disease.

  2. The biological features and genetic diversity of novel fish rhabdovirus isolates in China.

    PubMed

    Fu, Xiaozhe; Lin, Qiang; Liang, Hongru; Liu, Lihui; Huang, Zhibin; Li, Ningqiu; Su, Jianguo

    2017-09-01

    The Rhabdoviridae is a diverse family of negative-sense single-stranded RNA viruses which infects mammals, birds, reptiles, fish, insects and plants. Herein, we reported the isolation and characterization of 6 novel viruses from diseased fish collected from China including SCRV-QY, SCRV-SS, SCRV-GM, CmRV-FS, MsRV-SS, OmbRV-JM. The typical clinical symptom of diseased fish was hemorrhaging. Efficient propagation of these isolates in a Chinese perch brain cell line was determined by means of observation of cytopathic effect, RT-PCR and electron microscopy. Sequence alignment and phylogenetic analysis of the complete G protein sequences revealed that these isolates were clustered into one monophyletic lineage belonging to the species Siniperca chuatsi rhabdovirus.

  3. Going forward with genetics: recent technological advances and forward genetics in mice.

    PubMed

    Moresco, Eva Marie Y; Li, Xiaohong; Beutler, Bruce

    2013-05-01

    Forward genetic analysis is an unbiased approach for identifying genes essential to defined biological phenomena. When applied to mice, it is one of the most powerful methods to facilitate understanding of the genetic basis of human biology and disease. The speed at which disease-causing mutations can be identified in mutagenized mice has been markedly increased by recent advances in DNA sequencing technology. Creating and analyzing mutant phenotypes may therefore become rate-limiting in forward genetic experimentation. We review the forward genetic approach and its future in the context of recent technological advances, in particular massively parallel DNA sequencing, induced pluripotent stem cells, and haploid embryonic stem cells. Copyright © 2013 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.

  4. Defining the transcriptome assembly and its use for genome dynamics and transcriptome profiling studies in pigeonpea (Cajanus cajan L.).

    PubMed

    Dubey, Anuja; Farmer, Andrew; Schlueter, Jessica; Cannon, Steven B; Abernathy, Brian; Tuteja, Reetu; Woodward, Jimmy; Shah, Trushar; Mulasmanovic, Benjamin; Kudapa, Himabindu; Raju, Nikku L; Gothalwal, Ragini; Pande, Suresh; Xiao, Yongli; Town, Chris D; Singh, Nagendra K; May, Gregory D; Jackson, Scott; Varshney, Rajeev K

    2011-06-01

    This study reports generation of large-scale genomic resources for pigeonpea, a so-called 'orphan crop species' of the semi-arid tropic regions. FLX/454 sequencing carried out on a normalized cDNA pool prepared from 31 tissues produced 494 353 short transcript reads (STRs). Cluster analysis of these STRs, together with 10 817 Sanger ESTs, resulted in a pigeonpea trancriptome assembly (CcTA) comprising of 127 754 tentative unique sequences (TUSs). Functional analysis of these TUSs highlights several active pathways and processes in the sampled tissues. Comparison of the CcTA with the soybean genome showed similarity to 10 857 and 16 367 soybean gene models (depending on alignment methods). Additionally, Illumina 1G sequencing was performed on Fusarium wilt (FW)- and sterility mosaic disease (SMD)-challenged root tissues of 10 resistant and susceptible genotypes. More than 160 million sequence tags were used to identify FW- and SMD-responsive genes. Sequence analysis of CcTA and the Illumina tags identified a large new set of markers for use in genetics and breeding, including 8137 simple sequence repeats, 12 141 single-nucleotide polymorphisms and 5845 intron-spanning regions. Genomic resources developed in this study should be useful for basic and applied research, not only for pigeonpea improvement but also for other related, agronomically important legumes.

  5. Analysis of expressed sequence tags (ESTs) from cocoa (Theobroma cacao L) upon infection with Phytophthora megakarya.

    PubMed

    Naganeeswaran, Sudalaimuthu Asari; Subbian, Elain Apshara; Ramaswamy, Manimekalai

    2012-01-01

    Phytophthora megakarya, the causative agent of cacao black pod disease in West African countries causes an extensive loss of yield. In this study we have analyzed 4 libraries of ESTs derived from Phytophthora megakarya infected cocoa leaf and pod tissues. Totally 6379 redundant sequences were retrieved from ESTtik database and EST processing was performed using seqclean tool. Clustering and assembling using CAP3 generated 3333 non-redundant (907 contigs and 2426 singletons) sequences. The primary sequence analysis of 3333 non-redundant sequences showed that the GC percentage was 42.7 and the sequence length ranged from 101 - 2576 nucleotides. Further, functional analysis (Blast, Interproscan, Gene ontology and KEGG search) were executed and 1230 orthologous genes were annotated. Totally 272 enzymes corresponding to 114 metabolic pathways were identified. Functional annotation revealed that most of the sequences are related to molecular function, stress response and biological processes. The annotated enzymes are aldehyde dehydrogenase (E.C: 1.2.1.3), catalase (E.C: 1.11.1.6), acetyl-CoA C-acetyltransferase (E.C: 2.3.1.9), threonine ammonia-lyase (E.C: 4.3.1.19), acetolactate synthase (E.C: 2.2.1.6), O-methyltransferase (E.C: 2.1.1.68) which play an important role in amino acid biosynthesis and phenyl propanoid biosynthesis. All this information was stored in MySQL database management system to be used in future for reconstruction of biotic stress response pathway in cocoa.

  6. Analysis of the whole mitochondrial genome: translation of the Ion Torrent Personal Genome Machine system to the diagnostic bench?

    PubMed

    Seneca, Sara; Vancampenhout, Kim; Van Coster, Rudy; Smet, Joél; Lissens, Willy; Vanlander, Arnaud; De Paepe, Boel; Jonckheere, An; Stouffs, Katrien; De Meirleir, Linda

    2015-01-01

    Next-generation sequencing (NGS), an innovative sequencing technology that enables the successful analysis of numerous gene sequences in a massive parallel sequencing approach, has revolutionized the field of molecular biology. Although NGS was introduced in a rather recent past, the technology has already demonstrated its potential and effectiveness in many research projects, and is now on the verge of being introduced into the diagnostic setting of routine laboratories to delineate the molecular basis of genetic disease in undiagnosed patient samples. We tested a benchtop device on retrospective genomic DNA (gDNA) samples of controls and patients with a clinical suspicion of a mitochondrial DNA disorder. This Ion Torrent Personal Genome Machine platform is a high-throughput sequencer with a fast turnaround time and reasonable running costs. We challenged the chemistry and technology with the analysis and processing of a mutational spectrum composed of samples with single-nucleotide substitutions, indels (insertions and deletions) and large single or multiple deletions, occasionally in heteroplasmy. The output data were compared with previously obtained conventional dideoxy sequencing results and the mitochondrial revised Cambridge Reference Sequence (rCRS). We were able to identify the majority of all nucleotide alterations, but three false-negative results were also encountered in the data set. At the same time, the poor performance of the PGM instrument in regions associated with homopolymeric stretches generated many false-positive miscalls demanding additional manual curation of the data.

  7. Monitoring of Fasciola Species Contamination in Water Dropwort by cox1 Mitochondrial and ITS-2 rDNA Sequencing Analysis.

    PubMed

    Choi, In-Wook; Kim, Hwang-Yong; Quan, Juan-Hua; Ryu, Jae-Gee; Sun, Rubing; Lee, Young-Ha

    2015-10-01

    Fascioliasis, a food-borne trematode zoonosis, is a disease primarily in cattle and sheep and occasionally in humans. Water dropwort (Oenanthe javanica), an aquatic perennial herb, is a common second intermediate host of Fasciola, and the fresh stems and leaves are widely used as a seasoning in the Korean diet. However, no information regarding Fasciola species contamination in water dropwort is available. Here, we collected 500 samples of water dropwort in 3 areas in Korea during February and March 2015, and the water dropwort contamination of Fasciola species was monitored by DNA sequencing analysis of the Fasciola hepatica and Fasciola gigantica specific mitochondrial cytochrome c oxidase subunit 1 (cox1) and nuclear ribosomal internal transcribed spacer 2 (ITS-2). Among the 500 samples assessed, the presence of F. hepatica cox1 and 1TS-2 markers were detected in 2 samples, and F. hepatica contamination was confirmed by sequencing analysis. The nucleotide sequences of cox1 PCR products from the 2 F. hepatica-contaminated samples were 96.5% identical to the F. hepatica cox1 sequences in GenBank, whereas F. gigantica cox1 sequences were 46.8% similar with the sequence detected from the cox1 positive samples. However, F. gigantica cox1 and ITS-2 markers were not detected by PCR in the 500 samples of water dropwort. Collectively, in this survey of the water dropwort contamination with Fasciola species, very low prevalence of F. hepatica contamination was detected in the samples.

  8. [Using exon combined target region capture sequencing chip to detect the disease-causing genes of retinitis pigmentosa].

    PubMed

    Rong, Weining; Chen, Xuejuan; Li, Huiping; Liu, Yani; Sheng, Xunlun

    2014-06-01

    To detect the disease-causing genes of 10 retinitis pigmentosa pedigrees by using exon combined target region capture sequencing chip. Pedigree investigation study. From October 2010 to December 2013, 10 RP pedigrees were recruited for this study in Ningxia Eye Hospital. All the patients and family members received complete ophthalmic examinations. DNA was abstracted from patients, family members and controls. Using exon combined target region capture sequencing chip to screen the candidate disease-causing mutations. Polymerase chain reaction (PCR) and direct sequencing were used to confirm the disease-causing mutations. Seventy patients and 23 normal family members were recruited from 10 pedigrees. Among 10 RP pedigrees, 1 was autosomal dominant pedigrees and 9 were autosomal recessive pedigrees. 7 mutations related to 5 genes of 5 pedigrees were detected. A frameshift mutation on BBS7 gene was detected in No.2 pedigree, the patients of this pedigree combined with central obesity, polydactyly and mental handicap. No.2 pedigree was diagnosed as Bardet-Biedl syndrome finally. A missense mutation was detected in No.7 and No.10 pedigrees respectively. Because the patients suffered deafness meanwhile, the final diagnosis was Usher syndrome. A missense mutation on C3 gene related to age-related macular degeneration was also detected in No. 7 pedigrees. A nonsense mutation and a missense mutation on CRB1 gene were detected in No. 1 pedigree and a splicesite mutation on PROM1 gene was detected in No. 5 pedigree. Retinitis pigmentosa is a kind of genetic eye disease with diversity clinical phenotypes. Rapid and effective genetic diagnosis technology combined with clinical characteristics analysis is helpful to improve the level of clinical diagnosis of RP.

  9. Noninvasive Prenatal Screening for Genetic Diseases Using Massively Parallel Sequencing of Maternal Plasma DNA

    PubMed Central

    Chitty, Lyn S.; Lo, Y. M. Dennis

    2015-01-01

    The identification of cell-free fetal DNA (cffDNA) in maternal plasma in 1997 heralded the most significant change in obstetric care for decades, with the advent of safer screening and diagnosis based on analysis of maternal blood. Here, we describe how the technological advances offered by next-generation sequencing have allowed for the development of a highly sensitive screening test for aneuploidies as well as definitive prenatal molecular diagnosis for some monogenic disorders. PMID:26187875

  10. [Movement analysis of upper extremity hemiparesis in patients with cerebrovascular disease: a pilot study].

    PubMed

    Molina Rueda, F; Rivas Montero, F M; Pérez de Heredia Torres, M; Alguacil Diego, I M; Molero Sánchez, A; Miangolarra Page, J C

    2012-01-01

    As a result of neurophysiological injury, stroke patients have mobility limitations, mainly on the side of the body contralateral to the lesioned hemisphere. The purpose of this study is to quantify motor compensation strategies in stroke patients during the activity of drinking water from a glass. Four male patient with cerebrovascular disease and four right-handed, healthy male control subjects. The motion analysis was conducted using the Vicon Motion System(®) and surface electromyography equipment ZeroWire Aurion(®). We analysed elbow, shoulder and trunk joint movements and performed a qualitative analysis of the sequence of muscle activation. Trunk, shoulder and elbow movements measured in the stroke patient along the sagittal plane decreased during the drinking from a glass activity, while the movements in the shoulder in the coronal plane and trunk increased. As for the sequence of muscle activation, anterior, middle and posterior deltoid all contracted in the patient group during the task, while the upper trapezius activation remained throughout the activity. Quantitative analysis of movement provides quantitative information on compensation strategies used by stroke patients, and is therefore, clinically relevant. Copyright © 2011 Sociedad Española de Neurología. Published by Elsevier Espana. All rights reserved.

  11. Comprehensive molecular diagnosis of 179 Leber congenital amaurosis and juvenile retinitis pigmentosa patients by targeted next generation sequencing

    PubMed Central

    Wang, Xia; Wang, Hui; Sun, Vincent; Tuan, Han-Fang; Keser, Vafa; Wang, Keqing; Ren, Huanan; Lopez, Irma; Zaneveld, Jacques E; Siddiqui, Sorath; Bowles, Stephanie; Khan, Ayesha; Salvo, Jason; Jacobson, Samuel G; Iannaccone, Alessandro; Wang, Feng; Birch, David; Heckenlively, John R; Fishman, Gerald A; Traboulsi, Elias I; Li, Yumei; Wheaton, Dianna; Koenekoop, Robert K; Chen, Rui

    2014-01-01

    Background Leber congenital amaurosis (LCA) and juvenile retinitis pigmentosa (RP) are inherited retinal diseases that cause early onset severe visual impairment. An accurate molecular diagnosis can refine the clinical diagnosis and allow gene specific treatments. Methods We developed a capture panel that enriches the exonic DNA of 163 known retinal disease genes. Using this panel, we performed targeted next generation sequencing (NGS) for a large cohort of 179 unrelated and prescreened patients with the clinical diagnosis of LCA or juvenile RP. Systematic NGS data analysis, Sanger sequencing validation, and segregation analysis were utilised to identify the pathogenic mutations. Patients were revisited to examine the potential phenotypic ambiguity at the time of initial diagnosis. Results Pathogenic mutations for 72 patients (40%) were identified, including 45 novel mutations. Of these 72 patients, 58 carried mutations in known LCA or juvenile RP genes and exhibited corresponding phenotypes, while 14 carried mutations in retinal disease genes that were not consistent with their initial clinical diagnosis. We revisited patients in the latter case and found that homozygous mutations in PRPH2 can cause LCA/juvenile RP. Guided by the molecular diagnosis, we reclassified the clinical diagnosis in two patients. Conclusions We have identified a novel gene and a large number of novel mutations that are associated with LCA/juvenile RP. Our results highlight the importance of molecular diagnosis as an integral part of clinical diagnosis. PMID:23847139

  12. Comprehensive viral enrichment enables sensitive respiratory virus genomic identification and analysis by next generation sequencing.

    PubMed

    O'Flaherty, Brigid M; Li, Yan; Tao, Ying; Paden, Clinton R; Queen, Krista; Zhang, Jing; Dinwiddie, Darrell L; Gross, Stephen M; Schroth, Gary P; Tong, Suxiang

    2018-06-01

    Next generation sequencing (NGS) technologies have revolutionized the genomics field and are becoming more commonplace for identification of human infectious diseases. However, due to the low abundance of viral nucleic acids (NAs) in relation to host, viral identification using direct NGS technologies often lacks sufficient sensitivity. Here, we describe an approach based on two complementary enrichment strategies that significantly improves the sensitivity of NGS-based virus identification. To start, we developed two sets of DNA probes to enrich virus NAs associated with respiratory diseases. The first set of probes spans the genomes, allowing for identification of known viruses and full genome sequencing, while the second set targets regions conserved among viral families or genera, providing the ability to detect both known and potentially novel members of those virus groups. Efficiency of enrichment was assessed by NGS testing reference virus and clinical samples with known infection. We show significant improvement in viral identification using enriched NGS compared to unenriched NGS. Without enrichment, we observed an average of 0.3% targeted viral reads per sample. However, after enrichment, 50%-99% of the reads per sample were the targeted viral reads for both the reference isolates and clinical specimens using both probe sets. Importantly, dramatic improvements on genome coverage were also observed following virus-specific probe enrichment. The methods described here provide improved sensitivity for virus identification by NGS, allowing for a more comprehensive analysis of disease etiology. © 2018 O'Flaherty et al.; Published by Cold Spring Harbor Laboratory Press.

  13. Right ventricular strain analysis from three-dimensional echocardiography by using temporally diffeomorphic motion estimation.

    PubMed

    Zhang, Zhijun; Zhu, Meihua; Ashraf, Muhammad; Broberg, Craig S; Sahn, David J; Song, Xubo

    2014-12-01

    Quantitative analysis of right ventricle (RV) motion is important for study of the mechanism of congenital and acquired diseases. Unlike left ventricle (LV), motion estimation of RV is more difficult because of its complex shape and thin myocardium. Although attempts of finite element models on MR images and speckle tracking on echocardiography have shown promising results on RV strain analysis, these methods can be improved since the temporal smoothness of the motion is not considered. The authors have proposed a temporally diffeomorphic motion estimation method in which a spatiotemporal transformation is estimated by optimization of a registration energy functional of the velocity field in their earlier work. The proposed motion estimation method is a fully automatic process for general image sequences. The authors apply the method by combining with a semiautomatic myocardium segmentation method to the RV strain analysis of three-dimensional (3D) echocardiographic sequences of five open-chest pigs under different steady states. The authors compare the peak two-point strains derived by their method with those estimated from the sonomicrometry, the results show that they have high correlation. The motion of the right ventricular free wall is studied by using segmental strains. The baseline sequence results show that the segmental strains in their methods are consistent with results obtained by other image modalities such as MRI. The image sequences of pacing steady states show that segments with the largest strain variation coincide with the pacing sites. The high correlation of the peak two-point strains of their method and sonomicrometry under different steady states demonstrates that their RV motion estimation has high accuracy. The closeness of the segmental strain of their method to those from MRI shows the feasibility of their method in the study of RV function by using 3D echocardiography. The strain analysis of the pacing steady states shows the potential utility of their method in study on RV diseases.

  14. Diagnostic use of computational retrotransposon detection: Successful definition of pathogenetic mechanism in a ciliopathy phenotype.

    PubMed

    Takenouchi, Toshiki; Kuchikata, Tomu; Yoshihashi, Hiroshi; Fujiwara, Mineko; Uehara, Tomoko; Miyama, Sahoko; Yamada, Shiro; Kosaki, Kenjiro

    2017-05-01

    Among more than 5,000 human monogenic disorders with known causative genes, transposable element insertion of a Long Interspersed Nuclear Element 1 (LINE1, L1) is known as the mechanistic basis in only 13 genetic conditions. Meckel-Gruber syndrome is a rare ciliopathy characterized by occipital encephalocele and cystic kidney disease. Here, we document a boy with occipital encephalocele, post-axial polydactyly, and multicystic renal disease. A medical exome analysis detected a heterozygous frameshift mutation, c.4582_4583delCG p.(Arg1528Serfs*17) in CC2D2A in the maternally derived allele. The further use of a dedicated bioinformatics algorithm for detecting retrotransposon insertions led to the detection of an L1 insertion affecting exon 7 in the paternally derived allele. The complete sequencing and sequence homology analysis of the inserted L1 element showed that the L1 element was classified as L1HS (L1 human specific) and that the element had intact open reading frames in the two L1-encoded proteins. This observation ranks Meckel-Gruber syndrome as only the 14th disorder to be caused by an L1 insertion among more than 5,000 known human genetic disorders. Although a transposable element detection algorithm is not included in the current best-practice next-generation sequencing analysis, the present observation illustrates the utility of such an algorithm, which would require modest computational time and resources. Whether the seemingly infrequent recognition of L1 insertion in the pathogenesis of human genetic diseases might simply reflect a lack of appropriate detection methods remains to be seen. © 2017 Wiley Periodicals, Inc.

  15. Transcriptome analysis of the honey bee fungal pathogen, Ascosphaera apis: implications for host pathogenesis

    PubMed Central

    2012-01-01

    Background We present a comprehensive transcriptome analysis of the fungus Ascosphaera apis, an economically important pathogen of the Western honey bee (Apis mellifera) that causes chalkbrood disease. Our goals were to further annotate the A. apis reference genome and to identify genes that are candidates for being differentially expressed during host infection versus axenic culture. Results We compared A. apis transcriptome sequence from mycelia grown on liquid or solid media with that dissected from host-infected tissue. 454 pyrosequencing provided 252 Mb of filtered sequence reads from both culture types that were assembled into 10,087 contigs. Transcript contigs, protein sequences from multiple fungal species, and ab initio gene predictions were included as evidence sources in the Maker gene prediction pipeline, resulting in 6,992 consensus gene models. A phylogeny based on 12 of these protein-coding loci further supported the taxonomic placement of Ascosphaera as sister to the core Onygenales. Several common protein domains were less abundant in A. apis compared with related ascomycete genomes, particularly cytochrome p450 and protein kinase domains. A novel gene family was identified that has expanded in some ascomycete lineages, but not others. We manually annotated genes with homologs in other fungal genomes that have known relevance to fungal virulence and life history. Functional categories of interest included genes involved in mating-type specification, intracellular signal transduction, and stress response. Computational and manual annotations have been made publicly available on the Bee Pests and Pathogens website. Conclusions This comprehensive transcriptome analysis substantially enhances our understanding of the A. apis genome and its expression during infection of honey bee larvae. It also provides resources for future molecular studies of chalkbrood disease and ultimately improved disease management. PMID:22747707

  16. Genetic diversity and phylogenetic analysis of Aleutian mink disease virus isolates in north-east China.

    PubMed

    Leng, Xue; Liu, Dongxu; Li, Jianming; Shi, Kun; Zeng, Fanli; Zong, Ying; Liu, Yi; Sun, Zhibo; Zhang, Shanshan; Liu, Yadong; Du, Rui

    2018-05-01

    Aleutian mink disease is the most important disease in the mink-farming industry worldwide. So far, few large-scale molecular epidemiological studies of AMDV, based on the NS1 and VP2 genes, have been conducted in China. Here, eight new Chinese isolates of AMDV from three provinces in north-east China were analyzed to clarify the molecular epidemiology of AMDV. The seroprevalence of AMDV in north-east China was 41.8% according to counterimmuno-electrophoresis. Genetic variation analysis of the eight isolates showed significant non-synonymous substitutions in the NS1 and VP2 genes, especially in the NS1 gene. All eight isolates included the caspase-recognition sequence NS1:285 (DQTD↓S), but not the caspase recognition sequence NS1:227 (INTD↓S). The LN1 and LN2 strains had a new 10-amino-acid deletion in-between amino acids 28-37, while the JL3 strain had a one-amino-acid deletion at position 28 in the VP2 protein, compared with the AMDV-G strain. Phylogenetic analysis based on most of NS1 (1755 bp) and complete VP2 showed that the AMDV genotypes did not cluster according to their pathogenicity or geographic origin. Local and imported ADMV species are all prevalent in mink-farming populations in the north-east of China. This is the first study to report the molecular epidemiology of AMDV in north-east China based on most of NS1 and the complete VP2, and further provides information about polyG deletions and new variations in the amino acid sequences of NS1 and VP2 proteins. This report is a good foundation for further study of AMDV in China.

  17. Occurrence and phylogenetic analysis of bovine respiratory syncytial virus in outbreaks of respiratory disease in Norway

    PubMed Central

    2014-01-01

    Background Bovine respiratory syncytial virus (BRSV) is one of the major pathogens involved in the bovine respiratory disease (BRD) complex. The seroprevalence to BRSV in Norwegian cattle herds is high, but its role in epidemics of respiratory disease is unclear. The aims of the study were to investigate the etiological role of BRSV and other respiratory viruses in epidemics of BRD and to perform phylogenetic analysis of Norwegian BRSV strains. Results BRSV infection was detected either serologically and/or virologically in 18 (86%) of 21 outbreaks and in most cases as a single viral agent. When serology indicated that bovine coronavirus and/or bovine parainfluenza virus 3 were present, the number of BRSV positive animals in the herd was always higher, supporting the view of BRSV as the main pathogen. Sequencing of the G gene of BRSV positive samples showed that the current circulating Norwegian BRSVs belong to genetic subgroup II, along with other North European isolates. One isolate from an outbreak in Norway in 1976 was also investigated. This strain formed a separate branch in subgroup II, clearly different from the current Scandinavian sequences. The currently circulating BRSV could be divided into two different strains that were present in the same geographical area at the same time. The sequence variations between the two strains were in an antigenic important part of the G protein. Conclusion The results demonstrated that BRSV is the most important etiological agent of epidemics of BRD in Norway and that it often acts as the only viral agent. The phylogenetic analysis of the Norwegian strains of BRSV and several previously published isolates supported the theory of geographical and temporal clustering of BRSV. PMID:24423030

  18. Loop mediated isothermal amplification (LAMP) assay for detection of coconut root wilt disease and arecanut yellow leaf disease phytoplasma.

    PubMed

    Nair, Smita; Manimekalai, Ramaswamy; Ganga Raj, Palliyath; Hegde, Vinayaka

    2016-07-01

    The coconut root wilt disease (RWD) and the arecanut yellow leaf disease (YLD) are two major phytoplasma associated diseases affecting palms in South India. Greatly debilitating the palm health, these diseases cause substantial yield reduction and economic loss to farmers. A rapid and robust diagnostic technique is crucial in efficient disease management. We established phytoplasma 16S rDNA targeted loop mediated isothermal amplification (LAMP) and real time LAMP based diagnostics for coconut RWD and arecanut YLD. The LAMP reaction was set at 65 °C and end point detection made using hydroxynaphthol blue (HNB) and agarose gel electrophoresis. Molecular typing of LAMP products were made with restriction enzyme HpyCH4 V. Conventional PCR with LAMP external primers and sequencing of amplicons was carried out. Real time LAMP was performed on the Genei II platform (Optigene Ltd., UK). An annealing curve analysis was programmed at the end of the incubation to check the fidelity of the amplicons. The phytoplasma positive samples produced typical ladder like bands on agarose gel, showed colour change from violet to blue with HNB and produced unique annealing peak at 85 ± 0.5 °C in the real time detection. Restriction digestion produced predicted size fragments. Sequencing and BLASTN analysis confirmed that the amplification corresponded to phytoplasma 16S rRNA gene. LAMP method devised here was found to be more robust compared to conventional nested PCR and hence has potential applications in detection of phytoplasma from symptomatic palm samples and in rapid screening of healthy seedlings.

  19. Genes from the 20Q13 amplicon and their uses

    DOEpatents

    Gray, Joe; Collins, Colin; Hwang, Soo-in; Godfrey, Tony; Kowbel, David; Rommens, Johanna

    1999-01-01

    The present invention relates to cDNA sequences from a region of amplification on chromosome 20 associated with disease. The sequences can be used in hybridization methods for the identification of chromosomal abnormalities associated with various diseases. The sequences can also be used for treatment of diseases.

  20. DNA analysis of an uncommon missense mutation in a Gaucher disease patient of Jewish-Polish-Russian descent

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Choy, F.Y.M.; Wei, C.; Applegarth, D.A.

    1994-06-01

    Gaucher disease is the most frequent lysosomal lipid storage disease. It results from deficient glucocerebrosidase activity and is transmitted as an autosomal recessive trait. Three clinical forms of Gaucher disease have been described: type 1, non-neuronopathic; type 2, acute neuronopathic; and type 3, subacute neuronopathic. We have sequenced the full length cDNA of the glucocerebrosidase gene and identified an uncommon mutation in nucleotide position 1604 (genoma DNA nucleotide position 6683) from a Gaucher disease patient of Jewish-Polish-Russian descent with type 1 Gaucher disease. It is a G{yields}A transition in exon 11 that results in {sup 496}Arg{yields}{sup 496}His of glucocerebrosidase. Thismore » missense mutation is present in the heterozygous form and creates a new cleavage site for the endonuclease HphI. We have developed a simple method to detect the presence of this mutation by using HphI restriction fragment length polymorphism analysis of glucocerebrosidase genomic DNA or cDNA. The mutation in the other Gaucher allele of this patient is an A{yields}G transition at cDNA nucleotide position 1226 which creates an XhoI cleavage site after PCR mismatch amplification. The presence of this mutation was also confirmed by sequence analysis. Based on previous reports that mutation 1226 is present only in type 1 Gaucher disease and the observation that there is no neurological involvement in this patient, we conclude that our patient with the 1226/1604 genotype is diagnosed as having type 1 Gaucher disease. Since it was also postulated that mutation 1226 in the homozygous form will usually result in a good prognosis, we speculate that the orthopedic complications and the unusual presence of glomerulosclerosis in this patient may be attributable to the mutation at nucleotide 1604. This speculation will require a description of more patients with this mutation for confirmation. 32 refs., 5 figs.« less

Top