Sample records for sequence typing identified

  1. Development of chemiluminescent probe hybridization, RT-PCR and nucleic acid cycle sequencing assays of Sabin type 3 isolates to identify base pair 472 Sabin type 3 mutants associated with vaccine associated paralytic poliomyelitis.

    PubMed

    Old, M O; Logan, L H; Maldonado, Y A

    1997-11-01

    Sabin type 3 polio vaccine virus is the most common cause of poliovaccine associated paralytic poliomyelitis. Vaccine associated paralytic poliomyelitis cases have been associated with Sabin type 3 revertants containing a single U to C substitution at bp 472 of Sabin type 3. A rapid method of identification of Sabin type 3 bp 472 mutants is described. An enterovirus group-specific probe for use in a chemiluminescent dot blot hybridization assay was developed to identify enterovirus positive viral lysates. A reverse transcription-polymerase chain reaction (RT-PCR) assay producing a 319 bp PCR product containing the Sabin type 3 bp 472 mutation site was then employed to identify Sabin type 3 isolates. Chemiluminescent nucleic acid cycle sequencing of the purified 319 bp PCR product was then employed to identify nucleic acid sequences at bp 472. The enterovirus group probe hybridization procedure and isolation of the Sabin type 3 PCR product were highly sensitive and specific; nucleic acid cycle sequencing corresponded to the known sequence of stock Sabin type 3 isolates. These methods will be used to identify the Sabin type 3 reversion rate from sequential stool samples of infants obtained after the first and second doses of oral poliovirus vaccine.

  2. Deep Sequencing to Identify the Causes of Viral Encephalitis

    PubMed Central

    Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.

    2014-01-01

    Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691

  3. The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color

    USDA-ARS?s Scientific Manuscript database

    Background: Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. Results: We describe the sequencing and assembly of...

  4. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  5. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  6. Subtyping Salmonella enterica serovar enteritidis isolates from different sources by using sequence typing based on virulence genes and clustered regularly interspaced short palindromic repeats (CRISPRs).

    PubMed

    Liu, Fenyun; Kariyawasam, Subhashinie; Jayarao, Bhushan M; Barrangou, Rodolphe; Gerner-Smidt, Peter; Ribot, Efrain M; Knabel, Stephen J; Dudley, Edward G

    2011-07-01

    Salmonella enterica subsp. enterica serovar Enteritidis is a major cause of food-borne salmonellosis in the United States. Two major food vehicles for S. Enteritidis are contaminated eggs and chicken meat. Improved subtyping methods are needed to accurately track specific strains of S. Enteritidis related to human salmonellosis throughout the chicken and egg food system. A sequence typing scheme based on virulence genes (fimH and sseL) and clustered regularly interspaced short palindromic repeats (CRISPRs)-CRISPR-including multi-virulence-locus sequence typing (designated CRISPR-MVLST)-was used to characterize 35 human clinical isolates, 46 chicken isolates, 24 egg isolates, and 63 hen house environment isolates of S. Enteritidis. A total of 27 sequence types (STs) were identified among the 167 isolates. CRISPR-MVLST identified three persistent and predominate STs circulating among U.S. human clinical isolates and chicken, egg, and hen house environmental isolates in Pennsylvania, and an ST that was found only in eggs and humans. It also identified a potential environment-specific sequence type. Moreover, cluster analysis based on fimH and sseL identified a number of clusters, of which several were found in more than one outbreak, as well as 11 singletons. Further research is needed to determine if CRISPR-MVLST might help identify the ecological origins of S. Enteritidis strains that contaminate chickens and eggs.

  7. Molecular biological studies of adult and metacercarial stages of Petasiger exaeretus Dietz, 1909 (Digenea: Echinostomatidae).

    PubMed

    Cech, Gábor; Molnár, Kálmán; Székely, Csaba

    2017-06-01

    Molnár et al. (2015) reported two types of echinostomatid metacercariae in the lateral line organ of Hungarian fish species. Type 1 metacercariae possessed 27 collar spines and 16 uniform and three larger dorsal spines, whereas Type 2 metacercariae bore 27 collar spines and 19 equal-sized dorsal spines. In the recent work, molecular studies carried out on the ITS region and partial 28S rDNA sequences of two types of echinostomatid metacercariae and the sequences of adult stages of the species of Petasiger Dietz, 1909 collected from cormorants (Phalacrocorax carbo L.) showed that some of the Type 2 metacercariae corresponded to Petasiger exaeretus Dietz, 1909, whereas other morphologically similar metacercariae were identified as Petasiger phalacrocoracis (Yamaguti, 1939). The sequences of the Type 1 metacercariae with three larger dorsal spines could not be identified with any of the known sequences from echinostomatid trematodes.

  8. The genetic architecture of type 2 diabetes.

    PubMed

    Fuchsberger, Christian; Flannick, Jason; Teslovich, Tanya M; Mahajan, Anubha; Agarwala, Vineeta; Gaulton, Kyle J; Ma, Clement; Fontanillas, Pierre; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Denis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; van der Schouw, Yvonne T; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeriya; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana C N; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Burtt, Noël P; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Florez, Jose C; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Boehnke, Michael; Altshuler, David; McCarthy, Mark I

    2016-08-04

    The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of the heritability of this disease. Here, to test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole-genome sequencing in 2,657 European individuals with and without diabetes, and exome sequencing in 12,940 individuals from five ancestry groups. To increase statistical power, we expanded the sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support the idea that lower-frequency variants have a major role in predisposition to type 2 diabetes.

  9. The genetic architecture of type 2 diabetes

    PubMed Central

    Ma, Clement; Fontanillas, Pierre; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Denis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; van der Schouw, Yvonne T; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeriya; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana C N; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Burtt, Noël P; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Florez, Jose C; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Boehnke, Michael; Altshuler, David; McCarthy, Mark I

    2016-01-01

    The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of heritability. To test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole genome sequencing in 2,657 Europeans with and without diabetes, and exome sequencing in a total of 12,940 subjects from five ancestral groups. To increase statistical power, we expanded sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support a major role for lower-frequency variants in predisposition to type 2 diabetes. PMID:27398621

  10. Chromosomal 16S Ribosomal RNA Methyltransferase RmtE1 in Escherichia coli Sequence Type 448

    PubMed Central

    Li, Bin; Pacey, Marissa P.

    2017-01-01

    We identified rmtE1, an uncommon 16S ribosomal methyltransferase gene, in an aminoglycoside- and cephalosporin-resistant Escherichia coli sequence type 448 clinical strain co-harboring blaCMY-2. Long-read sequencing revealed insertion of a 101,257-bp fragment carrying both resistance genes to the chromosome. Our findings underscore E. coli sequence type 448 as a potential high-risk multidrug-resistant clone. PMID:28418308

  11. Genomic characterization of two new enterovirus types, EV-A114 and EV-A121.

    PubMed

    Deshpande, Jagadish M; Sharma, Deepa K; Saxena, Vinay K; Shetty, Sushmitha A; Qureshi, Tarique Husain I H; Nalavade, Uma P

    2016-12-01

    Enteroviruses cause a variety of illnesses of the gastrointestinal tract, central nervous system and cardiovascular system. Phylogenetic analysis of VP1 sequences has identified 106 different human enteroviruses classified into four enterovirus species within the genus Enterovirus of the family Picornaviridae. It is likely that not all enterovirus types have been discovered. Between September 2013 and October 2014, stool samples of 6274 apparently healthy children of up to 5 years of age residing in Gorakhpur district, Uttar Pradesh, India were screened for enteroviruses. Virus isolates obtained in RD and Hep-2c cells were identified by complete VP1 sequencing. Enteroviruses were isolated from 3042 samples. A total of 87 different enterovirus types were identified. Two isolates with 71 and 74 % nucleotide sequence similarity to all other known enteroviruses were recognized as novel types. In this paper we report identification and complete genome sequence analysis of these two isolates classified as EV-A114 and EV-A121.

  12. Sequencing artifacts in the type A influenza databases and attempts to correct them.

    PubMed

    Suarez, David L; Chester, Nikki; Hatfield, Jason

    2014-07-01

    There are over 276 000 influenza gene sequences in public databases, with the quality of the sequences determined by the contributor. As part of a high school class project, influenza sequences with possible errors were identified in the public databases based on the size of the gene being longer than expected, with the hypothesis that these sequences would have an error. Students contacted sequence submitters alerting them of the possible sequence issue(s) and requested they the suspect sequence(s) be correct as appropriate. Type A influenza viruses were screened, and gene segments longer than the accepted size were identified for further analysis. Attention was placed on sequences with additional nucleotides upstream or downstream of the highly conserved non-coding ends of the viral segments. A total of 1081 sequences were identified that met this criterion. Three types of errors were commonly observed: non-influenza primer sequence wasn't removed from the sequence; PCR product was cloned and plasmid sequence was included in the sequence; and Taq polymerase added an adenine at the end of the PCR product. Internal insertions of nucleotide sequence were also commonly observed, but in many cases it was unclear if the sequence was correct or actually contained an error. A total of 215 sequences, or 22.8% of the suspect sequences, were corrected in the public databases in the first year of the student project. Unfortunately 138 additional sequences with possible errors were added to the databases in the second year. Additional awareness of the need for data integrity of sequences submitted to public databases is needed to fully reap the benefits of these large data sets. © 2014 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.

  13. Structural and sequence features of two residue turns in beta-hairpins.

    PubMed

    Madan, Bharat; Seo, Sung Yong; Lee, Sun-Gu

    2014-09-01

    Beta-turns in beta-hairpins have been implicated as important sites in protein folding. In particular, two residue β-turns, the most abundant connecting elements in beta-hairpins, have been a major target for engineering protein stability and folding. In this study, we attempted to investigate and update the structural and sequence properties of two residue turns in beta-hairpins with a large data set. For this, 3977 beta-turns were extracted from 2394 nonhomologous protein chains and analyzed. First, the distribution, dihedral angles and twists of two residue turn types were determined, and compared with previous data. The trend of turn type occurrence and most structural features of the turn types were similar to previous results, but for the first time Type II turns in beta-hairpins were identified. Second, sequence motifs for the turn types were devised based on amino acid positional potentials of two-residue turns, and their distributions were examined. From this study, we could identify code-like sequence motifs for the two residue beta-turn types. Finally, structural and sequence properties of beta-strands in the beta-hairpins were analyzed, which revealed that the beta-strands showed no specific sequence and structural patterns for turn types. The analytical results in this study are expected to be a reference in the engineering or design of beta-hairpin turn structures and sequences. © 2014 Wiley Periodicals, Inc.

  14. Sequence analysis of chloroplast chlB gene of medicinal Ephedra species and its application to authentication of Ephedra Herb.

    PubMed

    Guo, Yahong; Tsuruga, Ayako; Yamaguchi, Shigeharu; Oba, Koji; Iwai, Kasumi; Sekita, Setsuko; Mizukami, Hajime

    2006-06-01

    Chloroplast chlB gene encoding subunit B of light-independent protochlorophyllide reductase was amplified from herbarium and crude drug specimens of Ephedra sinica, E. intermedia, E. equisetina, and E. przewalskii. Sequence comparison of the chlB gene indicated that all the E. sinica specimens have the same sequence type (Type S) distinctive from other species, while there are two sequence types (Type E1 and Type E2) in E. equisetina. E. intermedia and E. prezewalskii revealed an identical sequence type (Type IP). E. sinica was also identified by digesting the chlB fragment with Bcl I. A novel method for DNA authentication of Ephedra Herb based on the sequences of the chloroplast chlB gene and internal transcribed spacer of nuclear rRNA genes was developed and successfully applied for identification of the crude drugs obtained in the Chinese market.

  15. Comparative evaluation of the identification of rapidly growing non-tuberculous mycobacteria by mass spectrometry (MALDI-TOF MS), GenoType Mycobacterium CM/AS assay and partial sequencing of the rpoβ gene with phylogenetic analysis as a reference method.

    PubMed

    Costa-Alcalde, José Javier; Barbeito-Castiñeiras, Gema; González-Alba, José María; Aguilera, Antonio; Galán, Juan Carlos; Pérez-Del-Molino, María Luisa

    2018-06-02

    The American Thoracic Society and the Infectious Diseases Society of America recommend that clinically significant non-tuberculous mycobacteria (NTM) should be identified to the species level in order to determine their clinical significance. The aim of this study was to evaluate identification of rapidly growing NTM (RGM) isolated from clinical samples by using MALDI-TOF MS and a commercial molecular system. The results were compared with identification using a reference method. We included 46 clinical isolates of RGM and identified them using the commercial molecular system GenoType ® CM/AS (Hain, Lifescience, Germany), MALDI-TOF MS (Bruker) and, as reference method, partial rpoβ gene sequencing followed by BLAST and phylogenetic analysis with the 1093 sequences available in the GeneBank. The degree of agreement between GenoType ® and MALDI-TOF MS and the reference method, partial rpoβ sequencing, was 27/43 (62.8%) and 38/43 cases (88.3%) respectively. For all the samples correctly classified by GenoType ® , we obtained the same result with MALDI-TOF MS (27/27). However, MALDI-TOF MS also correctly identified 68.75% (11/16) of the samples that GenoType ® had misclassified (p=0.005). MALDI-TOF MS classified significantly better than GenoType ® . When a MALDI-TOF MS score >1.85 was achieved, MALDI-TOF MS and partial rpoβ gene sequencing were equivalent. GenoType ® was not able to distinguish between species belonging to the M. fortuitum complex. MALDI-TOF MS methodology is simple, rapid and associated with lower consumable costs than GenoType ® . The partial rpoβ sequencing methods with BLAST and phylogenetic analysis were not able to identify some RGM unequivocally. Therefore, sequencing of additional regions would be indicated in these cases. Copyright © 2018 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.

  16. Identification and functional characterization of a novel bipartite nuclear localization sequence in ARID1A

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bateman, Nicholas W.; The John P. Murtha Cancer Center, Walter Reed National Military Medical Center, 8901 Wisconsin Avenue, Bethesda 20889, MD; Shoji, Yutaka

    2016-01-01

    AT-rich interactive domain-containing protein 1A (ARID1A) is a recently identified nuclear tumor suppressor frequently altered in solid tumor malignancies. We have identified a bipartite-like nuclear localization sequence (NLS) that contributes to nuclear import of ARID1A not previously described. We functionally confirm activity using GFP constructs fused with wild-type or mutant NLS sequences. We further show that cyto-nuclear localized, bipartite NLS mutant ARID1A exhibits greater stability than nuclear-localized, wild-type ARID1A. Identification of this undescribed functional NLS within ARID1A contributes vital insights to rationalize the impact of ARID1A missense mutations observed in patient tumors. - Highlights: • We have identified a bipartitemore » nuclear localization sequence (NLS) in ARID1A. • Confirmation of the NLS was performed using GFP constructs. • NLS mutant ARID1A exhibits greater stability than wild-type ARID1A.« less

  17. Piscine reovirus: Genomic and molecular phylogenetic analysis from farmed and wild salmonids collected on the Canada/US Pacific Coast

    USGS Publications Warehouse

    Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.

    2015-01-01

    Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.

  18. Piscine Reovirus: Genomic and Molecular Phylogenetic Analysis from Farmed and Wild Salmonids Collected on the Canada/US Pacific Coast

    PubMed Central

    Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.

    2015-01-01

    Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673

  19. Ribosomal DNA intergenic spacer sequence in foxtail millet, Setaria italica (L.) P. Beauv. and its characterization and application to typing of foxtail millet landraces.

    PubMed

    Fukunaga, Kenji; Ichitani, Katsuyuki; Taura, Satoru; Sato, Muneharu; Kawase, Makoto

    2005-02-01

    We determined the sequence of ribosomal DNA (rDNA) intergenic spacer (IGS) of foxtail millet isolated in our previous study, and identified subrepeats in the polymorphic region. We also developed a PCR-based method for identifying rDNA types based on sequence information and assessed 153 accessions of foxtail millet. Results were congruent with our previous works. This study provides new findings regarding the geographical distribution of rDNA variants. This new method facilitates analyses of numerous foxtail millet accessions. It is helpful for typing of foxtail millet germplasms and elucidating the evolution of this millet.

  20. Streptococcus mutans clonal variation revealed by multilocus sequence typing.

    PubMed

    Nakano, Kazuhiko; Lapirattanakul, Jinthana; Nomura, Ryota; Nemoto, Hirotoshi; Alaluusua, Satu; Grönroos, Lisa; Vaara, Martti; Hamada, Shigeyuki; Ooshima, Takashi; Nakagawa, Ichiro

    2007-08-01

    Streptococcus mutans is the major pathogen of dental caries, a biofilm-dependent infectious disease, and occasionally causes infective endocarditis. S. mutans strains have been classified into four serotypes (c, e, f, and k). However, little is known about the S. mutans population, including the clonal relationships among strains of S. mutans, in relation to the particular clones that cause systemic diseases. To address this issue, we have developed a multilocus sequence typing (MLST) scheme for S. mutans. Eight housekeeping gene fragments were sequenced from each of 102 S. mutans isolates collected from the four serotypes in Japan and Finland. Between 14 and 23 alleles per locus were identified, allowing us theoretically to distinguish more than 1.2 x 10(10) sequence types. We identified 92 sequence types in these 102 isolates, indicating that S. mutans contains a diverse population. Whereas serotype c strains were widely distributed in the dendrogram, serotype e, f, and k strains were differentiated into clonal complexes. Therefore, we conclude that the ancestral strain of S. mutans was serotype c. No geographic specificity was identified. However, the distribution of the collagen-binding protein gene (cnm) and direct evidence of mother-to-child transmission were clearly evident. In conclusion, the superior discriminatory capacity of this MLST scheme for S. mutans may have important practical implications.

  1. An Outbreak of Streptococcus pyogenes in a Mental Health Facility: Advantage of Well-Timed Whole-Genome Sequencing Over emm Typing.

    PubMed

    Bergin, Sarah M; Periaswamy, Balamurugan; Barkham, Timothy; Chua, Hong Choon; Mok, Yee Ming; Fung, Daniel Shuen Sheng; Su, Alex Hsin Chuan; Lee, Yen Ling; Chua, Ming Lai Ivan; Ng, Poh Yong; Soon, Wei Jia Wendy; Chu, Collins Wenhan; Tan, Siyun Lucinda; Meehan, Mary; Ang, Brenda Sze Peng; Leo, Yee Sin; Holden, Matthew T G; De, Partha; Hsu, Li Yang; Chen, Swaine L; de Sessions, Paola Florez; Marimuthu, Kalisvar

    2018-05-09

    OBJECTIVEWe report the utility of whole-genome sequencing (WGS) conducted in a clinically relevant time frame (ie, sufficient for guiding management decision), in managing a Streptococcus pyogenes outbreak, and present a comparison of its performance with emm typing.SETTINGA 2,000-bed tertiary-care psychiatric hospital.METHODSActive surveillance was conducted to identify new cases of S. pyogenes. WGS guided targeted epidemiological investigations, and infection control measures were implemented. Single-nucleotide polymorphism (SNP)-based genome phylogeny, emm typing, and multilocus sequence typing (MLST) were performed. We compared the ability of WGS and emm typing to correctly identify person-to-person transmission and to guide the management of the outbreak.RESULTSThe study included 204 patients and 152 staff. We identified 35 patients and 2 staff members with S. pyogenes. WGS revealed polyclonal S. pyogenes infections with 3 genetically distinct phylogenetic clusters (C1-C3). Cluster C1 isolates were all emm type 4, sequence type 915 and had pairwise SNP differences of 0-5, which suggested recent person-to-person transmissions. Epidemiological investigation revealed that cluster C1 was mediated by dermal colonization and transmission of S. pyogenes in a male residential ward. Clusters C2 and C3 were genomically diverse, with pairwise SNP differences of 21-45 and 26-58, and emm 11 and mostly emm120, respectively. Clusters C2 and C3, which may have been considered person-to-person transmissions by emm typing, were shown by WGS to be unlikely by integrating pairwise SNP differences with epidemiology.CONCLUSIONSWGS had higher resolution than emm typing in identifying clusters with recent and ongoing person-to-person transmissions, which allowed implementation of targeted intervention to control the outbreak.Infect Control Hosp Epidemiol 2018;1-9.

  2. Complete Genome Sequences of Two Methicillin-Resistant Staphylococcus haemolyticus Isolates of Multilocus Sequence Type 25, First Detected by Shotgun Metagenomics.

    PubMed

    Couto, Natacha; Chlebowicz, Monika A; Raangs, Erwin C; Friedrich, Alex W; Rossen, John W

    2018-04-05

    The emergence of nosocomial infections by multidrug-resistant Staphylococcus haemolyticus isolates has been reported in several European countries. Here, we report the first two complete genome sequences of S. haemolyticus sequence type 25 (ST25) isolates 83131A and 83131B. Both isolates were isolated from the same clinical sample and were first identified through shotgun metagenomics. Copyright © 2018 Couto et al.

  3. A sparse differential clustering algorithm for tracing cell type changes via single-cell RNA-sequencing data

    PubMed Central

    Barron, Martin; Zhang, Siyuan

    2018-01-01

    Abstract Cell types in cell populations change as the condition changes: some cell types die out, new cell types may emerge and surviving cell types evolve to adapt to the new condition. Using single-cell RNA-sequencing data that measure the gene expression of cells before and after the condition change, we propose an algorithm, SparseDC, which identifies cell types, traces their changes across conditions and identifies genes which are marker genes for these changes. By solving a unified optimization problem, SparseDC completes all three tasks simultaneously. SparseDC is highly computationally efficient and demonstrates its accuracy on both simulated and real data. PMID:29140455

  4. Exome sequencing identifies a DNAJB6 mutation in a family with dominantly-inherited limb-girdle muscular dystrophy.

    PubMed

    Couthouis, Julien; Raphael, Alya R; Siskind, Carly; Findlay, Andrew R; Buenrostro, Jason D; Greenleaf, William J; Vogel, Hannes; Day, John W; Flanigan, Kevin M; Gitler, Aaron D

    2014-05-01

    Limb-girdle muscular dystrophy primarily affects the muscles of the hips and shoulders (the "limb-girdle" muscles), although it is a heterogeneous disorder that can present with varying symptoms. There is currently no cure. We sought to identify the genetic basis of limb-girdle muscular dystrophy type 1 in an American family of Northern European descent using exome sequencing. Exome sequencing was performed on DNA samples from two affected siblings and one unaffected sibling and resulted in the identification of eleven candidate mutations that co-segregated with the disease. Notably, this list included a previously reported mutation in DNAJB6, p.Phe89Ile, which was recently identified as a cause of limb-girdle muscular dystrophy type 1D. Additional family members were Sanger sequenced and the mutation in DNAJB6 was only found in affected individuals. Subsequent haplotype analysis indicated that this DNAJB6 p.Phe89Ile mutation likely arose independently of the previously reported mutation. Since other published mutations are located close by in the G/F domain of DNAJB6, this suggests that the area may represent a mutational hotspot. Exome sequencing provided an unbiased and effective method for identifying the genetic etiology of limb-girdle muscular dystrophy type 1 in a previously genetically uncharacterized family. This work further confirms the causative role of DNAJB6 mutations in limb-girdle muscular dystrophy type 1D. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. Dissemination of metallo-β-lactamase-producing Pseudomonas aeruginosa of sequence type 235 in Asian countries.

    PubMed

    Kim, Moon Jung; Bae, Il Kwon; Jeong, Seok Hoon; Kim, So Hyun; Song, Jae Hoon; Choi, Jae Young; Yoon, Sang Sun; Thamlikitkul, Visanu; Hsueh, Po-Ren; Yasin, Rohani Md; Lalitha, M K; Lee, Kyungwon

    2013-12-01

    To investigate the epidemiological traits of metallo-β-lactamase (MBL)-producing Pseudomonas aeruginosa (MPPA) clinical isolates collected by the Asian Network for Surveillance of Resistant Pathogens (ANSORP). A total of 16 MPPA clinical isolates were collected from six Asian countries in 2000 to 2009 by ANSORP. The MBL gene was detected by PCR amplification. The genetic organization of the class 1 integron carrying the MBL gene cassette was investigated by PCR mapping and sequencing. Southern blotting, repetitive sequence-based PCR and multilocus sequence typing (MLST) experiments were performed to characterize the isolates. PCR and sequencing experiments detected the blaVIM-2 (n = 12), blaVIM-3 (n = 1), blaIMP-6 (n = 2) and blaIMP-26 (n = 1) genes. The MBL genes were located on the chromosome in all isolates except one. Furthermore, all the MBL genes were located in a class 1 integron. All the MPPA isolates from Malaysia, Thailand, Sri Lanka and Korea were identified as sequence type (ST) 235 by MLST. Three VIM-2-producing isolates from India were identified as ST773, and one isolate harbouring VIM-3 from Taiwan was identified as ST298. P. aeruginosa ST235 might play a role in dissemination of MBL genes in Asian countries.

  6. Muscle RAS oncogene homolog (MRAS) recurrent mutation in Borrmann type IV gastric cancer.

    PubMed

    Yasumoto, Makiko; Sakamoto, Etsuko; Ogasawara, Sachiko; Isobe, Taro; Kizaki, Junya; Sumi, Akiko; Kusano, Hironori; Akiba, Jun; Torimura, Takuji; Akagi, Yoshito; Itadani, Hiraku; Kobayashi, Tsutomu; Hasako, Shinichi; Kumazaki, Masafumi; Mizuarai, Shinji; Oie, Shinji; Yano, Hirohisa

    2017-01-01

    The prognosis of patients with Borrmann type IV gastric cancer (Type IV) is extremely poor. Thus, there is an urgent need to elucidate the molecular mechanisms underlying the oncogenesis of Type IV and to identify new therapeutic targets. Although previous studies using whole-exome and whole-genome sequencing have elucidated genomic alterations in gastric cancer, none has focused on comprehensive genetic analysis of Type IV. To discover cancer-relevant genes in Type IV, we performed whole-exome sequencing and genome-wide copy number analysis on 13 patients with Type IV. Exome sequencing identified 178 somatic mutations in protein-coding sequences or at splice sites. Among the mutations, we found a mutation in muscle RAS oncogene homolog (MRAS), which is predicted to cause molecular dysfunction. MRAS belongs to the Ras subgroup of small G proteins, which includes the prototypic RAS oncogenes. We analyzed an additional 46 Type IV samples to investigate the frequency of MRAS mutation. There were eight nonsynonymous mutations (mutation frequency, 17%), showing that MRAS is recurrently mutated in Type IV. Copy number analysis identified six focal amplifications and one homozygous deletion, including insulin-like growth factor 1 receptor (IGF1R) amplification. The samples with IGF1R amplification had remarkably higher IGF1R mRNA and protein expression levels compared with the other samples. This is the first report of MRAS recurrent mutation in human tumor samples. Our results suggest that MRAS mutation and IGF1R amplification could drive tumorigenesis of Type IV and could be new therapeutic targets. © 2016 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.

  7. Flagellin diversity in Clostridium botulinum groups I and II: a new strategy for strain identification.

    PubMed

    Paul, Catherine J; Twine, Susan M; Tam, Kevin J; Mullen, James A; Kelly, John F; Austin, John W; Logan, Susan M

    2007-05-01

    Strains of Clostridium botulinum are traditionally identified by botulinum neurotoxin type; however, identification of an additional target for typing would improve differentiation. Isolation of flagellar filaments and analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) showed that C. botulinum produced multiple flagellin proteins. Nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS) analysis of in-gel tryptic digests identified peptides in all flagellin bands that matched two homologous tandem flagellin genes identified in the C. botulinum Hall A genome. Designated flaA1 and flaA2, these open reading frames encode the major structural flagellins of C. botulinum. Colony PCR and sequencing of flaA1/A2 variable regions classified 80 environmental and clinical strains into group I or group II and clustered isolates into 12 flagellar types. Flagellar type was distinct from neurotoxin type, and epidemiologically related isolates clustered together. Sequencing a larger PCR product, obtained during amplification of flaA1/A2 from type E strain Bennett identified a second flagellin gene, flaB. LC-MS analysis confirmed that flaB encoded a large type E-specific flagellin protein, and the predicted molecular mass for FlaB matched that observed by SDS-PAGE. In contrast, the molecular mass of FlaA was 2 to 12 kDa larger than the mass predicted by the flaA1/A2 sequence of a given strain, suggesting that FlaA is posttranslationally modified. While identification of FlaB, and the observation by SDS-PAGE of different masses of the FlaA proteins, showed the flagellin proteins of C. botulinum to be diverse, the presence of the flaA1/A2 gene in all strains examined facilitates single locus sequence typing of C. botulinum using the flagellin variable region.

  8. Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons

    PubMed Central

    Pagano, Johanna F.B.; Ensink, Wim A.; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P.; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J.; Dekker, Rob J.

    2017-01-01

    5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. PMID:28003516

  9. Probable Diagnosis of a Patient with Niemann-Pick Disease Type C: Managing Pitfalls of Exome Sequencing.

    PubMed

    Zeiger, William A; Jamal, Nasheed I; Scheuner, Maren T; Pittman, Patricia; Raymond, Kimiyo M; Morra, Massimo; Mishra, Shri K

    2018-02-17

    Here, we present a case of a 31-year-old man with progressive cognitive decline, ataxia, and dystonia. Extensive laboratory, radiographic, and targeted genetic studies over the course of several years failed to yield a diagnosis. Initial whole exome sequencing through a commercial laboratory identified several variants of uncertain significance; however, follow-up clinical examination and testing ruled each of these out. Eventually, repeat whole exome sequencing identified a known pathogenic intronic variant in the NPC1 gene (NM_000271.4, c.1554-1009G>A) and an additional heterozygous exonic variant of uncertain significance in the NPC1 gene (NM_000271.4, c.2524T>C). Follow-up biochemical testing was consistent with a diagnosis of probable Niemann-Pick disease Type C (NP-C). This case illustrates the potential of whole exome sequencing for diagnosing rare complex neurologic diseases. It also identifies several potential common pitfalls that must be navigated by clinicians when interpreting commercial whole exome sequencing results.

  10. Combining one-step Sanger sequencing with phasing probe hybridization for HLA class I typing yields rapid, G-group resolution predicting 99% of unique full length protein sequences.

    PubMed

    Tu, Bin; Masaberg, Carly; Hou, Lihua; Behm, Daniel; Brescia, Peter; Cha, Nuri; Kariyawasam, Kanthi; Lee, Jar How; Nong, Thoa; Sells, John; Tausch, Paul; Yang, Ruyan; Ng, Jennifer; Hurley, Carolyn Katovich

    2017-02-01

    Sanger-based DNA sequencing of exons 2+3 of HLA class I alleles from a heterozygote frequently results in two or more alternative genotypes. This study was undertaken to reduce the time and effort required to produce a single high resolution HLA genotype. Samples were typed in parallel by Sanger sequencing and oligonucleotide probe hybridization. This workflow, together with optimization of analysis software, was tested and refined during the typing of over 42,000 volunteers for an unrelated hematopoietic progenitor cell donor registry. Next generation DNA sequencing (NGS) was applied to over 1000 of these samples to identify the alleles present within the G group designations. Single genotypes at G level resolution were obtained for over 95% of the loci without additional assays. The vast majority of alleles identified (>99%) were the primary allele giving the G groups their name. Only 0.7% of the alleles identified encoded protein variants that were not detected by a focus on the antigen recognition domain (ARD)-encoding exons. Our combined method routinely provides biologically relevant typing resolution at the level of the ARD. It can be applied to both single samples or to large volume typing supporting either bone marrow or solid organ transplantation using technologies currently available in many HLA laboratories. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  11. Myelin protein zero gene sequencing diagnoses Charcot-Marie-Tooth Type 1B disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Su, Y.; Zhang, H.; Madrid, R.

    1994-09-01

    Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, affects about 1 in 2600 people in Norway and is found worldwide. CMT Type 1 (CMT1) has slow nerve conduction with demyelinated Schwann cells. Autosomal dominant CMT Type 1B (CMT1B) results from mutations in the myelin protein zero gene which directs the synthesis of more than half of all Schwann cell protein. This gene was mapped to the chromosome 1q22-1q23.1 borderline by fluorescence in situ hybridization. The first 7 of 7 reported CMT1B mutations are unique. Thus the most effective means to identify CMT1B mutations in at-risk family members and fetuses ismore » to sequence the entire coding sequence in dominant or sporadic CMT patients without the CMT1A duplication. Of the 19 primers used in 16 pars to uniquely amplify the entire MPZ coding sequence, 6 primer pairs were used to amplify and sequence the 6 exons. The DyeDeoxy Terminator cycle sequencing method used with four different color fluorescent lables was superior to manual sequencing because it sequences more bases unambiguously from extracted genomic DNA samples within 24 hours. This protocol was used to test 28 CMT and Dejerine-Sottas patients without CMT1A gene duplication. Sequencing MPZ gene-specific amplified fragments identified 9 polymorphic sites within the 6 exons that encode the 248 amino acid MPZ protein. The large number of major CMT1B mutations identified by single strand sequencing are being verified by reverse strand sequencing and when possible, by restriction enzyme analysis. This protocol can be used to distringuish CMT1B patients from othre CMT phenotypes and to determine the CMT1B status of relatives both presymptomatically and prenatally.« less

  12. Reads2Type: a web application for rapid microbial taxonomy identification.

    PubMed

    Saputra, Dhany; Rasmussen, Simon; Larsen, Mette V; Haddad, Nizar; Sperotto, Maria Maddalena; Aarestrup, Frank M; Lund, Ole; Sicheritz-Pontén, Thomas

    2015-11-25

    Identification of bacteria may be based on sequencing and molecular analysis of a specific locus such as 16S rRNA, or a set of loci such as in multilocus sequence typing. In the near future, healthcare institutions and routine diagnostic microbiology laboratories may need to sequence the entire genome of microbial isolates. Therefore we have developed Reads2Type, a web-based tool for taxonomy identification based on whole bacterial genome sequence data. Raw sequencing data provided by the user are mapped against a set of marker probes that are derived from currently available bacteria complete genomes. Using a dataset of 1003 whole genome sequenced bacteria from various sequencing platforms, Reads2Type was able to identify the species with 99.5 % accuracy and on the minutes time scale. In comparison with other tools, Reads2Type offers the advantage of not needing to transfer sequencing files, as the entire computational analysis is done on the computer of whom utilizes the web application. This also prevents data privacy issues to arise. The Reads2Type tool is available at http://www.cbs.dtu.dk/~dhany/reads2type.html.

  13. Typing of canine parvovirus isolates using mini-sequencing based single nucleotide polymorphism analysis.

    PubMed

    Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A

    2012-05-01

    The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.

  14. The 5S rDNA in two Abracris grasshoppers (Ommatolampidinae: Acrididae): molecular and chromosomal organization.

    PubMed

    Bueno, Danilo; Palacios-Gimenez, Octavio Manuel; Martí, Dardo Andrea; Mariguela, Tatiane Casagrande; Cabral-de-Mello, Diogo Cavalcanti

    2016-08-01

    The 5S ribosomal DNA (rDNA) sequences are subject of dynamic evolution at chromosomal and molecular levels, evolving through concerted and/or birth-and-death fashion. Among grasshoppers, the chromosomal location for this sequence was established for some species, but little molecular information was obtained to infer evolutionary patterns. Here, we integrated data from chromosomal and nucleotide sequence analysis for 5S rDNA in two Abracris species aiming to identify evolutionary dynamics. For both species, two arrays were identified, a larger sequence (named type-I) that consisted of the entire 5S rDNA gene plus NTS (non-transcribed spacer) and a smaller (named type-II) with truncated 5S rDNA gene plus short NTS that was considered a pseudogene. For type-I sequences, the gene corresponding region contained the internal control region and poly-T motif and the NTS presented partial transposable elements. Between the species, nucleotide differences for type-I were noticed, while type-II was identical, suggesting pseudogenization in a common ancestor. At chromosomal point to view, the type-II was placed in one bivalent, while type-I occurred in multiple copies in distinct chromosomes. In Abracris, the evolution of 5S rDNA was apparently influenced by the chromosomal distribution of clusters (single or multiple location), resulting in a mixed mechanism integrating concerted and birth-and-death evolution depending on the unit.

  15. Novel DDR2 mutation identified by whole exome sequencing in a Moroccan patient with spondylo-meta-epiphyseal dysplasia, short limb-abnormal calcification type.

    PubMed

    Mansouri, Maria; Kayserili, Hülya; Elalaoui, Siham Chafai; Nishimura, Gen; Iida, Aritoshi; Lyahyai, Jaber; Miyake, Noriko; Matsumoto, Naomichi; Sefiani, Abdelaziz; Ikegawa, Shiro

    2016-02-01

    Spondylo-meta-epiphyseal dysplasia (SMED), short limb-abnormal calcification type (SMED, SL-AC), is a very rare autosomal recessive disorder with various skeletal changes characterized by premature calcification leading to severe disproportionate short stature. Twenty-two patients have been reported until now, but only five mutations (four missense and one splice-site) in the conserved sequence encoding the tyrosine kinase domain of the DDR2 gene has been identified. We report here a novel DDR2 missense mutation, c.370C > T (p.Arg124Trp) in a Moroccan girl with SMED, SL-AC, identified by whole exome sequencing. Our study has expanded the mutational spectrum of this rare disease and it has shown that exome sequencing is a powerful and cost-effective tool for the diagnosis of clinically heterogeneous disorders such as SMED. © 2015 Wiley Periodicals, Inc.

  16. Real-Time PCR Typing of Escherichia coli Based on Multiple Single Nucleotide Polymorphisms--a Convenient and Rapid Method.

    PubMed

    Lager, Malin; Mernelius, Sara; Löfgren, Sture; Söderman, Jan

    2016-01-01

    Healthcare-associated infections caused by Escherichia coli and antibiotic resistance due to extended-spectrum beta-lactamase (ESBL) production constitute a threat against patient safety. To identify, track, and control outbreaks and to detect emerging virulent clones, typing tools of sufficient discriminatory power that generate reproducible and unambiguous data are needed. A probe based real-time PCR method targeting multiple single nucleotide polymorphisms (SNP) was developed. The method was based on the multi locus sequence typing scheme of Institute Pasteur and by adaptation of previously described typing assays. An 8 SNP-panel that reached a Simpson's diversity index of 0.95 was established, based on analysis of sporadic E. coli cases (ESBL n = 27 and non-ESBL n = 53). This multi-SNP assay was used to identify the sequence type 131 (ST131) complex according to the Achtman's multi locus sequence typing scheme. However, it did not fully discriminate within the complex but provided a diagnostic signature that outperformed a previously described detection assay. Pulsed-field gel electrophoresis typing of isolates from a presumed outbreak (n = 22) identified two outbreaks (ST127 and ST131) and three different non-outbreak-related isolates. Multi-SNP typing generated congruent data except for one non-outbreak-related ST131 isolate. We consider multi-SNP real-time PCR typing an accessible primary generic E. coli typing tool for rapid and uniform type identification.

  17. Fetal eye movements on magnetic resonance imaging.

    PubMed

    Woitek, Ramona; Kasprian, Gregor; Lindner, Christian; Stuhr, Fritz; Weber, Michael; Schöpf, Veronika; Brugger, Peter C; Asenbaum, Ulrika; Furtner, Julia; Bettelheim, Dieter; Seidl, Rainer; Prayer, Daniela

    2013-01-01

    Eye movements are the physical expression of upper fetal brainstem function. Our aim was to identify and differentiate specific types of fetal eye movement patterns using dynamic MRI sequences. Their occurrence as well as the presence of conjugated eyeball motion and consistently parallel eyeball position was systematically analyzed. Dynamic SSFP sequences were acquired in 72 singleton fetuses (17-40 GW, three age groups [17-23 GW, 24-32 GW, 33-40 GW]). Fetal eye movements were evaluated according to a modified classification originally published by Birnholz (1981): Type 0: no eye movements; Type I: single transient deviations; Type Ia: fast deviation, slower reposition; Type Ib: fast deviation, fast reposition; Type II: single prolonged eye movements; Type III: complex sequences; and Type IV: nystagmoid. In 95.8% of fetuses, the evaluation of eye movements was possible using MRI, with a mean acquisition time of 70 seconds. Due to head motion, 4.2% of the fetuses and 20.1% of all dynamic SSFP sequences were excluded. Eye movements were observed in 45 fetuses (65.2%). Significant differences between the age groups were found for Type I (p = 0.03), Type Ia (p = 0.031), and Type IV eye movements (p = 0.033). Consistently parallel bulbs were found in 27.3-45%. In human fetuses, different eye movement patterns can be identified and described by MRI in utero. In addition to the originally classified eye movement patterns, a novel subtype has been observed, which apparently characterizes an important step in fetal brainstem development. We evaluated, for the first time, eyeball position in fetuses. Ultimately, the assessment of fetal eye movements by MRI yields the potential to identify early signs of brainstem dysfunction, as encountered in brain malformations such as Chiari II or molar tooth malformations.

  18. Identification of GATC- and CCGG- recognizing Type II REases and their putative specificity-determining positions using Scan2S—a novel motif scan algorithm with optional secondary structure constraints

    PubMed Central

    Niv, Masha Y.; Skrabanek, Lucy; Roberts, Richard J.; Scheraga, Harold A.; Weinstein, Harel

    2008-01-01

    Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering. PMID:17972284

  19. Identification of GATC- and CCGG-recognizing Type II REases and their putative specificity-determining positions using Scan2S--a novel motif scan algorithm with optional secondary structure constraints.

    PubMed

    Niv, Masha Y; Skrabanek, Lucy; Roberts, Richard J; Scheraga, Harold A; Weinstein, Harel

    2008-05-01

    Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering.

  20. Genetic diversity of Flavobacterium psychrophilum isolates from three Oncorhynchus spp. in the United States, as revealed by multilocus sequence typing

    USDA-ARS?s Scientific Manuscript database

    Flavobacterium psychrophilum is an important pathogen of salmonids worldwide. Multilocus sequence typing (MLST) has identified a recombinogenic population structure from which emerged a few epidemic clonal complexes particularly threatening for salmonid aquaculture. To date, MLST genotypes for this ...

  1. Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex.

    PubMed

    Pollen, Alex A; Nowakowski, Tomasz J; Shuga, Joe; Wang, Xiaohui; Leyrat, Anne A; Lui, Jan H; Li, Nianzhen; Szpankowski, Lukasz; Fowler, Brian; Chen, Peilin; Ramalingam, Naveen; Sun, Gang; Thu, Myo; Norris, Michael; Lebofsky, Ronald; Toppani, Dominique; Kemp, Darnell W; Wong, Michael; Clerkson, Barry; Jones, Brittnee N; Wu, Shiquan; Knutsson, Lawrence; Alvarado, Beatriz; Wang, Jing; Weaver, Lesley S; May, Andrew P; Jones, Robert C; Unger, Marc A; Kriegstein, Arnold R; West, Jay A A

    2014-10-01

    Large-scale surveys of single-cell gene expression have the potential to reveal rare cell populations and lineage relationships but require efficient methods for cell capture and mRNA sequencing. Although cellular barcoding strategies allow parallel sequencing of single cells at ultra-low depths, the limitations of shallow sequencing have not been investigated directly. By capturing 301 single cells from 11 populations using microfluidics and analyzing single-cell transcriptomes across downsampled sequencing depths, we demonstrate that shallow single-cell mRNA sequencing (~50,000 reads per cell) is sufficient for unbiased cell-type classification and biomarker identification. In the developing cortex, we identify diverse cell types, including multiple progenitor and neuronal subtypes, and we identify EGR1 and FOS as previously unreported candidate targets of Notch signaling in human but not mouse radial glia. Our strategy establishes an efficient method for unbiased analysis and comparison of cell populations from heterogeneous tissue by microfluidic single-cell capture and low-coverage sequencing of many cells.

  2. Molecular epidemiology over an 11-year period (2000 to 2010) of extended-spectrum β-lactamase-producing Escherichia coli causing bacteremia in a centralized Canadian region.

    PubMed

    Peirano, Gisele; van der Bij, Akke K; Gregson, Daniel B; Pitout, Johann D D

    2012-02-01

    A study was designed to assess the importance of sequence types among extended-spectrum β-lactamase (ESBL)-producing Escherichia coli isolates causing bacteremia over an 11-year period (2000 to 2010) in a centralized Canadian region. A total of 197 patients with incident infections were identified; the majority presented with community-onset urosepsis, with a significant increase in the prevalence of ESBL-producing E. coli during the later part of the study. The majority of E. coli isolates produced either CTX-M-15 or CTX-M-14. We identified 7 different major sequence types among 91% of isolates (i.e., the ST10 clonal complex, ST38, ST131, ST315, ST393, ST405, and ST648) and provided insight into their clinical and molecular characteristics. ST38 was the most antimicrobial-susceptible sequence type and predominated during 2000 to 2004 but disappeared after 2008. ST131 was the most antimicrobial-resistant sequence type, and the influx of a single pulsotype of this sequence type was responsible for the significant increase of ESBL-producing E. coli strains since 2007. During 2010, 49/63 (78%) of the ESBL-producing E. coli isolates belonged to ST131, and this sequence type had established itself as a major drug-resistant pathogen in Calgary, Alberta, Canada, posing an important new public health threat within our region. We urgently need well-designed epidemiological and molecular studies to understand the dynamics of transmission, risk factors, and reservoirs for E. coli ST131. This will provide insight into the emergence and spread of this multiresistant sequence type.

  3. Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks.

    PubMed

    Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S K; Mammel, Mark K; Tarr, Phillip I; Eppinger, Mark

    2016-01-01

    Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and long-term evolution and can complement currently employed typing schemes for outbreak ex- and inclusion, diagnostics, surveillance, and forensic studies.

  4. Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks

    PubMed Central

    Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S. K.; Mammel, Mark K.; Tarr, Phillip I.; Eppinger, Mark

    2016-01-01

    Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and long-term evolution and can complement currently employed typing schemes for outbreak ex- and inclusion, diagnostics, surveillance, and forensic studies. PMID:27446025

  5. Sequence Typing Confirms that a Predominant Listeria monocytogenes Clone Caused Human Listeriosis Cases and Outbreaks in Canada from 1988 to 2010

    PubMed Central

    Reimer, Aleisha; Verghese, Bindhu; Lok, Mei; Ziegler, Jennifer; Farber, Jeffrey; Pagotto, Franco; Graham, Morag; Nadon, Celine A.

    2012-01-01

    Human listeriosis outbreaks in Canada have been predominantly caused by serotype 1/2a isolates with highly similar pulsed-field gel electrophoresis (PFGE) patterns. Multilocus sequence typing (MLST) and multi-virulence-locus sequence typing (MVLST) each identified a diverse population of Listeria monocytogenes isolates, and within that, both methods had congruent subtypes that substantiated a predominant clone (clonal complex 8; virulence type 59; proposed epidemic clone 5 [ECV]) that has been causing human illness across Canada for more than 2 decades. PMID:22337989

  6. Use of Whole-Genus Genome Sequence Data To Develop a Multilocus Sequence Typing Tool That Accurately Identifies Yersinia Isolates to the Species and Subspecies Levels

    PubMed Central

    Hall, Miquette; Chattaway, Marie A.; Reuter, Sandra; Savin, Cyril; Strauch, Eckhard; Carniel, Elisabeth; Connor, Thomas; Van Damme, Inge; Rajakaruna, Lakshani; Rajendram, Dunstan; Jenkins, Claire; Thomson, Nicholas R.

    2014-01-01

    The genus Yersinia is a large and diverse bacterial genus consisting of human-pathogenic species, a fish-pathogenic species, and a large number of environmental species. Recently, the phylogenetic and population structure of the entire genus was elucidated through the genome sequence data of 241 strains encompassing every known species in the genus. Here we report the mining of this enormous data set to create a multilocus sequence typing-based scheme that can identify Yersinia strains to the species level to a level of resolution equal to that for whole-genome sequencing. Our assay is designed to be able to accurately subtype the important human-pathogenic species Yersinia enterocolitica to whole-genome resolution levels. We also report the validation of the scheme on 386 strains from reference laboratory collections across Europe. We propose that the scheme is an important molecular typing system to allow accurate and reproducible identification of Yersinia isolates to the species level, a process often inconsistent in nonspecialist laboratories. Additionally, our assay is the most phylogenetically informative typing scheme available for Y. enterocolitica. PMID:25339391

  7. Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons.

    PubMed

    Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M

    2017-04-01

    5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  8. Characteristics and molecular phylogeny of Fasciola flukes from Bangladesh, determined based on spermatogenesis and nuclear and mitochondrial DNA analyses.

    PubMed

    Mohanta, Uday Kumar; Ichikawa-Seki, Madoka; Shoriki, Takuya; Katakura, Ken; Itagaki, Tadashi

    2014-07-01

    This study aimed to precisely discriminate Fasciola spp. based on DNA sequences of nuclear internal transcribed spacer 1 (ITS1) and mitochondrial nicotinamide adenine dinucleotide (NADH) dehydrogenase subunit 1 (nad1) gene. We collected 150 adult flukes from the bile ducts of cattle, buffaloes, sheep, and goats from six different regions of Bangladesh. Spermatogenic status was determined by analyzing stained seminal vesicles. The ITS1 types were analyzed using the polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) method. The nad1 haplotypes were identified based on PCR and direct sequencing and analyzed phylogenetically by comparing with nad1 haplotypes of Fasciola spp. from other Asian countries. Of the 127 aspermic flukes, 98 were identified as Fg type in ITS1, whereas 29 were identified as Fh/Fg type, indicating a combination of ITS1 sequences of Fasciola hepatica and Fasciola gigantica. All the 127 aspermic flukes showed Fsp-NDI-Bd11 in nad1 haplotype with nucleotide sequences identical to aspermic Fasciola sp. from Asian countries. Further, 20 spermic flukes were identified as F. gigantica based on their spermatogenic status and Fg type in ITS1. F. gigantica population was thought to be introduced into Bangladesh considerably earlier than the aspermic Fasciola sp. because 11 haplotypes with high haplotype diversity were detected from the F. gigantica population. However, three flukes from Bangladesh could not be precisely identified, because their spermatogenic status, ITS1 types, and nad1 haplotypes were ambiguous. Therefore, developing a robust method to distinguish aspermic Fasciola sp. from other Fasciola species is necessary in the future.

  9. Genetic analysis of a Chinese family with members affected with Usher syndrome type II and Waardenburg syndrome type IV.

    PubMed

    Wang, Xueling; Lin, Xiao-Jiang; Tang, Xiangrong; Chai, Yong-Chuan; Yu, De-Hong; Chen, Dong-Ye; Wu, Hao

    2017-11-01

    The purpose of this study was to identify the genetic causes of a family presenting with multiple symptoms overlapping Usher syndrome type II (USH2) and Waardenburg syndrome type IV (WS4). Targeted next-generation sequencing including the exon and flanking intron sequences of 79 deafness genes was performed on the proband. Co-segregation of the disease phenotype and the detected variants were confirmed in all family members by PCR amplification and Sanger sequencing. The affected members of this family had two different recessive disorders, USH2 and WS4. By targeted next-generation sequencing, we identified that USH2 was caused by a novel missense mutation, p.V4907D in GPR98; whereas WS4 due to p.V185M in EDNRB. This is the first report of homozygous p.V185M mutation in EDNRB in patient with WS4. This study reported a Chinese family with multiple independent and overlapping phenotypes. In condition, molecular level analysis was efficient to identify the causative variant p.V4907D in GPR98 and p.V185M in EDNRB, also was helpful to confirm the clinical diagnosis of USH2 and WS4. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. Methyltransferases acquired by lactococcal 936-type phage provide protection against restriction endonuclease activity.

    PubMed

    Murphy, James; Klumpp, Jochen; Mahony, Jennifer; O'Connell-Motherway, Mary; Nauta, Arjen; van Sinderen, Douwe

    2014-10-01

    So-called 936-type phages are among the most frequently isolated phages in dairy facilities utilising Lactococcus lactis starter cultures. Despite extensive efforts to control phage proliferation and decades of research, these phages continue to negatively impact cheese production in terms of the final product quality and consequently, monetary return. Whole genome sequencing and in silico analysis of three 936-type phage genomes identified several putative (orphan) methyltransferase (MTase)-encoding genes located within the packaging and replication regions of the genome. Utilising SMRT sequencing, methylome analysis was performed on all three phages, allowing the identification of adenine modifications consistent with N-6 methyladenine sequence methylation, which in some cases could be attributed to these phage-encoded MTases. Heterologous gene expression revealed that M.Phi145I/M.Phi93I and M.Phi93DAM, encoded by genes located within the packaging module, provide protection against the restriction enzymes HphI and DpnII, respectively, representing the first functional MTases identified in members of 936-type phages. SMRT sequencing technology enabled the identification of the target motifs of MTases encoded by the genomes of three lytic 936-type phages and these MTases represent the first functional MTases identified in this species of phage. The presence of these MTase-encoding genes on 936-type phage genomes is assumed to represent an adaptive response to circumvent host encoded restriction-modification systems thereby increasing the fitness of the phages in a dynamic dairy environment.

  11. Molecular characterization and epidemiology of cefoxitin resistance among Enterobacteriaceae lacking inducible chromosomal ampC genes from hospitalized and non-hospitalized patients in Algeria: description of new sequence type in Klebsiella pneumoniae isolates.

    PubMed

    Gharout-Sait, Alima; Touati, Abdelaziz; Guillard, Thomas; Brasme, Lucien; de Champs, Christophe

    2015-01-01

    In this study, 922 consecutive non-duplicate clinical isolates of Enterobacteriaceae obtained from hospitalized and non-hospitalized patients at Bejaia, Algeria were analyzed for AmpC-type β-lactamases production. The ampC genes and their genetic environment were characterized using polymerase chain reaction (PCR) and sequencing. Plasmid incompatibility groups were determined by using PCR-based replicon typing. Phylogenetic grouping and multilocus sequence typing were determined for molecular typing of the plasmid-mediated AmpC (pAmpC) isolates. Of the isolates, 15 (1.6%) were identified as AmpC producers including 14 CMY-4-producing isolates and one DHA-1-producing Klebsiella pneumoniae. All AmpC-producing isolates co-expressed the broad-spectrum TEM-1 β-lactamase and three of them co-produced CTX-M and/or SHV-12 ESBL. Phylogenetic grouping and virulence genotyping of the E. coli isolates revealed that most of them belonged to groups D and B1. Multilocus sequence typing analysis of K. pneumoniae isolates identified four different sequence types (STs) with two new sequences: ST1617 and ST1618. Plasmid replicon typing indicates that blaCMY-4 gene was located on broad host range A/C plasmid, while LVPK replicon was associated with blaDHA-1. All isolates carrying blaCMY-4 displayed the transposon-like structures ISEcp1/ΔISEcp1-blaCMY-blc-sugE. Our study showed that CMY-4 was the main pAmpC in the Enterobacteriaceae isolates in Algeria. Copyright © 2015 Elsevier Editora Ltda. All rights reserved.

  12. Discovery of a bovine enterovirus in alpaca.

    PubMed

    McClenahan, Shasta D; Scherba, Gail; Borst, Luke; Fredrickson, Richard L; Krause, Philip R; Uhlenhaut, Christine

    2013-01-01

    A cytopathic virus was isolated using Madin-Darby bovine kidney (MDBK) cells from lung tissue of alpaca that died of a severe respiratory infection. To identify the virus, the infected cell culture supernatant was enriched for virus particles and a generic, PCR-based method was used to amplify potential viral sequences. Genomic sequence data of the alpaca isolate was obtained and compared with sequences of known viruses. The new alpaca virus sequence was most similar to recently designated Enterovirus species F, previously bovine enterovirus (BEVs), viruses that are globally prevalent in cattle, although they appear not to cause significant disease. Because bovine enteroviruses have not been previously reported in U.S. alpaca, we suspect that this type of infection is fairly rare, and in this case appeared not to spread beyond the original outbreak. The capsid sequence of the detected virus had greatest homology to Enterovirus F type 1 (indicating that the virus should be considered a member of serotype 1), but the virus had greater homology in 2A protease sequence to type 3, suggesting that it may have been a recombinant. Identifying pathogens that infect a new host species for the first time can be challenging. As the disease in a new host species may be quite different from that in the original or natural host, the pathogen may not be suspected based on the clinical presentation, delaying diagnosis. Although this virus replicated in MDBK cells, existing standard culture and molecular methods could not identify it. In this case, a highly sensitive generic PCR-based pathogen-detection method was used to identify this pathogen.

  13. Discovery of a Bovine Enterovirus in Alpaca

    PubMed Central

    McClenahan, Shasta D.; Scherba, Gail; Borst, Luke; Fredrickson, Richard L.; Krause, Philip R.; Uhlenhaut, Christine

    2013-01-01

    A cytopathic virus was isolated using Madin-Darby bovine kidney (MDBK) cells from lung tissue of alpaca that died of a severe respiratory infection. To identify the virus, the infected cell culture supernatant was enriched for virus particles and a generic, PCR-based method was used to amplify potential viral sequences. Genomic sequence data of the alpaca isolate was obtained and compared with sequences of known viruses. The new alpaca virus sequence was most similar to recently designated Enterovirus species F, previously bovine enterovirus (BEVs), viruses that are globally prevalent in cattle, although they appear not to cause significant disease. Because bovine enteroviruses have not been previously reported in U.S. alpaca, we suspect that this type of infection is fairly rare, and in this case appeared not to spread beyond the original outbreak. The capsid sequence of the detected virus had greatest homology to Enterovirus F type 1 (indicating that the virus should be considered a member of serotype 1), but the virus had greater homology in 2A protease sequence to type 3, suggesting that it may have been a recombinant. Identifying pathogens that infect a new host species for the first time can be challenging. As the disease in a new host species may be quite different from that in the original or natural host, the pathogen may not be suspected based on the clinical presentation, delaying diagnosis. Although this virus replicated in MDBK cells, existing standard culture and molecular methods could not identify it. In this case, a highly sensitive generic PCR-based pathogen-detection method was used to identify this pathogen. PMID:23950875

  14. Pancreatic islet enhancer clusters enriched in type 2 diabetes risk-associated variants.

    PubMed

    Pasquali, Lorenzo; Gaulton, Kyle J; Rodríguez-Seguí, Santiago A; Mularoni, Loris; Miguel-Escalada, Irene; Akerman, İldem; Tena, Juan J; Morán, Ignasi; Gómez-Marín, Carlos; van de Bunt, Martijn; Ponsa-Cobas, Joan; Castro, Natalia; Nammo, Takao; Cebola, Inês; García-Hurtado, Javier; Maestro, Miguel Angel; Pattou, François; Piemonti, Lorenzo; Berney, Thierry; Gloyn, Anna L; Ravassard, Philippe; Skarmeta, José Luis Gómez; Müller, Ferenc; McCarthy, Mark I; Ferrer, Jorge

    2014-02-01

    Type 2 diabetes affects over 300 million people, causing severe complications and premature death, yet the underlying molecular mechanisms are largely unknown. Pancreatic islet dysfunction is central in type 2 diabetes pathogenesis, and understanding islet genome regulation could therefore provide valuable mechanistic insights. We have now mapped and examined the function of human islet cis-regulatory networks. We identify genomic sequences that are targeted by islet transcription factors to drive islet-specific gene activity and show that most such sequences reside in clusters of enhancers that form physical three-dimensional chromatin domains. We find that sequence variants associated with type 2 diabetes and fasting glycemia are enriched in these clustered islet enhancers and identify trait-associated variants that disrupt DNA binding and islet enhancer activity. Our studies illustrate how islet transcription factors interact functionally with the epigenome and provide systematic evidence that the dysregulation of islet enhancers is relevant to the mechanisms underlying type 2 diabetes.

  15. Helicobacter spp. from captive bottlenose dolphins (Tursiops spp.) and polar bears (Ursus maritimus).

    PubMed

    Oxley, Andrew P A; Argo, Jeffrey A; McKay, David B

    2005-11-01

    The gastric fluid of six bottlenose dolphins and the faeces of four polar bears from the same oceanarium were examined for the presence of Helicobacter. As detected by PCR, all dolphins and 8/12 samples collected from polar bears were positive for Helicobacter. Novel sequence types were identified in samples collected from these animals of which several were unique to either the dolphins or the polar bears. At least one sequence type was, however, detected in both animal taxa. In addition, a sequence type from a dolphin shared a 98.2-100% identity to sequences from other Helicobacter species from harp seals, sea otters and sea lions. This study reports on the occurrence of novel Helicobacter sequence types in polar bears and dolphins and demonstrates the broad-host range of some species within these animals.

  16. Statistical theory for protein combinatorial libraries. Packing interactions, backbone flexibility, and the sequence variability of a main-chain structure.

    PubMed

    Kono, H; Saven, J G

    2001-02-23

    Combinatorial experiments provide new ways to probe the determinants of protein folding and to identify novel folding amino acid sequences. These types of experiments, however, are complicated both by enormous conformational complexity and by large numbers of possible sequences. Therefore, a quantitative computational theory would be helpful in designing and interpreting these types of experiment. Here, we present and apply a statistically based, computational approach for identifying the properties of sequences compatible with a given main-chain structure. Protein side-chain conformations are included in an atom-based fashion. Calculations are performed for a variety of similar backbone structures to identify sequence properties that are robust with respect to minor changes in main-chain structure. Rather than specific sequences, the method yields the likelihood of each of the amino acids at preselected positions in a given protein structure. The theory may be used to quantify the characteristics of sequence space for a chosen structure without explicitly tabulating sequences. To account for hydrophobic effects, we introduce an environmental energy that it is consistent with other simple hydrophobicity scales and show that it is effective for side-chain modeling. We apply the method to calculate the identity probabilities of selected positions of the immunoglobulin light chain-binding domain of protein L, for which many variant folding sequences are available. The calculations compare favorably with the experimentally observed identity probabilities.

  17. Proposals for the classification of human rhinovirus species A, B and C into genotypically assigned types

    PubMed Central

    McIntyre, Chloe L.; Knowles, Nick J.

    2013-01-01

    Human rhinoviruses (HRVs) frequently cause mild upper respiratory tract infections and more severe disease manifestations such as bronchiolitis and asthma exacerbations. HRV is classified into three species within the genus Enterovirus of the family Picornaviridae. HRV species A and B contain 75 and 25 serotypes identified by cross-neutralization assays, although the use of such assays for routine HRV typing is hampered by the large number of serotypes, replacement of virus isolation by molecular methods in HRV diagnosis and the poor or absent replication of HRV species C in cell culture. To address these problems, we propose an alternative, genotypic classification of HRV-based genetic relatedness analogous to that used for enteroviruses. Nucleotide distances between 384 complete VP1 sequences of currently assigned HRV (sero)types identified divergence thresholds of 13, 12 and 13 % for species A, B and C, respectively, that divided inter- and intra-type comparisons. These were paralleled by 10, 9.5 and 10 % thresholds in the larger dataset of >3800 VP4 region sequences. Assignments based on VP1 sequences led to minor revisions of existing type designations (such as the reclassification of serotype pairs, e.g. A8/A95 and A29/A44, as single serotypes) and the designation of new HRV types A101–106, B101–103 and C34–C51. A protocol for assignment and numbering of new HRV types using VP1 sequences and the restriction of VP4 sequence comparisons to type identification and provisional type assignments is proposed. Genotypic assignment and identification of HRV types will be of considerable value in the future investigation of type-associated differences in disease outcomes, transmission and epidemiology. PMID:23677786

  18. Massively Parallel DNA Sequencing Facilitates Diagnosis of Patients with Usher Syndrome Type 1

    PubMed Central

    Yoshimura, Hidekane; Iwasaki, Satoshi; Nishio, Shin-ya; Kumakawa, Kozo; Tono, Tetsuya; Kobayashi, Yumiko; Sato, Hiroaki; Nagai, Kyoko; Ishikawa, Kotaro; Ikezono, Tetsuo; Naito, Yasushi; Fukushima, Kunihiro; Oshikawa, Chie; Kimitsuki, Takashi; Nakanishi, Hiroshi; Usami, Shin-ichi

    2014-01-01

    Usher syndrome is an autosomal recessive disorder manifesting hearing loss, retinitis pigmentosa and vestibular dysfunction, and having three clinical subtypes. Usher syndrome type 1 is the most severe subtype due to its profound hearing loss, lack of vestibular responses, and retinitis pigmentosa that appears in prepuberty. Six of the corresponding genes have been identified, making early diagnosis through DNA testing possible, with many immediate and several long-term advantages for patients and their families. However, the conventional genetic techniques, such as direct sequence analysis, are both time-consuming and expensive. Targeted exon sequencing of selected genes using the massively parallel DNA sequencing technology will potentially enable us to systematically tackle previously intractable monogenic disorders and improve molecular diagnosis. Using this technique combined with direct sequence analysis, we screened 17 unrelated Usher syndrome type 1 patients and detected probable pathogenic variants in the 16 of them (94.1%) who carried at least one mutation. Seven patients had the MYO7A mutation (41.2%), which is the most common type in Japanese. Most of the mutations were detected by only the massively parallel DNA sequencing. We report here four patients, who had probable pathogenic mutations in two different Usher syndrome type 1 genes, and one case of MYO7A/PCDH15 digenic inheritance. This is the first report of Usher syndrome mutation analysis using massively parallel DNA sequencing and the frequency of Usher syndrome type 1 genes in Japanese. Mutation screening using this technique has the power to quickly identify mutations of many causative genes while maintaining cost-benefit performance. In addition, the simultaneous mutation analysis of large numbers of genes is useful for detecting mutations in different genes that are possibly disease modifiers or of digenic inheritance. PMID:24618850

  19. Massively parallel DNA sequencing facilitates diagnosis of patients with Usher syndrome type 1.

    PubMed

    Yoshimura, Hidekane; Iwasaki, Satoshi; Nishio, Shin-Ya; Kumakawa, Kozo; Tono, Tetsuya; Kobayashi, Yumiko; Sato, Hiroaki; Nagai, Kyoko; Ishikawa, Kotaro; Ikezono, Tetsuo; Naito, Yasushi; Fukushima, Kunihiro; Oshikawa, Chie; Kimitsuki, Takashi; Nakanishi, Hiroshi; Usami, Shin-Ichi

    2014-01-01

    Usher syndrome is an autosomal recessive disorder manifesting hearing loss, retinitis pigmentosa and vestibular dysfunction, and having three clinical subtypes. Usher syndrome type 1 is the most severe subtype due to its profound hearing loss, lack of vestibular responses, and retinitis pigmentosa that appears in prepuberty. Six of the corresponding genes have been identified, making early diagnosis through DNA testing possible, with many immediate and several long-term advantages for patients and their families. However, the conventional genetic techniques, such as direct sequence analysis, are both time-consuming and expensive. Targeted exon sequencing of selected genes using the massively parallel DNA sequencing technology will potentially enable us to systematically tackle previously intractable monogenic disorders and improve molecular diagnosis. Using this technique combined with direct sequence analysis, we screened 17 unrelated Usher syndrome type 1 patients and detected probable pathogenic variants in the 16 of them (94.1%) who carried at least one mutation. Seven patients had the MYO7A mutation (41.2%), which is the most common type in Japanese. Most of the mutations were detected by only the massively parallel DNA sequencing. We report here four patients, who had probable pathogenic mutations in two different Usher syndrome type 1 genes, and one case of MYO7A/PCDH15 digenic inheritance. This is the first report of Usher syndrome mutation analysis using massively parallel DNA sequencing and the frequency of Usher syndrome type 1 genes in Japanese. Mutation screening using this technique has the power to quickly identify mutations of many causative genes while maintaining cost-benefit performance. In addition, the simultaneous mutation analysis of large numbers of genes is useful for detecting mutations in different genes that are possibly disease modifiers or of digenic inheritance.

  20. Molecular genetics of cystinuria: Identification of four new mutations and seven polymorphisms, and evidence for genetic heterogeneity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gasparini, P.; Bisceglia, L.; Notarangelo, A.

    A cystinuria disease gene (rBAT) has been recently identified, and some mutations causing the disease have been described. The frequency of these mutations has been investigated in a large sample of 51 Italian and Spanish cystinuric patients. In addition, to identify new mutated alleles, genomic DNA has been analyzed by an accurate and sensitive method able to detect nucleotide changes. Because of the lack of information available on the genomic structure of rBAT gene, the study was carried out using the sequence data so far obtained by us. More than 70% of the entire coding sequence and 8 intron-exon boundariesmore » have been analyzed. Four new mutations and seven intragenic polymorphisms have been detected. All mutations so far identified in rBAT belong only to cystinuria type I alleles, accounting for {approximately} 44% of all type I cystinuric chromosomes. Mutation M467T is the most common mutated allele in the Italian and Spanish populations. After analysis of 70% of the rBAT coding region, we have detected normal sequences in cystinuria type II and type III chromosomes. The presence of rBAT mutated alleles only in type I chromosomes of homozygous (type I/I) and heterozygous (type I/III) patients provides evidence for genetic heterogeneity where rBAT would be responsible only for type I cystinuria and suggests a complementation mechanism to explain the intermediate type I/type III phenotype. 25 refs., 1 fig., 3 tabs.« less

  1. Flagellin Diversity in Clostridium botulinum Groups I and II: a New Strategy for Strain Identification▿

    PubMed Central

    Paul, Catherine J.; Twine, Susan M.; Tam, Kevin J.; Mullen, James A.; Kelly, John F.; Austin, John W.; Logan, Susan M.

    2007-01-01

    Strains of Clostridium botulinum are traditionally identified by botulinum neurotoxin type; however, identification of an additional target for typing would improve differentiation. Isolation of flagellar filaments and analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) showed that C. botulinum produced multiple flagellin proteins. Nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS) analysis of in-gel tryptic digests identified peptides in all flagellin bands that matched two homologous tandem flagellin genes identified in the C. botulinum Hall A genome. Designated flaA1 and flaA2, these open reading frames encode the major structural flagellins of C. botulinum. Colony PCR and sequencing of flaA1/A2 variable regions classified 80 environmental and clinical strains into group I or group II and clustered isolates into 12 flagellar types. Flagellar type was distinct from neurotoxin type, and epidemiologically related isolates clustered together. Sequencing a larger PCR product, obtained during amplification of flaA1/A2 from type E strain Bennett identified a second flagellin gene, flaB. LC-MS analysis confirmed that flaB encoded a large type E-specific flagellin protein, and the predicted molecular mass for FlaB matched that observed by SDS-PAGE. In contrast, the molecular mass of FlaA was 2 to 12 kDa larger than the mass predicted by the flaA1/A2 sequence of a given strain, suggesting that FlaA is posttranslationally modified. While identification of FlaB, and the observation by SDS-PAGE of different masses of the FlaA proteins, showed the flagellin proteins of C. botulinum to be diverse, the presence of the flaA1/A2 gene in all strains examined facilitates single locus sequence typing of C. botulinum using the flagellin variable region. PMID:17351097

  2. Phenotypic H-Antigen Typing by Mass Spectrometry Combined with Genetic Typing of H Antigens, O Antigens, and Toxins by Whole-Genome Sequencing Enhances Identification of Escherichia coli Isolates.

    PubMed

    Cheng, Keding; Chui, Huixia; Domish, Larissa; Sloan, Angela; Hernandez, Drexler; McCorrister, Stuart; Robinson, Alyssia; Walker, Matthew; Peterson, Lorea A M; Majcher, Miles; Ratnam, Sam; Haldane, David J M; Bekal, Sadjia; Wylie, John; Chui, Linda; Tyler, Shaun; Xu, Bianli; Reimer, Aleisha; Nadon, Celine; Knox, J David; Wang, Gehua

    2016-08-01

    Mass spectrometry-based phenotypic H-antigen typing (MS-H) combined with whole-genome-sequencing-based genetic identification of H antigens, O antigens, and toxins (WGS-HOT) was used to type 60 clinical Escherichia coli isolates, 43 of which were previously identified as nonmotile, H type undetermined, or O rough by serotyping or having shown discordant MS-H and serotyping results. Whole-genome sequencing confirmed that MS-H was able to provide more accurate data regarding H antigen expression than serotyping. Further, enhanced and more confident O antigen identification resulted from gene cluster based typing in combination with conventional typing based on the gene pair comprising wzx and wzy and that comprising wzm and wzt The O antigen was identified in 94.6% of the isolates when the two genetic O typing approaches (gene pair and gene cluster) were used in conjunction, in comparison to 78.6% when the gene pair database was used alone. In addition, 98.2% of the isolates showed the existence of genes for various toxins and/or virulence factors, among which verotoxins (Shiga toxin 1 and/or Shiga toxin 2) were 100% concordant with conventional PCR based testing results. With more applications of mass spectrometry and whole-genome sequencing in clinical microbiology laboratories, this combined phenotypic and genetic typing platform (MS-H plus WGS-HOT) should be ideal for pathogenic E. coli typing. Copyright © 2016 Cheng et al.

  3. Molecular sequence typing reveals genotypic diversity among Escherichia coli isolates recovered from a cantaloupe packinghouse in Northwestern Mexico

    USDA-ARS?s Scientific Manuscript database

    The increase in the consumption of fresh produce in the United States has correlated with a rise in the number of reported foodborne illnesses. To identify potential risk factors associated with post-harvest practices, the present study employed multilocus sequence typing (MLST) for the genotypic c...

  4. Loci and candidate genes conferring resistance to soybean cyst nematode HG type 2.5.7.

    PubMed

    Zhao, Xue; Teng, Weili; Li, Yinghui; Liu, Dongyuan; Cao, Guanglu; Li, Dongmei; Qiu, Lijuan; Zheng, Hongkun; Han, Yingpeng; Li, Wenbin

    2017-06-14

    Soybean (Glycine max L. Merr.) cyst nematode (SCN, Heterodera glycines I,) is a major pest of soybean worldwide. The most effective strategy to control this pest involves the use of resistant cultivars. The aim of the present study was to investigate the genome-wide genetic architecture of resistance to SCN HG Type 2.5.7 (race 1) in landrace and elite cultivated soybeans. A total of 200 diverse soybean accessions were screened for resistance to SCN HG Type 2.5.7 and genotyped through sequencing using the Specific Locus Amplified Fragment Sequencing (SLAF-seq) approach with a 6.14-fold average sequencing depth. A total of 33,194 SNPs were identified with minor allele frequencies (MAF) over 4%, covering 97% of all the genotypes. Genome-wide association mapping (GWAS) revealed thirteen SNPs associated with resistance to SCN HG Type 2.5.7. These SNPs were distributed on five chromosomes (Chr), including Chr7, 8, 14, 15 and 18. Four SNPs were novel resistance loci and nine SNPs were located near known QTL. A total of 30 genes were identified as candidate genes underlying SCN resistance. A total of sixteen novel soybean accessions were identified with significant resistance to HG Type 2.5.7. The beneficial alleles and candidate genes identified by GWAS might be valuable for improving marker-assisted breeding efficiency and exploring the molecular mechanisms underlying SCN resistance.

  5. Screening for duplications, deletions and a common intronic mutation detects 35% of second mutations in patients with USH2A monoallelic mutations on Sanger sequencing.

    PubMed

    Steele-Stallard, Heather B; Le Quesne Stabej, Polona; Lenassi, Eva; Luxon, Linda M; Claustres, Mireille; Roux, Anne-Francoise; Webster, Andrew R; Bitner-Glindzicz, Maria

    2013-08-08

    Usher Syndrome is the leading cause of inherited deaf-blindness. It is divided into three subtypes, of which the most common is Usher type 2, and the USH2A gene accounts for 75-80% of cases. Despite recent sequencing strategies, in our cohort a significant proportion of individuals with Usher type 2 have just one heterozygous disease-causing mutation in USH2A, or no convincing disease-causing mutations across nine Usher genes. The purpose of this study was to improve the molecular diagnosis in these families by screening USH2A for duplications, heterozygous deletions and a common pathogenic deep intronic variant USH2A: c.7595-2144A>G. Forty-nine Usher type 2 or atypical Usher families who had missing mutations (mono-allelic USH2A or no mutations following Sanger sequencing of nine Usher genes) were screened for duplications/deletions using the USH2A SALSA MLPA reagent kit (MRC-Holland). Identification of USH2A: c.7595-2144A>G was achieved by Sanger sequencing. Mutations were confirmed by a combination of reverse transcription PCR using RNA extracted from nasal epithelial cells or fibroblasts, and by array comparative genomic hybridisation with sequencing across the genomic breakpoints. Eight mutations were identified in 23 Usher type 2 families (35%) with one previously identified heterozygous disease-causing mutation in USH2A. These consisted of five heterozygous deletions, one duplication, and two heterozygous instances of the pathogenic variant USH2A: c.7595-2144A>G. No variants were found in the 15 Usher type 2 families with no previously identified disease-causing mutations. In 11 atypical families, none of whom had any previously identified convincing disease-causing mutations, the mutation USH2A: c.7595-2144A>G was identified in a heterozygous state in one family. All five deletions and the heterozygous duplication we report here are novel. This is the first time that a duplication in USH2A has been reported as a cause of Usher syndrome. We found that 8 of 23 (35%) of 'missing' mutations in Usher type 2 probands with only a single heterozygous USH2A mutation detected with Sanger sequencing could be attributed to deletions, duplications or a pathogenic deep intronic variant. Future mutation detection strategies and genetic counselling will need to take into account the prevalence of these types of mutations in order to provide a more comprehensive diagnostic service.

  6. An efficient, versatile and scalable pattern growth approach to mine frequent patterns in unaligned protein sequences.

    PubMed

    Ye, Kai; Kosters, Walter A; Ijzerman, Adriaan P

    2007-03-15

    Pattern discovery in protein sequences is often based on multiple sequence alignments (MSA). The procedure can be computationally intensive and often requires manual adjustment, which may be particularly difficult for a set of deviating sequences. In contrast, two algorithms, PRATT2 (http//www.ebi.ac.uk/pratt/) and TEIRESIAS (http://cbcsrv.watson.ibm.com/) are used to directly identify frequent patterns from unaligned biological sequences without an attempt to align them. Here we propose a new algorithm with more efficiency and more functionality than both PRATT2 and TEIRESIAS, and discuss some of its applications to G protein-coupled receptors, a protein family of important drug targets. In this study, we designed and implemented six algorithms to mine three different pattern types from either one or two datasets using a pattern growth approach. We compared our approach to PRATT2 and TEIRESIAS in efficiency, completeness and the diversity of pattern types. Compared to PRATT2, our approach is faster, capable of processing large datasets and able to identify the so-called type III patterns. Our approach is comparable to TEIRESIAS in the discovery of the so-called type I patterns but has additional functionality such as mining the so-called type II and type III patterns and finding discriminating patterns between two datasets. The source code for pattern growth algorithms and their pseudo-code are available at http://www.liacs.nl/home/kosters/pg/.

  7. Occurrence of ascaridoid nematodes in selected edible fish from the Persian Gulf and description of Hysterothylacium larval type XV and Hysterothylacium persicum n. sp. (Nematoda: Raphidascarididae).

    PubMed

    Shamsi, Shokoofeh; Ghadam, Masoumeh; Suthar, Jaydipbhai; Ebrahimzadeh Mousavi, Hoseinali; Soltani, Mehdi; Mirzargar, Saeed

    2016-11-07

    Despite several reports on the presence of the potentially zoonotic nematodes among edible fishes in the Persian Gulf, there is still no study on the specific identification of these parasites or their genetic characterisation. In the present study, a total of 600 fish belonging to five popular species of fish in the region, including Otolithes ruber, Psettodes erumei, Saurida tumbil, Scomberomorus commerson and Sphyraena jello were examined for infection with nematode parasites. Detailed microscopy of nematodes found in the present study followed by characterisation of the first and second internal transcribed spacers (ITS-1 and ITS-2, respectively) showed that they belong to five distinct taxa that could be potentially zoonotic. Anisakis type I was found in four species of fish, had identical ITS sequences as Anisakis typica previously reported in Australian waters and was different from those reported in the Nearctic. Hysterothylacium type VI in the present study was morphologically similar to those previously described from Australasian waters and ITS sequences were identical among Australian specimens and those found in the present study. Another Hysterothylacium larval type was also found in the present study which had identical ITS sequences and similar morphology to those previously reported and identified as H. amoyense in China Sea. Since no ITS sequence data from a well identified adult H. amoyense with an identifiable museum voucher number is yet available and due to some other issues discussed in the article we suggest assignment of this larval type from the China Sea and the Persian Gulf to H. amoyense is doubtful until future studies on a well identified male specimen of H. amoyense or other species reveals the specific identity of this larval type. We propose to refer to this larval type as Hysterothylacium larval type XV. In the present study we also describe a new species, Hysterothylacium persicum and discuss how to differentiate it from closely related species. We also found some adult females with distinct morphology and ITS sequence but due to lack of male specimens they have been referred as Hysterothylacium sp. in this paper. They had the same ITS sequence data as Hysterothylacium larval type VI. This study shows the presence of a relatively broad diversity of potentially zoonotic nematodes in edible fish of the Persian Gulf. Therefore educational campaigns for public and local health practitioners are suggested to protect consumers from becoming infected with these parasites. Copyright © 2016 Elsevier B.V. All rights reserved.

  8. Multilocus sequence typing of total-genome-sequenced bacteria.

    PubMed

    Larsen, Mette V; Cosentino, Salvatore; Rasmussen, Simon; Friis, Carsten; Hasman, Henrik; Marvig, Rasmus Lykke; Jelsbak, Lars; Sicheritz-Pontén, Thomas; Ussery, David W; Aarestrup, Frank M; Lund, Ole

    2012-04-01

    Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the "gold standard" of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

  9. Four new type I restriction enzymes identified in Escherichia coli clinical isolates

    PubMed Central

    Kasarjian, Julie K. A.; Kodama, Yoshiaki; Iida, Masatake; Matsuda, Katsura; Ryu, Junichi

    2005-01-01

    Using a plasmid transformation method and the RM search computer program, four type I restriction enzymes with new recognition sites and two isoschizomers (EcoBI and Eco377I) were identified in a collection of clinical Escherichia coli isolates. These new enzymes were designated Eco394I, Eco826I, Eco851I and Eco912I. Their recognition sequences were determined to be GAC(5N)RTAAY, GCA(6N)CTGA, GTCA(6N)TGAY and CAC(5N)TGGC, respectively. A methylation sensitivity assay, using various synthetic oligonucleotides, was used to identify the adenines that prevent cleavage when methylated (underlined). These results suggest that type I enzymes are abundant in E.coli and many other bacteria, as has been inferred from bacterial genome sequencing projects. PMID:16040596

  10. Isolation of Propionibacterium acnes among the microbiota of primary endodontic infections with and without intraoral communication.

    PubMed

    Niazi, Sadia Ambreen; Al Kharusi, Hana Suleiman; Patel, Shanon; Bruce, Kenneth; Beighton, David; Foschi, Federico; Mannocci, Francesco

    2016-11-01

    The presence of opportunistic pathogens such as Propionibacterium acnes (P. acnes) may contribute to the endodontic pathology. The presence of P. acnes may be influenced by different endodontic conditions. The aims of the study were firstly, to identify P. acnes within the whole cultivable microbiota of primary endodontic infections, to investigate which P. acnes phylotypes predominate in such infections and secondly to determine if the presence of an "open" communication (e.g. a sinus) can be associated with the isolation of P. acnes from the root canal. The predominant cultivable microbiota of 15 primary endodontic lesions (7 without communication with the oral environment and 8 with an open communication) were identified using partial 16S ribosomal RNA (rRNA) gene sequence analysis. The identification of the organism was determined by interrogating the Human Oral Microbiome Database. The P. acnes isolates were typed on the basis of the recA gene sequence comparison. A neighbor-joining tree was constructed using MEGA 4.1 with the inclusion of known recA sequences. There was no difference in the number of species identified from lesions without communication (5.86 ± 3.7) and those with communication (5.37 ± 3.6) (P > 0.05). PCR-based 16S rRNA gene sequencing revealed P. acnes as the most prevalent isolate recovered from lesions with communication. recA gene sequencing revealed two phylogenetic lineages present in lesion with communication, with mainly type I (further split into type IA and type IB) and type II. The presence of P. acnes as opportunistic pathogens has been confirmed and may sustain the traits observed in specific clinical presentations. Clinical management of open lesions may require further disinfection to eliminate opportunistic bacteria.

  11. Epidemiological characterization of a nosocomial outbreak of extended spectrum β-lactamase Escherichia coli ST-131 confirms the clinical value of core genome multilocus sequence typing.

    PubMed

    Woksepp, Hanna; Ryberg, Anna; Berglind, Linda; Schön, Thomas; Söderman, Jan

    2017-12-01

    Enhanced precision of epidemiological typing in clinically suspected nosocomial outbreaks is crucial. Our aim was to investigate whether single nucleotide polymorphism (SNP) analysis and core genome (cg) multilocus sequence typing (MLST) of whole genome sequencing (WGS) data would more reliably identify a nosocomial outbreak, compared to earlier molecular typing methods. Sixteen isolates from a nosocomial outbreak of ESBL E. coli ST-131 in southeastern Sweden and three control strains were subjected to WGS. Sequences were explored by SNP analysis and cgMLST. cgMLST clearly differentiated between the outbreak isolates and the control isolates (>1400 differences). All clinically identified outbreak isolates showed close clustering (≥2 allele differences), except for two isolates (>50 allele differences). These data confirmed that the isolates with >50 differing genes did not belong to the nosocomial outbreak. The number of SNPs within the outbreak was ≤7, whereas the two discrepant isolates had >700 SNPs. Two of the ESBL E. coli ST-131 isolates did not belong to the clinically identified outbreak. Our results illustrate the power of WGS in terms of resolution, which may avoid overestimation of patients belonging to outbreaks as judged from epidemiological data and previously employed molecular methods with lower discriminatory ability. © 2017 APMIS. Published by John Wiley & Sons Ltd.

  12. Differential correlation for sequencing data.

    PubMed

    Siska, Charlotte; Kechris, Katerina

    2017-01-19

    Several methods have been developed to identify differential correlation (DC) between pairs of molecular features from -omics studies. Most DC methods have only been tested with microarrays and other platforms producing continuous and Gaussian-like data. Sequencing data is in the form of counts, often modeled with a negative binomial distribution making it difficult to apply standard correlation metrics. We have developed an R package for identifying DC called Discordant which uses mixture models for correlations between features and the Expectation Maximization (EM) algorithm for fitting parameters of the mixture model. Several correlation metrics for sequencing data are provided and tested using simulations. Other extensions in the Discordant package include additional modeling for different types of differential correlation, and faster implementation, using a subsampling routine to reduce run-time and address the assumption of independence between molecular feature pairs. With simulations and breast cancer miRNA-Seq and RNA-Seq data, we find that Spearman's correlation has the best performance among the tested correlation methods for identifying differential correlation. Application of Spearman's correlation in the Discordant method demonstrated the most power in ROC curves and sensitivity/specificity plots, and improved ability to identify experimentally validated breast cancer miRNA. We also considered including additional types of differential correlation, which showed a slight reduction in power due to the additional parameters that need to be estimated, but more versatility in applications. Finally, subsampling within the EM algorithm considerably decreased run-time with negligible effect on performance. A new method and R package called Discordant is presented for identifying differential correlation with sequencing data. Based on comparisons with different correlation metrics, this study suggests Spearman's correlation is appropriate for sequencing data, but other correlation metrics are available to the user depending on the application and data type. The Discordant method can also be extended to investigate additional DC types and subsampling with the EM algorithm is now available for reduced run-time. These extensions to the R package make Discordant more robust and versatile for multiple -omics studies.

  13. Fetal Eye Movements on Magnetic Resonance Imaging

    PubMed Central

    Woitek, Ramona; Kasprian, Gregor; Lindner, Christian; Stuhr, Fritz; Weber, Michael; Schöpf, Veronika; Brugger, Peter C.; Asenbaum, Ulrika; Furtner, Julia; Bettelheim, Dieter; Seidl, Rainer; Prayer, Daniela

    2013-01-01

    Objectives Eye movements are the physical expression of upper fetal brainstem function. Our aim was to identify and differentiate specific types of fetal eye movement patterns using dynamic MRI sequences. Their occurrence as well as the presence of conjugated eyeball motion and consistently parallel eyeball position was systematically analyzed. Methods Dynamic SSFP sequences were acquired in 72 singleton fetuses (17–40 GW, three age groups [17–23 GW, 24–32 GW, 33–40 GW]). Fetal eye movements were evaluated according to a modified classification originally published by Birnholz (1981): Type 0: no eye movements; Type I: single transient deviations; Type Ia: fast deviation, slower reposition; Type Ib: fast deviation, fast reposition; Type II: single prolonged eye movements; Type III: complex sequences; and Type IV: nystagmoid. Results In 95.8% of fetuses, the evaluation of eye movements was possible using MRI, with a mean acquisition time of 70 seconds. Due to head motion, 4.2% of the fetuses and 20.1% of all dynamic SSFP sequences were excluded. Eye movements were observed in 45 fetuses (65.2%). Significant differences between the age groups were found for Type I (p = 0.03), Type Ia (p = 0.031), and Type IV eye movements (p = 0.033). Consistently parallel bulbs were found in 27.3–45%. Conclusions In human fetuses, different eye movement patterns can be identified and described by MRI in utero. In addition to the originally classified eye movement patterns, a novel subtype has been observed, which apparently characterizes an important step in fetal brainstem development. We evaluated, for the first time, eyeball position in fetuses. Ultimately, the assessment of fetal eye movements by MRI yields the potential to identify early signs of brainstem dysfunction, as encountered in brain malformations such as Chiari II or molar tooth malformations. PMID:24194885

  14. High-Resolution Melting Analysis for Rapid Detection of Sequence Type 131 Escherichia coli.

    PubMed

    Harrison, Lucas B; Hanson, Nancy D

    2017-06-01

    Escherichia coli isolates belonging to the sequence type 131 (ST131) clonal complex have been associated with the global distribution of fluoroquinolone and β-lactam resistance. Whole-genome sequencing and multilocus sequence typing identify sequence type but are expensive when evaluating large numbers of samples. This study was designed to develop a cost-effective screening tool using high-resolution melting (HRM) analysis to differentiate ST131 from non-ST131 E. coli in large sample populations in the absence of sequence analysis. The method was optimized using DNA from 12 E. coli isolates. Singleplex PCR was performed using 10 ng of DNA, Type-it HRM buffer, and multilocus sequence typing primers and was followed by multiplex PCR. The amplicon sizes ranged from 630 to 737 bp. Melt temperature peaks were determined by performing HRM analysis at 0.1°C resolution from 50 to 95°C on a Rotor-Gene Q 5-plex HRM system. Derivative melt curves were compared between sequence types and analyzed by principal component analysis. A blinded study of 191 E. coli isolates of ST131 and unknown sequence types validated this methodology. This methodology returned 99.2% specificity (124 true negatives and 1 false positive) and 100% sensitivity (66 true positives and 0 false negatives). This HRM methodology distinguishes ST131 from non-ST131 E. coli without sequence analysis. The analysis can be accomplished in about 3 h in any laboratory with an HRM-capable instrument and principal component analysis software. Therefore, this assay is a fast and cost-effective alternative to sequencing-based ST131 identification. Copyright © 2017 Harrison and Hanson.

  15. Multilocus Sequence Typing Compared to Pulsed-Field Gel Electrophoresis for Molecular Typing of Pseudomonas aeruginosa▿

    PubMed Central

    Johnson, Jennifer K.; Arduino, Sonia M.; Stine, O. Colin; Johnson, Judith A.; Harris, Anthony D.

    2007-01-01

    For hospital epidemiologists, determining a system of typing that is discriminatory is essential for measuring the effectiveness of infection control measures. In situations in which the incidence of resistant Pseudomonas aeruginosa is increasing, the ability to discern whether it is due to patient-to-patient transmission versus an increase in patient endogenous strains is often made on the basis of molecular typing. The present study compared the discriminatory abilities of pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST) for 90 P. aeruginosa isolates obtained from cultures of perirectal surveillance swabs from patients in an intensive care unit. PFGE identified 85 distinct types and 76 distinct groups when similarity cutoffs of 100% and 87%, respectively, were used. By comparison, MLST identified 60 sequence types that could be clustered into 11 clonal complexes and 32 singletons. By using the Simpson index of diversity (D), PFGE had a greater discriminatory ability than MLST for P. aeruginosa isolates (D values, 0.999 versus 0.975, respectively). Thus, while MLST was better for detecting genetic relatedness, we determined that PFGE was more discriminatory than MLST for determining genetic differences in P. aeruginosa. PMID:17881548

  16. Comparison of Three Different Hepatitis C Virus Genotyping Methods: 5'NCR PCR-RFLP, Core Type-Specific PCR, and NS5b Sequencing in a Tertiary Care Hospital in South India.

    PubMed

    Daniel, Hubert D-J; David, Joel; Raghuraman, Sukanya; Gnanamony, Manu; Chandy, George M; Sridharan, Gopalan; Abraham, Priya

    2017-05-01

    Based on genetic heterogeneity, hepatitis C virus (HCV) is classified into seven major genotypes and 64 subtypes. In spite of the sequence heterogeneity, all genotypes share an identical complement of colinear genes within the large open reading frame. The genetic interrelationships between these genes are consistent among genotypes. Due to this property, complete sequencing of the HCV genome is not required. HCV genotypes along with subtypes are critical for planning antiviral therapy. Certain genotypes are also associated with higher progression to liver cirrhosis. In this study, 100 blood samples were collected from individuals who came for routine HCV genotype identification. These samples were used for the comparison of two different genotyping methods (5'NCR PCR-RFLP and HCV core type-specific PCR) with NS5b sequencing. Of the 100 samples genotyped using 5'NCR PCR-RFLP and HCV core type-specific PCR, 90% (κ = 0.913, P < 0.00) and 96% (κ = 0.794, P < 0.00) correlated with NS5b sequencing, respectively. Sixty percent and 75% of discordant samples by 5'NCR PCR-RFLP and HCV core type-specific PCR, respectively, belonged to genotype 6. All the HCV genotype 1 subtypes were classified accurately by both the methods. This study shows that the 5'NCR-based PCR-RFLP and the HCV core type-specific PCR-based assays correctly identified HCV genotypes except genotype 6 from this region. Direct sequencing of the HCV core region was able to identify all the genotype 6 from this region and serves as an alternative to NS5b sequencing. © 2016 Wiley Periodicals, Inc.

  17. Identification and functional characterization of a novel bipartite nuclear localization sequence in ARID1A.

    PubMed

    Bateman, Nicholas W; Shoji, Yutaka; Conrads, Kelly A; Stroop, Kevin D; Hamilton, Chad A; Darcy, Kathleen M; Maxwell, George L; Risinger, John I; Conrads, Thomas P

    2016-01-01

    AT-rich interactive domain-containing protein 1A (ARID1A) is a recently identified nuclear tumor suppressor frequently altered in solid tumor malignancies. We have identified a bipartite-like nuclear localization sequence (NLS) that contributes to nuclear import of ARID1A not previously described. We functionally confirm activity using GFP constructs fused with wild-type or mutant NLS sequences. We further show that cyto-nuclear localized, bipartite NLS mutant ARID1A exhibits greater stability than nuclear-localized, wild-type ARID1A. Identification of this undescribed functional NLS within ARID1A contributes vital insights to rationalize the impact of ARID1A missense mutations observed in patient tumors. Copyright © 2015 Elsevier Inc. All rights reserved.

  18. MULTILOCUS SEQUENCE TYPING OF BRUCELLA ISOLATES FROM THAILAND.

    PubMed

    Chawjiraphan, Wireeya; Sonthayanon, Piengchan; Chanket, Phanita; Benjathummarak, Surachet; Kerdsin, Anusak; Kalambhaheti, Thareerat

    2016-11-01

    Although brucellosis outbreaks in Thailand are rare, they cause abortions and infertility in animals, resulting in significant economic loss. Because Brucella spp display > 90% DNA homology, multilocus sequence typing (MLST) was employed to categorize local Brucella isolates into sequence types (STs) and to determine their genetic relatedness. Brucella samples were isolated from vaginal secretion of cows and goats, and from blood cultures of infected individuals. Brucella species were determined by multiplex PCR of eight loci, in addition to MLST based on partial DNA sequences of nine house-keeping genes. MLST analysis of 36 isolates revealed 78 distinct novel allele types and 34 novel STs, while two isolates possessed the known ST8. Sequence alignments identified polymorphic sites in each allele, ranging from 2-6%, while overall genetic diversity was 3.6%. MLST analysis of the 36 Brucella isolates classified them into three species, namely, B. melitensis, B. abortus and B. suis, in agreement with multiplex PCR results. Genetic relatedness among ST members of B. melitensis and B. abortus determined by eBURST program revealed ST2 as founder of B. abortus isolates and ST8 the founder of B. melitensis isolates. ST 36, 41 and 50 of Thai Brucella isolates were identified as single locus variants of clonal cluster (CC) 8, while the majority of STs were diverse. The genetic diversity and relatedness identified using MLST revealed hitherto unexpected diversity among Thai Brucella isolates. Genetic classification of isolates could reveal the route of brucellosis transmission among humans and farm animals and also reveal their relationship with other isolates in the region and other parts of the world.

  19. Population Structure in Nontypeable Haemophilus influenzae

    PubMed Central

    LaCross, Nathan C.; Marrs, Carl F.; Gilsdorf, Janet R.

    2013-01-01

    Nontypeable Haemophilus influenzae (NTHi) frequently colonize the human pharynx asymptomatically, and are an important cause of otitis media in children. Past studies have identified typeable H. influenzae as being clonal, but the population structure of NTHi has not been extensively characterized. The research presented here investigated the diversity and population structure in a well-characterized collection of NTHi isolated from the middle ears of children with otitis media or the pharynges of healthy children in three disparate geographic regions. Multilocus sequence typing identified 109 unique sequence types among 170 commensal and otitis media-associated NTHi isolates from Finland, Israel, and the US. The largest clonal complex contained only five sequence types, indicating a high level of genetic diversity. The eBURST v3, ClonalFrame 1.1, and structure 2.3.3 programs were used to further characterize diversity and population structure from the sequence typing data. Little clustering was apparent by either disease state (otitis media or commensalism) or geography in the ClonalFrame phylogeny. Population structure was clearly evident, with support for eight populations when all 170 isolates were analyzed. Interestingly, one population contained only commensal isolates, while two others consisted solely of otitis media isolates, suggesting associations between population structure and disease. PMID:23266487

  20. Improving taxonomic accuracy for fungi in public sequence databases: applying ‘one name one species’ in well-defined genera with Trichoderma/Hypocrea as a test case

    PubMed Central

    Strope, Pooja K; Chaverri, Priscila; Gazis, Romina; Ciufo, Stacy; Domrachev, Michael; Schoch, Conrad L

    2017-01-01

    Abstract The ITS (nuclear ribosomal internal transcribed spacer) RefSeq database at the National Center for Biotechnology Information (NCBI) is dedicated to the clear association between name, specimen and sequence data. This database is focused on sequences obtained from type material stored in public collections. While the initial ITS sequence curation effort together with numerous fungal taxonomy experts attempted to cover as many orders as possible, we extended our latest focus to the family and genus ranks. We focused on Trichoderma for several reasons, mainly because the asexual and sexual synonyms were well documented, and a list of proposed names and type material were recently proposed and published. In this case study the recent taxonomic information was applied to do a complete taxonomic audit for the genus Trichoderma in the NCBI Taxonomy database. A name status report is available here: https://www.ncbi.nlm.nih.gov/Taxonomy/TaxIdentifier/tax_identifier.cgi. As a result, the ITS RefSeq Targeted Loci database at NCBI has been augmented with more sequences from type and verified material from Trichoderma species. Additionally, to aid in the cross referencing of data from single loci and genomes we have collected a list of quality records of the RPB2 gene obtained from type material in GenBank that could help validate future submissions. During the process of curation misidentified genomes were discovered, and sequence records from type material were found hidden under previous classifications. Source metadata curation, although more cumbersome, proved to be useful as confirmation of the type material designation. Database URL: http://www.ncbi.nlm.nih.gov/bioproject/PRJNA177353 PMID:29220466

  1. Molecular characterization of a distinct monopartite begomovirus associated with betasatellites and alphasatellites infecting Pisum sativum in Nepal.

    PubMed

    Shahid, M S; Pudashini, B J; Khatri-Chhetri, G B; Briddon, R W; Natsuaki, K T

    2017-04-01

    Pea (Pisum sativum) plants exhibiting leaf distortion, yellowing, stunted growth and reduction in leaf size from Rampur, Nepal were shown to be infected by a begomovirus in association with betasatellites and alphasatellites. The begomovirus associated with the disease showed only low levels of nucleotide sequence identity (<91%) to previously characterized begomoviruses. This finding indicates that the pea samples were infected with an as yet undescribed begomovirus for which the name Pea leaf distortion virus (PLDV) is proposed. Two species of betasatellite were identified in association with PLDV. One group of sequences had high (>78%) nucleotide sequence identity to isolates of Ludwigia leaf distortion betasatellite (LuLDB), and the second group had less than 78% to all other betasatellite sequences. This showed PLDV to be associated with either LuLDB or a previously undescribed betasatellite for which the name Pea leaf distortion betasatellite is proposed. Two types of alphasatellites were identified in the PLDV-infected pea plants. The first type showed high levels of sequence identity to Ageratum yellow vein alphasatellite, and the second type showed high levels of identity to isolates of Sida yellow vein China alphasatellite. These are the first begomovirus, betasatellites and alphasatellites isolated from pea.

  2. High-Throughput Single-Cell RNA Sequencing and Data Analysis.

    PubMed

    Sagar; Herman, Josip Stefan; Pospisilik, John Andrew; Grün, Dominic

    2018-01-01

    Understanding biological systems at a single cell resolution may reveal several novel insights which remain masked by the conventional population-based techniques providing an average readout of the behavior of cells. Single-cell transcriptome sequencing holds the potential to identify novel cell types and characterize the cellular composition of any organ or tissue in health and disease. Here, we describe a customized high-throughput protocol for single-cell RNA-sequencing (scRNA-seq) combining flow cytometry and a nanoliter-scale robotic system. Since scRNA-seq requires amplification of a low amount of endogenous cellular RNA, leading to substantial technical noise in the dataset, downstream data filtering and analysis require special care. Therefore, we also briefly describe in-house state-of-the-art data analysis algorithms developed to identify cellular subpopulations including rare cell types as well as to derive lineage trees by ordering the identified subpopulations of cells along the inferred differentiation trajectories.

  3. Burkholderia pseudomallei Isolates from Sarawak, Malaysian Borneo, Are Predominantly Susceptible to Aminoglycosides and Macrolides

    PubMed Central

    Podin, Yuwana; Sarovich, Derek S.; Price, Erin P.; Kaestli, Mirjam; Mayo, Mark; Hii, KingChing; Ngian, HieUng; Wong, SeeChang; Wong, IngTien; Wong, JinShyan; Mohan, Anand; Ooi, MongHow; Fam, TemLom; Wong, Jack; Tuanyok, Apichai; Keim, Paul; Giffard, Philip M.

    2014-01-01

    Melioidosis is a potentially fatal disease caused by the saprophytic bacterium Burkholderia pseudomallei. Resistance to gentamicin is generally a hallmark of B. pseudomallei, and gentamicin is a selective agent in media used for diagnosis of melioidosis. In this study, we determined the prevalence and mechanism of gentamicin susceptibility found in B. pseudomallei isolates from Sarawak, Malaysian Borneo. We performed multilocus sequence typing and antibiotic susceptibility testing on 44 B. pseudomallei clinical isolates from melioidosis patients in Sarawak district hospitals. Whole-genome sequencing was used to identify the mechanism of gentamicin susceptibility. A novel allelic-specific PCR was designed to differentiate gentamicin-sensitive isolates from wild-type B. pseudomallei. A reversion assay was performed to confirm the involvement of this mechanism in gentamicin susceptibility. A substantial proportion (86%) of B. pseudomallei clinical isolates in Sarawak, Malaysian Borneo, were found to be susceptible to the aminoglycoside gentamicin, a rare occurrence in other regions where B. pseudomallei is endemic. Gentamicin sensitivity was restricted to genetically related strains belonging to sequence type 881 or its single-locus variant, sequence type 997. Whole-genome sequencing identified a novel nonsynonymous mutation within amrB, encoding an essential component of the AmrAB-OprA multidrug efflux pump. We confirmed the role of this mutation in conferring aminoglycoside and macrolide sensitivity by reversion of this mutation to the wild-type sequence. Our study demonstrates that alternative B. pseudomallei selective media without gentamicin are needed for accurate melioidosis laboratory diagnosis in Sarawak. This finding may also have implications for environmental sampling of other locations to test for B. pseudomallei endemicity. PMID:24145517

  4. Burkholderia pseudomallei isolates from Sarawak, Malaysian Borneo, are predominantly susceptible to aminoglycosides and macrolides.

    PubMed

    Podin, Yuwana; Sarovich, Derek S; Price, Erin P; Kaestli, Mirjam; Mayo, Mark; Hii, KingChing; Ngian, Hieung; Wong, SeeChang; Wong, IngTien; Wong, JinShyan; Mohan, Anand; Ooi, MongHow; Fam, TemLom; Wong, Jack; Tuanyok, Apichai; Keim, Paul; Giffard, Philip M; Currie, Bart J

    2014-01-01

    Melioidosis is a potentially fatal disease caused by the saprophytic bacterium Burkholderia pseudomallei. Resistance to gentamicin is generally a hallmark of B. pseudomallei, and gentamicin is a selective agent in media used for diagnosis of melioidosis. In this study, we determined the prevalence and mechanism of gentamicin susceptibility found in B. pseudomallei isolates from Sarawak, Malaysian Borneo. We performed multilocus sequence typing and antibiotic susceptibility testing on 44 B. pseudomallei clinical isolates from melioidosis patients in Sarawak district hospitals. Whole-genome sequencing was used to identify the mechanism of gentamicin susceptibility. A novel allelic-specific PCR was designed to differentiate gentamicin-sensitive isolates from wild-type B. pseudomallei. A reversion assay was performed to confirm the involvement of this mechanism in gentamicin susceptibility. A substantial proportion (86%) of B. pseudomallei clinical isolates in Sarawak, Malaysian Borneo, were found to be susceptible to the aminoglycoside gentamicin, a rare occurrence in other regions where B. pseudomallei is endemic. Gentamicin sensitivity was restricted to genetically related strains belonging to sequence type 881 or its single-locus variant, sequence type 997. Whole-genome sequencing identified a novel nonsynonymous mutation within amrB, encoding an essential component of the AmrAB-OprA multidrug efflux pump. We confirmed the role of this mutation in conferring aminoglycoside and macrolide sensitivity by reversion of this mutation to the wild-type sequence. Our study demonstrates that alternative B. pseudomallei selective media without gentamicin are needed for accurate melioidosis laboratory diagnosis in Sarawak. This finding may also have implications for environmental sampling of other locations to test for B. pseudomallei endemicity.

  5. Molecular Typing and Epidemiology of Human Listeriosis Cases, Denmark, 2002-2012.

    PubMed

    Jensen, Anne Kvistholm; Björkman, Jonas T; Ethelberg, Steen; Kiil, Kristoffer; Kemp, Michael; Nielsen, Eva Møller

    2016-04-01

    Denmark has a high incidence of invasive listeriosis (0.9 cases/100,000 population in 2012). We analyzed patient data, clinical outcome, and trends in pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST) of Listeria monocytogenes strains isolated in Denmark during 2002-2012. We performed 2-enzyme PFGE and serotyping on 559 isolates and MLST on 92 isolates and identified some correlation between molecular type and clinical outcome and patient characteristics. We found 178 different PFGE types, but isolates from 122 cases belonged to just 2 closely related PFGE types, clonal complex 8 and sequence type 8. These 2 types were the main cause of a peak in incidence of invasive listeriosis during 2005-2009, possibly representing an outbreak or the presence of a highly prevalent clone. However, current typing methods could not fully confirm these possibilities, highlighting the need for more refined discriminatory typing methods to identify outbreaks within frequently occurring L. monocytogenes PFGE types.

  6. Molecular Typing and Epidemiology of Human Listeriosis Cases, Denmark, 2002–20121

    PubMed Central

    Björkman, Jonas T.; Ethelberg, Steen; Kiil, Kristoffer; Kemp, Michael; Nielsen, Eva Møller

    2016-01-01

    Denmark has a high incidence of invasive listeriosis (0.9 cases/100,000 population in 2012). We analyzed patient data, clinical outcome, and trends in pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST) of Listeria monocytogenes strains isolated in Denmark during 2002–2012. We performed 2-enzyme PFGE and serotyping on 559 isolates and MLST on 92 isolates and identified some correlation between molecular type and clinical outcome and patient characteristics. We found 178 different PFGE types, but isolates from 122 cases belonged to just 2 closely related PFGE types, clonal complex 8 and sequence type 8. These 2 types were the main cause of a peak in incidence of invasive listeriosis during 2005–2009, possibly representing an outbreak or the presence of a highly prevalent clone. However, current typing methods could not fully confirm these possibilities, highlighting the need for more refined discriminatory typing methods to identify outbreaks within frequently occurring L. monocytogenes PFGE types. PMID:26982714

  7. Development of Mycoplasma synoviae (MS) core genome multilocus sequence typing (cgMLST) scheme.

    PubMed

    Ghanem, Mostafa; El-Gazzar, Mohamed

    2018-05-01

    Mycoplasma synoviae (MS) is a poultry pathogen with reported increased prevalence and virulence in recent years. MS strain identification is essential for prevention, control efforts and epidemiological outbreak investigations. Multiple multilocus based sequence typing schemes have been developed for MS, yet the resolution of these schemes could be limited for outbreak investigation. The cost of whole genome sequencing became close to that of sequencing the seven MLST targets; however, there is no standardized method for typing MS strains based on whole genome sequences. In this paper, we propose a core genome multilocus sequence typing (cgMLST) scheme as a standardized and reproducible method for typing MS based whole genome sequences. A diverse set of 25 MS whole genome sequences were used to identify 302 core genome genes as cgMLST targets (35.5% of MS genome) and 44 whole genome sequences of MS isolates from six countries in four continents were used for typing applying this scheme. cgMLST based phylogenetic trees displayed a high degree of agreement with core genome SNP based analysis and available epidemiological information. cgMLST allowed evaluation of two conventional MLST schemes of MS. The high discriminatory power of cgMLST allowed differentiation between samples of the same conventional MLST type. cgMLST represents a standardized, accurate, highly discriminatory, and reproducible method for differentiation between MS isolates. Like conventional MLST, it provides stable and expandable nomenclature, allowing for comparing and sharing the typing results between different laboratories worldwide. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  8. Routine HLA-B genotyping with PCR-sequence-specific oligonucleotides detects a B*52 variant (B*5206).

    PubMed

    Hoelsch, K; Lenggeler, I; Pfannes, W; Knabe, H; Klein, H-G; Woelpl, A

    2005-05-01

    A new human leukocyte antigen (HLA)-B allele was found during routine typing of samples for a German unrelated bone marrow donor registry, the "Aktion Knochenmarkspende Bayern". After first interpretation of data of two independent low-resolution sequence-specific oligonucleotide typing tests, a B*51 variant was suggested. Further analysis via sequence-based typing identified the sequence as new B*52 allele. This new allele officially assigned as B*5206 differs from HLA-B*520102 by one nucleotide exchange in exon 2. The mutation is located at nucleotide position 274, at which a cytosine is substituted by a thymine leading to an amino acid change at protein position 67 from serine (TCC) to phenylalanine (TTC).

  9. Crash sequence based risk matrix for motorcycle crashes.

    PubMed

    Wu, Kun-Feng; Sasidharan, Lekshmi; Thor, Craig P; Chen, Sheng-Yin

    2018-04-05

    Considerable research has been conducted related to motorcycle and other powered-two-wheeler (PTW) crashes; however, it always has been controversial among practitioners concerning with types of crashes should be first targeted and how to prioritize resources for the implementation of mitigating actions. Therefore, there is a need to identify types of motorcycle crashes that constitute the greatest safety risk to riders - most frequent and most severe crashes. This pilot study seeks exhibit the efficacy of a new approach for prioritizing PTW crash causation sequences as they relate to injury severity to better inform the application of mitigating countermeasures. To accomplish this, the present study constructed a crash sequence-based risk matrix to identify most frequent and most severe motorcycle crashes in an attempt to better connect causes and countermeasures of PTW crashes. Although the frequency of each crash sequence can be computed from crash data, a crash severity model is needed to compare the levels of crash severity among different crash sequences, while controlling for other factors that also have effects on crash severity such drivers' age, use of helmet, etc. The construction of risk matrix based on crash sequences involve two tasks: formulation of crash sequence and the estimation of a mixed-effects (ME) model to adjust the levels of severities for each crash sequence to account for other crash contributing factors that would have an effect on the maximum level of crash severity in a crash. Three data elements from the National Automotive Sampling System - General Estimating System (NASS-GES) data were utilized to form a crash sequence: critical event, crash types, and sequence of events. A mixed-effects model was constructed to model the severity levels for each crash sequence while accounting for the effects of those crash contributing factors on crash severity. A total of 8039 crashes involving 8208 motorcycles occurred during 2011 and 2013 were included in this study, weighted to represent 338,655 motorcyclists involved in traffic crashes in three years (2011-2013)(NHTSA, 2013). The top five most frequent and severe types of crash sequences were identified, accounting for 23 percent of all the motorcycle crashes included in the study, and they are (1) run-off-road crashes on the right, and hitting roadside objects, (2) cross-median crashes, and rollover, (3) left-turn oncoming crashes, and head-on, (4) crossing over (passing through) or turning into opposite direction at intersections, and (5) side-impacted. In addition to crash sequences, several other factors were also identified to have effects on crash severity: use of helmet, presence of horizontal curves, alcohol consumption, road surface condition, roadway functional class, and nighttime condition. Copyright © 2018 Elsevier Ltd. All rights reserved.

  10. Novel cis-acting replication element in the adeno-associated virus type 2 genome is involved in amplification of integrated rep-cap sequences.

    PubMed

    Nony, P; Tessier, J; Chadeuf, G; Ward, P; Giraud, A; Dugast, M; Linden, R M; Moullier, P; Salvetti, A

    2001-10-01

    This study identifies a region of the adeno-associated virus type 2 (AAV-2) rep gene (nucleotides 190 to 540 of wild-type AAV-2) as a cis-acting Rep-dependent element able to promote the replication of transiently transfected plasmids. This viral element is also shown to be involved in the amplification of integrated sequences in the presence of adenovirus and Rep proteins.

  11. Mining co-occurrence and sequence patterns from cancer diagnoses in New York State.

    PubMed

    Wang, Yu; Hou, Wei; Wang, Fusheng

    2018-01-01

    The goal of this study is to discover disease co-occurrence and sequence patterns from large scale cancer diagnosis histories in New York State. In particular, we want to identify disparities among different patient groups. Our study will provide essential knowledge for clinical researchers to further investigate comorbidities and disease progression for improving the management of multiple diseases. We used inpatient discharge and outpatient visit records from the New York State Statewide Planning and Research Cooperative System (SPARCS) from 2011-2015. We grouped each patient's visit history to generate diagnosis sequences for seven most popular cancer types. We performed frequent disease co-occurrence mining using the Apriori algorithm, and frequent disease sequence patterns discovery using the cSPADE algorithm. Different types of cancer demonstrated distinct patterns. Disparities of both disease co-occurrence and sequence patterns were observed from patients within different age groups. There were also considerable disparities in disease co-occurrence patterns with respect to different claim types (i.e., inpatient, outpatient, emergency department and ambulatory surgery). Disparities regarding genders were mostly found where the cancer types were gender specific. Supports of most patterns were usually higher for males than for females. Compared with secondary diagnosis codes, primary diagnosis codes can convey more stable results. Two disease sequences consisting of the same diagnoses but in different orders were usually with different supports. Our results suggest that the methods adopted can generate potentially interesting and clinically meaningful disease co-occurrence and sequence patterns, and identify disparities among various patient groups. These patterns could imply comorbidities and disease progressions.

  12. Draft genome sequence of CTX-M-type β-lactamase-producing Klebsiella quasipneumoniae subsp. similipneumoniae isolated from a Box turtle.

    PubMed

    Li, Chien-Feng; Tang, Hui-Ling; Chiou, Chien-Shun; Tung, Kwong-Chung; Lu, Min-Chi; Lai, Yi-Chyi

    2018-03-01

    Klebsiella spp. are regarded as major pathogens causing infections in humans and various animals. Here we report the draft genome sequence of a CTX-M-type β-lactamase-producing Klebsiella quasipneumoniae subsp. similipneumoniae strain CHKP0062 isolated from a Yellow-margined Box turtle. An Illumina-Solexa platform was used to sequence the genome of CHKP0062. Qualified reads were assembled de novo using Velvet. The draft genome was annotated by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). The resistome and virulome of the strain were investigated. A total of 5423 protein-coding sequences, 87 tRNAs, 24 rRNAs and 12 ncRNAs were identified in the 5 699 275-bp genome. CHKP0062 was assigned to sequence type ST2131 with the K-loci type as KL67. No virulence-associated genes were identified. However, numerous antimicrobial resistance genes were present in this strain. Plasmid contigs were assembled and revealed homology to the multidrug resistance plasmids pC15-K, pCTX-M3 and pKF3-94, with the carriage of the class A β-lactamase genes bla TEM-1b and bla CTX-M-3 . The genome sequence reported in this study will be useful for comparative genomic analysis regarding the dissemination of clinically important antibiotic resistance genes among Klebsiella spp. isolated from humans and animals. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.

  13. RNAcentral: A comprehensive database of non-coding RNA sequences

    DOE PAGES

    Williams, Kelly Porter; Lau, Britney Yan

    2016-10-28

    RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides a single entry point for accessing ncRNA sequences of all ncRNA types from all organisms. Since its launch in 2014, RNAcentral has integrated twelve new resources, taking the total number of collaborating database to 22, and began importing new types of data, such as modified nucleotides from MODOMICS and PDB. We created new species-specific identifiers that refer to unique RNA sequences within a context of single species. Furthermore, the website has been subject to continuous improvements focusing on text and sequence similaritymore » searches as well as genome browsing functionality.« less

  14. RNAcentral: A comprehensive database of non-coding RNA sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Williams, Kelly Porter; Lau, Britney Yan

    RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides a single entry point for accessing ncRNA sequences of all ncRNA types from all organisms. Since its launch in 2014, RNAcentral has integrated twelve new resources, taking the total number of collaborating database to 22, and began importing new types of data, such as modified nucleotides from MODOMICS and PDB. We created new species-specific identifiers that refer to unique RNA sequences within a context of single species. Furthermore, the website has been subject to continuous improvements focusing on text and sequence similaritymore » searches as well as genome browsing functionality.« less

  15. Application of MLST and Pilus Gene Sequence Comparisons to Investigate the Population Structures of Actinomyces naeslundii and Actinomyces oris

    PubMed Central

    Henssge, Uta; Do, Thuy; Gilbert, Steven C.; Cox, Steven; Clark, Douglas; Wickström, Claes; Ligtenberg, A. J. M.; Radford, David R.; Beighton, David

    2011-01-01

    Actinomyces naeslundii and Actinomyces oris are members of the oral biofilm. Their identification using 16S rRNA sequencing is problematic and better achieved by comparison of metG partial sequences. A. oris is more abundant and more frequently isolated than A. naeslundii. We used a multi-locus sequence typing approach to investigate the genotypic diversity of these species and assigned A. naeslundii (n = 37) and A. oris (n = 68) isolates to 32 and 68 sequence types (ST), respectively. Neighbor-joining and ClonalFrame dendrograms derived from the concatenated partial sequences of 7 house-keeping genes identified at least 4 significant subclusters within A. oris and 3 within A. naeslundii. The strain collection we had investigated was an under-representation of the total population since at least 3 STs composed of single strains may represent discrete clusters of strains not well represented in the collection. The integrity of these sub-clusters was supported by the sequence analysis of fimP and fimA, genes coding for the type 1 and 2 fimbriae, respectively. An A. naeslundii subcluster was identified with both fimA and fimP genes and these strains were able to bind to MUC7 and statherin while all other A. naeslundii strains possessed only fimA and did not bind to statherin. An A. oris subcluster harboured a fimA gene similar to that of Actinomyces odontolyticus but no detectable fimP failed to bind significantly to either MUC7 or statherin. These data are evidence of extensive genotypic and phenotypic diversity within the species A. oris and A. naeslundii but the status of the subclusters identified here will require genome comparisons before their phylogenic position can be unequivocally established. PMID:21738661

  16. Application of MLST and pilus gene sequence comparisons to investigate the population structures of Actinomyces naeslundii and Actinomyces oris.

    PubMed

    Henssge, Uta; Do, Thuy; Gilbert, Steven C; Cox, Steven; Clark, Douglas; Wickström, Claes; Ligtenberg, A J M; Radford, David R; Beighton, David

    2011-01-01

    Actinomyces naeslundii and Actinomyces oris are members of the oral biofilm. Their identification using 16S rRNA sequencing is problematic and better achieved by comparison of metG partial sequences. A. oris is more abundant and more frequently isolated than A. naeslundii. We used a multi-locus sequence typing approach to investigate the genotypic diversity of these species and assigned A. naeslundii (n = 37) and A. oris (n = 68) isolates to 32 and 68 sequence types (ST), respectively. Neighbor-joining and ClonalFrame dendrograms derived from the concatenated partial sequences of 7 house-keeping genes identified at least 4 significant subclusters within A. oris and 3 within A. naeslundii. The strain collection we had investigated was an under-representation of the total population since at least 3 STs composed of single strains may represent discrete clusters of strains not well represented in the collection. The integrity of these sub-clusters was supported by the sequence analysis of fimP and fimA, genes coding for the type 1 and 2 fimbriae, respectively. An A. naeslundii subcluster was identified with both fimA and fimP genes and these strains were able to bind to MUC7 and statherin while all other A. naeslundii strains possessed only fimA and did not bind to statherin. An A. oris subcluster harboured a fimA gene similar to that of Actinomyces odontolyticus but no detectable fimP failed to bind significantly to either MUC7 or statherin. These data are evidence of extensive genotypic and phenotypic diversity within the species A. oris and A. naeslundii but the status of the subclusters identified here will require genome comparisons before their phylogenic position can be unequivocally established.

  17. Core genome conservation of Staphylococcus haemolyticus limits sequence based population structure analysis.

    PubMed

    Cavanagh, Jorunn Pauline; Klingenberg, Claus; Hanssen, Anne-Merethe; Fredheim, Elizabeth Aarag; Francois, Patrice; Schrenzel, Jacques; Flægstad, Trond; Sollid, Johanna Ericson

    2012-06-01

    The notoriously multi-resistant Staphylococcus haemolyticus is an emerging pathogen causing serious infections in immunocompromised patients. Defining the population structure is important to detect outbreaks and spread of antimicrobial resistant clones. Currently, the standard typing technique is pulsed-field gel electrophoresis (PFGE). In this study we describe novel molecular typing schemes for S. haemolyticus using multi locus sequence typing (MLST) and multi locus variable number of tandem repeats (VNTR) analysis. Seven housekeeping genes (MLST) and five VNTR loci (MLVF) were selected for the novel typing schemes. A panel of 45 human and veterinary S. haemolyticus isolates was investigated. The collection had diverse PFGE patterns (38 PFGE types) and was sampled over a 20 year-period from eight countries. MLST resolved 17 sequence types (Simpsons index of diversity [SID]=0.877) and MLVF resolved 14 repeat types (SID=0.831). We found a low sequence diversity. Phylogenetic analysis clustered the isolates in three (MLST) and one (MLVF) clonal complexes, respectively. Taken together, neither the MLST nor the MLVF scheme was suitable to resolve the population structure of this S. haemolyticus collection. Future MLVF and MLST schemes will benefit from addition of more variable core genome sequences identified by comparing different fully sequenced S. haemolyticus genomes. Copyright © 2012 Elsevier B.V. All rights reserved.

  18. Occurrence of Diverse AbGRI1-Type Genomic Islands in Acinetobacter baumannii Global Clone 2 Isolates from South Korea.

    PubMed

    Kim, Dae Hun; Jung, Sook-In; Kwon, Ki Tae; Ko, Kwan Soo

    2017-02-01

    In this study, we analyzed the frequency of the AbGRI1-type genomic island (GI) and its association with genotypes. We obtained 130 Acinetobacter baumannii isolates causing bloodstream infections from patients in South Korea. Antimicrobial susceptibility testing and multilocus sequence typing were performed. The presence of AbGRI1-type GIs and their structures were determined by sequential PCR and sequencing. Ninety-eight isolates (75.3%) representing 14 sequence types (STs) belonged to clonal complex 208 (CC208), corresponding to global clone 2 (GC2). AbGRI1-type GIs interrupted the comM gene in 107 isolates (82.4%). Four types of GIs were identified: Tn6022 (50 isolates; 46.7%), AbaR4 (23 isolates; 21.5%), Tn6166 (10 isolates; 9.3%), and Tn6166/Tn2006 (24 isolates; 22.4%). In the 50 isolates with Tn6022, Tn2006 or Tn2008B, both containing ISAba1-bla OXA-23 , was present in sites other than GIs in 3 or 28 isolates, respectively. In the 10 isolates with Tn6166, Tn2008B was identified in one isolate. AbGRI1-type GIs were identified nearly exclusively in CC208 isolates, with the exception of nine non-CC208 isolates (AbaR4 in eight ST229 isolates and Tn6022 in one ST1244 isolate). Within CC208 isolates, there was evidence of frequent recombination events, in both housekeeping genes and AbGRI1-type GIs, contributing to genotype diversification and the emergence of carbapenem resistance. Copyright © 2017 American Society for Microbiology.

  19. The recognition and modification sites for the bacterial type I restriction systems KpnAI, StySEAI, StySENI and StySGI

    PubMed Central

    Kasarjian, Julie K. A.; Hidaka, Masumi; Horiuchi, Takashi; Iida, Masatake; Ryu, Junichi

    2004-01-01

    Using an in vivo plasmid transformation method, we have determined the DNA sequences recognized by the KpnAI, StySEAI, StySENI and StySGI R-M systems from Klebsiella oxytoca strain M5a1, Salmonella eastbourne, Salmonella enteritidis and Salmonella gelsenkirchen, respectively. These type I restriction-modification systems were originally identified using traditional phage assay, and described here is the plasmid transformation test and computer program used to determine their DNA recognition sequences. For this test, we constructed two sets of plasmids, pL and pE, that contain phage lambda and Escherichia coli K-12 chromosomal DNA fragments, respectively. Further, using the methylation sensitivities of various known type II restriction enzymes, we identified the target adenines for methylation (listed in bold italics below as A or T in case of the complementary strand). The recognition sequence and methylation sites are GAA(6N)TGCC (KpnAI), ACA(6N)TYCA (StySEAI), CGA(6N)TACC (StySENI) and TAAC(7N)RTCG (StySGI). These DNA recognition sequences all have a typical type I bipartite pattern and represent three novel specificities and one isoschizomer (StySENI). For confirmation, oligonucleotides containing each of the predicted sequences were synthesized, cloned into plasmid pMECA and transformed into each strain, resulting in a large reduction in efficiency of transformation (EOT). PMID:15199175

  20. Serotypes, antibiotic susceptibilities, and multi-locus sequence type profiles of Streptococcus agalactiae isolates circulating in Beijing, China.

    PubMed

    Wang, Ping; Tong, Jing-jing; Ma, Xiu-hua; Song, Feng-li; Fan, Ling; Guo, Cui-mei; Shi, Wei; Yu, Sang-jie; Yao, Kai-hu; Yang, Yong-hong

    2015-01-01

    To investigate the serotypes, antibiotic susceptibilities, and multi-locus sequence type (MLST) profiles of Streptococcus agalactiae (S. agalactiae) in Beijing to provide references for the prevention and treatment of S. agalactiae infections. All isolates were identified using the CAMP test and the latex-agglutination assay and serotyped using a Strep-B-Latex kit, after which they were assessed for antibiotic susceptibility, macrolide-resistance genes, and MLST profiles. In total, 56 S. agalactiae isolates were identified in 863 pregnant women (6.5%). Serotypes Ia, Ib, II, III, and V were identified, among which types III (32.1%), Ia (17.9%), Ib (16.1%), and V (14.3%) were the predominant serotypes. All isolates were susceptible to penicillin and ceftriaxone. The nonsusceptiblity rates measured for erythromycin, clarithromycin, azithromycin, telithromycin, clindamycin, tetracycline, and levofloxacin were 85.7%, 92.9%, 98.2%, 30.4%, 73.2%, 91%, and 39.3%, respectively. We identified 14 sequence types (STs) for the 56 isolates, among which ST19 (30.4%) was predominant. The rate of fluoroquinolone resistance was higher in serotype III than in the other serotypes. Among the 44 erythromycin-resistant isolates, 32 (72.7%) carried ermB. S. agalactiae isolates of the serotypes Ia, Ib, III, and V are common in Beijing. Among the S. agalactiae isolates, the macrolide and clindamycin resistance rates are extremely high. Most of the erythromycin-resistant isolates carry ermB.

  1. An outbreak of dengue virus (DENV) type 2 Cosmopolitan genotype in Israeli travellers returning from the Seychelles, April 2017.

    PubMed

    Lustig, Yaniv; Wolf, Dana; Halutz, Ora; Schwartz, Eli

    2017-06-29

    Dengue virus infection was diagnosed in six Israeli travellers returning from the Seychelles in April 2017. Phylogenetic analysis identified identical sequences belonging to the Cosmopolitan genotype of dengue virus type 2 in all samples sequenced, thus providing evidence for a probable dengue type 2 outbreak in the Seychelles. This report further demonstrates the role of travellers as sentinels for arboviral infections, especially in countries with limited diagnostic capabilities. This article is copyright of The Authors, 2017.

  2. Analysis of whole genome sequencing for the Escherichia coli O157:H7 typing phages.

    PubMed

    Cowley, Lauren A; Beckett, Stephen J; Chase-Topping, Margo; Perry, Neil; Dallman, Tim J; Gally, David L; Jenkins, Claire

    2015-04-08

    Shiga toxin producing Escherichia coli O157 can cause severe bloody diarrhea and haemolytic uraemic syndrome. Phage typing of E. coli O157 facilitates public health surveillance and outbreak investigations, certain phage types are more likely to occupy specific niches and are associated with specific age groups and disease severity. The aim of this study was to analyse the genome sequences of 16 (fourteen T4 and two T7) E. coli O157 typing phages and to determine the genes responsible for the subtle differences in phage type profiles. The typing phages were sequenced using paired-end Illumina sequencing at The Genome Analysis Centre and the Animal Health and Veterinary Laboratories Agency and bioinformatics programs including Velvet, Brig and Easyfig were used to analyse them. A two-way Euclidian cluster analysis highlighted the associations between groups of phage types and typing phages. The analysis showed that the T7 typing phages (9 and 10) differed by only three genes and that the T4 typing phages formed three distinct groups of similar genomic sequences: Group 1 (1, 8, 11, 12 and 15, 16), Group 2 (3, 6, 7 and 13) and Group 3 (2, 4, 5 and 14). The E. coli O157 phage typing scheme exhibited a significantly modular network linked to the genetic similarity of each group showing that these groups are specialised to infect a subset of phage types. Sequencing the typing phage has enabled us to identify the variable genes within each group and to determine how this corresponds to changes in phage type.

  3. Draft genome sequences of 14 swine associated LA-MRSA ST398 isolates from the U.S.

    USDA-ARS?s Scientific Manuscript database

    Livestock associated methicillin resistant Staphylococcus aureus (LA-MRSA) is part of the normal microbiota of swine. The initial and predominant swine associated LA-MRSA sequence type (ST) identified is ST398. Here, we present 14 draft genome sequence from LA-MRSA ST398 isolates found in the US....

  4. Genome sequence of an aflatoxigenic pathogen of Argentinian peanut, Aspergillus arachidicola

    USDA-ARS?s Scientific Manuscript database

    In this study we sequenced the genome of the A. arachidicola Type strain (CBS 117610) and found its genome size to be 38.9 Mb, and its number of predicted genes to be 12,091, which are values comparable to those in other sequenced Aspergilli. Of its predicted genes, 691 were identified as unique to ...

  5. Diversity of 16S rRNA genes of new Ehrlichia strains isolated from horses with clinical signs of Potomac horse fever.

    PubMed

    Wen, B; Rikihisa, Y; Fuerst, P A; Chaichanasiriwithaya, W

    1995-04-01

    Ehrlichia risticii is the causative agent of Potomac horse fever. Variations among the major antigens of different local E. risticii strains have been detected previously. To further assess genetic variability in this species or species complex, the sequences of the 16S rRNA genes of several isolates obtained from sick horses diagnosed as having Potomac horse fever were determined. The sequences of six isolates obtained from Ohio and three isolates obtained from Kentucky were amplified by PCR. Three groups of sequences were identified. The sequences of five of the Ohio isolates were identical to the sequence of the type strain of E. risticii, the Illinois strain. The sequence of one Ohio isolate, isolate 081, was unique; this sequence differed in 10 nucleotides from the sequence of the type strain (level of similarity, 99.3%). The sequences of the three Kentucky isolates were identical to each other, but differed by five bases from the sequence of the type strain (level of similarity, 99.6%). The levels of sequence similarity of isolate 081, the Kentucky isolates, and the type strain to the next most closely related Ehrlichia sp., Ehrlichia sennetsu, were 99.3, 99.2, and 99.2%, respectively. On the basis of the distinct antigenic profiles and the levels of 16S rRNA sequence divergence, isolate 081 is as divergent from the type strain of E. risticii as E. sennetsu is. Therefore, we suggest that strain 081 and the Kentucky isolates may represent two new distinct Ehrlichia species.

  6. Comparison of Sanger and next generation sequencing performance for genotyping Cryptosporidium isolates at the 18S rRNA and actin loci.

    PubMed

    Paparini, Andrea; Gofton, Alexander; Yang, Rongchang; White, Nicole; Bunce, Michael; Ryan, Una M

    2015-01-01

    Cryptosporidium is an important enteric pathogen that infects a wide range of humans and animals. Rapid and reliable detection and characterisation methods are essential for understanding the transmission dynamics of the parasite. Sanger sequencing, and high-throughput sequencing (HTS) on an Ion Torrent platform, were compared with each other for their sensitivity and accuracy in detecting and characterising 25 Cryptosporidium-positive human and animal faecal samples. Ion Torrent reads (n = 123,857) were obtained at both 18S rRNA and actin loci for 21 of the 25 samples. Of these, one isolate at the actin locus (Cattle 05) and three at the 18S rRNA locus (HTS 10, HTS 11 and HTS 12), suffered PCR drop-out (i.e. PCR failures) when using fusion-tagged PCR. Sanger sequences were obtained for both loci for 23 of the 25 samples and showed good agreement with Ion Torrent-based genotyping. Two samples both from pythons (SK 02 and SK 05) produced mixed 18S and actin chromatograms by Sanger sequencing but were clearly identified by Ion Torrent sequencing as C. muris. One isolate (SK 03) was typed as C. muris by Sanger sequencing but was identified as a mixed C. muris and C. tyzzeri infection by HTS. 18S rRNA Type B sequences were identified in 4/6 C. parvum isolates when deep sequenced but were undetected in Sanger sequencing. Sanger was cheaper than Ion Torrent when sequencing a small numbers of samples, but when larger numbers of samples are considered (n = 60), the costs were comparative. Fusion-tagged amplicon based approaches are a powerful way of approaching mixtures, the only draw-back being the loss of PCR efficiency on low-template samples when using primers coupled to MID tags and adaptors. Taken together these data show that HTS has excellent potential for revealing the "true" composition of species/types in a Cryptosporidium infection, but that HTS workflows need to be carefully developed to ensure sensitivity, accuracy and contamination are controlled. Copyright © 2015 Elsevier Inc. All rights reserved.

  7. Experimental and statistical post-validation of positive example EST sequences carrying peroxisome targeting signals type 1 (PTS1)

    PubMed Central

    Lingner, Thomas; Kataya, Amr R. A.; Reumann, Sigrun

    2012-01-01

    We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences.1 As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity.” Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals. PMID:22415050

  8. Experimental and statistical post-validation of positive example EST sequences carrying peroxisome targeting signals type 1 (PTS1).

    PubMed

    Lingner, Thomas; Kataya, Amr R A; Reumann, Sigrun

    2012-02-01

    We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences. As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity." Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals.

  9. Multilocus sequence typing (MLST) analysis of Propionibacterium acnes isolates from radical prostatectomy specimens.

    PubMed

    Mak, Tim N; Yu, Shu-Han; De Marzo, Angelo M; Brüggemann, Holger; Sfanos, Karen S

    2013-05-01

    Inflammation is commonly observed in radical prostatectomy specimens, and evidence suggests that inflammation may contribute to prostate carcinogenesis. Multiple microorganisms have been implicated in serving as a stimulus for prostatic inflammation. The pro-inflammatory anaerobe, Propionibacterium acnes, is ubiquitously found on human skin and is associated with the skin disease acne vulgaris. Recent studies have shown that P. acnes can be detected in prostatectomy specimens by bacterial culture or by culture-independent molecular techniques. Radical prostatectomy tissue samples were obtained from 30 prostate cancer patients and subject to both aerobic and anaerobic culture. Cultured species were identified by 16S rDNA gene sequencing. Propionibacterium acnes isolates were typed using multilocus sequence typing (MLST). Our study confirmed that P. acnes can be readily cultured from prostatectomy tissues (7 of 30 cases, 23%). In some cases, multiple isolates of P. acnes were cultured as well as other Propionibacterium species, such as P. granulosum and P. avidum. Overall, 9 of 30 cases (30%) were positive for Propionibacterium spp. MLST analyses identified eight different sequence types (STs) among prostate-derived P. acnes isolates. These STs belong to two clonal complexes, namely CC36 (type I-2) and CC53/60 (type II), or are CC53/60-related singletons. MLST typing results indicated that prostate-derived P. acnes isolates do not fall within the typical skin/acne STs, but rather are characteristic of STs associated with opportunistic infections and/or urethral flora. The MLST typing results argue against the likelihood that prostatectomy-derived P. acnes isolates represent contamination from skin flora. Copyright © 2012 Wiley Periodicals, Inc.

  10. Multilocus Sequence Typing of Cronobacter Strains Isolated from Retail Foods and Environmental Samples.

    PubMed

    Killer, Jiří; Skřivanová, Eva; Hochel, Igor; Marounek, Milan

    2015-06-01

    Cronobacter spp. are bacterial pathogens that affect children and immunocompromised adults. In this study, we used multilocus sequence typing (MLST) to determine sequence types (STs) in 11 Cronobacter spp. strains isolated from retail foods, 29 strains from dust samples obtained from vacuum cleaners, and 4 clinical isolates. Using biochemical tests, species-specific polymerase chain reaction, and MLST analysis, 36 strains were identified as Cronobacter sakazakii, and 6 were identified as Cronobacter malonaticus. In addition, one strain that originated from retail food and one from a dust sample from a vacuum cleaner were identified on the basis of MLST analysis as Cronobacter dublinensis and Cronobacter turicensis, respectively. Cronobacter spp. strains isolated from the retail foods were assigned to eight different MLST sequence types, seven of which were newly identified. The strains isolated from the dust samples were assigned to 7 known STs and 14 unknown STs. Three clinical isolates and one household dust isolate were assigned to ST4, which is the predominant ST associated with neonatal meningitis. One clinical isolate was classified based on MLST analysis as Cronobacter malonaticus and belonged to an as-yet-unknown ST. Three strains isolated from the household dust samples were assigned to ST1, which is another clinically significant ST. It can be concluded that Cronobacter spp. strains of different origin are genetically quite variable. The recovery of C. sakazakii strains belonging to ST1 and ST4 from the dust samples suggests the possibility that contamination could occur during food preparation. All of the novel STs and alleles for C. sakazakii, C. malonaticus, C. dublinensis, and C. turicensis determined in this study were deposited in the Cronobacter MLST database available online ( http://pubmlst.org/cronobacter/).

  11. Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections.

    PubMed

    van der Weele, Pascal; Meijer, Chris J L M; King, Audrey J

    2017-10-01

    Human papillomavirus (HPV) is a strongly conserved DNA virus, high-risk types of which can cause cervical cancer in persistent infections. The most common type found in HPV-attributable cancer is HPV16, which can be subdivided into four lineages (A to D) with different carcinogenic properties. Studies have shown HPV16 sequence diversity in different geographical areas, but only limited information is available regarding HPV16 diversity within a population, especially at the whole-genome level. We analyzed HPV16 major variant diversity and conservation in persistent infections and performed a single nucleotide polymorphism (SNP) comparison between persistent and clearing infections. Materials were obtained in the Netherlands from a cohort study with longitudinal follow-up for up to 3 years. Our analysis shows a remarkably large variant diversity in the population. Whole-genome sequences were obtained for 57 persistent and 59 clearing HPV16 infections, resulting in 109 unique variants. Interestingly, persistent infections were completely conserved through time. One reinfection event was identified where the initial and follow-up samples clustered differently. Non-A1/A2 variants seemed to clear preferentially ( P = 0.02). Our analysis shows that population-wide HPV16 sequence diversity is very large. In persistent infections, the HPV16 sequence was fully conserved. Sequencing can identify HPV16 reinfections, although occurrence is rare. SNP comparison identified no strongly acting effect of the viral genome affecting HPV16 infection clearance or persistence in up to 3 years of follow-up. These findings suggest the progression of an early HPV16 infection could be host related. IMPORTANCE Human papillomavirus 16 (HPV16) is the predominant type found in cervical cancer. Progression of initial infection to cervical cancer has been linked to sequence properties; however, knowledge of variants circulating in European populations, especially with longitudinal follow-up, is limited. By sequencing a number of infections with known follow-up for up to 3 years, we gained initial insights into the genetic diversity of HPV16 and the effects of the viral genome on the persistence of infections. A SNP comparison between sequences obtained from clearing and persistent infections did not identify strongly acting DNA variations responsible for these infection outcomes. In addition, we identified an HPV16 reinfection event where sequencing of initial and follow-up samples showed different HPV16 variants. Based on conventional genotyping, this infection would incorrectly be considered a persistent HPV16 infection. In the context of vaccine efficacy and monitoring studies, such infections could potentially cause reduced reported efficacy or efficiency. Copyright © 2017 van der Weele et al.

  12. Identification of Fasciola species based on mitochondrial and nuclear DNA reveals the co-existence of intermediate Fasciola and Fasciola gigantica in Thailand.

    PubMed

    Wannasan, Anchalee; Khositharattanakool, Pathamet; Chaiwong, Prasong; Piangjai, Somsak; Uparanukraw, Pichart; Morakote, Nimit

    2014-11-01

    Molecular techniques were used to identify Fasciola species collected from Chiang Mai Thailand. Morphometrically, 65 stained and 45 fresh worms collected from cattle suggested the possible occurrence of both F. gigantica and F. hepatica. Twenty-two worms comprising 15 from cattle and 7 from human patients, were identified subsequently based on three genetic markers: mitochondrial nicotinamide adenine dinucleotide dehydrogenase subunit 1 (nad1), mitochondrial cytochrome c oxidase subunit 1 (cox1) and nuclear ribosomal internal transcribed spacer 2 (ITS2). All of them presented the F. gigantica type in maternally inherited mitochondrial sequences (nad1 and cox1), with six types in each sequence (FgNDI-CM1 to FgNDI-CM6 and FgCOI-CM1 to FgCOI-CM6, respectively). Remarkably, the predominant nad1 type, FgNDI-CM6, was identical to that of aspermic Fasciola sp. formerly reported from Thailand, Japan, Korea, China, Vietnam, and Myanmar. ITS2 sequences were analyzed successfully in 20 worms. Fifteen worms showed the F. gigantica type and five (including one worm from a patient) had mixed ITS2 sequences of both F. gigantica and F. hepatica in the same worms, with additional heterogeneity within both ITS2 types. This study revealed the intermediate form of Fasciola coexisting with F. gigantica for the first time in Thailand.

  13. Legionella oakridgensis ATCC 33761 genome sequence and phenotypic characterization reveals its replication capacity in amoebae.

    PubMed

    Brzuszkiewicz, Elzbieta; Schulz, Tino; Rydzewski, Kerstin; Daniel, Rolf; Gillmaier, Nadine; Dittmann, Christine; Holland, Gudrun; Schunder, Eva; Lautner, Monika; Eisenreich, Wolfgang; Lück, Christian; Heuner, Klaus

    2013-12-01

    Legionella oakridgensis is able to cause Legionnaires' disease, but is less virulent compared to L. pneumophila strains and very rarely associated with human disease. L. oakridgensis is the only species of the family legionellae which is able to grow on media without additional cysteine. In contrast to earlier publications, we found that L. oakridgensis is able to multiply in amoebae. We sequenced the genome of L. oakridgensis type strain OR-10 (ATCC 33761). The genome is smaller than the other yet sequenced Legionella genomes and has a higher G+C-content of 40.9%. L. oakridgensis lacks a flagellum and it also lacks all genes of the flagellar regulon except of the alternative sigma-28 factor FliA and the anti-sigma-28 factor FlgM. Genes encoding structural components of type I, type II, type IV Lvh and type IV Dot/Icm, Sec- and Tat-secretion systems could be identified. Only a limited set of Dot/Icm effector proteins have been recognized within the genome sequence of L. oakridgensis. Like in L. pneumophila strains, various proteins with eukaryotic motifs and eukaryote-like proteins were detected. We could demonstrate that the Dot/Icm system is essential for intracellular replication of L. oakridgensis. Furthermore, we identified new putative virulence factors of Legionella. Copyright © 2013 Elsevier GmbH. All rights reserved.

  14. Methicillin-Resistant and -Susceptible Staphylococcus aureus Sequence Type 398 in Pigs and Humans

    PubMed Central

    van Belkum, Alex; Peeters, Justine K.; van Leeuwen, Willem B.; van Duijkeren, Engeline; Huijsdens, Xander W.; Spalburg, Emile; de Neeling, Albert J.; Verbrugh, Henri A.

    2008-01-01

    Methicillin-resistant Staphylococcus aureus sequence type 398 (ST398 MRSA) was identified in Dutch pigs and pig farmers. ST398 methicillin-susceptible S. aureus circulates among humans at low frequency (0.2%) but was isolated in 3 human cases of bacteremia (2.1%; p = 0.026). Although its natural host is probably porcine, ST398 MRSA likely causes infections in humans. PMID:18325267

  15. Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population.

    PubMed

    Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao

    2018-05-01

    Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.

  16. Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety 'Amrapali' (Mangifera indica L.).

    PubMed

    Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

    2016-01-01

    Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.

  17. Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety ‘Amrapali’ (Mangifera indica L.)

    PubMed Central

    Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

    2016-01-01

    Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892

  18. Evaluation of an automated repetitive sequence-based PCR system for subtyping Enterobacter sakazakii.

    PubMed

    Healy, B; Mullane, N; Collin, V; Mailler, S; Iversen, C; Chatellier, S; Storrs, M; Fanning, S

    2008-07-01

    Enterobacter sakazakii is regarded as a ubiquitous organism that can be isolated from a wide range of foods and environments. Infection in at-risk infants has been epidemiologically linked to the consumption of contaminated powdered infant formula. Preventing the dissemination of this pathogen in a powdered infant formula manufacturing facility is an important step in ensuring consumer confidence in a given brand together with the protection of the health status of a vulnerable population. In this study we report the application of a repetitive sequence-based PCR typing method to subtype a previously well-characterized collection of E. sakazakii isolates of diverse origin. While both methods successfully discriminated between the collection of isolates, repetitive sequence-based PCR identified 65 types, whereas pulsed-field gel electrophoresis identified 110 types showing > or =95% similarity. The method was quick and easy to perform, and our data demonstrated the utility and value of this approach to monitor in-process contamination, which could potentially contribute to a reduction in the transmission of E. sakazakii.

  19. Development and evaluation of a multi-locus sequence typing scheme for Mycoplasma synoviae.

    PubMed

    Dijkman, R; Feberwee, A; Landman, W J M

    2016-08-01

    Reproducible molecular Mycoplasma synoviae typing techniques with sufficient discriminatory power may help to expand knowledge on its epidemiology and contribute to the improvement of control and eradication programmes of this mycoplasma species. The present study describes the development and validation of a novel multi-locus sequence typing (MLST) scheme for M. synoviae. Thirteen M. synoviae isolates originating from different poultry categories, farms and lesions, were subjected to whole genome sequencing. Their sequences were compared to that of M. synoviae reference strain MS53. A high number of single nucleotide polymorphisms (SNPs) indicating considerable genetic diversity were identified. SNPs were present in over 40 putative target genes for MLST of which five target genes were selected (nanA, uvrA, lepA, ruvB and ugpA) for the MLST scheme. This scheme was evaluated analysing 209 M. synoviae samples from different countries, categories of poultry, farms and lesions. Eleven clonal clusters and 76 different sequence types (STs) were obtained. Clustering occurred following geographical origin, supporting the hypothesis of regional population evolution. M. synoviae samples obtained from epidemiologically linked outbreaks often harboured the same ST. In contrast, multiple M. synoviae lineages were found in samples originating from swollen joints or oviducts from hens that produce eggs with eggshell apex abnormalities indicating that further research is needed to identify the genetic factors of M. synoviae that may explain its variations in tissue tropism and disease inducing potential. Furthermore, MLST proved to have a higher discriminatory power compared to variable lipoprotein and haemagglutinin A typing, which generated 50 different genotypes on the same database.

  20. KpnBI is the prototype of a new family (IE) of bacterial type I restriction-modification system

    PubMed Central

    Chin, V.; Valinluck, V.; Magaki, S.; Ryu, J.

    2004-01-01

    KpnBI is a restriction-modification (R-M) system recognized in the GM236 strain of Klebsiella pneumoniae. Here, the KpnBI modification genes were cloned into a plasmid using a modification expression screening method. The modification genes that consist of both hsdM (2631 bp) and hsdS (1344 bp) genes were identified on an 8.2 kb EcoRI chromosomal fragment. These two genes overlap by one base and share the same promoter located upstream of the hsdM gene. Using recently developed plasmid R-M tests and a computer program RM Search, the DNA recognition sequence for the KpnBI enzymes was identified as a new 8 nt sequence containing one degenerate base with a 6 nt spacer, CAAANNNNNNRTCA. From Dam methylation and HindIII sensitivity tests, the methylation loci were predicted to be the italicized third adenine in the 5′ specific region and the adenine opposite the italicized thymine in the 3′ specific region. Combined with previous sequence data for hsdR, we concluded that the KpnBI system is a typical type I R-M system. The deduced amino acid sequences of the three subunits of the KpnBI system show only limited homologies (25 to 33% identity) at best, to the four previously categorized type I families (IA, IB, IC, and ID). Furthermore, their identity scores to other uncharacterized putative genome type I sequences were 53% at maximum. Therefore, we propose that KpnBI is the prototype of a new ‘type IE’ family. PMID:15475385

  1. Identification and functional activity of a staphylocoagulase type XI variant originating from staphylococcal food poisoning isolates.

    PubMed

    Suzuki, Y; Matsushita, S; Kubota, H; Kobayashi, M; Murauchi, K; Higuchi, Y; Kato, R; Hirai, A; Sadamasu, K

    2016-09-01

    Staphylocoagulase, an extracellular protein secreted by Staphylococcus aureus, has been used as an epidemiological marker. At least 12 serotypes and 24 genotypes subdivided on the basis of nucleotide sequence have been reported to date. In this study, we identified a novel staphylocoagulase nucleotide sequence, coa310, from staphylococcal food poisoning isolates that had the ability to coagulate plasma, but could not be typed using the conventional method. The protein encoded by coa310 contained the six fundamental conserved domains of staphylocoagulase. The full-length nucleotide sequence of coa310 shared the highest similarity (77·5%) with that of staphylocoagulase-type (SCT) XIa. The sequence of the D1 region, which would be responsible for the determination of SCT, shared the highest similarity (91·8%) with that of SCT XIa. These results suggest that coa310 is a novel variant of SCT XI. Moreover, we demonstrated that coa310 encodes a functioning coagulase, by confirming the coagulating activity of the recombinant protein expressed from coa310. This is the first study to directly demonstrate that Coa310, a putative SCT XI, has coagulating activity. These findings may be useful for the improvement of the staphylocoagulase-typing method, including serotyping and genotyping. This is the first study to identify a novel variant of staphylocoagulase type XI based on its nucleotide sequence and to demonstrate coagulating activity in the variant using a recombinant protein. Elucidation of the variety of staphylocoagulases will provide suggestions for further improvement of the staphylocoagulase-typing method and contribute to our understanding of the epidemiologic characterization of Staphylococcus aureus. © 2016 The Society for Applied Microbiology.

  2. Identification of four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, from an East African population by high-resolution sequence-based typing.

    PubMed

    Luo, M; Mao, X; Plummer, F A

    2005-02-01

    We report here four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, identified from an East African population during sequence-based HLA-B typing. The novel alleles were confirmed by sequencing two separate polymerase chain reaction products, and by molecular cloning and sequencing multiple clones. B*1590 is identical to B*1510 at exon 2 and exon 3, except for a difference (GCCGTC) at codon 158. Sequence differences at codon 152 (GAGGTG) and codon 167 (TGGTCG) differentiate B*1591 from B*1503 at exon 3. B*2726 is identical to B*2708 at exon 2 and exon 3, except for a difference (AAGCAG) at codon 70. B*4705 was identified in three Kenyan women. The allele is identical to B*47010101/02 at exon 2 and exon 3, except for differences at codon 97 (AGGAAT) and codon 99 (TTTTAT). These new alleles have been named by the WHO Nomenclature Committee. Identification of these novel HLA-B alleles reflects the genetic diversity of this East African population.

  3. Analyzing Somatic Genome Rearrangements in Human Cancers by Using Whole-Exome Sequencing | Office of Cancer Genomics

    Cancer.gov

    Although exome sequencing data are generated primarily to detect single-nucleotide variants and indels, they can also be used to identify a subset of genomic rearrangements whose breakpoints are located in or near exons. Using >4,600 tumor and normal pairs across 15 cancer types, we identified over 9,000 high confidence somatic rearrangements, including a large number of gene fusions.

  4. Genetic Analyses of the Internal Transcribed Spacer Sequences Suggest Introgression and Duplication in the Medicinal Mushroom Agaricus subrufescens

    PubMed Central

    Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D.; Callac, Philippe

    2016-01-01

    The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated. PMID:27228131

  5. The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.

    PubMed

    Motamayor, Juan C; Mockaitis, Keithanne; Schmutz, Jeremy; Haiminen, Niina; Livingstone, Donald; Cornejo, Omar; Findley, Seth D; Zheng, Ping; Utro, Filippo; Royaert, Stefan; Saski, Christopher; Jenkins, Jerry; Podicheti, Ram; Zhao, Meixia; Scheffler, Brian E; Stack, Joseph C; Feltus, Frank A; Mustiga, Guiliana M; Amores, Freddy; Phillips, Wilbert; Marelli, Jean Philippe; May, Gregory D; Shapiro, Howard; Ma, Jianxin; Bustamante, Carlos D; Schnell, Raymond J; Main, Dorrie; Gilbert, Don; Parida, Laxmi; Kuhn, David N

    2013-06-03

    Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. We describe the sequencing and assembly of the genome of Theobroma cacao L. cultivar Matina 1-6. The genome of the Matina 1-6 cultivar is 445 Mbp, which is significantly larger than a sequenced Criollo cultivar, and more typical of other cultivars. The chromosome-scale assembly, version 1.1, contains 711 scaffolds covering 346.0 Mbp, with a contig N50 of 84.4 kbp, a scaffold N50 of 34.4 Mbp, and an evidence-based gene set of 29,408 loci. Version 1.1 has 10x the scaffold N50 and 4x the contig N50 as Criollo, and includes 111 Mb more anchored sequence. The version 1.1 assembly has 4.4% gap sequence, while Criollo has 10.9%. Through a combination of haplotype, association mapping and gene expression analyses, we leverage this robust reference genome to identify a promising candidate gene responsible for pod color variation. We demonstrate that green/red pod color in cacao is likely regulated by the R2R3 MYB transcription factor TcMYB113, homologs of which determine pigmentation in Rosaceae, Solanaceae, and Brassicaceae. One SNP within the target site for a highly conserved trans-acting siRNA in dicots, found within TcMYB113, seems to affect transcript levels of this gene and therefore pod color variation. We report a high-quality sequence and annotation of Theobroma cacao L. and demonstrate its utility in identifying candidate genes regulating traits.

  6. The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color

    PubMed Central

    2013-01-01

    Background Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. Results We describe the sequencing and assembly of the genome of Theobroma cacao L. cultivar Matina 1-6. The genome of the Matina 1-6 cultivar is 445 Mbp, which is significantly larger than a sequenced Criollo cultivar, and more typical of other cultivars. The chromosome-scale assembly, version 1.1, contains 711 scaffolds covering 346.0 Mbp, with a contig N50 of 84.4 kbp, a scaffold N50 of 34.4 Mbp, and an evidence-based gene set of 29,408 loci. Version 1.1 has 10x the scaffold N50 and 4x the contig N50 as Criollo, and includes 111 Mb more anchored sequence. The version 1.1 assembly has 4.4% gap sequence, while Criollo has 10.9%. Through a combination of haplotype, association mapping and gene expression analyses, we leverage this robust reference genome to identify a promising candidate gene responsible for pod color variation. We demonstrate that green/red pod color in cacao is likely regulated by the R2R3 MYB transcription factor TcMYB113, homologs of which determine pigmentation in Rosaceae, Solanaceae, and Brassicaceae. One SNP within the target site for a highly conserved trans-acting siRNA in dicots, found within TcMYB113, seems to affect transcript levels of this gene and therefore pod color variation. Conclusions We report a high-quality sequence and annotation of Theobroma cacao L. and demonstrate its utility in identifying candidate genes regulating traits. PMID:23731509

  7. 5' diversity of human hepatic PXR (NR1I2) transcripts and identification of the major transcription initiation site.

    PubMed

    Kurose, Kouichi; Koyano, Satoru; Ikeda, Shinobu; Tohkin, Masahiro; Hasegawa, Ryuichi; Sawada, Jun-Ichi

    2005-05-01

    The human pregnane X receptor (PXR) is a crucial regulator of the genes encoding several major cytochrome P450 enzymes and transporters, such as CYP3A4 and MDR1, but its own transcriptional regulation remains unclear. To elucidate the transcriptional mechanisms of human PXR gene, we first endeavored to identify the transcription initiation site of human PXR using 5'-RACE. Five types of 5'-variable transcripts (a, b, c, d, and e) with common exon 2 sequence were found, and comparison of these sequences with the genomic sequence suggested that their 5' diversity is derived from initiation by alternative promoters and alternative splicing. None of the exons found in our study contain any new in-frame coding regions. Newly identified introns IVS-a and IVS-b were found to have CT-AC splice sites that do not follow the GT-AG rule of conventional donor and acceptor splice sites. Of the five types of 5' variable transcripts identified, RT-PCR showed that type-a was the major transcript type. Four transcription initiation sites (A-D) for type-a transcript were identified by 5'-RACE using GeneRacer RACE Ready cDNA (human liver) constructed by the oligo-capping method. Putative TATA boxes were located approximately 30 bp upstream from the transcriptional start sites of the major transcript (C) and the longest minor transcript (A) expressed in the human liver. These results indicate that the initiation of transcription of human PXR is more complex than previously reported.

  8. TaALMT1 promoter sequence compositions, acid tolerance, and Al tolerance in wheat cultivars and landraces from Sichuan in China.

    PubMed

    Han, C; Dai, S F; Liu, D C; Pu, Z J; Wei, Y M; Zheng, Y L; Wen, D J; Zhao, L; Yan, Z H

    2013-11-18

    Previous genetic studies on wheat from various sources have indicated that aluminum (Al) tolerance may have originated independently in USA, Brazil, and China. Here, TaALMT1 promoter sequences of 92 landraces and cultivars from Sichuan, China, were sequenced. Five promoter types (I', II, III, IV, and V) were observed in 39 cultivars, and only three promoter types (I, II, and III) were observed in 53 landraces. Among the wheat collections worldwide, only the Chinese Spring (CS) landrace native to Sichuan, China, carried the TaALMT1 promoter type III. Besides CS, two other Sichuan-bred landraces and six cultivars with TaALMT1 promoter type III were identified in this study. In the phylogenetic tree constructed based on the TaALMT1 promoter sequences, type III formed a separate branch, which was supported by a high bootstrap value. It is likely that TaALMT1 promoter type III originated from Sichuan-bred wheat landraces of China. In addition, the landraces with promoter type I showed the lowest Al tolerance among all landraces and cultivars. Furthermore, the cultivars with promoter type IV showed better Al tolerance than landraces with promoter type II. A comparison of acid tolerance and Al tolerance between cultivars and landraces showed that the landraces had better acid tolerance than the cultivars, whereas the cultivars showed better Al tolerance than the landraces. Moreover, significant difference in Al tolerance was also observed between the cultivars raised by the National Ministry of Agriculture and by Sichuan Province. Among the landraces from different regions, those from the East showed better acid tolerance and Al tolerance than those from the South and West of Sichuan. Additional Al-tolerant and acid-tolerant wheat lines were also identified.

  9. Cloning and Sequencing of Defective Particles Derived from the Autonomous Parvovirus Minute Virus of Mice for the Construction of Vectors with Minimal cis-Acting Sequences

    PubMed Central

    Clément, Nathalie; Avalosse, Bernard; El Bakkouri, Karim; Velu, Thierry; Brandenburger, Annick

    2001-01-01

    The production of wild-type-free stocks of recombinant parvovirus minute virus of mice [MVM(p)] is difficult due to the presence of homologous sequences in vector and helper genomes that cannot easily be eliminated from the overlapping coding sequences. We have therefore cloned and sequenced spontaneously occurring defective particles of MVM(p) with very small genomes to identify the minimal cis-acting sequences required for DNA amplification and virus production. One of them has lost all capsid-coding sequences but is still able to replicate in permissive cells when nonstructural proteins are provided in trans by a helper plasmid. Vectors derived from this particle produce stocks with no detectable wild-type MVM after cotransfection with new, matched, helper plasmids that present no homology downstream from the transgene. PMID:11152501

  10. Serotypes, Antibiotic Susceptibilities, and Multi-Locus Sequence Type Profiles of Streptococcus agalactiae Isolates Circulating in Beijing, China

    PubMed Central

    Ma, Xiu-hua; Song, Feng-li; Fan, Ling; Guo, Cui-mei; Shi, Wei; Yu, Sang-jie; Yao, Kai-hu; Yang, Yong-hong

    2015-01-01

    Background To investigate the serotypes, antibiotic susceptibilities, and multi-locus sequence type (MLST) profiles of Streptococcus agalactiae (S. agalactiae) in Beijing to provide references for the prevention and treatment of S. agalactiae infections. Methods All isolates were identified using the CAMP test and the latex-agglutination assay and serotyped using a Strep-B-Latex kit, after which they were assessed for antibiotic susceptibility, macrolide-resistance genes, and MLST profiles. Results In total, 56 S. agalactiae isolates were identified in 863 pregnant women (6.5%). Serotypes Ia, Ib, II, III, and V were identified, among which types III (32.1%), Ia (17.9%), Ib (16.1%), and V (14.3%) were the predominant serotypes. All isolates were susceptible to penicillin and ceftriaxone. The nonsusceptiblity rates measured for erythromycin, clarithromycin, azithromycin, telithromycin, clindamycin, tetracycline, and levofloxacin were 85.7%, 92.9%, 98.2%, 30.4%, 73.2%, 91%, and 39.3%, respectively. We identified 14 sequence types (STs) for the 56 isolates, among which ST19 (30.4%) was predominant. The rate of fluoroquinolone resistance was higher in serotype III than in the other serotypes. Among the 44 erythromycin-resistant isolates, 32 (72.7%) carried ermB. Conclusion S. agalactiae isolates of the serotypes Ia, Ib, III, and V are common in Beijing. Among the S. agalactiae isolates, the macrolide and clindamycin resistance rates are extremely high. Most of the erythromycin-resistant isolates carry ermB. PMID:25781346

  11. Use of Whole-Genome Phylogeny and Comparisons for Development of a Multiplex PCR Assay To Identify Sequence Type 36 Vibrio parahaemolyticus.

    PubMed

    Whistler, Cheryl A; Hall, Jeffrey A; Xu, Feng; Ilyas, Saba; Siwakoti, Puskar; Cooper, Vaughn S; Jones, Stephen H

    2015-06-01

    Vibrio parahaemolyticus sequence type 36 (ST36) strains that are native to the Pacific Ocean have recently caused multistate outbreaks of gastroenteritis linked to shellfish harvested from the Atlantic Ocean. Whole-genome comparisons of 295 genomes of V. parahaemolyticus, including several traced to northeastern U.S. sources, were used to identify diagnostic loci, one putatively encoding an endonuclease (prp), and two others potentially conferring O-antigenic properties (cps and flp). The combination of all three loci was present in only one clade of closely related strains of ST36, ST59, and one additional unknown sequence type. However, each locus was also identified outside this clade, with prp and flp occurring in only two nonclade isolates and cps in four. Based on the distribution of these loci in sequenced genomes, prp identified clade strains with >99% accuracy, but the addition of one more locus increased accuracy to 100%. Oligonucleotide primers targeting prp and cps were combined in a multiplex PCR method that defines species using the tlh locus and determines the presence of both the tdh and trh hemolysin-encoding genes, which are also present in ST36. Application of the method in vitro to a collection of 94 clinical isolates collected over a 4-year period in three northeastern U.S. states and 87 environmental isolates revealed that the prp and cps amplicons were detected only in clinical isolates identified as belonging to the ST36 clade and in no environmental isolates from the region. The assay should improve detection and surveillance, thereby reducing infections. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  12. Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.

    PubMed

    Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

    2012-02-01

    The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.

  13. Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes

    PubMed Central

    Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

    2012-01-01

    The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information. PMID:22384404

  14. Barcode Identifiers as a Practical Tool for Reliable Species Assignment of Medically Important Black Yeast Species

    PubMed Central

    Heinrichs, Guido; de Hoog, G. Sybren

    2012-01-01

    Herpotrichiellaceous black yeasts and relatives comprise severe pathogens flanked by nonpathogenic environmental siblings. Reliable identification by conventional methods is notoriously difficult. Molecular identification is hampered by the sequence variability in the internal transcribed spacer (ITS) domain caused by difficult-to-sequence homopolymeric regions and by poor taxonomic attribution of sequences deposited in GenBank. Here, we present a potential solution using short barcode identifiers (27 to 50 bp) based on ITS2 ribosomal DNA (rDNA), which allows unambiguous definition of species-specific fragments. Starting from proven sequences of ex-type and authentic strains, we were able to describe 103 identifiers. Multiple BLAST searches of these proposed barcode identifiers in GenBank revealed uniqueness for 100 taxonomic entities, whereas the three remaining identifiers each matched with two entities, but the species of these identifiers could easily be discriminated by differences in the remaining ITS regions. Using the proposed barcode identifiers, a 4.1-fold increase of 100% matches in GenBank was achieved in comparison to the classical approach using the complete ITS sequences. The proposed barcode identifiers will be made accessible for the diagnostic laboratory in a permanently updated online database, thereby providing a highly practical, reliable, and cost-effective tool for identification of clinically important black yeasts and relatives. PMID:22785187

  15. Sequence Segmentation with changeptGUI.

    PubMed

    Tasker, Edward; Keith, Jonathan M

    2017-01-01

    Many biological sequences have a segmental structure that can provide valuable clues to their content, structure, and function. The program changept is a tool for investigating the segmental structure of a sequence, and can also be applied to multiple sequences in parallel to identify a common segmental structure, thus providing a method for integrating multiple data types to identify functional elements in genomes. In the previous edition of this book, a command line interface for changept is described. Here we present a graphical user interface for this package, called changeptGUI. This interface also includes tools for pre- and post-processing of data and results to facilitate investigation of the number and characteristics of segment classes.

  16. Structure and function of neonatal social communication in a genetic mouse model of autism.

    PubMed

    Takahashi, T; Okabe, S; Broin, P Ó; Nishi, A; Ye, K; Beckert, M V; Izumi, T; Machida, A; Kang, G; Abe, S; Pena, J L; Golden, A; Kikusui, T; Hiroi, N

    2016-09-01

    A critical step toward understanding autism spectrum disorder (ASD) is to identify both genetic and environmental risk factors. A number of rare copy number variants (CNVs) have emerged as robust genetic risk factors for ASD, but not all CNV carriers exhibit ASD and the severity of ASD symptoms varies among CNV carriers. Although evidence exists that various environmental factors modulate symptomatic severity, the precise mechanisms by which these factors determine the ultimate severity of ASD are still poorly understood. Here, using a mouse heterozygous for Tbx1 (a gene encoded in 22q11.2 CNV), we demonstrate that a genetically triggered neonatal phenotype in vocalization generates a negative environmental loop in pup-mother social communication. Wild-type pups used individually diverse sequences of simple and complicated call types, but heterozygous pups used individually invariable call sequences with less complicated call types. When played back, representative wild-type call sequences elicited maternal approach, but heterozygous call sequences were ineffective. When the representative wild-type call sequences were randomized, they were ineffective in eliciting vigorous maternal approach behavior. These data demonstrate that an ASD risk gene alters the neonatal call sequence of its carriers and this pup phenotype in turn diminishes maternal care through atypical social communication. Thus, an ASD risk gene induces, through atypical neonatal call sequences, less than optimal maternal care as a negative neonatal environmental factor.

  17. Structure and function of neonatal social communication in a genetic mouse model of autism

    PubMed Central

    Takahashi, Tomohisa; Okabe, Shota; Ó Broin, Pilib; Nishi, Akira; Ye, Kenny; Beckert, Michael V.; Izumi, Takeshi; Machida, Akihiro; Kang, Gina; Abe, Seiji; Pena, Jose L.; Golden, Aaron; Kikusui, Takefumi; Hiroi, Noboru

    2015-01-01

    A critical step toward understanding autism spectrum disorder (ASD) is to identify both genetic and environmental risk factors. A number of rare copy number variants (CNVs) have emerged as robust genetic risk factors for ASD, but not all CNV carriers exhibit ASD and the severity of ASD symptoms varies among CNV carriers. Although evidence exists that various environmental factors modulate symptomatic severity, the precise mechanisms by which these factors determine the ultimate severity of ASD are still poorly understood. Here, using a mouse heterozygous for Tbx1 (a gene encoded in 22q11.2 CNV), we demonstrate that a genetically-triggered neonatal phenotype in vocalization generates a negative environmental loop in pup-mother social communication. Wild-type pups used individually diverse sequences of simple and complicated call types, but heterozygous pups used individually invariable call sequences with less complicated call types. When played back, representative wild-type call sequences elicited maternal approach, but heterozygous call sequences were ineffective. When the representative wild-type call sequences were randomized, they were ineffective in eliciting vigorous maternal approach behavior. These data demonstrate that an ASD risk gene alters the neonatal call sequence of its carriers and this pup phenotype in turn diminishes maternal care through atypical social communication. Thus, an ASD risk gene induces, through atypical neonatal call sequences, less than optimal maternal care as a negative neonatal environmental factor. PMID:26666205

  18. A RESTful application programming interface for the PubMLST molecular typing and genome databases

    PubMed Central

    Bray, James E.; Maiden, Martin C. J.

    2017-01-01

    Abstract Molecular typing is used to differentiate microorganisms at the subspecies or strain level for epidemiological investigations, infection control, public health and environmental sampling. DNA sequence-based typing methods require authoritative databases that link sequence variants to nomenclature in order to facilitate communication and comparison of identified types in national or global settings. The PubMLST website (https://pubmlst.org/) fulfils this role for over a hundred microorganisms for which it hosts curated molecular sequence typing data, providing sequence and allelic profile definitions for multi-locus sequence typing (MLST) and single-gene typing approaches. In recent years, these have expanded to cover the whole genome with schemes such as core genome MLST (cgMLST) and whole genome MLST (wgMLST) which catalogue the allelic diversity found in hundreds to thousands of genes. These approaches provide a common nomenclature for high-resolution strain characterization and comparison. Molecular typing information is linked to isolate provenance, phenotype, and increasingly genome assemblies, providing a resource for outbreak investigation and research in to population structure, gene association, global epidemiology and vaccine coverage. A Representational State Transfer (REST) Application Programming Interface (API) has been developed for the PubMLST website to make these large quantities of structured molecular typing and whole genome sequence data available for programmatic access by any third party application. The API is an integral component of the Bacterial Isolate Genome Sequence Database (BIGSdb) platform that is used to host PubMLST resources, and exposes all public data within the site. In addition to data browsing, searching and download, the API supports authentication and submission of new data to curator queues. Database URL: http://rest.pubmlst.org/ PMID:29220452

  19. Diagnostic Applications of Next Generation Sequencing in Immunogenetics and Molecular Oncology

    PubMed Central

    Grumbt, Barbara; Eck, Sebastian H.; Hinrichsen, Tanja; Hirv, Kaimo

    2013-01-01

    Summary With the introduction of the next generation sequencing (NGS) technologies, remarkable new diagnostic applications have been established in daily routine. Implementation of NGS is challenging in clinical diagnostics, but definite advantages and new diagnostic possibilities make the switch to the technology inevitable. In addition to the higher sequencing capacity, clonal sequencing of single molecules, multiplexing of samples, higher diagnostic sensitivity, workflow miniaturization, and cost benefits are some of the valuable features of the technology. After the recent advances, NGS emerged as a proven alternative for classical Sanger sequencing in the typing of human leukocyte antigens (HLA). By virtue of the clonal amplification of single DNA molecules ambiguous typing results can be avoided. Simultaneously, a higher sample throughput can be achieved by tagging of DNA molecules with multiplex identifiers and pooling of PCR products before sequencing. In our experience, up to 380 samples can be typed for HLA-A, -B, and -DRB1 in high-resolution during every sequencing run. In molecular oncology, NGS shows a markedly increased sensitivity in comparison to the conventional Sanger sequencing and is developing to the standard diagnostic tool in detection of somatic mutations in cancer cells with great impact on personalized treatment of patients. PMID:23922545

  20. AgdbNet – antigen sequence database software for bacterial typing

    PubMed Central

    Jolley, Keith A; Maiden, Martin CJ

    2006-01-01

    Background Bacterial typing schemes based on the sequences of genes encoding surface antigens require databases that provide a uniform, curated, and widely accepted nomenclature of the variants identified. Due to the differences in typing schemes, imposed by the diversity of genes targeted, creating these databases has typically required the writing of one-off code to link the database to a web interface. Here we describe agdbNet, widely applicable web database software that facilitates simultaneous BLAST querying of multiple loci using either nucleotide or peptide sequences. Results Databases are described by XML files that are parsed by a Perl CGI script. Each database can have any number of loci, which may be defined by nucleotide and/or peptide sequences. The software is currently in use on at least five public databases for the typing of Neisseria meningitidis, Campylobacter jejuni and Streptococcus equi and can be set up to query internal isolate tables or suitably-configured external isolate databases, such as those used for multilocus sequence typing. The style of the resulting website can be fully configured by modifying stylesheets and through the use of customised header and footer files that surround the output of the script. Conclusion The software provides a rapid means of setting up customised Internet antigen sequence databases. The flexible configuration options enable typing schemes with differing requirements to be accommodated. PMID:16790057

  1. Multiplex detection of respiratory pathogens

    DOEpatents

    McBride, Mary [Brentwood, CA; Slezak, Thomas [Livermore, CA; Birch, James M [Albany, CA

    2012-07-31

    Described are kits and methods useful for detection of respiratory pathogens (influenza A (including subtyping capability for H1, H3, H5 and H7 subtypes) influenza B, parainfluenza (type 2), respiratory syncytial virus, and adenovirus) in a sample. Genomic sequence information from the respiratory pathogens was analyzed to identify signature sequences, e.g., polynucleotide sequences useful for confirming the presence or absence of a pathogen in a sample. Primer and probe sets were designed and optimized for use in a PCR based, multiplexed Luminex assay to successfully identify the presence or absence of pathogens in a sample.

  2. Single-Cell RNA Sequencing of the Bronchial Epithelium in Smokers With Lung Cancer

    DTIC Science & Technology

    2015-07-01

    AWARD NUMBER: W81XWH-14-1-0234 TITLE: Single-Cell RNA Sequencing of the Bronchial Epithelium in Smokers With Lung Cancer PRINCIPAL INVESTIGATOR...TITLE AND SUBTITLE Single-Cell RNA Sequencing of the Bronchial Epithelium in Smokers With Lung Cancer 5a. CONTRACT NUMBER 5b. GRANT NUMBER W81XWH...single cell RNA sequencing on airway epithelial cells obtained from smokers with and without lung cancer to identify cell-type dependent gene expression

  3. Geoseq: a tool for dissecting deep-sequencing datasets.

    PubMed

    Gurtowski, James; Cancio, Anthony; Shah, Hardik; Levovitz, Chaya; George, Ajish; Homann, Robert; Sachidanandam, Ravi

    2010-10-12

    Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO), Sequence Read Archive (SRA) hosted by the NCBI, or the DNA Data Bank of Japan (ddbj). Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a) identify differential isoform expression in mRNA-seq datasets, b) identify miRNAs (microRNAs) in libraries, and identify mature and star sequences in miRNAS and c) to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.

  4. Evaluation of GenoType NTM-DR Assay for Identification of Mycobacterium chimaera.

    PubMed

    Mok, Simone; Rogers, Thomas R; Fitzgibbon, Margaret

    2017-06-01

    Identification of species within the Mycobacterium avium complex (MAC) is difficult, and most current diagnostic laboratory tests cannot distinguish between species included in the complex. Differentiation of species within the MAC is important, as Mycobacterium chimaera has recently emerged as a major cause of invasive cardiovascular infections following open heart surgery. A new commercial diagnostic assay, GenoType NTM-DR ver. 1.0, is intended to differentiate between three species within the MAC, namely, Mycobacterium avium , Mycobacterium intracellulare , and Mycobacterium chimaera In this study, we investigated an archival collection of 173 MAC isolates using 16S rRNA and 16S-23S internal transcribed spacer (ITS) gene sequencing, and GenoType NTM-DR was evaluated for identifying M. chimaera and other species belonging to the MAC. Species identification of 157/173 (91%) isolates with the GenoType NTM-DR assay was in agreement with 16S rRNA and 16S-23S ITS gene sequencing results. Misidentification occurred with 16 isolates which belonged to four species included in the MAC that are rarely encountered in clinical specimens. Despite some limitations of this assay, GenoType NTM-DR had 100% specificity for identifying M. chimaera This novel assay will enable diagnostic laboratories to differentiate species belonging to the Mycobacterium avium complex and to accurately identify M. chimaera It can produce rapid results and is also more cost efficient than gene sequencing methods. Copyright © 2017 American Society for Microbiology.

  5. Wide spread of OXA-23-producing carbapenem-resistant Acinetobacter baumannii belonging to clonal complex II in different hospitals in Lebanon.

    PubMed

    Al Atrouni, Ahmad; Hamze, Monzer; Jisr, Tamima; Lemarié, Carole; Eveillard, Matthieu; Joly-Guillou, Marie-Laure; Kempf, Marie

    2016-11-01

    To investigate the molecular epidemiology of Acinetobacter baumannii strains isolated from different hospitals in Lebanon. A total of 119 non-duplicate Acinetobacter strains were identified using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) and partial rpoB gene sequencing. Antibiotic susceptibility testing was performed by disc diffusion method and all identified carbapenem-resistant isolates were investigated by PCR assays for the presence of the carbapenemase-encoding genes. Multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE) were used for molecular typing. Of the 119 A. baumannii isolates, 76.5% were resistant to carbapenems. The most common carbapenemase was the OXA-23-type, found in 82 isolates. The study of population structure using MLST revealed the presence of 30 sequence types (STs) including 18 new ones, with ST2 being the most commonly detected, accounting for 61% of the isolates typed. PFGE performed on all strains of ST2 identified a major cluster of 53 isolates, in addition to three other minor clusters and ten unique profiles. This study highlights the wide dissemination of highly related OXA-23-producing carbapenem-resistant A. baumannii belonging to the international clone II in Lebanon. Thus, appropriate infection control measures are recommended in order to control the geographical spread of this clone in this country. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  6. Validation of Minim typing for fast and accurate discrimination of extended-spectrum, beta-lactamase-producing Klebsiella pneumoniae isolates in tertiary care hospital.

    PubMed

    Brhelova, Eva; Kocmanova, Iva; Racil, Zdenek; Hanslianova, Marketa; Antonova, Mariya; Mayer, Jiri; Lengerova, Martina

    2016-09-01

    Minim typing is derived from the multi-locus sequence typing (MLST). It targets the same genes, but sequencing is replaced by high resolution melt analysis. Typing can be performed by analysing six loci (6MelT), four loci (4MelT) or using data from four loci plus sequencing the tonB gene (HybridMelT). The aim of this study was to evaluate Minim typing to discriminate extended-spectrum beta-lactamase producing Klebsiella pneumoniae (ESBL-KLPN) isolates at our hospital. In total, 380 isolates were analyzed. The obtained alleles were assigned according to both the 6MelT and 4MelT typing scheme. In 97 isolates, the tonB gene was sequenced to enable HybridMelT typing. We found that the presented method is suitable to quickly monitor isolates of ESBL-KLPN; results are obtained in less than 2 hours and at a lower cost than MLST. We identified a local ESBL-KLPN outbreak and a comparison of colonizing and invasive isolates revealed a long term colonization of patients with the same strain. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Detection of porcine circovirus type 2 in pigs imported from Indonesia.

    PubMed

    Manokaran, Gayathri; Lin, Yueh-Nuo; Soh, Moi-Lien; Lim, Elizabeth Ai-Sim; Lim, Chee-Wee; Tan, Boon-Huan

    2008-11-25

    We have detected the presence of porcine circovirus (PCV) type 2 in Indonesian pigs imported to Singapore for food consumption. A total of three viral isolates were identified, and to genetically characterise them further, their full genomes were sequenced. Each genome showed a typical organization of PCV type 2, with the three isolates sharing similar genome lengths of 1767 nucleotide (nt) at high nt identities of 99.8-100%, further indicating that the viral isolates were quite homogeneous. Sequence analysis further revealed that the ORF2 genes contain the nt sequence CCCCGC (from nt position 262 to 267) that was previously reported to be associated with PCV type 2, group 1C. The phylogenetic tree was constructed for the ORF2 genes, and the PCV type 2 isolates distributed into two distinctive groups. The Indonesian PCV type 2 clustered tightly with one China isolate, accession number AY035820, as a sub-cluster in group 1C. The sequence and phylogenetic analyses both confirmed that the three Indonesian PCV type 2 isolates belong to group 1C, and that the genetic changes for the three Indonesian isolates were very stable, possibly due to the low-scale evolution.

  8. [Sequence-based typing of enviromental Legionella pneumophila isolates in Guangzhou].

    PubMed

    Zhang, Ying; Qu, Pinghua; Zhang, Jian; Chen, Shouyi

    2011-03-01

    To characterize the genes of Legionella pneumophila isolated from different water source in Guangzhou from 2006 to 2009. To genotype the strains by using sequence-based typing (SBT) scheme. In total 44 L. pneumophila strains were identified by SBT with 7 diversifying genes of flaA, asd, mip, pilE, mompS, proA and neuA. Analysis of the amplicons sequence was taken in the European Working Group for Legionella Infections (EWGLI) international SBT database to obtain the allelic profiles and sequence types (STs). Serogroups were typed by latex agglutination test. Data from SBT revealed a high diversity among the strains and ST01 accounts for 30% (13/ 44). Fifteen new STs were discovered from 20 STs and 2 of them were newly assigned (ST887 and ST888) by EWGLI. SBT Phylogenetic tree was generated by SplitsTree and BURST programs. High diversity and specificity were observed of the L. pneumophila strains in Guangzhou. SBT is useful for L. pneumophila genomic study and epidemiological surveillance.

  9. Microbiological Features of KPC-Producing Enterobacter Isolates Identified in a U.S. Hospital System

    PubMed Central

    Ahn, Chulsoo; Syed, Alveena; Hu, Fupin; O’Hara, Jessica A.; Rivera, Jesabel I.; Doi, Yohei

    2014-01-01

    Microbiological data regarding KPC-producing Enterobacter spp. are scarce. In this study, 11 unique KPC-producing Enterobacter isolates were identified among 44 ertapenem-non-susceptible Enterobacter isolates collected between 2009 and 2013 at a hospital system in Western Pennsylvania. All cases were healthcare-associated and occurred in medically complex patients. While pulsed-field gel electrophoresis (PFGE) showed diverse restriction patterns overall, multilocus sequence typing (MLST) identified Enterobacter cloacae isolates with sequence types (STs) 93 and 171 from two hospitals each. The levels of carbapenem minimum inhibitory concentrations were highly variable. All isolates remained susceptible to colistin, tigecycline, and the majority to amikacin and doxycycline. A blaKPC-carrying IncN plasmid conferring trimethoprim-sulfamethoxazole resistance was identified in three of the isolates. Spread of blaKPC in Enterobacter spp. appears to be due to a combination of plasmid-mediated and clonal processes. PMID:25053203

  10. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  11. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  12. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  13. PHASTpep: Analysis Software for Discovery of Cell-Selective Peptides via Phage Display and Next-Generation Sequencing

    PubMed Central

    Dasa, Siva Sai Krishna; Kelly, Kimberly A.

    2016-01-01

    Next-generation sequencing has enhanced the phage display process, allowing for the quantification of millions of sequences resulting from the biopanning process. In response, many valuable analysis programs focused on specificity and finding targeted motifs or consensus sequences were developed. For targeted drug delivery and molecular imaging, it is also necessary to find peptides that are selective—targeting only the cell type or tissue of interest. We present a new analysis strategy and accompanying software, PHage Analysis for Selective Targeted PEPtides (PHASTpep), which identifies highly specific and selective peptides. Using this process, we discovered and validated, both in vitro and in vivo in mice, two sequences (HTTIPKV and APPIMSV) targeted to pancreatic cancer-associated fibroblasts that escaped identification using previously existing software. Our selectivity analysis makes it possible to discover peptides that target a specific cell type and avoid other cell types, enhancing clinical translatability by circumventing complications with systemic use. PMID:27186887

  14. Prevalence of Diverse Clones of Vancomycin-Resistant Enterococcus faecium ST78 in a Chinese Hospital.

    PubMed

    Yang, Jiyong; Jiang, Yufeng; Guo, Ling; Ye, LIyan; Ma, Yanning; Luo, Yanping

    2016-06-01

    Vancomycin-resistant Enterococcus (VRE) has been identified in China. However, little is known about the spread of VRE isolates. The genetic relatedness of vancomycin-resistant Enterococcus faecium (VREfm) isolates was analyzed by pulsed-field gel electrophoresis (PFGE), their antimicrobial susceptibilities were analyzed by E-test and the VITEK 2 AST-GP67 test Kit, and their sequence types (STs) were investigated by multilocus sequence typing (MLST). S1-PFGE was used for plasmid profiling, and PCR and subsequent sequencing were performed to identify the virulence genes. A total of 96 nonduplicated VREfm isolates were obtained and categorized into 38 PFGE types (type 1-38). The predominant MLST type was ST78, while ST17, ST341, and ST342 were also sporadically identified. All types of clinical VREfm strains harbored the vanA gene; however, they carried plasmids of different sizes. While 92.1%, 71.1%, and 60.5% of VREfm strains carried hyl, scm, and ecbA genes, respectively, all of them were positive for esp, acm, sgrA, pilA, and pilB genes. Clonal VREfm spread was observed, and nonplasmid-mediated horizontal transfer of vancomycin-resistant gene might have conveyed resistance to some vancomycin-susceptible E. faecium strains. E. faecium ST78 carrying vanA gene was the most prevalent clone in this study. The high prevalence of virulence genes, including esp, hyl, acm, scm, ecbA, sgrA, pilA, and pilB, confirmed their important roles in the emergence of VREfm ST78 in nosocomial infections.

  15. Colonisation with toxigenic Corynebacterium diphtheriae in a Scottish burns patient, June 2015.

    PubMed

    Deshpande, Ashutosh; Inkster, Teresa; Hamilton, Kate; Litt, David; Fry, Norman; Kennedy, Iain T R; Shookhye-Dickson, Jacqueline; Hill, Robert L R

    2015-01-01

    On 12 June 2015, Corynebacterium diphtheriae was identified in a skin swab from a burns patient in Scotland. The isolate was confirmed to be genotypically and phenotypically toxigenic. Multilocus sequence typing of three patient isolates yielded sequence type ST 125. The patient was clinically well. We summarise findings of this case, and results of close contact identification and screening: 12 family and close contacts and 32 hospital staff have been found negative for C. diphtheriae.

  16. Distribution of Bartonella henselae Variants in Patients, Reservoir Hosts and Vectors in Spain

    PubMed Central

    Gil, Horacio; Escudero, Raquel; Pons, Inmaculada; Rodríguez-Vargas, Manuela; García-Esteban, Coral; Rodríguez-Moreno, Isabel; García-Amil, Cristina; Lobo, Bruno; Valcárcel, Félix; Pérez, Azucena; Jiménez, Santos; Jado, Isabel; Juste, Ramón; Segura, Ferrán; Anda, Pedro

    2013-01-01

    We have studied the diversity of B. henselae circulating in patients, reservoir hosts and vectors in Spain. In total, we have fully characterized 53 clinical samples from 46 patients, as well as 78 B. henselae isolates obtained from 35 cats from La Rioja and Catalonia (northeastern Spain), four positive cat blood samples from which no isolates were obtained, and three positive fleas by Multiple Locus Sequence Typing and Multiple Locus Variable Number Tandem Repeats Analysis. This study represents the largest series of human cases characterized with these methods, with 10 different sequence types and 41 MLVA profiles. Two of the sequence types and 35 of the profiles were not described previously. Most of the B. henselae variants belonged to ST5. Also, we have identified a common profile (72) which is well distributed in Spain and was found to persist over time. Indeed, this profile seems to be the origin from which most of the variants identified in this study have been generated. In addition, ST5, ST6 and ST9 were found associated with felines, whereas ST1, ST5 and ST8 were the most frequent sequence types found infecting humans. Interestingly, some of the feline associated variants never found on patients were located in a separate clade, which could represent a group of strains less pathogenic for humans. PMID:23874563

  17. Phylogenetic analysis of Mycobacterium massiliense strains having recombinant rpoB gene laterally transferred from Mycobacterium abscessus.

    PubMed

    Kim, Byoung-Jun; Kim, Ga-Na; Kim, Bo-Ram; Shim, Tae-Sun; Kook, Yoon-Hoh; Kim, Bum-Joon

    2017-01-01

    Recent multi locus sequence typing (MLST) and genome based studies indicate that lateral gene transfer (LGT) events in the rpoB gene are prevalent between Mycobacterium abscessus complex strains. To check the prevalence of the M. massiliense strains subject to rpoB LGT (Rec-mas), we applied rpoB typing (711 bp) to 106 Korean strains of M. massiliense infection that had already been identified by hsp65 sequence analysis (603 bp). The analysis indicated 6 smooth strains in M. massiliense Type I (10.0%, 6/60) genotypes but no strains in M. massiliense Type II genotypes (0%, 0/46), showing a discrepancy between the 2 typing methods. Further MLST analysis based on the partial sequencing of seven housekeeping genes, argH, cya, glpK, gnd, murC, pta and purH, as well as erm(41) PCR proved that these 6 Rec-mas strains consisted of two distinct genotypes belonging to M. massiliense and not M. abscessus. The complete rpoB sequencing analysis showed that these 6 Rec-mas strains have an identical hybrid rpoB gene, of which a 478 bp partial rpoB fragment may be laterally transferred from M. abscessus. Notably, five of the 6 Rec-mas strains showed complete identical sequences in a total of nine genes, including the seven MLST genes, hsp65, and rpoB, suggesting their clonal propagation in South Korea. In conclusion, we identified 6 M. massiliense smooth strains of 2 phylogenetically distinct genotypes with a specific hybrid rpoB gene laterally transferred from M. abscessus from Korean patients. Their clinical relevance and bacteriological traits remain to be elucidated.

  18. Phylogenetic analysis of Mycobacterium massiliense strains having recombinant rpoB gene laterally transferred from Mycobacterium abscessus

    PubMed Central

    Kim, Byoung-Jun; Kim, Ga-Na; Kim, Bo-Ram; Shim, Tae-Sun; Kook, Yoon-Hoh

    2017-01-01

    Recent multi locus sequence typing (MLST) and genome based studies indicate that lateral gene transfer (LGT) events in the rpoB gene are prevalent between Mycobacterium abscessus complex strains. To check the prevalence of the M. massiliense strains subject to rpoB LGT (Rec-mas), we applied rpoB typing (711 bp) to 106 Korean strains of M. massiliense infection that had already been identified by hsp65 sequence analysis (603 bp). The analysis indicated 6 smooth strains in M. massiliense Type I (10.0%, 6/60) genotypes but no strains in M. massiliense Type II genotypes (0%, 0/46), showing a discrepancy between the 2 typing methods. Further MLST analysis based on the partial sequencing of seven housekeeping genes, argH, cya, glpK, gnd, murC, pta and purH, as well as erm(41) PCR proved that these 6 Rec-mas strains consisted of two distinct genotypes belonging to M. massiliense and not M. abscessus. The complete rpoB sequencing analysis showed that these 6 Rec-mas strains have an identical hybrid rpoB gene, of which a 478 bp partial rpoB fragment may be laterally transferred from M. abscessus. Notably, five of the 6 Rec-mas strains showed complete identical sequences in a total of nine genes, including the seven MLST genes, hsp65, and rpoB, suggesting their clonal propagation in South Korea. In conclusion, we identified 6 M. massiliense smooth strains of 2 phylogenetically distinct genotypes with a specific hybrid rpoB gene laterally transferred from M. abscessus from Korean patients. Their clinical relevance and bacteriological traits remain to be elucidated. PMID:28604829

  19. Comparison of methods available for identification of Mycobacterium chimaera.

    PubMed

    Lecorche, E; Haenn, S; Mougari, F; Kumanski, S; Veziris, N; Benmansour, H; Raskine, L; Moulin, L; Cambau, E

    2018-04-01

    Mycobacterium chimaera is a recently described nontuberculous mycobacterium belonging to the Mycobacterium avium complex (MAC). Because this species is implicated in a worldwide outbreak due to contaminated heater-cooler unit water tanks during open-heart surgery, it has become mandatory for clinical microbiology laboratories to be able to differentiate M. chimaera from the other MAC species, especially M. intracellulare. Such identification has so far been restricted to specialized laboratories because it required the analysis of several gene sequences. The aim of this study was to evaluate commercial methods for identifying M. chimaera with regard to the reference gene sequencing ITS, the internal transcribed spacer 16-23S. Forty-seven clinical and environmental isolates including 41 MAC were identified by (a) PCR sequencing of the ITS and hsp65 genes, (b) three molecular biology kits (INNO-LiPA Mycobacteria, GenoType Mycobacterium CM and GenoType NTM-DR) and (c) matrix-assisted desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) using Microflex LT. There was a high concordance for species determination between the reference ITS sequencing and the GenoType NTM-DR test (39/41, 95%), the INNO-LiPA Mycobacteria test (38/41, 93%) and the hsp65 sequencing (38/41, 93%). The GenoType Mycobacterium CM test did not distinguish M. chimaera from M. intracellulare. MALDI-TOF MS distinguished two M. chimaera-M. intracellulare groups separated from M. avium and from the other mycobacterial species on a score-oriented dendrogram, but it also failed to differentiate the two species. INNO-LiPA Mycobacteria and GenoType NTM-DR are efficient assays for M. chimaera identification in clinical microbiology laboratories. Copyright © 2017 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.

  20. The alpha-fetoprotein third domain receptor binding fragment: in search of scavenger and associated receptor targets.

    PubMed

    Mizejewski, G J

    2015-01-01

    Recent studies have demonstrated that the carboxyterminal third domain of alpha-fetoprotein (AFP-CD) binds with various ligands and receptors. Reports within the last decade have established that AFP-CD contains a large fragment of amino acids that interact with several different receptor types. Using computer software specifically designed to identify protein-to-protein interaction at amino acid sequence docking sites, the computer searches identified several types of scavenger-associated receptors and their amino acid sequence locations on the AFP-CD polypeptide chain. The scavenger receptors (SRs) identified were CD36, CD163, Stabilin, SSC5D, SRB1 and SREC; the SR-associated receptors included the mannose, low-density lipoprotein receptors, the asialoglycoprotein receptor, and the receptor for advanced glycation endproducts (RAGE). Interestingly, some SR interaction sites were localized on the AFP-derived Growth Inhibitory Peptide (GIP) segment at amino acids #480-500. Following the detection studies, a structural subdomain analysis of both the receptor and the AFP-CD revealed the presence of epidermal growth factor (EGF) repeats, extracellular matrix-like protein regions, amino acid-rich motifs and dimerization subdomains. For the first time, it was reported that EGF-like sequence repeats were identified on each of the three domains of AFP. Thereafter, the localization of receptors on specific cell types were reviewed and their functions were discussed.

  1. Does typing of Chlamydia trachomatis using housekeeping multilocus sequence typing reveal different sexual networks among heterosexuals and men who have sex with men?

    PubMed

    Versteeg, Bart; Bruisten, Sylvia M; van der Ende, Arie; Pannekoek, Yvonne

    2016-04-18

    Chlamydia trachomatis infections remain the most common bacterial sexually transmitted infection worldwide. To gain more insight into the epidemiology and transmission of C. trachomatis, several schemes of multilocus sequence typing (MLST) have been developed. We investigated the clustering of C. trachomatis strains derived from men who have sex with men (MSM) and heterosexuals using the MLST scheme based on 7 housekeeping genes (MLST-7) adapted for clinical specimens and a high-resolution MLST scheme based on 6 polymorphic genes, including ompA (hr-MLST-6). Specimens from 100 C. trachomatis infected men who have sex with men (MSM) and 100 heterosexual women were randomly selected from previous studies and sequenced. We adapted the MLST-7 scheme to a nested assay to be suitable for direct typing of clinical specimens. All selected specimens were typed using both the adapted MLST-7 scheme and the hr-MLST-6 scheme. Clustering of C. trachomatis strains derived from MSM and heterosexuals was assessed using minimum spanning tree analysis. Sufficient chlamydial DNA was present in 188 of the 200 (94 %) selected samples. Using the adapted MLST-7 scheme, full MLST profiles were obtained for 187 of 188 tested specimens resulting in a high success rate of 99.5 %. Of these 187 specimens, 91 (48.7 %) were from MSM and 96 (51.3 %) from heterosexuals. We detected 21 sequence types (STs) using the adapted MLST-7 and 79 STs using the hr-MLST-6 scheme. Minimum spanning tree analyses was used to examine the clustering of MLST-7 data, which showed no reflection of separate transmission in MSM and heterosexual hosts. Moreover, typing using the hr-MLST-6 scheme identified genetically related clusters within each of clusters that were identified by using the MLST-7 scheme. No distinct transmission of C. trachomatis could be observed in MSM and heterosexuals using the adapted MLST-7 scheme in contrast to using the hr-MLST-6. In addition, we compared clustering of both MLST schemes and demonstrated that typing using the hr-MLST-6 scheme is able to identify genetically related clusters of C. trachomatis strains within each of the clusters that were identified by using the MLST-7 scheme.

  2. Determining Clostridium difficile intra-taxa diversity by mining multilocus sequence typing databases.

    PubMed

    Muñoz, Marina; Ríos-Chaparro, Dora Inés; Patarroyo, Manuel Alfonso; Ramírez, Juan David

    2017-03-14

    Multilocus sequence typing (MLST) is a highly discriminatory typing strategy; it is reproducible and scalable. There is a MLST scheme for Clostridium difficile (CD), a gram positive bacillus causing different pathologies of the gastrointestinal tract. This work was aimed at describing the frequency of sequence types (STs) and Clades (C) reported and evalute the intra-taxa diversity in the CD MLST database (CD-MLST-db) using an MLSA approach. Analysis of 1778 available isolates showed that clade 1 (C1) was the most frequent worldwide (57.7%), followed by C2 (29.1%). Regarding sequence types (STs), it was found that ST-1, belonging to C2, was the most frequent. The isolates analysed came from 17 countries, mostly from the United Kingdom (UK) (1541 STs, 87.0%). The diversity of the seven housekeeping genes in the MLST scheme was evaluated, and alleles from the profiles (STs), for identifying CD population structure. It was found that adk and atpA are conserved genes allowing a limited amount of clusters to be discriminated; however, different genes such as drx, glyA and particularly sodA showed high diversity indexes and grouped CD populations in many clusters, suggesting that these genes' contribution to CD typing should be revised. It was identified that CD STs reported to date have a mostly clonal population structure with foreseen events of recombination; however, one group of STs was not assigned to a clade being highly different containing at least nine well-supported clusters, suggesting a greater amount of clades for CD. This study shows the usefulness of CD-MLST-db as a tool for studying CD distribution and population structure, identifying the need for reviewing the usefulness of sodA as housekeeping gene within the MLST scheme and suggesting the existence of a greater amount of CD clades. The study also shows the plausible exchange of genetic material between STs, contributing towards intra-taxa genetic diversity.

  3. Depositional architecture and sequence stratigraphy of the Upper Jurassic Hanifa Formation, central Saudi Arabia

    NASA Astrophysics Data System (ADS)

    El-Sorogy, Abdelbaset; Al-Kahtany, Khaled; Almadani, Sattam; Tawfik, Mohamed

    2018-03-01

    To document the depositional architecture and sequence stratigraphy of the Upper Jurassic Hanifa Formation in central Saudi Arabia, three composite sections were examined, measured and thin section analysed at Al-Abakkayn, Sadous and Maashabah mountains. Fourteen microfacies types were identified, from wackestones to boundstones and which permits the recognition of five lithofacies associations in a carbonate platform. Lithofacies associations range from low energy, sponges, foraminifers and bioclastic burrowed offshoal deposits to moderate lithoclstic, peloidal and bioclastic foreshoal deposits in the lower part of the Hanifa while the upper part is dominated by corals, ooidal and peloidal high energy shoal deposits to moderate to low energy peloidal, stromatoporoids and other bioclastics back shoal deposits. The studied Hanifa Formation exhibits an obvious cyclicity, distinguishing from vertical variations in lithofacies types. These microfacies types are arranged in two third order sequences, the first sequence is equivalent to the lower part of the Hanifa Formation (Hawtah member) while the second one is equivalent to the upper part (Ulayyah member). Within these two sequences, there are three to six fourth-order high frequency sequences respectively in the studied sections.

  4. Molecular and phylogenetic characterizations of an Eimeria krijgsmanni Yakimoff & Gouseff, 1938 (Apicomplexa: Eimeriidae) mouse intestinal protozoan parasite by partial 18S ribosomal RNA gene sequence analysis.

    PubMed

    Takeo, Toshinori; Tanaka, Tetsuya; Matsubayashi, Makoto; Maeda, Hiroki; Kusakisako, Kodai; Matsui, Toshihiro; Mochizuki, Masami; Matsuo, Tomohide

    2014-08-01

    Previously, we characterized an undocumented strain of Eimeria krijgsmanni by morphological and biological features. Here, we present a detailed molecular phylogenetic analysis of this organism. Namely, 18S ribosomal RNA gene (rDNA) sequences of E. krijgsmanni were analyzed to incorporate this species into a comprehensive Eimeria phylogeny. As a result, partial 18S rDNA sequence from E. krijgsmanni was successfully determined, and two different types, Type A and Type B, that differed by 1 base pair were identified. E. krijgsmanni was originally isolated from a single oocyst, and thus the result show that the two types might have allelic sequence heterogeneity in the 18S rDNA. Based on phylogenetic analyses, the two types of E. krijgsmanni 18S rDNA formed one of two clades among murine Eimeria spp.; these Eimeria clades reflected morphological similarity among the Eimeria spp. This is the third molecular phylogenetic characterization of a murine Eimeria spp. in addition to E. falciformis and E. papillata. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  5. Single-cell sequencing in stem cell biology.

    PubMed

    Wen, Lu; Tang, Fuchou

    2016-04-15

    Cell-to-cell variation and heterogeneity are fundamental and intrinsic characteristics of stem cell populations, but these differences are masked when bulk cells are used for omic analysis. Single-cell sequencing technologies serve as powerful tools to dissect cellular heterogeneity comprehensively and to identify distinct phenotypic cell types, even within a 'homogeneous' stem cell population. These technologies, including single-cell genome, epigenome, and transcriptome sequencing technologies, have been developing rapidly in recent years. The application of these methods to different types of stem cells, including pluripotent stem cells and tissue-specific stem cells, has led to exciting new findings in the stem cell field. In this review, we discuss the recent progress as well as future perspectives in the methodologies and applications of single-cell omic sequencing technologies.

  6. groHMM: a computational tool for identifying unannotated and cell type-specific transcription units from global run-on sequencing data.

    PubMed

    Chae, Minho; Danko, Charles G; Kraus, W Lee

    2015-07-16

    Global run-on coupled with deep sequencing (GRO-seq) provides extensive information on the location and function of coding and non-coding transcripts, including primary microRNAs (miRNAs), long non-coding RNAs (lncRNAs), and enhancer RNAs (eRNAs), as well as yet undiscovered classes of transcripts. However, few computational tools tailored toward this new type of sequencing data are available, limiting the applicability of GRO-seq data for identifying novel transcription units. Here, we present groHMM, a computational tool in R, which defines the boundaries of transcription units de novo using a two state hidden-Markov model (HMM). A systematic comparison of the performance between groHMM and two existing peak-calling methods tuned to identify broad regions (SICER and HOMER) favorably supports our approach on existing GRO-seq data from MCF-7 breast cancer cells. To demonstrate the broader utility of our approach, we have used groHMM to annotate a diverse array of transcription units (i.e., primary transcripts) from four GRO-seq data sets derived from cells representing a variety of different human tissue types, including non-transformed cells (cardiomyocytes and lung fibroblasts) and transformed cells (LNCaP and MCF-7 cancer cells), as well as non-mammalian cells (from flies and worms). As an example of the utility of groHMM and its application to questions about the transcriptome, we show how groHMM can be used to analyze cell type-specific enhancers as defined by newly annotated enhancer transcripts. Our results show that groHMM can reveal new insights into cell type-specific transcription by identifying novel transcription units, and serve as a complete and useful tool for evaluating functional genomic elements in cells.

  7. Prevalence, genetic diversity and recombination of species G enteroviruses infecting pigs in Vietnam

    PubMed Central

    Van Dung, Nguyen; Anh, Pham Hong; Van Cuong, Nguyen; Hoa, Ngo Thi; Carrique-Mas, Juan; Hien, Vo Be; Campbell, James; Baker, Stephen; Farrar, Jeremy; Woolhouse, Mark E.; Bryant, Juliet E.

    2014-01-01

    Picornaviruses infecting pigs, described for many years as ‘porcine enteroviruses’, have recently been recognized as distinct viruses within three distinct genera (Teschovirus, Sapelovirus and Enterovirus). To better characterize the epidemiology and genetic diversity of members of the Enterovirus genus, faecal samples from pigs from four provinces in Vietnam were screened by PCR using conserved enterovirus (EV)-specific primers from the 5′ untranslated region (5′ UTR). High rates of infection were recorded in pigs on all farms, with detection frequencies of approximately 90 % in recently weaned pigs but declining to 40 % in those aged over 1 year. No differences in EV detection rates were observed between pigs with and without diarrhoea [74 % (n = 70) compared with 72 % (n = 128)]. Genetic analysis of consensus VP4/VP2 and VP1 sequences amplified from a subset of EV-infected pigs identified species G EVs in all samples. Among these, VP1 sequence comparisons identified six type 1 and seven type 6 variants, while four further VP1 sequences failed to group with any previously identified EV-G types. These have now been formally assigned as EV-G types 8–11 by the Picornavirus Study Group. Comparison of VP1, VP4/VP2, 3Dpol and 5′ UTRs of study samples and those available on public databases showed frequent, bootstrap-supported differences in their phylogenies indicative of extensive within-species recombination between genome regions. In summary, we identified extremely high frequencies of infection with EV-G in pigs in Vietnam, substantial genetic diversity and recombination within the species, and evidence for a much larger number of circulating EV-G types than currently described. PMID:24323635

  8. Evaluation of the class II region of the major histocompatibility complex of the greyhound with the genomic matching technique and sequence-based typing.

    PubMed

    Fliegner, R A; Holloway, S A; Lester, S; McLure, C A; Dawkins, R L

    2008-08-01

    The class II region of the major histocompatibility complex was evaluated in 25 greyhounds by sequence-based typing and the genomic matching technique (GMT). Two new DLA-DRB1 alleles were identified. Twenty-four dogs carried the DLA-DRB1*01201/DQA1*00401/DQB1*01303/DQB1*01701 haplotype, which carries two DQB1 alleles. One haplotype was identified from which DQB1 and DQA1 appeared to be deleted. The GMT enabled detection of DQB1 copy number, discrimination of the different class II haplotypes and the identification of new, possibly biologically relevant polymorphisms.

  9. Generalized lessons about sequence learning from the study of the serial reaction time task

    PubMed Central

    Schwarb, Hillary; Schumacher, Eric H.

    2012-01-01

    Over the last 20 years researchers have used the serial reaction time (SRT) task to investigate the nature of spatial sequence learning. They have used the task to identify the locus of spatial sequence learning, identify situations that enhance and those that impair learning, and identify the important cognitive processes that facilitate this type of learning. Although controversies remain, the SRT task has been integral in enhancing our understanding of implicit sequence learning. It is important, however, to ask what, if anything, the discoveries made using the SRT task tell us about implicit learning more generally. This review analyzes the state of the current spatial SRT sequence learning literature highlighting the stimulus-response rule hypothesis of sequence learning which we believe provides a unifying account of discrepant SRT data. It also challenges researchers to use the vast body of knowledge acquired with the SRT task to understand other implicit learning literatures too often ignored in the context of this particular task. This broad perspective will make it possible to identify congruences among data acquired using various different tasks that will allow us to generalize about the nature of implicit learning. PMID:22723815

  10. [Applylication of new type combined fragments: nrDNA ITS+ nad 1-intron 2 for identification of Dendrobium species of Fengdous].

    PubMed

    Geng, Li-xia; Zheng, Rui; Ren, Jie; Niu, Zhi-tao; Sun, Yu-long; Xue, Qing-yun; Liu, Wei; Ding, Xiao-yu

    2015-08-01

    In this study, 17 kinds of Dendrobium species of Fengdous including 39 individuals were collected from 4 provinces. Mitochondrial gene sequences co I, nad 5, nad 1-intron 2 and chloroplast gene sequences rbcL, matK amd psbA-trnH were amplified from these materials, as well as nrDNA ITS. Furthermore, suitable sequences for identification of Dendrobium species of Fengdous were screened by K-2-P and P-distance. The results showed that during the mentioned 7 sequences, nrDNA ITS, nad 1-intron 2 and psbA-trnH which had a high degree of variability could be used to identify Dendrobium species of Fengdous. However, single fragment could not be used to distinguish D. moniliforme and D. huoshanense. Moreover, compared to other combined fragments, new type combined fragments nrDNA ITS+nad 1-intron 2 was more effective in identifying the original plants of Dendrobium species and could be used to identify D. huoshanense and D. moniliforme. Besides, according to the UPGMA tree constructed with nrDNA ITS+nad 1-intron 2, 3 inspected Dendrobium plants were identified as D. huoshanense, D. moniliforme and D. officinale, respectively. This study identified Dendrobium species of Fengdous by combined fragments nrDNA ITS+nad 1-intron 2 for the first time, which provided a more effective basis for identification of Dendrobium species. And this study will be helpful for regulating the market of Fengdous.

  11. Diversity of Group I and II Clostridium botulinum Strains from France Including Recently Identified Subtypes

    PubMed Central

    Mazuet, Christelle; Legeay, Christine; Sautereau, Jean; Ma, Laurence; Bouchier, Christiane; Bouvet, Philippe; Popoff, Michel R.

    2016-01-01

    In France, human botulism is mainly food-borne intoxication, whereas infant botulism is rare. A total of 99 group I and II Clostridium botulinum strains including 59 type A (12 historical isolates [1947–1961], 43 from France [1986–2013], 3 from other countries, and 1 collection strain), 31 type B (3 historical, 23 recent isolates, 4 from other countries, and 1 collection strain), and 9 type E (5 historical, 3 isolates, and 1 collection strain) were investigated by botulinum locus gene sequencing and multilocus sequence typing analysis. Historical C. botulinum A strains mainly belonged to subtype A1 and sequence type (ST) 1, whereas recent strains exhibited a wide genetic diversity: subtype A1 in orfX or ha locus, A1(B), A1(F), A2, A2b2, A5(B2′) A5(B3′), as well as the recently identified A7 and A8 subtypes, and were distributed into 25 STs. Clostridium botulinum A1(B) was the most frequent subtype from food-borne botulism and food. Group I C. botulinum type B in France were mainly subtype B2 (14 out of 20 historical and recent strains) and were divided into 19 STs. Food-borne botulism resulting from ham consumption during the recent period was due to group II C. botulinum B4. Type E botulism is rare in France, 5 historical and 1 recent strains were subtype E3. A subtype E12 was recently identified from an unusual ham contamination. Clostridium botulinum strains from human botulism in France showed a wide genetic diversity and seems to result not from a single evolutionary lineage but from multiple and independent genetic rearrangements. PMID:27189984

  12. De Novo Sequencing and Analysis of Lemongrass Transcriptome Provide First Insights into the Essential Oil Biosynthesis of Aromatic Grasses.

    PubMed

    Meena, Seema; Kumar, Sarma R; Venkata Rao, D K; Dwivedi, Varun; Shilpashree, H B; Rastogi, Shubhra; Shasany, Ajit K; Nagegowda, Dinesh A

    2016-01-01

    Aromatic grasses of the genus Cymbopogon (Poaceae family) represent unique group of plants that produce diverse composition of monoterpene rich essential oils, which have great value in flavor, fragrance, cosmetic, and aromatherapy industries. Despite the commercial importance of these natural aromatic oils, their biosynthesis at the molecular level remains unexplored. As the first step toward understanding the essential oil biosynthesis, we performed de novo transcriptome assembly and analysis of C. flexuosus (lemongrass) by employing Illumina sequencing. Mining of transcriptome data and subsequent phylogenetic analysis led to identification of terpene synthases, pyrophosphatases, alcohol dehydrogenases, aldo-keto reductases, carotenoid cleavage dioxygenases, alcohol acetyltransferases, and aldehyde dehydrogenases, which are potentially involved in essential oil biosynthesis. Comparative essential oil profiling and mRNA expression analysis in three Cymbopogon species (C. flexuosus, aldehyde type; C. martinii, alcohol type; and C. winterianus, intermediate type) with varying essential oil composition indicated the involvement of identified candidate genes in the formation of alcohols, aldehydes, and acetates. Molecular modeling and docking further supported the role of identified protein sequences in aroma formation in Cymbopogon. Also, simple sequence repeats were found in the transcriptome with many linked to terpene pathway genes including the genes potentially involved in aroma biosynthesis. This work provides the first insights into the essential oil biosynthesis of aromatic grasses, and the identified candidate genes and markers can be a great resource for biotechnological and molecular breeding approaches to modulate the essential oil composition.

  13. De Novo Sequencing and Analysis of Lemongrass Transcriptome Provide First Insights into the Essential Oil Biosynthesis of Aromatic Grasses

    PubMed Central

    Meena, Seema; Kumar, Sarma R.; Venkata Rao, D. K.; Dwivedi, Varun; Shilpashree, H. B.; Rastogi, Shubhra; Shasany, Ajit K.; Nagegowda, Dinesh A.

    2016-01-01

    Aromatic grasses of the genus Cymbopogon (Poaceae family) represent unique group of plants that produce diverse composition of monoterpene rich essential oils, which have great value in flavor, fragrance, cosmetic, and aromatherapy industries. Despite the commercial importance of these natural aromatic oils, their biosynthesis at the molecular level remains unexplored. As the first step toward understanding the essential oil biosynthesis, we performed de novo transcriptome assembly and analysis of C. flexuosus (lemongrass) by employing Illumina sequencing. Mining of transcriptome data and subsequent phylogenetic analysis led to identification of terpene synthases, pyrophosphatases, alcohol dehydrogenases, aldo-keto reductases, carotenoid cleavage dioxygenases, alcohol acetyltransferases, and aldehyde dehydrogenases, which are potentially involved in essential oil biosynthesis. Comparative essential oil profiling and mRNA expression analysis in three Cymbopogon species (C. flexuosus, aldehyde type; C. martinii, alcohol type; and C. winterianus, intermediate type) with varying essential oil composition indicated the involvement of identified candidate genes in the formation of alcohols, aldehydes, and acetates. Molecular modeling and docking further supported the role of identified protein sequences in aroma formation in Cymbopogon. Also, simple sequence repeats were found in the transcriptome with many linked to terpene pathway genes including the genes potentially involved in aroma biosynthesis. This work provides the first insights into the essential oil biosynthesis of aromatic grasses, and the identified candidate genes and markers can be a great resource for biotechnological and molecular breeding approaches to modulate the essential oil composition. PMID:27516768

  14. Analysis of the genome-wide variations among multiple strains of the plant pathogenic bacterium Xylella fastidiosa

    PubMed Central

    Doddapaneni, Harshavardhan; Yao, Jiqiang; Lin, Hong; Walker, M Andrew; Civerolo, Edwin L

    2006-01-01

    Background The Gram-negative, xylem-limited phytopathogenic bacterium Xylella fastidiosa is responsible for causing economically important diseases in grapevine, citrus and many other plant species. Despite its economic impact, relatively little is known about the genomic variations among strains isolated from different hosts and their influence on the population genetics of this pathogen. With the availability of genome sequence information for four strains, it is now possible to perform genome-wide analyses to identify and categorize such DNA variations and to understand their influence on strain functional divergence. Results There are 1,579 genes and 194 non-coding homologous sequences present in the genomes of all four strains, representing a 76. 2% conservation of the sequenced genome. About 60% of the X. fastidiosa unique sequences exist as tandem gene clusters of 6 or more genes. Multiple alignments identified 12,754 SNPs and 14,449 INDELs in the 1528 common genes and 20,779 SNPs and 10,075 INDELs in the 194 non-coding sequences. The average SNP frequency was 1.08 × 10-2 per base pair of DNA and the average INDEL frequency was 2.06 × 10-2 per base pair of DNA. On an average, 60.33% of the SNPs were synonymous type while 39.67% were non-synonymous type. The mutation frequency, primarily in the form of external INDELs was the main type of sequence variation. The relative similarity between the strains was discussed according to the INDEL and SNP differences. The number of genes unique to each strain were 60 (9a5c), 54 (Dixon), 83 (Ann1) and 9 (Temecula-1). A sub-set of the strain specific genes showed significant differences in terms of their codon usage and GC composition from the native genes suggesting their xenologous origin. Tandem repeat analysis of the genomic sequences of the four strains identified associations of repeat sequences with hypothetical and phage related functions. Conclusion INDELs and strain specific genes have been identified as the main source of variations among strains, with individual strains showing different rates of genome evolution. Based on these genome comparisons, it appears that the Pierce's disease strain Temecula-1 genome represents the ancestral genome of the X. fastidiosa. Results of this analysis are publicly available in the form of a web database. PMID:16948851

  15. Single-Molecule Electrical Random Resequencing of DNA and RNA

    NASA Astrophysics Data System (ADS)

    Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji

    2012-07-01

    Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.

  16. Signatures from Tissue-specific MPSS Libraries Identify Transcripts Preferentially Expressed in the Mouse Inner Ear

    PubMed Central

    Peters, Linda M.; Belyantseva, Inna A.; Lagziel, Ayala; Battey, James F.; Friedman, Thomas B.; Morell, Robert J.

    2007-01-01

    Specialization in cell function and morphology is influenced by the differential expression of mRNAs, many of which are expressed at low abundance and restricted to certain cell types. Detecting such transcripts in cDNA libraries may require sequencing millions of clones. Massively parallel signature sequencing (MPSS) is well-suited for identifying transcripts that are expressed in discrete cell types and in low abundance. We have made MPSS libraries from microdissections of three inner ear tissues. By comparing these MPSS libraries to those of 87 other tissues included in the Mouse Reference Transcriptome (MRT) online resource, we have identified genes that are highly enriched in, or specific to, the inner ear. We show by RT-PCR and in situ hybridization that signatures unique to the inner ear libraries identify transcripts with highly specific cell-type localizations. These transcripts serve to illustrate the utility of a resource that is available to the research community. Utilization of these resources will increase the number of known transcription units and expand our knowledge of the tissue-specific regulation of the transcriptome. PMID:17049805

  17. Review and International Recommendation of Methods for Typing Neisseria gonorrhoeae Isolates and Their Implications for Improved Knowledge of Gonococcal Epidemiology, Treatment, and Biology

    PubMed Central

    Unemo, Magnus; Dillon, Jo-Anne R.

    2011-01-01

    Summary: Gonorrhea, which may become untreatable due to multiple resistance to available antibiotics, remains a public health problem worldwide. Precise methods for typing Neisseria gonorrhoeae, together with epidemiological information, are crucial for an enhanced understanding regarding issues involving epidemiology, test of cure and contact tracing, identifying core groups and risk behaviors, and recommending effective antimicrobial treatment, control, and preventive measures. This review evaluates methods for typing N. gonorrhoeae isolates and recommends various methods for different situations. Phenotypic typing methods, as well as some now-outdated DNA-based methods, have limited usefulness in differentiating between strains of N. gonorrhoeae. Genotypic methods based on DNA sequencing are preferred, and the selection of the appropriate genotypic method should be guided by its performance characteristics and whether short-term epidemiology (microepidemiology) or long-term and/or global epidemiology (macroepidemiology) matters are being investigated. Currently, for microepidemiological questions, the best methods for fast, objective, portable, highly discriminatory, reproducible, typeable, and high-throughput characterization are N. gonorrhoeae multiantigen sequence typing (NG-MAST) or full- or extended-length porB gene sequencing. However, pulsed-field gel electrophoresis (PFGE) and Opa typing can be valuable in specific situations, i.e., extreme microepidemiology, despite their limitations. For macroepidemiological studies and phylogenetic studies, DNA sequencing of chromosomal housekeeping genes, such as multilocus sequence typing (MLST), provides a more nuanced understanding. PMID:21734242

  18. Rapid Detection of Rare Deleterious Variants by Next Generation Sequencing with Optional Microarray SNP Genotype Data

    PubMed Central

    Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.

    2015-01-01

    ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133

  19. Metallo-β-lactamase-producing Pseudomonas aeruginosa in the Netherlands: the nationwide emergence of a single sequence type.

    PubMed

    Van der Bij, A K; Van der Zwan, D; Peirano, G; Severin, J A; Pitout, J D D; Van Westreenen, M; Goessens, W H F

    2012-09-01

    Recently, the first outbreak of clonally related VIM-2 metallo-β-lactamase (MBL)-producing Pseudomonas aeruginosa in a Dutch tertiary-care centre was described. Subsequently, a nationwide surveillance study was performed in 2010-2011, which identified the presence of VIM-2 MBL-producing P. aeruginosa in 11 different hospitals. Genotyping by multiple-locus variable-number tandem-repeat analysis (MLVA) showed that the majority of the 82 MBL-producing isolates found belonged to a single MLVA type (n = 70, 85%), identified as ST111 by multilocus sequence typing (MLST). As MBL-producing isolates cause serious infections that are difficult to treat, the presence of clonally related isolates in various hospitals throughout the Netherlands is of nationwide concern. © 2012 The Authors. Clinical Microbiology and Infection © 2012 European Society of Clinical Microbiology and Infectious Diseases.

  20. First Description of Two Sequence Type 2 Acinetobacter baumannii Isolates Carrying OXA-23 Carbapenemase in Pagellus acarne Fished from the Mediterranean Sea near Bejaia, Algeria

    PubMed Central

    Brahmi, Soumia; Touati, Abdelaziz; Cadière, Axelle; Djahmi, Nassima; Pantel, Alix; Sotto, Albert; Dunyach-Remy, Catherine

    2016-01-01

    To determine the occurrence of carbapenem-resistant Acinetobacter baumannii in fish fished from the Mediterranean Sea near the Bejaia coast (Algeria), we studied 300 gills and gut samples that had been randomly and prospectively collected during 1 year. After screening on selective agar media, using PCR arrays and whole-genome sequencing, we identified for the first time two OXA-23-producing A. baumannii strains belonging to the widespread sequence type 2 (ST2)/international clone II and harboring aminoglycoside-modifying enzymes [aac(6′)-Ib and aac(3′)-I genes]. PMID:26787693

  1. Occurrence of Stolbur Phytoplasma Disease in Spreading Type Petunia hybrida Cultivars in Korea

    PubMed Central

    Chung, Bong Nam; Jeong, Myeong Il; Choi, Seung Kook; Joa, Jae Ho; Choi, Kyeong San; Choi, In Myeong

    2013-01-01

    In January 2012, spreading type petunia cv. Wave Pink plants showing an abnormal growth habit of sprouting unusual multiple plantlets from the lateral buds were collected from a greenhouse in Gwacheon, Gyeonggi Province, Korea. The presence of phytoplasma was investigated using PCR with the primer pairs P1/P6, and R16F1/R1 for nested-PCR. In the nested PCR, 1,096 bp PCR products were obtained, and through sequencing 12 Pet-Stol isolates were identified. Comparison of the nucleotide sequences of 16S rRNA gene of the 12 Pet-Stol isolates with other phytoplasmas belonging to aster yellows or Stolbur showed that Pet-Stol isolates were members of Stolbur. The presence of phytoplasma in petunia was also confirmed by microscopic observation of the pathogens. In this study, Stolbur phytoplasma was identified from spreading type petunia cultivars by sequence analysis of 16S rRNA gene of phytoplasma and microscopic observation of phytoplasma bodies. This is the first report of Stolbur phytoplasma in commercial Petunia hybrida cultivars. PMID:25288978

  2. Exome sequencing-driven discovery of coding polymorphisms associated with common metabolic phenotypes.

    PubMed

    Albrechtsen, A; Grarup, N; Li, Y; Sparsø, T; Tian, G; Cao, H; Jiang, T; Kim, S Y; Korneliussen, T; Li, Q; Nie, C; Wu, R; Skotte, L; Morris, A P; Ladenvall, C; Cauchi, S; Stančáková, A; Andersen, G; Astrup, A; Banasik, K; Bennett, A J; Bolund, L; Charpentier, G; Chen, Y; Dekker, J M; Doney, A S F; Dorkhan, M; Forsen, T; Frayling, T M; Groves, C J; Gui, Y; Hallmans, G; Hattersley, A T; He, K; Hitman, G A; Holmkvist, J; Huang, S; Jiang, H; Jin, X; Justesen, J M; Kristiansen, K; Kuusisto, J; Lajer, M; Lantieri, O; Li, W; Liang, H; Liao, Q; Liu, X; Ma, T; Ma, X; Manijak, M P; Marre, M; Mokrosiński, J; Morris, A D; Mu, B; Nielsen, A A; Nijpels, G; Nilsson, P; Palmer, C N A; Rayner, N W; Renström, F; Ribel-Madsen, R; Robertson, N; Rolandsson, O; Rossing, P; Schwartz, T W; Slagboom, P E; Sterner, M; Tang, M; Tarnow, L; Tuomi, T; van't Riet, E; van Leeuwen, N; Varga, T V; Vestmar, M A; Walker, M; Wang, B; Wang, Y; Wu, H; Xi, F; Yengo, L; Yu, C; Zhang, X; Zhang, J; Zhang, Q; Zhang, W; Zheng, H; Zhou, Y; Altshuler, D; 't Hart, L M; Franks, P W; Balkau, B; Froguel, P; McCarthy, M I; Laakso, M; Groop, L; Christensen, C; Brandslund, I; Lauritzen, T; Witte, D R; Linneberg, A; Jørgensen, T; Hansen, T; Wang, J; Nielsen, R; Pedersen, O

    2013-02-01

    Human complex metabolic traits are in part regulated by genetic determinants. Here we applied exome sequencing to identify novel associations of coding polymorphisms at minor allele frequencies (MAFs) >1% with common metabolic phenotypes. The study comprised three stages. We performed medium-depth (8×) whole exome sequencing in 1,000 cases with type 2 diabetes, BMI >27.5 kg/m(2) and hypertension and in 1,000 controls (stage 1). We selected 16,192 polymorphisms nominally associated (p < 0.05) with case-control status, from four selected annotation categories or from loci reported to associate with metabolic traits. These variants were genotyped in 15,989 Danes to search for association with 12 metabolic phenotypes (stage 2). In stage 3, polymorphisms showing potential associations were genotyped in a further 63,896 Europeans. Exome sequencing identified 70,182 polymorphisms with MAF >1%. In stage 2 we identified 51 potential associations with one or more of eight metabolic phenotypes covered by 45 unique polymorphisms. In meta-analyses of stage 2 and stage 3 results, we demonstrated robust associations for coding polymorphisms in CD300LG (fasting HDL-cholesterol: MAF 3.5%, p = 8.5 × 10(-14)), COBLL1 (type 2 diabetes: MAF 12.5%, OR 0.88, p = 1.2 × 10(-11)) and MACF1 (type 2 diabetes: MAF 23.4%, OR 1.10, p = 8.2 × 10(-10)). We applied exome sequencing as a basis for finding genetic determinants of metabolic traits and show the existence of low-frequency and common coding polymorphisms with impact on common metabolic traits. Based on our study, coding polymorphisms with MAF above 1% do not seem to have particularly high effect sizes on the measured metabolic traits.

  3. Genome sequencing and analysis of a type A Clostridium perfringens isolate from a case of bovine clostridial abomasitis.

    PubMed

    Nowell, Victoria J; Kropinski, Andrew M; Songer, J Glenn; MacInnes, Janet I; Parreira, Valeria R; Prescott, John F

    2012-01-01

    Clostridium perfringens is a common inhabitant of the avian and mammalian gastrointestinal tracts and can behave commensally or pathogenically. Some enteric diseases caused by type A C. perfringens, including bovine clostridial abomasitis, remain poorly understood. To investigate the potential basis of virulence in strains causing this disease, we sequenced the genome of a type A C. perfringens isolate (strain F262) from a case of bovine clostridial abomasitis. The ∼3.34 Mbp chromosome of C. perfringens F262 is predicted to contain 3163 protein-coding genes, 76 tRNA genes, and an integrated plasmid sequence, Cfrag (∼18 kb). In addition, sequences of two complete circular plasmids, pF262C (4.8 kb) and pF262D (9.1 kb), and two incomplete plasmid fragments, pF262A (48.5 kb) and pF262B (50.0 kb), were identified. Comparison of the chromosome sequence of C. perfringens F262 to complete C. perfringens chromosomes, plasmids and phages revealed 261 unique genes. No novel toxin genes related to previously described clostridial toxins were identified: 60% of the 261 unique genes were hypothetical proteins. There was a two base pair deletion in virS, a gene reported to encode the main sensor kinase involved in virulence gene activation. Despite this frameshift mutation, C. perfringens F262 expressed perfringolysin O, alpha-toxin and the beta2-toxin, suggesting that another regulation system might contribute to the pathogenicity of this strain. Two complete plasmids, pF262C (4.8 kb) and pF262D (9.1 kb), unique to this strain of C. perfringens were identified.

  4. Genome Sequencing and Analysis of a Type A Clostridium perfringens Isolate from a Case of Bovine Clostridial Abomasitis

    PubMed Central

    Nowell, Victoria J.; Kropinski, Andrew M.; Songer, J. Glenn; MacInnes, Janet I.; Parreira, Valeria R.; Prescott, John F.

    2012-01-01

    Clostridium perfringens is a common inhabitant of the avian and mammalian gastrointestinal tracts and can behave commensally or pathogenically. Some enteric diseases caused by type A C. perfringens, including bovine clostridial abomasitis, remain poorly understood. To investigate the potential basis of virulence in strains causing this disease, we sequenced the genome of a type A C. perfringens isolate (strain F262) from a case of bovine clostridial abomasitis. The ∼3.34 Mbp chromosome of C. perfringens F262 is predicted to contain 3163 protein-coding genes, 76 tRNA genes, and an integrated plasmid sequence, Cfrag (∼18 kb). In addition, sequences of two complete circular plasmids, pF262C (4.8 kb) and pF262D (9.1 kb), and two incomplete plasmid fragments, pF262A (48.5 kb) and pF262B (50.0 kb), were identified. Comparison of the chromosome sequence of C. perfringens F262 to complete C. perfringens chromosomes, plasmids and phages revealed 261 unique genes. No novel toxin genes related to previously described clostridial toxins were identified: 60% of the 261 unique genes were hypothetical proteins. There was a two base pair deletion in virS, a gene reported to encode the main sensor kinase involved in virulence gene activation. Despite this frameshift mutation, C. perfringens F262 expressed perfringolysin O, alpha-toxin and the beta2-toxin, suggesting that another regulation system might contribute to the pathogenicity of this strain. Two complete plasmids, pF262C (4.8 kb) and pF262D (9.1 kb), unique to this strain of C. perfringens were identified. PMID:22412860

  5. Bifidobacterium animalis subsp. lactis ATCC 27673 Is a Genomically Unique Strain within Its Conserved Subspecies

    PubMed Central

    Loquasto, Joseph R.; Barrangou, Rodolphe; Dudley, Edward G.; Stahl, Buffy; Chen, Chun

    2013-01-01

    Many strains of Bifidobacterium animalis subsp. lactis are considered health-promoting probiotic microorganisms and are commonly formulated into fermented dairy foods. Analyses of previously sequenced genomes of B. animalis subsp. lactis have revealed little genetic diversity, suggesting that it is a monomorphic subspecies. However, during a multilocus sequence typing survey of Bifidobacterium, it was revealed that B. animalis subsp. lactis ATCC 27673 gave a profile distinct from that of the other strains of the subspecies. As part of an ongoing study designed to understand the genetic diversity of this subspecies, the genome of this strain was sequenced and compared to other sequenced genomes of B. animalis subsp. lactis and B. animalis subsp. animalis. The complete genome of ATCC 27673 was 1,963,012 bp, contained 1,616 genes and 4 rRNA operons, and had a G+C content of 61.55%. Comparative analyses revealed that the genome of ATCC 27673 contained six distinct genomic islands encoding 83 open reading frames not found in other strains of the same subspecies. In four islands, either phage or mobile genetic elements were identified. In island 6, a novel clustered regularly interspaced short palindromic repeat (CRISPR) locus which contained 81 unique spacers was identified. This type I-E CRISPR-cas system differs from the type I-C systems previously identified in this subspecies, representing the first identification of a different system in B. animalis subsp. lactis. This study revealed that ATCC 27673 is a strain of B. animalis subsp. lactis with novel genetic content and suggests that the lack of genetic variability observed is likely due to the repeated sequencing of a limited number of widely distributed commercial strains. PMID:23995933

  6. Accurate prediction of secreted substrates and identification of a conserved putative secretion signal for type III secretion systems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Samudrala, Ram; Heffron, Fred; McDermott, Jason E.

    2009-04-24

    The type III secretion system is an essential component for virulence in many Gram-negative bacteria. Though components of the secretion system apparatus are conserved, its substrates, effector proteins, are not. We have used a machine learning approach to identify new secreted effectors. The method integrates evolutionary measures, such as the pattern of homologs in a range of other organisms, and sequence-based features, such as G+C content, amino acid composition and the N-terminal 30 residues of the protein sequence. The method was trained on known effectors from Salmonella typhimurium and validated on a corresponding set of effectors from Pseudomonas syringae, aftermore » eliminating effectors with detectable sequence similarity. The method was able to identify all of the known effectors in P. syringae with a specificity of 84% and sensitivity of 82%. The reciprocal validation, training on P. syringae and validating on S. typhimurium, gave similar results with a specificity of 86% when the sensitivity level was 87%. These results show that type III effectors in disparate organisms share common features. We found that maximal performance is attained by including an N-terminal sequence of only 30 residues, which agrees with previous studies indicating that this region contains the secretion signal. We then used the method to define the most important residues in this putative secretion signal. Finally, we present novel predictions of secreted effectors in S. typhimurium, some of which have been experimentally validated, and apply the method to predict secreted effectors in the genetically intractable human pathogen Chlamydia trachomatis. This approach is a novel and effective way to identify secreted effectors in a broad range of pathogenic bacteria for further experimental characterization and provides insight into the nature of the type III secretion signal.« less

  7. Categorizing accident sequences in the external radiotherapy for risk analysis

    PubMed Central

    2013-01-01

    Purpose This study identifies accident sequences from the past accidents in order to help the risk analysis application to the external radiotherapy. Materials and Methods This study reviews 59 accidental cases in two retrospective safety analyses that have collected the incidents in the external radiotherapy extensively. Two accident analysis reports that accumulated past incidents are investigated to identify accident sequences including initiating events, failure of safety measures, and consequences. This study classifies the accidents by the treatments stages and sources of errors for initiating events, types of failures in the safety measures, and types of undesirable consequences and the number of affected patients. Then, the accident sequences are grouped into several categories on the basis of similarity of progression. As a result, these cases can be categorized into 14 groups of accident sequence. Results The result indicates that risk analysis needs to pay attention to not only the planning stage, but also the calibration stage that is committed prior to the main treatment process. It also shows that human error is the largest contributor to initiating events as well as to the failure of safety measures. This study also illustrates an event tree analysis for an accident sequence initiated in the calibration. Conclusion This study is expected to provide sights into the accident sequences for the prospective risk analysis through the review of experiences. PMID:23865005

  8. Multiple alignment-free sequence comparison

    PubMed Central

    Ren, Jie; Song, Kai; Sun, Fengzhu; Deng, Minghua; Reinert, Gesine

    2013-01-01

    Motivation: Recently, a range of new statistics have become available for the alignment-free comparison of two sequences based on k-tuple word content. Here, we extend these statistics to the simultaneous comparison of more than two sequences. Our suite of statistics contains, first, and , extensions of statistics for pairwise comparison of the joint k-tuple content of all the sequences, and second, , and , averages of sums of pairwise comparison statistics. The two tasks we consider are, first, to identify sequences that are similar to a set of target sequences, and, second, to measure the similarity within a set of sequences. Results: Our investigation uses both simulated data as well as cis-regulatory module data where the task is to identify cis-regulatory modules with similar transcription factor binding sites. We find that although for real data, all of our statistics show a similar performance, on simulated data the Shepp-type statistics are in some instances outperformed by star-type statistics. The multiple alignment-free statistics are more sensitive to contamination in the data than the pairwise average statistics. Availability: Our implementation of the five statistics is available as R package named ‘multiAlignFree’ at be http://www-rcf.usc.edu/∼fsun/Programs/multiAlignFree/multiAlignFreemain.html. Contact: reinert@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23990418

  9. Genotyping of Indian antigenic, vaccine, and field Brucella spp. using multilocus sequence typing.

    PubMed

    Shome, Rajeswari; Krithiga, Natesan; Shankaranarayana, Padmashree B; Jegadesan, Sankarasubramanian; Udayakumar S, Vishnu; Shome, Bibek Ranjan; Saikia, Girin Kumar; Sharma, Narendra Kumar; Chauhan, Harshad; Chandel, Bharat Singh; Jeyaprakash, Rajendhran; Rahman, Habibur

    2016-03-31

    Brucellosis is one of the most important zoonotic diseases that affects multiple livestock species and causes great economic losses. The highly conserved genomes of Brucella, with > 90% homology among species, makes it important to study the genetic diversity circulating in the country. A total of 26 Brucella spp. (4 reference strains and 22 field isolates) and 1 B. melitensis draft genome sequence from India (B. melitensis Bm IND1) were included for sequence typing. The field isolates were identified by biochemical tests and confirmed by both conventional and quantitative polymerase chain reaction (qPCR) targeting bcsp 31Brucella genus-specific marker. Brucella speciation and biotyping was done by Bruce ladder, probe qPCR, and AMOS PCRs, respectively, and genotyping was done by multilocus sequence typing (MLST). The MLST typing of 27 Brucella spp. revealed five distinct sequence types (STs); the B. abortus S99 reference strain and 21 B. abortus field isolates belonged to ST1. On the other hand, the vaccine strain B. abortus S19 was genotyped as ST5. Similarly, B. melitensis 16M reference strain and one B. melitensis field isolate were grouped into ST7. Another B. melitensis field isolate belonged to ST8 (draft genome sequence from India), and only B. suis 1330 reference strain was found to be ST14. The sequences revealed genetic similarity of the Indian strains to the global reference and field strains. The study highlights the usefulness of MLST for typing of field isolates and validation of reference strains used for diagnosis and vaccination against brucellosis.

  10. Dispersion of Multidrug-Resistant Enterococcus faecium Isolates Belonging to Major Clonal Complexes in Different Portuguese Settings▿

    PubMed Central

    Freitas, Ana R.; Novais, Carla; Ruiz-Garbajosa, Patricia; Coque, Teresa M.; Peixe, Luísa

    2009-01-01

    The population structure of 56 Enterococcus faecium isolates selected from a collection of enterococci from humans, animals, and the environment in Portugal (1997 to 2007) was analyzed by multilocus sequence typing. We identified 41 sequence types clustering into CC17, CC5, CC9, CC22 and CC94, all clonal lineages comprising isolates from different hosts. Our findings highlight the role of community-associated hosts as reservoirs of enterococci able to cause human infections. PMID:19447948

  11. Highly conserved intragenic HSV-2 sequences: Results from next-generation sequencing of HSV-2 UL and US regions from genital swabs collected from 3 continents.

    PubMed

    Johnston, Christine; Magaret, Amalia; Roychoudhury, Pavitra; Greninger, Alexander L; Cheng, Anqi; Diem, Kurt; Fitzgibbon, Matthew P; Huang, Meei-Li; Selke, Stacy; Lingappa, Jairam R; Celum, Connie; Jerome, Keith R; Wald, Anna; Koelle, David M

    2017-10-01

    Understanding the variability in circulating herpes simplex virus type 2 (HSV-2) genomic sequences is critical to the development of HSV-2 vaccines. Genital lesion swabs containing ≥ 10 7 log 10 copies HSV DNA collected from Africa, the USA, and South America underwent next-generation sequencing, followed by K-mer based filtering and de novo genomic assembly. Sites of heterogeneity within coding regions in unique long and unique short (U L _U S ) regions were identified. Phylogenetic trees were created using maximum likelihood reconstruction. Among 46 samples from 38 persons, 1468 intragenic base-pair substitutions were identified. The maximum nucleotide distance between strains for concatenated U L_ U S segments was 0.4%. Phylogeny did not reveal geographic clustering. The most variable proteins had non-synonymous mutations in < 3% of amino acids. Unenriched HSV-2 DNA can undergo next-generation sequencing to identify intragenic variability. The use of clinical swabs for sequencing expands the information that can be gathered directly from these specimens. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Detection of cystic fibrosis mutations in a GeneChip{trademark} assay format

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Miyada, C.G.; Cronin, M.T.; Kim, S.M.

    1994-09-01

    We are developing assays for the detection of cystic fibrosis mutations based on DNA hybridization. A DNA sample is amplified by PCR, labeled by incorporating a fluorescein-tagged dNTP, enzymatically treated to produce smaller fragments and hybridized to a series of short (13-16 bases) oligonucleotides synthesized on a glass surface via photolithography. The hybrids are detected by eqifluorescence and mutations are identified by the specific pattern of hybridization. In a GeneChip assay, the chip surface is composed of a series of subarrays, each being specific for a particular mutation. Each subarray is further subdivided into a series of probes (40 total),more » half based on the mutant sequence and the remainder based on the wild-type sequence. For each of the subarrays, there is a redundancy in the number of probes that should hybridize to either a wild-type or a mutant target. The multiple probe strategy provides sequence information for a short five base region overlapping the mutation site. In addition, homozygous wild-type and mutant as well as heterozygous samples are each identified by a specific pattern of hybridization. The small size of each probe feature (250 x 250 {mu}m{sup 2}) permits the inclusion of additional probes required to generate sequence information by hybridization.« less

  13. Genetic diversity of Clostridium perfringens type A isolates from animals, food poisoning outbreaks and sludge

    PubMed Central

    Johansson, Anders; Aspan, Anna; Bagge, Elisabeth; Båverud, Viveca; Engström, Björn E; Johansson, Karl-Erik

    2006-01-01

    Background Clostridium perfringens, a serious pathogen, causes enteric diseases in domestic animals and food poisoning in humans. The epidemiological relationship between C. perfringens isolates from the same source has previously been investigated chiefly by pulsed-field gel electrophoresis (PFGE). In this study the genetic diversity of C. perfringens isolated from various animals, from food poisoning outbreaks and from sludge was investigated. Results We used PFGE to examine the genetic diversity of 95 C. perfringens type A isolates from eight different sources. The isolates were also examined for the presence of the beta2 toxin gene (cpb2) and the enterotoxin gene (cpe). The cpb2 gene from the 28 cpb2-positive isolates was also partially sequenced (519 bp, corresponding to positions 188 to 706 in the consensus cpb2 sequence). The results of PFGE revealed a wide genetic diversity among the C. perfringens type A isolates. The genetic relatedness of the isolates ranged from 58 to 100% and 56 distinct PFGE types were identified. Almost all clusters with similar patterns comprised isolates with a known epidemiological correlation. Most of the isolates from pig, horse and sheep carried the cpb2 gene. All isolates originating from food poisoning outbreaks carried the cpe gene and three of these also carried cpb2. Two evolutionary different populations were identified by sequence analysis of the partially sequenced cpb2 genes from our study and cpb2 sequences previously deposited in GenBank. Conclusion As revealed by PFGE, there was a wide genetic diversity among C. perfringens isolates from different sources. Epidemiologically related isolates showed a high genetic similarity, as expected, while isolates with no obvious epidemiological relationship expressed a lesser degree of genetic similarity. The wide diversity revealed by PFGE was not reflected in the 16S rRNA sequences, which had a considerable degree of sequence similarity. Sequence comparison of the partially sequenced cpb2 gene revealed two genetically different populations. This is to our knowledge the first study in which the genetic diversity of C. perfringens isolates both from different animals species, from food poisoning outbreaks and from sludge has been investigated. PMID:16737528

  14. De novo transcriptome sequencing reveals a considerable bias in the incidence of simple sequence repeats towards the downstream of 'Pre-miRNAs' of black pepper.

    PubMed

    Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

    2013-01-01

    Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of '43 pre-miRNA candidates bearing different types of SSR motifs'. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted 'pre-miRNA candidates bearing SSRs'. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted 'pre-miRNA candidates'. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of 'tandem repeats' in miRNAs.

  15. De novo Transcriptome Sequencing Reveals a Considerable Bias in the Incidence of Simple Sequence Repeats towards the Downstream of ‘Pre-miRNAs’ of Black Pepper

    PubMed Central

    Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

    2013-01-01

    Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of ‘43 pre-miRNA candidates bearing different types of SSR motifs’. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted ‘pre-miRNA candidates bearing SSRs’. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted ‘pre-miRNA candidates’. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of ‘tandem repeats’ in miRNAs. PMID:23469176

  16. Multilocus Sequence Typing Has Better Discriminatory Ability for Typing Vibrio cholerae than Does Pulsed-Field Gel Electrophoresis and Provides a Measure of Phylogenetic Relatedness

    PubMed Central

    Kotetishvili, Mamuka; Stine, O. Colin; Chen, Yuansha; Kreger, Arnold; Sulakvelidze, Alexander; Sozhamannan, Shanmuga; Morris, Jr., J. Glenn

    2003-01-01

    Twenty-two Vibrio cholerae isolates, including some from “epidemic” (O1 and O139) and “nonepidemic” serogroups, were characterized by pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST) by using three housekeeping genes, gyrB, pgm, and recA; sequence data were also obtained for the virulence-associated genes tcpA, ctxA, and ctxB. Even with the small number of loci used, MLST had better discriminatory ability than did PFGE. On MLST analysis, there was clear clustering of epidemic serogroups; much greater diversity was seen among tcpA- and ctxAB-positive V. cholerae strains from other, nonepidemic serogroups, with a number of tcpA and ctxAB alleles identified. PMID:12734277

  17. Outbreak of Vibrio parahaemolyticus Sequence Type 120, Peru, 2009.

    PubMed

    Gonzalez-Escalona, Narjol; Gavilan, Ronnie G; Toro, Magaly; Zamudio, Maria L; Martinez-Urtaza, Jaime

    2016-07-01

    In 2009, an outbreak of Vibrio parahaemolyticus occurred in Piura, Cajamarca, Lambayeque, and Lima, Peru. Whole-genome sequencing of clinical and environmental samples from the outbreak revealed a new V. parahaemolyticus clone. All the isolates identified belonged to a single clonal complex described exclusively in Asia before its emergence in Peru.

  18. Outbreak of Vibrio parahaemolyticus Sequence Type 120, Peru, 2009

    PubMed Central

    Gonzalez-Escalona, Narjol; Gavilan, Ronnie G.; Toro, Magaly; Zamudio, Maria L.

    2016-01-01

    In 2009, an outbreak of Vibrio parahaemolyticus occurred in Piura, Cajamarca, Lambayeque, and Lima, Peru. Whole-genome sequencing of clinical and environmental samples from the outbreak revealed a new V. parahaemolyticus clone. All the isolates identified belonged to a single clonal complex described exclusively in Asia before its emergence in Peru. PMID:27315090

  19. Instructional Models in Methods Courses. Occasional Paper No. 7.

    ERIC Educational Resources Information Center

    Clubok, Arthur, Ed.

    Instructional models are distinct sets of sequenced teaching actions created to promote student achievement of selected learning outcomes. They identify: (1) the type of information to be presented to students; (2) the sequence in which it should be presented; (3) the teaching tactics that stimulate necessary cognitive learning processes; and (4)…

  20. Gene sequence analyses and other DNA-based methods for yeast species recognition

    USDA-ARS?s Scientific Manuscript database

    DNA sequence analyses, as well as other DNA-based methodologies, have transformed the way in which yeasts are identified. The focus of this chapter will be on the resolution of species using various types of DNA comparisons. In other chapters in this book, Rozpedowska, Piškur and Wolfe discuss mul...

  1. Expressed sequences tags of the anther smut fungus, Microbotryum violaceum, identify mating and pathogenicity genes

    PubMed Central

    Yockteng, Roxana; Marthey, Sylvain; Chiapello, Hélène; Gendrault, Annie; Hood, Michael E; Rodolphe, François; Devier, Benjamin; Wincker, Patrick; Dossat, Carole; Giraud, Tatiana

    2007-01-01

    Background The basidiomycete fungus Microbotryum violaceum is responsible for the anther-smut disease in many plants of the Caryophyllaceae family and is a model in genetics and evolutionary biology. Infection is initiated by dikaryotic hyphae produced after the conjugation of two haploid sporidia of opposite mating type. This study describes M. violaceum ESTs corresponding to nuclear genes expressed during conjugation and early hyphal production. Results A normalized cDNA library generated 24,128 sequences, which were assembled into 7,765 unique genes; 25.2% of them displayed significant similarity to annotated proteins from other organisms, 74.3% a weak similarity to the same set of known proteins, and 0.5% were orphans. We identified putative pheromone receptors and genes that in other fungi are involved in the mating process. We also identified many sequences similar to genes known to be involved in pathogenicity in other fungi. The M. violaceum EST database, MICROBASE, is available on the Web and provides access to the sequences, assembled contigs, annotations and programs to compare similarities against MICROBASE. Conclusion This study provides a basis for cloning the mating type locus, for further investigation of pathogenicity genes in the anther smut fungi, and for comparative genomics. PMID:17692127

  2. Transcriptomic analysis of Siberian ginseng (Eleutherococcus senticosus) to discover genes involved in saponin biosynthesis.

    PubMed

    Hwang, Hwan-Su; Lee, Hyoshin; Choi, Yong Eui

    2015-03-14

    Eleutherococcus senticosus, Siberian ginseng, is a highly valued woody medicinal plant belonging to the family Araliaceae. E. senticosus produces a rich variety of saponins such as oleanane-type, noroleanane-type, 29-hydroxyoleanan-type, and lupane-type saponins. Genomic or transcriptomic approaches have not been used to investigate the saponin biosynthetic pathway in this plant. In this study, de novo sequencing was performed to select candidate genes involved in the saponin biosynthetic pathway. A half-plate 454 pyrosequencing run produced 627,923 high-quality reads with an average sequence length of 422 bases. De novo assembly generated 72,811 unique sequences, including 15,217 contigs and 57,594 singletons. Approximately 48,300 (66.3%) unique sequences were annotated using BLAST similarity searches. All of the mevalonate pathway genes for saponin biosynthesis starting from acetyl-CoA were isolated. Moreover, 206 reads of cytochrome P450 (CYP) and 145 reads of uridine diphosphate glycosyltransferase (UGT) sequences were isolated. Based on methyl jasmonate (MeJA) treatment and real-time PCR (qPCR) analysis, 3 CYPs and 3 UGTs were finally selected as candidate genes involved in the saponin biosynthetic pathway. The identified sequences associated with saponin biosynthesis will facilitate the study of the functional genomics of saponin biosynthesis and genetic engineering of E. senticosus.

  3. Multilocus sequence typing and pulsed-field gel electrophoresis analysis of Oenococcus oeni from different wine-producing regions of China.

    PubMed

    Wang, Tao; Li, Hua; Wang, Hua; Su, Jing

    2015-04-16

    The present study established a typing method with NotI-based pulsed-field gel electrophoresis (PFGE) and stress response gene schemed multilocus sequence typing (MLST) for 55 Oenococcus oeni strains isolated from six individual regions in China and two model strains PSU-1 (CP000411) and ATCC BAA-1163 (AAUV00000000). Seven stress response genes, cfa, clpL, clpP, ctsR, mleA, mleP and omrA, were selected for MLST testing, and positive selective pressure was detected for these genes. Furthermore, both methods separated the strains into two clusters. The PFGE clusters are correlated with the region, whereas the sequence types (STs) formed by the MLST confirm the two clusters identified by PFGE. In addition, the population structure was a mixture of evolutionary pathways, and the strains exhibited both clonal and panmictic characteristics. Copyright © 2015 Elsevier B.V. All rights reserved.

  4. CRISPRTarget

    PubMed Central

    Biswas, Ambarish; Gagnon, Joshua N.; Brouns, Stan J.J.; Fineran, Peter C.; Brown, Chris M.

    2013-01-01

    The bacterial and archaeal CRISPR/Cas adaptive immune system targets specific protospacer nucleotide sequences in invading organisms. This requires base pairing between processed CRISPR RNA and the target protospacer. For type I and II CRISPR/Cas systems, protospacer adjacent motifs (PAM) are essential for target recognition, and for type III, mismatches in the flanking sequences are important in the antiviral response. In this study, we examine the properties of each class of CRISPR. We use this information to provide a tool (CRISPRTarget) that predicts the most likely targets of CRISPR RNAs (http://bioanalysis.otago.ac.nz/CRISPRTarget). This can be used to discover targets in newly sequenced genomic or metagenomic data. To test its utility, we discover features and targets of well-characterized Streptococcus thermophilus and Sulfolobus solfataricus type II and III CRISPR/Cas systems. Finally, in Pectobacterium species, we identify new CRISPR targets and propose a model of temperate phage exposure and subsequent inhibition by the type I CRISPR/Cas systems. PMID:23492433

  5. The genetic structure of the A mating-type locus of Lentinula edodes.

    PubMed

    Au, Chun Hang; Wong, Man Chun; Bao, Dapeng; Zhang, Meiyan; Song, Chunyan; Song, Wenhua; Law, Patrick Tik Wan; Kües, Ursula; Kwan, Hoi Shan

    2014-02-10

    The Shiitake mushroom, Lentinula edodes (Berk.) Pegler is a tetrapolar basidiomycete with two unlinked mating-type loci, commonly called the A and B loci. Identifying the mating-types in shiitake is important for enhancing the breeding and cultivation of this economically-important edible mushroom. Here, we identified the A mating-type locus from the first draft genome sequence of L. edodes and characterized multiple alleles from different monokaryotic strains. Two intron-length polymorphism markers were developed to facilitate rapid molecular determination of A mating-type. L. edodes sequences were compared with those of known tetrapolar and bipolar basidiomycete species. The A mating-type genes are conserved at the homeodomain region across the order Agaricales. However, we observed unique genomic organization of the locus in L. edodes which exhibits atypical gene order and multiple repetitive elements around its A locus. To our knowledge, this is the first known exception among Homobasidiomycetes, in which the mitochondrial intermediate peptidase (mip) gene is not closely linked to A locus. Copyright © 2013 Elsevier B.V. All rights reserved.

  6. Genetic analysis of Fasciola isolates from cattle in Korea based on second internal transcribed spacer (ITS-2) sequence of nuclear ribosomal DNA.

    PubMed

    Choe, Se-Eun; Nguyen, Thuy Thi-Dieu; Kang, Tae-Gyu; Kweon, Chang-Hee; Kang, Seung-Won

    2011-09-01

    Nuclear ribosomal DNA sequence of the second internal transcribed spacer (ITS-2) has been used efficiently to identify the liver fluke species collected from different hosts and various geographic regions. ITS-2 sequences of 19 Fasciola samples collected from Korean native cattle were determined and compared. Sequence comparison including ITS-2 sequences of isolates from this study and reference sequences from Fasciola hepatica and Fasciola gigantica and intermediate Fasciola in Genbank revealed seven identical variable sites of investigated isolates. Among 19 samples, 12 individuals had ITS-2 sequences completely identical to that of pure F. hepatica, five possessed the sequences identical to F. gigantica type, whereas two shared the sequence of both F. hepatica and F. gigantica. No variations in length and nucleotide composition of ITS-2 sequence were observed within isolates that belonged to F. hepatica or F. gigantica. At the position of 218, five Fasciola containing a single-base substitution (C>T) formed a distinct branch inside the F. gigantica-type group which was similar to those of Asian-origin isolates. The phylogenetic tree of the Fasciola spp. based on complete ITS-2 sequences from this study and other representative isolates in different locations clearly showed that pure F. hepatica, F. gigantica type and intermediate Fasciola were observed. The result also provided additional genetic evidence for the existence of three forms of Fasciola isolated from native cattle in Korea by genetic approach using ITS-2 sequence.

  7. BETASEQ: a powerful novel method to control type-I error inflation in partially sequenced data for rare variant association testing.

    PubMed

    Yan, Song; Li, Yun

    2014-02-15

    Despite its great capability to detect rare variant associations, next-generation sequencing is still prohibitively expensive when applied to large samples. In case-control studies, it is thus appealing to sequence only a subset of cases to discover variants and genotype the identified variants in controls and the remaining cases under the reasonable assumption that causal variants are usually enriched among cases. However, this approach leads to inflated type-I error if analyzed naively for rare variant association. Several methods have been proposed in recent literature to control type-I error at the cost of either excluding some sequenced cases or correcting the genotypes of discovered rare variants. All of these approaches thus suffer from certain extent of information loss and thus are underpowered. We propose a novel method (BETASEQ), which corrects inflation of type-I error by supplementing pseudo-variants while keeps the original sequence and genotype data intact. Extensive simulations and real data analysis demonstrate that, in most practical situations, BETASEQ leads to higher testing powers than existing approaches with guaranteed (controlled or conservative) type-I error. BETASEQ and associated R files, including documentation, examples, are available at http://www.unc.edu/~yunmli/betaseq

  8. The spa typing of methicillin-resistant Staphylococcus aureus isolates by High Resolution Melting (HRM) analysis.

    PubMed

    Fasihi, Yasser; Fooladi, Saba; Mohammadi, Mohammad Ali; Emaneini, Mohammad; Kalantar-Neyestanaki, Davood

    2017-09-06

    Molecular typing is an important tool for control and prevention of infection. A suitable molecular typing method for epidemiological investigation must be easy to perform, highly reproducible, inexpensive, rapid and easy to interpret. In this study, two molecular typing methods including the conventional PCR-sequencing method and high resolution melting (HRM) analysis were used for staphylococcal protein A (spa) typing of 30 Methicillin-resistant Staphylococcus aureus (MRSA) isolates recovered from clinical samples. Based on PCR-sequencing method results, 16 different spa types were identified among the 30 MRSA isolates. Among the 16 different spa types, 14 spa types separated by HRM method. Two spa types including t4718 and t2894 were not separated from each other. According to our results, spa typing based on HRM analysis method is very rapid, easy to perform and cost-effective, but this method must be standardized for different regions, spa types, and real-time machinery.

  9. Gender Identification in Date Palm Using Molecular Markers.

    PubMed

    Awan, Faisal Saeed; Maryam; Jaskani, Muhammad J; Sadia, Bushra

    2017-01-01

    Breeding of date palm is complicated because of its long life cycle and heterozygous nature. Sexual propagation of date palm does not produce true-to-type plants. Sex of date palms cannot be identified until the first flowering stage. Molecular markers such as random amplified polymorphic DNA (RAPD), sequence-characterized amplified regions (SCAR), and simple sequence repeats (SSR) have successfully been used to identify the sex-linked loci in the plant genome and to isolate the corresponding genes. This chapter highlights the use of three molecular markers including RAPD, SCAR, and SSR to identify the gender of date palm seedlings.

  10. Effects of canine parvovirus strain variations on diagnostic test results and clinical management of enteritis in dogs.

    PubMed

    Markovich, Jessica E; Stucker, Karla M; Carr, Alaina H; Harbison, Carole E; Scarlett, Janet M; Parrish, Colin R

    2012-07-01

    To estimate the prevalence of canine parvovirus (CPV) strains among dogs with enteritis admitted to a referral hospital in the southwestern United States during an 11-month period and to compare diagnostic test results, disease severity, and patient outcome among CPV strains. Prospective observational study. 72 dogs with histories and clinical signs of parvoviral enteritis. For each dog, a fecal sample or rectal swab specimen was evaluated for CPV antigen via an ELISA. Subsequently, fecal samples (n = 42 dogs) and pharyngeal swab specimens (16) were obtained and tested for CPV antigen via an ELISA and CPV DNA via a PCR assay. For specimens with CPV-positive results via PCR assay, genetic sequencing was performed to identify the CPV strain. 56 dogs tested positive for CPV via ELISA or PCR assay. For 42 fecal samples tested via both ELISA and PCR assay, 27 had positive results via both assays, whereas 6 had positive PCR assay results only. Ten pharyngeal swab specimens yielded positive PCR assay results. Genetic sequencing was performed on 34 fecal or pharyngeal swab specimens that had CPV-positive PCR assay results; 25 (73.5%) were identified as containing CPV type-2c, and 9 (26.5%) were identified as containing CPV type-2b. No association was found between CPV strain and disease severity or clinical outcome. CPV type-2b and CPV type-2c posed similar health risks for dogs; therefore, genetic sequencing of CPV does not appear necessary for clinical management of infected patients. The diagnostic tests used could detect CPV type-2c.

  11. Rapid identification of fungal pathogens in BacT/ALERT, BACTEC, and BBL MGIT media using polymerase chain reaction and DNA sequencing of the internal transcribed spacer regions.

    PubMed

    Pryce, Todd M; Palladino, Silvano; Price, Diane M; Gardam, Dianne J; Campbell, Peter B; Christiansen, Keryn J; Murray, Ronan J

    2006-04-01

    We report a direct polymerase chain reaction/sequence (d-PCRS)-based method for the rapid identification of clinically significant fungi from 5 different types of commercial broth enrichment media inoculated with clinical specimens. Media including BacT/ALERT FA (BioMérieux, Marcy l'Etoile, France) (n = 87), BACTEC Plus Aerobic/F (Becton Dickinson, Microbiology Systems, Sparks, MD) (n = 16), BACTEC Peds Plus/F (Becton Dickinson) (n = 15), BACTEC Lytic/10 Anaerobic/F (Becton Dickinson) (n = 11) bottles, and BBL MGIT (Becton Dickinson) (n = 11) were inoculated with specimens from 138 patients. A universal DNA extraction method was used combining a novel pretreatment step to remove PCR inhibitors with a column-based DNA extraction kit. Target sequences in the noncoding internal transcribed spacer regions of the rRNA gene were amplified by PCR and sequenced using a rapid (24 h) automated capillary electrophoresis system. Using sequence alignment software, fungi were identified by sequence similarity with sequences derived from isolates identified by upper-level reference laboratories or isolates defined as ex-type strains. We identified Candida albicans (n = 14), Candida parapsilosis (n = 8), Candida glabrata (n = 7), Candida krusei (n = 2), Scedosporium prolificans (n = 4), and 1 each of Candida orthopsilosis, Candida dubliniensis, Candida kefyr, Candida tropicalis, Candida guilliermondii, Saccharomyces cerevisiae, Cryptococcus neoformans, Aspergillus fumigatus, Histoplasma capsulatum, and Malassezia pachydermatis by d-PCRS analysis. All d-PCRS identifications from positive broths were in agreement with the final species identification of the isolates grown from subculture. Earlier identification of fungi using d-PCRS may facilitate prompt and more appropriate antifungal therapy.

  12. Comparative Analysis of the Orphan CRISPR2 Locus in 242 Enterococcus faecalis Strains

    PubMed Central

    Hullahalli, Karthik; Rodrigues, Marinelle; Schmidt, Brendan D.; Li, Xiang; Bhardwaj, Pooja; Palmer, Kelli L.

    2015-01-01

    Clustered, Regularly Interspaced Short Palindromic Repeats and their associated Cas proteins (CRISPR-Cas) provide prokaryotes with a mechanism for defense against mobile genetic elements (MGEs). A CRISPR locus is a molecular memory of MGE encounters. It contains an array of short sequences, called spacers, that generally have sequence identity to MGEs. Three different CRISPR loci have been identified among strains of the opportunistic pathogen Enterococcus faecalis. CRISPR1 and CRISPR3 are associated with the cas genes necessary for blocking MGEs, but these loci are present in only a subset of E. faecalis strains. The orphan CRISPR2 lacks cas genes and is ubiquitous in E. faecalis, although its spacer content varies from strain to strain. Because CRISPR2 is a variable locus occurring in all E. faecalis, comparative analysis of CRISPR2 sequences may provide information about the clonality of E. faecalis strains. We examined CRISPR2 sequences from 228 E. faecalis genomes in relationship to subspecies phylogenetic lineages (sequence types; STs) determined by multilocus sequence typing (MLST), and to a genome phylogeny generated for a representative 71 genomes. We found that specific CRISPR2 sequences are associated with specific STs and with specific branches on the genome tree. To explore possible applications of CRISPR2 analysis, we evaluated 14 E. faecalis bloodstream isolates using CRISPR2 analysis and MLST. CRISPR2 analysis identified two groups of clonal strains among the 14 isolates, an assessment that was confirmed by MLST. CRISPR2 analysis was also used to accurately predict the ST of a subset of isolates. We conclude that CRISPR2 analysis, while not a replacement for MLST, is an inexpensive method to assess clonality among E. faecalis isolates, and can be used in conjunction with MLST to identify recombination events occurring between STs. PMID:26398194

  13. Identification and characterization of unrecognized viruses in stool samples of non-polio acute flaccid paralysis children by simplified VIDISCA.

    PubMed

    Shaukat, Shahzad; Angez, Mehar; Alam, Muhammad Masroor; Jebbink, Maarten F; Deijs, Martin; Canuti, Marta; Sharif, Salmaan; de Vries, Michel; Khurshid, Adnan; Mahmood, Tariq; van der Hoek, Lia; Zaidi, Syed Sohail Zahoor

    2014-08-12

    The use of sequence independent methods combined with next generation sequencing for identification purposes in clinical samples appears promising and exciting results have been achieved to understand unexplained infections. One sequence independent method, Virus Discovery based on cDNA Amplified Fragment Length Polymorphism (VIDISCA) is capable of identifying viruses that would have remained unidentified in standard diagnostics or cell cultures. VIDISCA is normally combined with next generation sequencing, however, we set up a simplified VIDISCA which can be used in case next generation sequencing is not possible. Stool samples of 10 patients with unexplained acute flaccid paralysis showing cytopathic effect in rhabdomyosarcoma cells and/or mouse cells were used to test the efficiency of this method. To further characterize the viruses, VIDISCA-positive samples were amplified and sequenced with gene specific primers. Simplified VIDISCA detected seven viruses (70%) and the proportion of eukaryotic viral sequences from each sample ranged from 8.3 to 45.8%. Human enterovirus EV-B97, EV-B100, echovirus-9 and echovirus-21, human parechovirus type-3, human astrovirus probably a type-3/5 recombinant, and tetnovirus-1 were identified. Phylogenetic analysis based on the VP1 region demonstrated that the human enteroviruses are more divergent isolates circulating in the community. Our data support that a simplified VIDISCA protocol can efficiently identify unrecognized viruses grown in cell culture with low cost, limited time without need of advanced technical expertise. Also complex data interpretation is avoided thus the method can be used as a powerful diagnostic tool in limited resources. Redesigning the routine diagnostics might lead to additional detection of previously undiagnosed viruses in clinical samples of patients.

  14. Novel type of VanB2 teicoplanin-resistant hospital-associated Enterococcus faecium.

    PubMed

    Santona, Antonella; Paglietti, Bianca; Al-Qahtani, Ahmed A; Bohol, Marie Fe F; Senok, Abiola; Deligios, Massimo; Rubino, Salvatore; Al-Ahdal, Mohammed N

    2014-08-01

    Seven high-risk clones of vancomycin-resistant Enterococcus faecium (VREF) belonging to clonal complex 17 were identified using multilocus sequence typing (MLST) among clinical isolates from Saudi Arabia. Among these isolates, a new hospital-associated sequence type (ST795), VanB(2)-type teicoplanin-resistant strain was detected. Its unusual phenotype resulted from a new combination of mutations in the ddl, vanS and vanW genes, which confirmed the trend of evolution in VanB-type resistance. Furthermore, characteristics of adaptation and persistence in the hospital environment of ST795 were emphasised by the presence of genes and clusters recognised to be specific for hospital-associated VREF. Copyright © 2014 Elsevier B.V. and the International Society of Chemotherapy. All rights reserved.

  15. Cell type discovery using single-cell transcriptomics: implications for ontological representation.

    PubMed

    Aevermann, Brian D; Novotny, Mark; Bakken, Trygve; Miller, Jeremy A; Diehl, Alexander D; Osumi-Sutherland, David; Lasken, Roger S; Lein, Ed S; Scheuermann, Richard H

    2018-05-01

    Cells are fundamental function units of multicellular organisms, with different cell types playing distinct physiological roles in the body. The recent advent of single-cell transcriptional profiling using RNA sequencing is producing 'big data', enabling the identification of novel human cell types at an unprecedented rate. In this review, we summarize recent work characterizing cell types in the human central nervous and immune systems using single-cell and single-nuclei RNA sequencing, and discuss the implications that these discoveries are having on the representation of cell types in the reference Cell Ontology (CL). We propose a method, based on random forest machine learning, for identifying sets of necessary and sufficient marker genes, which can be used to assemble consistent and reproducible cell type definitions for incorporation into the CL. The representation of defined cell type classes and their relationships in the CL using this strategy will make the cell type classes being identified by high-throughput/high-content technologies findable, accessible, interoperable and reusable (FAIR), allowing the CL to serve as a reference knowledgebase of information about the role that distinct cellular phenotypes play in human health and disease.

  16. Labeled nucleotide phosphate (NP) probes

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2009-02-03

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  17. G to A substitution in 5{prime} donor splice site of introns 18 and 48 of COL1A1 gene of type I collagen results in different splicing alternatives in osteogenesis imperfecta type I cell strains

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Willing, M.; Deschenes, S.

    We have identified a G to A substitution in the 5{prime} donor splice site of intron 18 of one COL1A1 allele in two unrelated families with osteogenesis imperfecta (OI) type I. A third OI type I family has a G to A substitution at the identical position in intron 48 of one COL1A1 allele. Both mutations abolish normal splicing and lead to reduced steady-state levels of mRNA from the mutant COL1A1 allele. The intron 18 mutation leads to both exon 18 skipping in the mRNA and to utilization of a single alternative splice site near the 3{prime} end of exonmore » 18. The latter results in deletion of the last 8 nucleotides of exon 18 from the mRNA, a shift in the translational reading-frame, and the creation of a premature termination codon in exon 19. Of the potential alternative 5{prime} splice sites in exon 18 and intron 18, the one utilized has a surrounding nucleotide sequence which most closely resembles that of the natural splice site. Although a G to A mutation was detected at the identical position in intron 48 of one COL1A1 allele in another OI type I family, nine complex alternative splicing patterns were identified by sequence analysis of cDNA clones derived from fibroblast mRNA from this cell strain. All result in partial or complete skipping of exon 48, with in-frame deletions of portions of exons 47 and/or 49. The different patterns of RNA splicing were not explained by their sequence homology with naturally occuring 5{prime} splice sites, but rather by recombination between highly homologous exon sequences, suggesting that we may not have identified the major splicing alternative(s) in this cell strain. Both G to A mutations result in decreased production of type I collagen, the common biochemical correlate of OI type I.« less

  18. Identifying User Interaction Patterns in E-Textbooks

    PubMed Central

    Saarinen, Santeri; Turunen, Markku; Mikkilä-Erdmann, Mirjamaija; Erdmann, Norbert; Yrjänäinen, Sari; Keskinen, Tuuli

    2015-01-01

    We introduce a new architecture for e-textbooks which contains two navigational aids: an index and a concept map. We report results from an evaluation in a university setting with 99 students. The interaction sequences of the users were captured during the user study. We found several clusters of user interaction types in our data. Three separate user types were identified based on the interaction sequences: passive user, term clicker, and concept map user. We also discovered that with the concept map interface users started to interact with the application significantly sooner than with the index interface. Overall, our findings suggest that analysis of interaction patterns allows deeper insights into the use of e-textbooks than is afforded by summative evaluation. PMID:26605377

  19. Identifying User Interaction Patterns in E-Textbooks.

    PubMed

    Saarinen, Santeri; Heimonen, Tomi; Turunen, Markku; Mikkilä-Erdmann, Mirjamaija; Raisamo, Roope; Erdmann, Norbert; Yrjänäinen, Sari; Keskinen, Tuuli

    2015-01-01

    We introduce a new architecture for e-textbooks which contains two navigational aids: an index and a concept map. We report results from an evaluation in a university setting with 99 students. The interaction sequences of the users were captured during the user study. We found several clusters of user interaction types in our data. Three separate user types were identified based on the interaction sequences: passive user, term clicker, and concept map user. We also discovered that with the concept map interface users started to interact with the application significantly sooner than with the index interface. Overall, our findings suggest that analysis of interaction patterns allows deeper insights into the use of e-textbooks than is afforded by summative evaluation.

  20. Application of artificial neural networks to identify equilibration in computer simulations

    NASA Astrophysics Data System (ADS)

    Leibowitz, Mitchell H.; Miller, Evan D.; Henry, Michael M.; Jankowski, Eric

    2017-11-01

    Determining which microstates generated by a thermodynamic simulation are representative of the ensemble for which sampling is desired is a ubiquitous, underspecified problem. Artificial neural networks are one type of machine learning algorithm that can provide a reproducible way to apply pattern recognition heuristics to underspecified problems. Here we use the open-source TensorFlow machine learning library and apply it to the problem of identifying which hypothetical observation sequences from a computer simulation are “equilibrated” and which are not. We generate training populations and test populations of observation sequences with embedded linear and exponential correlations. We train a two-neuron artificial network to distinguish the correlated and uncorrelated sequences. We find that this simple network is good enough for > 98% accuracy in identifying exponentially-decaying energy trajectories from molecular simulations.

  1. Insertion sequence ISRP10 inactivation of the oprD gene in imipenem-resistant Pseudomonas aeruginosa clinical isolates.

    PubMed

    Sun, Qinghui; Ba, Zhaofen; Wu, Guoying; Wang, Wei; Lin, Shuxiang; Yang, Hongjiang

    2016-05-01

    Carbapenem resistance mechanisms were investigated in 32 imipenem-resistant Pseudomonas aeruginosa clinical isolates recovered from hospitalised children. Sequence analysis revealed that 31 of the isolates had an insertion sequence element ISRP10 disrupting the porin gene oprD, demonstrating that ISRP10 inactivation of oprD conferred imipenem resistance in the majority of the isolates. Multilocus sequence typing (MLST) was used to discriminate the isolates. In total, 11 sequence types (STs) were identified including 3 novel STs, and 68.3% (28/41) of the tested strains were characterised as clone ST253. In combination with random amplified polymorphic DNA (RAPD) analysis, the imipenem-resistant isolates displayed a relatively high degree of genetic variability and were unlikely associated with nosocomial infections. Copyright © 2016 Elsevier B.V. and the International Society of Chemotherapy. All rights reserved.

  2. Identification of a preferred substrate peptide for transglutaminase 3 and detection of in situ activity in skin and hair follicles.

    PubMed

    Yamane, Asaka; Fukui, Mina; Sugimura, Yoshiaki; Itoh, Miho; Alea, Mileidys Perez; Thomas, Vincent; El Alaoui, Said; Akiyama, Masashi; Hitomi, Kiyotaka

    2010-09-01

    Transglutaminases (TGases) are a family of enzymes that catalyze cross-linking reactions between proteins. During epidermal differentiation, these enzymatic reactions are essential for formation of the cornified envelope, which consists of cross-linked structural proteins. Two main transglutaminases isoforms, epidermal-type (TGase 3) and keratinocyte-type (TGase 1), are cooperatively involved in this process of differentiating keratinocytes. Information regarding their substrate preference is of great importance to determine the functional role of these isozymes and clarify their possible co-operative action. Thus far, we have identified highly reactive peptide sequences specifically recognized by TGases isozymes such as TGase 1, TGase 2 (tissue-type isozyme) and the blood coagulation isozyme, Factor XIII. In this study, several substrate peptide sequences for human TGase 3 were screened from a phage-displayed peptide library. The preferred substrate sequences for TGase 3 were selected and evaluated as fusion proteins with mutated glutathione S-transferase. From these studies, a highly reactive and isozyme-specific sequence (E51) was identified. Furthermore, this sequence was found to be a prominent substrate in the peptide form and was suitable for detection of in situ TGase 3 activity in the mouse epidermis. TGase 3 enzymatic activity was detected in the layers of differentiating keratinocytes and hair follicles with patterns distinct from those of TGase 1. Our findings provide new information on the specific distribution of TGase 3 and constitute a useful tool to clarify its functional role in the epidermis.

  3. Population Structure and Antimicrobial Resistance Profiles of Streptococcus suis Serotype 2 Sequence Type 25 Strains.

    PubMed

    Athey, Taryn B T; Teatero, Sarah; Takamatsu, Daisuke; Wasserscheid, Jessica; Dewar, Ken; Gottschalk, Marcelo; Fittipaldi, Nahuel

    2016-01-01

    Strains of serotype 2 Streptococcus suis are responsible for swine and human infections. Different serotype 2 genetic backgrounds have been defined using multilocus sequence typing (MLST). However, little is known about the genetic diversity within each MLST sequence type (ST). Here, we used whole-genome sequencing to test the hypothesis that S. suis serotype 2 strains of the ST25 lineage are genetically heterogeneous. We evaluated 51 serotype 2 ST25 S. suis strains isolated from diseased pigs and humans in Canada, the United States of America, and Thailand. Whole-genome sequencing revealed numerous large-scale rearrangements in the ST25 genome, compared to the genomes of ST1 and ST28 S. suis strains, which result, among other changes, in disruption of a pilus island locus. We report that recombination and lateral gene transfer contribute to ST25 genetic diversity. Phylogenetic analysis identified two main and distinct Thai and North American clades grouping most strains investigated. These clades also possessed distinct patterns of antimicrobial resistance genes, which correlated with acquisition of different integrative and conjugative elements (ICEs). Some of these ICEs were found to be integrated at a recombination hot spot, previously identified as the site of integration of the 89K pathogenicity island in serotype 2 ST7 S. suis strains. Our results highlight the limitations of MLST for phylogenetic analysis of S. suis, and the importance of lateral gene transfer and recombination as drivers of diversity in this swine pathogen and zoonotic agent.

  4. Population Structure and Antimicrobial Resistance Profiles of Streptococcus suis Serotype 2 Sequence Type 25 Strains

    PubMed Central

    Athey, Taryn B. T.; Teatero, Sarah; Takamatsu, Daisuke; Wasserscheid, Jessica; Dewar, Ken; Gottschalk, Marcelo; Fittipaldi, Nahuel

    2016-01-01

    Strains of serotype 2 Streptococcus suis are responsible for swine and human infections. Different serotype 2 genetic backgrounds have been defined using multilocus sequence typing (MLST). However, little is known about the genetic diversity within each MLST sequence type (ST). Here, we used whole-genome sequencing to test the hypothesis that S. suis serotype 2 strains of the ST25 lineage are genetically heterogeneous. We evaluated 51 serotype 2 ST25 S. suis strains isolated from diseased pigs and humans in Canada, the United States of America, and Thailand. Whole-genome sequencing revealed numerous large-scale rearrangements in the ST25 genome, compared to the genomes of ST1 and ST28 S. suis strains, which result, among other changes, in disruption of a pilus island locus. We report that recombination and lateral gene transfer contribute to ST25 genetic diversity. Phylogenetic analysis identified two main and distinct Thai and North American clades grouping most strains investigated. These clades also possessed distinct patterns of antimicrobial resistance genes, which correlated with acquisition of different integrative and conjugative elements (ICEs). Some of these ICEs were found to be integrated at a recombination hot spot, previously identified as the site of integration of the 89K pathogenicity island in serotype 2 ST7 S. suis strains. Our results highlight the limitations of MLST for phylogenetic analysis of S. suis, and the importance of lateral gene transfer and recombination as drivers of diversity in this swine pathogen and zoonotic agent. PMID:26954687

  5. Gene Discovery through Genomic Sequencing of Brucella abortus

    PubMed Central

    Sánchez, Daniel O.; Zandomeni, Ruben O.; Cravero, Silvio; Verdún, Ramiro E.; Pierrou, Ester; Faccio, Paula; Diaz, Gabriela; Lanzavecchia, Silvia; Agüero, Fernán; Frasch, Alberto C. C.; Andersson, Siv G. E.; Rossetti, Osvaldo L.; Grau, Oscar; Ugalde, Rodolfo A.

    2001-01-01

    Brucella abortus is the etiological agent of brucellosis, a disease that affects bovines and human. We generated DNA random sequences from the genome of B. abortus strain 2308 in order to characterize molecular targets that might be useful for developing immunological or chemotherapeutic strategies against this pathogen. The partial sequencing of 1,899 clones allowed the identification of 1,199 genomic sequence surveys (GSSs) with high homology (BLAST expect value < 10−5) to sequences deposited in the GenBank databases. Among them, 925 represent putative novel genes for the Brucella genus. Out of 925 nonredundant GSSs, 470 were classified in 15 categories based on cellular function. Seven hundred GSSs showed no significant database matches and remain available for further studies in order to identify their function. A high number of GSSs with homology to Agrobacterium tumefaciens and Rhizobium meliloti proteins were observed, thus confirming their close phylogenetic relationship. Among them, several GSSs showed high similarity with genes related to nodule nitrogen fixation, synthesis of nod factors, nodulation protein symbiotic plasmid, and nodule bacteroid differentiation. We have also identified several B. abortus homologs of virulence and pathogenesis genes from other pathogens, including a homolog to both the Shda gene from Salmonella enterica serovar Typhimurium and the AidA-1 gene from Escherichia coli. Other GSSs displayed significant homologies to genes encoding components of the type III and type IV secretion machineries, suggesting that Brucella might also have an active type III secretion machinery. PMID:11159979

  6. Relationships between functional genes in Lactobacillus delbrueckii ssp. bulgaricus isolates and phenotypic characteristics associated with fermentation time and flavor production in yogurt elucidated using multilocus sequence typing.

    PubMed

    Liu, Wenjun; Yu, Jie; Sun, Zhihong; Song, Yuqin; Wang, Xueni; Wang, Hongmei; Wuren, Tuoya; Zha, Musu; Menghe, Bilige; Heping, Zhang

    2016-01-01

    Lactobacillus delbrueckii ssp. bulgaricus (L. bulgaricus) is well known for its worldwide application in yogurt production. Flavor production and acid producing are considered as the most important characteristics for starter culture screening. To our knowledge this is the first study applying functional gene sequence multilocus sequence typing technology to predict the fermentation and flavor-producing characteristics of yogurt-producing bacteria. In the present study, phenotypic characteristics of 35 L. bulgaricus strains were quantified during the fermentation of milk to yogurt and during its subsequent storage; these included fermentation time, acidification rate, pH, titratable acidity, and flavor characteristics (acetaldehyde concentration). Furthermore, multilocus sequence typing analysis of 7 functional genes associated with fermentation time, acid production, and flavor formation was done to elucidate the phylogeny and genetic evolution of the same L. bulgaricus isolates. The results showed that strains significantly differed in fermentation time, acidification rate, and acetaldehyde production. Combining functional gene sequence analysis with phenotypic characteristics demonstrated that groups of strains established using genotype data were consistent with groups identified based on their phenotypic traits. This study has established an efficient and rapid molecular genotyping method to identify strains with good fermentation traits; this has the potential to replace time-consuming conventional methods based on direct measurement of phenotypic traits. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  7. rpoB-Based Identification of Nonpigmented and Late-Pigmenting Rapidly Growing Mycobacteria

    PubMed Central

    Adékambi, Toïdi; Colson, Philippe; Drancourt, Michel

    2003-01-01

    Nonpigmented and late-pigmenting rapidly growing mycobacteria (RGM) are increasingly isolated in clinical microbiology laboratories. Their accurate identification remains problematic because classification is labor intensive work and because new taxa are not often incorporated into classification databases. Also, 16S rRNA gene sequence analysis underestimates RGM diversity and does not distinguish between all taxa. We determined the complete nucleotide sequence of the rpoB gene, which encodes the bacterial β subunit of the RNA polymerase, for 20 RGM type strains. After using in-house software which analyzes and graphically represents variability stretches of 60 bp along the nucleotide sequence, our analysis focused on a 723-bp variable region exhibiting 83.9 to 97% interspecies similarity and 0 to 1.7% intraspecific divergence. Primer pair Myco-F-Myco-R was designed as a tool for both PCR amplification and sequencing of this region for molecular identification of RGM. This tool was used for identification of 63 RGM clinical isolates previously identified at the species level on the basis of phenotypic characteristics and by 16S rRNA gene sequence analysis. Of 63 clinical isolates, 59 (94%) exhibited <2% partial rpoB gene sequence divergence from 1 of 20 species under study and were regarded as correctly identified at the species level. Mycobacterium abscessus and Mycobacterium mucogenicum isolates were clearly distinguished from Mycobacterium chelonae; Mycobacterium mageritense isolates were clearly distinguished from “Mycobacterium houstonense.” Four isolates were not identified at the species level because they exhibited >3% partial rpoB gene sequence divergence from the corresponding type strain; they belonged to three taxa related to M. mucogenicum, Mycobacterium smegmatis, and Mycobacterium porcinum. For M. abscessus and M. mucogenicum, this partial sequence yielded a high genetic heterogeneity within the clinical isolates. We conclude that molecular identification by analysis of the 723-bp rpoB sequence is a rapid and accurate tool for identification of RGM. PMID:14662964

  8. Class 1 integrons characterization and multilocus sequence typing of Salmonella spp. from swine production chains in Chiang Mai and Lamphun provinces, Thailand.

    PubMed

    Boonkhot, Phacharaporn; Tadee, Pakpoom; Yamsakul, Panuwat; Pocharoen, Chairoj; Chokesajjawatee, Nipa; Patchanee, Prapas

    2015-05-01

    Pigs and pork products are well known as an important source of Salmonella, one of the major zoonotic foodborne pathogens. The emergence and spread of antimicrobial resistance is becoming a major public health concern worldwide. Integrons are genetic elements known to have a role in the acquisition and expression of genes conferring antibiotic resistance. This study focuses on the prevalence of class 1 integrons-carrying Salmonella, the genetic diversity of strains of those organisms obtained from swine production chains in Chiang Mai and Lamphun provinces, Thailand, using multilocus sequence typing (MLST) and comparison of genetic diversity of sequence types of Salmonella from this study with pulsotypes identified in previous study. In 175 Salmonella strains, the overall prevalence of class 1 integrons-carrying-Salmonella was 14%. The gene cassettes array pattern "dfrA12-orfF-aadA2" was the most frequently observed. Most of the antimicrobial resistance identified was not associated with related gene cassettes harbored by Salmonella. Six sequence types were generated from 30 randomly selected strains detected by MLST. Salmonella at the human-animal-environment interface was confirmed. Linkages both in the farm to slaughterhouse contamination route and the horizontal transmission of resistance genes were demonstrated. To reduce this problem, the use of antimicrobials in livestock should be controlled by veterinarians. Education and training of food handlers as well as promotion of safe methods of food consumption are important avenues for helping prevent foodborne illness.

  9. Generation of a novel next-generation sequencing-based method for the isolation of new human papillomavirus types.

    PubMed

    Brancaccio, Rosario N; Robitaille, Alexis; Dutta, Sankhadeep; Cuenin, Cyrille; Santare, Daiga; Skenders, Girts; Leja, Marcis; Fischer, Nicole; Giuliano, Anna R; Rollison, Dana E; Grundhoff, Adam; Tommasino, Massimo; Gheit, Tarik

    2018-05-07

    With the advent of new molecular tools, the discovery of new papillomaviruses (PVs) has accelerated during the past decade, enabling the expansion of knowledge about the viral populations that inhabit the human body. Human PVs (HPVs) are etiologically linked to benign or malignant lesions of the skin and mucosa. The detection of HPV types can vary widely, depending mainly on the methodology and the quality of the biological sample. Next-generation sequencing is one of the most powerful tools, enabling the discovery of novel viruses in a wide range of biological material. Here, we report a novel protocol for the detection of known and unknown HPV types in human skin and oral gargle samples using improved PCR protocols combined with next-generation sequencing. We identified 105 putative new PV types in addition to 296 known types, thus providing important information about the viral distribution in the oral cavity and skin. Copyright © 2018. Published by Elsevier Inc.

  10. Nearing saturation of cancer driver gene discovery.

    PubMed

    Hsiehchen, David; Hsieh, Antony

    2018-06-15

    Extensive sequencing efforts of cancer genomes such as The Cancer Genome Atlas (TCGA) have been undertaken to uncover bona fide cancer driver genes which has enhanced our understanding of cancer and revealed therapeutic targets. However, the number of driver gene mutations is bounded, indicating that there must be a point when further sequencing efforts will be excessive. We found that there was a significant positive correlation between sample size and identified driver gene mutations across 33 cancers sequenced by the TCGA, which is expected if additional sequencing is still leading to the identification of more driver genes. However, the rate of new cancer driver genes being discovered with larger samples is declining rapidly. Our analysis provides a general guide for determining which cancer types would likely benefit from additional sequencing efforts, particularly those with relatively high rates of cancer driver gene discovery. Our results argue that past strategies of indiscriminately sequencing as many specimens as possible for all cancer types is becoming inefficient. In addition, without significant investments into applying our knowledge of cancer genomes, we risk sequencing more cancer genomes for the sake of sequencing rather than meaningful patient benefit.

  11. Single-cell RNA sequencing identifies diverse roles of epithelial cells in idiopathic pulmonary fibrosis

    PubMed Central

    Mizuno, Takako; Sridharan, Anusha; Du, Yina; Guo, Minzhe; Wikenheiser-Brokamp, Kathryn A.; Perl, Anne-Karina T.; Funari, Vincent A.; Gokey, Jason J.; Stripp, Barry R.; Whitsett, Jeffrey A.

    2016-01-01

    Idiopathic pulmonary fibrosis (IPF) is a lethal interstitial lung disease characterized by airway remodeling, inflammation, alveolar destruction, and fibrosis. We utilized single-cell RNA sequencing (scRNA-seq) to identify epithelial cell types and associated biological processes involved in the pathogenesis of IPF. Transcriptomic analysis of normal human lung epithelial cells defined gene expression patterns associated with highly differentiated alveolar type 2 (AT2) cells, indicated by enrichment of RNAs critical for surfactant homeostasis. In contrast, scRNA-seq of IPF cells identified 3 distinct subsets of epithelial cell types with characteristics of conducting airway basal and goblet cells and an additional atypical transitional cell that contributes to pathological processes in IPF. Individual IPF cells frequently coexpressed alveolar type 1 (AT1), AT2, and conducting airway selective markers, demonstrating “indeterminate” states of differentiation not seen in normal lung development. Pathway analysis predicted aberrant activation of canonical signaling via TGF-β, HIPPO/YAP, P53, WNT, and AKT/PI3K. Immunofluorescence confocal microscopy identified the disruption of alveolar structure and loss of the normal proximal-peripheral differentiation of pulmonary epithelial cells. scRNA-seq analyses identified loss of normal epithelial cell identities and unique contributions of epithelial cells to the pathogenesis of IPF. The present study provides a rich data source to further explore lung health and disease. PMID:27942595

  12. The complete genome sequence of human adenovirus 84, a highly recombinant new Human mastadenovirus D type with a unique fiber gene.

    PubMed

    Kaján, Győző L; Kajon, Adriana E; Pinto, Alexis Castillo; Bartha, Dániel; Arnberg, Niklas

    2017-10-15

    A novel human adenovirus was isolated from a pediatric case of acute respiratory disease in Panama City, Panama in 2011. The clinical isolate was initially identified as an intertypic recombinant based on hexon and fiber gene sequencing. Based on the analysis of its complete genome sequence, the novel complex recombinant Human mastadenovirus D (HAdV-D) strain was classified into a new HAdV type: HAdV-84, and it was designated Adenovirus D human/PAN/P309886/2011/84[P43H17F84]. HAdV-D types possess usually an ocular or gastrointestinal tropism, and respiratory association is scarcely reported. The virus has a novel fiber type, most closely related to, but still clearly distant from that of HAdV-36. The predicted fiber is hypothesised to bind sialic acid with lower affinity compared to HAdV-37. Bioinformatic analysis of the complete genomic sequence of HAdV-84 revealed multiple homologous recombination events and provided deeper insight into HAdV evolution. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. DNA methylation assessment from human slow- and fast-twitch skeletal muscle fibers

    PubMed Central

    Begue, Gwénaëlle; Raue, Ulrika; Jemiolo, Bozena

    2017-01-01

    A new application of the reduced representation bisulfite sequencing method was developed using low-DNA input to investigate the epigenetic profile of human slow- and fast-twitch skeletal muscle fibers. Successful library construction was completed with as little as 15 ng of DNA, and high-quality sequencing data were obtained with 32 ng of DNA. Analysis identified 143,160 differentially methylated CpG sites across 14,046 genes. In both fiber types, selected genes predominantly expressed in slow or fast fibers were hypomethylated, which was supported by the RNA-sequencing analysis. These are the first fiber type-specific methylation data from human skeletal muscle and provide a unique platform for future research. NEW & NOTEWORTHY This study validates a low-DNA input reduced representation bisulfite sequencing method for human muscle biopsy samples to investigate the methylation patterns at a fiber type-specific level. These are the first fiber type-specific methylation data reported from human skeletal muscle and thus provide initial insight into basal state differences in myosin heavy chain I and IIa muscle fibers among young, healthy men. PMID:28057818

  14. Analysis of Pre-Analytic Factors Affecting the Success of Clinical Next-Generation Sequencing of Solid Organ Malignancies.

    PubMed

    Chen, Hui; Luthra, Rajyalakshmi; Goswami, Rashmi S; Singh, Rajesh R; Roy-Chowdhuri, Sinchita

    2015-08-28

    Application of next-generation sequencing (NGS) technology to routine clinical practice has enabled characterization of personalized cancer genomes to identify patients likely to have a response to targeted therapy. The proper selection of tumor sample for downstream NGS based mutational analysis is critical to generate accurate results and to guide therapeutic intervention. However, multiple pre-analytic factors come into play in determining the success of NGS testing. In this review, we discuss pre-analytic requirements for AmpliSeq PCR-based sequencing using Ion Torrent Personal Genome Machine (PGM) (Life Technologies), a NGS sequencing platform that is often used by clinical laboratories for sequencing solid tumors because of its low input DNA requirement from formalin fixed and paraffin embedded tissue. The success of NGS mutational analysis is affected not only by the input DNA quantity but also by several other factors, including the specimen type, the DNA quality, and the tumor cellularity. Here, we review tissue requirements for solid tumor NGS based mutational analysis, including procedure types, tissue types, tumor volume and fraction, decalcification, and treatment effects.

  15. Are Escherichia coli Pathotypes Still Relevant in the Era of Whole-Genome Sequencing?

    PubMed Central

    Robins-Browne, Roy M.; Holt, Kathryn E.; Ingle, Danielle J.; Hocking, Dianna M.; Yang, Ji; Tauschek, Marija

    2016-01-01

    The empirical and pragmatic nature of diagnostic microbiology has given rise to several different schemes to subtype E.coli, including biotyping, serotyping, and pathotyping. These schemes have proved invaluable in identifying and tracking outbreaks, and for prognostication in individual cases of infection, but they are imprecise and potentially misleading due to the malleability and continuous evolution of E. coli. Whole genome sequencing can be used to accurately determine E. coli subtypes that are based on allelic variation or differences in gene content, such as serotyping and pathotyping. Whole genome sequencing also provides information about single nucleotide polymorphisms in the core genome of E. coli, which form the basis of sequence typing, and is more reliable than other systems for tracking the evolution and spread of individual strains. A typing scheme for E. coli based on genome sequences that includes elements of both the core and accessory genomes, should reduce typing anomalies and promote understanding of how different varieties of E. coli spread and cause disease. Such a scheme could also define pathotypes more precisely than current methods. PMID:27917373

  16. Are Escherichia coli Pathotypes Still Relevant in the Era of Whole-Genome Sequencing?

    PubMed

    Robins-Browne, Roy M; Holt, Kathryn E; Ingle, Danielle J; Hocking, Dianna M; Yang, Ji; Tauschek, Marija

    2016-01-01

    The empirical and pragmatic nature of diagnostic microbiology has given rise to several different schemes to subtype E .coli, including biotyping, serotyping, and pathotyping. These schemes have proved invaluable in identifying and tracking outbreaks, and for prognostication in individual cases of infection, but they are imprecise and potentially misleading due to the malleability and continuous evolution of E. coli . Whole genome sequencing can be used to accurately determine E. coli subtypes that are based on allelic variation or differences in gene content, such as serotyping and pathotyping. Whole genome sequencing also provides information about single nucleotide polymorphisms in the core genome of E. coli , which form the basis of sequence typing, and is more reliable than other systems for tracking the evolution and spread of individual strains. A typing scheme for E. coli based on genome sequences that includes elements of both the core and accessory genomes, should reduce typing anomalies and promote understanding of how different varieties of E. coli spread and cause disease. Such a scheme could also define pathotypes more precisely than current methods.

  17. Multilocus sequence type system for the plant pathogen Xylella fastidiosa and relative contributions of recombination and point mutation to clonal diversity.

    PubMed

    Scally, Mark; Schuenzel, Erin L; Stouthamer, Richard; Nunney, Leonard

    2005-12-01

    Multilocus sequence typing (MLST) identifies and groups bacterial strains based on DNA sequence data from (typically) seven housekeeping genes. MLST has also been employed to estimate the relative contributions of recombination and point mutation to clonal divergence. We applied MLST to the plant pathogen Xylella fastidiosa using an initial set of sequences for 10 loci (9.3 kb) of 25 strains from five different host plants, grapevine (PD strains), oleander (OLS strains), oak (OAK strains), almond (ALS strains), and peach (PP strains). An eBURST analysis identified six clonal complexes using the grouping criterion that each member must be identical to at least one other member at 7 or more of the 10 loci. These clonal complexes corresponded to previously identified phylogenetic clades; clonal complex 1 (CC1) (all PD strains plus two ALS strains) and CC2 (OLS strains) defined the X. fastidiosa subsp. fastidiosa and X. fastidiosa subsp. sandyi clades, while CC3 (ALS strains), CC4 (OAK strains), and CC5 (PP strains) were subclades of X. fastidiosa subsp. multiplex. CC6 (ALS strains) identified an X. fastidiosa subsp. multiplex-like group characterized by a high frequency of intersubspecific recombination. Compared to the recombination rate in other bacterial species, the recombination rate in X. fastidiosa is relatively low. Recombination between different alleles was estimated to give rise to 76% of the nucleotide changes and 31% of the allelic changes observed. The housekeeping loci holC, nuoL, leuA, gltT, cysG, petC, and lacF were chosen to form the basis of a public database for typing X. fastidiosa (www.mlst.net). These loci identified the same six clonal complexes using the strain grouping criterion of identity at five or more loci with at least one other member.

  18. The multilocus sequence typing network: mlst.net.

    PubMed

    Aanensen, David M; Spratt, Brian G

    2005-07-01

    The unambiguous characterization of strains of a pathogen is crucial for addressing questions relating to its epidemiology, population and evolutionary biology. Multilocus sequence typing (MLST), which defines strains from the sequences at seven house-keeping loci, has become the method of choice for molecular typing of many bacterial and fungal pathogens (and non-pathogens), and MLST schemes and strain databases are available for a growing number of prokaryotic and eukaryotic organisms. Sequence data are ideal for strain characterization as they are unambiguous, meaning strains can readily be compared between laboratories via the Internet. Laboratories undertaking MLST can quickly progress from sequencing the seven gene fragments to characterizing their strains and relating them to those submitted by others and to the population as a whole. We provide the gateway to a number of MLST schemes, each of which contain a set of tools for the initial characterization of strains, and methods for relating query strains to other strains of the species, including clustering based on differences in allelic profiles, phylogenetic trees based on concatenated sequences, and a recently developed method (eBURST) for identifying clonal complexes within a species and displaying the overall structure of the population. This network of MLST websites is available at http://www.mlst.net.

  19. Influenza virus sequence feature variant type analysis: evidence of a role for NS1 in influenza virus host range restriction.

    PubMed

    Noronha, Jyothi M; Liu, Mengya; Squires, R Burke; Pickett, Brett E; Hale, Benjamin G; Air, Gillian M; Galloway, Summer E; Takimoto, Toru; Schmolke, Mirco; Hunt, Victoria; Klem, Edward; García-Sastre, Adolfo; McGee, Monnie; Scheuermann, Richard H

    2012-05-01

    Genetic drift of influenza virus genomic sequences occurs through the combined effects of sequence alterations introduced by a low-fidelity polymerase and the varying selective pressures experienced as the virus migrates through different host environments. While traditional phylogenetic analysis is useful in tracking the evolutionary heritage of these viruses, the specific genetic determinants that dictate important phenotypic characteristics are often difficult to discern within the complex genetic background arising through evolution. Here we describe a novel influenza virus sequence feature variant type (Flu-SFVT) approach, made available through the public Influenza Research Database resource (www.fludb.org), in which variant types (VTs) identified in defined influenza virus protein sequence features (SFs) are used for genotype-phenotype association studies. Since SFs have been defined for all influenza virus proteins based on known structural, functional, and immune epitope recognition properties, the Flu-SFVT approach allows the rapid identification of the molecular genetic determinants of important influenza virus characteristics and their connection to underlying biological functions. We demonstrate the use of the SFVT approach to obtain statistical evidence for effects of NS1 protein sequence variations in dictating influenza virus host range restriction.

  20. Subtyping of the Legionella pneumophila "Ulm" outbreak strain using the CRISPR-Cas system.

    PubMed

    Lück, Christian; Brzuszkiewicz, Elzbieta; Rydzewski, Kerstin; Koshkolda, Tetyana; Sarnow, Katharina; Essig, Andreas; Heuner, Klaus

    2015-12-01

    In 2009/2010 an outbreak of Legionnaires' disease with 64 cases including four fatalities took place in the city of Ulm/Neu-Ulm in Germany. L. pneumophila serogroup 1, mAb type Knoxville, sequence type (ST) 62 was identified as the epidemic strain. This strain was isolated from eight patients and from a cooling tower in the city of Ulm. Based on whole genome sequencing data from one patient strain, we identified an Lvh type IV secretion system containing a CRISPR-Cas system. The CRISPR sequence contains 38 spacer DNA sequences. We used these variable DNA spacers to further subtype the outbreak strain as well as six epidemiologically unrelated strains of CRISPR-Cas positive ST62 strains isolated at various regions in Germany. The first 12 spacer DNAs of eight patient isolates and three environmental isolates from the suspected source of infection were analyzed and found to be identical. Spacer DNAs were identified in further six epidemiologically unrelated patient isolates of L. pneumophila of ST62 in addition to the 12 "core" spacers. The presence of new spacer DNAs at the 5' site downstream of the first repeat indicates that these CRISPR-Cas systems seem to be functional. PCR analysis revealed that not all L. pneumophila sg1 ST62 strains investigated exhibited a CRISPR-Cas system. In addition, we could demonstrate that the CRISPR-Cas system is localized on a genomic island (LpuGI-Lvh) which can be excised from the chromosome and therefore may be transferable horizontally to other L. pneumophila strains. Copyright © 2015 Elsevier GmbH. All rights reserved.

  1. Characterization and Exploitation of CRISPR Loci in Bifidobacterium longum

    PubMed Central

    Hidalgo-Cantabrana, Claudio; Crawley, Alexandra B.; Sanchez, Borja; Barrangou, Rodolphe

    2017-01-01

    Diverse CRISPR-Cas systems provide adaptive immunity in many bacteria and most archaea, via a DNA-encoded, RNA-mediated, nucleic-acid targeting mechanism. Over time, CRISPR loci expand via iterative uptake of invasive DNA sequences into the CRISPR array during the adaptation process. These genetic vaccination cards thus provide insights into the exposure of strains to phages and plasmids in space and time, revealing the historical predatory exposure of a strain. These genetic loci thus constitute a unique basis for genotyping of strains, with potential of resolution at the strain-level. Here, we investigate the occurrence and diversity of CRISPR-Cas systems in the genomes of various Bifidobacterium longum strains across three sub-species. Specifically, we analyzed the genomic content of 66 genomes belonging to B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis, and identified 25 strains that carry 29 total CRISPR-Cas systems. We identify various Type I and Type II CRISPR-Cas systems that are widespread in this species, notably I-C, I-E, and II-C. Noteworthy, Type I-C systems showed extended CRISPR arrays, with extensive spacer diversity. We show how these hypervariable loci can be used to gain insights into strain origin, evolution and phylogeny, and can provide discriminatory sequences to distinguish even clonal isolates. By investigating CRISPR spacer sequences, we reveal their origin and implicate phages and prophages as drivers of CRISPR immunity expansion in this species, with redundant targeting of select prophages. Analysis of CRISPR spacer origin also revealed novel PAM sequences. Our results suggest that CRISPR-Cas immune systems are instrumental in mounting diversified viral resistance in B. longum, and show that these sequences are useful for typing across three subspecies. PMID:29033911

  2. Characterization and Exploitation of CRISPR Loci in Bifidobacterium longum.

    PubMed

    Hidalgo-Cantabrana, Claudio; Crawley, Alexandra B; Sanchez, Borja; Barrangou, Rodolphe

    2017-01-01

    Diverse CRISPR-Cas systems provide adaptive immunity in many bacteria and most archaea, via a DNA-encoded, RNA-mediated, nucleic-acid targeting mechanism. Over time, CRISPR loci expand via iterative uptake of invasive DNA sequences into the CRISPR array during the adaptation process. These genetic vaccination cards thus provide insights into the exposure of strains to phages and plasmids in space and time, revealing the historical predatory exposure of a strain. These genetic loci thus constitute a unique basis for genotyping of strains, with potential of resolution at the strain-level. Here, we investigate the occurrence and diversity of CRISPR-Cas systems in the genomes of various Bifidobacterium longum strains across three sub-species. Specifically, we analyzed the genomic content of 66 genomes belonging to B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis , and identified 25 strains that carry 29 total CRISPR-Cas systems. We identify various Type I and Type II CRISPR-Cas systems that are widespread in this species, notably I-C, I-E, and II-C. Noteworthy, Type I-C systems showed extended CRISPR arrays, with extensive spacer diversity. We show how these hypervariable loci can be used to gain insights into strain origin, evolution and phylogeny, and can provide discriminatory sequences to distinguish even clonal isolates. By investigating CRISPR spacer sequences, we reveal their origin and implicate phages and prophages as drivers of CRISPR immunity expansion in this species, with redundant targeting of select prophages. Analysis of CRISPR spacer origin also revealed novel PAM sequences. Our results suggest that CRISPR-Cas immune systems are instrumental in mounting diversified viral resistance in B. longum , and show that these sequences are useful for typing across three subspecies.

  3. Accurate Typing of Human Leukocyte Antigen Class I Genes by Oxford Nanopore Sequencing.

    PubMed

    Liu, Chang; Xiao, Fangzhou; Hoisington-Lopez, Jessica; Lang, Kathrin; Quenzel, Philipp; Duffy, Brian; Mitra, Robi David

    2018-04-03

    Oxford Nanopore Technologies' MinION has expanded the current DNA sequencing toolkit by delivering long read lengths and extreme portability. The MinION has the potential to enable expedited point-of-care human leukocyte antigen (HLA) typing, an assay routinely used to assess the immunologic compatibility between organ donors and recipients, but the platform's high error rate makes it challenging to type alleles with accuracy. We developed and validated accurate typing of HLA by Oxford nanopore (Athlon), a bioinformatic pipeline that i) maps nanopore reads to a database of known HLA alleles, ii) identifies candidate alleles with the highest read coverage at different resolution levels that are represented as branching nodes and leaves of a tree structure, iii) generates consensus sequences by remapping the reads to the candidate alleles, and iv) calls the final diploid genotype by blasting consensus sequences against the reference database. Using two independent data sets generated on the R9.4 flow cell chemistry, Athlon achieved a 100% accuracy in class I HLA typing at the two-field resolution. Copyright © 2018 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  4. 17{beta}-Hydroxysteroid dehydrogenase type 13 is a liver-specific lipid droplet-associated protein

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Horiguchi, Yuka; Araki, Makoto; Motojima, Kiyoto

    2008-05-30

    17{beta}-Hydroxysteroid dehydrogenase (17{beta}HSD) type 13 is identified as a new lipid droplet-associated protein. 17{beta}HSD type 13 has an N-terminal sequence similar to that of 17{beta}HSD type 11, and both sequences function as an endoplasmic reticulum and lipid droplet-targeting signal. Localization of native 17{beta}HSD type 13 on the lipid droplets was confirmed by subcellular fractionation and Western blotting. In contrast to 17{beta}HSD type 11, however, expression of 17{beta}HSD type 13 is largely restricted to the liver and is not enhanced by peroxisome proliferator-activated receptor {alpha} and its ligand. Instead the expression level of 17{beta}HSD type 13 in the receptor-null mice wasmore » increased several-fold. 17{beta}HSD type 13 may have a distinct physiological role as a lipid droplet-associated protein in the liver.« less

  5. Whole Genome Sequencing Demonstrates Limited Transmission within Identified Mycobacterium tuberculosis Clusters in New South Wales, Australia

    PubMed Central

    Gurjav, Ulziijargal; Outhred, Alexander C.; Jelfs, Peter; McCallum, Nadine; Wang, Qinning; Hill-Cawthorne, Grant A.; Marais, Ben J.; Sintchenko, Vitali

    2016-01-01

    Australia has a low tuberculosis incidence rate with most cases occurring among recent immigrants. Given suboptimal cluster resolution achieved with 24-locus mycobacterium interspersed repetitive unit (MIRU-24) genotyping, the added value of whole genome sequencing was explored. MIRU-24 profiles of all Mycobacterium tuberculosis culture-confirmed tuberculosis cases diagnosed between 2009 and 2013 in New South Wales (NSW), Australia, were examined and clusters identified. The relatedness of cases within the largest MIRU-24 clusters was assessed using whole genome sequencing and phylogenetic analyses. Of 1841 culture-confirmed TB cases, 91.9% (1692/1841) had complete demographic and genotyping data. East-African Indian (474; 28.0%) and Beijing (470; 27.8%) lineage strains predominated. The overall rate of MIRU-24 clustering was 20.1% (340/1692) and was highest among Beijing lineage strains (35.7%; 168/470). One Beijing and three East-African Indian (EAI) clonal complexes were responsible for the majority of observed clusters. Whole genome sequencing of the 4 largest clusters (30 isolates) demonstrated diverse single nucleotide polymorphisms (SNPs) within identified clusters. All sequenced EAI strains and 70% of Beijing lineage strains clustered by MIRU-24 typing demonstrated distinct SNP profiles. The superior resolution provided by whole genome sequencing demonstrated limited M. tuberculosis transmission within NSW, even within identified MIRU-24 clusters. Routine whole genome sequencing could provide valuable public health guidance in low burden settings. PMID:27737005

  6. Pyrosequencing analysis of the gyrB gene to differentiate bacteria responsible for diarrheal diseases.

    PubMed

    Hou, X-L; Cao, Q-Y; Jia, H-Y; Chen, Z

    2008-07-01

    Pathogens causing acute diarrhea include a large variety of species from Enterobacteriaceae and Vibrionaceae. A method based on pyrosequencing was used here to differentiate bacteria commonly associated with diarrhea in China; the method is targeted to a partial amplicon of the gyrB gene, which encodes the B subunit of DNA gyrase. Twenty-eight specific polymorphic positions were identified from sequence alignment of a large sequence dataset and targeted using 17 sequencing primers. Of 95 isolates tested, belonging to 13 species within 7 genera, most could be identified to the species level; O157 type could be differentiated from other E. coli types; Salmonella enterica subsp. enterica could be identified at the serotype level; the genus Shigella, except for S. boydii and S. dysenteriae, could also be identified. All these isolates were also subjected to conventional sequencing of a relatively long ( approximately1.2 kb) region of gyrB DNA; these results confirmed those with pyrosequencing. Twenty-two fecal samples were surveyed, the results of which were concordant with culture-based bacterial identification, and the pathogen detection limit with simulated stool specimens was 10(4) CFU/ml. DNA from different pathogens was also mixed to simulate a case of multibacterial infection, and the generated signals correlated well with the mix ratio. In summary, the gyrB-based pyrosequencing approach proved to have significant reliability and discriminatory power for enteropathogenic bacterial identification and provided a fast and effective method for clinical diagnosis.

  7. Identification of Trypanosoma cruzi Discrete Typing Units (DTUs) in Latin-American migrants in Barcelona (Spain).

    PubMed

    Abras, Alba; Gállego, Montserrat; Muñoz, Carmen; Juiz, Natalia A; Ramírez, Juan Carlos; Cura, Carolina I; Tebar, Silvia; Fernández-Arévalo, Anna; Pinazo, María-Jesús; de la Torre, Leonardo; Posada, Elizabeth; Navarro, Ferran; Espinal, Paula; Ballart, Cristina; Portús, Montserrat; Gascón, Joaquim; Schijman, Alejandro G

    2017-04-01

    Trypanosoma cruzi, the causative agent of Chagas disease, is divided into six Discrete Typing Units (DTUs): TcI-TcVI. We aimed to identify T. cruzi DTUs in Latin-American migrants in the Barcelona area (Spain) and to assess different molecular typing approaches for the characterization of T. cruzi genotypes. Seventy-five peripheral blood samples were analyzed by two real-time PCR methods (qPCR) based on satellite DNA (SatDNA) and kinetoplastid DNA (kDNA). The 20 samples testing positive in both methods, all belonging to Bolivian individuals, were submitted to DTU characterization using two PCR-based flowcharts: multiplex qPCR using TaqMan probes (MTq-PCR), and conventional PCR. These samples were also studied by sequencing the SatDNA and classified as type I (TcI/III), type II (TcII/IV) and type I/II hybrid (TcV/VI). Ten out of the 20 samples gave positive results in the flowcharts: TcV (5 samples), TcII/V/VI (3) and mixed infections by TcV plus TcII (1) and TcV plus TcII/VI (1). By SatDNA sequencing, we classified the 20 samples, 19 as type I/II and one as type I. The most frequent DTU identified by both flowcharts, and suggested by SatDNA sequencing in the remaining samples with low parasitic loads, TcV, is common in Bolivia and predominant in peripheral blood. The mixed infection by TcV-TcII was detected for the first time simultaneously in Bolivian migrants. PCR-based flowcharts are very useful to characterize DTUs during acute infection. SatDNA sequence analysis cannot discriminate T. cruzi populations at the level of a single DTU but it enabled us to increase the number of characterized cases in chronically infected patients. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  8. Genome sequence of Frateuria aurantia type strain (Kondô 67T), a xanthomonade isolated from Lilium auratium Lindl.

    PubMed Central

    Anderson, Iain; Teshima, Huzuki; Nolan, Matt; Lapidus, Alla; Tice, Hope; Del Rio, Tijana Glavina; Cheng, Jan-Fang; Han, Cliff; Tapia, Roxanne; Goodwin, Lynne A.; Pitluck, Sam; Liolios, Konstantinos; Mavromatis, Konstantinos; Pagani, Ioanna; Ivanova, Natalia; Mikhailova, Natalia; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Rohde, Manfred; Lang, Elke; Detter, John C.; Göker, Markus; Woyke, Tanja; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2013-01-01

    Frateuria aurantia (ex Kondô and Ameyama 1958) Swings et al. 1980 is a member of the bispecific genus Frateuria in the family Xanthomonadaceae, which is already heavily targeted for non-type strain genome sequencing. Strain Kondô 67T was initially (1958) identified as a member of ‘Acetobacter aurantius’, a name that was not considered for the approved list. Kondô 67T was therefore later designated as the type strain of the newly proposed acetogenic species Frateuria aurantia. The strain is of interest because of its triterpenoids (hopane family). F. aurantia Kondô 67T is the first member of the genus Frateura whose genome sequence has been deciphered, and here we describe the features of this organism, together with the complete genome sequence and annotation. The 3,603,458-bp long chromosome with its 3,200 protein-coding and 88 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project. PMID:24501647

  9. Genome sequence of Frateuria aurantia type strain (Kondô 67T), a xanthomonade isolated from Lilium auratium Lindl.

    DOE PAGES

    Anderson, Iain; Teshima, Huzuki; Nolan, Matt; ...

    2013-10-16

    Frateuria aurantia (ex Kondô and Ameyama 1958) Swings et al. 1980 is a member of the bispecific genus Frateuria in the family Xanthomonadaceae, which is already heavily targeted for non-type strain genome sequencing. Strain Kondô 67 T was initially (1958) identified as a member of ‘Acetobacter aurantius’, a name that was not considered for the approved list. Kondô 67 T was therefore later designated as the type strain of the newly proposed acetogenic species Frateuria aurantia. The strain is of interest because of its triterpenoids (hopane family). F. aurantia Kondô 67 T is the first member of the genus Frateuramore » whose genome sequence has been deciphered, and here we describe the features of this organism, together with the complete genome sequence and annotation. The 3,603,458-bp long chromosome with its 3,200 protein-coding and 88 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.« less

  10. Limitations of variable number of tandem repeat typing identified through whole genome sequencing of Mycobacterium avium subsp. paratuberculosis on a national and herd level.

    PubMed

    Ahlstrom, Christina; Barkema, Herman W; Stevenson, Karen; Zadoks, Ruth N; Biek, Roman; Kao, Rowland; Trewby, Hannah; Haupstein, Deb; Kelton, David F; Fecteau, Gilles; Labrecque, Olivia; Keefe, Greg P; McKenna, Shawn L B; De Buck, Jeroen

    2015-03-08

    Mycobacterium avium subsp. paratuberculosis (MAP), the causative bacterium of Johne's disease in dairy cattle, is widespread in the Canadian dairy industry and has significant economic and animal welfare implications. An understanding of the population dynamics of MAP can be used to identify introduction events, improve control efforts and target transmission pathways, although this requires an adequate understanding of MAP diversity and distribution between herds and across the country. Whole genome sequencing (WGS) offers a detailed assessment of the SNP-level diversity and genetic relationship of isolates, whereas several molecular typing techniques used to investigate the molecular epidemiology of MAP, such as variable number of tandem repeat (VNTR) typing, target relatively unstable repetitive elements in the genome that may be too unpredictable to draw accurate conclusions. The objective of this study was to evaluate the diversity of bovine MAP isolates in Canadian dairy herds using WGS and then determine if VNTR typing can distinguish truly related and unrelated isolates. Phylogenetic analysis based on 3,039 SNPs identified through WGS of 124 MAP isolates identified eight genetically distinct subtypes in dairy herds from seven Canadian provinces, with the dominant type including over 80% of MAP isolates. VNTR typing of 527 MAP isolates identified 12 types, including "bison type" isolates, from seven different herds. At a national level, MAP isolates differed from each other by 1-2 to 239-240 SNPs, regardless of whether they belonged to the same or different VNTR types. A herd-level analysis of MAP isolates demonstrated that VNTR typing may both over-estimate and under-estimate the relatedness of MAP isolates found within a single herd. The presence of multiple MAP subtypes in Canada suggests multiple introductions into the country including what has now become one dominant type, an important finding for Johne's disease control. VNTR typing often failed to identify closely and distantly related isolates, limiting the applicability of using this typing scheme to study the molecular epidemiology of MAP at a national and herd-level.

  11. mirVAFC: A Web Server for Prioritizations of Pathogenic Sequence Variants from Exome Sequencing Data via Classifications.

    PubMed

    Li, Zhongshan; Liu, Zhenwei; Jiang, Yi; Chen, Denghui; Ran, Xia; Sun, Zhong Sheng; Wu, Jinyu

    2017-01-01

    Exome sequencing has been widely used to identify the genetic variants underlying human genetic disorders for clinical diagnoses, but the identification of pathogenic sequence variants among the huge amounts of benign ones is complicated and challenging. Here, we describe a new Web server named mirVAFC for pathogenic sequence variants prioritizations from clinical exome sequencing (CES) variant data of single individual or family. The mirVAFC is able to comprehensively annotate sequence variants, filter out most irrelevant variants using custom criteria, classify variants into different categories as for estimated pathogenicity, and lastly provide pathogenic variants prioritizations based on classifications and mutation effects. Case studies using different types of datasets for different diseases from publication and our in-house data have revealed that mirVAFC can efficiently identify the right pathogenic candidates as in original work in each case. Overall, the Web server mirVAFC is specifically developed for pathogenic sequence variant identifications from family-based CES variants using classification-based prioritizations. The mirVAFC Web server is freely accessible at https://www.wzgenomics.cn/mirVAFC/. © 2016 WILEY PERIODICALS, INC.

  12. Identification of tissue-specific cell death using methylation patterns of circulating DNA

    PubMed Central

    Lehmann-Werman, Roni; Neiman, Daniel; Zemmour, Hai; Moss, Joshua; Magenheim, Judith; Vaknin-Dembinsky, Adi; Rubertsson, Sten; Nellgård, Bengt; Blennow, Kaj; Zetterberg, Henrik; Spalding, Kirsty; Haller, Michael J.; Wasserfall, Clive H.; Schatz, Desmond A.; Greenbaum, Carla J.; Dorrell, Craig; Grompe, Markus; Zick, Aviad; Hubert, Ayala; Maoz, Myriam; Fendrich, Volker; Bartsch, Detlef K.; Golan, Talia; Ben Sasson, Shmuel A.; Zamir, Gideon; Razin, Aharon; Cedar, Howard; Shapiro, A. M. James; Glaser, Benjamin; Shemer, Ruth; Dor, Yuval

    2016-01-01

    Minimally invasive detection of cell death could prove an invaluable resource in many physiologic and pathologic situations. Cell-free circulating DNA (cfDNA) released from dying cells is emerging as a diagnostic tool for monitoring cancer dynamics and graft failure. However, existing methods rely on differences in DNA sequences in source tissues, so that cell death cannot be identified in tissues with a normal genome. We developed a method of detecting tissue-specific cell death in humans based on tissue-specific methylation patterns in cfDNA. We interrogated tissue-specific methylome databases to identify cell type-specific DNA methylation signatures and developed a method to detect these signatures in mixed DNA samples. We isolated cfDNA from plasma or serum of donors, treated the cfDNA with bisulfite, PCR-amplified the cfDNA, and sequenced it to quantify cfDNA carrying the methylation markers of the cell type of interest. Pancreatic β-cell DNA was identified in the circulation of patients with recently diagnosed type-1 diabetes and islet-graft recipients; oligodendrocyte DNA was identified in patients with relapsing multiple sclerosis; neuronal/glial DNA was identified in patients after traumatic brain injury or cardiac arrest; and exocrine pancreas DNA was identified in patients with pancreatic cancer or pancreatitis. This proof-of-concept study demonstrates that the tissue origins of cfDNA and thus the rate of death of specific cell types can be determined in humans. The approach can be adapted to identify cfDNA derived from any cell type in the body, offering a minimally invasive window for diagnosing and monitoring a broad spectrum of human pathologies as well as providing a better understanding of normal tissue dynamics. PMID:26976580

  13. New FeFe-hydrogenase genes identified in a metagenomic fosmid library from a municipal wastewater treatment plant as revealed by high-throughput sequencing.

    PubMed

    Tomazetto, Geizecler; Wibberg, Daniel; Schlüter, Andreas; Oliveira, Valéria M

    2015-01-01

    A fosmid metagenomic library was constructed with total community DNA obtained from a municipal wastewater treatment plant (MWWTP), with the aim of identifying new FeFe-hydrogenase genes encoding the enzymes most important for hydrogen metabolism. The dataset generated by pyrosequencing of a fosmid library was mined to identify environmental gene tags (EGTs) assigned to FeFe-hydrogenase. The majority of EGTs representing FeFe-hydrogenase genes were affiliated with the class Clostridia, suggesting that this group is the main hydrogen producer in the MWWTP analyzed. Based on assembled sequences, three FeFe-hydrogenase genes were predicted based on detection of the L2 motif (MPCxxKxxE) in the encoded gene product, confirming true FeFe-hydrogenase sequences. These sequences were used to design specific primers to detect fosmids encoding FeFe-hydrogenase genes predicted from the dataset. Three identified fosmids were completely sequenced. The cloned genomic fragments within these fosmids are closely related to members of the Spirochaetaceae, Bacteroidales and Firmicutes, and their FeFe-hydrogenase sequences are characterized by the structure type M3, which is common to clostridial enzymes. FeFe-hydrogenase sequences found in this study represent hitherto undetected sequences, indicating the high genetic diversity regarding these enzymes in MWWTP. Results suggest that MWWTP have to be considered as reservoirs for new FeFe-hydrogenase genes. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  14. Chemical-biogeographic survey of secondary metabolism in soil.

    PubMed

    Charlop-Powers, Zachary; Owen, Jeremy G; Reddy, Boojala Vijay B; Ternei, Melinda A; Brady, Sean F

    2014-03-11

    In this study, we compare biosynthetic gene richness and diversity of 96 soil microbiomes from diverse environments found throughout the southwestern and northeastern regions of the United States. The 454-pyroseqencing of nonribosomal peptide adenylation (AD) and polyketide ketosynthase (KS) domain fragments amplified from these microbiomes provide a means to evaluate the variation of secondary metabolite biosynthetic diversity in different soil environments. Through soil composition and AD- and KS-amplicon richness analysis, we identify soil types with elevated biosynthetic potential. In general, arid soils show the richest observed biosynthetic diversity, whereas brackish sediments and pine forest soils show the least. By mapping individual environmental amplicon sequences to sequences derived from functionally characterized biosynthetic gene clusters, we identified conserved soil type-specific secondary metabolome enrichment patterns despite significant sample-to-sample sequence variation. These data are used to create chemical biogeographic distribution maps for biomedically valuable families of natural products in the environment that should prove useful for directing the discovery of bioactive natural products in the future.

  15. Complete Deletion of the Fucose Operon in Haemophilus influenzae Is Associated with a Cluster in Multilocus Sequence Analysis-Based Phylogenetic Group II Related to Haemophilus haemolyticus: Implications for Identification and Typing

    PubMed Central

    de Gier, Camilla; Kirkham, Lea-Ann S.

    2015-01-01

    Nonhemolytic variants of Haemophilus haemolyticus are difficult to differentiate from Haemophilus influenzae despite a wide difference in pathogenic potential. A previous investigation characterized a challenging set of 60 clinical strains using multiple PCRs for marker genes and described strains that could not be unequivocally identified as either species. We have analyzed the same set of strains by multilocus sequence analysis (MLSA) and near-full-length 16S rRNA gene sequencing. MLSA unambiguously allocated all study strains to either of the two species, while identification by 16S rRNA sequence was inconclusive for three strains. Notably, the two methods yielded conflicting identifications for two strains. Most of the “fuzzy species” strains were identified as H. influenzae that had undergone complete deletion of the fucose operon. Such strains, which are untypeable by the H. influenzae multilocus sequence type (MLST) scheme, have sporadically been reported and predominantly belong to a single branch of H. influenzae MLSA phylogenetic group II. We also found evidence of interspecies recombination between H. influenzae and H. haemolyticus within the 16S rRNA genes. Establishing an accurate method for rapid and inexpensive identification of H. influenzae is important for disease surveillance and treatment. PMID:26378279

  16. Rapid Identification of Cell-Specific, Internalizing RNA Aptamers with Bioinformatics Analyses of a Cell-Based Aptamer Selection

    PubMed Central

    Thiel, William H.; Bair, Thomas; Peek, Andrew S.; Liu, Xiuying; Dassie, Justin; Stockdale, Katie R.; Behlke, Mark A.; Miller, Francis J.; Giangrande, Paloma H.

    2012-01-01

    Background The broad applicability of RNA aptamers as cell-specific delivery tools for therapeutic reagents depends on the ability to identify aptamer sequences that selectively access the cytoplasm of distinct cell types. Towards this end, we have developed a novel approach that combines a cell-based selection method (cell-internalization SELEX) with high-throughput sequencing (HTS) and bioinformatics analyses to rapidly identify cell-specific, internalization-competent RNA aptamers. Methodology/Principal Findings We demonstrate the utility of this approach by enriching for RNA aptamers capable of selective internalization into vascular smooth muscle cells (VSMCs). Several rounds of positive (VSMCs) and negative (endothelial cells; ECs) selection were performed to enrich for aptamer sequences that preferentially internalize into VSMCs. To identify candidate RNA aptamer sequences, HTS data from each round of selection were analyzed using bioinformatics methods: (1) metrics of selection enrichment; and (2) pairwise comparisons of sequence and structural similarity, termed edit and tree distance, respectively. Correlation analyses of experimentally validated aptamers or rounds revealed that the best cell-specific, internalizing aptamers are enriched as a result of the negative selection step performed against ECs. Conclusions and Significance We describe a novel approach that combines cell-internalization SELEX with HTS and bioinformatics analysis to identify cell-specific, cell-internalizing RNA aptamers. Our data highlight the importance of performing a pre-clear step against a non-target cell in order to select for cell-specific aptamers. We expect the extended use of this approach to enable the identification of aptamers to a multitude of different cell types, thereby facilitating the broad development of targeted cell therapies. PMID:22962591

  17. Comparative analysis of the mating-type loci from Neurospora crassa and Sordaria macrospora: identification of novel transcribed ORFs.

    PubMed

    Pöggeler, S; Kück, U

    2000-03-01

    The mating-type locus controls mating and sexual development in filamentous ascomycetes. In the heterothallic ascomycete Neurospora crassa, the genes that confer mating behavior comprise dissimilar DNA sequences (idiomorphs) in the mat a and mat A mating partners. In the homothallic fungus Sordaria macrospora, sequences corresponding to both idiomorphs are located contiguously in the mating-type locus, which contains one chimeric gene, Smt A-3, that includes sequences which are similar to sequences found at the mat A and mat a mating-type idiomorphs in N. crassa. In this study, we describe the comparative transcriptional analysis of the chimeric mating-type region of S. macrospora and the corresponding region of the N. crassa mat a idiomorph. By means of RT-PCR experiments, we identified novel intervening sequences in the mating-type loci of both ascomycetes and, hence, concluded that an additional ORF, encoding a putative polypeptide of 79 amino acids, is present in the N. crassa mat a idiomorph. Furthermore, our analysis revealed co-transcription of the novel gene with the mat a-1 gene in N. crassa. The same mode of transcription was found in the corresponding mating-type region of S. macrospora, where the chimeric Smt A-3 gene is co-transcribed with the mat a-specific Smt a-1 gene. Analysis of a Smt A-3 cDNA revealed optional splicing of two introns. We believe that this is the first report of co-transcription of protein-encoding nuclear genes in filamentous fungi. Possible functions of the novel ORFs in regulating mating-type gene expression are discussed.

  18. A novel method for simultaneous Enterococcus species identification/typing and van genotyping by high resolution melt analysis.

    PubMed

    Gurtler, Volker; Grando, Danilla; Mayall, Barrie C; Wang, Jenny; Ghaly-Derias, Shahbano

    2012-09-01

    In order to develop a typing and identification method for van gene containing Enterococcus faecium, two multiplex PCR reactions were developed for use in HRM-PCR (High Resolution Melt-PCR): (i) vanA, vanB, vanC, vanC23 to detect van genes from different Enterococcus species; (ii) ISR (intergenic spacer region between the 16S and 23S rRNA genes) to detect all Enterococcus species and obtain species and isolate specific HRM curves. To test and validate the method three groups of isolates were tested: (i) 1672 Enterococcus species isolates from January 2009 to December 2009; (ii) 71 isolates previously identified and typed by PFGE (pulsed-field gel electrophoresis) and MLST (multi-locus sequence typing); and (iii) 18 of the isolates from (i) for which ISR sequencing was done. As well as successfully identifying 2 common genotypes by HRM from the Austin Hospital clinical isolates, this study analysed the sequences of all the vanB genes deposited in GenBank and developed a numerical classification scheme for the standardised naming of these vanB genotypes. The identification of Enterococcus faecalis from E. faecium was reliable and stable using ISR PCR. The typing of E. faecium by ISR PCR: (i) detected two variable peaks corresponding to different copy numbers of insertion sequences I and II corresponding to peak I and II respectively; (ii) produced 7 melt profiles for E. faecium with variable copy numbers of sequences I and II; (iii) demonstrated stability and instability of peak heights with equal frequency within the patient sample (36.4±4.5 days and 38.6±5.8 days respectively for 192 patients); (iv) detected ISR-HRM types with as much discrimination as PFGE and more than MLST; and (v) detected ISR-HRM types that differentiated some isolates that were identical by PFGE and MLST. In conjunction with the rapid and accurate van genotyping method described here, this ISR-HRM typing and identification method can be used as a stable identification and typing method with predictable instability based on recombination and concerted evolution of the rrn operon that will complement existing typing methods. Crown Copyright © 2012. Published by Elsevier B.V. All rights reserved.

  19. Late Miocene volcanic sequences in northern Victoria Land, Antarctica: products of glaciovolcanic eruptions under different thermal regimes

    NASA Astrophysics Data System (ADS)

    Smellie, J. L.; Rocchi, S.; Armienti, P.

    2011-01-01

    Late Miocene (c. 13-5 Ma) volcanic sequences of the Hallett Volcanic Province (HVP) crop out along >250 km of western Ross Sea coast in northern Victoria Land. Eight primary volcanic and six sedimentary lithofacies have been identified, and they are organised into at least five different sequence architectures as a consequence of different combinations of eruptive and/or depositional conditions. The volcanoes were erupted in association with a Miocene glacial cover and the sequences are overwhelmingly glaciovolcanic. The commonest and most representative are products of mafic aa lava-fed deltas, a type of glaciovolcanic sequence that has not been described before. It is distinguished by (1) a subaerially emplaced relatively thin caprock of aa lavas lying on and passing down-dip into (2) a thicker association of chaotic to crudely bedded hyaloclastite breccias, water-chilled lava sheets and irregular lava masses, collectively called lobe-hyaloclastite. A second distinctive sequence type present is characterised by water-cooled lavas and associated sedimentary lithofacies (diamictite (probably glacigenic) and fluvial sands and gravels) similar to some mafic glaciovolcanic sheet-like sequences (see Smellie, Earth-Science Reviews, 74, 241-268, 2008), but including (for the first time) examples of likely sheet-like sequences with felsic compositions. Other sequence types in the HVP are minor and include tuff cones, cinder cones and a single ice-marginal lacustrine sequence. The glacial thermal regime varied from polar, characterised by sequences lacking glacial erosion, glacigenic sediments or evidence for free water, to temperate or sub-polar for sequences in which all of these features are conspicuously developed.

  20. Identifying metabolic enzymes with multiple types of association evidence

    PubMed Central

    Kharchenko, Peter; Chen, Lifeng; Freund, Yoav; Vitkup, Dennis; Church, George M

    2006-01-01

    Background Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. Results We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. Conclusion We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. PMID:16571130

  1. Whole Transcriptome Sequencing Enables Discovery and Analysis of Viruses in Archived Primary Central Nervous System Lymphomas

    PubMed Central

    DeBoever, Christopher; Reid, Erin G.; Smith, Erin N.; Wang, Xiaoyun; Dumaop, Wilmar; Harismendy, Olivier; Carson, Dennis; Richman, Douglas; Masliah, Eliezer; Frazer, Kelly A.

    2013-01-01

    Primary central nervous system lymphomas (PCNSL) have a dramatically increased prevalence among persons living with AIDS and are known to be associated with human Epstein Barr virus (EBV) infection. Previous work suggests that in some cases, co-infection with other viruses may be important for PCNSL pathogenesis. Viral transcription in tumor samples can be measured using next generation transcriptome sequencing. We demonstrate the ability of transcriptome sequencing to identify viruses, characterize viral expression, and identify viral variants by sequencing four archived AIDS-related PCNSL tissue samples and analyzing raw sequencing reads. EBV was detected in all four PCNSL samples and cytomegalovirus (CMV), JC polyomavirus (JCV), and HIV were also discovered, consistent with clinical diagnoses. CMV was found to express three long non-coding RNAs recently reported as expressed during active infection. Single nucleotide variants were observed in each of the viruses observed and three indels were found in CMV. No viruses were found in several control tumor types including 32 diffuse large B-cell lymphoma samples. This study demonstrates the ability of next generation transcriptome sequencing to accurately identify viruses, including DNA viruses, in solid human cancer tissue samples. PMID:24023918

  2. [Typing and identification of non-polio enterovirus from acute flaccid paralysis cases in Ningxia, 1997-2011].

    PubMed

    Ma, Jiang-tao; Chen, Hui; Yuan, Fang; Ma, Xue-min; Guan, Guang-yu; Zhan, Jun

    2012-11-01

    To identify the serotype of 73 non-polio enterovirus (NPEV) strains from acute flaccid paralysis (AFP) cases in Ningxia province, during 1997 - 2011. Partial sequencing of the VP1 region was amplified by RT-PCR with degenerate primers and sequenced while sequences were compared with the database of GenBank by the BLAST algorithm. Evolution was analyzed by constructing phylogenetic tree using Mega 5.1. In this study, a total of 73 NPEVs were analyzed, including 4 strains un-typed, 69 strains typed by RT-PCR. A total of 27 serotypes were identified, including 8 serotypes of human enterovirus (HEV)-A, 19 serotypes of HEV-B. The HEV-B group (46/69, 66.7%) constituted the largest proportion of isolates, followed by HEV-A (23/69, 33.3%), but no strains were found that belonged to HEV-C or HEV-D group. In the 69 strains, enterovirus 71 was the most frequently seen isolates, followed by coxsackie-virus A4, 16, 9 and echovirus 24, 6. HEV-B was the most predominant (46/69, 66.7%) serotype of NPEV in Ningxia during the AFP surveillance, in 1997 - 2011.

  3. Characterization of a restriction modification system from the commensal Escherichia coli strain A0 34/86 (O83:K24:H31).

    PubMed

    Weiserová, Marie; Ryu, Junichi

    2008-06-27

    Type I restriction-modification (R-M) systems are the most complex restriction enzymes discovered to date. Recent years have witnessed a renaissance of interest in R-M enzymes Type I. The massive ongoing sequencing programmes leading to discovery of, so far, more than 1 000 putative enzymes in a broad range of microorganisms including pathogenic bacteria, revealed that these enzymes are widely represented in nature. The aim of this study was characterisation of a putative R-M system EcoA0ORF42P identified in the commensal Escherichia coli A0 34/86 (O83: K24: H31) strain, which is efficiently used at Czech paediatric clinics for prophylaxis and treatment of nosocomial infections and diarrhoea of preterm and newborn infants. We have characterised a restriction-modification system EcoA0ORF42P of the commensal Escherichia coli strain A0 34/86 (O83: K24: H31). This system, designated as EcoAO83I, is a new functional member of the Type IB family, whose specificity differs from those of known Type IB enzymes, as was demonstrated by an immunological cross-reactivity and a complementation assay. Using the plasmid transformation method and the RM search computer program, we identified the DNA recognition sequence of the EcoAO83I as GGA(8N)ATGC. In consistence with the amino acids alignment data, the 3' TRD component of the recognition sequence is identical to the sequence recognized by the EcoEI enzyme. The A-T (modified adenine) distance is identical to that in the EcoAI and EcoEI recognition sites, which also indicates that this system is a Type IB member. Interestingly, the recognition sequence we determined here is identical to the previously reported prototype sequence for Eco377I and its isoschizomers. Putative restriction-modification system EcoA0ORF42P in the commensal Escherichia coli strain A0 34/86 (O83: K24: H31) was found to be a member of the Type IB family and was designated as EcoAO83I. Combination of the classical biochemical and bacterial genetics approaches with comparative genomics might contribute effectively to further classification of many other putative Type-I enzymes, especially in clinical samples.

  4. Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.

    PubMed

    Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru

    2015-01-01

    The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.

  5. Functionally conserved cis-regulatory elements of COL18A1 identified through zebrafish transgenesis.

    PubMed

    Kague, Erika; Bessling, Seneca L; Lee, Josephine; Hu, Gui; Passos-Bueno, Maria Rita; Fisher, Shannon

    2010-01-15

    Type XVIII collagen is a component of basement membranes, and expressed prominently in the eye, blood vessels, liver, and the central nervous system. Homozygous mutations in COL18A1 lead to Knobloch Syndrome, characterized by ocular defects and occipital encephalocele. However, relatively little has been described on the role of type XVIII collagen in development, and nothing is known about the regulation of its tissue-specific expression pattern. We have used zebrafish transgenesis to identify and characterize cis-regulatory sequences controlling expression of the human gene. Candidate enhancers were selected from non-coding sequence associated with COL18A1 based on sequence conservation among mammals. Although these displayed no overt conservation with orthologous zebrafish sequences, four regions nonetheless acted as tissue-specific transcriptional enhancers in the zebrafish embryo, and together recapitulated the major aspects of col18a1 expression. Additional post-hoc computational analysis on positive enhancer sequences revealed alignments between mammalian and teleost sequences, which we hypothesize predict the corresponding zebrafish enhancers; for one of these, we demonstrate functional overlap with the orthologous human enhancer sequence. Our results provide important insight into the biological function and regulation of COL18A1, and point to additional sequences that may contribute to complex diseases involving COL18A1. More generally, we show that combining functional data with targeted analyses for phylogenetic conservation can reveal conserved cis-regulatory elements in the large number of cases where computational alignment alone falls short. Copyright 2009 Elsevier Inc. All rights reserved.

  6. Quasispecies Analyses of the HIV-1 Near-full-length Genome With Illumina MiSeq

    PubMed Central

    Ode, Hirotaka; Matsuda, Masakazu; Matsuoka, Kazuhiro; Hachiya, Atsuko; Hattori, Junko; Kito, Yumiko; Yokomaku, Yoshiyuki; Iwatani, Yasumasa; Sugiura, Wataru

    2015-01-01

    Human immunodeficiency virus type-1 (HIV-1) exhibits high between-host genetic diversity and within-host heterogeneity, recognized as quasispecies. Because HIV-1 quasispecies fluctuate in terms of multiple factors, such as antiretroviral exposure and host immunity, analyzing the HIV-1 genome is critical for selecting effective antiretroviral therapy and understanding within-host viral coevolution mechanisms. Here, to obtain HIV-1 genome sequence information that includes minority variants, we sought to develop a method for evaluating quasispecies throughout the HIV-1 near-full-length genome using the Illumina MiSeq benchtop deep sequencer. To ensure the reliability of minority mutation detection, we applied an analysis method of sequence read mapping onto a consensus sequence derived from de novo assembly followed by iterative mapping and subsequent unique error correction. Deep sequencing analyses of aHIV-1 clone showed that the analysis method reduced erroneous base prevalence below 1% in each sequence position and discarded only < 1% of all collected nucleotides, maximizing the usage of the collected genome sequences. Further, we designed primer sets to amplify the HIV-1 near-full-length genome from clinical plasma samples. Deep sequencing of 92 samples in combination with the primer sets and our analysis method provided sufficient coverage to identify >1%-frequency sequences throughout the genome. When we evaluated sequences of pol genes from 18 treatment-naïve patients' samples, the deep sequencing results were in agreement with Sanger sequencing and identified numerous additional minority mutations. The results suggest that our deep sequencing method would be suitable for identifying within-host viral population dynamics throughout the genome. PMID:26617593

  7. SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read

    PubMed Central

    2010-01-01

    Background High-throughput automated sequencing has enabled an exponential growth rate of sequencing data. This requires increasing sequence quality and reliability in order to avoid database contamination with artefactual sequences. The arrival of pyrosequencing enhances this problem and necessitates customisable pre-processing algorithms. Results SeqTrim has been implemented both as a Web and as a standalone command line application. Already-published and newly-designed algorithms have been included to identify sequence inserts, to remove low quality, vector, adaptor, low complexity and contaminant sequences, and to detect chimeric reads. The availability of several input and output formats allows its inclusion in sequence processing workflows. Due to its specific algorithms, SeqTrim outperforms other pre-processors implemented as Web services or standalone applications. It performs equally well with sequences from EST libraries, SSH libraries, genomic DNA libraries and pyrosequencing reads and does not lead to over-trimming. Conclusions SeqTrim is an efficient pipeline designed for pre-processing of any type of sequence read, including next-generation sequencing. It is easily configurable and provides a friendly interface that allows users to know what happened with sequences at every pre-processing stage, and to verify pre-processing of an individual sequence if desired. The recommended pipeline reveals more information about each sequence than previously described pre-processors and can discard more sequencing or experimental artefacts. PMID:20089148

  8. The evolution and population structure of Lactobacillus fermentum from different naturally fermented products as determined by multilocus sequence typing (MLST).

    PubMed

    Dan, Tong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Menghe, Bilige; Zhang, Heping; Sun, Zhihong

    2015-05-20

    Lactobacillus fermentum is economically important in the production and preservation of fermented foods. A repeatable and discriminative typing method was devised to characterize L. fermentum at the molecular level. The multilocus sequence typing (MLST) scheme developed was based on analysis of the internal sequence of 11 housekeeping gene fragments (clpX, dnaA, dnaK, groEL, murC, murE, pepX, pyrG, recA, rpoB, and uvrC). MLST analysis of 203 isolates of L. fermentum from Mongolia and seven provinces/ autonomous regions in China identified 57 sequence types (ST), 27 of which were represented by only a single isolate, indicating high genetic diversity. Phylogenetic analyses based on the sequence of the 11 housekeeping gene fragments indicated that the L. fermentum isolates analyzed belonged to two major groups. A standardized index of association (I A (S)) indicated a weak clonal population structure in L. fermentum. Split decomposition analysis indicated that recombination played an important role in generating the genetic diversity observed in L. fermentum. The results from the minimum spanning tree strongly suggested that evolution of L. fermentum STs was not correlated with geography or food-type. The MLST scheme developed will be valuable for further studies on the evolution and population structure of L. fermentum isolates used in food products.

  9. Vibrio cholerae typing phage N4: genome sequence and its relatedness to T7 viral supergroup.

    PubMed

    Das, Mayukh; Nandy, R K; Bhowmick, Tushar Suvra; Yamasaki, S; Ghosh, A; Nair, G B; Sarkar, B L

    2012-01-01

    In countries where cholera is endemic, Vibrio cholerae O1 bacteriophages have been detected in sewage water. These have been used to serve not only as strain markers, but also for the typing of V. cholerae strains. Vibriophage N4 (ATCC 51352-B1) occupies a unique position in the new phage-typing scheme and can infect a larger number of V. cholerae O1 biotype El Tor strains. Here we characterized the complete genome sequence of this typing vibriophage. The complete DNA sequence of the N4 genome was determined by using a shotgun sequencing approach. Complete genome sequence explored that phage N4 is comprised of one circular, double-stranded chromosome of 38,497 bp with an overall GC content of 42.8%. A total of 47 open reading frames were identified and functions could be assigned to 30 of them. Further, a close relationship with another vibriophage, VP4, and the enterobacteriophage T7 could be established. DNA-DNA hybridization among V. cholerae O1 and O139 phages revealed homology among O1 vibriophages at their genomic level. This study indicates two evolutionary distinctive branches of the possible phylogenetic origin of O1 and O139 vibriophages and provides an unveiled collection of information on viral gene products of typing vibriophages. Copyright © 2011 S. Karger AG, Basel.

  10. Identification and expression analysis of a novel R-type lectin from the coleopteran beetle, Tenebrio molitor.

    PubMed

    Kim, Dong Hyun; Patnaik, Bharat Bhusan; Seo, Gi Won; Kang, Seong Min; Lee, Yong Seok; Lee, Bok Luel; Han, Yeon Soo

    2013-11-01

    We have identified novel ricin-type (R-type) lectin by sequencing of random clones from cDNA library of the coleopteran beetle, Tenebrio molitor. The cDNA sequence is comprised of 495 bp encoding a protein of 164 amino acid residues and shows 49% identity with galectin of Tribolium castaneum. Bioinformatics analysis shows that the amino acid residues from 35 to 162 belong to ricin-type beta-trefoil structure. The transcript was significantly upregulated after early hours of injection with peptidoglycans derived from Gram (+) and Gram (-) bacteria, beta-1, 3 glucan from fungi and an intracellular pathogen, Listeria monocytogenes suggesting putative function in innate immunity. Copyright © 2013 Elsevier Inc. All rights reserved.

  11. Integrating and mining the chromatin landscape of cell-type specificity using self-organizing maps.

    PubMed

    Mortazavi, Ali; Pepke, Shirley; Jansen, Camden; Marinov, Georgi K; Ernst, Jason; Kellis, Manolis; Hardison, Ross C; Myers, Richard M; Wold, Barbara J

    2013-12-01

    We tested whether self-organizing maps (SOMs) could be used to effectively integrate, visualize, and mine diverse genomics data types, including complex chromatin signatures. A fine-grained SOM was trained on 72 ChIP-seq histone modifications and DNase-seq data sets from six biologically diverse cell lines studied by The ENCODE Project Consortium. We mined the resulting SOM to identify chromatin signatures related to sequence-specific transcription factor occupancy, sequence motif enrichment, and biological functions. To highlight clusters enriched for specific functions such as transcriptional promoters or enhancers, we overlaid onto the map additional data sets not used during training, such as ChIP-seq, RNA-seq, CAGE, and information on cis-acting regulatory modules from the literature. We used the SOM to parse known transcriptional enhancers according to the cell-type-specific chromatin signature, and we further corroborated this pattern on the map by EP300 (also known as p300) occupancy. New candidate cell-type-specific enhancers were identified for multiple ENCODE cell types in this way, along with new candidates for ubiquitous enhancer activity. An interactive web interface was developed to allow users to visualize and custom-mine the ENCODE SOM. We conclude that large SOMs trained on chromatin data from multiple cell types provide a powerful way to identify complex relationships in genomic data at user-selected levels of granularity.

  12. Integrating and mining the chromatin landscape of cell-type specificity using self-organizing maps

    PubMed Central

    Mortazavi, Ali; Pepke, Shirley; Jansen, Camden; Marinov, Georgi K.; Ernst, Jason; Kellis, Manolis; Hardison, Ross C.; Myers, Richard M.; Wold, Barbara J.

    2013-01-01

    We tested whether self-organizing maps (SOMs) could be used to effectively integrate, visualize, and mine diverse genomics data types, including complex chromatin signatures. A fine-grained SOM was trained on 72 ChIP-seq histone modifications and DNase-seq data sets from six biologically diverse cell lines studied by The ENCODE Project Consortium. We mined the resulting SOM to identify chromatin signatures related to sequence-specific transcription factor occupancy, sequence motif enrichment, and biological functions. To highlight clusters enriched for specific functions such as transcriptional promoters or enhancers, we overlaid onto the map additional data sets not used during training, such as ChIP-seq, RNA-seq, CAGE, and information on cis-acting regulatory modules from the literature. We used the SOM to parse known transcriptional enhancers according to the cell-type-specific chromatin signature, and we further corroborated this pattern on the map by EP300 (also known as p300) occupancy. New candidate cell-type-specific enhancers were identified for multiple ENCODE cell types in this way, along with new candidates for ubiquitous enhancer activity. An interactive web interface was developed to allow users to visualize and custom-mine the ENCODE SOM. We conclude that large SOMs trained on chromatin data from multiple cell types provide a powerful way to identify complex relationships in genomic data at user-selected levels of granularity. PMID:24170599

  13. Molecular Epidemiology and Clinical Impact of Acinetobacter calcoaceticus-baumannii Complex in a Belgian Burn Wound Center

    PubMed Central

    Bilocq, Florence; Jennes, Serge; Verbeken, Gilbert; Rose, Thomas; Keersebilck, Elkana; Bosmans, Petra; Pieters, Thierry; Hing, Mony; Heuninckx, Walter; De Pauw, Frank; Soentjens, Patrick; Merabishvili, Maia; Deschaght, Pieter; Vaneechoutte, Mario; Bogaerts, Pierre; Glupczynski, Youri; Pot, Bruno; van der Reijden, Tanny J.; Dijkshoorn, Lenie

    2016-01-01

    Multidrug resistant Acinetobacter baumannii and its closely related species A. pittii and A. nosocomialis, all members of the Acinetobacter calcoaceticus-baumannii (Acb) complex, are a major cause of hospital acquired infection. In the burn wound center of the Queen Astrid military hospital in Brussels, 48 patients were colonized or infected with Acb complex over a 52-month period. We report the molecular epidemiology of these organisms, their clinical impact and infection control measures taken. A representative set of 157 Acb complex isolates was analyzed using repetitive sequence-based PCR (rep-PCR) (DiversiLab) and a multiplex PCR targeting OXA-51-like and OXA-23-like genes. We identified 31 rep-PCR genotypes (strains). Representatives of each rep-type were identified to species by rpoB sequence analysis: 13 types to A. baumannii, 10 to A. pittii, and 3 to A. nosocomialis. It was assumed that isolates that belonged to the same rep-type also belonged to the same species. Thus, 83.4% of all isolates were identified to A. baumannii, 9.6% to A. pittii and 4.5% to A. nosocomialis. We observed 12 extensively drug resistant Acb strains (10 A. baumannii and 2 A. nosocomialis), all carbapenem-non-susceptible/colistin-susceptible and imported into the burn wound center through patients injured in North Africa. The two most prevalent rep-types 12 and 13 harbored an OXA-23-like gene. Multilocus sequence typing allocated them to clonal complex 1 corresponding to EU (international) clone I. Both strains caused consecutive outbreaks, interspersed with periods of apparent eradication. Patients infected with carbapenem resistant A. baumannii were successfully treated with colistin/rifampicin. Extensive infection control measures were required to eradicate the organisms. Acinetobacter infection and colonization was not associated with increased attributable mortality. PMID:27223476

  14. Recent Evolutionary Radiation and Host Plant Specialization in the Xylella fastidiosa Subspecies Native to the United States

    PubMed Central

    Vickerman, Danel B.; Bromley, Robin E.; Russell, Stephanie A.; Hartman, John R.; Morano, Lisa D.; Stouthamer, Richard

    2013-01-01

    The bacterial pathogen, Xylella fastidiosa, infects many plant species in the Americas, making it a good model for investigating the genetics of host adaptation. We used multilocus sequence typing (MLST) to identify isolates of the native U.S. subsp. multiplex that were largely unaffected by intersubspecific homologous recombination (IHR) and to investigate how their evolutionary history influences plant host specialization. We identified 110 “non-IHR” isolates, 2 minimally recombinant “intermediate” ones (including the subspecific type), and 31 with extensive IHR. The non-IHR and intermediate isolates defined 23 sequence types (STs) which we used to identify 22 plant hosts (73% trees) characteristic of the subspecies. Except for almond, subsp. multiplex showed no host overlap with the introduced subspecies (subspecies fastidiosa and sandyi). MLST sequences revealed that subsp. multiplex underwent recent radiation (<25% of subspecies age) which included only limited intrasubspecific recombination (ρ/θ = 0.02); only one isolated lineage (ST50 from ash) was older. A total of 20 of the STs grouped into three loose phylogenetic clusters distinguished by nonoverlapping hosts (excepting purple leaf plum): “almond,” “peach,” and “oak” types. These host differences were not geographical, since all three types also occurred in California. ST designation was a good indicator of host specialization. ST09, widespread in the southeastern United States, only infected oak species, and all peach isolates were ST10 (from California, Florida, and Georgia). Only ST23 had a broad host range. Hosts of related genotypes were sometimes related, but often host groupings crossed plant family or even order, suggesting that phylogenetically plastic features of hosts affect bacterial pathogenicity. PMID:23354698

  15. Recent evolutionary radiation and host plant specialization in the Xylella fastidiosa subspecies native to the United States.

    PubMed

    Nunney, Leonard; Vickerman, Danel B; Bromley, Robin E; Russell, Stephanie A; Hartman, John R; Morano, Lisa D; Stouthamer, Richard

    2013-04-01

    The bacterial pathogen, Xylella fastidiosa, infects many plant species in the Americas, making it a good model for investigating the genetics of host adaptation. We used multilocus sequence typing (MLST) to identify isolates of the native U.S. subsp. multiplex that were largely unaffected by intersubspecific homologous recombination (IHR) and to investigate how their evolutionary history influences plant host specialization. We identified 110 "non-IHR" isolates, 2 minimally recombinant "intermediate" ones (including the subspecific type), and 31 with extensive IHR. The non-IHR and intermediate isolates defined 23 sequence types (STs) which we used to identify 22 plant hosts (73% trees) characteristic of the subspecies. Except for almond, subsp. multiplex showed no host overlap with the introduced subspecies (subspecies fastidiosa and sandyi). MLST sequences revealed that subsp. multiplex underwent recent radiation (<25% of subspecies age) which included only limited intrasubspecific recombination (ρ/θ = 0.02); only one isolated lineage (ST50 from ash) was older. A total of 20 of the STs grouped into three loose phylogenetic clusters distinguished by nonoverlapping hosts (excepting purple leaf plum): "almond," "peach," and "oak" types. These host differences were not geographical, since all three types also occurred in California. ST designation was a good indicator of host specialization. ST09, widespread in the southeastern United States, only infected oak species, and all peach isolates were ST10 (from California, Florida, and Georgia). Only ST23 had a broad host range. Hosts of related genotypes were sometimes related, but often host groupings crossed plant family or even order, suggesting that phylogenetically plastic features of hosts affect bacterial pathogenicity.

  16. Molecular Epidemiology and Clinical Impact of Acinetobacter calcoaceticus-baumannii Complex in a Belgian Burn Wound Center.

    PubMed

    De Vos, Daniel; Pirnay, Jean-Paul; Bilocq, Florence; Jennes, Serge; Verbeken, Gilbert; Rose, Thomas; Keersebilck, Elkana; Bosmans, Petra; Pieters, Thierry; Hing, Mony; Heuninckx, Walter; De Pauw, Frank; Soentjens, Patrick; Merabishvili, Maia; Deschaght, Pieter; Vaneechoutte, Mario; Bogaerts, Pierre; Glupczynski, Youri; Pot, Bruno; van der Reijden, Tanny J; Dijkshoorn, Lenie

    2016-01-01

    Multidrug resistant Acinetobacter baumannii and its closely related species A. pittii and A. nosocomialis, all members of the Acinetobacter calcoaceticus-baumannii (Acb) complex, are a major cause of hospital acquired infection. In the burn wound center of the Queen Astrid military hospital in Brussels, 48 patients were colonized or infected with Acb complex over a 52-month period. We report the molecular epidemiology of these organisms, their clinical impact and infection control measures taken. A representative set of 157 Acb complex isolates was analyzed using repetitive sequence-based PCR (rep-PCR) (DiversiLab) and a multiplex PCR targeting OXA-51-like and OXA-23-like genes. We identified 31 rep-PCR genotypes (strains). Representatives of each rep-type were identified to species by rpoB sequence analysis: 13 types to A. baumannii, 10 to A. pittii, and 3 to A. nosocomialis. It was assumed that isolates that belonged to the same rep-type also belonged to the same species. Thus, 83.4% of all isolates were identified to A. baumannii, 9.6% to A. pittii and 4.5% to A. nosocomialis. We observed 12 extensively drug resistant Acb strains (10 A. baumannii and 2 A. nosocomialis), all carbapenem-non-susceptible/colistin-susceptible and imported into the burn wound center through patients injured in North Africa. The two most prevalent rep-types 12 and 13 harbored an OXA-23-like gene. Multilocus sequence typing allocated them to clonal complex 1 corresponding to EU (international) clone I. Both strains caused consecutive outbreaks, interspersed with periods of apparent eradication. Patients infected with carbapenem resistant A. baumannii were successfully treated with colistin/rifampicin. Extensive infection control measures were required to eradicate the organisms. Acinetobacter infection and colonization was not associated with increased attributable mortality.

  17. Emergence of Pseudomonas aeruginosa with class 1 integron carrying blaVIM-2 and blaVIM-4 in the University Clinical Hospital of Bialystok (northeastern Poland).

    PubMed

    Michalska-Falkowska, Anna; Sacha, Paweł Tomasz; Grześ, Henryk; Hauschild, Tomasz; Wieczorek, Piotr; Ojdana, Dominika; Tryniszewska, Elżbieta Anna

    2017-07-11

    The effectiveness of carbapenems, considered as last-resort antimicrobials in severe infections, becomes compromised by bacterial resistance. The production of metallo-β-lactamases (MBLs) is the most significant threat to carbapenems activity among Pseudomonas aeruginosa. The aim of this study was to assess the presence and type of MBLs genes in carbapenem-resistant P. aeruginosa clinical strains, to identify the location of MBLs genes and to determine genetic relatedness between MBL-producers using pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST). The first identified MBL-positive (with blaVIM genes) P. aeruginosa strains were isolated from patients hospitalized in the University Clinical Hospital of Bialystok in the period from September 2012 to December 2013. Variants of MBLs genes and variable integron regions were characterized by PCR and sequencing. PFGE was performed after digesting of bacterial genomes by XbaI enzyme. By MLST seven housekeeping genes were analyzed for the determination of sequence type (ST). Three strains carried the blaVIM-2 gene and one harbored the blaVIM-4 gene. The blaVIM genes resided within class 1 integrons. PCR mapping of integrons revealed the presence of four different cassette arrays. Genetic relatedness analysis by PFGE classified VIM-positive strains into four unrelated pulsotypes (A-D). MLST demonstrated the presence of four (ST 111, ST27, and ST17) different sequence type including one previously undescribed new type of ST 2342. Antimicrobial susceptibility testing showed that VIM-positive strains were resistant to carbapenems, cephalosporins, aminoglycosides, and quinolones, intermediate to aztreonam, and susceptible only to colistin. Integrons mapping, PFGE, and MLST results may point to different origin of these strains and independent introduction into hospitalized patients.

  18. Nucleic acid analysis using terminal-phosphate-labeled nucleotides

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2008-04-22

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  19. Development and application of a multilocus sequence analysis method for the identification of genotypes within genus Bradyrhizobium and for establishing nodule occupancy of soybean (Glycine max L. Merr)

    USDA-ARS?s Scientific Manuscript database

    A Multilocus Sequence Typing (MLST) method based on allelic variation of 7 chromosomal loci was developed for characterizing genotypes within the genus Bradyrhizobium. With the method 29 distinct multilocus genotypes (GTs) were identified among 191 culture collection soybean strains. The occupancy ...

  20. Making sense of deep sequencing

    PubMed Central

    Goldman, D.; Domschke, K.

    2016-01-01

    This review, the first of an occasional series, tries to make sense of the concepts and uses of deep sequencing of polynucleic acids (DNA and RNA). Deep sequencing, synonymous with next-generation sequencing, high-throughput sequencing and massively parallel sequencing, includes whole genome sequencing but is more often and diversely applied to specific parts of the genome captured in different ways, for example the highly expressed portion of the genome known as the exome and portions of the genome that are epigenetically marked either by DNA methylation, the binding of proteins including histones, or that are in different configurations and thus more or less accessible to enzymes that cleave DNA. Deep sequencing of RNA (RNASeq) reverse-transcribed to complementary DNA is invaluable for measuring RNA expression and detecting changes in RNA structure. Important concepts in deep sequencing include the length and depth of sequence reads, mapping and assembly of reads, sequencing error, haplotypes, and the propensity of deep sequencing, as with other types of ‘big data’, to generate large numbers of errors, requiring monitoring for methodologic biases and strategies for replication and validation. Deep sequencing yields a unique genetic fingerprint that can be used to identify a person, and a trove of predictors of genetic medical diseases. Deep sequencing to identify epigenetic events including changes in DNA methylation and RNA expression can reveal the history and impact of environmental exposures. Because of the power of sequencing to identify and deliver biomedically significant information about a person and their blood relatives, it creates ethical dilemmas and practical challenges in research and clinical care, for example the decision and procedures to report incidental findings that will increasingly and frequently be discovered. PMID:24925306

  1. Haemophilus influenzae Type b Carriage and Novel Bacterial Population Structure among Children in Urban Kathmandu, Nepal▿

    PubMed Central

    Williams, E. J.; Lewis, J.; John, T.; Hoe, J. C.; Yu, L.; Dongol, S.; Kelly, D. F.; Griffiths, D. T.; Shah, A.; Limbu, B.; Pradhan, R.; Mawas, F.; Shrestha, S.; Thorson, S.; Werno, A. M.; Murdoch, D. R.; Adhikari, N.; Pollard, A. J.

    2011-01-01

    Haemophilus influenzae type b (Hib) is a major cause of invasive bacterial infection in children that can be prevented by a vaccine, but there is still uncertainty about its relative importance in Asia. This study investigated the age-specific prevalence of Hib carriage and its molecular epidemiology in carriage and disease in Nepal. Oropharyngeal swabs were collected from children in Kathmandu, Nepal, from 3 different settings: a hospital outpatient department (OPD), schools, and children's homes. Hib was isolated using Hib antiserum agar plates, and serotyping was performed with latex agglutination. Hib isolates from children with invasive disease were obtained during active microbiological surveillance at Patan Hospital, Kathmandu, Nepal. Genotyping of disease and carriage isolates was undertaken using multilocus sequence typing (MLST). Swabs were taken from 2,195 children, including 1,311 children at an OPD, 647 children attending schools, and 237 children in homes. Overall, Hib was identified in 5.0% (110/2,195; 95% confidence interval [95% CI], 3.9% to 6.4%). MLST was performed on 108 Hib isolates from children carrying Hib isolates and 15 isolates from children with invasive disease. Thirty-one sequence types (STs) were identified, and 20 of these were novel STs. The most common ST isolates were sequence type 6 (ST6) and the novel ST722. There was marked heterogeneity among the STs from children with disease and children carrying Hib. STs identified from invasive infections were those commonly identified in carriage. This study provides evidence of Hib carriage among children in urban Nepal with genetically diverse strains prior to introduction of universal vaccination. The Hib carriage rate in Nepal was similar to the rates observed in other populations with documented high disease rates prior to vaccination, supporting implementation of Hib vaccine in Nepal in 2009. PMID:21270225

  2. Systematics of Cladophora spp. (Chlorophyta) from North Carolina, USA, based upon morphology and DNA sequence data with a description of Cladophora subtilissima sp. nov.

    PubMed

    Taylor, Robin L; Bailey, Jeffrey Craig; Freshwater, David Wilson

    2017-06-01

    Identification of Cladophora species is challenging due to conservation of gross morphology, few discrete autapomorphies, and environmental influences on morphology. Twelve species of marine Cladophora were reported from North Carolina waters. Cladophora specimens were collected from inshore and offshore marine waters for DNA sequence and morphological analyses. The nuclear-encoded rRNA internal transcribed spacer regions (ITS) were sequenced for 105 specimens and used in molecular assisted identification. The ITS1 and ITS2 region was highly variable, and sequences were sorted into ITS Sets of Alignable Sequences (SASs). Sequencing of short hyper-variable ITS1 sections from Cladophora type specimens was used to positively identify species represented by SASs when the types were made available. Secondary structures for the ITS1 locus were also predicted for each specimen and compared to predicted structures from Cladophora sequences available in GenBank. Nine ITS SASs were identified and representative specimens chosen for phylogenetic analyses of 18S and 28S rRNA gene sequences to reveal relationships with other Cladophora species. Phylogenetic analyses indicated that marine Cladophorales were polyphyletic and separated into two clades, the Cladophora clade and the "Siphonocladales" clade. Morphological analyses were performed to assess the consistency of character states within species, and complement the DNA sequence analyses. These analyses revealed intra- and interspecific character state variation, and that combined molecular and morphological analyses were required for the identification of species. One new report, Cladophora dotyana, and one new species Cladophora subtilissima sp. nov., were revealed, and increased the biodiversity of North Carolina marine Cladophora to 14 species. © 2017 Phycological Society of America.

  3. Neisseria gonorrhoeae Sequence Typing for Antimicrobial Resistance, a Novel Antimicrobial Resistance Multilocus Typing Scheme for Tracking Global Dissemination of N. gonorrhoeae Strains.

    PubMed

    Demczuk, W; Sidhu, S; Unemo, M; Whiley, D M; Allen, V G; Dillon, J R; Cole, M; Seah, C; Trembizki, E; Trees, D L; Kersh, E N; Abrams, A J; de Vries, H J C; van Dam, A P; Medina, I; Bharat, A; Mulvey, M R; Van Domselaar, G; Martin, I

    2017-05-01

    A curated Web-based user-friendly sequence typing tool based on antimicrobial resistance determinants in Neisseria gonorrhoeae was developed and is publicly accessible (https://ngstar.canada.ca). The N. gonorrhoeae Sequence Typing for Antimicrobial Resistance (NG-STAR) molecular typing scheme uses the DNA sequences of 7 genes ( penA , mtrR , porB , ponA , gyrA , parC , and 23S rRNA) associated with resistance to β-lactam antimicrobials, macrolides, or fluoroquinolones. NG-STAR uses the entire penA sequence, combining the historical nomenclature for penA types I to XXXVIII with novel nucleotide sequence designations; the full mtrR sequence and a portion of its promoter region; portions of ponA , porB , gyrA , and parC ; and 23S rRNA sequences. NG-STAR grouped 768 isolates into 139 sequence types (STs) ( n = 660) consisting of 29 clonal complexes (CCs) having a maximum of a single-locus variation, and 76 NG-STAR STs ( n = 109) were identified as unrelated singletons. NG-STAR had a high Simpson's diversity index value of 96.5% (95% confidence interval [CI] = 0.959 to 0.969). The most common STs were NG-STAR ST-90 ( n = 100; 13.0%), ST-42 and ST-91 ( n = 45; 5.9%), ST-64 ( n = 44; 5.72%), and ST-139 ( n = 42; 5.5%). Decreased susceptibility to azithromycin was associated with NG-STAR ST-58, ST-61, ST-64, ST-79, ST-91, and ST-139 ( n = 156; 92.3%); decreased susceptibility to cephalosporins was associated with NG-STAR ST-90, ST-91, and ST-97 ( n = 162; 94.2%); and ciprofloxacin resistance was associated with NG-STAR ST-26, ST-90, ST-91, ST-97, ST-150, and ST-158 ( n = 196; 98.0%). All isolates of NG-STAR ST-42, ST-43, ST-63, ST-81, and ST-160 ( n = 106) were susceptible to all four antimicrobials. The standardization of nomenclature associated with antimicrobial resistance determinants through an internationally available database will facilitate the monitoring of the global dissemination of antimicrobial-resistant N. gonorrhoeae strains. © Crown copyright 2017.

  4. Unconventional P-35S sequence identified in genetically modified maize

    PubMed Central

    Al-Hmoud, Nisreen; Al-Husseini, Nawar; Ibrahim-Alobaide, Mohammed A; Kübler, Eric; Farfoura, Mahmoud; Alobydi, Hytham; Al-Rousan, Hiyam

    2014-01-01

    The Cauliflower Mosaic Virus 35S promoter sequence, CaMV P-35S, is one of several commonly used genetic targets to detect genetically modified maize and is found in most GMOs. In this research we report the finding of an alternative P-35S sequence and its incidence in GM maize marketed in Jordan. The primer pair normally used to amplify a 123 bp DNA fragment of the CaMV P-35S promoter in GMOs also amplified a previously undetected alternative sequence of CaMV P-35S in GM maize samples which we term V3. The amplified V3 sequence comprises 386 base pairs and was not found in the standard wild-type maize, MON810 and MON 863 GM maize. The identified GM maize samples carrying the V3 sequence were found free of CaMV when compared with CaMV infected brown mustard sample. The data of sequence alignment analysis of the V3 genetic element showed 90% similarity with the matching P-35S sequence of the cauliflower mosaic virus isolate CabbB-JI and 99% similarity with matching P-35S sequences found in several binary plant vectors, of which the binary vector locus JQ693018 is one example. The current study showed an increase of 44% in the incidence of the identified 386 bp sequence in GM maize sold in Jordan’s markets during the period 2009 and 2012. PMID:24495911

  5. Diversity of Group I and II Clostridium botulinum Strains from France Including Recently Identified Subtypes.

    PubMed

    Mazuet, Christelle; Legeay, Christine; Sautereau, Jean; Ma, Laurence; Bouchier, Christiane; Bouvet, Philippe; Popoff, Michel R

    2016-06-13

    In France, human botulism is mainly food-borne intoxication, whereas infant botulism is rare. A total of 99 group I and II Clostridium botulinum strains including 59 type A (12 historical isolates [1947-1961], 43 from France [1986-2013], 3 from other countries, and 1 collection strain), 31 type B (3 historical, 23 recent isolates, 4 from other countries, and 1 collection strain), and 9 type E (5 historical, 3 isolates, and 1 collection strain) were investigated by botulinum locus gene sequencing and multilocus sequence typing analysis. Historical C. botulinum A strains mainly belonged to subtype A1 and sequence type (ST) 1, whereas recent strains exhibited a wide genetic diversity: subtype A1 in orfX or ha locus, A1(B), A1(F), A2, A2b2, A5(B2') A5(B3'), as well as the recently identified A7 and A8 subtypes, and were distributed into 25 STs. Clostridium botulinum A1(B) was the most frequent subtype from food-borne botulism and food. Group I C. botulinum type B in France were mainly subtype B2 (14 out of 20 historical and recent strains) and were divided into 19 STs. Food-borne botulism resulting from ham consumption during the recent period was due to group II C. botulinum B4. Type E botulism is rare in France, 5 historical and 1 recent strains were subtype E3. A subtype E12 was recently identified from an unusual ham contamination. Clostridium botulinum strains from human botulism in France showed a wide genetic diversity and seems to result not from a single evolutionary lineage but from multiple and independent genetic rearrangements. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  6. MLST-Based Population Genetic Analysis in a Global Context Reveals Clonality amongst Cryptococcus neoformans var. grubii VNI Isolates from HIV Patients in Southeastern Brazil

    PubMed Central

    Ferreira-Paim, Kennio; Andrade-Silva, Leonardo; Fonseca, Fernanda M.; Ferreira, Thatiana B.; Mora, Delio J.; Andrade-Silva, Juliana; Khan, Aziza; Dao, Aiken; Reis, Eduardo C.; Almeida, Margarete T. G.; Maltos, Andre; Junior, Virmondes R.; Trilles, Luciana; Rickerts, Volker; Chindamporn, Ariya; Sykes, Jane E.; Cogliati, Massimo; Nielsen, Kirsten; Boekhout, Teun; Fisher, Matthew; Kwon-Chung, June; Engelthaler, David M.; Lazéra, Marcia; Meyer, Wieland; Silva-Vergara, Mario L.

    2017-01-01

    Cryptococcosis is an important fungal infection in immunocompromised individuals, especially those infected with HIV. In Brazil, despite the free availability of antiretroviral therapy (ART) in the public health system, the mortality rate due to Cryptococcus neoformans meningitis is still high. To obtain a more detailed picture of the population genetic structure of this species in southeast Brazil, we studied 108 clinical isolates from 101 patients and 35 environmental isolates. Among the patients, 59% had a fatal outcome mainly in HIV-positive male patients. All the isolates were found to be C. neoformans var. grubii major molecular type VNI and mating type locus alpha. Twelve were identified as diploid by flow cytometry, being homozygous (AαAα) for the mating type and by PCR screening of the STE20, GPA1, and PAK1 genes. Using the ISHAM consensus multilocus sequence typing (MLST) scheme, 13 sequence types (ST) were identified, with one being newly described. ST93 was identified from 81 (75%) of the clinical isolates, while ST77 and ST93 were identified from 19 (54%) and 10 (29%) environmental isolates, respectively. The southeastern Brazilian isolates had an overwhelming clonal population structure. When compared with populations from different continents based on data extracted from the ISHAM-MLST database (mlst.mycologylab.org) they showed less genetic variability. Two main clusters within C. neoformans var. grubii VNI were identified that diverged from VNB around 0.58 to 4.8 million years ago. PMID:28099434

  7. Genetic diversity and virulence profiles of Listeria monocytogenes recovered from bulk tank milk, milk filters, and milking equipment from dairies in the United States (2002 to 2014).

    PubMed

    Kim, Seon Woo; Haendiges, Julie; Keller, Eric N; Myers, Robert; Kim, Alexander; Lombard, Jason E; Karns, Jeffrey S; Van Kessel, Jo Ann S; Haley, Bradd J

    2018-01-01

    Unpasteurized dairy products are known to occasionally harbor Listeria monocytogenes and have been implicated in recent listeriosis outbreaks and numerous sporadic cases of listeriosis. However, the diversity and virulence profiles of L. monocytogenes isolates recovered from these products have not been fully described. Here we report a genomic analysis of 121 L. monocytogenes isolates recovered from milk, milk filters, and milking equipment collected from bovine dairy farms in 19 states over a 12-year period. In a multi-virulence-locus sequence typing (MVLST) analysis, 59 Virulence Types (VT) were identified, of which 25% were Epidemic Clones I, II, V, VI, VII, VIII, IX, or X, and 31 were novel VT. In a multi-locus sequence typing (MLST) analysis, 60 Sequence Types (ST) of 56 Clonal Complexes (CC) were identified. Within lineage I, CC5 and CC1 were among the most abundant, and within lineage II, CC7 and CC37 were the most abundant. Multiple CCs previously associated with central nervous system and maternal-neonatal infections were identified. A genomic analysis identified variable distribution of virulence markers, Listeria pathogenicity islands (LIPI) -1, -3, and -4, and stress survival island-1 (SSI-1). Of these, 14 virulence markers, including LIPI-3 and -4 were more frequently detected in one lineage (I or II) than the other. LIPI-3 and LIPI-4 were identified in 68% and 28% of lineage I CCs, respectively. Results of this analysis indicate that there is a high level of genetic diversity among the L. monocytogenes present in bulk tank milk in the United States with some strains being more frequently detected than others, and some being similar to those that have been isolated from previous non-dairy related outbreaks. Results of this study also demonstrate significant number of strains isolated from dairy farms encode virulence markers associated with severe human disease.

  8. Clostridium difficile: Investigating Transmission Patterns between Infected and Colonized Patients using whole Genome Sequencing.

    PubMed

    Kong, L Y; Eyre, D W; Corbeil, J; Raymond, F; Walker, A S; Wilcox, M H; Crook, D W; Michaud, S; Toye, B; Frost, E; Dendukuri, N; Schiller, I; Bourgault, A M; Dascal, A; Oughton, M; Longtin, Y; Poirier, L; Brassard, P; Turgeon, N; Gilca, R; Loo, V G

    2018-05-28

    Whole genome sequencing (WGS) studies can enhance our understanding of the role of patients with asymptomatic Clostridium difficile colonization in transmission. Isolates obtained from patients with Clostridium difficile infection (CDI) and colonization identified in a study conducted during 2006 - 2007 at six Canadian hospitals underwent typing by pulsed-field gel electrophoresis, multilocus sequence typing, and WGS. Isolates from incident CDI cases not in the initial study were also sequenced where possible. Ward movement and typing data were combined to identify plausible donors for each CDI case, as defined by shared time and space within predefined limits. Proportions of plausible donors for CDI cases that were colonized, infected, or both were examined. Five hundred and fifty-four isolates were sequenced successfully, 353 from colonized and 201 from CDI cases. The NAP1/027/ST1 strain was the most common strain, found in 124 (62%) of infected and 92 (26%) of colonized patients. A donor with a plausible ward link was found for 81 CDI cases (40%) using WGS with a threshold of ≤2 single nucleotide variants to determine relatedness. Sixty-five (32%) CDI cases could be linked to both infected and colonized donors. Exclusive linkages to infected and colonized donors were found for 28 (14%) and 12 (6%) CDI cases, respectively. Colonized patients contribute to transmission, but CDI cases are more likely linked to other infected patients than colonized patients in this cohort with high rates of NAP1/027/ST1 strain, highlighting the importance of local prevalence of virulent strains in determining transmission dynamics.

  9. Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases

    PubMed Central

    Schadt, Eric E.; Banerjee, Onureena; Fang, Gang; Feng, Zhixing; Wong, Wing H.; Zhang, Xuegong; Kislyuk, Andrey; Clark, Tyson A.; Luong, Khai; Keren-Paz, Alona; Chess, Andrew; Kumar, Vipin; Chen-Plotkin, Alice; Sondheimer, Neal; Korlach, Jonas; Kasarskis, Andrew

    2013-01-01

    Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types. PMID:23093720

  10. Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases.

    PubMed

    Schadt, Eric E; Banerjee, Onureena; Fang, Gang; Feng, Zhixing; Wong, Wing H; Zhang, Xuegong; Kislyuk, Andrey; Clark, Tyson A; Luong, Khai; Keren-Paz, Alona; Chess, Andrew; Kumar, Vipin; Chen-Plotkin, Alice; Sondheimer, Neal; Korlach, Jonas; Kasarskis, Andrew

    2013-01-01

    Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types.

  11. Fast neutron induced structural rearrangements at a soybean NAP1 locus result in gnarled trichomes

    USDA-ARS?s Scientific Manuscript database

    A soybean (Glycine max (L.) Merr.) gnarled trichome mutant, exhibiting stunted trichomes compared to wild-type, was identified in a fast neutron mutant population. Genetic mapping using whole genome sequence-based bulked segregant analysis identified a 26.6 megabase interval on chromosome 20 that ...

  12. Targeted therapy according to next generation sequencing-based panel sequencing.

    PubMed

    Saito, Motonobu; Momma, Tomoyuki; Kono, Koji

    2018-04-17

    Targeted therapy against actionable gene mutations shows a significantly higher response rate as well as longer survival compared to conventional chemotherapy, and has become a standard therapy for many cancers. Recent progress in next-generation sequencing (NGS) has enabled to identify huge number of genetic aberrations. Based on sequencing results, patients recommend to undergo targeted therapy or immunotherapy. In cases where there are no available approved drugs for the genetic mutations detected in the patients, it is recommended to be facilitate the registration for the clinical trials. For that purpose, a NGS-based sequencing panel that can simultaneously target multiple genes in a single investigation has been used in daily clinical practice. To date, various types of sequencing panels have been developed to investigate genetic aberrations with tumor somatic genome variants (gain-of-function or loss-of-function mutations, high-level copy number alterations, and gene fusions) through comprehensive bioinformatics. Because sequencing panels are efficient and cost-effective, they are quickly being adopted outside the lab, in hospitals and clinics, in order to identify personal targeted therapy for individual cancer patients.

  13. Complete genome sequence of the filamentous gliding predatory bacterium Herpetosiphon aurantiacus type strain (114-95T)

    PubMed Central

    Kiss, Hajnalka; Nett, Markus; Domin, Nicole; Martin, Karin; Maresca, Julia A.; Copeland, Alex; Lapidus, Alla; Lucas, Susan; Berry, Kerrie W.; Glavina Del Rio, Tijana; Dalin, Eileen; Tice, Hope; Pitluck, Sam; Richardson, Paul; Bruce, David; Goodwin, Lynne; Han, Cliff; Detter, John C.; Schmutz, Jeremy; Brettin, Thomas; Land, Miriam; Hauser, Loren; Kyrpides, Nikos C.; Ivanova, Natalia; Göker, Markus; Woyke, Tanja; Klenk, Hans-Peter; Bryant, Donald A.

    2011-01-01

    Herpetosiphon aurantiacus Holt and Lewin 1968 is the type species of the genus Herpetosiphon, which in turn is the type genus of the family Herpetosiphonaceae, type family of the order Herpetosiphonales in the phylum Chloroflexi. H. aurantiacus cells are organized in filaments which can rapidly glide. The species is of interest not only because of its rather isolated position in the tree of life, but also because Herpetosiphon ssp. were identified as predators capable of facultative predation by a wolf pack strategy and of degrading the prey organisms by excreted hydrolytic enzymes. The genome of H. aurantiacus strain 114-95T is the first completely sequenced genome of a member of the family Herpetosiphonaceae. The 6,346,587 bp long chromosome and the two 339,639 bp and 99,204 bp long plasmids with a total of 5,577 protein-coding and 77 RNA genes was sequenced as part of the DOE Joint Genome Institute Program DOEM 2005. PMID:22675585

  14. mec-associated dru typing in the epidemiological analysis of ST239 MRSA in Malaysia.

    PubMed

    Ghaznavi-Rad, E; Goering, R V; Nor Shamsudin, M; Weng, P L; Sekawi, Z; Tavakol, M; van Belkum, A; Neela, V

    2011-11-01

    The usefulness of mec-associated dru typing in the epidemiological analysis of methicillin-resistant Staphylococcus aureus (MRSA) isolated in Malaysia was investigated and compared with pulsed-field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and spa and SCCmec typing. The isolates studied included all MRSA types in Malaysia. Multilocus sequence type ST188 and ST1 isolates were highly clonal by all typing methods. However, the dru typing of ST239 isolates produced the clearest discrimination between SCCmec IIIa and III isolates, yielding more subtypes than any other method. Evaluation of the discriminatory power for each method identified dru typing and PFGE as the most discriminatory, with Simpson's index of diversity (SID) values over 89%, including an isolate which was non-typeable by spa, but dru-typed as dt13j. The discriminatory ability of dru typing, especially with closely related MRSA ST239 strains (e.g., Brazilian and Hungarian), underscores its utility as a tool for the epidemiological investigation of MRSA.

  15. Novel ZBTB24 Mutation Associated with Immunodeficiency, Centromere Instability, and Facial Anomalies Type-2 Syndrome Identified in a Patient with Very Early Onset Inflammatory Bowel Disease.

    PubMed

    Conrad, Máire A; Dawany, Noor; Sullivan, Kathleen E; Devoto, Marcella; Kelsen, Judith R

    2017-12-01

    Very early onset inflammatory bowel disease, diagnosed in children ≤5 years old, can be the initial presentation of some primary immunodeficiencies. In this study, we describe a 17-month-old boy with recurrent infections, growth failure, facial anomalies, and inflammatory bowel disease. Immune evaluation, whole-exome sequencing, karyotyping, and methylation array were performed to evaluate the child's constellation of symptoms and examination findings. Whole-exome sequencing revealed that the child was homozygous for a novel variant in ZBTB24, the gene associated with immunodeficiency, centromere instability, and facial anomalies type-2 syndrome. This describes the first case of inflammatory bowel disease associated with immunodeficiency, centromere instability, and facial anomalies type-2 syndrome in a child with a novel disease-causing mutation in ZBTB24 found on whole-exome sequencing.

  16. [Mutational frequencies in usherin(USH2A gene) in 26 Colombian individuals with Usher syndrome type II].

    PubMed

    López, Greizy; Gelvez, Nancy Yaneth; Tamayo, Martalucía

    2011-03-01

    Usher syndrome is a disorder characterized by progressive retinitis pigmentosa, prelingual sensory hearing loss and vestibular dysfunction. It is the most frequent cause of deaf-blindness in humans. Three clinical types and twelve genetic subtypes have been characterized. Type II is the most common, and among these cases, nearly 80% have mutations in the USH2A gene. The aim of the study was to establish the mutational frequencies for the short isoform of USH2A gene in Usher syndrome type II. Twenty-six Colombian individuals with Usher syndrome type II were included. SSCP analysis for 20 exons of the short isoform was performed and abnormal patterns were sequenced. Sequencing of exon 13 of the USH2A gene was performed for all the individuals because the most frequent mutation is located in this exon. The most frequent mutation was c.2299delG, identified in the 27% (n=8) of the sample. The second mutation, p.R334W, showed a frequency of 15%. A new variant identified in the 5’UTR region, g.129G>T, was present in 1 individual (4%). Four polymorphisms were identified; one of them is a new deletion in exon 20, first reported in this study. Mutations in the usherin short isoform were identified in 38% of a sample of 26 USH2 cases. Molecular diagnosis was established in 7 of the 26.

  17. VWF mutations and new sequence variations identified in healthy controls are more frequent in the African-American population.

    PubMed

    Bellissimo, Daniel B; Christopherson, Pamela A; Flood, Veronica H; Gill, Joan Cox; Friedman, Kenneth D; Haberichter, Sandra L; Shapiro, Amy D; Abshire, Thomas C; Leissinger, Cindy; Hoots, W Keith; Lusher, Jeanne M; Ragni, Margaret V; Montgomery, Robert R

    2012-03-01

    Diagnosis and classification of VWD is aided by molecular analysis of the VWF gene. Because VWF polymorphisms have not been fully characterized, we performed VWF laboratory testing and gene sequencing of 184 healthy controls with a negative bleeding history. The controls included 66 (35.9%) African Americans (AAs). We identified 21 new sequence variations, 13 (62%) of which occurred exclusively in AAs and 2 (G967D, T2666M) that were found in 10%-15% of the AA samples, suggesting they are polymorphisms. We identified 14 sequence variations reported previously as VWF mutations, the majority of which were type 1 mutations. These controls had VWF Ag levels within the normal range, suggesting that these sequence variations might not always reduce plasma VWF levels. Eleven mutations were found in AAs, and the frequency of M740I, H817Q, and R2185Q was 15%-18%. Ten AA controls had the 2N mutation H817Q; 1 was homozygous. The average factor VIII level in this group was 99 IU/dL, suggesting that this variation may confer little or no clinical symptoms. This study emphasizes the importance of sequencing healthy controls to understand ethnic-specific sequence variations so that asymptomatic sequence variations are not misidentified as mutations in other ethnic or racial groups.

  18. Applications of alignment-free methods in epigenomics.

    PubMed

    Pinello, Luca; Lo Bosco, Giosuè; Yuan, Guo-Cheng

    2014-05-01

    Epigenetic mechanisms play an important role in the regulation of cell type-specific gene activities, yet how epigenetic patterns are established and maintained remains poorly understood. Recent studies have supported a role of DNA sequences in recruitment of epigenetic regulators. Alignment-free methods have been applied to identify distinct sequence features that are associated with epigenetic patterns and to predict epigenomic profiles. Here, we review recent advances in such applications, including the methods to map DNA sequence to feature space, sequence comparison and prediction models. Computational studies using these methods have provided important insights into the epigenetic regulatory mechanisms.

  19. Genotype and biotype of invasive Anopheles stephensi in Mannar Island of Sri Lanka.

    PubMed

    Surendran, Sinnathamby N; Sivabalakrishnan, Kokila; Gajapathy, Kanapathy; Arthiyan, Sivasingham; Jayadas, Tibutius T P; Karvannan, Kalingarajah; Raveendran, Selvarajah; Parakrama Karunaratne, S H P; Ramasamy, Ranjan

    2018-01-03

    Anopheles stephensi, the major vector of urban malaria in India, was recently detected for the first time in Sri Lanka in Mannar Island on the northwestern coast. Since there are different biotypes of An. stephensi with different vector capacities in India, a study was undertaken to further characterise the genotype and biotype of An. stephensi in Mannar Island. Mosquito larvae were collected in Pesalai village in Mannar and maintained in the insectary until adulthood. Adult An. stephensi were identified morphologically using published keys. Identified adult An. stephensi were molecularly characterized using two mitochondrial (cox1 and cytb) and one nuclear (ITS2) markers. Their PCR-amplified target fragments were sequenced and checked against available sequences in GenBank for phylogenetic analysis. The average spiracular and thoracic lengths and the spiracular index were determined to identify biotypes based on corresponding indices for Indian An. stephensi. All DNA sequences for the Mannar samples matched reported sequences for An. stephensi from the Middle East and India. However, a single nucleotide variation in the cox1 sequence suggested an amino acid change from valine to methionine in the cox1 protein in Sri Lankan An. stephensi. Morphological data was consistent with the presence of the Indian urban vector An. stephensi type-form in Sri Lanka. The present study provides a more detailed molecular characterization of An. stephensi and suggests the presence of the type-form of the vector for the first time in Sri Lanka. The single mutation in the cox1 gene may be indicative of a founder effect causing the initial diversification of An. stephensi in Sri Lanka from the Indian form. The distribution of the potent urban vector An. stephensi type-form needs to be established by studies throughout the island as its spread adds to the challenge of maintaining the country's malaria-free status.

  20. Clonal Transmission of Gram-Negative Bacteria with Carbapenemases NDM-1, VIM-1, and OXA-23/72 in a Bulgarian Hospital.

    PubMed

    Pfeifer, Yvonne; Trifonova, Angelina; Pietsch, Michael; Brunner, Magdalena; Todorova, Iva; Gergova, Ivanka; Wilharm, Gottfried; Werner, Guido; Savov, Encho

    2017-04-01

    We characterized 72 isolates with reduced susceptibility to carbapenems (50 Acinetobacter spp., 13 Proteus mirabilis, five Escherichia coli, one Morganella morganii, one Enterobacter cloacae, one Providencia rettgeri, and one Pseudomonas aeruginosa) from a hospital in Sofia, Bulgaria. Different β-lactamase genes were identified by polymerase chain reaction and sequencing. Bacterial strain typing was performed by enzymatic macrorestriction and pulsed-field gel electrophoresis (PFGE) typing as well as multilocus sequence typing for selected isolates. The majority of Acinetobacter baumannii (46/50) and one Acinetobacter pittii isolate harbored carbapenemase genes bla OXA-23 or bla OXA-72 ; two A. baumannii contained both genes. PFGE typing of all A. baumannii showed the presence of nine different clones belonging to eight sequence types ST350, ST208, ST436, ST437, ST449, ST231, ST502, and ST579. Molecular characterization of the remaining isolates confirmed the presence of one NDM-1-producing E. coli-ST101 clone (five isolates) and one P. mirabilis clone (13 isolates) with VIM-1 and CMY-99. Furthermore, NDM-1 was identified in P. rettgeri and M. morganii and VIM-2 in the P. aeruginosa isolate. The permanent introduction of OXA-23/72 carbapenemase-producing A. baumannii clones into the hospital and the repeated occurrence of one VIM-1-producing P. mirabilis and one NDM-1-producing E. coli-ST101 clone over a period of more than 1 year is of concern and requires intensified investigations.

  1. Genotyping of Chromobacterium violaceum isolates by recA PCR-RFLP analysis.

    PubMed

    Scholz, Holger Christian; Witte, Angela; Tomaso, Herbert; Al Dahouk, Sascha; Neubauer, Heinrich

    2005-03-15

    Intraspecies variation of Chromobacterium violaceum was examined by comparative sequence - and by restriction fragment length polymorphism analysis of the recombinase A gene (recA-PCR-RFLP). Primers deduced from the known recA gene sequence of the type strain C. violaceum ATCC 12472(T) allowed the specific amplification of a 1040bp recA fragment from each of the 13 C. violaceum strains investigated, whereas other closely related organisms tested negative. HindII-PstI-recA RFLP analysis generated from 13 representative C. violaceum strains enabled us to identify at least three different genospecies. In conclusion, analysis of the recA gene provides a rapid and robust nucleotide sequence-based approach to specifically identify and classify C. violaceum on genospecies level.

  2. [Multilocus sequence-typing for characterization of Moscow strains of Haemophilus influenzae type b].

    PubMed

    Platonov, A E; Mironov, K O; Iatsyshina, S B; Koroleva, I S; Platonova, O V; Gushchin, A E; Shipulin, G A

    2003-01-01

    Haemophilius influenzae, type b (Hib) bacteria, were genotyped by multilocus sequence typing (MLST) using 5 loci (adk, fucK, mdh, pgi, recA). 42 Moscow Hib strains (including 38 isolates form cerebrospinal fluid of children, who had purulent meningitis in 1999-2001, and 4 strains isolated from healthy carriers of Hib), as well as 2 strains from Yekaterinburg were studied. In MLST a strain is characterized, by alleles and their combinations (an allele profile) referred to also as sequence-type (ST). 9 Sts were identified within the Russian Hib bacteria: ST-1 was found in 25 strains (57%), ST-12 was found in 8 strains (18%), ST-11 was found in 4 strains (9%) and ST-15 was found in 2 strains (4.5%); all other STs strains (13, 14, 16, 17, 51) were found in isolated cases (2.3%). A comparison of allelic profiles and of nucleotide sequences showed that 93% of Russian isolates, i.e. strain with ST-1, 11, 12, 13, 15 and 17, belong to one and the same clonal complex. 2 isolates from Norway and Sweden from among 7 foreign Hib strains studied up to now can be described as belonging to the same clonal complex; 5 Hib strains were different from the Russian ones.

  3. DNA-based differentiation of the Ecuadorian cocoa types CCN-51 and Arriba based on sequence differences in the chloroplast genome.

    PubMed

    Herrmann, Luise; Haase, Ilka; Blauhut, Maike; Barz, Nadine; Fischer, Markus

    2014-12-17

    Two cocoa types, Arriba and CCN-51, are being cultivated in Ecuador. With regard to the unique aroma, Arriba is considered a fine cocoa type, while CCN-51 is a bulk cocoa because of its weaker aroma. Because it is being assumed that Arriba is mixed with CCN-51, there is an interest in the analytical differentiation of the two types. Two methods to identify CCN-51 adulterations in Arriba cocoa were developed on the basis of differences in the chloroplast DNA. On the one hand, a different repeat of the sequence TAAAG in the inverted repeat region results in a different length of amplicons for the two cocoa types, which can be detected by agarose gel electrophoresis, capillary gel electrophoresis, and denaturing high-performance liquid chromatography. On the other hand, single nucleotide polymorphisms (SNPs) between the CCN-51 and Arriba sequences represent restriction sites, which can be used for restriction fragment length polymorphism analysis. A semi-quantitative analysis based on these SNPs is feasible. A method for an exact quantitation based on these results is not realizable. These sequence variations were confirmed for a comprehensive cultivar collection of Arriba and CCN-51, for both bean and leaf samples.

  4. [Analysis of 4 clustered high risk acute flaccid paralysis cases in Shanxi Province in 2006].

    PubMed

    Yan, Dong-mei; Zhang, Yong; Wang, Dong-yan

    2010-04-01

    Analysis of epidemiology of 4 clustered high risk acute flaccid paralysis(AFP) cases reported by Shanxi province in 2006 and VP1 gene characteristic for type III poliovirus isolated from the four AFP cases. Virus isolation and identification were conducted according to the 4th edition of WHO polio laboratory manual. The sequence of VP1 region were amplified and sequenced. The phylogenetic trees based on VP1 region were constructed. Three of four high risk AFP cases were suspected as vaccine associated paralysis poliomyelitis (VAPP), the onset date of them were close. VP1 sequencing of the four type III isolates revealed that the identity were 99.7%, 99.9%, 99.4% and 99.9% respectively compared with vaccine reference strain-BJOPV3. According to WHO criteria, the four isolates were identified as type III vaccine-related poliovirus. Phylogenetic analysis based on VP1 coding sequence showed that the four type III poliovirus were not related significantly. The type III poliovirus isolated from 3 suspected VAPP cases shared one nucleotide mutation at 2637 (C-->U), which result in the amino acid mutation from Val into Ala. The improvement of laboratory surveillance for clustered high risk AFP cases should be strengthened so as to detect and prevent poliovirus circulation timely.

  5. Statistical Features of the 2010 Beni-Ilmane, Algeria, Aftershock Sequence

    NASA Astrophysics Data System (ADS)

    Hamdache, M.; Peláez, J. A.; Gospodinov, D.; Henares, J.

    2018-03-01

    The aftershock sequence of the 2010 Beni-Ilmane ( M W 5.5) earthquake is studied in depth to analyze the spatial and temporal variability of seismicity parameters of the relationships modeling the sequence. The b value of the frequency-magnitude distribution is examined rigorously. A threshold magnitude of completeness equal to 2.1, using the maximum curvature procedure or the changing point algorithm, and a b value equal to 0.96 ± 0.03 have been obtained for the entire sequence. Two clusters have been identified and characterized by their faulting type, exhibiting b values equal to 0.99 ± 0.05 and 1.04 ± 0.05. Additionally, the temporal decay of the aftershock sequence was examined using a stochastic point process. The analysis was done through the restricted epidemic-type aftershock sequence (RETAS) stochastic model, which allows the possibility to recognize the prevailing clustering pattern of the relaxation process in the examined area. The analysis selected the epidemic-type aftershock sequence (ETAS) model to offer the most appropriate description of the temporal distribution, which presumes that all events in the sequence can cause secondary aftershocks. Finally, the fractal dimensions are estimated using the integral correlation. The obtained D 2 values are 2.15 ± 0.01, 2.23 ± 0.01 and 2.17 ± 0.02 for the entire sequence, and for the first and second cluster, respectively. An analysis of the temporal evolution of the fractal dimensions D -2, D 0, D 2 and the spectral slope has been also performed to derive and characterize the different clusters included in the sequence.

  6. Application of High-Throughput Next-Generation Sequencing for HLA Typing on Buccal Extracted DNA: Results from over 10,000 Donor Recruitment Samples

    PubMed Central

    Nguyen, David; Valenzuela, Nicole; Takemura, Ping; Bolon, Yung-Tsi; Springer, Brianna; Saito, Katsuyuki; Zheng, Ying; Hague, Tim; Pasztor, Agnes; Horvath, Gyorgy; Rigo, Krisztina; Reed, Elaine F.; Zhang, Qiuheng

    2016-01-01

    Background Unambiguous HLA typing is important in hematopoietic stem cell transplantation (HSCT), HLA disease association studies, and solid organ transplantation. However, current molecular typing methods only interrogate the antigen recognition site (ARS) of HLA genes, resulting in many cis-trans ambiguities that require additional typing methods to resolve. Here we report high-resolution HLA typing of 10,063 National Marrow Donor Program (NMDP) registry donors using long-range PCR by next generation sequencing (NGS) approach on buccal swab DNA. Methods Multiplex long-range PCR primers amplified the full-length of HLA class I genes (A, B, C) from promotor to 3’ UTR. Class II genes (DRB1, DQB1) were amplified from exon 2 through part of exon 4. PCR amplicons were pooled and sheared using Covaris fragmentation. Library preparation was performed using the Illumina TruSeq Nano kit on the Beckman FX automated platform. Each sample was tagged with a unique barcode, followed by 2×250 bp paired-end sequencing on the Illumina MiSeq. HLA typing was assigned using Omixon Twin software that combines two independent computational algorithms to ensure high confidence in allele calling. Consensus sequence and typing results were reported in Histoimmunogenetics Markup Language (HML) format. All homozygous alleles were confirmed by Luminex SSO typing and exon novelties were confirmed by Sanger sequencing. Results Using this automated workflow, over 10,063 NMDP registry donors were successfully typed under high-resolution by NGS. Despite known challenges of nucleic acid degradation and low DNA concentration commonly associated with buccal-based specimens, 97.8% of samples were successfully amplified using long-range PCR. Among these, 98.2% were successfully reported by NGS, with an accuracy rate of 99.84% in an independent blind Quality Control audit performed by the NDMP. In this study, NGS-HLA typing identified 23 null alleles (0.023%), 92 rare alleles (0.091%) and 42 exon novelties (0.042%). Conclusion Long-range, unambiguous HLA genotyping is achievable on clinical buccal swab-extracted DNA. Importantly, full-length gene sequencing and the ability to curate full sequence data will permit future interrogation of the impact of introns, expanded exons, and other gene regulatory sequences on clinical outcomes in transplantation. PMID:27798706

  7. Application of High-Throughput Next-Generation Sequencing for HLA Typing on Buccal Extracted DNA: Results from over 10,000 Donor Recruitment Samples.

    PubMed

    Yin, Yuxin; Lan, James H; Nguyen, David; Valenzuela, Nicole; Takemura, Ping; Bolon, Yung-Tsi; Springer, Brianna; Saito, Katsuyuki; Zheng, Ying; Hague, Tim; Pasztor, Agnes; Horvath, Gyorgy; Rigo, Krisztina; Reed, Elaine F; Zhang, Qiuheng

    2016-01-01

    Unambiguous HLA typing is important in hematopoietic stem cell transplantation (HSCT), HLA disease association studies, and solid organ transplantation. However, current molecular typing methods only interrogate the antigen recognition site (ARS) of HLA genes, resulting in many cis-trans ambiguities that require additional typing methods to resolve. Here we report high-resolution HLA typing of 10,063 National Marrow Donor Program (NMDP) registry donors using long-range PCR by next generation sequencing (NGS) approach on buccal swab DNA. Multiplex long-range PCR primers amplified the full-length of HLA class I genes (A, B, C) from promotor to 3' UTR. Class II genes (DRB1, DQB1) were amplified from exon 2 through part of exon 4. PCR amplicons were pooled and sheared using Covaris fragmentation. Library preparation was performed using the Illumina TruSeq Nano kit on the Beckman FX automated platform. Each sample was tagged with a unique barcode, followed by 2×250 bp paired-end sequencing on the Illumina MiSeq. HLA typing was assigned using Omixon Twin software that combines two independent computational algorithms to ensure high confidence in allele calling. Consensus sequence and typing results were reported in Histoimmunogenetics Markup Language (HML) format. All homozygous alleles were confirmed by Luminex SSO typing and exon novelties were confirmed by Sanger sequencing. Using this automated workflow, over 10,063 NMDP registry donors were successfully typed under high-resolution by NGS. Despite known challenges of nucleic acid degradation and low DNA concentration commonly associated with buccal-based specimens, 97.8% of samples were successfully amplified using long-range PCR. Among these, 98.2% were successfully reported by NGS, with an accuracy rate of 99.84% in an independent blind Quality Control audit performed by the NDMP. In this study, NGS-HLA typing identified 23 null alleles (0.023%), 92 rare alleles (0.091%) and 42 exon novelties (0.042%). Long-range, unambiguous HLA genotyping is achievable on clinical buccal swab-extracted DNA. Importantly, full-length gene sequencing and the ability to curate full sequence data will permit future interrogation of the impact of introns, expanded exons, and other gene regulatory sequences on clinical outcomes in transplantation.

  8. Characterization of OXA-48-like-producing Enterobacteriaceae isolated from river water in Algeria.

    PubMed

    Tafoukt, Rima; Touati, Abdelaziz; Leangapichart, Thongpan; Bakour, Sofiane; Rolain, Jean-Marc

    2017-09-01

    The spread of carbapenemase-producing Enterobacteriaceae (CPE) is a significant problem for healthcare worldwide. The prevalence of carbapenem-resistant Enterobacteriaceae (CPE) in water environments in Algeria are unknown. The aim of this study was to screen for the presence of CPE isolates in the Soummam River in Bejaia, Algeria. Isolates of Enterobacteriaceae recovered from twelve samples of river water and showing reduced susceptibility to carbapenems were included in this study. The isolates were identified by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS). Isolates were subjected to antimicrobial susceptibility testing and the modified Carba NP test. Carbapenemase and extended-spectrum β-lactamase (ESBL) determinants were studied by PCR amplification and sequencing. The clonal relatedness between isolates was studied by Multilocus Sequence Typing (MLST) method. A total of 20 carbapenem-resistant Enterobacteriaceae strains were included in this study, identified as Escherichia coli (n = 12), Klebsiella pneumoniae (n = 3), Raoultella ornithinolytica (n = 3), Citrobacter freundii (n = 1) and Citrobacter braakii (n = 1). Carbapenemase genes identified in this study included bla OXA-48 , observed in 17 isolates (9 E. coli, 3 K. pneumoniae, 3 R. ornithinolytica, 1 C. freundii and 1 C. braakii), and bla OXA-244 , a variant of bla OXA-48 , was found in three E. coli isolates. MLST showed that 12 E. coli strains belonged to six different sequence types (ST559, ST38, ST212, ST3541, 1972 and ST2142), and we identified three different STs in K. pneumoniae isolates, including ST133, ST2055, and a new sequence type: ST2192. This study showed the presence of OXA-48-like-producing Enterobacteriaceae in water environments and highlighted the potential role of aquatic environments as reservoirs of clinically relevant antimicrobial-resistant bacteria, with the potential to spread throughout the community. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. Molecular analysis of varicella vaccines and varicella-zoster virus from vaccine-related skin lesions.

    PubMed

    Thiele, Sonja; Borschewski, Aljona; Küchler, Judit; Bieberbach, Marc; Voigt, Sebastian; Ehlers, Bernhard

    2011-07-01

    To prevent complications that might follow an infection with varicella-zoster virus (VZV), the live attenuated Oka strain (V-Oka) is administered to children in many developed countries. Three vaccine brands (Varivax from Sanofi Pasteur MSD; Varilrix and Priorix-Tetra, both from Glaxo-Smith-Kline) are licensed in Germany and have been associated with both different degrees of vaccine effectiveness and adverse effects. To identify genetic variants in the vaccines that might contribute to rash-associated syndromes, single nucleotide polymorphism (SNP) profiles of variants from the three vaccines and rash-associated vaccine-type VZV from German vaccinees were quantitatively compared by PCR-based pyrosequencing (PSQ). The Varivax vaccine contained an estimated 3-fold higher diversity of VZV variants, with 20% more wild-type (wt) SNPs than Varilrix and Priorix-Tetra. These minor VZV variants in the vaccines were identified by analyzing cloned full-length open reading frame (ORF) orf62 sequences by chain termination sequencing and PSQ. Some of these sequences amplified from vaccine VZV were very similar or identical to those of the rash-associated vaccine-type VZV from vaccinees and were almost exclusively detected in Varivax. Therefore, minorities of rash-associated VZV variants are present in varicella vaccine formulations, and it can be concluded that the analysis of a core set of four SNPs is required as a minimum for a firm diagnostic differentiation of vaccine-type VZV from wt VZV.

  10. A novel mutation of the MITF gene in a family with Waardenburg syndrome type 2: A case report

    PubMed Central

    SHI, YUNFANG; LI, XIAOZHOU; JU, DUAN; LI, YAN; ZHANG, XIULING; ZHANG, YING

    2016-01-01

    Waardenburg syndrome (WS) is an autosomal dominant disorder with varying degrees of sensorineural hearing loss, and accumulation of pigmentation in hair, skin and iris. There are four types of WS (WS1–4) with differing characteristics. Mutations in six genes [paired box gene 3 (PAX3), microphthalmia-associated transcription factor (MITF), endothelin 3 (END3), endothelin receptor type B (EDNRB), SRY (sex determining region Y)-box 10 (SOX10) and snail homolog 2 (SNAI2)] have been identified to be associated with the various types. This case report describes the investigation of genetic mutations in three patients with WS2 from a single family. Genomic DNA was extracted, and the six WS-related genes were sequenced using next-generation sequencing technology. In addition to mutations in PAX3, EDNRB and SOX10, a novel heterozygous MITF mutation, p.Δ315Arg (c.944_946delGAA) on exon 8 was identified. This is predicted to be a candidate disease-causing mutation that may affect the structure and function of the enzyme. PMID:27073475

  11. A novel mutation of the MITF gene in a family with Waardenburg syndrome type 2: A case report.

    PubMed

    Shi, Yunfang; Li, Xiaozhou; Ju, Duan; Li, Yan; Zhang, Xiuling; Zhang, Ying

    2016-04-01

    Waardenburg syndrome (WS) is an autosomal dominant disorder with varying degrees of sensorineural hearing loss, and accumulation of pigmentation in hair, skin and iris. There are four types of WS (WS1-4) with differing characteristics. Mutations in six genes [paired box gene 3 ( PAX3 ), microphthalmia-associated transcription factor ( MITF ), endothelin 3 ( END3 ), endothelin receptor type B ( EDNRB ), SRY (sex determining region Y)-box 10 ( SOX10 ) and snail homolog 2 ( SNAI2 )] have been identified to be associated with the various types. This case report describes the investigation of genetic mutations in three patients with WS2 from a single family. Genomic DNA was extracted, and the six WS-related genes were sequenced using next-generation sequencing technology. In addition to mutations in PAX3, EDNRB and SOX10, a novel heterozygous MITF mutation, p.Δ315Arg (c.944_946delGAA) on exon 8 was identified. This is predicted to be a candidate disease-causing mutation that may affect the structure and function of the enzyme.

  12. Clavibacter michiganensis subsp. phaseoli subsp. nov., pathogenic in bean.

    PubMed

    González, Ana J; Trapiello, Estefanía

    2014-05-01

    A yellow Gram-reaction-positive bacterium isolated from bean seeds (Phaseolus vulgaris L.) was identified as Clavibacter michiganensis by 16S rRNA gene sequencing. Molecular methods were employed in order to identify the subspecies. Such methods included the amplification of specific sequences by PCR, 16S amplified rDNA restriction analysis (ARDRA), RFLP and multilocus sequence analysis as well as the analysis of biochemical and phenotypic traits including API 50CH and API ZYM results. The results showed that strain LPPA 982T did not represent any known subspecies of C. michiganensis. Pathogenicity tests revealed that the strain is a bean pathogen causing a newly identified bacterial disease that we name bacterial bean leaf yellowing. On the basis of these results, strain LPPA 982T is regarded as representing a novel subspecies for which the name Clavibacter michiganensis subsp. phaseoli subsp. nov. is proposed. The type strain is LPPA 982T (=CECT 8144T=LMG 27667T).

  13. Exome sequencing identifies SUCO mutations in mesial temporal lobe epilepsy.

    PubMed

    Sha, Zhiqiang; Sha, Longze; Li, Wenting; Dou, Wanchen; Shen, Yan; Wu, Liwen; Xu, Qi

    2015-03-30

    Mesial temporal lobe epilepsy (mTLE) is the main type and most common medically intractable form of epilepsy. Severity of disease-based stratified samples may help identify new disease-associated mutant genes. We analyzed mRNA expression profiles from patient hippocampal tissue. Three of the seven patients had severe mTLE with generalized-onset convulsions and consciousness loss that occurred over many years. We found that compared with other groups, patients with severe mTLE were classified into a distinct group. Whole-exome sequencing and Sanger sequencing validation in all seven patients identified three novel SUN domain-containing ossification factor (SUCO) mutations in severely affected patients. Furthermore, SUCO knock down significantly reduced dendritic length in vitro. Our results indicate that mTLE defects may affect neuronal development, and suggest that neurons have abnormal development due to lack of SUCO, which may be a generalized-onset epilepsy-related gene. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  14. Multiplex PCR identification of Taenia spp. in rodents and carnivores.

    PubMed

    Al-Sabi, Mohammad N S; Kapel, Christian M O

    2011-11-01

    The genus Taenia includes several species of veterinary and public health importance, but diagnosis of the etiological agent in definitive and intermediate hosts often relies on labor intensive and few specific morphometric criteria, especially in immature worms and underdeveloped metacestodes. In the present study, a multiplex PCR, based on five primers targeting the 18S rDNA and ITS2 sequences, produced a species-specific banding patterns for a range of Taenia spp. Species typing by the multiplex PCR was compared to morphological identification and sequencing of cox1 and/or 12S rDNA genes. As compared to sequencing, the multiplex PCR identified 31 of 32 Taenia metacestodes from rodents, whereas only 14 cysts were specifically identified by morphology. Likewise, the multiplex PCR identified 108 of 130 adult worms, while only 57 were identified to species by morphology. The tested multiplex PCR system may potentially be used for studies of Taenia spp. transmitted between rodents and carnivores.

  15. Comparative evaluation of an automated repetitive-sequence-based PCR instrument versus pulsed-field gel electrophoresis in the setting of a Serratia marcescens nosocomial infection outbreak.

    PubMed

    Ligozzi, Marco; Fontana, Roberta; Aldegheri, Marco; Scalet, Giovanna; Lo Cascio, Giuliana

    2010-05-01

    A semiautomated, repetitive-sequence-based PCR (rep-PCR) instrument (DiversiLab system) was evaluated in comparison with pulsed-field gel electrophoresis (PFGE) to investigate an outbreak of Serratia marcescens infections in a neonatal intensive care unit (NICU). A selection of 36 epidemiologically related and 8 epidemiologically unrelated isolates was analyzed. Among the epidemiologically related isolates, PFGE identified five genetically unrelated patterns. Thirty-two isolates from patients and wet nurses showed the same PFGE profile (pattern A). Genetically unrelated PFGE patterns were found in one patient (pattern B), in two wet nurses (patterns C and D), and in an environmental isolate from the NICU (pattern G). Rep-PCR identified seven different patterns, three of which included the 32 isolates of PFGE type A. One or two band differences in isolates of these three types allowed isolates to be categorized as similar and included in a unique cluster. Isolates of different PFGE types were also of unrelated rep-PCR types. All of the epidemiologically unrelated isolates were of different PFGE and rep-PCR types. The level of discrimination exhibited by rep-PCR with the DiversiLab system allowed us to conclude that this method was able to identify genetic similarity in a spatio-temporal cluster of S. marcescens isolates.

  16. Analysis of the full genome of human group C rotaviruses reveals lineage diversification and reassortment.

    PubMed

    Medici, Maria Cristina; Tummolo, Fabio; Martella, Vito; Arcangeletti, Maria Cristina; De Conto, Flora; Chezzi, Carlo; Fehér, Enikő; Marton, Szilvia; Calderaro, Adriana; Bányai, Krisztián

    2016-08-01

    Group C rotaviruses (RVC) are enteric pathogens of humans and animals. Whole-genome sequences are available only for few RVCs, leaving gaps in our knowledge about their genetic diversity. We determined the full-length genome sequence of two human RVCs (PR2593/2004 and PR713/2012), detected in Italy from hospital-based surveillance for rotavirus infection in 2004 and 2012. In the 11 RNA genomic segments, the two Italian RVCs segregated within separate intra-genotypic lineages showed variation ranging from 1.9 % (VP6) to 15.9 % (VP3) at the nucleotide level. Comprehensive analysis of human RVC sequences available in the databases allowed us to reveal the existence of at least two major genome configurations, defined as type I and type II. Human RVCs of type I were all associated with the M3 VP3 genotype, including the Italian strain PR2593/2004. Conversely, human RVCs of type II were all associated with the M2 VP3 genotype, including the Italian strain PR713/2012. Reassortant RVC strains between these major genome configurations were identified. Although only a few full-genome sequences of human RVCs, mostly of Asian origin, are available, the analysis of human RVC sequences retrieved from the databases indicates that at least two intra-genotypic RVC lineages circulate in European countries. Gathering more sequence data is necessary to develop a standardized genotype and intra-genotypic lineage classification system useful for epidemiological investigations and avoiding confusion in the literature.

  17. Great expectations: patient perspectives and anticipated utility of non-diagnostic genomic-sequencing results.

    PubMed

    Hylind, Robyn; Smith, Maureen; Rasmussen-Torvik, Laura; Aufox, Sharon

    2018-01-01

    The management of secondary findings is a challenge to health-care providers relaying clinical genomic-sequencing results to patients. Understanding patients' expectations from non-diagnostic genomic sequencing could help guide this management. This study interviewed 14 individuals enrolled in the eMERGE (Electronic Medical Records and Genomics) study. Participants in eMERGE consent to undergo non-diagnostic genomic sequencing, receive results, and have results returned to their physicians. The interviews assessed expectations and intended use of results. The majority of interviewees were male (64%) and 43% identified as non-Caucasian. A unique theme identified was that many participants expressed uncertainty about the type of diseases they expected to receive results on, what results they wanted to learn about, and how they intended to use results. Participant uncertainty highlights the complex nature of deciding to undergo genomic testing and a deficiency in genomic knowledge. These results could help improve how genomic sequencing and secondary findings are discussed with patients.

  18. Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA.

    PubMed

    Xu, Weijia; Ozer, Stuart; Gutell, Robin R

    2009-01-01

    With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure.

  19. Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA

    PubMed Central

    Xu, Weijia; Ozer, Stuart; Gutell, Robin R.

    2010-01-01

    With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure. PMID:20502534

  20. Whole genome sequencing of Oryza sativa L. cv. Seeragasamba identifies a new fragrance allele in rice

    PubMed Central

    Bindusree, Ganigara; Natarajan, Purushothaman; Kalva, Sukesh

    2017-01-01

    Fragrance of rice is an important trait that confers a large economic benefit to the farmers who cultivate aromatic rice varieties. Several aromatic rice varieties have limited geographic distribution, and are endowed with variety-specific unique fragrances. BADH2 was identified as a fragrance gene in 2005, and it is essential to identify the fragrance alleles from diverse geographical locations and genetic backgrounds. Seeragasamba is a short-grain aromatic rice variety of the indica type, which is cultivated in a limited area in India. Whole genome sequencing of this variety identified a new badh2 allele (badh2-p) with an 8 bp insertion in the promoter region of the BADH2 gene. When the whole genome sequences of 76 aromatic varieties in the 3000 rice genome project were analyzed, the badh2-p allele was present in 13 varieties (approximately 17%) of both indica and japonica types. In addition, the badh2-p allele was present in 17 varieties that already had the loss-of-function allele, badh2-E7. Taken together, the frequency of badh2-p allele (approximately 40%) was found to be greater than that of the badh2-E7 allele (approximately 34%) among the aromatic rice varieties. Therefore, it is suggested to include badh2-p as a predominant allele when screening for fragrance alleles in aromatic rice varieties. PMID:29190814

  1. Candida mesorugosa sp. nov., a novel yeast species similar to Candida rugosa, isolated from a tertiary hospital in Brazil.

    PubMed

    Chaves, Guilherme M; Terçarioli, Gisela R; Padovan, Ana Carolina B; Rosas, Robert C; Ferreira, Renata C; Melo, Analy S A; Colombo, Arnaldo L

    2013-04-01

    Candida rugosa is a yeast species that is emerging as a causative agent of invasive infection, particularly in Latin America. Recently, C. pseudorugosa was proposed as a new species closely related to C. rugosa. We evaluated in this investigation the genetic heterogeneity within the C. rugosa species complex. All clinical isolates used in this study were identified phenotypically as C. rugosa but were genotypically different from the C. rugosa type, ATCC 10571. RAPD marker analysis revealed less than 83% similarity between our clinical isolates and the C. rugosa type strain. The D1/D2 region sequences of our clinical isolates showed 98% identity with C. rugosa but only 94-95% identity with C. pseudorugosa. The ITS rDNA sequences of the Brazilian isolates showed 91% identity with the C. rugosa ATCC 10571 ITS sequence. Network and Bayesian analyses of ITS and housekeeping gene sequences separated our clinical isolates into different branches from C. rugosa type strain. These differences are sufficient to reassign our isolates to a distinct species, named C. mesorugosa.

  2. Understanding the molecular epidemiology and global relationships of Brachyspira hyodysenteriae from swine herds in the United States: a multi-locus sequence typing approach.

    PubMed

    Mirajkar, Nandita S; Gebhart, Connie J

    2014-01-01

    Outbreaks of mucohemorrhagic diarrhea in pigs caused by Brachyspira hyodysenteriae in the late 2000s indicated the re-emergence of Swine Dysentery (SD) in the U.S. Although the clinical disease was absent in the U.S. since the early 1990s, it continued to cause significant economic losses to other swine rearing countries worldwide. This study aims to fill the gap in knowledge pertaining to the re-emergence and epidemiology of B. hyodysenteriae in the U.S. and its global relationships using a multi-locus sequence typing (MLST) approach. Fifty-nine post re-emergent isolates originating from a variety of sources in the U.S. were characterized by MLST, analyzed for epidemiological relationships (within and between multiple sites of swine systems), and were compared with pre re-emergent isolates from the U.S. Information for an additional 272 global isolates from the MLST database was utilized for international comparisons. Thirteen nucleotide sequence types (STs) including a predominant genotype (ST93) were identified in the post re-emergent U.S. isolates; some of which showed genetic similarity to the pre re-emergent STs thereby suggesting its likely role in the re-emergence of SD. In the U.S., in general, no more than one ST was found on a site; multiple sites of a common system shared a ST; and STs found in the U.S. were distinct from those identified globally. Of the 110 STs characterized from ten countries, only two were found in more than one country. The U.S. and global populations, identified as clonal and heterogeneous based on STs, showed close relatedness based on amino acid types (AATs). One predicted founder type (AAT9) and multiple predicted subgroup founder types identified for both the U.S. and the global population indicate the potential microevolution of this pathogen. This study elucidates the strain diversity and microevolution of B. hyodysenteriae, and highlights the utility of MLST for epidemiological and surveillance studies.

  3. Identification of a 'Candidatus Phytoplasma hispanicum'-related strain, associated with yellows-type diseases, in smoke-tree sharpshooter (Homalodisca liturata Ball).

    PubMed

    Servín-Villegas, Rosalía; Caamal-Chan, Maria Goretty; Chavez-Medina, Alicia; Loera-Muro, Abraham; Barraza, Aarón; Medina-Hernández, Diana; Holguín-Peña, Ramón Jaime

    2018-04-11

    The 16SrXIII group from phytoplasma bacteria were identified in salivary glands from Homalodisca liturata, which were collected in El Comitán on the Baja California peninsula in Mexico. We were able to positively identify 15 16S rRNA gene sequences with the corresponding signature sequence of 'CandidatusPhytoplasma' (CAAGAYBATKATGTKTAGCYGGDCT) and in silico restriction fragment length polymorphism (RFLP) profiles (F value estimations) coupled with a phylogenetic analysis to confirm their relatedness to 'CandidatusPhytoplasma hispanicum', which in turn belongs to the 16SrXIII group. A restriction analysis was carried out with AluI and EcoRI to confirm that the five sequences belongs to subgroup D. The rest of the sequences did not exhibit any known RFLP profile related to a subgroup reported in the 16SrXIII group.

  4. Mapping Base Modifications in DNA by Transverse-Current Sequencing

    NASA Astrophysics Data System (ADS)

    Alvarez, Jose R.; Skachkov, Dmitry; Massey, Steven E.; Kalitsov, Alan; Velev, Julian P.

    2018-02-01

    Sequencing DNA modifications and lesions, such as methylation of cytosine and oxidation of guanine, is even more important and challenging than sequencing the genome itself. The traditional methods for detecting DNA modifications are either insensitive to these modifications or require additional processing steps to identify a particular type of modification. Transverse-current sequencing in nanopores can potentially identify the canonical bases and base modifications in the same run. In this work, we demonstrate that the most common DNA epigenetic modifications and lesions can be detected with any predefined accuracy based on their tunneling current signature. Our results are based on simulations of the nanopore tunneling current through DNA molecules, calculated using nonequilibrium electron-transport methodology within an effective multiorbital model derived from first-principles calculations, followed by a base-calling algorithm accounting for neighbor current-current correlations. This methodology can be integrated with existing experimental techniques to improve base-calling fidelity.

  5. Targeted exome sequencing identifies novel compound heterozygous mutations in P3H1 in a fetus with osteogenesis imperfecta type VIII.

    PubMed

    Huang, Yanru; Mei, Libin; Lv, Weigang; Li, Haoxian; Zhang, Rui; Pan, Qian; Tan, Hu; Guo, Jing; Luo, Xiaomei; Chen, Chen; Liang, Desheng; Wu, Lingqian

    2017-01-01

    Osteogenesis imperfecta (OI) is a highly clinically and genetically heterogeneous group of disorders. It is difficult to identify severe OI in the perinatal period. Here, a Chinese woman with a suspected history of fetal OI was referred to our institution at 19weeks of gestation, due to ultrasound inspection during antenatal screening, which revealed bulbous metaphyses, short humeri, and short thick bent femora in the fetus. Using targeted exome sequencing of 248 genes known to be involved in skeletal system diseases, we identified novel compound heterozygous mutation in the P3H1 gene in the fetus with OI type VIII: c.105_120del (p.D36Rfs*16) and c.2164C>T (p.Q722*). These two mutations were inherited from the father and mother, respectively. The mRNA level of P3H1 wasn't changed suggested that mRNA with this mutation escaped from nonsense-mediated RNA decay. Besides, the level of P3H1 was absence while the CRTAP was mildly decreased. In conclusion, our findings imply this novel compound heterozygous mutation as the molecular pathogenetic in a Chinese fetus with OI type VIII, and demonstrate that targeted next-generation sequencing (NGS) is an accurate, rapid, and cost-effective method in the genetic diagnosis of fetal skeletal dysplasia with genetic and clinical heterogeneity, especially for autosomal recessive skeletal disorders. Copyright © 2016 Elsevier B.V. All rights reserved.

  6. Dissemination of blaNDM-5 gene via an IncX3-type plasmid among non-clonal Escherichia coli in China.

    PubMed

    Li, Xi; Fu, Ying; Shen, Mengyuan; Huang, Danyan; Du, Xiaoxing; Hu, Qingfeng; Zhou, Yonglie; Wang, Dairong; Yu, Yunsong

    2018-01-01

    The emergence and spread of New Delhi metallo-β-lactamase-producing Enterobacteriaceae has been a serious challenge to manage in the clinic due to its rapid dissemination of multi-drug resistance worldwide. As one main type of carbapenemases, New Delhi metallo-β-lactamase (NDM)is able to confer resistance to almost all β-lactams, including carbapenems, in Enterobacteriaceae . Recently, New Delhi metallo-β-lactamase-5 attracted extensive attention because of increased resistance to carbapenems and widespread dissemination. However, the dissemination mechanism of bla NDM-5 gene remains unclear. A total of 224 carbapenem-resistant Enterobacteriaceae isolates (CRE) were collected from different hospitals in Zhejiang province. NDM-5-positive isolates were identified and subjected to genotyping, susceptibility testing, and clinical data analysis. We established the genetic location of bla NDM-5 with southern blot hybridisation, and analysed plasmids containing bla NDM-5 with filter mating and DNA sequencing. Eleven New Delhi metallo-β-lactamase-5 (NDM-5)-producing strains were identified, including 9 Escherichia coli strains, 1 Klebsiella pneumoniae strain, and 1 Citrobacter freundii strain. No epidemiological links for E. coli isolates were identified by multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE). S1-PFGE and southern blot suggested that the bla NDM-5 gene was located on a 46-kb IncX3-type plasmid in all isolates. Nine of the 11 isolates (81.8%) tested could successfully transfer their carbapenem-resistant phenotype to E. coli strain C600. Moreover, sequence analysis further showed that this plasmid possessed high sequence similarity to most of previously reported bla NDM-5 -habouring plasmids in China. The present data in this study showed the IncX3 type plasmid played an important role in the dissemination of bla NDM-5 in Enterobacteriaceae . In addition, to the best of our knowledge, this report is the first to isolate both E. coli and C. freundii strains carrying bla NDM-5 from one single patient, which further indicated the possibility of bla NDM-5 transmission among diverse species. Close surveillance is urgently needed to monitor the further dissemination of NDM-5-producing isolates.

  7. Population genomic data reveal genes related to important traits of quail.

    PubMed

    Wu, Yan; Zhang, Yaolei; Hou, Zhuocheng; Fan, Guangyi; Pi, Jinsong; Sun, Shuai; Chen, Jiang; Liu, Huaqiao; Du, Xiao; Shen, Jie; Hu, Gang; Chen, Wenbin; Pan, Ailuan; Yin, Pingping; Chen, Xiaoli; Pu, Yuejin; Zhang, He; Liang, Zhenhua; Jian, Jianbo; Zhang, Hao; Wu, Bin; Sun, Jing; Chen, Jianwei; Tao, Hu; Yang, Ting; Xiao, Hongwei; Yang, Huan; Zheng, Chuanwei; Bai, Mingzhou; Fang, Xiaodong; Burt, David W; Wang, Wen; Li, Qingyi; Xu, Xun; Li, Chengfeng; Yang, Huanming; Wang, Jian; Yang, Ning; Liu, Xin; Du, Jinping

    2018-05-01

    Japanese quail (Coturnix japonica), a recently domesticated poultry species, is important not only as an agricultural product, but also as a model bird species for genetic research. However, most of the biological questions concerning genomics, phylogenetics, and genetics of some important economic traits have not been answered. It is thus necessary to complete a high-quality genome sequence as well as a series of comparative genomics, evolution, and functional studies. Here, we present a quail genome assembly spanning 1.04 Gb with 86.63% of sequences anchored to 30 chromosomes (28 autosomes and 2 sex chromosomes Z/W). Our genomic data have resolved the long-term debate of phylogeny among Perdicinae (Japanese quail), Meleagridinae (turkey), and Phasianinae (chicken). Comparative genomics and functional genomic data found that four candidate genes involved in early maturation had experienced positive selection, and one of them encodes follicle stimulating hormone beta (FSHβ), which is correlated with different FSHβ levels in quail and chicken. We re-sequenced 31 quails (10 wild, 11 egg-type, and 10 meat-type) and identified 18 and 26 candidate selective sweep regions in the egg-type and meat-type lines, respectively. That only one of them is shared between egg-type and meat-type lines suggests that they were subject to an independent selection. We also detected a haplotype on chromosome Z, which was closely linked with maroon/yellow plumage in quail using population resequencing and a genome-wide association study. This haplotype block will be useful for quail breeding programs. This study provided a high-quality quail reference genome, identified quail-specific genes, and resolved quail phylogeny. We have identified genes related to quail early maturation and a marker for plumage color, which is significant for quail breeding. These results will facilitate biological discovery in quails and help us elucidate the evolutionary processes within the Phasianidae family.

  8. Isolation of Canine parvovirus with a view to identify the prevalent serotype on the basis of partial sequence analysis.

    PubMed

    Kaur, Gurpreet; Chandra, Mudit; Dwivedi, P N; Sharma, N S

    2015-01-01

    The aim of this study was to isolate Canine parvovirus (CPV) from suspected dogs on madin darby canine kidney (MDCK) cell line and its confirmation by polymerase chain reaction (PCR) and nested PCR (NPCR). Further, VP2 gene of the CPV isolates was amplified and sequenced to determine prevailing antigenic type. A total of 60 rectal swabs were collected from dogs showing signs of gastroenteritis, processed and subjected to isolation in MDCK cell line. The samples showing cytopathic effects (CPE) were confirmed by PCR and NPCR. These samples were subjected to PCR for amplification of VP2 gene of CPV, sequenced and analyzed to study the prevailing antigenic types of CPV. Out of the 60 samples subjected to isolation in MDCK cell line five samples showed CPE in the form of rounding of cells, clumping of cells and finally detachment of the cells. When these samples and the two commercially available vaccines were subjected to PCR for amplification of VP2 gene, a 1710 bp product was amplified. The sequence analysis revealed that the vaccines belonged to the CPV-2 type and the samples were of CPV-2b type. It can be concluded from the present study that out of a total of 60 samples 5 samples exhibited CPE as observed in MDCK cell line. Sequence analysis of the VP2 gene among the samples and vaccine strains revealed that samples belonged to CPV-2b type and vaccines belonging to CPV-2.

  9. Membership and Coronal Activity in the NGC 2232 and Cr 140 Open Clusters

    NASA Technical Reports Server (NTRS)

    Oliversen, Ronald J. (Technical Monitor); Patten, Brian M.

    2004-01-01

    Making use of eight archival ROSAT HRI images in the regions of the NGC 2232 and Cr 140, this project's primary focus is to identify X-ray sources and to extract net source counts for these sources in these two open clusters. These X-ray data would be combined with ground-based photometry and spectroscopy in order to identify G, K, and early-M type cluster members. Such membership data are important because, at present, no members later than spectral type approx. F5 are currently known for either cluster. With ages estimated to be approx. 25 Myr and at distances of just approx. 350 pc, the combined late-type membership of the NGC 2232 and Cr 140 clusters would yield an almost unique sample of solar-type stars in the post-T Tauri/pre-main sequence phase of evolution. These stars could be used to assess the level and dispersion of coronal activity levels, as a part of a probe of the importance of magnetic braking and the level of magnetic dynamo activity, for solar-type stars just before they reach the zero-age main sequence.

  10. A de novo mutation in the AGXT gene causing primary hyperoxaluria type 1.

    PubMed

    Williams, Emma L; Kemper, Markus J; Rumsby, Gill

    2006-09-01

    Primary hyperoxaluria type 1 is caused by mutations in the alanine-glyoxylate aminotransferase (AGXT) gene. In cases in which no mutation was identified, linkage analysis can be used to confirm or exclude the diagnosis in other siblings. We present a family in which a sibling of the index case predicted to have primary hyperoxaluria type 1 by means of linkage analysis failed to show hyperoxaluria during the following 7 years, putting the diagnosis into question. Whole-gene sequence analysis identified 2 causative mutations in the index case, of which only 1, c.646A (Gly216Arg), was inherited. The other sequence change, c.33_34insC, was a de novo mutation occurring on the paternal allele. This particular mutation is a relatively common cause of primary hyperoxaluria type 1. It occurs in a run of 8 cytosines and therefore potentially is susceptible to polymerase slippage. This case illustrates 2 important points. First, biochemical confirmation of a genetic diagnosis should always be made in siblings diagnosed by using genetic tests. Second, de novo mutations should be considered as a potential, albeit rare, cause of primary hyperoxaluria type 1.

  11. Genetic Mapping of Glutaric Aciduria, Type 3, to Chromosome 7 and Identification of Mutations in C7orf10

    PubMed Central

    Sherman, Eric A.; Strauss, Kevin A.; Tortorelli, Silvia; Bennett, Michael J.; Knerr, Ina; Morton, D. Holmes; Puffenberger, Erik G.

    2008-01-01

    While screening Old Order Amish children for glutaric aciduria type 1 (GA1) between 1989 and 1993, we found three healthy children who excreted abnormal quantities of glutaric acid but low 3-hydroxyglutaric acid, a pattern consistent with glutaric aciduria type 3 (GA3). None of these children had the GCDH c.1262C→T mutation that causes GA1 among the Amish. Using single-nucleotide polymorphism (SNP) genotypes, we identified a shared homozygous 4.7 Mb region on chromosome 7. This region contained 25 genes including C7orf10, an open reading frame with a putative mitochondrial targeting sequence and coenzyme-A transferase domain. Direct sequencing of C7orf10 revealed that the three Amish individuals were homozygous for a nonsynonymous sequence variant (c.895C→T, Arg299Trp). We then sequenced three non-Amish children with GA3 and discovered two nonsense mutations (c.322C→T, Arg108Ter, and c.424C→T, Arg142Ter) in addition to the Amish mutation. Two pathogenic alleles were identified in each of the six patients. There was no consistent clinical phenotype associated with GA3. In affected individuals, urine molar ratios of glutarate to its derivatives (3-hydroxyglutarate, glutarylcarnitine, and glutarylglycine) were elevated, suggesting impaired formation of glutaryl-CoA. These observations refine our understanding of the lysine-tryptophan degradation pathway and have important implications for the pathophysiology of GA1. PMID:18926513

  12. Characterization of six mutations in Exon 37 of neurofibromatosis type 1 gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Upadhyaya, M.; Osborn, M.; Maynard, J.

    Neurofibromatosis type 1 (NF1) is one of the most common inherited disorders, with an incidence of 1 in 3,000. We screened a total of 320 unrelated NF1 patients for mutations in exon 37 of the NF1 gene. Six independent mutations were identified, of which three are novel, and these include a recurrent nonsense mutation identified in 2 unrelated patients at codon 2281 (G2281X), a 1-bp insertion (6791 ins A) resulting in a change of TAG (tyrosine) to a TAA (stop codon), and a 3-bp deletion (6839 del TAC) which generated a frameshift. Another recurrent nonsense mutation, Y2264X, which was detectedmore » in 2 unrelated patients in this study, was also previously reported in 2 NF1 individuals. All the mutations were identified within a contiguous 49-bp sequence. Further studies are warranted to support the notion that this region of the gene contains highly mutable sequences. 17 refs., 2 figs., 1 tab.« less

  13. Exome sequencing for prenatal diagnosis of fetuses with sonographic abnormalities.

    PubMed

    Drury, Suzanne; Williams, Hywel; Trump, Natalie; Boustred, Christopher; Lench, Nicholas; Scott, Richard H; Chitty, Lyn S

    2015-10-01

    In the absence of aneuploidy or other pathogenic cytogenetic abnormality, fetuses with increased nuchal translucency (NT ≥ 3.5 mm) and/or other sonographic abnormalities have a greater incidence of genetic syndromes, but defining the underlying pathology can be challenging. Here, we investigate the value of whole exome sequencing in fetuses with sonographic abnormalities but normal microarray analysis. Whole exome sequencing was performed on DNA extracted from chorionic villi or amniocytes in 24 fetuses with unexplained ultrasound findings. In the first 14 cases sequencing was initially performed on fetal DNA only. For the remaining 10, the trio of fetus, mother and father was sequenced simultaneously. In 21% (5/24) cases, exome sequencing provided definitive diagnoses (Milroy disease, hypophosphatasia, achondrogenesis type 2, Freeman-Sheldon syndrome and Baraitser-Winter Syndrome). In a further case, a plausible diagnosis of orofaciodigital syndrome type 6 was made. In two others, a single mutation in an autosomal recessive gene was identified, but incomplete sequencing coverage precluded exclusion of the presence of a second mutation. Whole exome sequencing improves prenatal diagnosis in euploid fetuses with abnormal ultrasound scans. In order to expedite interpretation of results, trio sequencing should be employed, but interpretation can still be compromised by incomplete coverage of relevant genes. © 2015 John Wiley & Sons, Ltd.

  14. A Complete Developmental Sequence of a Drosophila Neuronal Lineage as Revealed by Twin-Spot MARCM

    PubMed Central

    He, Yisheng; Ding, Peng; Kao, Jui-Chun; Lee, Tzumin

    2010-01-01

    Drosophila brains contain numerous neurons that form complex circuits. These neurons are derived in stereotyped patterns from a fixed number of progenitors, called neuroblasts, and identifying individual neurons made by a neuroblast facilitates the reconstruction of neural circuits. An improved MARCM (mosaic analysis with a repressible cell marker) technique, called twin-spot MARCM, allows one to label the sister clones derived from a common progenitor simultaneously in different colors. It enables identification of every single neuron in an extended neuronal lineage based on the order of neuron birth. Here we report the first example, to our knowledge, of complete lineage analysis among neurons derived from a common neuroblast that relay olfactory information from the antennal lobe (AL) to higher brain centers. By identifying the sequentially derived neurons, we found that the neuroblast serially makes 40 types of AL projection neurons (PNs). During embryogenesis, one PN with multi-glomerular innervation and 18 uniglomerular PNs targeting 17 glomeruli of the adult AL are born. Many more PNs of 22 additional types, including four types of polyglomerular PNs, derive after the neuroblast resumes dividing in early larvae. Although different offspring are generated in a rather arbitrary sequence, the birth order strictly dictates the fate of each post-mitotic neuron, including the fate of programmed cell death. Notably, the embryonic progenitor has an altered temporal identity following each self-renewing asymmetric cell division. After larval hatching, the same progenitor produces multiple neurons for each cell type, but the number of neurons for each type is tightly regulated. These observations substantiate the origin-dependent specification of neuron types. Sequencing neuronal lineages will not only unravel how a complex brain develops but also permit systematic identification of neuron types for detailed structure and function analysis of the brain. PMID:20808769

  15. Characterization of nasopharyngeal isolates of type b Haemophilus influenzae from Delhi

    PubMed Central

    Saikia, Kandarpa K.; Das, Bimal K.; Bewal, Ramesh K.; Kapil, Arti; Arora, N.K.; Sood, Seema

    2012-01-01

    Background & objectives: Haemophilus influenzae is an important cause of mortality and morbidity among young children in developing countries. Increasing incidence of antibiotic resistance especially production of extended spectrum beta lactamase (ESBL) has made treatment and management of H. influenzae infection more difficult. Nasopharyngeal H. influenzae isolates are excellent surrogate for determination of antibiotic resistance prevalent among invasive H. influenzae isolates. In this study, we characterized nasopharyngeal H. influenzae isolates obtained from healthy school going children in Delhi. Methods: Nasopharyngeal H. influenzae isolates were collected from healthy school going children and subjected to serotyping, fimbrial typing and antibiogram profiling. ESBL production was recorded using phenotypic as well as molecular methods. Multi locus sequence typing (MLST) of 13 representative nasopharyngeal H. influenzae isolates was performed as per guidelines. Results: A significant proportion (26 of 80, 32.5%) of nasopharyngeal isolates of H. influenzae were identified as serotype b. Fimbrial gene (hifA) was detected in 23 (28.75%) isolates. Resistance against commonly prescribed antibiotics (Amp, Tet, Chloro, Septran, Cephalexin) were observed to be high among the nasopharyngeal commensal H. influenzae. Extended spectrum beta lactamase (ESBL) production was observed in a five (6.25%) isolates by both double disk diffusion and molecular typing. MLST identified several novel alleles as well as novel sequence types. Interpretation & conclusions: Our findings showed high resistance against common antibiotics and detection of ESBL in nasopharyngeal H. influenzae isolates collected from normal healthy school going children in Delhi. Detection of H. influenzae type b capsular gene and the presence of fimbrial gene (hif A) suggest virulence potential of these isolates. Discovery of novel alleles and presence of new sequence types (STs) among nasopharyngeal H. influenzae isolates may suggest wider genetic diversity. PMID:23287135

  16. The Human Microbiome and Understanding the 16S rRNA Gene in Translational Nursing Science.

    PubMed

    Ames, Nancy J; Ranucci, Alexandra; Moriyama, Brad; Wallen, Gwenyth R

    As more is understood regarding the human microbiome, it is increasingly important for nurse scientists and healthcare practitioners to analyze these microbial communities and their role in health and disease. 16S rRNA sequencing is a key methodology in identifying these bacterial populations that has recently transitioned from use primarily in research to having increased utility in clinical settings. The objectives of this review are to (a) describe 16S rRNA sequencing and its role in answering research questions important to nursing science; (b) provide an overview of the oral, lung, and gut microbiomes and relevant research; and (c) identify future implications for microbiome research and 16S sequencing in translational nursing science. Sequencing using the 16S rRNA gene has revolutionized research and allowed scientists to easily and reliably characterize complex bacterial communities. This type of research has recently entered the clinical setting, one of the best examples involving the use of 16S sequencing to identify resistant pathogens, thereby improving the accuracy of bacterial identification in infection control. Clinical microbiota research and related requisite methods are of particular relevance to nurse scientists-individuals uniquely positioned to utilize these techniques in future studies in clinical settings.

  17. Longitudinal Metagenomic Analysis of Hospital Air Identifies Clinically Relevant Microbes.

    PubMed

    King, Paula; Pham, Long K; Waltz, Shannon; Sphar, Dan; Yamamoto, Robert T; Conrad, Douglas; Taplitz, Randy; Torriani, Francesca; Forsyth, R Allyn

    2016-01-01

    We describe the sampling of sixty-three uncultured hospital air samples collected over a six-month period and analysis using shotgun metagenomic sequencing. Our primary goals were to determine the longitudinal metagenomic variability of this environment, identify and characterize genomes of potential pathogens and determine whether they are atypical to the hospital airborne metagenome. Air samples were collected from eight locations which included patient wards, the main lobby and outside. The resulting DNA libraries produced 972 million sequences representing 51 gigabases. Hierarchical clustering of samples by the most abundant 50 microbial orders generated three major nodes which primarily clustered by type of location. Because the indoor locations were longitudinally consistent, episodic relative increases in microbial genomic signatures related to the opportunistic pathogens Aspergillus, Penicillium and Stenotrophomonas were identified as outliers at specific locations. Further analysis of microbial reads specific for Stenotrophomonas maltophilia indicated homology to a sequenced multi-drug resistant clinical strain and we observed broad sequence coverage of resistance genes. We demonstrate that a shotgun metagenomic sequencing approach can be used to characterize the resistance determinants of pathogen genomes that are uncharacteristic for an otherwise consistent hospital air microbial metagenomic profile.

  18. Prevalence of the F-type lectin domain.

    PubMed

    Bishnoi, Ritika; Khatri, Indu; Subramanian, Srikrishna; Ramya, T N C

    2015-08-01

    F-type lectins are fucolectins with characteristic fucose and calcium-binding sequence motifs and a unique lectin fold (the "F-type" fold). F-type lectins are phylogenetically widespread with selective distribution. Several eukaryotic F-type lectins have been biochemically and structurally characterized, and the F-type lectin domain (FLD) has also been studied in the bacterial proteins, Streptococcus mitis lectinolysin and Streptococcus pneumoniae SP2159. However, there is little knowledge about the extent of occurrence of FLDs and their domain organization, especially, in bacteria. We have now mined the extensive genomic sequence information available in the public databases with sensitive sequence search techniques in order to exhaustively survey prokaryotic and eukaryotic FLDs. We report 437 FLD sequence clusters (clustered at 80% sequence identity) from eukaryotic, eubacterial and viral proteins. Domain architectures are diverse but mostly conserved in closely related organisms, and domain organizations of bacterial FLD-containing proteins are very different from their eukaryotic counterparts, suggesting unique specialization of FLDs to suit different requirements. Several atypical phylogenetic associations hint at lateral transfer. Among eukaryotes, we observe an expansion of FLDs in terms of occurrence and domain organization diversity in the taxa Mollusca, Hemichordata and Branchiostomi, perhaps coinciding with greater emphasis on innate immune strategies in these organisms. The naturally occurring FLDs with diverse domain organizations that we have identified here will be useful for future studies aimed at creating designer molecular platforms for directing desired biological activities to fucosylated glycoconjugates in target niches. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  19. Evaluation of microbial community in hydrothermal field by direct DNA sequencing

    NASA Astrophysics Data System (ADS)

    Kawarabayasi, Y.; Maruyama, A.

    2002-12-01

    Many extremophiles have been discovered from terrestrial and marine hydrothermal fields. Some thermophiles can grow beyond 90°C in culture, while direct microscopic analysis occasionally indicates that microbes may survive in much hotter hydrothermal fluids. However, it is very difficult to isolate and cultivate such microbes from the environments, i.e., over 99% of total microbes remains undiscovered. Based on experiences of entire microbial genome analysis (Y.K.) and microbial community analysis (A.M.), we started to find out unique microbes/genes in hydrothermal fields through direct sequencing of environmental DNA fragments. At first, shotgun plasmid libraries were directly constructed with the DNA molecules prepared from mixed microbes collected by an in situ filtration system from low-temperature fluids at RM24 in the Southern East Pacific Rise (S-EPR). A gene amplification (PCR) technique was not used for preventing mutation in the process. The nucleotide sequences of 285 clones indicated that no sequence had identical data in public databases. Among 27 clones determined entire sequences, no ORF was identified on 14 clones like intron in Eukaryote. On four clones, tetra-nucleotide-long multiple tandem repetitive sequences were identified. This type of sequence was identified in some familiar disease in human. The result indicates that living/dead materials with eukaryotic features may exist in this low temperature field. Secondly, shotgun plasmid libraries were constructed from the environmental DNA prepared from Beppu hot springs. In randomly-selected 143 clones used for sequencing, no known sequence was identified. Unlike the clones in S-EPR library, clear ORFs were identified on all nine clones determined the entire sequence. It was found that one clone, H4052, contained the complete Aspartyl-tRNA synthetase. Phylogenetic analysis using amino acid sequences of this gene indicated that this gene was separated from other Euryarchaea before the differentiation of species. Thus, some novel archaeal species are expected to be in this field. The present direct cloning and sequencing technique is now opening a window to the new world in hydrothermal microbial community analysis.

  20. High-resolution typing of Chlamydia trachomatis: epidemiological and clinical uses.

    PubMed

    de Vries, Henry J C; Schim van der Loeff, Maarten F; Bruisten, Sylvia M

    2015-02-01

    A state-of-the-art overview of molecular Chlamydia trachomatis typing methods that are used for routine diagnostics and scientific studies. Molecular epidemiology uses high-resolution typing techniques such as multilocus sequence typing, multilocus variable number of tandem repeats analysis, and whole-genome sequencing to identify strains based on their DNA sequence. These data can be used for cluster, network and phylogenetic analyses, and are used to unveil transmission networks, risk groups, and evolutionary pathways. High-resolution typing of C. trachomatis strains is applied to monitor treatment efficacy and re-infections, and to study the recent emergence of lymphogranuloma venereum (LGV) amongst men who have sex with men in high-income countries. Chlamydia strain typing has clinical relevance in disease management, as LGV needs longer treatment than non-LGV C. trachomatis. It has also led to the discovery of a new variant Chlamydia strain in Sweden, which was not detected by some commercial C. trachomatis diagnostic platforms. After a brief history and comparison of the various Chlamydia typing methods, the applications of the current techniques are described and future endeavors to extend scientific understanding are formulated. High-resolution typing will likely help to further unravel the pathophysiological mechanisms behind the wide clinical spectrum of chlamydial disease.

  1. Solution structure of the chick TGFbeta type II receptor ligand-binding domain.

    PubMed

    Marlow, Michael S; Brown, Christopher B; Barnett, Joey V; Krezel, Andrzej M

    2003-02-28

    The transforming growth factor beta (TGFbeta) signaling pathway influences cell proliferation, immune responses, and extracellular matrix reorganization throughout the vertebrate life cycle. The signaling cascade is initiated by ligand-binding to its cognate type II receptor. Here, we present the structure of the chick type II TGFbeta receptor determined by solution NMR methods. Distance and angular constraints were derived from 15N and 13C edited NMR experiments. Torsion angle dynamics was used throughout the structure calculations and refinement. The 20 final structures were energy minimized using the generalized Born solvent model. For these 20 structures, the average backbone root-mean-square distance from the average structure is below 0.6A. The overall fold of this 109-residue domain is conserved within the superfamily of these receptors. Chick receptors fully recognize and respond to human TGFbeta ligands despite only 60% identity at the sequence level. Comparison with the human TGFbeta receptor determined by X-ray crystallography reveals different conformations in several regions. Sequence divergence and crystal packing interactions under low pH conditions are likely causes. This solution structure identifies regions were structural changes, however subtle, may occur upon ligand-binding. We also identified two very well conserved molecular surfaces. One was found to bind ligand in the crystallized human TGFbeta3:TGFbeta type II receptor complex. The other, newly identified area can be the interaction site with type I and/or type III receptors of the TGFbeta signaling complex.

  2. Transcriptogenomics identification and characterization of RNA editing sites in human primary monocytes using high-depth next generation sequencing data.

    PubMed

    Leong, Wai-Mun; Ripen, Adiratna Mat; Mirsafian, Hoda; Mohamad, Saharuddin Bin; Merican, Amir Feisal

    2018-06-07

    High-depth next generation sequencing data provide valuable insights into the number and distribution of RNA editing events. Here, we report the RNA editing events at cellular level of human primary monocyte using high-depth whole genomic and transcriptomic sequencing data. We identified over a ten thousand putative RNA editing sites and 69% of the sites were A-to-I editing sites. The sites enriched in repetitive sequences and intronic regions. High-depth sequencing datasets revealed that 90% of the canonical sites were edited at lower frequencies (<0.7). Single and multiple human monocytes and brain tissues samples were analyzed through genome sequence independent approach. The later approach was observed to identify more editing sites. Monocytes was observed to contain more C-to-U editing sites compared to brain tissues. Our results establish comparable pipeline that can address current limitations as well as demonstrate the potential for highly sensitive detection of RNA editing events in single cell type. Copyright © 2018 Elsevier Inc. All rights reserved.

  3. Natural and Unanticipated Modifiers of RNAi Activity in Caenorhabditis elegans

    PubMed Central

    Asad, Nadeem; Aw, Wen Yih; Timmons, Lisa

    2012-01-01

    Organisms used as model genomics systems are maintained as isogenic strains, yet evidence of sequence differences between independently maintained wild-type stocks has been substantiated by whole-genome resequencing data and strain-specific phenotypes. Sequence differences may arise from replication errors, transposon mobilization, meiotic gene conversion, or environmental or chemical assault on the genome. Low frequency alleles or mutations with modest effects on phenotypes can contribute to natural variation, and it has proven possible for such sequences to become fixed by adapted evolutionary enrichment and identified by resequencing. Our objective was to identify and analyze single locus genetic defects leading to RNAi resistance in isogenic strains of Caenorhabditis elegans. In so doing, we uncovered a mutation that arose de novo in an existing strain, which initially frustrated our phenotypic analysis. We also report experimental, environmental, and genetic conditions that can complicate phenotypic analysis of RNAi pathway defects. These observations highlight the potential for unanticipated mutations, coupled with genetic and environmental phenomena, to enhance or suppress the effects of known mutations and cause variation between wild-type strains. PMID:23209671

  4. Human germline and pan-cancer variomes and their distinct functional profiles

    PubMed Central

    Pan, Yang; Karagiannis, Konstantinos; Zhang, Haichen; Dingerdissen, Hayley; Shamsaddini, Amirhossein; Wan, Quan; Simonyan, Vahan; Mazumder, Raja

    2014-01-01

    Identification of non-synonymous single nucleotide variations (nsSNVs) has exponentially increased due to advances in Next-Generation Sequencing technologies. The functional impacts of these variations have been difficult to ascertain because the corresponding knowledge about sequence functional sites is quite fragmented. It is clear that mapping of variations to sequence functional features can help us better understand the pathophysiological role of variations. In this study, we investigated the effect of nsSNVs on more than 17 common types of post-translational modification (PTM) sites, active sites and binding sites. Out of 1 705 285 distinct nsSNVs on 259 216 functional sites we identified 38 549 variations that significantly affect 10 major functional sites. Furthermore, we found distinct patterns of site disruptions due to germline and somatic nsSNVs. Pan-cancer analysis across 12 different cancer types led to the identification of 51 genes with 106 nsSNV affected functional sites found in 3 or more cancer types. 13 of the 51 genes overlap with previously identified Significantly Mutated Genes (Nature. 2013 Oct 17;502(7471)). 62 mutations in these 13 genes affecting functional sites such as DNA, ATP binding and various PTM sites occur across several cancers and can be prioritized for additional validation and investigations. PMID:25232094

  5. Evolutionarily conserved regions and hydrophobic contacts at the superfamily level: The case of the fold-type I, pyridoxal-5′-phosphate-dependent enzymes

    PubMed Central

    Paiardini, Alessandro; Bossa, Francesco; Pascarella, Stefano

    2004-01-01

    The wealth of biological information provided by structural and genomic projects opens new prospects of understanding life and evolution at the molecular level. In this work, it is shown how computational approaches can be exploited to pinpoint protein structural features that remain invariant upon long evolutionary periods in the fold-type I, PLP-dependent enzymes. A nonredundant set of 23 superposed crystallographic structures belonging to this superfamily was built. Members of this family typically display high-structural conservation despite low-sequence identity. For each structure, a multiple-sequence alignment of orthologous sequences was obtained, and the 23 alignments were merged using the structural information to obtain a comprehensive multiple alignment of 921 sequences of fold-type I enzymes. The structurally conserved regions (SCRs), the evolutionarily conserved residues, and the conserved hydrophobic contacts (CHCs) were extracted from this data set, using both sequence and structural information. The results of this study identified a structural pattern of hydrophobic contacts shared by all of the superfamily members of fold-type I enzymes and involved in native interactions. This profile highlights the presence of a nucleus for this fold, in which residues participating in the most conserved native interactions exhibit preferential evolutionary conservation, that correlates significantly (r = 0.70) with the extent of mean hydrophobic contact value of their apolar fraction. PMID:15498941

  6. Multilocus sequence typing scheme for the Mycobacterium abscessus complex.

    PubMed

    Macheras, Edouard; Konjek, Julie; Roux, Anne-Laure; Thiberge, Jean-Michel; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby E; Bodmer, Thomas; Jarlier, Vincent; Cambau, Emmanuelle; Brisse, Sylvain; Caro, Valérie; Rastogi, Nalin; Gaillard, Jean-Louis; Heym, Beate

    2014-01-01

    We developed a multilocus sequence typing (MLST) scheme for Mycobacterium abscessus sensu lato, based on the partial sequencing of seven housekeeping genes: argH, cya, glpK, gnd, murC, pta and purH. This scheme was used to characterize a collection of 227 isolates recovered between 1994 and 2010 in France, Germany, Switzerland and Brazil. We identified 100 different sequence types (STs), which were distributed into three groups on the tree obtained by concatenating the sequences of the seven housekeeping gene fragments (3576bp): the M. abscessus sensu stricto group (44 STs), the "M. massiliense" group (31 STs) and the "M. bolletii" group (25 STs). SplitTree analysis showed a degree of intergroup lateral transfers. There was also evidence of lateral transfer events involving rpoB. The most prevalent STs in our collection were ST1 (CC5; 20 isolates) and ST23 (CC3; 31 isolates). Both STs were found in Europe and Brazil, and the latter was implicated in a large post-surgical procedure outbreak in Brazil. Respiratory isolates from patients with cystic fibrosis belonged to a large variety of STs; however, ST2 was predominant in this group of patients. Our MLST scheme, publicly available at www.pasteur.fr/mlst, offers investigators a valuable typing tool for M. abscessus sensu lato in future epidemiological studies throughout the world. Copyright © 2013 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  7. Emergence of new types of Theileria orientalis in Australian cattle and possible cause of theileriosis outbreaks

    PubMed Central

    2011-01-01

    Theileria parasites cause a benign infection of cattle in parts of Australia where they are endemic, but have, in recent years, been suspected of being responsible for a number of outbreaks of disease in cattle near the coast of New South Wales. The objective of this study was to identify and characterize the species of Theileria in cattle on six farms in New South Wales where disease outbreaks have occurred, and compare with Theileria from three disease-free farms in Queensland that is endemic for Theileria. Special reference was made to sub-typing of T. orientalis by type-specific PCR and sequencing of the small subunit (SSU) rRNA gene, and sequence analysis of the gene encoding a polymorphic merozoite/piroplasm surface protein (MPSP) that may be under immune selection. Nucleotide sequencing of SSU rRNA and MPSP genes revealed the presence of four Theileria genotypes: T. orientalis (buffeli), T. orientalis (ikeda), T. orientalis (chitose) and T. orientalis type 4 (MPSP) or type C (SSU rRNA). The majority of animals showed mixed infections while a few showed single infection. When MPSP nucleotide sequences were translated into amino acids, base transition did not change amino acid composition of the protein product, suggesting possible silent polymorphism. The occurrence of ikeda and type 4 (type C) previously not reported to occur and silent mutation is thought to have enhanced parasite evasion of the host immune response causing the outbreak. PMID:21338493

  8. Prevalence of Staphylococcus aureus and Methicillin-Resistant Staphylococcus aureus in Retail Ready-to-Eat Foods in China

    PubMed Central

    Yang, Xiaojuan; Zhang, Jumei; Yu, Shubo; Wu, Qingping; Guo, Weipeng; Huang, Jiahui; Cai, Shuzhen

    2016-01-01

    Staphylococcus aureus, particularly methicillin-resistant S.aureus (MRSA), is a life-threatening pathogen in humans, and its presence in food is a public health concern. MRSA has been identified in foods in China, but little information is available regarding MRSA in ready-to-eat (RTE) foods. We aimed to investigate the prevalence of S. aureus and MRSA in Chinese retail RTE foods. All isolated S. aureus were tested for antimicrobial susceptibility, and MRSA isolates were further characterized by multilocus sequence typing (MLST) and staphylococcal cassette chromosome mec (SCCmec) typing. Of the 550 RTE foods collected from 2011 to 2014, 69 (12.5%) were positive for S. aureus. Contamination levels were mostly in the range of 0.3–10 most probable number (MPN)/g, with five samples exceeding 10 MPN/g. Of the 69 S. aureus isolates, seven were identified as MRSA by cefoxitin disc diffusion test. Six isolates were mecA-positive, while no mecC-positive isolates were identified. In total, 75.8% (47/62) of the methicillin-susceptible S. aureus isolates and all of the MRSA isolates were resistant to three or more antibiotics. Amongst the MRSA isolates, four were identified as community-acquired strains (ST59-MRSA-IVa (n = 2), ST338-MRSA-V, ST1-MRSA-V), while one was a livestock-associated strain (ST9, harboring an unreported SCCmec type 2C2). One novel sequence type was identified (ST3239), the SCCmec gene of which could not be typed. Overall, our findings showed that Chinese retail RTE foods are likely vehicles for transmission of multidrug-resistant S. aureus and MRSA lineages. This is a serious public health risk and highlights the need to implement good hygiene practices. PMID:27375562

  9. Rapid Threat Organism Recognition Pipeline

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Williams, Kelly P.; Solberg, Owen D.; Schoeniger, Joseph S.

    2013-05-07

    The RAPTOR computational pipeline identifies microbial nucleic acid sequences present in sequence data from clinical samples. It takes as input raw short-read genomic sequence data (in particular, the type generated by the Illumina sequencing platforms) and outputs taxonomic evaluation of detected microbes in various human-readable formats. This software was designed to assist in the diagnosis or characterization of infectious disease, by detecting pathogen sequences in nucleic acid sequence data from clinical samples. It has also been applied in the detection of algal pathogens, when algal biofuel ponds became unproductive. RAPTOR first trims and filters genomic sequence reads based on qualitymore » and related considerations, then performs a quick alignment to the human (or other host) genome to filter out host sequences, then performs a deeper search against microbial genomes. Alignment to a protein sequence database is optional. Alignment results are summarized and placed in a taxonomic framework using the Lowest Common Ancestor algorithm.« less

  10. Novel division level bacterial diversity in a Yellowstone hot spring.

    PubMed

    Hugenholtz, P; Pitulle, C; Hershberger, K L; Pace, N R

    1998-01-01

    A culture-independent molecular phylogenetic survey was carried out for the bacterial community in Obsidian Pool (OP), a Yellowstone National Park hot spring previously shown to contain remarkable archaeal diversity (S. M. Barns, R. E. Fundyga, M. W. Jeffries, and N. R. Page, Proc. Natl. Acad. Sci. USA 91:1609-1613, 1994). Small-subunit rRNA genes (rDNA) were amplified directly from OP sediment DNA by PCR with universally conserved or Bacteria-specific rDNA primers and cloned. Unique rDNA types among > 300 clones were identified by restriction fragment length polymorphism, and 122 representative rDNA sequences were determined. These were found to represent 54 distinct bacterial sequence types or clusters (> or = 98% identity) of sequences. A majority (70%) of the sequence types were affiliated with 14 previously recognized bacterial divisions (main phyla; kingdoms); 30% were unaffiliated with recognized bacterial divisions. The unaffiliated sequence types (represented by 38 sequences) nominally comprise 12 novel, division level lineages termed candidate divisions. Several OP sequences were nearly identical to those of cultivated chemolithotrophic thermophiles, including the hydrogen-oxidizing Calderobacterium and the sulfate reducers Thermodesulfovibrio and Thermodesulfobacterium, or belonged to monophyletic assemblages recognized for a particular type of metabolism, such as the hydrogen-oxidizing Aquificales and the sulfate-reducing delta-Proteobacteria. The occurrence of such organisms is consistent with the chemical composition of OP (high in reduced iron and sulfur) and suggests a lithotrophic base for primary productivity in this hot spring, through hydrogen oxidation and sulfate reduction. Unexpectedly, no archaeal sequences were encountered in OP clone libraries made with universal primers. Hybridization analysis of amplified OP DNA with domain-specific probes confirmed that the analyzed community rDNA from OP sediment was predominantly bacterial. These results expand substantially our knowledge of the extent of bacterial diversity and call into question the commonly held notion that Archaea dominate hydrothermal environments. Finally, the currently known extent of division level bacterial phylogenetic diversity is collated and summarized.

  11. Complete coding regions of the prototypes enterovirus B93 and C95: phylogenetic analyses of the P1 and P3 regions of EV-B and EV-C strains.

    PubMed

    Junttila, N; Lévêque, N; Magnius, L O; Kabue, J P; Muyembe-Tamfum, J J; Maslin, J; Lina, B; Norder, H

    2015-03-01

    Complete coding regions were sequenced for two new enterovirus genomes: EV-B93 previously identified by VP1 sequencing, derived from a child with acute flaccid paralysis in the Democratic Republic of Congo; and EV-C95 from a French soldier with acute gastroenteritis in Djibouti. The EV-B93 P1 had more than 30% nucleotide divergence from other EV-B types, with highest similarity to E-15 and EV-B80. The P1 nucleotide sequence of EV-C95 was most similar, 71%, to CV-A21. Complete coding regions for the new enteroviruses were compared with those of 135 EV-B and 176 EV-C strains representing all types available in GenBank. When strains from the same outbreak or strains isolated during the same year in the same geographical region were excluded, 27 of the 58 EV-B, and 16 of the 23 EV-C types were represented by more than one sequence. However, for EV-B the P3 sequences formed three clades mainly according to origin or time of isolation, irrespective of type, while for EV-C the P3 sequences segregated mainly according to disease manifestation, with most strains causing paralysis, including polioviruses, forming one clade, and strains causing respiratory illness forming another. There was no intermixing of types between these two clades, apart from two EV-C96 strains. The EV-B P3 sequences had lower inter-clade and higher intra-clade variability as compared to the EV-C sequences, which may explain why inter-clade recombinations are more frequent in EV-B. Further analysis of more isolates may shed light on the role of recombinations in the evolution of EV-B in geographical context. © 2014 Wiley Periodicals, Inc.

  12. Structural and Sequence Stratigraphic Analysis of the Onshore Nile Delta, Egypt.

    NASA Astrophysics Data System (ADS)

    Barakat, Moataz; Dominik, Wilhelm

    2010-05-01

    The Nile Delta is considered the earliest known delta in the world. It was already described by Herodotus in the 5th Century AC. Nowadays; the Nile Delta is an emerging giant gas province in the Middle East with proven gas reserves which have more than doubled in size in the last years. The Nile Delta basin contains a thick sedimentary sequence inferred to extend from Jurassic to recent time. Structural styles and depositional environments varied during this period. Facies architecture and sequence stratigraphy of the Nile Delta are resolved using seismic stratigraphy based on (2D seismic lines) including synthetic seismograms and tying in well log data. Synthetic seismograms were constructed using sonic and density logs. The combination of structural interpretation and sequence stratigraphy of the development of the basin was resolved. Seven chrono-stratigraphic boundaries have been identified and correlated on seismic and well log data. Several unconformity boundaries also identified on seismic lines range from angular to disconformity type. Furthermore, time structure maps, velocity maps, depth structure maps as well as Isopach maps were constructed using seismic lines and log data. Several structural features were identified: normal faults, growth faults, listric faults, secondary antithetic faults and large rotated fault blocks of manly Miocene age. In some cases minor rollover structures could be identified. Sedimentary features such as paleo-channels were distinctively recognized. Typical Sequence stratigraphic features such as incised valley, clinoforms, topsets, offlaps and onlaps are identified and traced on the seismic lines allowing a good insight into sequence stratigraphic history of the Nile Delta most especially in the Miocene to Pliocene clastic sedimentary succession.

  13. The Organelle Genomes of Hassawi Rice (Oryza sativa L.) and Its Hybrid in Saudi Arabia: Genome Variation, Rearrangement, and Origins

    PubMed Central

    Zhang, Tongwu; Hu, Songnian; Zhang, Guangyu; Pan, Linlin; Zhang, Xiaowei; Al-Mssallem, Ibrahim S.; Yu, Jun

    2012-01-01

    Hassawi rice (Oryza sativa L.) is a landrace adapted to the climate of Saudi Arabia, characterized by its strong resistance to soil salinity and drought. Using high quality sequencing reads extracted from raw data of a whole genome sequencing project, we assembled both chloroplast (cp) and mitochondrial (mt) genomes of the wild-type Hassawi rice (Hassawi-1) and its dwarf hybrid (Hassawi-2). We discovered 16 InDels (insertions and deletions) but no SNP (single nucleotide polymorphism) is present between the two Hassawi cp genomes. We identified 48 InDels and 26 SNPs in the two Hassawi mt genomes and a new type of sequence variation, termed reverse complementary variation (RCV) in the rice cp genomes. There are two and four RCVs identified in Hassawi-1 when compared to 93–11 (indica) and Nipponbare (japonica), respectively. Microsatellite sequence analysis showed there are more SSRs in the genic regions of both cp and mt genomes in the Hassawi rice than in the other rice varieties. There are also large repeats in the Hassawi mt genomes, with the longest length of 96,168 bp and 96,165 bp in Hassawi-1 and Hassawi-2, respectively. We believe that frequent DNA rearrangement in the Hassawi mt and cp genomes indicate ongoing dynamic processes to reach genetic stability under strong environmental pressures. Based on sequence variation analysis and the breeding history, we suggest that both Hassawi-1 and Hassawi-2 originated from the Indonesian variety Peta since genetic diversity between the two Hassawi cultivars is very low albeit an unknown historic origin of the wild-type Hassawi rice. PMID:22870184

  14. Identification and Characterization of 30 K Protein Genes Found in Bombyx mori (Lepidoptera: Bombycidae) Transcriptome

    PubMed Central

    Shi, Xiao-Feng; Li, Yi-Nü; Yi, Yong-Zhu; Xiao, Xing-Guo; Zhang, Zhi-Fang

    2015-01-01

    The 30 K proteins, the major group of hemolymph proteins in the silkworm, Bombyx mori (Lepidoptera: Bombycidae), are structurally related with molecular masses of ∼30 kDa and are involved in various physiological processes, e.g., energy storage, embryonic development, and immune responses. For this report, known 30 K protein gene sequences were used as Blastn queries against sequences in the B. mori transcriptome (SilkTransDB). Twenty-nine cDNAs (Bm30K-1–29) were retrieved, including four being previously unidentified in the Lipoprotein_11 family. The genomic structures of the 29 genes were analyzed and they were mapped to their corresponding chromosomes. Furthermore, phylogenetic analysis revealed that the 29 genes encode three types of 30 K proteins. The members increased in each type is mainly a result of gene duplication with the appearance of each type preceding the differentiation of each species included in the tree. Real-Time Quantitative Polymerase Chain Reaction (Q-PCR) confirmed that the genes could be expressed, and that the three types have different temporal expression patterns. Proteins from the hemolymph was separated by SDS-PAGE, and those with molecular mass of ∼30 kDa were isolated and identified by mass spectrometry sequencing in combination with searches of various databases containing B. mori 30K protein sequences. Of the 34 proteins identified, 13 are members of the 30 K protein family, with one that had not been found in the SilkTransDB, although it had been found in the B. mori genome. Taken together, our results indicate that the 30 K protein family contains many members with various functions. Other methods will be required to find more members of the family. PMID:26078299

  15. How Primary Care Providers Talk to Patients about Genome Sequencing Results: Risk, Rationale, and Recommendation.

    PubMed

    Vassy, Jason L; Davis, J Kelly; Kirby, Christine; Richardson, Ian J; Green, Robert C; McGuire, Amy L; Ubel, Peter A

    2018-06-01

    Genomics will play an increasingly prominent role in clinical medicine. To describe how primary care physicians (PCPs) discuss and make clinical recommendations about genome sequencing results. Qualitative analysis. PCPs and their generally healthy patients undergoing genome sequencing. Patients received clinical genome reports that included four categories of results: monogenic disease risk variants (if present), carrier status, five pharmacogenetics results, and polygenic risk estimates for eight cardiometabolic traits. Patients' office visits with their PCPs were audio-recorded, and summative content analysis was used to describe how PCPs discussed genomic results. For each genomic result discussed in 48 PCP-patient visits, we identified a "take-home" message (recommendation), categorized as continuing current management, further treatment, further evaluation, behavior change, remembering for future care, or sharing with family members. We analyzed how PCPs came to each recommendation by identifying 1) how they described the risk or importance of the given result and 2) the rationale they gave for translating that risk into a specific recommendation. Quantitative analysis showed that continuing current management was the most commonly coded recommendation across results overall (492/749, 66%) and for each individual result type except monogenic disease risk results. Pharmacogenetics was the most common result type to prompt a recommendation to remember for future care (94/119, 79%); carrier status was the most common type prompting a recommendation to share with family members (45/54, 83%); and polygenic results were the most common type prompting a behavior change recommendation (55/58, 95%). One-fifth of recommendation codes associated with monogenic results were for further evaluation (6/24, 25%). Rationales for these recommendations included patient context, family context, and scientific/clinical limitations of sequencing. PCPs distinguish substantive differences among categories of genome sequencing results and use clinical judgment to justify continuing current management in generally healthy patients with genomic results.

  16. Development of an ELA-DRA gene typing method based on pyrosequencing technology.

    PubMed

    Díaz, S; Echeverría, M G; It, V; Posik, D M; Rogberg-Muñoz, A; Pena, N L; Peral-García, P; Vega-Pla, J L; Giovambattista, G

    2008-11-01

    The polymorphism of equine lymphocyte antigen (ELA) class II DRA gene had been detected by polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and reference strand-mediated conformation analysis. These methodologies allowed to identify 11 ELA-DRA exon 2 sequences, three of which are widely distributed among domestic horse breeds. Herein, we describe the development of a pyrosequencing-based method applicable to ELA-DRA typing, by screening samples from eight different horse breeds previously typed by PCR-SSCP. This sequence-based method would be useful in high-throughput genotyping of major histocompatibility complex genes in horses and other animal species, making this system interesting as a rapid screening method for animal genotyping of immune-related genes.

  17. Cytogenetic Analysis of Populus trichocarpa - Ribosomal DNA, Telomere Repeat Sequence, and Marker-selected BACs

    Treesearch

    M.N. lslam-Faridi; C.D. Nelson; S.P. DiFazio; L.E. Gunter; G.A. Tuskan

    2009-01-01

    The 185-285 rDNA and 55 rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 185-285 rDNA sites and one 55 rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis-type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones...

  18. Occurrence and molecular analysis of Balantidium coli in mountain gorilla (Gorilla beringei beringei) in the Volcanoes National Park, Rwanda.

    PubMed

    Hassell, James M; Blake, Damer P; Cranfield, Michael R; Ramer, Jan; Hogan, Jennifer N; Noheli, Jean Bosco; Waters, Michael; Hermosilla, Carlos

    2013-10-01

    Cysts morphologically resembling Balantidium coli were identified in the feces of a mountain gorilla (Gorilla beringei beringei). Confirmatory PCR and sequencing revealed two distinct B. coli-specific sequences (ITS-1, sub-types A0 and B1). This represents the first report of B. coli in this species, raising the possibility of infection from a reservoir host.

  19. Occurrence of Carbapenemase-Producing Enterobacteriaceae Isolates in the Wildlife: First Report of OXA-48 in Wild Boars in Algeria.

    PubMed

    Bachiri, Taous; Bakour, Sofiane; Lalaoui, Rym; Belkebla, Nadia; Allouache, Meriem; Rolain, Jean Marc; Touati, Abdelaziz

    2018-04-01

    The aim of the present study was to screen for the presence of carbapenemase-producing Enterobacteriaceae (CPE) isolates from wild boars and Barbary macaques in Algeria. Fecal samples were collected from wild boars (n = 168) and Barbary macaques (n = 212), in Bejaia, Algeria, between September 2014 and April 2016. The isolates were identified and antimicrobial susceptibility was determined. Carbapenem resistance determinants were studied using PCR and sequencing, while clonal relatedness was performed using multilocus sequence typing (MLST). PCR was used to investigate certain virulence genes. Three CPE isolates from three different samples (1.8%) recovered from wild boars were identified as Escherichia coli (two isolates) and Klebsiella pneumoniae (one isolate). These isolates were resistant to amoxicillin, amoxicillin-clavulanate, tobramycin, ertapenem, and meropenem. The results of PCR and sequencing analysis showed that all three isolates produced the OXA-48 enzyme. The MLST showed that the two E. coli isolates were assigned to the same sequence type, ST635, and belonged to phylogroup A, whereas K. pneumoniae strain belonged to ST13. The K. pneumoniae strain was positive for multiple virulence factors, whereas no virulence determinants were found in E. coli isolates. This is the first report of OXA-48-producing Enterobacteriaceae in wild animals from Algeria and Africa.

  20. Detailed Investigation of the Role of Common and Low-Frequency WFS1 Variants in Type 2 Diabetes Risk

    PubMed Central

    Fawcett, Katherine A.; Wheeler, Eleanor; Morris, Andrew P.; Ricketts, Sally L.; Hallmans, Göran; Rolandsson, Olov; Daly, Allan; Wasson, Jon; Permutt, Alan; Hattersley, Andrew T.; Glaser, Benjamin; Franks, Paul W.; McCarthy, Mark I.; Wareham, Nicholas J.; Sandhu, Manjinder S.; Barroso, Inês

    2010-01-01

    OBJECTIVE Wolfram syndrome 1 (WFS1) single nucleotide polymorphisms (SNPs) are associated with risk of type 2 diabetes. In this study we aimed to refine this association and investigate the role of low-frequency WFS1 variants in type 2 diabetes risk. RESEARCH DESIGN AND METHODS For fine-mapping, we sequenced WFS1 exons, splice junctions, and conserved noncoding sequences in samples from 24 type 2 diabetic case and 68 control subjects, selected tagging SNPs, and genotyped these in 959 U.K. type 2 diabetic case and 1,386 control subjects. The same genomic regions were sequenced in samples from 1,235 type 2 diabetic case and 1,668 control subjects to compare the frequency of rarer variants between case and control subjects. RESULTS Of 31 tagging SNPs, the strongest associated was the previously untested 3′ untranslated region rs1046320 (P = 0.008); odds ratio 0.84 and P = 6.59 × 10−7 on further replication in 3,753 case and 4,198 control subjects. High correlation between rs1046320 and the original strongest SNP (rs10010131) (r2 = 0.92) meant that we could not differentiate between their effects in our samples. There was no difference in the cumulative frequency of 82 rare (minor allele frequency [MAF] <0.01) nonsynonymous variants between type 2 diabetic case and control subjects (P = 0.79). Two intermediate frequency (MAF 0.01–0.05) nonsynonymous changes also showed no statistical association with type 2 diabetes. CONCLUSIONS We identified six highly correlated SNPs that show strong and comparable associations with risk of type 2 diabetes, but further refinement of these associations will require large sample sizes (>100,000) or studies in ethnically diverse populations. Low frequency variants in WFS1 are unlikely to have a large impact on type 2 diabetes risk in white U.K. populations, highlighting the complexities of undertaking association studies with low-frequency variants identified by resequencing. PMID:20028947

  1. Prophage-Mediated Dynamics of ‘Candidatus Liberibacter asiaticus’ Populations, the Destructive Bacterial Pathogens of Citrus Huanglongbing

    PubMed Central

    Zhou, Lijuan; Powell, Charles A.; Li, Wenbin; Irey, Mike; Duan, Yongping

    2013-01-01

    Prophages are highly dynamic components in the bacterial genome and play an important role in intraspecies variations. There are at least two prophages in the chromosomes of Candidatus Liberibacter asiaticus’ (Las) Floridian isolates. Las is both unculturable and the most prevalent species of Liberibacter pathogens that cause huanglongbing (HLB), a worldwide destructive disease of citrus. In this study, seven new prophage variants resulting from two hyper-variable regions were identified by screening clone libraries of infected citrus, periwinkle and psyllids. Among them, Types A and B share highly conserved sequences and localize within the two prophages, FP1 and FP2, respectively. Although Types B and C were abundant in all three libraries, Type A was much more abundant in the libraries from the Las-infected psyllids than from the Las-infected plants, and Type D was only identified in libraries from the infected host plants but not from the infected psyllids. Sequence analysis of these variants revealed that the variations may result from recombination and rearrangement events. Conventional PCR results using type-specific molecular markers indicated that A, B, C and D are the four most abundant types in Las-infected citrus and periwinkle. However, only three types, A, B and C are abundant in Las-infected psyllids. Typing results for Las-infected citrus field samples indicated that mixed populations of Las bacteria present in Floridian isolates, but only the Type D population was correlated with the blotchy mottle symptom. Extended cloning and sequencing of the Type D region revealed a third prophage/phage in the Las genome, which may derive from the recombination of FP1 and FP2. Dramatic variations in these prophage regions were also found among the global Las isolates. These results are the first to demonstrate the prophage/phage-mediated dynamics of Las populations in plant and insect hosts, and their correlation with insect transmission and disease development. PMID:24349235

  2. Prophage-mediated dynamics of 'Candidatus Liberibacter asiaticus' populations, the destructive bacterial pathogens of citrus huanglongbing.

    PubMed

    Zhou, Lijuan; Powell, Charles A; Li, Wenbin; Irey, Mike; Duan, Yongping

    2013-01-01

    Prophages are highly dynamic components in the bacterial genome and play an important role in intraspecies variations. There are at least two prophages in the chromosomes of Candidatus Liberibacter asiaticus' (Las) Floridian isolates. Las is both unculturable and the most prevalent species of Liberibacter pathogens that cause huanglongbing (HLB), a worldwide destructive disease of citrus. In this study, seven new prophage variants resulting from two hyper-variable regions were identified by screening clone libraries of infected citrus, periwinkle and psyllids. Among them, Types A and B share highly conserved sequences and localize within the two prophages, FP1 and FP2, respectively. Although Types B and C were abundant in all three libraries, Type A was much more abundant in the libraries from the Las-infected psyllids than from the Las-infected plants, and Type D was only identified in libraries from the infected host plants but not from the infected psyllids. Sequence analysis of these variants revealed that the variations may result from recombination and rearrangement events. Conventional PCR results using type-specific molecular markers indicated that A, B, C and D are the four most abundant types in Las-infected citrus and periwinkle. However, only three types, A, B and C are abundant in Las-infected psyllids. Typing results for Las-infected citrus field samples indicated that mixed populations of Las bacteria present in Floridian isolates, but only the Type D population was correlated with the blotchy mottle symptom. Extended cloning and sequencing of the Type D region revealed a third prophage/phage in the Las genome, which may derive from the recombination of FP1 and FP2. Dramatic variations in these prophage regions were also found among the global Las isolates. These results are the first to demonstrate the prophage/phage-mediated dynamics of Las populations in plant and insect hosts, and their correlation with insect transmission and disease development.

  3. Analysis and functional annotation of expressed sequence tags from in vitro cell lines of elasmobranchs: spiny dogfish shark (Squalus acanthias) and little skate (Leucoraja erinacea)

    PubMed Central

    Parton, Angela; Bayne, Christopher J.; Barnes, David W.

    2010-01-01

    Elasmobranchs are the most commonly used experimental models among the jawed, cartilaginous fish (Chondrichthyes). Previously we developed cell lines from embryos of two elasmobranchs, Squalus acanthias the spiny dogfish shark (SAE line), and Leucoraja erinacea the little skate (LEE-1 line). From these lines cDNA libraries were derived and expressed sequence tags (ESTs) generated. From the SAE cell line 4303 unique transcripts were identified, with 1848 of these representing unknown sequences (showing no BLASTX identification). From the LEE-1 cell line, 3660 unique transcripts were identified, and unknown, unique sequences totaled 1333. Gene Ontology (GO) annotation showed that GO assignments for the two cell lines were in general similar. These results suggest that the procedures used to derive the cell lines led to isolation of cell types of the same general embryonic origin from both species. The LEE-1 transcripts included GO categories “envelope” and “oxidoreductase activity” but the SAE transcripts did not. GO analysis of SAE transcripts identified the category “anatomical structure formation” that was not present in LEE-1 cells. Increased organelle compartments may exist within LEE-1 cells compared to SAE cells, and the higher oxidoreductase activity in LEE-1 cells may indicate a role for these cells in responses associated with innate immunity or in steroidogenesis. These EST libraries from elasmobranch cell lines provide information for assembly of genomic sequences and are useful in revealing gene diversity, new genes and molecular markers, as well as in providing means for elucidation of full-length cDNAs and probes for gene array analyses. This is the first study of this type with members of the Chondrichthyes. PMID:20471924

  4. Analysis and functional annotation of expressed sequence tags from in vitro cell lines of elasmobranchs: Spiny dogfish shark (Squalus acanthias) and little skate (Leucoraja erinacea).

    PubMed

    Parton, Angela; Bayne, Christopher J; Barnes, David W

    2010-09-01

    Elasmobranchs are the most commonly used experimental models among the jawed, cartilaginous fish (Chondrichthyes). Previously we developed cell lines from embryos of two elasmobranchs, Squalus acanthias the spiny dogfish shark (SAE line), and Leucoraja erinacea the little skate (LEE-1 line). From these lines cDNA libraries were derived and expressed sequence tags (ESTs) generated. From the SAE cell line 4303 unique transcripts were identified, with 1848 of these representing unknown sequences (showing no BLASTX identification). From the LEE-1 cell line, 3660 unique transcripts were identified, and unknown, unique sequences totaled 1333. Gene Ontology (GO) annotation showed that GO assignments for the two cell lines were in general similar. These results suggest that the procedures used to derive the cell lines led to isolation of cell types of the same general embryonic origin from both species. The LEE-1 transcripts included GO categories "envelope" and "oxidoreductase activity" but the SAE transcripts did not. GO analysis of SAE transcripts identified the category "anatomical structure formation" that was not present in LEE-1 cells. Increased organelle compartments may exist within LEE-1 cells compared to SAE cells, and the higher oxidoreductase activity in LEE-1 cells may indicate a role for these cells in responses associated with innate immunity or in steroidogenesis. These EST libraries from elasmobranch cell lines provide information for assembly of genomic sequences and are useful in revealing gene diversity, new genes and molecular markers, as well as in providing means for elucidation of full-length cDNAs and probes for gene array analyses. This is the first study of this type with members of the Chondrichthyes. Copyright 2010 Elsevier Inc. All rights reserved.

  5. Molecular characterization and phylogenetic relationships among microsporidian isolates infecting silkworm, Bombyx mori using small subunit rRNA (SSU-rRNA) gene sequence analysis.

    PubMed

    Nath, B Surendra; Gupta, S K; Bajpai, A K

    2012-12-01

    The life cycle, spore morphology, pathogenicity, tissue specificity, mode of transmission and small subunit rRNA (SSU-rRNA) gene sequence analysis of the five new microsporidian isolates viz., NIWB-11bp, NIWB-12n, NIWB-13md, NIWB-14b and NIWB-15mb identified from the silkworm, Bombyx mori have been studied along with type species, NIK-1s_mys. The life cycle of the microsporidians identified exhibited the sequential developmental cycles that are similar to the general developmental cycle of the genus, Nosema. The spores showed considerable variations in their shape, length and width. The pathogenicity observed was dose-dependent and differed from each of the microsporidian isolates; the NIWB-15mb was found to be more virulent than other isolates. All of the microsporidians were found to infect most of the tissues examined and showed gonadal infection and transovarial transmission in the infected silkworms. SSU-rRNA sequence based phylogenetic tree placed NIWB-14b, NIWB-12n and NIWB-11bp in a separate branch along with other Nosema species and Nosema bombycis; while NIWB-15mb and NIWB-13md together formed another cluster along with other Nosema species. NIK-1s_mys revealed a signature sequence similar to standard type species, N. bombycis, indicating that NIK-1s_mys is similar to N. bombycis. Based on phylogenetic relationships, branch length information based on genetic distance and nucleotide differences, we conclude that the microsporidian isolates identified are distinctly different from the other known species and belonging to the genus, Nosema. This SSU-rRNA gene sequence analysis method is found to be more useful approach in detecting different and closely related microsporidians of this economically important domestic insect.

  6. Dissecting genetic and environmental mutation signatures with model organisms.

    PubMed

    Segovia, Romulo; Tam, Annie S; Stirling, Peter C

    2015-08-01

    Deep sequencing has impacted on cancer research by enabling routine sequencing of genomes and exomes to identify genetic changes associated with carcinogenesis. Researchers can now use the frequency, type, and context of all mutations in tumor genomes to extract mutation signatures that reflect the driving mutational processes. Identifying mutation signatures, however, may not immediately suggest a mechanism. Consequently, several recent studies have employed deep sequencing of model organisms exposed to discrete genetic or environmental perturbations. These studies exploit the simpler genomes and availability of powerful genetic tools in model organisms to analyze mutation signatures under controlled conditions, forging mechanistic links between mutational processes and signatures. We discuss the power of this approach and suggest that many such studies may be on the horizon. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Next-generation transcriptome sequencing, SNP discovery and validation in four market classes of peanut, Arachis hypogaea L.

    PubMed

    Chopra, Ratan; Burow, Gloria; Farmer, Andrew; Mudge, Joann; Simpson, Charles E; Wilkins, Thea A; Baring, Michael R; Puppala, Naveen; Chamberlin, Kelly D; Burow, Mark D

    2015-06-01

    Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker-trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.

  8. Characterisation of IS153, an IS3-family insertion sequence isolated from Lactobacillus sanfranciscensis and its use for strain differentiation.

    PubMed

    Ehrmann, M A; Vogel, R E

    2001-11-01

    An insertion sequence has been identified in the genome of Lactobacillus sanfranciscensis DSM 20451T as segment of 1351 nucleotides containing 37-bp imperfect terminal inverted repeats. The sequence of this element encodes two out of phase, overlapping open reading frames, orfA and orfB, from which three putative proteins are produced. OrfAB is a transframe protein produced by -1 translational frame shifting between orf A and orf B that is presumed to be the transposase. The large orfAB of this element encodes a 342 amino acid protein that displays similarities with transposases encoded by bacterial insertion sequences belonging to the IS3 family. In L. sanfranciscensis type strain DSM 20451T multiple truncated IS elements were identified. Inverse PCR was used to analyze target sites of four of these elements, but except of their highly AT rich character not any sequence specificity was identified so far. Moreover, no flanking direct repeats were identified. Multiple copies of IS153 were detected by hybridization in other strains of L. sanfranciscensis. Resulting hybridization patterns were shown to differentiate between organisms at strain level rather than a probe targeted against the 16S rDNA. With a PCR based approach IS153 or highly similar sequences were detected in L. acidophilus, L. casei, L. malefermentans, L. plantarum, L. hilgardii, L. collinoides L. farciminis L. sakei and L. salivarius, L. reuteri as well as in Enterococcus faecium, Pediococcus acidilactici and P. pentosaceus.

  9. A seminested PCR assay for detection and typing of human papillomavirus based on E1 gene sequences.

    PubMed

    Cavalcante, Gustavo Henrique O; de Araújo, Josélio M G; Fernandes, José Veríssimo; Lanza, Daniel C F

    2018-05-01

    HPV infection is considered one of the leading causes of cervical cancer in the world. To date, more than 180 types of HPV have been described and viral typing is critical for defining the prognosis of cancer. In this work, a seminested PCR which allow fast and inexpensively detection and typing of HPV is presented. The system is based on the amplification of a variable length region within the viral gene E1, using three primers that potentially anneal in all HPV genomes. The amplicons produced in the first step can be identified by high resolution electrophoresis or direct sequencing. The seminested step includes nine specific primers which can be used in multiplex or individual reactions to discriminate the main types of HPV by amplicon size differentiation using agarose electrophoresis, reducing the time spent and cost per analysis. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Automated typing of red blood cell and platelet antigens: a whole-genome sequencing study.

    PubMed

    Lane, William J; Westhoff, Connie M; Gleadall, Nicholas S; Aguad, Maria; Smeland-Wagman, Robin; Vege, Sunitha; Simmons, Daimon P; Mah, Helen H; Lebo, Matthew S; Walter, Klaudia; Soranzo, Nicole; Di Angelantonio, Emanuele; Danesh, John; Roberts, David J; Watkins, Nick A; Ouwehand, Willem H; Butterworth, Adam S; Kaufman, Richard M; Rehm, Heidi L; Silberstein, Leslie E; Green, Robert C

    2018-06-01

    There are more than 300 known red blood cell (RBC) antigens and 33 platelet antigens that differ between individuals. Sensitisation to antigens is a serious complication that can occur in prenatal medicine and after blood transfusion, particularly for patients who require multiple transfusions. Although pre-transfusion compatibility testing largely relies on serological methods, reagents are not available for many antigens. Methods based on single-nucleotide polymorphism (SNP) arrays have been used, but typing for ABO and Rh-the most important blood groups-cannot be done with SNP typing alone. We aimed to develop a novel method based on whole-genome sequencing to identify RBC and platelet antigens. This whole-genome sequencing study is a subanalysis of data from patients in the whole-genome sequencing arm of the MedSeq Project randomised controlled trial (NCT01736566) with no measured patient outcomes. We created a database of molecular changes in RBC and platelet antigens and developed an automated antigen-typing algorithm based on whole-genome sequencing (bloodTyper). This algorithm was iteratively improved to address cis-trans haplotype ambiguities and homologous gene alignments. Whole-genome sequencing data from 110 MedSeq participants (30 × depth) were used to initially validate bloodTyper through comparison with conventional serology and SNP methods for typing of 38 RBC antigens in 12 blood-group systems and 22 human platelet antigens. bloodTyper was further validated with whole-genome sequencing data from 200 INTERVAL trial participants (15 × depth) with serological comparisons. We iteratively improved bloodTyper by comparing its typing results with conventional serological and SNP typing in three rounds of testing. The initial whole-genome sequencing typing algorithm was 99·5% concordant across the first 20 MedSeq genomes. Addressing discordances led to development of an improved algorithm that was 99·8% concordant for the remaining 90 MedSeq genomes. Additional modifications led to the final algorithm, which was 99·2% concordant across 200 INTERVAL genomes (or 99·9% after adjustment for the lower depth of coverage). By enabling more precise antigen-matching of patients with blood donors, antigen typing based on whole-genome sequencing provides a novel approach to improve transfusion outcomes with the potential to transform the practice of transfusion medicine. National Human Genome Research Institute, Doris Duke Charitable Foundation, National Health Service Blood and Transplant, National Institute for Health Research, and Wellcome Trust. Copyright © 2018 Elsevier Ltd. All rights reserved.

  11. Strain/species identification in metagenomes using genome-specific markers

    PubMed Central

    Tu, Qichao; He, Zhili; Zhou, Jizhong

    2014-01-01

    Shotgun metagenome sequencing has become a fast, cheap and high-throughput technology for characterizing microbial communities in complex environments and human body sites. However, accurate identification of microorganisms at the strain/species level remains extremely challenging. We present a novel k-mer-based approach, termed GSMer, that identifies genome-specific markers (GSMs) from currently sequenced microbial genomes, which were then used for strain/species-level identification in metagenomes. Using 5390 sequenced microbial genomes, 8 770 321 50-mer strain-specific and 11 736 360 species-specific GSMs were identified for 4088 strains and 2005 species (4933 strains), respectively. The GSMs were first evaluated against mock community metagenomes, recently sequenced genomes and real metagenomes from different body sites, suggesting that the identified GSMs were specific to their targeting genomes. Sensitivity evaluation against synthetic metagenomes with different coverage suggested that 50 GSMs per strain were sufficient to identify most microbial strains with ≥0.25× coverage, and 10% of selected GSMs in a database should be detected for confident positive callings. Application of GSMs identified 45 and 74 microbial strains/species significantly associated with type 2 diabetes patients and obese/lean individuals from corresponding gastrointestinal tract metagenomes, respectively. Our result agreed with previous studies but provided strain-level information. The approach can be directly applied to identify microbial strains/species from raw metagenomes, without the effort of complex data pre-processing. PMID:24523352

  12. Sleep-stage sequencing of sleep-onset REM periods in MSLT predicts treatment response in patients with narcolepsy.

    PubMed

    Drakatos, Panagis; Patel, Kishankumar; Thakrar, Chiraag; Williams, Adrian J; Kent, Brian D; Leschziner, Guy D

    2016-04-01

    Current treatment recommendations for narcolepsy suggest that modafinil should be used as a first-line treatment ahead of conventional stimulants or sodium oxybate. In this study, performed in a tertiary sleep disorders centre, treatment responses were examined following these recommendations, and the ability of sleep-stage sequencing of sleep-onset rapid eye movement periods in the multiple sleep latency test to predict treatment response. Over a 3.5-year period, 255 patients were retrospectively identified in the authors' database as patients diagnosed with narcolepsy, type 1 (with cataplexy) or type 2 (without) using clinical and polysomnographic criteria. Eligible patients were examined in detail, sleep study data were abstracted and sleep-stage sequencing of sleep-onset rapid eye movement periods were analysed. Response to treatment was graded utilizing an internally developed scale. Seventy-five patients were included (39% males). Forty (53%) were diagnosed with type 1 narcolepsy with a mean follow-up of 2.37 ± 1.35 years. Ninety-seven percent of the patients were initially started on modafinil, and overall 59% reported complete response on the last follow-up. Twenty-nine patients (39%) had the sequence of sleep stage 1 or wake to rapid eye movement in all of their sleep-onset rapid eye movement periods, with most of these diagnosed as narcolepsy type 1 (72%). The presence of this specific sleep-stage sequence in all sleep-onset rapid eye movement periods was associated with worse treatment response (P = 0.0023). Sleep-stage sequence analysis of sleep-onset rapid eye movement periods in the multiple sleep latency test may aid the prediction of treatment response in narcoleptics and provide a useful prognostic tool in clinical practice, above and beyond their classification as narcolepsy type 1 or 2. © 2015 European Sleep Research Society.

  13. Molecular epidemiology of Pseudomonas aeruginosa clinical isolates from Korea producing β-lactamases with extended-spectrum activity.

    PubMed

    Bae, Il Kwon; Suh, Borum; Jeong, Seok Hoon; Wang, Kang-Kyun; Kim, Yong-Rok; Yong, Dongeun; Lee, Kyungwon

    2014-07-01

    This study was performed to investigate the prevalence and molecular epidemiology of Pseudomonas aeruginosa isolates from Korea that produce enzymes with extended-spectrum (ES) activity to β-lactams. A total of 205 non-duplicate P. aeruginosa clinical isolates were collected from 18 university hospitals in Korea. PCR and sequencing experiments were performed to identify genes encoding β-lactamases. PCR mapping and sequencing of the regions surrounding the β-lactamase genes were performed. Multilocus sequence typing experiments were performed. The most common sequence type (ST) was ST235 (n = 96), and 2 single-locus variants of ST235, ST1015 (n = 1) and ST1162 (n = 1), were also identified. These 3 STs were grouped as a clonal complex (CC), CC235. The remaining 107 isolates were identified as 59 different STs. Isolates belonging to CC235 showed higher rates of non-susceptibility to imipenem (85.4% versus 47.7%) and meropenem (92.7% versus 52.3%) compared to non-CC235 isolates. All the metallo-β-lactamase (MBL)-producing isolates were identified as CC235, except for 1 ST591. Genes encoding OXA-17 and OXA-142 were detected in 1 isolate and 4 isolates of CC235, respectively; while the bla(SHV-12) gene was detected in 4 non-CC235 isolates. Class A and D β-lactamases with ES activity play a role in acquiring ceftazidime resistance in P. aeruginosa in Korea. Production of IMP-6 and VIM-2 MBLs is the main mechanisms in acquiring resistance to ceftazidime and carbapenems in P. aeruginosa isolates in Korea. Clonal spread of P. aeruginosa CC235 may be an important conduit for the dissemination of MBL genes in Korea. Copyright © 2014 Elsevier Inc. All rights reserved.

  14. Molecular Epidemiologic Comparison of 2 Unusual Clusters of Group A Streptococcal Necrotizing Fasciitis in Hawaii

    PubMed Central

    Erdem, Guliz; Ford, Jacqueline M.; Kanenaka, Rebecca Y.; Abe, Lucienne; Yamaga, Karen; Effler, Paul V.

    2006-01-01

    Two clusters of necrotizing fasciitis (NF) due to group A streptococcus (GAS) were identified on the Hawaiian islands of Kauai and Maui during 1997 and 2002, respectively. The emm gene sequence types and the pulsed-field gel electrophoresis patterns were determined for 6 isolates recovered from patients with NF and for 116 isolates recovered from patients with temporally associated community-acquired GAS infection. No predominant emm type was identified, and the emm types of 64 (52.5%) of the isolates were considered to be uncommon in the continental United States. These findings suggest that unusual emm types might be responsible for invasive GAS infections in patients from Hawaii. PMID:15909276

  15. Viral expression associated with gastrointestinal adenocarcinomas in TCGA high-throughput sequencing data

    PubMed Central

    2013-01-01

    Background Up to 20% of cancers worldwide are thought to be associated with microbial pathogens, including bacteria and viruses. The widely used methods of viral infection detection are usually limited to a few a priori suspected viruses in one cancer type. To our knowledge, there have not been many broad screening approaches to address this problem more comprehensively. Methods In this study, we performed a comprehensive screening for viruses in nine common cancers using a multistep computational approach. Tumor transcriptome and genome sequencing data were available from The Cancer Genome Atlas (TCGA). Nine hundred fifty eight primary tumors in nine common cancers with poor prognosis were screened against a non-redundant database of virus sequences. DNA sequences from normal matched tissue specimens were used as controls to test whether each virus is associated with tumors. Results We identified human papilloma virus type 18 (HPV-18) and four human herpes viruses (HHV) types 4, 5, 6B, and 8, also known as EBV, CMV, roseola virus, and KSHV, in colon, rectal, and stomach adenocarcinomas. In total, 59% of screened gastrointestinal adenocarcinomas (GIA) were positive for at least one virus: 26% for EBV, 21% for CMV, 7% for HHV-6B, and 20% for HPV-18. Over 20% of tumors were co-infected with multiple viruses. Two viruses (EBV and CMV) were statistically significantly associated with colorectal cancers when compared to the matched healthy tissues from the same individuals (p = 0.02 and 0.03, respectively). HPV-18 was not detected in DNA, and thus, no association testing was possible. Nevertheless, HPV-18 expression patterns suggest viral integration in the host genome, consistent with the potentially oncogenic nature of HPV-18 in colorectal adenocarcinomas. The estimated counts of viral copies were below one per cell for all identified viruses and approached the detection limit. Conclusions Our comprehensive screening for viruses in multiple cancer types using next-generation sequencing data clearly demonstrates the presence of viral sequences in GIA. EBV, CMV, and HPV-18 are potentially causal for GIA, although their oncogenic role is yet to be established. PMID:24279398

  16. Single-Cell Sequencing of the Healthy and Diseased Heart Reveals Ckap4 as a New Modulator of Fibroblasts Activation.

    PubMed

    Gladka, Monika M; Molenaar, Bas; de Ruiter, Hesther; van der Elst, Stefan; Tsui, Hoyee; Versteeg, Danielle; Lacraz, Grègory P A; Huibers, Manon M H; van Oudenaarden, Alexander; van Rooij, Eva

    2018-01-31

    Background -Genome-wide transcriptome analysis has greatly advanced our understanding of the regulatory networks underlying basic cardiac biology and mechanisms driving disease. However, so far, the resolution of studying gene expression patterns in the adult heart has been limited to the level of extracts from whole tissues. The use of tissue homogenates inherently causes the loss of any information on cellular origin or cell type-specific changes in gene expression. Recent developments in RNA amplification strategies provide a unique opportunity to use small amounts of input RNA for genome-wide sequencing of single cells. Methods -Here, we present a method to obtain high quality RNA from digested cardiac tissue from adult mice for automated single-cell sequencing of both the healthy and diseased heart. Results -After optimization, we were able to perform single-cell sequencing on adult cardiac tissue under both homeostatic conditions and after ischemic injury. Clustering analysis based on differential gene expression unveiled known and novel markers of all main cardiac cell types. Based on differential gene expression we were also able to identify multiple subpopulations within a certain cell type. Furthermore, applying single-cell sequencing on both the healthy and the injured heart indicated the presence of disease-specific cell subpopulations. As such, we identified cytoskeleton associated protein 4 ( Ckap4 ) as a novel marker for activated fibroblasts that positively correlates with known myofibroblast markers in both mouse and human cardiac tissue. Ckap4 inhibition in activated fibroblasts treated with TGFβ triggered a greater increase in the expression of genes related to activated fibroblasts compared to control, suggesting a role of Ckap4 in modulating fibroblast activation in the injured heart. Conclusions -Single-cell sequencing on both the healthy and diseased adult heart allows us to study transcriptomic differences between cardiac cells, as well as cell type-specific changes in gene expression during cardiac disease. This new approach provides a wealth of novel insights into molecular changes that underlie the cellular processes relevant for cardiac biology and pathophysiology. Applying this technology could lead to the discovery of new therapeutic targets relevant for heart disease.

  17. Evaluation of a Method Using Three Genomic Guided Escherichia coli Markers for Phylogenetic Typing of E. coli Isolates of Various Genetic Backgrounds

    PubMed Central

    Hamamoto, Kouta; Ueda, Shuhei; Yamamoto, Yoshimasa

    2015-01-01

    Genotyping and characterization of bacterial isolates are essential steps in the identification and control of antibiotic-resistant bacterial infections. Recently, one novel genotyping method using three genomic guided Escherichia coli markers (GIG-EM), dinG, tonB, and dipeptide permease (DPP), was reported. Because GIG-EM has not been fully evaluated using clinical isolates, we assessed this typing method with 72 E. coli collection of reference (ECOR) environmental E. coli reference strains and 63 E. coli isolates of various genetic backgrounds. In this study, we designated 768 bp of dinG, 745 bp of tonB, and 655 bp of DPP target sequences for use in the typing method. Concatenations of the processed marker sequences were used to draw GIG-EM phylogenetic trees. E. coli isolates with identical sequence types as identified by the conventional multilocus sequence typing (MLST) method were localized to the same branch of the GIG-EM phylogenetic tree. Sixteen clinical E. coli isolates were utilized as test isolates without prior characterization by conventional MLST and phylogenetic grouping before GIG-EM typing. Of these, 14 clinical isolates were assigned to a branch including only isolates of a pandemic clone, E. coli B2-ST131-O25b, and these results were confirmed by conventional typing methods. Our results suggested that the GIG-EM typing method and its application to phylogenetic trees might be useful tools for the molecular characterization and determination of the genetic relationships among E. coli isolates. PMID:25809972

  18. Evaluation of a Method Using Three Genomic Guided Escherichia coli Markers for Phylogenetic Typing of E. coli Isolates of Various Genetic Backgrounds.

    PubMed

    Hamamoto, Kouta; Ueda, Shuhei; Yamamoto, Yoshimasa; Hirai, Itaru

    2015-06-01

    Genotyping and characterization of bacterial isolates are essential steps in the identification and control of antibiotic-resistant bacterial infections. Recently, one novel genotyping method using three genomic guided Escherichia coli markers (GIG-EM), dinG, tonB, and dipeptide permease (DPP), was reported. Because GIG-EM has not been fully evaluated using clinical isolates, we assessed this typing method with 72 E. coli collection of reference (ECOR) environmental E. coli reference strains and 63 E. coli isolates of various genetic backgrounds. In this study, we designated 768 bp of dinG, 745 bp of tonB, and 655 bp of DPP target sequences for use in the typing method. Concatenations of the processed marker sequences were used to draw GIG-EM phylogenetic trees. E. coli isolates with identical sequence types as identified by the conventional multilocus sequence typing (MLST) method were localized to the same branch of the GIG-EM phylogenetic tree. Sixteen clinical E. coli isolates were utilized as test isolates without prior characterization by conventional MLST and phylogenetic grouping before GIG-EM typing. Of these, 14 clinical isolates were assigned to a branch including only isolates of a pandemic clone, E. coli B2-ST131-O25b, and these results were confirmed by conventional typing methods. Our results suggested that the GIG-EM typing method and its application to phylogenetic trees might be useful tools for the molecular characterization and determination of the genetic relationships among E. coli isolates. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  19. Tertiary structure prediction and identification of druggable pocket in the cancer biomarker – Osteopontin-c

    PubMed Central

    2014-01-01

    Background Osteopontin (Eta, secreted sialoprotein 1, opn) is secreted from different cell types including cancer cells. Three splice variant forms namely osteopontin-a, osteopontin-b and osteopontin-c have been identified. The main astonishing feature is that osteopontin-c is found to be elevated in almost all types of cancer cells. This was the vital point to consider it for sequence analysis and structure predictions which provide ample chances for prognostic, therapeutic and preventive cancer research. Methods Osteopontin-c gene sequence was determined from Breast Cancer sample and was translated to protein sequence. It was then analyzed using various software and web tools for binding pockets, docking and druggability analysis. Due to the lack of homological templates, tertiary structure was predicted using ab-initio method server – I-TASSER and was evaluated after refinement using web tools. Refined structure was compared with known bone sialoprotein electron microscopic structure and docked with CD44 for binding analysis and binding pockets were identified for drug designing. Results Signal sequence of about sixteen amino acid residues was identified using signal sequence prediction servers. Due to the absence of known structures of similar proteins, three dimensional structure of osteopontin-c was predicted using I-TASSER server. The predicted structure was refined with the help of SUMMA server and was validated using SAVES server. Molecular dynamic analysis was carried out using GROMACS software. The final model was built and was used for docking with CD44. Druggable pockets were identified using pocket energies. Conclusions The tertiary structure of osteopontin-c was predicted successfully using the ab-initio method and the predictions showed that osteopontin-c is of fibrous nature comparable to firbronectin. Docking studies showed the significant similarities of QSAET motif in the interaction of CD44 and osteopontins between the normal and splice variant forms of osteopontins and binding pockets analyses revealed several pockets which paved the way to the identification of a druggable pocket. PMID:24401206

  20. Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.

    PubMed

    Taheri, Sima; Lee Abdullah, Thohirah; Yusop, Mohd Rafii; Hanafi, Mohamed Musa; Sahebi, Mahbod; Azizi, Parisa; Shamshiri, Redmond Ramin

    2018-02-13

    Microsatellites, or simple sequence repeats (SSRs), are one of the most informative and multi-purpose genetic markers exploited in plant functional genomics. However, the discovery of SSRs and development using traditional methods are laborious, time-consuming, and costly. Recently, the availability of high-throughput sequencing technologies has enabled researchers to identify a substantial number of microsatellites at less cost and effort than traditional approaches. Illumina is a noteworthy transcriptome sequencing technology that is currently used in SSR marker development. Although 454 pyrosequencing datasets can be used for SSR development, this type of sequencing is no longer supported. This review aims to present an overview of the next generation sequencing, with a focus on the efficient use of de novo transcriptome sequencing (RNA-Seq) and related tools for mining and development of microsatellites in plants.

  1. The complete genome sequence of the acarbose producer Actinoplanes sp. SE50/110

    PubMed Central

    2012-01-01

    Background Actinoplanes sp. SE50/110 is known as the wild type producer of the alpha-glucosidase inhibitor acarbose, a potent drug used worldwide in the treatment of type-2 diabetes mellitus. As the incidence of diabetes is rapidly rising worldwide, an ever increasing demand for diabetes drugs, such as acarbose, needs to be anticipated. Consequently, derived Actinoplanes strains with increased acarbose yields are being used in large scale industrial batch fermentation since 1990 and were continuously optimized by conventional mutagenesis and screening experiments. This strategy reached its limits and is generally superseded by modern genetic engineering approaches. As a prerequisite for targeted genetic modifications, the complete genome sequence of the organism has to be known. Results Here, we present the complete genome sequence of Actinoplanes sp. SE50/110 [GenBank:CP003170], the first publicly available genome of the genus Actinoplanes, comprising various producers of pharmaceutically and economically important secondary metabolites. The genome features a high mean G + C content of 71.32% and consists of one circular chromosome with a size of 9,239,851 bp hosting 8,270 predicted protein coding sequences. Phylogenetic analysis of the core genome revealed a rather distant relation to other sequenced species of the family Micromonosporaceae whereas Actinoplanes utahensis was found to be the closest species based on 16S rRNA gene sequence comparison. Besides the already published acarbose biosynthetic gene cluster sequence, several new non-ribosomal peptide synthetase-, polyketide synthase- and hybrid-clusters were identified on the Actinoplanes genome. Another key feature of the genome represents the discovery of a functional actinomycete integrative and conjugative element. Conclusions The complete genome sequence of Actinoplanes sp. SE50/110 marks an important step towards the rational genetic optimization of the acarbose production. In this regard, the identified actinomycete integrative and conjugative element could play a central role by providing the basis for the development of a genetic transformation system for Actinoplanes sp. SE50/110 and other Actinoplanes spp. Furthermore, the identified non-ribosomal peptide synthetase- and polyketide synthase-clusters potentially encode new antibiotics and/or other bioactive compounds, which might be of pharmacologic interest. PMID:22443545

  2. The complete genome sequence of the acarbose producer Actinoplanes sp. SE50/110.

    PubMed

    Schwientek, Patrick; Szczepanowski, Rafael; Rückert, Christian; Kalinowski, Jörn; Klein, Andreas; Selber, Klaus; Wehmeier, Udo F; Stoye, Jens; Pühler, Alfred

    2012-03-23

    Actinoplanes sp. SE50/110 is known as the wild type producer of the alpha-glucosidase inhibitor acarbose, a potent drug used worldwide in the treatment of type-2 diabetes mellitus. As the incidence of diabetes is rapidly rising worldwide, an ever increasing demand for diabetes drugs, such as acarbose, needs to be anticipated. Consequently, derived Actinoplanes strains with increased acarbose yields are being used in large scale industrial batch fermentation since 1990 and were continuously optimized by conventional mutagenesis and screening experiments. This strategy reached its limits and is generally superseded by modern genetic engineering approaches. As a prerequisite for targeted genetic modifications, the complete genome sequence of the organism has to be known. Here, we present the complete genome sequence of Actinoplanes sp. SE50/110 [GenBank:CP003170], the first publicly available genome of the genus Actinoplanes, comprising various producers of pharmaceutically and economically important secondary metabolites. The genome features a high mean G + C content of 71.32% and consists of one circular chromosome with a size of 9,239,851 bp hosting 8,270 predicted protein coding sequences. Phylogenetic analysis of the core genome revealed a rather distant relation to other sequenced species of the family Micromonosporaceae whereas Actinoplanes utahensis was found to be the closest species based on 16S rRNA gene sequence comparison. Besides the already published acarbose biosynthetic gene cluster sequence, several new non-ribosomal peptide synthetase-, polyketide synthase- and hybrid-clusters were identified on the Actinoplanes genome. Another key feature of the genome represents the discovery of a functional actinomycete integrative and conjugative element. The complete genome sequence of Actinoplanes sp. SE50/110 marks an important step towards the rational genetic optimization of the acarbose production. In this regard, the identified actinomycete integrative and conjugative element could play a central role by providing the basis for the development of a genetic transformation system for Actinoplanes sp. SE50/110 and other Actinoplanes spp. Furthermore, the identified non-ribosomal peptide synthetase- and polyketide synthase-clusters potentially encode new antibiotics and/or other bioactive compounds, which might be of pharmacologic interest.

  3. Bypassing bacterial infection in phage display by sequencing DNA released from phage particles.

    PubMed

    Villequey, Camille; Kong, Xu-Dong; Heinis, Christian

    2017-11-01

    Phage display relies on a bacterial infection step in which the phage particles are replicated to perform multiple affinity selection rounds and to enable the identification of isolated clones by DNA sequencing. While this process is efficient for wild-type phage, the bacterial infection rate of phage with mutant or chemically modified coat proteins can be low. For example, a phage mutant with a disulfide-free p3 coat protein, used for the selection of bicyclic peptides, has a more than 100-fold reduced infection rate compared to the wild-type. A potential strategy for bypassing the bacterial infection step is to directly sequence DNA extracted from phage particles after a single round of phage panning using high-throughput sequencing. In this work, we have quantified the fraction of phage clones that can be identified by directly sequencing DNA from phage particles. The results show that the DNA of essentially all of the phage particles can be 'decoded', and that the sequence coverage for mutants equals that of amplified DNA extracted from cells infected with wild-type phage. This procedure is particularly attractive for selections with phage that have a compromised infection capacity, and it may allow phage display to be performed with particles that are not infective at all. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. Kernel based machine learning algorithm for the efficient prediction of type III polyketide synthase family of proteins.

    PubMed

    Mallika, V; Sivakumar, K C; Jaichand, S; Soniya, E V

    2010-07-13

    Type III Polyketide synthases (PKS) are family of proteins considered to have significant roles in the biosynthesis of various polyketides in plants, fungi and bacteria. As these proteins shows positive effects to human health, more researches are going on regarding this particular protein. Developing a tool to identify the probability of sequence being a type III polyketide synthase will minimize the time consumption and manpower efforts. In this approach, we have designed and implemented PKSIIIpred, a high performance prediction server for type III PKS where the classifier is Support Vector Machines (SVMs). Based on the limited training dataset, the tool efficiently predicts the type III PKS superfamily of proteins with high sensitivity and specificity. The PKSIIIpred is available at http://type3pks.in/prediction/. We expect that this tool may serve as a useful resource for type III PKS researchers. Currently work is being progressed for further betterment of prediction accuracy by including more sequence features in the training dataset.

  5. Typing of the rabies virus in Chile, 2002-2008.

    PubMed

    Yung, V; Favi, M; Fernandez, J

    2012-12-01

    In Chile, dog rabies has been controlled and insectivorous bats have been identified as the main rabies reservoir. This study aimed to determine the rabies virus (RABV) variants circulating in the country between 2002 and 2008. A total of 612 RABV isolates were tested using a panel with eight monoclonal antibodies against the viral nucleoprotein (N-mAbs) for antigenic typing, and a product of 320-bp of the nucleoprotein gene was sequenced from 99 isolates. Typing of the isolates revealed six different antigenic variants but phylogenetic analysis identified four clusters associated with four different bat species. Tadarida brasiliensis bats were confirmed as the main reservoir. This methodology identified several independent rabies enzootics maintained by different species of insectivorous bats in Chile.

  6. Identification of a type-D feruloyl esterase from Neurospora crassa.

    PubMed

    Crepin, V F; Faulds, C B; Connerton, I F

    2004-02-01

    Feruloyl esterases constitute an interesting group of enzymes that have the potential for use over a broad range of applications in the agri-food industries. In order to expand the range of available enzymes, we have examined the presence of feruoyl esterase genes present in the genome sequence of the filamentous fungus Neurospora crassa. We have identified an orphan gene (contig 3.544), the translation of which shows sequence identity with known feruloyl esterases. This gene was cloned and the corresponding recombinant protein expressed in Pichia pastoris to confirm that the enzyme (NcFaeD-3.544) exhibits feruloyl esterase activity. Unusually the enzyme was capable of p-coumaric acid release from untreated crude plant cell wall materials. The substrate utilisation preferences of the recombinant enzyme place it in the recently recognised type-D sub-class of feruloyl esterase.

  7. A user's guide to quantitative and comparative analysis of metagenomic datasets.

    PubMed

    Luo, Chengwei; Rodriguez-R, Luis M; Konstantinidis, Konstantinos T

    2013-01-01

    Metagenomics has revolutionized microbiological studies during the past decade and provided new insights into the diversity, dynamics, and metabolic potential of natural microbial communities. However, metagenomics still represents a field in development, and standardized tools and approaches to handle and compare metagenomes have not been established yet. An important reason accounting for the latter is the continuous changes in the type of sequencing data available, for example, long versus short sequencing reads. Here, we provide a guide to bioinformatic pipelines developed to accomplish the following tasks, focusing primarily on those developed by our team: (i) assemble a metagenomic dataset; (ii) determine the level of sequence coverage obtained and the amount of sequencing required to obtain complete coverage; (iii) identify the taxonomic affiliation of a metagenomic read or assembled contig; and (iv) determine differentially abundant genes, pathways, and species between different datasets. Most of these pipelines do not depend on the type of sequences available or can be easily adjusted to fit different types of sequences, and are freely available (for instance, through our lab Web site: http://www.enve-omics.gatech.edu/). The limitations of current approaches, as well as the computational aspects that can be further improved, will also be briefly discussed. The work presented here provides practical guidelines on how to perform metagenomic analysis of microbial communities characterized by varied levels of diversity and establishes approaches to handle the resulting data, independent of the sequencing platform employed. © 2013 Elsevier Inc. All rights reserved.

  8. Multilocus sequence typing of Pseudomonas syringae sensu lato confirms previously described genomospecies and permits rapid identification of P. syringae pv. coriandricola and P. syringae pv. apii causing bacterial leaf spot on parsley.

    PubMed

    Bull, Carolee T; Clarke, Christopher R; Cai, Rongman; Vinatzer, Boris A; Jardini, Teresa M; Koike, Steven T

    2011-07-01

    Since 2002, severe leaf spotting on parsley (Petroselinum crispum) has occurred in Monterey County, CA. Either of two different pathovars of Pseudomonas syringae sensu lato were isolated from diseased leaves from eight distinct outbreaks and once from the same outbreak. Fragment analysis of DNA amplified between repetitive sequence polymerase chain reaction; 16S rDNA sequence analysis; and biochemical, physiological, and host range tests identified the pathogens as Pseudomonas syringae pv. apii and P. syringae pv. coriandricola. Koch's postulates were completed for the isolates from parsley, and host range tests with parsley isolates and pathotype strains demonstrated that P. syringae pv. apii and P. syringae pv. coriandricola cause leaf spot diseases on parsley, celery, and coriander or cilantro. In a multilocus sequence typing (MLST) approach, four housekeeping gene fragments were sequenced from 10 strains isolated from parsley and 56 pathotype strains of P. syringae. Allele sequences were uploaded to the Plant-Associated Microbes Database and a phylogenetic tree was built based on concatenated sequences. Tree topology directly corresponded to P. syringae genomospecies and P. syringae pv. apii was allocated appropriately to genomospecies 3. This is the first demonstration that MLST can accurately allocate new pathogens directly to P. syringae sensu lato genomospecies. According to MLST, P. syringae pv. coriandricola is a member of genomospecies 9, P. cannabina. In a blind test, both P. syringae pv. coriandricola and P. syringae pv. apii isolates from parsley were correctly identified to pathovar. In both cases, MLST described diversity within each pathovar that was previously unknown.

  9. Vertical transmission of highly similar blaCTX-M-1-harboring IncI1 plasmids in Escherichia coli with different MLST types in the poultry production pyramid

    PubMed Central

    Zurfluh, Katrin; Wang, Juan; Klumpp, Jochen; Nüesch-Inderbinen, Magdalena; Fanning, Séamus; Stephan, Roger

    2014-01-01

    Objectives: The purpose of this study was to characterize sets of extended-spectrum β-lactamases (ESBL)-producing Enterobacteriaceae collected longitudinally from different flocks of broiler breeders, meconium of 1-day-old broilers from theses breeder flocks, as well as from these broiler flocks before slaughter. Methods: Five sets of ESBL-producing Escherichia coli were studied by multi-locus sequence typing (MLST), phylogenetic grouping, PCR-based replicon typing and resistance profiling. The blaCTX-M-1-harboring plasmids of one set (pHV295.1, pHV114.1, and pHV292.1) were fully sequenced and subjected to comparative analysis. Results: Eleven different MLST sequence types (ST) were identified with ST1056 the predominant one, isolated in all five sets either on the broiler breeder or meconium level. Plasmid sequencing revealed that blaCTX-M-1 was carried by highly similar IncI1/ST3 plasmids that were 105 076 bp, 110 997 bp, and 117 269 bp in size, respectively. Conclusions: The fact that genetically similar IncI1/ST3 plasmids were found in ESBL-producing E. coli of different MLST types isolated at the different levels in the broiler production pyramid provides strong evidence for a vertical transmission of these plasmids from a common source (nucleus poultry flocks). PMID:25324838

  10. Vertical transmission of highly similar bla CTX-M-1-harboring IncI1 plasmids in Escherichia coli with different MLST types in the poultry production pyramid.

    PubMed

    Zurfluh, Katrin; Wang, Juan; Klumpp, Jochen; Nüesch-Inderbinen, Magdalena; Fanning, Séamus; Stephan, Roger

    2014-01-01

    The purpose of this study was to characterize sets of extended-spectrum β-lactamases (ESBL)-producing Enterobacteriaceae collected longitudinally from different flocks of broiler breeders, meconium of 1-day-old broilers from theses breeder flocks, as well as from these broiler flocks before slaughter. Five sets of ESBL-producing Escherichia coli were studied by multi-locus sequence typing (MLST), phylogenetic grouping, PCR-based replicon typing and resistance profiling. The bla CTX-M-1-harboring plasmids of one set (pHV295.1, pHV114.1, and pHV292.1) were fully sequenced and subjected to comparative analysis. Eleven different MLST sequence types (ST) were identified with ST1056 the predominant one, isolated in all five sets either on the broiler breeder or meconium level. Plasmid sequencing revealed that bla CTX-M-1 was carried by highly similar IncI1/ST3 plasmids that were 105 076 bp, 110 997 bp, and 117 269 bp in size, respectively. The fact that genetically similar IncI1/ST3 plasmids were found in ESBL-producing E. coli of different MLST types isolated at the different levels in the broiler production pyramid provides strong evidence for a vertical transmission of these plasmids from a common source (nucleus poultry flocks).

  11. Immune Selection In Vitro Reveals Human Immunodeficiency Virus Type 1 Nef Sequence Motifs Important for Its Immune Evasion Function In Vivo

    PubMed Central

    Lee, Patricia; Ng, Hwee L.; Yang, Otto O.

    2012-01-01

    Human immunodeficiency virus type 1 (HIV-1) Nef downregulates major histocompatibility complex class I (MHC-I), impairing the clearance of infected cells by CD8+ cytotoxic T lymphocytes (CTLs). While sequence motifs mediating this function have been determined by in vitro mutagenesis studies of laboratory-adapted HIV-1 molecular clones, it is unclear whether the highly variable Nef sequences of primary isolates in vivo rely on the same sequence motifs. To address this issue, nef quasispecies from nine chronically HIV-1-infected persons were examined for sequence evolution and altered MHC-I downregulatory function under Gag-specific CTL immune pressure in vitro. This selection resulted in decreased nef diversity and strong purifying selection. Site-by-site analysis identified 13 codons undergoing purifying selection and 1 undergoing positive selection. Of the former, only 6 have been reported to have roles in Nef function, including 4 associated with MHC-I downregulation. Functional testing of naturally occurring in vivo polymorphisms at the 7 sites with no previously known functional role revealed 3 mutations (A84D, Y135F, and G140R) that ablated MHC-I downregulation and 3 (N52A, S169I, and V180E) that partially impaired MHC-I downregulation. Globally, the CTL pressure in vitro selected functional Nef from the in vivo quasispecies mixtures that predominately lacked MHC-I downregulatory function at the baseline. Overall, these data demonstrate that CTL pressure exerts a strong purifying selective pressure for MHC-I downregulation and identifies novel functional motifs present in Nef sequences in vivo. PMID:22553319

  12. Leptospira interrogans serovars Bratislava and Muenchen animal infections: Implications for epidemiology and control.

    PubMed

    Arent, Z; Frizzell, C; Gilmore, C; Allen, A; Ellis, W A

    2016-07-15

    Strains of Leptospira interrogans belonging to two very closely related serovars - Bratislava and Muenchen - have been associated with disease in domestic animals, in particular pigs, but also in horses and dogs. Similar strains have also been recovered from various wildlife species. Their epidemiology is poorly understood. Two hundred and forty seven such isolates, from UK domestic animal and wildlife species, were examined by restriction endonuclease analysis in an attempt to elucidate their epidemiology. A representative sub-sample of 65 of these isolates was further examined by multiple-locus variable-number tandem repeat analysis and 22 by secY sequencing. Ten restriction pattern types were identified. The majority of isolates fell into one of three restriction endonuclease analysis pattern types designated B2a, B2b and M2a. B2a was ubiquitous and was isolated from 10 species and represented the majority of the horse and all dog isolates. B2b was very different, being isolated only from pigs, indicating that this type was maintained by pigs. The pattern M2a was reported for the majority of isolates from pigs but also was common in small rodents isolates. Five restriction pattern types were found only in wildlife suggesting that they are unlikely to pose a disease threat to domestic animals. Multiple-locus variable-number tandem repeat analysis identified six clusters. The REA types B2a and B2b were all found in one MLVA cluster while the majority of the M2a strains examined occurred in another cluster. The secY sequencing detected only one sequence type, clustered with other serovars of Leptospira interrogans. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. Species identification and molecular typing of human Brucella isolates from Kuwait.

    PubMed

    Mustafa, Abu S; Habibi, Nazima; Osman, Amr; Shaheed, Faraz; Khan, Mohd W

    2017-01-01

    Brucellosis is a zoonotic disease of major concern in Kuwait and the Middle East. Human brucellosis can be caused by several Brucella species with varying degree of pathogenesis, and relapses are common after apparently successful therapy. The classical biochemical methods for identification of Brucella are time-consuming, cumbersome, and provide information limited to the species level only. In contrast, molecular methods are rapid and provide differentiation at intra-species level. In this study, four molecular methods [16S rRNA gene sequencing, real-time PCR, enterobacterial repetitive intergenic consensus (ERIC)-PCR and multilocus variable-number tandem-repeat analysis (MLVA)-8, MLVA-11 and MLVA-16 were evaluated for the identification and typing of 75 strains of Brucella isolated in Kuwait. 16S rRNA gene sequencing of all isolates showed 90-99% sequence identity with B. melitensis and real-time PCR with genus- and species- specific primers identified all isolates as B. melitensis. The results of ERIC-PCR suggested the existence of 75 ERIC genotypes of B. melitensis with a discriminatory index of 0.997. Cluster classification of these genotypes divided them into two clusters, A and B, diverging at ~25%. The maximum number of genotypes (n = 51) were found in cluster B5. MLVA-8 analysis identified all isolates as B. melitensis, and MLVA-8, MLVA-11 and MLVA-16 typing divided the isolates into 10, 32 and 71 MLVA types, respectively. Furthermore, the combined minimum spanning tree analysis demonstrated that, compared to MLVA types discovered all over the world, the Kuwaiti isolates were a distinct group of MLVA-11 and MLVA-16 types in the East Mediterranean Region.

  14. Species identification and molecular typing of human Brucella isolates from Kuwait

    PubMed Central

    Osman, Amr; Shaheed, Faraz; Khan, Mohd W.

    2017-01-01

    Brucellosis is a zoonotic disease of major concern in Kuwait and the Middle East. Human brucellosis can be caused by several Brucella species with varying degree of pathogenesis, and relapses are common after apparently successful therapy. The classical biochemical methods for identification of Brucella are time-consuming, cumbersome, and provide information limited to the species level only. In contrast, molecular methods are rapid and provide differentiation at intra-species level. In this study, four molecular methods [16S rRNA gene sequencing, real-time PCR, enterobacterial repetitive intergenic consensus (ERIC)-PCR and multilocus variable-number tandem-repeat analysis (MLVA)-8, MLVA-11 and MLVA-16 were evaluated for the identification and typing of 75 strains of Brucella isolated in Kuwait. 16S rRNA gene sequencing of all isolates showed 90–99% sequence identity with B. melitensis and real-time PCR with genus- and species- specific primers identified all isolates as B. melitensis. The results of ERIC-PCR suggested the existence of 75 ERIC genotypes of B. melitensis with a discriminatory index of 0.997. Cluster classification of these genotypes divided them into two clusters, A and B, diverging at ~25%. The maximum number of genotypes (n = 51) were found in cluster B5. MLVA-8 analysis identified all isolates as B. melitensis, and MLVA-8, MLVA-11 and MLVA-16 typing divided the isolates into 10, 32 and 71 MLVA types, respectively. Furthermore, the combined minimum spanning tree analysis demonstrated that, compared to MLVA types discovered all over the world, the Kuwaiti isolates were a distinct group of MLVA-11 and MLVA-16 types in the East Mediterranean Region. PMID:28800594

  15. Maturity onset diabetes of youth (MODY) in Turkish children: sequence analysis of 11 causative genes by next generation sequencing.

    PubMed

    Ağladıoğlu, Sebahat Yılmaz; Aycan, Zehra; Çetinkaya, Semra; Baş, Veysel Nijat; Önder, Aşan; Peltek Kendirci, Havva Nur; Doğan, Haldun; Ceylaner, Serdar

    2016-04-01

    Maturity-onset diabetes of the youth (MODY), is a genetically and clinically heterogeneous group of diseasesand is often misdiagnosed as type 1 or type 2 diabetes. The aim of this study is to investigate both novel and proven mutations of 11 MODY genes in Turkish children by using targeted next generation sequencing. A panel of 11 MODY genes were screened in 43 children with MODY diagnosed by clinical criterias. Studies of index cases was done with MISEQ-ILLUMINA, and family screenings and confirmation studies of mutations was done by Sanger sequencing. We identified 28 (65%) point mutations among 43 patients. Eighteen patients have GCK mutations, four have HNF1A, one has HNF4A, one has HNF1B, two have NEUROD1, one has PDX1 gene variations and one patient has both HNF1A and HNF4A heterozygote mutations. This is the first study including molecular studies of 11 MODY genes in Turkish children. GCK is the most frequent type of MODY in our study population. Very high frequency of novel mutations (42%) in our study population, supports that in heterogenous disorders like MODY sequence analysis provides rapid, cost effective and accurate genetic diagnosis.

  16. Dynamics of actin evolution in dinoflagellates.

    PubMed

    Kim, Sunju; Bachvaroff, Tsvetan R; Handy, Sara M; Delwiche, Charles F

    2011-04-01

    Dinoflagellates have unique nuclei and intriguing genome characteristics with very high DNA content making complete genome sequencing difficult. In dinoflagellates, many genes are found in multicopy gene families, but the processes involved in the establishment and maintenance of these gene families are poorly understood. Understanding the dynamics of gene family evolution in dinoflagellates requires comparisons at different evolutionary scales. Studies of closely related species provide fine-scale information relative to species divergence, whereas comparisons of more distantly related species provides broad context. We selected the actin gene family as a highly expressed conserved gene previously studied in dinoflagellates. Of the 142 sequences determined in this study, 103 were from the two closely related species, Dinophysis acuminata and D. caudata, including full length and partial cDNA sequences as well as partial genomic amplicons. For these two Dinophysis species, at least three types of sequences could be identified. Most copies (79%) were relatively similar and in nucleotide trees, the sequences formed two bushy clades corresponding to the two species. In comparisons within species, only eight to ten nucleotide differences were found between these copies. The two remaining types formed clades containing sequences from both species. One type included the most similar sequences in between-species comparisons with as few as 12 nucleotide differences between species. The second type included the most divergent sequences in comparisons between and within species with up to 93 nucleotide differences between sequences. In all the sequences, most variation occurred in synonymous sites or the 5' UnTranslated Region (UTR), although there was still limited amino acid variation between most sequences. Several potential pseudogenes were found (approximately 10% of all sequences depending on species) with incomplete open reading frames due to frameshifts or early stop codons. Overall, variation in the actin gene family fits best with the "birth and death" model of evolution based on recent duplications, pseudogenes, and incomplete lineage sorting. Divergence between species was similar to variation within species, so that actin may be too conserved to be useful for phylogenetic estimation of closely related species.

  17. Gene encoding the group B streptococcal protein R4, its presence in clinical reference laboratory isolates & R4 protein pepsin sensitivity.

    PubMed

    Smith, B L; Flores, A; Dechaine, J; Krepela, J; Bergdall, A; Ferrieri, P

    2004-05-01

    R proteins were first identified by Lancefield in group B Streptococcus (GBS) as resistant to trypsin at pH8 and sensitive to pepsin at pH2. The R4 protein found predominantly in type III and some type II and V invasive isolates conforms to these criteria. The Rib protein, although structurally and epidemiologically similar to R4, was reported as resistant to both proteases. We report here the gene encoding the R4 protein from a type III group B streptococcal isolate (76-043) well characterized in our laboratory. Trypsin extracted GBS proteins were assayed for protease sensitivities by double-diffusion Ouchterlony using varying conditions for the enzyme pepsin. Standard haemoglobin assay was used to examine pepsin enzymatic activity. Thirty clinical isolates of varying protein profiles identified by double-diffusion from our reference strain laboratory were screened by PCR and Southern technique. SDS-PAGE gel purified R4 amino acid sequences were determined and used to design oligonucleotide primers for screening a 76-043 genomic library. R4 was sensitive to pepsin at pH2 but appeared resistant at pH4, the reported pH used for Rib. By standard haemoglobin assay and trypsin extract studies of R4 protein, pepsin was shown to be active at pH2, yet easily inactivated; assays of GBS surface proteins are critical at pH2. Of the amino acids initially sequenced from R4, 88 per cent (61/69) showed identity to Rib; the r4 nucleotide sequence was identical to that of rib. All isolates with strong positive protein reactions for R4 were positive in both PCR and Southern technique, whereas isolates expressing alpha, beta, R1/R4, and R5 (BPS) protein profiles were not. Sequenced PCR products aligned with identity to the R4 and Rib nucleotide sequences and confirmed the identity of these proteins and their molecular sequences.

  18. Characterization of replication and conjugation of plasmid pWTY27 from a widely distributed Streptomyces species

    PubMed Central

    2012-01-01

    Background Streptomyces species are widely distributed in natural habitats, such as soils, lakes, plants and some extreme environments. Replication loci of several Streptomyces theta-type plasmids have been reported, but are not characterized in details. Conjugation loci of some Streptomyces rolling-circle-type plasmids are identified and mechanism of conjugal transferring are described. Results We report the detection of a widely distributed Streptomyces strain Y27 and its indigenous plasmid pWTY27 from fourteen plants and four soil samples cross China by both culturing and nonculturing methods. The complete nucleotide sequence of pWTY27 consisted of 14,288 bp. A basic locus for plasmid replication comprised repAB genes and an adjacent iteron sequence, to a long inverted-repeat (ca. 105 bp) of which the RepA protein bound specifically in vitro, suggesting that RepA may recognize a second structure (e.g. a long stem-loop) of the iteron DNA. A plasmid containing the locus propagated in linear mode when the telomeres of a linear plasmid were attached, indicating a bi-directional replication mode for pWTY27. As for rolling-circle plasmids, a single traA gene and a clt sequence (covering 16 bp within traA and its adjacent 159 bp) on pWTY27 were required for plasmid transfer. TraA recognized and bound specifically to the two regions of the clt sequence, one containing all the four DC1 of 7 bp (TGACACC) and one DC2 (CCCGCCC) and most of IC1, and another covering two DC2 and part of IC1, suggesting formation of a high-ordered DNA-protein complex. Conclusions This work (i) isolates a widespread Streptomyces strain Y27 and sequences its indigenous theta-type plasmid pWTY27; (ii) identifies the replication and conjugation loci of pWTY27 and; (iii) characterizes the binding sequences of the RepA and TraA proteins. PMID:23134842

  19. Sequencing the extrachromosomal circular mobilome reveals retrotransposon activity in plants

    PubMed Central

    Llauro, Christel; Jobet, Edouard; Robakowska-Hyzorek, Dagmara; Lasserre, Eric; Ghesquière, Alain; Panaud, Olivier

    2017-01-01

    Retrotransposons are mobile genetic elements abundant in plant and animal genomes. While efficiently silenced by the epigenetic machinery, they can be reactivated upon stress or during development. Their level of transcription not reflecting their transposition ability, it is thus difficult to evaluate their contribution to the active mobilome. Here we applied a simple methodology based on the high throughput sequencing of extrachromosomal circular DNA (eccDNA) forms of active retrotransposons to characterize the repertoire of mobile retrotransposons in plants. This method successfully identified known active retrotransposons in both Arabidopsis and rice material where the epigenome is destabilized. When applying mobilome-seq to developmental stages in wild type rice, we identified PopRice as a highly active retrotransposon producing eccDNA forms in the wild type endosperm. The mobilome-seq strategy opens new routes for the characterization of a yet unexplored fraction of plant genomes. PMID:28212378

  20. Sequencing the extrachromosomal circular mobilome reveals retrotransposon activity in plants.

    PubMed

    Lanciano, Sophie; Carpentier, Marie-Christine; Llauro, Christel; Jobet, Edouard; Robakowska-Hyzorek, Dagmara; Lasserre, Eric; Ghesquière, Alain; Panaud, Olivier; Mirouze, Marie

    2017-02-01

    Retrotransposons are mobile genetic elements abundant in plant and animal genomes. While efficiently silenced by the epigenetic machinery, they can be reactivated upon stress or during development. Their level of transcription not reflecting their transposition ability, it is thus difficult to evaluate their contribution to the active mobilome. Here we applied a simple methodology based on the high throughput sequencing of extrachromosomal circular DNA (eccDNA) forms of active retrotransposons to characterize the repertoire of mobile retrotransposons in plants. This method successfully identified known active retrotransposons in both Arabidopsis and rice material where the epigenome is destabilized. When applying mobilome-seq to developmental stages in wild type rice, we identified PopRice as a highly active retrotransposon producing eccDNA forms in the wild type endosperm. The mobilome-seq strategy opens new routes for the characterization of a yet unexplored fraction of plant genomes.

  1. Clinical and molecular epidemiology of Escherichia coli sequence type 131 among hospitalized patients colonized intestinally with fluoroquinolone-resistant E. coli.

    PubMed

    Han, Jennifer H; Johnston, Brian; Nachamkin, Irving; Tolomeo, Pam; Bilker, Warren B; Mao, Xiangqun; Clabots, Connie; Lautenbach, Ebbing; Johnson, James R

    2014-11-01

    This study examined molecular and epidemiologic factors associated with Escherichia coli sequence type 131 (ST131) among hospitalized patients colonized intestinally with fluoroquinolone (FQ)-resistant E. coli between 2002 and 2004. Among 86 patients, 21 (24%) were colonized with ST131. The proportion of ST131 isolates among colonizing isolates increased significantly over time, from 8% in 2002 to 50% in 2004 (P = 0.003). Furthermore, all 19 clonally related isolates were ST131. Future studies should identify potential transmissibility differences between ST131 and non-ST131 strains. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  2. Deciphering the biodiversity of Listeria monocytogenes lineage III strains by polyphasic approaches.

    PubMed

    Zhao, Hanxin; Chen, Jianshun; Fang, Chun; Xia, Ye; Cheng, Changyong; Jiang, Lingli; Fang, Weihuan

    2011-10-01

    Listeria monocytogenes is a foodborne pathogen of humans and animals. The majority of human listeriosis cases are caused by strains of lineages I and II, while lineage III strains are rare and seldom implicated in human listeriosis. We revealed by 16S rRNA sequencing the special evolutionary status of L. monocytogenes lineage III, which falls between lineages I and II strains of L. monocytogenes and the non-pathogenic species L. innocua and L. marthii in the dendrogram. Thirteen lineage III strains were then characterized by polyphasic approaches. Biochemical reactions demonstrated 8 biotypes, internalin profiling identified 10 internal-in types clustered in 4 groups, and multilocus sequence typing differentiated 12 sequence types. These typing schemes show that lineage III strains represent the most diverse population of L. monocytogenes, and comprise at least four subpopulations IIIA-1, IIIA-2, HIB, and IIIC. The in vitro and in vivo virulence assessments showed that two lineage IIIA-2 strains had reduced pathogenicity, while the other lineage III strains had comparable virulence to lineages I and II. The HIB strains are phylogenetically distinct from other sub-populations, providing additional evidence that this sublineage represents a novel lineage. The two biochemical reactions L-rhamnose and L-lactate alkalinization, and 10 internalins were identified as potential markers for lineage III subpopulations. This study provides new insights into the biodiversity and population structure of lineage III strains, which are important for understanding the evolution of the L. mono-cytogenes-L. innocua clade.

  3. Syndrome of Hepatic Cirrhosis, Dystonia, Polycythemia, and Hypermanganesemia Caused by Mutations in SLC30A10, a Manganese Transporter in Man

    PubMed Central

    Tuschl, Karin; Clayton, Peter T.; Gospe, Sidney M.; Gulab, Shamshad; Ibrahim, Shahnaz; Singhi, Pratibha; Aulakh, Roosy; Ribeiro, Reinaldo T.; Barsottini, Orlando G.; Zaki, Maha S.; Del Rosario, Maria Luz; Dyack, Sarah; Price, Victoria; Rideout, Andrea; Gordon, Kevin; Wevers, Ron A.; “Kling” Chong, W.K.; Mills, Philippa B.

    2012-01-01

    Environmental manganese (Mn) toxicity causes an extrapyramidal, parkinsonian-type movement disorder with characteristic magnetic resonance images of Mn accumulation in the basal ganglia. We have recently reported a suspected autosomal recessively inherited syndrome of hepatic cirrhosis, dystonia, polycythemia, and hypermanganesemia in cases without environmental Mn exposure. Whole-genome mapping of two consanguineous families identified SLC30A10 as the affected gene in this inherited type of hypermanganesemia. This gene was subsequently sequenced in eight families, and homozygous sequence changes were identified in all affected individuals. The function of the wild-type protein and the effect of sequence changes were studied in the manganese-sensitive yeast strain Δpmr1. Expressing human wild-type SLC30A10 in the Δpmr1 yeast strain rescued growth in high Mn conditions, confirming its role in Mn transport. The presence of missense (c.266T>C [p.Leu89Pro]) and nonsense (c.585del [p.Thr196Profs∗17]) mutations in SLC30A10 failed to restore Mn resistance. Previously, SLC30A10 had been presumed to be a zinc transporter. However, this work has confirmed that SLC30A10 functions as a Mn transporter in humans that, when defective, causes Mn accumulation in liver and brain. This is an important step toward understanding Mn transport and its role in neurodegenerative processes. PMID:22341972

  4. Type-Specific Detection of 30 Oncogenic Human Papillomaviruses by Genotyping both E6 and L1 Genes

    PubMed Central

    Peng, Junping; Gao, Lei; Guo, Junhua; Wang, Ting; Wang, Ling; Yao, Qing; Zhu, Haijun

    2013-01-01

    Human papillomavirus (HPV) is the principal cause of invasive cervical cancer and benign genital lesions. There are currently 30 HPV types linked to cervical cancer. HPV infection also leads to other types of cancer. We developed a 61-plex analysis of these 30 HPV types by examining two genes, E6 and L1, using MassARRAY matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) (PCR-MS). Two hundred samples from homosexual males (HM) were screened by PCR-MS and MY09/MY11 primer set-mediated PCR (MY-PCR) followed by sequencing. One hundred thirty-five formalin-fixed, paraffin-embedded (FFPE) cervical cancer samples were also analyzed by PCR-MS, and results were compared to those of the commercially available GenoArray (GA) assay. One or more HPV types were identified in 64.5% (129/200) of the samples from HM. Comprising all 30 HPV types, PCR-MS detected 51.9% (67/129) of samples with multiple HPV types, whereas MY-PCR detected only one single HPV type in these samples. All PCR-MS results were confirmed by MY-PCR. In the cervical cancer samples, PCR-MS and GA detected 97% (131/135) and 90.4% (122/135) of HPV-positive samples, respectively. PCR-MS and GA results were fully concordant for 122 positive and 4 negative samples. The sequencing results for the 9 samples that tested negative by GA were completely concordant with the positive PCR-MS results. Multiple HPV types were identified in 25.2% (34/135) and 55.6% (75/135) of the cervical cancer samples by GA and PCR-MS, respectively, and results were confirmed by sequencing. The new assay allows the genotyping of >1,000 samples per day. It provides a good alternative to current methods, especially for large-scale investigations of multiple HPV infections and degraded FFPE samples. PMID:23152557

  5. Detection of MEF-1 laboratory reference strain of poliovirus type 2 in children with poliomyelitis in India in 2002 & 2003.

    PubMed

    Deshpande, J M; Nadkarni, S S; Siddiqui, Z A

    2003-12-01

    Significant progress has been made towards eradication of poliomyelitis in India. Surveillance for acute flaccid paralysis (AFP) has reached high standards. Among the 3 types of polioviruses, type 2 had been eliminated in India and eradicated globally as of October 1999. However, we isolated wild poliovirus type 2 from a small number of polio cases in northern India in 2000 and again during December 2002 to February 2003. Using molecular tools the origin, of the wild type 2 poliovirus was investigated. Polioviruses isolated from stool samples collected from patients with AFP were differentiated as wild virus or Sabin vaccine-like by ELISA and probe hybridization assays. Complete VP1 gene nucleotide sequences of the wild type 2 poliovirus isolates were determined by reverse transcriptase polymerase chain reaction (RT-PCR), followed by cycle sequencing. VP1 nucleotide sequences were compared with those of wild type 2 polioviruses that were indigenous in India in the past as well as prototype/laboratory strains and the GenBank database. Wild poliovirus type 2 was detected in stool samples from 6 patients with AFP in western Uttar Pradesh and 1 in Gujarat. In addition, the virus was isolated from one healthy contact child and from environmental sewage sample in Moradabad where three of these patients were reported. These isolates were identified as genetically closely related to laboratory reference strain MEF-1. Molecular characterization of the isolates confirmed that there was no evidence of extensive person-to-person transmission of the virus in the community. Laboratory reference strain (MEF-1) of poliovirus type 2 caused paralytic poliomyelitis in 10 patients in September 2000 and November 2002 to February 2003. The origin of the virus was some laboratory as yet not identified. This episode highlights the urgent need for stringent containment of wild poliovirus containing materials in the laboratories across the country in order to prevent recurrence of such incidents.

  6. A typing scheme for the honeybee pathogen Melissococcus plutonius allows detection of disease transmission events and a study of the distribution of variants.

    PubMed

    Haynes, Edward; Helgason, Thorunn; Young, J Peter W; Thwaites, Richard; Budge, Giles E

    2013-08-01

    Melissococcus plutonius is the bacterial pathogen that causes European Foulbrood of honeybees, a globally important honeybee brood disease. We have used next-generation sequencing to identify highly polymorphic regions in an otherwise genetically homogenous organism, and used these loci to create a modified MLST scheme. This synthesis of a proven typing scheme format with next-generation sequencing combines reliability and low costs with insights only available from high-throughput sequencing technologies. Using this scheme we show that the global distribution of M.plutonius variants is not uniform. We use the scheme in epidemiological studies to trace movements of infective material around England, insights that would have been impossible to confirm without the typing scheme. We also demonstrate the persistence of local variants over time. © 2013 Crown copyright. Reproduced with the permission of the Controller of Her Majesty's Stationary Office/Queen’s Printer for Scotland and Food and Environment Research Agency.

  7. Theory of winds in late-type evolved and pre-main-sequence stars

    NASA Technical Reports Server (NTRS)

    Macgregor, K. B.

    1983-01-01

    Recent observational results confirm that many of the physical processes which are known to occur in the Sun also occur among late-type stars in general. One such process is the continuous loss of mass from a star in the form of a wind. There now exists an abundance of either direct or circumstantial evidence which suggests that most (if not all) stars in the cool portion of the HR diagram possess winds. An attempt is made to assess the current state of theoretical understanding of mass loss from two distinctly different classes of late-type stars: the post-main-sequence giant/supergiant stars and the pre-main-sequence T Tauri stars. Toward this end, the observationally inferred properties of the wind associated with each of the two stellar classes under consideration are summarized and compared against the predictions of existing theoretical models. Although considerable progress has been made in attempting to identify the mechanisms responsible for mass loss from cool stars, many fundamental problems remain to be solved.

  8. Copious amounts of hot and cold dust orbiting the main sequence a-type stars HD 131488 and HD 121191

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Melis, Carl; Zuckerman, B.; Rhee, Joseph H.

    2013-11-20

    We report two new dramatically dusty main sequence stars: HD 131488 (A1 V) and HD 121191 (A8 V). HD 131488 is found to have substantial amounts of dust in its terrestrial planet zone (L {sub IR}/L {sub bol} ≈ 4 × 10{sup –3}), cooler dust farther out in its planetary system, and an unusual mid-infrared spectral feature. HD 121191 shows terrestrial planet zone dust (L {sub IR}/L {sub bol} ≈ 2.3 × 10{sup –3}), hints of cooler dust, and shares the unusual mid-infrared spectral shape identified in HD 131488. These two stars belong to sub-groups of the Scorpius-Centaurus OB associationmore » and have ages of ∼10 Myr. HD 131488 and HD 121191 are the dustiest main sequence A-type stars currently known. Early-type stars that host substantial inner planetary system dust are thus far found only within the age range of 5-20 Myr.« less

  9. Genome sequence of Frateuria aurantia type strain (Kondo 67(T)), a xanthomonade isolated from Lilium auratium Lindl.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Anderson, Iain; Teshima, Hazuki; Nolan, Matt

    2013-01-01

    rateuria aurantia (ex Kondo and Ameyama 1958) Swings et al. 1980 is a member of the bispecific genus Frateuria in the family Xanthomonadaceae, which is already heavily targeted for non-type strain genome sequencing. Strain Kondo 67(T) was initially (1958) identified as a member of 'Acetobacter aurantius', a name that was not considered for the approved list. Kondo 67(T) was therefore later designated as the type strain of the newly proposed acetogenic species Frateuria aurantia. The strain is of interest because of its triterpenoids (hopane family). F. aurantia Kondo 67(T) is the first member of the genus Frateura whose genome sequencemore » has been deciphered, and here we describe the features of this organism, together with the complete genome sequence and annotation. The 3,603,458-bp long chromosome with its 3,200 protein-coding and 88 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.« less

  10. Investigating the long-term course of schizophrenia by sequence analysis.

    PubMed

    An der Heiden, Wolfram; Häfner, Heinz

    2015-08-30

    In the present study we set out to explore the long-term clinical course of schizophrenia in a holistic manner by adopting sequence analysis. Our aim was to identify course types of illness by means of cluster analysis. The study was based on course and outcome data for 107 patients followed up over 134 months after first admission in the ABC Schizophrenia Study. Focusing on the main syndromes (positive, negative, depressive and unspecific symptoms) and their combinations we looked for similarities in individual illness courses using the 'optimal matching' method. A cluster analysis performed on the resulting similarity matrix yielded two main groups (a 'improving' and a 'chronic' group), which comprised a total of six different types of illness course. The course types differed in both quantitative (frequency of syndromes and syndrome combinations) and qualitative terms (clinical presentation, sequence of syndromes). Cluster membership was only rarely, but clearly associated with sociodemographic characteristics, treatment data and other illness variables. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  11. High-Accuracy HLA Type Inference from Whole-Genome Sequencing Data Using Population Reference Graphs.

    PubMed

    Dilthey, Alexander T; Gourraud, Pierre-Antoine; Mentzer, Alexander J; Cereb, Nezih; Iqbal, Zamin; McVean, Gil

    2016-10-01

    Genetic variation at the Human Leucocyte Antigen (HLA) genes is associated with many autoimmune and infectious disease phenotypes, is an important element of the immunological distinction between self and non-self, and shapes immune epitope repertoires. Determining the allelic state of the HLA genes (HLA typing) as a by-product of standard whole-genome sequencing data would therefore be highly desirable and enable the immunogenetic characterization of samples in currently ongoing population sequencing projects. Extensive hyperpolymorphism and sequence similarity between the HLA genes, however, pose problems for accurate read mapping and make HLA type inference from whole-genome sequencing data a challenging problem. We describe how to address these challenges in a Population Reference Graph (PRG) framework. First, we construct a PRG for 46 (mostly HLA) genes and pseudogenes, their genomic context and their characterized sequence variants, integrating a database of over 10,000 known allele sequences. Second, we present a sequence-to-PRG paired-end read mapping algorithm that enables accurate read mapping for the HLA genes. Third, we infer the most likely pair of underlying alleles at G group resolution from the IMGT/HLA database at each locus, employing a simple likelihood framework. We show that HLA*PRG, our algorithm, outperforms existing methods by a wide margin. We evaluate HLA*PRG on six classical class I and class II HLA genes (HLA-A, -B, -C, -DQA1, -DQB1, -DRB1) and on a set of 14 samples (3 samples with 2 x 100bp, 11 samples with 2 x 250bp Illumina HiSeq data). Of 158 alleles tested, we correctly infer 157 alleles (99.4%). We also identify and re-type two erroneous alleles in the original validation data. We conclude that HLA*PRG for the first time achieves accuracies comparable to gold-standard reference methods from standard whole-genome sequencing data, though high computational demands (currently ~30-250 CPU hours per sample) remain a significant challenge to practical application.

  12. High-Accuracy HLA Type Inference from Whole-Genome Sequencing Data Using Population Reference Graphs

    PubMed Central

    Dilthey, Alexander T.; Gourraud, Pierre-Antoine; McVean, Gil

    2016-01-01

    Genetic variation at the Human Leucocyte Antigen (HLA) genes is associated with many autoimmune and infectious disease phenotypes, is an important element of the immunological distinction between self and non-self, and shapes immune epitope repertoires. Determining the allelic state of the HLA genes (HLA typing) as a by-product of standard whole-genome sequencing data would therefore be highly desirable and enable the immunogenetic characterization of samples in currently ongoing population sequencing projects. Extensive hyperpolymorphism and sequence similarity between the HLA genes, however, pose problems for accurate read mapping and make HLA type inference from whole-genome sequencing data a challenging problem. We describe how to address these challenges in a Population Reference Graph (PRG) framework. First, we construct a PRG for 46 (mostly HLA) genes and pseudogenes, their genomic context and their characterized sequence variants, integrating a database of over 10,000 known allele sequences. Second, we present a sequence-to-PRG paired-end read mapping algorithm that enables accurate read mapping for the HLA genes. Third, we infer the most likely pair of underlying alleles at G group resolution from the IMGT/HLA database at each locus, employing a simple likelihood framework. We show that HLA*PRG, our algorithm, outperforms existing methods by a wide margin. We evaluate HLA*PRG on six classical class I and class II HLA genes (HLA-A, -B, -C, -DQA1, -DQB1, -DRB1) and on a set of 14 samples (3 samples with 2 x 100bp, 11 samples with 2 x 250bp Illumina HiSeq data). Of 158 alleles tested, we correctly infer 157 alleles (99.4%). We also identify and re-type two erroneous alleles in the original validation data. We conclude that HLA*PRG for the first time achieves accuracies comparable to gold-standard reference methods from standard whole-genome sequencing data, though high computational demands (currently ~30–250 CPU hours per sample) remain a significant challenge to practical application. PMID:27792722

  13. A PCR technique based on the Hip1 interspersed repetitive sequence distinguishes cyanobacterial species and strains.

    PubMed

    Smith, J K; Parry, J D; Day, J G; Smith, R J

    1998-10-01

    The use of primers based on the Hip1 sequence as a typing technique for cyanobacteria has been investigated. The discovery of short repetitive sequence structures in bacterial DNA during the last decade has led to the development of PCR-based methods for typing, i.e., distinguishing and identifying, bacterial species and strains. An octameric palindromic sequence known as Hip1 has been shown to be present in the chromosomal DNA of many species of cyanobacteria as a highly repetitious interspersed sequence. PCR primers were constructed that extended the Hip1 sequence at the 3' end by two bases. Five of the 16 possible extended primers were tested. Each of the five primers produced a different set of products when used to prime PCR from cyanobacterial genomic DNA. Each primer produced a distinct set of products for each of the 15 cyanobacterial species tested. The ability of Hip1-based PCR to resolve taxonomic differences was assessed by analysis of independent isolates of Anabaena flos-aquae and Nostoc ellipsosporum obtained from the CCAP (Culture Collection of Algae and Protozoa, IFE, Cumbria, UK). A PCR-based RFLP analysis of products amplified from the 23S-16S rDNA intergenic region was used to characterize the isolates and to compare with the Hip1 typing data. The RFLP and Hip1 typing yielded similar results and both techniques were able to distinguish different strains. On the basis of these results it is suggested that the Hip1 PCR technique may assist in distinguishing cyanobacterial species and strains.

  14. Orthogonal typing methods identify genetic diversity among Belgian Campylobacter jejuni strains isolated over a decade from poultry and cases of sporadic human illness.

    PubMed

    Elhadidy, Mohamed; Arguello, Hector; Álvarez-Ordóñez, Avelino; Miller, William G; Duarte, Alexandra; Martiny, Delphine; Hallin, Marie; Vandenberg, Olivier; Dierick, Katelijne; Botteldoorn, Nadine

    2018-06-20

    Campylobacter jejuni is a zoonotic pathogen commonly associated with human gastroenteritis. Retail poultry meat is a major food-related transmission source of C. jejuni to humans. The present study investigated the genetic diversity, clonal relationship, and strain risk-analysis of 403 representative C. jejuni isolates from chicken broilers (n = 204) and sporadic cases of human diarrhea (n = 199) over a decade (2006-2015) in Belgium, using multilocus sequence typing (MLST), PCR binary typing (P-BIT), and identification of lipooligosaccharide (LOS) biosynthesis locus classes. A total of 123 distinct sequence types (STs), clustered in 28 clonal complexes (CCs) were assigned, including ten novel sequence types that were not previously documented in the international database. Sequence types ST-48, ST-21, ST-50, ST-45, ST-464, ST-2274, ST-572, ST-19, ST-257 and ST-42 were the most prevalent. Clonal complex 21 was the main clonal complex in isolates from humans and chickens. Among observed STs, a total of 35 STs that represent 72.2% (291/403) of the isolates were identified in both chicken and human isolates confirming considerable epidemiological relatedness; these 35 STs also clustered together in the most prevalent CCs. A majority of the isolates harbored sialylated LOS loci associated with potential neuropathic outcomes in humans. Although the concordance between MLST and P-BIT, determined by the adjusted Rand and Wallace coefficients, showed low congruence between both typing methods. The discriminatory power of P-BIT and MLST was similar, with Simpson's diversity indexes of 0.978 and 0.975, respectively. Furthermore, P-BIT could provide additional epidemiological information that would provide further insights regarding the potential association to human health from each strain. In addition, certain clones could be linked to specific clinical symptoms. Indeed, LOS class E was associated with less severe infections. Moreover, ST-572 was significantly associated with clinical infections occurring after travelling abroad. Ultimately, the data generated from this study will help to better understand the molecular epidemiology of C. jejuni infection. Copyright © 2018. Published by Elsevier B.V.

  15. Personalized genomic analyses for cancer mutation discovery and interpretation

    PubMed Central

    Jones, Siân; Anagnostou, Valsamo; Lytle, Karli; Parpart-Li, Sonya; Nesselbush, Monica; Riley, David R.; Shukla, Manish; Chesnick, Bryan; Kadan, Maura; Papp, Eniko; Galens, Kevin G.; Murphy, Derek; Zhang, Theresa; Kann, Lisa; Sausen, Mark; Angiuoli, Samuel V.; Diaz, Luis A.; Velculescu, Victor E.

    2015-01-01

    Massively parallel sequencing approaches are beginning to be used clinically to characterize individual patient tumors and to select therapies based on the identified mutations. A major question in these analyses is the extent to which these methods identify clinically actionable alterations and whether the examination of the tumor tissue alone is sufficient or whether matched normal DNA should also be analyzed to accurately identify tumor-specific (somatic) alterations. To address these issues, we comprehensively evaluated 815 tumor-normal paired samples from patients of 15 tumor types. We identified genomic alterations using next-generation sequencing of whole exomes or 111 targeted genes that were validated with sensitivities >95% and >99%, respectively, and specificities >99.99%. These analyses revealed an average of 140 and 4.3 somatic mutations per exome and targeted analysis, respectively. More than 75% of cases had somatic alterations in genes associated with known therapies or current clinical trials. Analyses of matched normal DNA identified germline alterations in cancer-predisposing genes in 3% of patients with apparently sporadic cancers. In contrast, a tumor-only sequencing approach could not definitively identify germline changes in cancer-predisposing genes and led to additional false-positive findings comprising 31% and 65% of alterations identified in targeted and exome analyses, respectively, including in potentially actionable genes. These data suggest that matched tumor-normal sequencing analyses are essential for precise identification and interpretation of somatic and germline alterations and have important implications for the diagnostic and therapeutic management of cancer patients. PMID:25877891

  16. Clinical Epidemiology and Molecular Analysis of Extended-Spectrum-β-Lactamase-Producing Escherichia coli in Nepal: Characteristics of Sequence Types 131 and 648

    PubMed Central

    Sherchan, Jatan Bahadur; Miyoshi-Akiyama, Tohru; Ohmagari, Norio; Kirikae, Teruo; Nagamatsu, Maki; Tojo, Masayoshi; Ohara, Hiroshi; Sherchand, Jeevan B.; Tandukar, Sarmila

    2015-01-01

    Recently, CTX-M-type extended-spectrum-β-lactamase (ESBL)-producing Escherichia coli strains have emerged worldwide. In particular, E. coli with O antigen type 25 (O25) and sequence type 131 (ST131), which is often associated with the CTX-M-15 ESBL, has been increasingly reported globally; however, epidemiology reports on ESBL-producing E. coli in Asia are limited. Patients with clinical isolates of ESBL-producing E. coli in the Tribhuvan University teaching hospital in Kathmandu, Nepal, were included in this study. Whole-genome sequencing of the isolates was conducted to analyze multilocus sequence types, phylotypes, virulence genotypes, O25b-ST131 clones, and distribution of acquired drug resistance genes. During the study period, 105 patients with ESBL-producing E. coli isolation were identified, and the majority (90%) of these isolates were CTX-M-15 positive. The most dominant ST was ST131 (n = 54; 51.4%), followed by ST648 (n = 15; 14.3%). All ST131 isolates were identified as O25b-ST131 clones, subclone H30-Rx. Three ST groups (ST131, ST648, and non-ST131/648) were compared in further analyses. ST648 isolates had a proportionally higher resistance to non-β-lactam antibiotics and featured drug-resistant genes more frequently than ST131 or non-ST131/648 isolates. ST131 possessed the most virulence genes, followed by ST648. The clinical characteristics were similar among groups. More than 38% of ESBL-producing E. coli isolates were from the outpatient clinic, and pregnant patients comprised 24% of ESBL-producing E. coli cases. We revealed that the high resistance of ESBL-producing E. coli to multiple classes of antibiotics in Nepal is driven mainly by CTX-M-producing ST131 and ST648. Their immense prevalence in the communities is a matter of great concern. PMID:25824221

  17. Molecular analysis of the anaerobic rumen fungus Orpinomyces - insights into an AT-rich genome.

    PubMed

    Nicholson, Matthew J; Theodorou, Michael K; Brookman, Jayne L

    2005-01-01

    The anaerobic gut fungi occupy a unique niche in the intestinal tract of large herbivorous animals and are thought to act as primary colonizers of plant material during digestion. They are the only known obligately anaerobic fungi but molecular analysis of this group has been hampered by difficulties in their culture and manipulation, and by their extremely high A+T nucleotide content. This study begins to answer some of the fundamental questions about the structure and organization of the anaerobic gut fungal genome. Directed plasmid libraries using genomic DNA digested with highly or moderately rich AT-specific restriction enzymes (VspI and EcoRI) were prepared from a polycentric Orpinomyces isolate. Clones were sequenced from these libraries and the breadth of genomic inserts, both genic and intergenic, was characterized. Genes encoding numerous functions not previously characterized for these fungi were identified, including cytoskeletal, secretory pathway and transporter genes. A peptidase gene with no introns and having sequence similarity to a gene encoding a bacterial peptidase was also identified, extending the range of metabolic enzymes resulting from apparent trans-kingdom transfer from bacteria to fungi, as previously characterized largely for genes encoding plant-degrading enzymes. This paper presents the first thorough analysis of the genic, intergenic and rDNA regions of a variety of genomic segments from an anaerobic gut fungus and provides observations on rules governing intron boundaries, the codon biases observed with different types of genes, and the sequence of only the second anaerobic gut fungal promoter reported. Large numbers of retrotransposon sequences of different types were found and the authors speculate on the possible consequences of any such transposon activity in the genome. The coding sequences identified included several orphan gene sequences, including one with regions strongly suggestive of structural proteins such as collagens and lampirin. This gene was present as a single copy in Orpinomyces, was expressed during vegetative growth and was also detected in genomes from another gut fungal genus, Neocallimastix.

  18. Isolation of Canine parvovirus with a view to identify the prevalent serotype on the basis of partial sequence analysis

    PubMed Central

    Kaur, Gurpreet; Chandra, Mudit; Dwivedi, P. N.; Sharma, N. S.

    2015-01-01

    Aim: The aim of this study was to isolate Canine parvovirus (CPV) from suspected dogs on madin darby canine kidney (MDCK) cell line and its confirmation by polymerase chain reaction (PCR) and nested PCR (NPCR). Further, VP2 gene of the CPV isolates was amplified and sequenced to determine prevailing antigenic type. Materials and Methods: A total of 60 rectal swabs were collected from dogs showing signs of gastroenteritis, processed and subjected to isolation in MDCK cell line. The samples showing cytopathic effects (CPE) were confirmed by PCR and NPCR. These samples were subjected to PCR for amplification of VP2 gene of CPV, sequenced and analyzed to study the prevailing antigenic types of CPV. Results: Out of the 60 samples subjected to isolation in MDCK cell line five samples showed CPE in the form of rounding of cells, clumping of cells and finally detachment of the cells. When these samples and the two commercially available vaccines were subjected to PCR for amplification of VP2 gene, a 1710 bp product was amplified. The sequence analysis revealed that the vaccines belonged to the CPV-2 type and the samples were of CPV-2b type. Conclusion: It can be concluded from the present study that out of a total of 60 samples 5 samples exhibited CPE as observed in MDCK cell line. Sequence analysis of the VP2 gene among the samples and vaccine strains revealed that samples belonged to CPV-2b type and vaccines belonging to CPV-2. PMID:27046996

  19. Mining new crystal protein genes from Bacillus thuringiensis on the basis of mixed plasmid-enriched genome sequencing and a computational pipeline.

    PubMed

    Ye, Weixing; Zhu, Lei; Liu, Yingying; Crickmore, Neil; Peng, Donghai; Ruan, Lifang; Sun, Ming

    2012-07-01

    We have designed a high-throughput system for the identification of novel crystal protein genes (cry) from Bacillus thuringiensis strains. The system was developed with two goals: (i) to acquire the mixed plasmid-enriched genomic sequence of B. thuringiensis using next-generation sequencing biotechnology, and (ii) to identify cry genes with a computational pipeline (using BtToxin_scanner). In our pipeline method, we employed three different kinds of well-developed prediction methods, BLAST, hidden Markov model (HMM), and support vector machine (SVM), to predict the presence of Cry toxin genes. The pipeline proved to be fast (average speed, 1.02 Mb/min for proteins and open reading frames [ORFs] and 1.80 Mb/min for nucleotide sequences), sensitive (it detected 40% more protein toxin genes than a keyword extraction method using genomic sequences downloaded from GenBank), and highly specific. Twenty-one strains from our laboratory's collection were selected based on their plasmid pattern and/or crystal morphology. The plasmid-enriched genomic DNA was extracted from these strains and mixed for Illumina sequencing. The sequencing data were de novo assembled, and a total of 113 candidate cry sequences were identified using the computational pipeline. Twenty-seven candidate sequences were selected on the basis of their low level of sequence identity to known cry genes, and eight full-length genes were obtained with PCR. Finally, three new cry-type genes (primary ranks) and five cry holotypes, which were designated cry8Ac1, cry7Ha1, cry21Ca1, cry32Fa1, and cry21Da1 by the B. thuringiensis Toxin Nomenclature Committee, were identified. The system described here is both efficient and cost-effective and can greatly accelerate the discovery of novel cry genes.

  20. Germline sequence variants in TGM3 and RGS22 confer risk of basal cell carcinoma

    PubMed Central

    Stacey, Simon N.; Sulem, Patrick; Gudbjartsson, Daniel F.; Jonasdottir, Aslaug; Thorleifsson, Gudmar; Gudjonsson, Sigurjon A.; Masson, Gisli; Gudmundsson, Julius; Sigurgeirsson, Bardur; Benediktsdottir, Kristrun R.; Thorisdottir, Kristin; Ragnarsson, Rafn; Fuentelsaz, Victoria; Corredera, Cristina; Grasa, Matilde; Planelles, Dolores; Sanmartin, Onofre; Rudnai, Peter; Gurzau, Eugene; Koppova, Kvetoslava; Hemminki, Kari; Nexø, Bjørn A; Tjønneland, Anne; Overvad, Kim; Johannsdottir, Hrefna; Helgadottir, Hafdis T.; Thorsteinsdottir, Unnur; Kong, Augustine; Vogel, Ulla; Kumar, Rajiv; Nagore, Eduardo; Mayordomo, José I.; Rafnar, Thorunn; Olafsson, Jon H.; Stefansson, Kari

    2014-01-01

    To search for new sequence variants that confer risk of cutaneous basal cell carcinoma (BCC), we conducted a genome-wide association study of 38.5 million single nucleotide polymorphisms (SNPs) and small indels identified through whole-genome sequencing of 2230 Icelanders. We imputed genotypes for 4208 BCC patients and 109 408 controls using Illumina SNP chip typing data, carried out association tests and replicated the findings in independent population samples. We found new BCC susceptibility loci at TGM3 (rs214782[G], P = 5.5 × 10−17, OR = 1.29) and RGS22 (rs7006527[C], P = 8.7 × 10−13, OR = 0.77). TGM3 encodes transglutaminase type 3, which plays a key role in production of the cornified envelope during epidermal differentiation. PMID:24403052

  1. Defining Differential Genetic Signatures in CXCR4- and the CCR5-Utilizing HIV-1 Co-Linear Sequences

    PubMed Central

    Aiamkitsumrit, Benjamas; Dampier, Will; Martin-Garcia, Julio; Nonnemacher, Michael R.; Pirrone, Vanessa; Ivanova, Tatyana; Zhong, Wen; Kilareski, Evelyn; Aldigun, Hazeez; Frantz, Brian; Rimbey, Matthew; Wojno, Adam; Passic, Shendra; Williams, Jean W.; Shah, Sonia; Blakey, Brandon; Parikh, Nirzari; Jacobson, Jeffrey M.; Moldover, Brian; Wigdahl, Brian

    2014-01-01

    The adaptation of human immunodeficiency virus type-1 (HIV-1) to an array of physiologic niches is advantaged by the plasticity of the viral genome, encoded proteins, and promoter. CXCR4-utilizing (X4) viruses preferentially, but not universally, infect CD4+ T cells, generating high levels of virus within activated HIV-1-infected T cells that can be detected in regional lymph nodes and peripheral blood. By comparison, the CCR5-utilizing (R5) viruses have a greater preference for cells of the monocyte-macrophage lineage; however, while R5 viruses also display a propensity to enter and replicate in T cells, they infect a smaller percentage of CD4+ T cells in comparison to X4 viruses. Additionally, R5 viruses have been associated with viral transmission and CNS disease and are also more prevalent during HIV-1 disease. Specific adaptive changes associated with X4 and R5 viruses were identified in co-linear viral sequences beyond the Env-V3. The in silico position-specific scoring matrix (PSSM) algorithm was used to define distinct groups of X4 and R5 sequences based solely on sequences in Env-V3. Bioinformatic tools were used to identify genetic signatures involving specific protein domains or long terminal repeat (LTR) transcription factor sites within co-linear viral protein R (Vpr), trans-activator of transcription (Tat), or LTR sequences that were preferentially associated with X4 or R5 Env-V3 sequences. A number of differential amino acid and nucleotide changes were identified across the co-linear Vpr, Tat, and LTR sequences, suggesting the presence of specific genetic signatures that preferentially associate with X4 or R5 viruses. Investigation of the genetic relatedness between X4 and R5 viruses utilizing phylogenetic analyses of complete sequences could not be used to definitively and uniquely identify groups of R5 or X4 sequences; in contrast, differences in the genetic diversities between X4 and R5 were readily identified within these co-linear sequences in HIV-1-infected patients. PMID:25265194

  2. Comparative genomic analysis of Acinetobacter strains isolated from murine colonic crypts.

    PubMed

    Saffarian, Azadeh; Touchon, Marie; Mulet, Céline; Tournebize, Régis; Passet, Virginie; Brisse, Sylvain; Rocha, Eduardo P C; Sansonetti, Philippe J; Pédron, Thierry

    2017-07-11

    A restricted set of aerobic bacteria dominated by the Acinetobacter genus was identified in murine intestinal colonic crypts. The vicinity of such bacteria with intestinal stem cells could indicate that they protect the crypt against cytotoxic and genotoxic signals. Genome analyses of these bacteria were performed to better appreciate their biodegradative capacities. Two taxonomically different clusters of Acinetobacter were isolated from murine proximal colonic crypts, one was identified as A. modestus and the other as A. radioresistens. Their identification was performed through biochemical parameters and housekeeping gene sequencing. After selection of one strain of each cluster (A. modestus CM11G and A. radioresistens CM38.2), comparative genomic analysis was performed on whole-genome sequencing data. The antibiotic resistance pattern of these two strains is different, in line with the many genes involved in resistance to heavy metals identified in both genomes. Moreover whereas the operon benABCDE involved in benzoate metabolism is encoded by the two genomes, the operon antABC encoding the anthranilate dioxygenase, and the phenol hydroxylase gene cluster are absent in the A. modestus genomic sequence, indicating that the two strains have different capacities to metabolize xenobiotics. A common feature of the two strains is the presence of a type IV pili system, and the presence of genes encoding proteins pertaining to secretion systems such as Type I and Type II secretion systems. Our comparative genomic analysis revealed that different Acinetobacter isolated from the same biological niche, even if they share a large majority of genes, possess unique features that could play a specific role in the protection of the intestinal crypt.

  3. Hepatitis C virus genotypes in Singapore and Indonesia.

    PubMed

    Ng, W C; Guan, R; Tan, M F; Seet, B L; Lim, C A; Ngiam, C M; Sjaifoellah Noer, H M; Lesmana, L

    1995-01-01

    5' untranslated and partial core (C) region sequence of hepatitis C virus (HCV) in 21 Singaporean and 15 Indonesian isolates were amplified by reverse-transcription polymerase chain reaction and sequenced with the use of conserved primer sequences deduced from HCV genomes identified in other geographical regions. The HCV genotypes are predominantly that of Simmonds type 1 and less of type 2 and 3 with the latter genotype currently not detected in Indonesia. The 5' untranslated sequences are related to HCV-1. DK-7 (Denmark), US-11 (United States of America), HCV-J4, SA-10 (South Africa), T-3 (Taiwan), HCV-J6, HCV-J8, Eb-1 and Eb-8. When compared with the prototype HCV-1, insertions are found within the 5' untranslated region of Singaporean isolates and not in the Indonesians. There are Singaporean and Indonesian isolates that have sequences within the 5' untranslated region that differ slightly from each other. Microheterogeneity is observed in the core region of two Singaporeans and one Indonesian isolate. Finally, not all HCV isolates can be amplified with the conserved core sequence primers when compared with the ease with which these isolates can be amplified with 5' untranslated region conserved primers.

  4. Random oligonucleotide mutagenesis: application to a large protein coding sequence of a major histocompatibility complex class I gene, H-2DP.

    PubMed Central

    Murray, R; Pederson, K; Prosser, H; Muller, D; Hutchison, C A; Frelinger, J A

    1988-01-01

    We have used random oligonucleotide mutagenesis (or saturation mutagenesis) to create a library of point mutations in the alpha 1 protein domain of a Major Histocompatibility Complex (MHC) molecule. This protein domain is critical for T cell and B cell recognition. We altered the MHC class I H-2DP gene sequence such that synthetic mutant alpha 1 exons (270 bp of coding sequence), which contain mutations identified by sequence analysis, can replace the wild type alpha 1 exon. The synthetic exons were constructed from twelve overlapping oligonucleotides which contained an average of 1.3 random point mutations per intact exon. DNA sequence analysis of mutant alpha 1 exons has shown a point mutant distribution that fits a Poisson distribution, and thus emphasizes the utility of this mutagenesis technique to "scan" a large protein sequence for important mutations. We report our use of saturation mutagenesis to scan an entire exon of the H-2DP gene, a cassette strategy to replace the wild type alpha 1 exon with individual mutant alpha 1 exons, and analysis of mutant molecules expressed on the surface of transfected mouse L cells. Images PMID:2903482

  5. [A new human leukocyte antigen class I allele, HLA- B*52:11].

    PubMed

    Li, Xiao-feng; Zhang, Xu; Zhang, Kun-lian; Chen, Yang; Liu, Xian-zhi; Li, Jian-ping

    2011-12-01

    To identify and confirm a novel HLA allele. A new human leukocyte antigen class I allele was found during routine HLA genotyping by polymerase chain reaction-sequence specific oligonucleotide probes (PCR-SSOP) and sequencing-based typing (SBT). The novel HLA-B*52 allele was identical to B*52:01:01 with an exception of one base substitution at position 583 of exon 3 where a C was changed to T resulting in codon 195 changed from CAC(H) to TAC(Y). A new HLA class I allele, B*52:11, is identified, and is named officially by the WHO Nomenclature Committee.

  6. The use of whole genome sequencing in the investigation of a nosocomial influenza virus outbreak.

    PubMed

    Houlihan, Catherine; Frampton, Dan; Ferns, R Bridget; Raffle, Jade; Grant, Paul; Reidy, Myriam; Hail, Leila; Thomson, Kirsty; Mattes, Frank; Kozlakidis, Zisis; Pillay, Deenan; Hayward, Andrew; Nastouli, Eleni

    2018-06-05

    Traditional epidemiological investigation of nosocomial transmission of influenza involves the identification of patients who have the same influenza virus type and who have overlapped in time and place. This method may miss-identify transmission where it has not occurred or miss transmission when it has. We applied influenza virus whole genome sequencing (WGS) to an outbreak of influenza A in a haematology/oncology ward and identified two separate introductions; one which resulted in 5 additional infections and 79 bed-days lost. Results from WGS are becoming rapidly available and may supplement traditional infection control procedures in the investigation and management of nosocomial outbreaks.

  7. Identification and profiling of novel microRNAs in the Brassica rapa genome based on small RNA deep sequencing

    PubMed Central

    2012-01-01

    Background MicroRNAs (miRNAs) are one of the functional non-coding small RNAs involved in the epigenetic control of the plant genome. Although plants contain both evolutionary conserved miRNAs and species-specific miRNAs within their genomes, computational methods often only identify evolutionary conserved miRNAs. The recent sequencing of the Brassica rapa genome enables us to identify miRNAs and their putative target genes. In this study, we sought to provide a more comprehensive prediction of B. rapa miRNAs based on high throughput small RNA deep sequencing. Results We sequenced small RNAs from five types of tissue: seedlings, roots, petioles, leaves, and flowers. By analyzing 2.75 million unique reads that mapped to the B. rapa genome, we identified 216 novel and 196 conserved miRNAs that were predicted to target approximately 20% of the genome’s protein coding genes. Quantitative analysis of miRNAs from the five types of tissue revealed that novel miRNAs were expressed in diverse tissues but their expression levels were lower than those of the conserved miRNAs. Comparative analysis of the miRNAs between the B. rapa and Arabidopsis thaliana genomes demonstrated that redundant copies of conserved miRNAs in the B. rapa genome may have been deleted after whole genome triplication. Novel miRNA members seemed to have spontaneously arisen from the B. rapa and A. thaliana genomes, suggesting the species-specific expansion of miRNAs. We have made this data publicly available in a miRNA database of B. rapa called BraMRs. The database allows the user to retrieve miRNA sequences, their expression profiles, and a description of their target genes from the five tissue types investigated here. Conclusions This is the first report to identify novel miRNAs from Brassica crops using genome-wide high throughput techniques. The combination of computational methods and small RNA deep sequencing provides robust predictions of miRNAs in the genome. The finding of numerous novel miRNAs, many with few target genes and low expression levels, suggests the rapid evolution of miRNA genes. The development of a miRNA database, BraMRs, enables us to integrate miRNA identification, target prediction, and functional annotation of target genes. BraMRs will represent a valuable public resource with which to study the epigenetic control of B. rapa and other closely related Brassica species. The database is available at the following link: http://bramrs.rna.kr [1]. PMID:23163954

  8. Genotypic and Phenotypic Characterization of “Streptococcus milleri” Group Isolates from a Veterans Administration Hospital Population

    PubMed Central

    Clarridge, Jill E.; Osting, Cheryl; Jalali, Mehri; Osborne, Janet; Waddington, Michael

    1999-01-01

    Because identification of the species within the “Streptococcus milleri” group is difficult for the clinical laboratory as the species share overlapping phenotypic characteristics, we wished to confirm biochemical identification with identification by 16S rRNA gene sequence analysis. Ninety-four clinical isolates previously identified as the “Streptococcus milleri” group were reclassified as S. anginosus, S. constellatus, or S. intermedius with the API 20 Strep system (bioMerieux Vikek, Hazelton, Mo.) and the Fluo-card (Key Scientific, Round Rock, Tex.). In addition, we determined the Lancefield group, hemolysis, colony size, colony texture, repetitive extragenic palindromic PCR (rep-PCR) pattern, and cellular fatty acid (CFA) profile (MIDI, Newark, Del.). 16S rRNA gene sequence analysis with 40 selected representative strains showed three distinct groups, with S. constellatus and S. intermedius found to be more closely related to each other than to S. anginosus, and further distinguished a biochemically distinct group of urogenital isolates within the S. anginosus group of isolates. Except for strains unreactive with the Fluo-card (8%), all S. anginosus and S. intermedius strains identified by sequencing were similarly identified by biochemical testing. However, 23% of the selected S. constellatus isolates identified by sequencing (9% of all S. constellatus isolates) would have been identified as S. anginosus or S. intermedius by biochemical tests. Although most S. anginosus strains formed one unique cluster by CFA analysis and most S. constellatus strains showed similar rep-PCR patterns, neither method was sufficiently dependable for identification. Whereas Lancefield group or lactose fermentation did not correspond to sequence or biochemical type, S. constellatus was most likely to be beta-hemolytic and S. intermedius was most likely to have a dry colony type. The most frequent isolate in our population was S. constellatus, followed by S. anginosus. There was an association of S. anginosus with a gastrointestinal or urogenital source, and there was an association of S. constellatus and S. intermedius with both the respiratory tract and upper-body abscesses. PMID:10523574

  9. Development of a genotyping microarray for Usher syndrome.

    PubMed

    Cremers, Frans P M; Kimberling, William J; Külm, Maigi; de Brouwer, Arjan P; van Wijk, Erwin; te Brinke, Heleen; Cremers, Cor W R J; Hoefsloot, Lies H; Banfi, Sandro; Simonelli, Francesca; Fleischhauer, Johannes C; Berger, Wolfgang; Kelley, Phil M; Haralambous, Elene; Bitner-Glindzicz, Maria; Webster, Andrew R; Saihan, Zubin; De Baere, Elfride; Leroy, Bart P; Silvestri, Giuliana; McKay, Gareth J; Koenekoop, Robert K; Millan, Jose M; Rosenberg, Thomas; Joensuu, Tarja; Sankila, Eeva-Marja; Weil, Dominique; Weston, Mike D; Wissinger, Bernd; Kremer, Hannie

    2007-02-01

    Usher syndrome, a combination of retinitis pigmentosa (RP) and sensorineural hearing loss with or without vestibular dysfunction, displays a high degree of clinical and genetic heterogeneity. Three clinical subtypes can be distinguished, based on the age of onset and severity of the hearing impairment, and the presence or absence of vestibular abnormalities. Thus far, eight genes have been implicated in the syndrome, together comprising 347 protein-coding exons. To improve DNA diagnostics for patients with Usher syndrome, we developed a genotyping microarray based on the arrayed primer extension (APEX) method. Allele-specific oligonucleotides corresponding to all 298 Usher syndrome-associated sequence variants known to date, 76 of which are novel, were arrayed. Approximately half of these variants were validated using original patient DNAs, which yielded an accuracy of >98%. The efficiency of the Usher genotyping microarray was tested using DNAs from 370 unrelated European and American patients with Usher syndrome. Sequence variants were identified in 64/140 (46%) patients with Usher syndrome type I, 45/189 (24%) patients with Usher syndrome type II, 6/21 (29%) patients with Usher syndrome type III and 6/20 (30%) patients with atypical Usher syndrome. The chip also identified two novel sequence variants, c.400C>T (p.R134X) in PCDH15 and c.1606T>C (p.C536S) in USH2A. The Usher genotyping microarray is a versatile and affordable screening tool for Usher syndrome. Its efficiency will improve with the addition of novel sequence variants with minimal extra costs, making it a very useful first-pass screening tool.

  10. Development of a genotyping microarray for Usher syndrome

    PubMed Central

    Cremers, Frans P M; Kimberling, William J; Külm, Maigi; de Brouwer, Arjan P; van Wijk, Erwin; te Brinke, Heleen; Cremers, Cor W R J; Hoefsloot, Lies H; Banfi, Sandro; Simonelli, Francesca; Fleischhauer, Johannes C; Berger, Wolfgang; Kelley, Phil M; Haralambous, Elene; Bitner‐Glindzicz, Maria; Webster, Andrew R; Saihan, Zubin; De Baere, Elfride; Leroy, Bart P; Silvestri, Giuliana; McKay, Gareth J; Koenekoop, Robert K; Millan, Jose M; Rosenberg, Thomas; Joensuu, Tarja; Sankila, Eeva‐Marja; Weil, Dominique; Weston, Mike D; Wissinger, Bernd; Kremer, Hannie

    2007-01-01

    Background Usher syndrome, a combination of retinitis pigmentosa (RP) and sensorineural hearing loss with or without vestibular dysfunction, displays a high degree of clinical and genetic heterogeneity. Three clinical subtypes can be distinguished, based on the age of onset and severity of the hearing impairment, and the presence or absence of vestibular abnormalities. Thus far, eight genes have been implicated in the syndrome, together comprising 347 protein‐coding exons. Methods: To improve DNA diagnostics for patients with Usher syndrome, we developed a genotyping microarray based on the arrayed primer extension (APEX) method. Allele‐specific oligonucleotides corresponding to all 298 Usher syndrome‐associated sequence variants known to date, 76 of which are novel, were arrayed. Results Approximately half of these variants were validated using original patient DNAs, which yielded an accuracy of >98%. The efficiency of the Usher genotyping microarray was tested using DNAs from 370 unrelated European and American patients with Usher syndrome. Sequence variants were identified in 64/140 (46%) patients with Usher syndrome type I, 45/189 (24%) patients with Usher syndrome type II, 6/21 (29%) patients with Usher syndrome type III and 6/20 (30%) patients with atypical Usher syndrome. The chip also identified two novel sequence variants, c.400C>T (p.R134X) in PCDH15 and c.1606T>C (p.C536S) in USH2A. Conclusion The Usher genotyping microarray is a versatile and affordable screening tool for Usher syndrome. Its efficiency will improve with the addition of novel sequence variants with minimal extra costs, making it a very useful first‐pass screening tool. PMID:16963483

  11. Sequence-Based Genotyping of Expressed Swine Leukocyte Antigen Class I Alleles by Next-Generation Sequencing Reveal Novel Swine Leukocyte Antigen Class I Haplotypes and Alleles in Belgian, Danish, and Kenyan Fattening Pigs and Göttingen Minipigs.

    PubMed

    Sørensen, Maria Rathmann; Ilsøe, Mette; Strube, Mikael Lenz; Bishop, Richard; Erbs, Gitte; Hartmann, Sofie Bruun; Jungersen, Gregers

    2017-01-01

    The need for typing of the swine leukocyte antigen (SLA) is increasing with the expanded use of pigs as models for human diseases and organ-transplantation experiments, their use in infection studies, and for design of veterinary vaccines. Knowledge of SLA sequences is furthermore a prerequisite for the prediction of epitope binding in pigs. The low number of known SLA class I alleles and the limited knowledge of their prevalence in different pig breeds emphasizes the need for efficient SLA typing methods. This study utilizes an SLA class I-typing method based on next-generation sequencing of barcoded PCR amplicons. The amplicons were generated with universal primers and predicted to resolve 68-88% of all known SLA class I alleles dependent on amplicon size. We analyzed the SLA profiles of 72 pigs from four different pig populations; Göttingen minipigs and Belgian, Kenyan, and Danish fattening pigs. We identified 67 alleles, nine previously described haplotypes and 15 novel haplotypes. The highest variation in SLA class I profiles was observed in the Danish pigs and the lowest among the Göttingen minipig population, which also have the highest percentage of homozygote individuals. Highlighting the fact that there are still numerous unknown SLA class I alleles to be discovered, a total of 12 novel SLA class I alleles were identified. Overall, we present new information about known and novel alleles and haplotypes and their prevalence in the tested pig populations.

  12. Genetic diversity among sea otter isolates of Toxoplasma gondii

    USGS Publications Warehouse

    Sundar, N.; Cole, Rebecca A.; Thomas, N.J.; Majumdar, D.; Dubey, J.P.; Su, C.

    2008-01-01

    Sea otters (Enhydra lutris) have been reported to become infected with Toxoplasma gondiiand at times succumb to clinical disease. Here, we determined genotypes of 39 T. gondiiisolates from 37 sea otters in two geographically distant locations (25 from California and 12 from Washington). Six genotypes were identified using 10 PCR-RFLP genetic markers including SAG1, SAG2, SAG3, BTUB, GRA6, c22-8, c29-2, L358, PK1, and Apico, and by DNA sequencing of loci SAG1 and GRA6 in 13 isolates. Of these 39 isolates, 13 (33%) were clonal Type II which can be further divided into two groups at the locus Apico. Two of the 39 isolates had Type II alleles at all loci except a Type I allele at locus L358. One isolate had Type II alleles at all loci except the Type I alleles at loci L358 and Apico. One isolate had Type III alleles at all loci except Type II alleles at SAG2 and Apico. Two sea otter isolates had a mixed infection. Twenty-one (54%) isolates had an unique allele at SAG1 locus. Further genotyping or DNA sequence analysis for 18 of these 21 isolates at loci SAG1 and GRA6 revealed that there were two different genotypes, including the previously identified Type X (four isolates) and a new genotype named Type A (14 isolates). The results from this study suggest that the sea otter isolates are genetically diverse.

  13. Polynucleobacter bacteria in the brackish-water species Euplotes harpa (Ciliata Hypotrichia).

    PubMed

    Vannini, Claudia; Petroni, Giulio; Verni, Franco; Rosati, Giovanna

    2005-01-01

    We have found a Polynucleobacter bacterium in the cytoplasm of Euplotes harpa, a species living in a brackish-water habitat, with a cirral pattern not corresponding to that of the freshwater Euplotes species known to harbor this type of bacteria. The symbiont has been found in three strains of the species, obtained by clonal cultures from ciliates collected in different geographic regions. The 16S rRNA gene sequence of this bacterium identifies it as a member of the beta-proteobacterial genus Polynucleobacter. This sequence shares a high similarity value (98.4-98.5%) with P. necessarius, the type species of the genus, and is associated with 16S rRNA gene sequences of environmental clones and bacterial strains included in the Polynucleobacter cluster (>95%). An oligonucleotide probe was designed to corroborate the assignment of the retrieved sequence to the symbiont and to detect similar bacteria rapidly. Antibiotic experiments showed that the elimination of the bacteria stops the reproductive cycle in E. harpa, as has been shown for the freshwater Euplotes species.

  14. M dwarf spectra from 0.6 to 1.5 micron - A spectral sequence, model atmosphere fitting, and the temperature scale

    NASA Technical Reports Server (NTRS)

    Kirkpatrick, J. D.; Kelly, Douglas M.; Rieke, George H.; Liebert, James; Allard, France; Wehrse, Rainer

    1993-01-01

    Red/infrared (0.6-1.5 micron) spectra are presented for a sequence of well-studied M dwarfs ranging from M2 through M9. A variety of temperature-sensitive features useful for spectral classification are identified. Using these features, the spectral data are compared to recent theoretical models, from which a temperature scale is assigned. The red portion of the model spectra provide reasonably good fits for dwarfs earlier than M6. For layer types, the infrared region provides a more reliable fit to the observations. In each case, the wavelength region used includes the broad peak of the energy distribution. For a given spectral type, the derived temperature sequence assigns higher temperatures than have earlier studies - the difference becoming more pronounced at lower luminosities. The positions of M dwarfs on the H-R diagram are, as a result, in closer agreement with theoretical tracks of the lower main sequence.

  15. Genotypic and phenotypic evaluation of off-type grasses in hybrid Bermudagrass [Cynodon dactylon (L.) Pers. x C. transvaalensis Burtt-Davy] putting greens using genotyping-by-sequencing and morphological characterization.

    PubMed

    Reasor, Eric H; Brosnan, James T; Staton, Margaret E; Lane, Thomas; Trigiano, Robert N; Wadl, Phillip A; Conner, Joann A; Schwartz, Brian M

    2018-01-01

    Interspecific hybrid bermudagrass [ Cynodon dactylon (L.) Pers. x C. transvaalensis Burtt-Davy] is one of the most widely used grasses on golf courses, with cultivars derived from 'Tifgreen' or 'Tifdwarf' particularly used for putting greens. Many bermudagrass cultivars established for putting greens can be genetically unstable and lead to the occurrence of undesirable off-type grasses that vary in phenotype. The objective of this research was to genetically and phenotypically differentiate off-type grasses and hybrid cultivars. Beginning in 2013, off-type and desirable hybrid bermudagrass samples were collected from golf course putting greens in the southeastern United States and genetically and phenotypically characterized using genotyping-by-sequencing and morphology. Genotyping-by-sequencing determined that 11% (5) of off-type and desirable samples from putting greens were genetically divergent from standard cultivars such as Champion, MiniVerde, Tifdwarf, TifEagle, and Tifgreen. In addition, genotyping-by-sequencing was unable to genetically distinguish all standard cultivars from one another due to their similar origin and clonal propagation; however, over 90,000 potentially informative nucleotide variants were identified among the triploid hybrid cultivars. Although few genetic differences were found in this research, samples harvested from golf course putting greens had variable morphology and were clustered into three distinct phenotypic groups. The majority of off-type grasses in hybrid bermudagrass putting greens were genetically similar with variable morphological traits. Off-type grasses within golf course putting greens have the potential to compromise putting surface functionality and aesthetics.

  16. Genotyping of Coxiella burnetii from domestic ruminants and human in Hungary: indication of various genotypes.

    PubMed

    Sulyok, Kinga M; Kreizinger, Zsuzsa; Hornstra, Heidie M; Pearson, Talima; Szigeti, Alexandra; Dán, Ádám; Balla, Eszter; Keim, Paul S; Gyuranecz, Miklós

    2014-05-07

    Information about the genotypic characteristic of Coxiella burnetii from Hungary is lacking. The aim of this study is to describe the genetic diversity of C. burnetii in Hungary and compare genotypes with those found elsewhere. A total of 12 samples: (cattle, n = 6, sheep, n = 5 and human, n = 1) collected from across Hungary were studied by a 10-loci multispacer sequence typing (MST) and 6-loci multiple-locus variable-number of tandem repeat analysis (MLVA). Phylogenetic relationships among MST genotypes show how these Hungarian samples are related to others collected around the world. Three MST genotypes were identified: sequence type (ST) 20 has also been identified in ruminants from other European countries and the USA, ST28 was previously identified in Kazakhstan, and the proposed ST37 is novel. All MST genotypes yielded different MLVA genotypes and three different MLVA genotypes were identified within ST20 samples alone. Two novel MLVA types 0-9-5-5-6-2 (AG) and 0-8-4-5-6-2 (AF) (Ms23-Ms24-Ms27-Ms28-Ms33-Ms34) were defined in the ovine materials correlated with ST28 and ST37. Samples from different parts of the phylogenetic tree were associated with different hosts, suggesting host-specific adaptations. Even with the limited number of samples analysed, this study revealed high genetic diversity among C. burnetii in Hungary. Understanding the background genetic diversity will be essential in identifying and controlling outbreaks.

  17. Genome Re-Sequencing of Semi-Wild Soybean Reveals a Complex Soja Population Structure and Deep Introgression

    PubMed Central

    Wu, Sanling; Wang, Ying-Ying; Ye, Chu-Yu; Bai, Xuefei; Li, Zefeng; Yan, Chenghai; Wang, Weidi; Wang, Ziqiang; Shu, Qingyao; Xie, Jiahua; Lee, Suk-Ha; Fan, Longjiang

    2014-01-01

    Semi-wild soybean is a unique type of soybean that retains both wild and domesticated characteristics, which provides an important intermediate type for understanding the evolution of the subgenus Soja population in the Glycine genus. In this study, a semi-wild soybean line (Maliaodou) and a wild line (Lanxi 1) collected from the lower Yangtze regions were deeply sequenced while nine other semi-wild lines were sequenced to a 3-fold genome coverage. Sequence analysis revealed that (1) no independent phylogenetic branch covering all 10 semi-wild lines was observed in the Soja phylogenetic tree; (2) besides two distinct subpopulations of wild and cultivated soybean in the Soja population structure, all semi-wild lines were mixed with some wild lines into a subpopulation rather than an independent one or an intermediate transition type of soybean domestication; (3) high heterozygous rates (0.19–0.49) were observed in several semi-wild lines; and (4) over 100 putative selective regions were identified by selective sweep analysis, including those related to the development of seed size. Our results suggested a hybridization origin for the semi-wild soybean, which makes a complex Soja population structure. PMID:25265539

  18. The Human Microbiome and Understanding the 16S rRNA Gene in Translational Nursing Science

    PubMed Central

    Ames, Nancy J.; Ranucci, Alexandra; Moriyama, Brad; Wallen, Gwenyth R.

    2017-01-01

    Background As more is understood regarding the human microbiome, it is increasingly important for nurse scientists and health care practitioners to analyze these microbial communities and their role in health and disease.16S rRNA sequencing is a key methodology in identifying these bacterial populations that has recently transitioned from use primarily in research to having increased utility in clinical settings. Objectives The objectives of this review are to: (a) describe 16S rRNA sequencing and its role in answering research questions important to nursing science; (b) provide an overview of the oral, lung and gut microbiomes and relevant research; and (c) identify future implications for microbiome research and 16S sequencing in translational nursing science. Discussion Sequencing using the 16S rRNA gene has revolutionized research and allowed scientists to easily and reliably characterize complex bacterial communities. This type of research has recently entered the clinical setting, one of the best examples involving the use of 16S sequencing to identify resistant pathogens, thereby improving the accuracy of bacterial identification in infection control. Clinical microbiota research and related requisite methods are of particular relevance to nurse scientists—individuals uniquely positioned to utilize these techniques in future studies in clinical settings. PMID:28252578

  19. Mitogenomes from type specimens, a genotyping tool for morphologically simple species: ten genomes of agar-producing red algae.

    PubMed

    Boo, Ga Hun; Hughey, Jeffery R; Miller, Kathy Ann; Boo, Sung Min

    2016-10-14

    DNA sequences from type specimens provide independent, objective characters that enhance the value of type specimens and permit the correct application of species names to phylogenetic clades and specimens. We provide mitochondrial genomes (mitogenomes) from archival type specimens of ten species in agar-producing red algal genera Gelidium and Pterocladiella. The genomes contain 43-44 genes, ranging in size from 24,910 to 24,970 bp with highly conserved gene synteny. Low Ka/Ks ratios of apocytochrome b and cytochrome oxidase genes support their utility as markers. Phylogenies of mitogenomes and cox1+rbcL sequences clarified classification at the genus and species levels. Three species formerly in Gelidium and Pterocladia are transferred to Pterocladiella: P. media comb. nov., P. musciformis comb. nov., and P. luxurians comb. and stat. nov. Gelidium sinicola is merged with G. coulteri because they share identical cox1 and rbcL sequences. We describe a new species, Gelidium millariana sp. nov., previously identified as G. isabelae from Australia. We demonstrate that mitogenomes from type specimens provide a new tool for typifying species in the Gelidiales and that there is an urgent need for analyzing mitogenomes from type specimens of red algae and other morphologically simple organisms for insight into their nomenclature, taxonomy and evolution.

  20. Mitogenomes from type specimens, a genotyping tool for morphologically simple species: ten genomes of agar-producing red algae

    PubMed Central

    Boo, Ga Hun; Hughey, Jeffery R.; Miller, Kathy Ann; Boo, Sung Min

    2016-01-01

    DNA sequences from type specimens provide independent, objective characters that enhance the value of type specimens and permit the correct application of species names to phylogenetic clades and specimens. We provide mitochondrial genomes (mitogenomes) from archival type specimens of ten species in agar-producing red algal genera Gelidium and Pterocladiella. The genomes contain 43–44 genes, ranging in size from 24,910 to 24,970 bp with highly conserved gene synteny. Low Ka/Ks ratios of apocytochrome b and cytochrome oxidase genes support their utility as markers. Phylogenies of mitogenomes and cox1+rbcL sequences clarified classification at the genus and species levels. Three species formerly in Gelidium and Pterocladia are transferred to Pterocladiella: P. media comb. nov., P. musciformis comb. nov., and P. luxurians comb. and stat. nov. Gelidium sinicola is merged with G. coulteri because they share identical cox1 and rbcL sequences. We describe a new species, Gelidium millariana sp. nov., previously identified as G. isabelae from Australia. We demonstrate that mitogenomes from type specimens provide a new tool for typifying species in the Gelidiales and that there is an urgent need for analyzing mitogenomes from type specimens of red algae and other morphologically simple organisms for insight into their nomenclature, taxonomy and evolution. PMID:27739454

  1. Expansion of the 'Reticulosphere': Diversity of Novel Branching and Network-forming Amoebae Helps to Define Variosea (Amoebozoa).

    PubMed

    Berney, Cédric; Geisen, Stefan; Van Wichelen, Jeroen; Nitsche, Frank; Vanormelingen, Pieter; Bonkowski, Michael; Bass, David

    2015-05-01

    Amoebae able to form cytoplasmic networks or displaying a multiply branching morphology remain very poorly studied. We sequenced the small-subunit ribosomal RNA gene of 15 new amoeboid isolates, 14 of which are branching or network-forming amoebae (BNFA). Phylogenetic analyses showed that these isolates all group within the poorly-known and weakly-defined class Variosea (Amoebozoa). They are resolved into six lineages corresponding to distinct new morphotypes; we describe them as new genera Angulamoeba (type species Angulamoeba microcystivorans n. gen., n. sp.; and A. fungorum n. sp.), Arboramoeba (type species Arboramoeba reticulata n. gen., n. sp.), Darbyshirella (type species Darbyshirella terrestris n. gen., n. sp.), Dictyamoeba (type species Dictyamoeba vorax n. gen., n. sp.), Heliamoeba (type species Heliamoeba mirabilis n. gen., n. sp.), and Ischnamoeba (type species Ischnamoeba montana n. gen., n. sp.). We also isolated and sequenced four additional variosean strains, one belonging to Flamella, one related to Telaepolella tubasferens, and two members of the cavosteliid protosteloid lineage. We identified a further 104 putative variosean environmental clone sequences in GenBank, comprising up to 14 lineages that may prove to represent additional novel morphotypes. We show that BNFA are phylogenetically widespread in Variosea and morphologically very variable, both within and between lineages. Copyright © 2015 Elsevier GmbH. All rights reserved.

  2. Identification of Type A, B, E, and F Botulinum Neurotoxin Genes and of Botulinum Neurotoxigenic Clostridia by Denaturing High-Performance Liquid Chromatography

    PubMed Central

    Franciosa, Giovanna; Pourshaban, Manoocheher; De Luca, Alessandro; Buccino, Anna; Dallapiccola, Bruno; Aureli, Paolo

    2004-01-01

    Denaturing high-performance liquid chromatography (DHPLC) is a recently developed technique for rapid screening of nucleotide polymorphisms in PCR products. We used this technique for the identification of type A, B, E, and F botulinum neurotoxin genes. PCR products amplified from a conserved region of the type A, B, E, and F botulinum toxin genes from Clostridium botulinum, neurotoxigenic C. butyricum type E, and C. baratii type F strains were subjected to both DHPLC analysis and sequencing. Unique DHPLC peak profiles were obtained with each different type of botulinum toxin gene fragment, consistent with nucleotide differences observed in the related sequences. We then evaluated the ability of this technique to identify botulinal neurotoxigenic organisms at the genus and species level. A specific short region of the 16S rRNA gene which contains genus-specific and in some cases species-specific heterogeneity was amplified from botulinum neurotoxigenic clostridia and from different food-borne pathogens and subjected to DHPLC analysis. Different peak profiles were obtained for each genus and species, demonstrating that the technique could be a reliable alternative to sequencing for the rapid identification of food-borne pathogens, specifically of botulinal neurotoxigenic clostridia most frequently implicated in human botulism. PMID:15240298

  3. A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals.

    PubMed

    Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M

    2017-01-01

    Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.

  4. Utility of Whole-Genome Sequencing of Escherichia coli O157 for Outbreak Detection and Epidemiological Surveillance.

    PubMed

    Holmes, Anne; Allison, Lesley; Ward, Melissa; Dallman, Timothy J; Clark, Richard; Fawkes, Angie; Murphy, Lee; Hanson, Mary

    2015-11-01

    Detailed laboratory characterization of Escherichia coli O157 is essential to inform epidemiological investigations. This study assessed the utility of whole-genome sequencing (WGS) for outbreak detection and epidemiological surveillance of E. coli O157, and the data were used to identify discernible associations between genotypes and clinical outcomes. One hundred five E. coli O157 strains isolated over a 5-year period from human fecal samples in Lothian, Scotland, were sequenced with the Ion Torrent Personal Genome Machine. A total of 8,721 variable sites in the core genome were identified among the 105 isolates; 47% of the single nucleotide polymorphisms (SNPs) were attributable to six "atypical" E. coli O157 strains and included recombinant regions. Phylogenetic analyses showed that WGS correlated well with the epidemiological data. Epidemiological links existed between cases whose isolates differed by three or fewer SNPs. WGS also correlated well with multilocus variable-number tandem repeat analysis (MLVA) typing data, with only three discordant results observed, all among isolates from cases not known to be epidemiologically related. WGS produced a better-supported, higher-resolution phylogeny than MLVA, confirming that the method is more suitable for epidemiological surveillance of E. coli O157. A combination of in silico analyses (VirulenceFinder, ResFinder, and local BLAST searches) were used to determine stx subtypes, multilocus sequence types (15 loci), and the presence of virulence and acquired antimicrobial resistance genes. There was a high level of correlation between the WGS data and our routine typing methods, although some discordant results were observed, mostly related to the limitation of short sequence read assembly. The data were used to identify sublineages and clades of E. coli O157, and when they were correlated with the clinical outcome data, they showed that one clade, Ic3, was significantly associated with severe disease. Together, the results show that WGS data can provide higher resolution of the relationships between E. coli O157 isolates than that provided by MLVA. The method has the potential to streamline the laboratory workflow and provide detailed information for the clinical management of patients and public health interventions. Copyright © 2015, Holmes et al.

  5. Utility of Whole-Genome Sequencing of Escherichia coli O157 for Outbreak Detection and Epidemiological Surveillance

    PubMed Central

    Allison, Lesley; Ward, Melissa; Dallman, Timothy J.; Clark, Richard; Fawkes, Angie; Murphy, Lee; Hanson, Mary

    2015-01-01

    Detailed laboratory characterization of Escherichia coli O157 is essential to inform epidemiological investigations. This study assessed the utility of whole-genome sequencing (WGS) for outbreak detection and epidemiological surveillance of E. coli O157, and the data were used to identify discernible associations between genotypes and clinical outcomes. One hundred five E. coli O157 strains isolated over a 5-year period from human fecal samples in Lothian, Scotland, were sequenced with the Ion Torrent Personal Genome Machine. A total of 8,721 variable sites in the core genome were identified among the 105 isolates; 47% of the single nucleotide polymorphisms (SNPs) were attributable to six “atypical” E. coli O157 strains and included recombinant regions. Phylogenetic analyses showed that WGS correlated well with the epidemiological data. Epidemiological links existed between cases whose isolates differed by three or fewer SNPs. WGS also correlated well with multilocus variable-number tandem repeat analysis (MLVA) typing data, with only three discordant results observed, all among isolates from cases not known to be epidemiologically related. WGS produced a better-supported, higher-resolution phylogeny than MLVA, confirming that the method is more suitable for epidemiological surveillance of E. coli O157. A combination of in silico analyses (VirulenceFinder, ResFinder, and local BLAST searches) were used to determine stx subtypes, multilocus sequence types (15 loci), and the presence of virulence and acquired antimicrobial resistance genes. There was a high level of correlation between the WGS data and our routine typing methods, although some discordant results were observed, mostly related to the limitation of short sequence read assembly. The data were used to identify sublineages and clades of E. coli O157, and when they were correlated with the clinical outcome data, they showed that one clade, Ic3, was significantly associated with severe disease. Together, the results show that WGS data can provide higher resolution of the relationships between E. coli O157 isolates than that provided by MLVA. The method has the potential to streamline the laboratory workflow and provide detailed information for the clinical management of patients and public health interventions. PMID:26354815

  6. Four distinct types of E.C. 1.2.1.30 enzymes can catalyze the reduction of carboxylic acids to aldehydes.

    PubMed

    Stolterfoht, Holly; Schwendenwein, Daniel; Sensen, Christoph W; Rudroff, Florian; Winkler, Margit

    2017-09-10

    Increasing demand for chemicals from renewable resources calls for the development of new biotechnological methods for the reduction of oxidized bio-based compounds. Enzymatic carboxylate reduction is highly selective, both in terms of chemo- and product selectivity, but not many carboxylate reductase enzymes (CARs) have been identified on the sequence level to date. Thus far, their phylogeny is unexplored and very little is known about their structure-function-relationship. CARs minimally contain an adenylation domain, a phosphopantetheinylation domain and a reductase domain. We have recently identified new enzymes of fungal origin, using similarity searches against genomic sequences from organisms in which aldehydes were detected upon incubation with carboxylic acids. Analysis of sequences with known CAR functionality and CAR enzymes recently identified in our laboratory suggests that the three-domain architecture mentioned above is modular. The construction of a distance tree with a subsequent 1000-replicate bootstrap analysis showed that the CAR sequences included in our study fall into four distinct subgroups (one of bacterial origin and three of fungal origin, respectively), each with a bootstrap value of 100%. The multiple sequence alignment of all experimentally confirmed CAR protein sequences revealed fingerprint sequences of residues which are likely to be involved in substrate and co-substrate binding and one of the three catalytic substeps, respectively. The fingerprint sequences broaden our understanding of the amino acids that might be essential for the reduction of organic acids to the corresponding aldehydes in CAR proteins. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Sequence data and association statistics from 12,940 type 2 diabetes cases and controls.

    PubMed

    Flannick, Jason; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M; Agarwala, Vineeta; Gaulton, Kyle J; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Dennis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana Cn; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Altshuler, David; Burtt, Noël P; Florez, Jose C; Boehnke, Michael; McCarthy, Mark I

    2017-12-19

    To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.

  8. Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

    PubMed Central

    Jason, Flannick; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M.; Agarwala, Vineeta; Gaulton, Kyle J.; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J.; Rivas, Manuel A.; Perry, John R. B.; Sim, Xueling; Blackwell, Thomas W.; Robertson, Neil R.; Rayner, N William; Cingolani, Pablo; Locke, Adam E.; Tajes, Juan Fernandez; Highland, Heather M.; Dupuis, Josee; Chines, Peter S.; Lindgren, Cecilia M.; Hartl, Christopher; Jackson, Anne U.; Chen, Han; Huyghe, Jeroen R.; van de Bunt, Martijn; Pearson, Richard D.; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M.; Gamazon, Eric R.; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A.; Below, Jennifer E.; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L.; Pasko, Dorota; Parker, Stephen C. J.; Varga, Tibor V.; Green, Todd; Beer, Nicola L.; Day-Williams, Aaron G.; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J.; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P.; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F.; Han, Bok-Ghee; Jenkinson, Christopher P.; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C. Y.; Palmer, Nicholette D.; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E.; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D.; Neale, Benjamin M.; Purcell, Shaun; Butterworth, Adam S.; Howson, Joanna M. M.; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K. L.; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H. T.; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E.; Rybin, Dennis; Farook, Vidya S.; Fowler, Sharon P.; Freedman, Barry I.; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J.; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K.; Puppala, Sobha; Scott, William R.; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A.; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C.; Mangino, Massimo; Bonnycastle, Lori L.; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L.; Herder, Christian; Groves, Christopher J.; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A.; Doney, Alex S. F.; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J.; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E.; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H.; Stirrups, Kathleen; Wood, Andrew R.; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O.; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P.; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B.; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N. A.; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M.; Syvänen, Ann-Christine; Bergman, Richard N.; Bharadwaj, Dwaipayan; Bottinger, Erwin P.; Cho, Yoon Shin; Chandak, Giriraj R.; Chan, Juliana CN; Chia, Kee Seng; Daly, Mark J.; Ebrahim, Shah B.; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A.; Lehman, Donna M.; Jia, Weiping; Ma, Ronald C. W.; Pollin, Toni I.; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J. F.; Small, Kerrin S.; Ried, Janina S.; DeFronzo, Ralph A.; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J.; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W.; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R.; Gloyn, Anna L.; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D.; Hattersley, Andrew T.; Bowden, Donald W.; Collins, Francis S.; Atzmon, Gil; Chambers, John C.; Spector, Timothy D.; Laakso, Markku; Strom, Tim M.; Bell, Graeme I.; Blangero, John; Duggirala, Ravindranath; Tai, E. Shyong; McVean, Gilean; Hanis, Craig L.; Wilson, James G.; Seielstad, Mark; Frayling, Timothy M.; Meigs, James B.; Cox, Nancy J.; Sladek, Rob; Lander, Eric S.; Gabriel, Stacey; Mohlke, Karen L.; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J.; Morris, Andrew P.; Kang, Hyun Min; Altshuler, David; Burtt, Noël P.; Florez, Jose C.; Boehnke, Michael; McCarthy, Mark I.

    2017-01-01

    To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1–5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D. PMID:29257133

  9. Multiple Locus Variable-Number Tandem-Repeat and Single-Nucleotide Polymorphism-Based Brucella Typing Reveals Multiple Lineages in Brucella melitensis Currently Endemic in China.

    PubMed

    Sun, Mingjun; Jing, Zhigang; Di, Dongdong; Yan, Hao; Zhang, Zhicheng; Xu, Quangang; Zhang, Xiyue; Wang, Xun; Ni, Bo; Sun, Xiangxiang; Yan, Chengxu; Yang, Zhen; Tian, Lili; Li, Jinping; Fan, Weixing

    2017-01-01

    Brucellosis is a worldwide zoonotic disease caused by Brucella spp. In China, brucellosis is recognized as a reemerging disease mainly caused by Brucella melitensis specie. To better understand the currently endemic B. melitensis strains in China, three Brucella genotyping methods were applied to 110 B. melitensis strains obtained in past several years. By MLVA genotyping, five MLVA-8 genotypes were identified, among which genotypes 42 (1-5-3-13-2-2-3-2) was recognized as the predominant genotype, while genotype 63 (1-5-3-13-2-3-3-2) and a novel genotype of 1-5-3-13-2-4-3-2 were second frequently observed. MLVA-16 discerned a total of 57 MLVA-16 genotypes among these Brucella strains, with 41 genotypes being firstly detected and the other 16 genotypes being previously reported. By BruMLSA21 typing, six sequence types (STs) were identified, among them ST8 is the most frequently seen in China while the other five STs were firstly detected and designated as ST137, ST138, ST139, ST140, and ST141 by international multilocus sequence typing database. Whole-genome sequence (WGS)-single-nucleotide polymorphism (SNP)-based typing and phylogenetic analysis resolved Chinese B. melitensis strains into five clusters, reflecting the existence of multiple lineages among these Chinese B. melitensis strains. In phylogeny, Chinese lineages are more closely related to strains collected from East Mediterranean and Middle East countries, such as Turkey, Kuwait, and Iraq. In the next few years, MLVA typing will certainly remain an important epidemiological tool for Brucella infection analysis, as it displays a high discriminatory ability and achieves result largely in agreement with WGS-SNP-based typing. However, WGS-SNP-based typing is found to be the most powerful and reliable method in discerning Brucella strains and will be popular used in the future.

  10. Targeted binding of the M13 bacteriophage to thiamethoxam organic crystals.

    PubMed

    Cho, Whirang; Fowler, Jeffrey D; Furst, Eric M

    2012-04-10

    Phage display screening with a combinatorial library was used to identify M13-type bacteriophages that express peptides with selective binding to organic crystals of thiamethoxam. The six most strongly binding phages exhibit at least 1000 times the binding affinity of wild-type M13 and express heptapeptide sequences that are rich in hydrophobic, hydrogen-bonding amino acids and proline. Among the peptide sequences identified, M13 displaying the pIII domain heptapeptide ASTLPKA exhibits the strongest binding to thiamethoxam in competitive binding assays. Electron and confocal microscopy confirm the specific binding affinity of ASTLPKA to thiamethoxam. Using atomic force microscope (AFM) probes functionalized with ASTLPKA expressing phage, we found that the average adhesion force between the bacteriophage and a thiamethoxam surface is 1.47 ± 0.80 nN whereas the adhesion force of wild-type M13KE phage is 0.18 ± 0.07 nN. Such a strongly binding bacteriophage could be used to modify the surface chemistry of thiamethoxam crystals and other organic solids with a high degree of specificity. © 2012 American Chemical Society

  11. Genetic mutation analysis of human gastric adenocarcinomas using ion torrent sequencing platform.

    PubMed

    Xu, Zhi; Huo, Xinying; Ye, Hua; Tang, Chuanning; Nandakumar, Vijayalakshmi; Lou, Feng; Zhang, Dandan; Dong, Haichao; Sun, Hong; Jiang, Shouwen; Zhang, Guangchun; Liu, Zhiyuan; Dong, Zhishou; Guo, Baishuai; He, Yan; Yan, Chaowei; Wang, Lu; Su, Ziyi; Li, Yangyang; Gu, Dongying; Zhang, Xiaojing; Wu, Xiaomin; Wei, Xiaowei; Hong, Lingzhi; Zhang, Yangmei; Yang, Jinsong; Gong, Yonglin; Tang, Cuiju; Jones, Lindsey; Huang, Xue F; Chen, Si-Yi; Chen, Jinfei

    2014-01-01

    Gastric cancer is the one of the major causes of cancer-related death, especially in Asia. Gastric adenocarcinoma, the most common type of gastric cancer, is heterogeneous and its incidence and cause varies widely with geographical regions, gender, ethnicity, and diet. Since unique mutations have been observed in individual human cancer samples, identification and characterization of the molecular alterations underlying individual gastric adenocarcinomas is a critical step for developing more effective, personalized therapies. Until recently, identifying genetic mutations on an individual basis by DNA sequencing remained a daunting task. Recent advances in new next-generation DNA sequencing technologies, such as the semiconductor-based Ion Torrent sequencing platform, makes DNA sequencing cheaper, faster, and more reliable. In this study, we aim to identify genetic mutations in the genes which are targeted by drugs in clinical use or are under development in individual human gastric adenocarcinoma samples using Ion Torrent sequencing. We sequenced 737 loci from 45 cancer-related genes in 238 human gastric adenocarcinoma samples using the Ion Torrent Ampliseq Cancer Panel. The sequencing analysis revealed a high occurrence of mutations along the TP53 locus (9.7%) in our sample set. Thus, this study indicates the utility of a cost and time efficient tool such as Ion Torrent sequencing to screen cancer mutations for the development of personalized cancer therapy.

  12. Distribution and factors associated with Salmonella enterica genotypes in a diverse population of humans and animals in Qatar using multi-locus sequence typing (MLST).

    PubMed

    Chang, Yu C; Scaria, Joy; Ibraham, Mariamma; Doiphode, Sanjay; Chang, Yung-Fu; Sultan, Ali; Mohammed, Hussni O

    2016-01-01

    Salmonella enterica is one of the most commonly reported causes of bacterial foodborne illness around the world. Understanding the sources of this pathogen and the associated factors that exacerbate its risk to humans will help in developing risk mitigation strategies. The genetic relatedness among Salmonella isolates recovered from human gastroenteritis cases and food animals in Qatar were investigated in the hope of shedding light on these sources, their possible transmission routes, and any associated factors. A repeat cross-sectional study was conducted in which the samples and associated data were collected from both populations (gastroenteritis cases and animals). Salmonella isolates were initially analyzed using multi-locus sequence typing (MLST) to investigate the genetic diversity and clonality. The relatedness among the isolates was assessed using the minimum spanning tree (MST). Twenty-seven different sequence types (STs) were identified in this study; among them, seven were novel, including ST1695, ST1696, ST1697, ST1698, ST1699, ST1702, and ST1703. The pattern of overall ST distribution was diverse; in particular, it was revealed that ST11 and ST19 were the most common sequence types, presenting 29.5% and 11.5% within the whole population. In addition, 20 eBurst Groups (eBGs) were identified in our data, which indicates that ST11 and ST19 belonged to eBG4 and eBG1, respectively. In addition, the potential association between the putative risk factors and eBGs were evaluated. There was no significant clustering of these eBGs by season; however, a significant association was identified in terms of nationality in that Qataris were six times more likely to present with eBG1 compared to non-Qataris. In the MST analysis, four major clusters were presented, namely, ST11, ST19, ST16, and ST31. The linkages between the clusters alluded to a possible transmission route. The results of the study have provided insight into the ST distributions of S. enterica and their possible zoonotic associations in Qatar. Published by Elsevier Ltd.

  13. De novo assembly and analysis of the Artemisia argyi transcriptome and identification of genes involved in terpenoid biosynthesis.

    PubMed

    Liu, Miaomiao; Zhu, Jinhang; Wu, Shengbing; Wang, Chenkai; Guo, Xingyi; Wu, Jiawen; Zhou, Meiqi

    2018-04-11

    Artemisia argyi Lev. et Vant. (A. argyi) is widely utilized for moxibustion in Chinese medicine, and the mechanism underlying terpenoid biosynthesis in its leaves is suggested to play an important role in its medicinal use. However, the A. argyi transcriptome has not been sequenced. Herein, we performed RNA sequencing for A. argyi leaf, root and stem tissues to identify as many as possible of the transcribed genes. In total, 99,807 unigenes were assembled by analysing the expression profiles generated from the three tissue types, and 67,446 of those unigenes were annotated in public databases. We further performed differential gene expression analysis to compare leaf tissue with the other two tissue types and identified numerous genes that were specifically expressed or up-regulated in leaf tissue. Specifically, we identified multiple genes encoding significant enzymes or transcription factors related to terpenoid synthesis. This study serves as a valuable resource for transcriptome information, as many transcribed genes related to terpenoid biosynthesis were identified in the A. argyi transcriptome, providing a functional genomic basis for additional studies on molecular mechanisms underlying the medicinal use of A. argyi.

  14. IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses

    DOE PAGES

    Paez-Espino, David; Chen, I. -Min A.; Palaniappan, Krishna; ...

    2016-10-30

    Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from > 6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs aremore » grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparingwith external sequences, thus serving as an essential resource in the viral genomics community.« less

  15. IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Paez-Espino, David; Chen, I. -Min A.; Palaniappan, Krishna

    Viruses represent the most abundant life forms on the planet. Recent experimental and computational improvements have led to a dramatic increase in the number of viral genome sequences identified primarily from metagenomic samples. As a result of the expanding catalog of metagenomic viral sequences, there exists a need for a comprehensive computational platform integrating all these sequences with associated metadata and analytical tools. Here we present IMG/VR (https://img.jgi.doe.gov/vr/), the largest publicly available database of 3908 isolate reference DNA viruses with 264 413 computationally identified viral contigs from > 6000 ecologically diverse metagenomic samples. Approximately half of the viral contigs aremore » grouped into genetically distinct quasi-species clusters. Microbial hosts are predicted for 20 000 viral sequences, revealing nine microbial phyla previously unreported to be infected by viruses. Viral sequences can be queried using a variety of associated metadata, including habitat type and geographic location of the samples, or taxonomic classification according to hallmark viral genes. IMG/VR has a user-friendly interface that allows users to interrogate all integrated data and interact by comparingwith external sequences, thus serving as an essential resource in the viral genomics community.« less

  16. The diversity of Klebsiella pneumoniae surface polysaccharides.

    PubMed

    Follador, Rainer; Heinz, Eva; Wyres, Kelly L; Ellington, Matthew J; Kowarik, Michael; Holt, Kathryn E; Thomson, Nicholas R

    2016-08-01

    Klebsiella pneumoniae is considered an urgent health concern due to the emergence of multi-drug-resistant strains for which vaccination offers a potential remedy. Vaccines based on surface polysaccharides are highly promising but need to address the high diversity of surface-exposed polysaccharides, synthesized as O-antigens (lipopolysaccharide, LPS) and K-antigens (capsule polysaccharide, CPS), present in K. pneumoniae . We present a comprehensive and clinically relevant study of the diversity of O- and K-antigen biosynthesis gene clusters across a global collection of over 500 K. pneumoniae whole-genome sequences and the seroepidemiology of human isolates from different infection types. Our study defines the genetic diversity of O- and K-antigen biosynthesis cluster sequences across this collection, identifying sequences for known serotypes as well as identifying novel LPS and CPS gene clusters found in circulating contemporary isolates. Serotypes O1, O2 and O3 were most prevalent in our sample set, accounting for approximately 80 % of all infections. In contrast, K serotypes showed an order of magnitude higher diversity and differ among infection types. In addition we investigated a potential association of O or K serotypes with phylogenetic lineage, infection type and the presence of known virulence genes. K1 and K2 serotypes, which are associated with hypervirulent K. pneumoniae , were associated with a higher abundance of virulence genes and more diverse O serotypes compared to other common K serotypes.

  17. Mechanistic and Technical Challenges in Studying the Human Microbiome and Cancer Epidemiology.

    PubMed

    Verma, Mukesh

    2017-04-01

    This article reviews the significance of the microbiome in cancer epidemiology, mechanistic and technical challenges in the field, and characterization of the microbiome in different tumor types to identify biomarkers of risk, progression, and prognosis. Publications on the microbiome and cancer epidemiology were reviewed to analyze sample collection and processing, microbiome taxa characterization by 16S ribosomal RNA sequencing, and microbiome metabolite characterization (metabotyping) by nuclear magnetic resonance and mass spectrometry. The analysis identified methodology types, research design, sample types, and issues in integrating data from different platforms. Aerodigestive cancer epidemiology studies conducted by different groups demonstrated the significance of microbiome information in developing approaches to improve health. Challenges exist in sample preparation and processing (eg, standardization of methods for collection and analysis). These challenges relate to technology, data integration from "omics" studies, inherent bias in primer selection during 16S ribosomal RNA sequencing, the need for large consortia with well-characterized biospecimens, cause and effect issues, resilience of microbiota to exposure events (requires longitudinal studies), and expanding studies for fungal and viral diversity (most studies used bacterial 16S ribosomal RNA sequencing for microbiota characterization). Despite these challenges, microbiome and cancer epidemiology studies are significant and may facilitate cancer risk assessment, diagnosis, and prognosis. In the future, clinical trials likely will use microbiota modifications to improve the efficacy of existing treatments.

  18. Carriage and acquisition rates of Clostridium difficile in hospitalized horses, including molecular characterization, multilocus sequence typing and antimicrobial susceptibility of bacterial isolates.

    PubMed

    Rodriguez, C; Taminiau, B; Brévers, B; Avesani, V; Van Broeck, J; Leroux, A A; Amory, H; Delmée, M; Daube, G

    2014-08-06

    Clostridium difficile has been identified as a significant agent of diarrhoea and enterocolitis in both foals and adult horses. Hospitalization, antibiotic therapy or changes in diet may contribute to the development of C. difficile infection. Horses admitted to a care unit are therefore at greater risk of being colonized. The aim of this study was to investigate the carriage of C. difficile in hospitalized horses and the possible influence of some risk factors in colonization. During a seven-month period, faecal samples and data relating the clinical history of horses admitted to a veterinary teaching hospital were collected. C. difficile isolates were characterized through toxin profiles, cytotoxicity activity, PCR-ribotyping, antimicrobial resistance and multilocus sequence typing (MLST). Ten isolates were obtained with a total of seven different PCR-ribotypes, including PCR-ribotype 014. Five of them were identified as toxinogenic. A high resistance to gentamicin, clindamycin and ceftiofur was found. MLST revealed four different sequencing types (ST), which included ST11, ST26, ST2 and ST15, and phylogenetic analysis showed that most of the isolates clustered in the same lineage. Clinical history suggests that horses frequently harbour toxigenic and non-toxigenic C. difficile and that in most cases they are colonized regardless of the reason for hospitalization; the development of diarrhoea is more unusual. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. The diversity of Klebsiella pneumoniae surface polysaccharides

    PubMed Central

    Heinz, Eva; Wyres, Kelly L.; Ellington, Matthew J.; Kowarik, Michael; Holt, Kathryn E.; Thomson, Nicholas R.

    2016-01-01

    Klebsiella pneumoniae is considered an urgent health concern due to the emergence of multi-drug-resistant strains for which vaccination offers a potential remedy. Vaccines based on surface polysaccharides are highly promising but need to address the high diversity of surface-exposed polysaccharides, synthesized as O-antigens (lipopolysaccharide, LPS) and K-antigens (capsule polysaccharide, CPS), present in K. pneumoniae. We present a comprehensive and clinically relevant study of the diversity of O- and K-antigen biosynthesis gene clusters across a global collection of over 500 K. pneumoniae whole-genome sequences and the seroepidemiology of human isolates from different infection types. Our study defines the genetic diversity of O- and K-antigen biosynthesis cluster sequences across this collection, identifying sequences for known serotypes as well as identifying novel LPS and CPS gene clusters found in circulating contemporary isolates. Serotypes O1, O2 and O3 were most prevalent in our sample set, accounting for approximately 80 % of all infections. In contrast, K serotypes showed an order of magnitude higher diversity and differ among infection types. In addition we investigated a potential association of O or K serotypes with phylogenetic lineage, infection type and the presence of known virulence genes. K1 and K2 serotypes, which are associated with hypervirulent K. pneumoniae, were associated with a higher abundance of virulence genes and more diverse O serotypes compared to other common K serotypes. PMID:28348868

  20. Detection of non-polio enteroviruses in Hungary 2000-2008 and molecular epidemiology of enterovirus 71, coxsackievirus A16, and echovirus 30.

    PubMed

    Kapusinszky, Beatrix; Szomor, Katalin N; Farkas, Agnes; Takács, Mária; Berencsi, György

    2010-04-01

    Human enteroviruses are associated with various clinical syndromes from minor febrile illness to severe, potentially fatal conditions like aseptic meningitis, paralysis, myocarditis, and neonatal enteroviral sepsis. Between June 2000 and August 2008 echovirus (E) type 2, 4, 6, 7, 9, 11, 13, 25, 30, coxsackievirus (CV) -A16, -A19, -B5, and enterovirus 71 (EV71) were reported in Hungary. In this study, 29 previously enterovirus positive samples from 28 patients diagnosed with hand, foot and mouth disease, meningitis and encephalitis, were molecularly typed. The genetic relationships of identified serotypes CV-A16, EV71, and E30 were assessed by direct sequencing of genomic region encoding the capsid protein VP1. The sequences were compared to each other and sequences from other geographical regions possessed in Genbank. The phylogenetic analysis of CV-A16 revealed that the viruses were mostly of Far-Eastern or Asia-Pacific origin. Typing of EV71 showed that one virus from 2000 belonged to genotype C1 and five viruses observed in 2004 and 2005 were identified as genotype C4. The 11 echovirus 30 strains showed homology with those of neighbor European countries. The molecular examination of E30 revealed that three separate lineages circulated in 2000, 2001, and 2004-2006 in Hungary.

  1. Mechanistic and Technical Challenges in Studying the Human Microbiome and Cancer Epidemiology

    PubMed Central

    2016-01-01

    This article reviews the significance of the microbiome in cancer epidemiology, mechanistic and technical challenges in the field, and characterization of the microbiome in different tumor types to identify biomarkers of risk, progression, and prognosis. Publications on the microbiome and cancer epidemiology were reviewed to analyze sample collection and processing, microbiome taxa characterization by 16S ribosomal RNA sequencing, and microbiome metabolite characterization (metabotyping) by nuclear magnetic resonance and mass spectrometry. The analysis identified methodology types, research design, sample types, and issues in integrating data from different platforms. Aerodigestive cancer epidemiology studies conducted by different groups demonstrated the significance of microbiome information in developing approaches to improve health. Challenges exist in sample preparation and processing (eg, standardization of methods for collection and analysis). These challenges relate to technology, data integration from “omics” studies, inherent bias in primer selection during 16S ribosomal RNA sequencing, the need for large consortia with well-characterized biospecimens, cause and effect issues, resilience of microbiota to exposure events (requires longitudinal studies), and expanding studies for fungal and viral diversity (most studies used bacterial 16S ribosomal RNA sequencing for microbiota characterization). Despite these challenges, microbiome and cancer epidemiology studies are significant and may facilitate cancer risk assessment, diagnosis, and prognosis. In the future, clinical trials likely will use microbiota modifications to improve the efficacy of existing treatments. PMID:27121074

  2. The complete Einstein Observatory X-ray survey of the Orion Nebula region.

    NASA Technical Reports Server (NTRS)

    Gagne, Marc; Caillault, Jean-Pierre

    1994-01-01

    We have analyzed archival Einstein Observatory images of a roughly 4.5 square degree region centered on the Orion Nebula. In all, 245 distinct X-ray sources have been detected in six High Resolution Imager (HRI) and 17 Imaging Proportional Counter (IPC) observations. An optical database of over 2700 stars has been assembled to search for candidate counterparts to the X-ray sources. Roughly half the X-ray sources are identified with a single Orion Nebula cluster member. The 10 main-sequence O6-B5 cluster stars detected in Orion have X-ray activity levels comparable to field O and B stars. X-ray emission has also been detected in the direction of four main-sequence late-B and early-A type stars. Since the mechanisms producing X-rays in late-type coronae and early-type winds cannot operate in the late-B and early-A type atmospheres, we argue that the observed X-rays, with L(sub X) approximately = 3 x 10(exp 30) ergs/s, are probably produced in the coronae of unseen late-type binary companions. Over 100 X-ray sources have been associated with late-type pre-main sequence stars. The upper envelope of X-ray activity rises sharply from mid-F to late-G, with L(sub x)/L(sub bol) in the range 10(exp -4) to 2 x 10(exp -3) for stars later than approximately G7. We have looked for variability of the late-type cluster members on timescales of a day to a year and find that 1/4 of the stars show significantly variable X-ray emission. A handful of the late-type stars have published rotational periods and spectroscopic rotational velocities; however, we see no correlation between X-ray activity and rotation. Thus, for this sample of pre-main-sequence stars, the large dispersion in X-ray activity does not appear to be caused by the dispersion in rotation, in contrast with results obtained for low-mass main-sequence stars in the Pleiades and pre-main-sequence stars in Taurus-Auriga.

  3. Use of Whole Genome Sequencing and Patient Interviews To Link a Case of Sporadic Listeriosis to Consumption of Prepackaged Lettuce

    PubMed Central

    Jackson, K. A.; Stroika, S.; Katz, L. S.; Beal, J.; Brandt, E.; Nadon, C.; Reimer, A.; Major, B.; Conrad, A.; Tarr, C.; Jackson, B. R.; Mody, R. K.

    2016-01-01

    We report on a case of listeriosis in a patient who probably consumed a prepackaged romaine lettuce–containing product recalled for Listeria monocytogenes contamination. Although definitive epidemiological information demonstrating exposure to the specific recalled product was lacking, the patient reported consumption of a prepackaged romaine lettuce–containing product of either the recalled brand or a different brand. A multinational investigation found that patient and food isolates from the recalled product were indistinguishable by pulsed-field gel electrophoresis and were highly related by whole genome sequencing, differing by four alleles by whole genome multilocus sequence typing and by five high-quality single nucleotide polymorphisms, suggesting a common source. To our knowledge, this is the first time prepackaged lettuce has been identified as a likely source for listeriosis. This investigation highlights the power of whole genome sequencing, as well as the continued need for timely and thorough epidemiological exposure data to identify sources of foodborne infections. PMID:27296429

  4. Integrative analysis workflow for the structural and functional classification of C-type lectins

    PubMed Central

    2011-01-01

    Background It is important to understand the roles of C-type lectins in the immune system due to their ubiquity and diverse range of functions in animal cells. It has been observed that currently confirmed C-type lectins share a highly conserved domain known as the C-type carbohydrate recognition domain (CRD). Using the sequence profile of the CRD, an increasing number of putative C-type lectins have been identified. Hence, it is highly needed to develop a systematic framework that enables us to elucidate their carbohydrate (glycan) recognition function, and discover their physiological and pathological roles. Results Presented herein is an integrated workflow for characterizing the sequence and structural features of novel C-type lectins. Our workflow utilizes web-based queries and available software suites to annotate features that can be found on the C-type lectin, given its amino acid sequence. At the same time, it incorporates modeling and analysis of glycans - a major class of ligands that interact with C-type lectins. Thereafter, the results are analyzed together with context-specific knowledge to filter off unlikely predictions. This allows researchers to design their subsequent experiments to confirm the functions of the C-type lectins in a systematic manner. Conclusions The efficacy and usefulness of our proposed immunoinformatics workflow was demonstrated by applying our integrated workflow to a novel C-type lectin -CLEC17A - and we report some of its possible functions that warrants further validation through wet-lab experiments. PMID:22372988

  5. In Search for Pheromone Receptors: Certain Members of the Odorant Receptor Family in the Desert Locust Schistocerca gregaria (Orthoptera: Acrididae) Are Co-expressed with SNMP1.

    PubMed

    Pregitzer, Pablo; Jiang, Xingcong; Grosse-Wilde, Ewald; Breer, Heinz; Krieger, Jürgen; Fleischer, Joerg

    2017-01-01

    Under given environmental conditions, the desert locust ( Schistocera gregaria ) forms destructive migratory swarms of billions of animals, leading to enormous crop losses in invaded regions. Swarm formation requires massive reproduction as well as aggregation of the animals. Pheromones that are detected via the olfactory system have been reported to control both reproductive and aggregation behavior. However, the molecular basis of pheromone detection in the antennae of Schistocerca gregaria is unknown. As an initial step to disclose pheromone receptors, we sequenced the antennal transcriptome of the desert locust. By subsequent bioinformatical approaches, 119 distinct nucleotide sequences encoding candidate odorant receptors (ORs) were identified. Phylogenetic analyses employing the identified ORs from Schistocerca gregaria (SgreORs) and OR sequences from the related species Locusta migratoria revealed a group of locust ORs positioned close to the root, i.e. at a basal site in a phylogenetic tree. Within this particular OR group (termed basal or b-OR group), the locust OR sequences were strictly orthologous, a trait reminiscent of pheromone receptors from lepidopteran species. In situ hybridization experiments with antennal tissue demonstrated expression of b-OR types from Schistocerca gregaria in olfactory sensory neurons (OSNs) of either sensilla trichodea or sensilla basiconica, both of which have been reported to respond to pheromonal substances. More importantly, two-color fluorescent in situ hybridization experiments showed that most b-OR types were expressed in cells co-expressing the "sensory neuron membrane protein 1" (SNMP1), a marker indicative of pheromone-sensitive OSNs in insects. Analyzing the expression of a larger number of SgreOR types outside the b-OR group revealed that only a few of them were co-expressed with SNMP1. In summary, we have identified several candidate pheromone receptors from Schistocerca gregaria that could mediate responses to pheromones implicated in controlling reproduction and aggregation behavior.

  6. HPV Genotyping of Modified General Primer-Amplicons Is More Analytically Sensitive and Specific by Sequencing than by Hybridization

    PubMed Central

    Meisal, Roger; Rounge, Trine Ballestad; Christiansen, Irene Kraus; Eieland, Alexander Kirkeby; Worren, Merete Molton; Molden, Tor Faksvaag; Kommedal, Øyvind; Hovig, Eivind; Leegaard, Truls Michael

    2017-01-01

    Sensitive and specific genotyping of human papillomaviruses (HPVs) is important for population-based surveillance of carcinogenic HPV types and for monitoring vaccine effectiveness. Here we compare HPV genotyping by Next Generation Sequencing (NGS) to an established DNA hybridization method. In DNA isolated from urine, the overall analytical sensitivity of NGS was found to be 22% higher than that of hybridization. NGS was also found to be the most specific method and expanded the detection repertoire beyond the 37 types of the DNA hybridization assay. Furthermore, NGS provided an increased resolution by identifying genetic variants of individual HPV types. The same Modified General Primers (MGP)-amplicon was used in both methods. The NGS method is described in detail to facilitate implementation in the clinical microbiology laboratory and includes suggestions for new standards for detection and calling of types and variants with improved resolution. PMID:28045981

  7. HPV Genotyping of Modified General Primer-Amplicons Is More Analytically Sensitive and Specific by Sequencing than by Hybridization.

    PubMed

    Meisal, Roger; Rounge, Trine Ballestad; Christiansen, Irene Kraus; Eieland, Alexander Kirkeby; Worren, Merete Molton; Molden, Tor Faksvaag; Kommedal, Øyvind; Hovig, Eivind; Leegaard, Truls Michael; Ambur, Ole Herman

    2017-01-01

    Sensitive and specific genotyping of human papillomaviruses (HPVs) is important for population-based surveillance of carcinogenic HPV types and for monitoring vaccine effectiveness. Here we compare HPV genotyping by Next Generation Sequencing (NGS) to an established DNA hybridization method. In DNA isolated from urine, the overall analytical sensitivity of NGS was found to be 22% higher than that of hybridization. NGS was also found to be the most specific method and expanded the detection repertoire beyond the 37 types of the DNA hybridization assay. Furthermore, NGS provided an increased resolution by identifying genetic variants of individual HPV types. The same Modified General Primers (MGP)-amplicon was used in both methods. The NGS method is described in detail to facilitate implementation in the clinical microbiology laboratory and includes suggestions for new standards for detection and calling of types and variants with improved resolution.

  8. Single-nucleus analysis of accessible chromatin in developing mouse forebrain reveals cell-type-specific transcriptional regulation.

    PubMed

    Preissl, Sebastian; Fang, Rongxin; Huang, Hui; Zhao, Yuan; Raviram, Ramya; Gorkin, David U; Zhang, Yanxiao; Sos, Brandon C; Afzal, Veena; Dickel, Diane E; Kuan, Samantha; Visel, Axel; Pennacchio, Len A; Zhang, Kun; Ren, Bing

    2018-03-01

    Analysis of chromatin accessibility can reveal transcriptional regulatory sequences, but heterogeneity of primary tissues poses a significant challenge in mapping the precise chromatin landscape in specific cell types. Here we report single-nucleus ATAC-seq, a combinatorial barcoding-assisted single-cell assay for transposase-accessible chromatin that is optimized for use on flash-frozen primary tissue samples. We apply this technique to the mouse forebrain through eight developmental stages. Through analysis of more than 15,000 nuclei, we identify 20 distinct cell populations corresponding to major neuronal and non-neuronal cell types. We further define cell-type-specific transcriptional regulatory sequences, infer potential master transcriptional regulators and delineate developmental changes in forebrain cellular composition. Our results provide insight into the molecular and cellular dynamics that underlie forebrain development in the mouse and establish technical and analytical frameworks that are broadly applicable to other heterogeneous tissues.

  9. Degradation of methyl bromide and methyl chloride in soil microcosms: Use of stable C isotope fractionation and stable isotope probing to identify reactions and the responsible microorganisms

    USGS Publications Warehouse

    Miller, L.G.; Warner, K.L.; Baesman, S.M.; Oremland, R.S.; McDonald, I.R.; Radajewski, S.; Murrell, J.C.

    2004-01-01

    Bacteria in soil microcosm experiments oxidized elevated levels of methyl chloride (MeCl) and methyl bromide (MeBr), the former compound more rapidly than the latter. MeBr was also removed by chemical reactions while MeCl was not. Chemical degradation dominated the early removal of MeBr and accounted for more than half of its total loss. Fractionation of stable carbon isotopes during chemical degradation of MeBr resulted in a kinetic isotope effect (KIE) of 59 ?? 7???. Soil bacterial oxidation dominated the later removal of MeBr and MeCl and was characterized by different KIEs for each compound. The KIE for MeBr oxidation was 69 ?? 9??? and the KIE for MeCl oxidation was 49 ?? 3???. Stable isotope probing revealed that different populations of soil bacteria assimilated added 13C-labeled MeBr and MeCl. The identity of the active MeBr and MeCl degrading bacteria in soil was determined by analysis of 16S rRNA gene sequences amplified from 13C-DNA fractions, which identified a number of sequences from organisms not previously thought to be involved in methyl halide degradation. These included Burkholderia , the major clone type in the 13C-MeBr fraction, and Rhodobacter, Lysobacter and Nocardioides the major clone types in the 13C-MeCl fraction. None of the 16S rRNA gene sequences for methyl halide oxidizing bacteria currently in culture (including Aminobacter strain IMB-1 isolated from fumigated soil) were identified. Functional gene clone types closely related to Aminobacter spp. were identified in libraries containing the sequences for the cmuA gene, which codes for the enzyme known to catalyze the initial step in the oxidation of MeBr and MeCl. The cmuA gene was limited to members of the alpha-Proteobacteria whereas the greater diversity demonstrated by the 16S rRNA gene may indicate that other enzymes catalyze methyl halide oxidation in different groups of bacteria. Copyright ?? 2004 Elsevier Ltd.

  10. Degradation of methyl bromide and methyl chloride in soil microcosms: Use of stable C isotope fractionation and stable isotope probing to identify reactions and the responsible microorganisms

    NASA Astrophysics Data System (ADS)

    Miller, Laurence G.; Warner, Karen L.; Baesman, Shaun M.; Oremland, Ronald S.; McDonald, Ian R.; Radajewski, Stefan; Murrell, J. Colin

    2004-08-01

    Bacteria in soil microcosm experiments oxidized elevated levels of methyl chloride (MeCl) and methyl bromide (MeBr), the former compound more rapidly than the latter. MeBr was also removed by chemical reactions while MeCl was not. Chemical degradation dominated the early removal of MeBr and accounted for more than half of its total loss. Fractionation of stable carbon isotopes during chemical degradation of MeBr resulted in a kinetic isotope effect (KIE) of 59 ± 7‰. Soil bacterial oxidation dominated the later removal of MeBr and MeCl and was characterized by different KIEs for each compound. The KIE for MeBr oxidation was 69 ± 9‰ and the KIE for MeCl oxidation was 49 ± 3‰. Stable isotope probing revealed that different populations of soil bacteria assimilated added 13C-labeled MeBr and MeCl. The identity of the active MeBr and MeCl degrading bacteria in soil was determined by analysis of 16S rRNA gene sequences amplified from 13C-DNA fractions, which identified a number of sequences from organisms not previously thought to be involved in methyl halide degradation. These included Burkholderia, the major clone type in the 13C-MeBr fraction, and Rhodobacter, Lysobacter and Nocardioides the major clone types in the 13C-MeCl fraction. None of the 16S rRNA gene sequences for methyl halide oxidizing bacteria currently in culture (including Aminobacter strain IMB-1 isolated from fumigated soil) were identified. Functional gene clone types closely related to Aminobacter spp. were identified in libraries containing the sequences for the cmuA gene, which codes for the enzyme known to catalyze the initial step in the oxidation of MeBr and MeCl. The cmuA gene was limited to members of the alpha-Proteobacteria whereas the greater diversity demonstrated by the 16S rRNA gene may indicate that other enzymes catalyze methyl halide oxidation in different groups of bacteria.

  11. First identification of porcine parvovirus 6 in Poland.

    PubMed

    Cui, Jin; Fan, Jinghui; Gerber, Priscilla F; Biernacka, Kinga; Stadejek, Tomasz; Xiao, Chao-Ting; Opriessnig, Tanja

    2017-02-01

    Porcine parvovirus type 1 is a major causative agent of swine reproductive failure. During the past decade, several new parvoviruses have been discovered in pigs. Porcine parvovirus type 6 (PPV6), recently identified, has been reported in pigs in China and in the USA while the PPV6 status in the European pig population remains undetermined. In the present study, PPV6 DNA was identified in serum samples collected from domestic pigs in Poland. In investigated herds, the prevalence of PPV6 was 14.9 % (15/101 samples). Sequencing was conducted, and 11 nearly complete PPV6 genomes were obtained. Phylogenetic analysis indicated that PPV6 sequences cluster into four distinct groups, and the Polish PPV6 strains from three individual farms were present in three of these four groups. In addition, the Polish PPV6 strain P15-1 was identified as a putative recombination of an ORF1 from US stains and an ORF2 from Chinese strains. This is the first identification of PPV6 in Europe, and this finding will encourage future epidemiological studies on parvoviruses in European pigs.

  12. Polymorphisms of cytochrome b gene in Leishmania parasites and their relation to types of cutaneous leishmaniasis lesions in Pakistan.

    PubMed

    Myint, Chomar Kaung; Asato, Yutaka; Yamamoto, Yu-ichi; Kato, Hirotomo; Bhutto, Abdul M; Soomro, Farooq R; Memon, Muhamad Z; Matsumoto, Jun; Marco, Jorge D; Oshiro, Minoru; Katakura, Ken; Hashiguchi, Yoshihisa; Uezato, Hiroshi

    2008-02-01

    The exact species and/or strains of Leishmania parasites involved strongly influence the clinical and epidemiological features of leishmaniasis, and current knowledge of those influences and relationships is inadequate. We report that cytochrome b (cyt b) gene sequencing identified causal Leishmania parasites of 69 cutaneous leishmaniasis cases in Pakistan over a 3-year period. Of 21 cases in highland areas (Quetta city, Balochistan province), 16 (76.2%) were identified as Leishmania (L.) tropica and five (23.8%) as Leishmania (L.) major. Of 48 cases from lowland areas, cities/villages in Indus valley in Sindh and Balochistan provinces, 47 (97.9%) were identified as L. (L.) major and one (2.1%) as L. (L.) tropica. Statistical analysis (Fisher's exact test) revealed a significant difference (P < 0.0001) in the distribution of the two species by altitude; L. (L.) major is predominant in lowland and L. (L.) tropica at highland areas. The present result enriched our earlier finding, based on the first year's cultured parasite data, that only L. (L.) tropica was found in highland areas and only L. (L.) major in lowland areas. Among Leishmania samples analyzed, three types of cyt b polymorphism of L. (L.) major were found, including 45 (86.5%) cases of type I, six (11.5%) of type II and one (2%) of type III. We report for the first time on the presence of polymorphisms in L. (L.) major (types I, II and III) based on species identification using cyt b gene sequencing from clinical samples. Moreover, we found no correlation between clinical presentation (wet-, dry- and/or mixed-types of cutaneous lesions) and causal Leishmania parasites.

  13. Identification of Bacillus spp. from Bikalga, fermented seeds of Hibiscus sabdariffa: phenotypic and genotypic characterization.

    PubMed

    Ouoba, L I I; Parkouda, C; Diawara, B; Scotti, C; Varnam, A H

    2008-01-01

    To identify Bacillus spp. responsible of the fermentation of Hibiscus sabdariffa for production of Bikalga, an alkaline fermented food used as a condiment in Burkina Faso. Seventy bacteria were isolated from Bikalga produced in different regions of Burkina Faso and identified by phenotyping and genotyping using PCR amplification of the 16S-23S rDNA intergenic transcribed spacer (ITS-PCR), repetitive sequence-based PCR (rep-PCR) and DNA sequencing. The isolates were characterized as motile, rod-shaped, endospore forming, catalase positive, Gram-positive bacteria. ITS-PCR allowed typing mainly at species level. Rep-PCR was more discriminative and allowed a typing at ssp. level. The DNA sequencing combined with the Blast search program and fermentation profiles using API 50CHB system allowed an identification of the bacteria as Bacillus subtilis, B. licheniformis, B. cereus, B. pumilus, B. badius, Brevibacillus bortelensis, B. sphaericus and B. fusiformis. B. subtilis were the predominant bacterium (42) followed by B. licheniformis (16). Various species and ssp. of Bacillus are involved in fermentation of H. sabdariffa for production of Bikalga. Selection of starter cultures of Bacillus for controlled production of Bikalga, selection of probiotic bacteria.

  14. Identification and correction of systematic error in high-throughput sequence data

    PubMed Central

    2011-01-01

    Background A feature common to all DNA sequencing technologies is the presence of base-call errors in the sequenced reads. The implications of such errors are application specific, ranging from minor informatics nuisances to major problems affecting biological inferences. Recently developed "next-gen" sequencing technologies have greatly reduced the cost of sequencing, but have been shown to be more error prone than previous technologies. Both position specific (depending on the location in the read) and sequence specific (depending on the sequence in the read) errors have been identified in Illumina and Life Technology sequencing platforms. We describe a new type of systematic error that manifests as statistically unlikely accumulations of errors at specific genome (or transcriptome) locations. Results We characterize and describe systematic errors using overlapping paired reads from high-coverage data. We show that such errors occur in approximately 1 in 1000 base pairs, and that they are highly replicable across experiments. We identify motifs that are frequent at systematic error sites, and describe a classifier that distinguishes heterozygous sites from systematic error. Our classifier is designed to accommodate data from experiments in which the allele frequencies at heterozygous sites are not necessarily 0.5 (such as in the case of RNA-Seq), and can be used with single-end datasets. Conclusions Systematic errors can easily be mistaken for heterozygous sites in individuals, or for SNPs in population analyses. Systematic errors are particularly problematic in low coverage experiments, or in estimates of allele-specific expression from RNA-Seq data. Our characterization of systematic error has allowed us to develop a program, called SysCall, for identifying and correcting such errors. We conclude that correction of systematic errors is important to consider in the design and interpretation of high-throughput sequencing experiments. PMID:22099972

  15. [A surveillance study on CRISPR/Cas molecular biomarker in Escherichia coli].

    PubMed

    Liang, W J; Zhang, R G; Duan, G C; Hong, L J; Zhang, B; Xi, Y L; Yang, H Y; Chen, S Y; Lou, T Y; Zhao, Y X

    2016-08-10

    A new method related to molecular biomarker with CRISPR/Cas (clustered regularly interspaced short palindromic repeats-cas) in Escherichia (E.) coli was developed and used for surveillance programs. CRISPR/Cas sequence that containing 135 strains with complete sequence and 203 strains with whole genome shotgun sequence of E. coli in GenBank by BLAST and 361 strains of E. coli (including 38 strains of E. coli O157∶H7) in laboratory were identified by PCR and analyzed with the CRISPR Finder. Spacers were compared with DANMAN and the phylogenetic trees of cas gene were constructed under Clustal Ⅹ and Mega 5.1. With new perspective, a descriptive method was developed targeting on the position of CRISPR/cas in E. coli. The CRISPR1 was detected in 77.04%, 100.00% and 75.62% and the CRISPR2 was detected in 74.81%, 100.00% and 92.24% and the CRISPR3 and CRISPR4 were detected in 11.85%, 0 and 1.39% for 135 strains with complete sequence, 203 strains with whole genome shotgun sequence and 361 strains in the laboratory, respectively. One strain downloaded in GenBank with whole genome sequencing and 2 strains in the our laboratory were identified that containing four CRISPR locus. The other E. coli strain was with insertion sequence in downstream of the non-cas CRISPR1. The unique CRISPR was found in 8 strains of O55∶H7, in 180 strains of O157∶H7, in 8 strains of O157∶HNM, in 40 strains of O104∶H4, in 4 strains of O145∶H28, in all the 699 E. coli strains. The phylogenetic tree could be divided into two groups-cas with type I-E or type I-F. CRISPR/Cas might be used as a valuable molecular biomarker in epidemiological surveillance studies to identify the high virulent strains or new strains of E. coli. Phage night be related to the missing or obtaining of spacers.

  16. Analyses of Sporocarps, Morphotyped Ectomycorrhizae, Environmental ITS and LSU Sequences Identify Common Genera that Occur at a Periglacial Site

    PubMed Central

    Jumpponen, Ari; Brown, Shawn P.; Trappe, James M.; Cázares, Efrén; Strömmer, Rauni

    2015-01-01

    Periglacial substrates exposed by retreating glaciers represent extreme and sensitive environments defined by a variety of abiotic stressors that challenge organismal establishment and survival. The simple communities often residing at these sites enable their analyses in depth. We utilized existing data and mined published sporocarp, morphotyped ectomycorrhizae (ECM), as well as environmental sequence data of internal transcribed spacer (ITS) and large subunit (LSU) regions of the ribosomal RNA gene to identify taxa that occur at a glacier forefront in the North Cascades Mountains in Washington State in the USA. The discrete data types consistently identified several common and widely distributed genera, perhaps best exemplified by Inocybe and Laccaria. Although we expected low diversity and richness, our environmental sequence data included 37 ITS and 26 LSU operational taxonomic units (OTUs) that likely form ECM. While environmental surveys of metabarcode markers detected large numbers of targeted ECM taxa, both the fruiting body and the morphotype datasets included genera that were undetected in either of the metabarcode datasets. These included hypogeous (Hymenogaster) and epigeous (Lactarius) taxa, some of which may produce large sporocarps but may possess small and/or spatially patchy genets. We highlight the importance of combining various data types to provide a comprehensive view of a fungal community, even in an environment assumed to host communities of low species richness and diversity. PMID:29376900

  17. Somatosensory neuron types identified by high-coverage single-cell RNA-sequencing and functional heterogeneity

    PubMed Central

    Li, Chang-Lin; Li, Kai-Cheng; Wu, Dan; Chen, Yan; Luo, Hao; Zhao, Jing-Rong; Wang, Sa-Shuang; Sun, Ming-Ming; Lu, Ying-Jin; Zhong, Yan-Qing; Hu, Xu-Ye; Hou, Rui; Zhou, Bei-Bei; Bao, Lan; Xiao, Hua-Sheng; Zhang, Xu

    2016-01-01

    Sensory neurons are distinguished by distinct signaling networks and receptive characteristics. Thus, sensory neuron types can be defined by linking transcriptome-based neuron typing with the sensory phenotypes. Here we classify somatosensory neurons of the mouse dorsal root ganglion (DRG) by high-coverage single-cell RNA-sequencing (10 950 ± 1 218 genes per neuron) and neuron size-based hierarchical clustering. Moreover, single DRG neurons responding to cutaneous stimuli are recorded using an in vivo whole-cell patch clamp technique and classified by neuron-type genetic markers. Small diameter DRG neurons are classified into one type of low-threshold mechanoreceptor and five types of mechanoheat nociceptors (MHNs). Each of the MHN types is further categorized into two subtypes. Large DRG neurons are categorized into four types, including neurexophilin 1-expressing MHNs and mechanical nociceptors (MNs) expressing BAI1-associated protein 2-like 1 (Baiap2l1). Mechanoreceptors expressing trafficking protein particle complex 3-like and Baiap2l1-marked MNs are subdivided into two subtypes each. These results provide a new system for cataloging somatosensory neurons and their transcriptome databases. PMID:26691752

  18. Genetic analysis of human immunodeficiency virus type 1 envelope V3 region isolates from mothers and infants after perinatal transmission.

    PubMed Central

    Ahmad, N; Baroudy, B M; Baker, R C; Chappey, C

    1995-01-01

    The human immunodeficiency virus type 1 (HIV-1) sequences from variable region 3 (V3) of the envelope gene were analyzed from seven infected mother-infant pairs following perinatal transmission. The V3 region sequences directly derived from the DNA of the uncultured peripheral blood mononuclear cells from infected mothers displayed a heterogeneous population. In contrast, the infants' sequences were less diverse than those of their mothers. In addition, the sequences from the younger infants' peripheral blood mononuclear cell DNA were more homogeneous than the older infants' sequences. All infants' sequences were different but displayed patterns similar to those seen in their mothers. In the mother-infant pair sequences analyzed, a minor genotype or subtype found in the mothers predominated in their infants. The conserved N-linked glycosylation site proximal to the first cysteine of the V3 loop was absent only in one infant's sequence set and in some variants of two other infants' sequences. Furthermore, the HIV-1 sequences of the epidemiologically linked mother-infant pairs were closer than the sequences of epidemiologically unlinked individuals, suggesting that the sequence comparison of mother-infant pairs done in order to identify genetic variants transmitted from mother to infant could be performed even in older infants. There was no evidence for transmission of a major genotype or multiple genotypes from mother to infant. In conclusion, a minor genotype of maternal virus is transmitted to the infants, and this finding could be useful in developing strategies to prevent maternal transmission of HIV-1 by means of perinatal interventions. PMID:7815476

  19. Venom proteomic and venomous glands transcriptomic analysis of the Egyptian scorpion Scorpio maurus palmatus (Arachnida: Scorpionidae).

    PubMed

    Abdel-Rahman, Mohamed A; Quintero-Hernandez, Veronica; Possani, Lourival D

    2013-11-01

    Proteomic analysis of the scorpion venom Scorpio maurus palmatus was performed using reverse-phase HPLC separation followed by mass spectrometry determination. Sixty five components were identified with molecular masses varying from 413 to 14,009 Da. The high percentage of peptides (41.5%) was from 3 to 5 KDa which may represent linear antimicrobial peptides and KScTxs. Also, 155 expressed sequence tags (ESTs) were analyzed through construction the cDNA library prepared from a pair of venomous gland. About 77% of the ESTs correspond to toxin-like peptides and proteins with definite open reading frames. The cDNA sequencing results also show the presence of sequences whose putative products have sequence similarity with antimicrobial peptides (24%), insecticidal toxins, β-NaScTxs, κ-KScTxs, α-KScTxs, calcines and La1-like peptides. Also, we have obtained 23 atypical types of venom molecules not recorded in other scorpion species. Moreover, 9% of the total ESTs revealed significant similarities with proteins involved in the cellular processes of these scorpion venomous glands. This is the first set of molecular masses and transcripts described from this species, in which various venom molecules have been identified. They belong to either known or unassigned types of scorpion venom peptides and proteins, and provide valuable information for evolutionary analysis and venomics. Copyright © 2013 Elsevier Ltd. All rights reserved.

  20. A sequential analysis of classroom discourse in Italian primary schools: the many faces of the IRF pattern.

    PubMed

    Molinari, Luisa; Mameli, Consuelo; Gnisci, Augusto

    2013-09-01

    A sequential analysis of classroom discourse is needed to investigate the conditions under which the triadic initiation-response-feedback (IRF) pattern may host different teaching orientations. The purpose of the study is twofold: first, to describe the characteristics of classroom discourse and, second, to identify and explore the different interactive sequences that can be captured with a sequential statistical analysis. Twelve whole-class activities were video recorded in three Italian primary schools. We observed classroom interaction as it occurs naturally on an everyday basis. In total, we collected 587 min of video recordings. Subsequently, 828 triadic IRF patterns were extracted from this material and analysed with the programme Generalized Sequential Query (GSEQ). The results indicate that classroom discourse may unfold in different ways. In particular, we identified and described four types of sequences. Dialogic sequences were triggered by authentic questions, and continued through further relaunches. Monologic sequences were directed to fulfil the teachers' pre-determined didactic purposes. Co-constructive sequences fostered deduction, reasoning, and thinking. Scaffolding sequences helped and sustained children with difficulties. The application of sequential analyses allowed us to show that interactive sequences may account for a variety of meanings, thus making a significant contribution to the literature and research practice in classroom discourse. © 2012 The British Psychological Society.

  1. A Pan-HIV Strategy for Complete Genome Sequencing

    PubMed Central

    Yamaguchi, Julie; Alessandri-Gradt, Elodie; Tell, Robert W.; Brennan, Catherine A.

    2015-01-01

    Molecular surveillance is essential to monitor HIV diversity and track emerging strains. We have developed a universal library preparation method (HIV-SMART [i.e., switching mechanism at 5′ end of RNA transcript]) for next-generation sequencing that harnesses the specificity of HIV-directed priming to enable full genome characterization of all HIV-1 groups (M, N, O, and P) and HIV-2. Broad application of the HIV-SMART approach was demonstrated using a panel of diverse cell-cultured virus isolates. HIV-1 non-subtype B-infected clinical specimens from Cameroon were then used to optimize the protocol to sequence directly from plasma. When multiplexing 8 or more libraries per MiSeq run, full genome coverage at a median ∼2,000× depth was routinely obtained for either sample type. The method reproducibly generated the same consensus sequence, consistently identified viral sequence heterogeneity present in specimens, and at viral loads of ≤4.5 log copies/ml yielded sufficient coverage to permit strain classification. HIV-SMART provides an unparalleled opportunity to identify diverse HIV strains in patient specimens and to determine phylogenetic classification based on the entire viral genome. Easily adapted to sequence any RNA virus, this technology illustrates the utility of next-generation sequencing (NGS) for viral characterization and surveillance. PMID:26699702

  2. The Distribution, Diversity, and Geobiology of Thermoproteales Populations in Yellowstone National Park

    NASA Astrophysics Data System (ADS)

    Jay, Z.; Beam, J.; Bailey, C.; Dohnalkova, A.; Planer-Friedrich, B.; Romine, M.; Inskeep, W. P.

    2012-12-01

    The order Thermoproteales (phylum Crenarchaeota) consists of thermophilic, rod-shaped organisms that are found globally in geothermal habitats ranging in pH from ~3-9. Nearly all isolated Thermoproteales couple the respiration of inorganic sulfur species (e.g. elemental sulfur, thiosulfate, sulfate) to the oxidation of hydrogen or complex organic carbon. Prior 16S rRNA and metagenome analysis revealed four prominent Thermoproteales-like populations in hypoxic, sulfidic hot springs In Yellowstone National Park (YNP), WY, USA (Monarch Geyser [80° C, pH 4], Cistern Spring [76° C, pH 5] and Joseph's Coat Hot Spring [JCHS; 80° C, pH 6]). The objectives of this study were to 1) characterize and compare the indigenous Thermoproteales-like de novo assemblies identified from metagenomic sequence data available for geothermal systems across YNP, 2) determine the metabolic potential of the Thermoproteales-like populations and evaluate their role in the geochemical cycling of organic and inorganic constituents, and 3) contrast both the sequenced genome and growth physiology of the first Thermoproteales isolated from YNP ("Pyrobaculum yellowstonensis" strain WP30), to the indigenous Thermoproteales-like de novo assemblies. Sequences related to either Caldivirga or Vulcanisaeta spp. (Type I Thermoproteales) were identified in both aerobic and anaerobic habitats ranging in pH ~3 - 6. Thermoproteus or Pyrobaculum spp. (Type-II Thermoproteales) were identified in anoxic habitats, but were constrained to pH values >4. Annotation of the de novo assemblies indicate that both Type-I and Type-II Thermoproteales populations are primarily heterotrophic, although key proteins of the autotrophic dicarboxylate/4-hydroxybutyrate cycle were also identified. Caldivirga/Vulcanisaeta-like populations appear to respire on elemental sulfur, sulfate, or molecular oxygen, while the Thermoproteus/Pyrobaculum-like population may also oxidize hydrogen and respire on elemental sulfur, thiosulfate, arsenate, or tetrathionate. One of the relevant Thermoproteales Type-II populations was isolated from JCHS and is an anaerobic heterotroph utilizing yeast extract as a carbon and energy source while respiring on elemental sulfur or arsenate, resulting in the production of sulfide or arsenite, respectively. The optimum growth temperature of strain WP30 (75° C) and pH range (4.5 - 7) corresponds well with characteristics of the sulfidic sediment used as the original inoculum. A draft genome of strain WP30 reveals that respiration may involve as many as four dimethylsulfoxide molybdopterin oxidoreductases including a putative sulfur reductase and an arsenate reductase. Sequences with high amino acid identity to these reductases were also identified in metagenome data sets from sites containin Type-II populations. Expression data of these terminal reductase genes during the growth of strain WP30 on either sulfur or arsenate were compared to expression results from field sites. These data provide insights regarding the diversity, distribution, and potential role of Thermoproteales-like populations in high-temperature environments of YNP.

  3. Sequencing of the variable region of rpsB to discriminate between Streptococcus pneumoniae and other streptococcal species.

    PubMed

    Wyllie, Anne L; Pannekoek, Yvonne; Bovenkerk, Sandra; van Engelsdorp Gastelaars, Jody; Ferwerda, Bart; van de Beek, Diederik; Sanders, Elisabeth A M; Trzciński, Krzysztof; van der Ende, Arie

    2017-09-01

    The vast majority of streptococci colonizing the human upper respiratory tract are commensals, only sporadically implicated in disease. Of these, the most pathogenic is Mitis group member, Streptococcus pneumoniae Phenotypic and genetic similarities between streptococci can cause difficulties in species identification. Using ribosomal S2-gene sequences extracted from whole-genome sequences published from 501 streptococci, we developed a method to identify streptococcal species. We validated this method on non-pneumococcal isolates cultured from cases of severe streptococcal disease ( n = 101) and from carriage ( n = 103), and on non-typeable pneumococci from asymptomatic individuals ( n = 17) and on whole-genome sequences of 1157 pneumococcal isolates from meningitis in the Netherlands. Following this, we tested 221 streptococcal isolates in molecular assays originally assumed specific for S. pneumoniae , targeting cpsA , lytA , piaB , ply , Spn9802, zmpC and capsule-type-specific genes. Cluster analysis of S2-sequences showed grouping according to species in line with published phylogenies of streptococcal core genomes. S2-typing convincingly distinguished pneumococci from non-pneumococcal species (99.2% sensitivity, 100% specificity). Molecular assays targeting regions of lytA and piaB were 100% specific for S. pneumoniae , whereas assays targeting cpsA , ply , Spn9802, zmpC and selected serotype-specific assays (but not capsular sequence typing) showed a lack of specificity. False positive results were over-represented in species associated with carriage, although no particular confounding signal was unique for carriage isolates. © 2017 The Authors.

  4. Sequencing of the variable region of rpsB to discriminate between Streptococcus pneumoniae and other streptococcal species

    PubMed Central

    Pannekoek, Yvonne; Bovenkerk, Sandra; van Engelsdorp Gastelaars, Jody; Ferwerda, Bart; van de Beek, Diederik; Sanders, Elisabeth A. M.; Trzciński, Krzysztof; van der Ende, Arie

    2017-01-01

    The vast majority of streptococci colonizing the human upper respiratory tract are commensals, only sporadically implicated in disease. Of these, the most pathogenic is Mitis group member, Streptococcus pneumoniae. Phenotypic and genetic similarities between streptococci can cause difficulties in species identification. Using ribosomal S2-gene sequences extracted from whole-genome sequences published from 501 streptococci, we developed a method to identify streptococcal species. We validated this method on non-pneumococcal isolates cultured from cases of severe streptococcal disease (n = 101) and from carriage (n = 103), and on non-typeable pneumococci from asymptomatic individuals (n = 17) and on whole-genome sequences of 1157 pneumococcal isolates from meningitis in the Netherlands. Following this, we tested 221 streptococcal isolates in molecular assays originally assumed specific for S. pneumoniae, targeting cpsA, lytA, piaB, ply, Spn9802, zmpC and capsule-type-specific genes. Cluster analysis of S2-sequences showed grouping according to species in line with published phylogenies of streptococcal core genomes. S2-typing convincingly distinguished pneumococci from non-pneumococcal species (99.2% sensitivity, 100% specificity). Molecular assays targeting regions of lytA and piaB were 100% specific for S. pneumoniae, whereas assays targeting cpsA, ply, Spn9802, zmpC and selected serotype-specific assays (but not capsular sequence typing) showed a lack of specificity. False positive results were over-represented in species associated with carriage, although no particular confounding signal was unique for carriage isolates. PMID:28931649

  5. Global molecular genetic analysis of porcine circovirus type 2 (PCV2) sequences confirms the presence of four main PCV2 genotypes and reveals a rapid increase of PCV2d.

    PubMed

    Xiao, Chao-Ting; Halbur, Patrick G; Opriessnig, Tanja

    2015-07-01

    The oldest porcine circovirus type 2 (PCV2) sequence dates back to 1962 and is among several hundreds of publicly available PCV2 sequences. Despite this resource, few studies have investigated the global genetic diversity of PCV2. To evaluate the phylogenetic relationship of PCV2 strains, 1680 PCV2 open reading frame 2 (ORF2) sequences were compared and analysed by methods of neighbour-joining, maximum-likelihood, Bayesian inference and network analysis. Four distinct clades were consistently identified and included PCV2a, PCV2b, PCV2c and PCV2d; the p-distance between PCV2d and PCV2b was 0.055±0.008, larger than the PCV2 genotype-definition cut-off of 0.035, supporting PCV2d as an independent genotype. Among the 1680 sequences, 278-285 (16.5-17 %) were classified as PCV2a, 1007-1058 (59.9-63 %) as PCV2b, three (0.2 %) as PCV2c and 322-323 (19.2 %) as PCV2d, with the remaining 12-78 sequences (0.7-4.6 %) classified as intermediate clades or strains by the various methods. Classification of strains to genotypes differed based on the number of sequences used for the analysis, indicating that sample size is important when determining classification and assessing PCV2 trends and shifts. PCV2d was initially identified in 1999 in samples collected in Switzerland, now appears to be widespread in China and has been present in North America since 2012. During 2012-2013, 37 % of all investigated PCV2 sequences from US pigs were classified as PCV2d and overall data analysis suggests an ongoing genotype shift from PCV2b towards PCV2d. The present analyses indicate that PCV2d emerged approximately 20 years ago.

  6. The origin of a methicillin-resistant Staphylococcus aureus isolate at a neonatal ward in Sweden-possible horizontal transfer of a staphylococcal cassette chromosome mec between methicillin-resistant Staphylococcus haemolyticus and Staphylococcus aureus.

    PubMed

    Berglund, C; Söderquist, B

    2008-11-01

    The first methicillin-resistant Staphylococcus aureus (MRSA) strain originated when a staphylococcal cassette chromosome mec (SCCmec) with the gene mecA was integrated into the chromosome of a susceptible S. aureus cell. The SCCmec elements are common among the coagulase-negative staphylococci, e.g. Staphylococcus haemolyticus, and these are considered to be potential SCCmec donors when new clones of MRSA arise. An outbreak of MRSA occurred at a neonatal intensive-care unit, and the isolates were all of sequence type (ST) 45, as characterized by multilocus sequence typing, but were not typeable with respect to SCCmec types I, II, III or IV. During the same time period, methicillin-resistant S. haemolyticus (MRSH) isolates identified in blood cultures at the same ward were found to be genotypically homogenous by pulsed-field gel electrophoresis, and did not carry a type I, II, III or IV SCCmec either. Thus, the hypothesis was raised that an SCCmec of MRSH had been transferred to a methicillin-susceptible S. aureus strain and thereby created a new clone of MRSA that caused the outbreak. This study showed that MRSA from the outbreak carried a ccrC and a class C mec complex that was also found among MRSH isolates. Partial sequencing of the mec complexes showed more than 99% homology, indicative of a common type V SCCmec. This finding may provide evidence for a recent horizontal transfer of an SCCmec from MRSH to an identified potential recipient, an ST45 methicillin-susceptible S. aureus strain, thereby creating a new clone of MRSA that caused the outbreak.

  7. Characterization of blaCTX-M IncFII plasmids and clones of Escherichia coli from pets in France.

    PubMed

    Dahmen, Safia; Haenni, Marisa; Châtre, Pierre; Madec, Jean-Yves

    2013-12-01

    To characterize bla(CTX-M) IncFII plasmids and clones of Escherichia coli from cats and dogs and to compare them with bla(CTX-M) IncFII plasmids reported in humans. From December 2006 to April 2010, 518 E. coli isolates from clinical infections in cats and dogs were screened for extended-spectrum β-lactamase (ESBL) production. Antimicrobial susceptibility was performed by disc diffusion and resistance genes were identified by PCR and sequencing. Plasmids were characterized using PCR-based replicon typing and sub-typing schemes, restriction fragment length polymorphism analysis, S1-PFGE and Southern hybridization. Isolates were characterized by PFGE, phylogenetic grouping, O25b typing and multilocus sequence typing. Nineteen E. coli isolates (3.7%) produced ESBLs, of which 14 (74%) carried bla(CTX-M) IncFII plasmids. The bla(CTX-M) gene was predominant and located on F31:A4:B1, F36:A4:B1 or F36:A1:B20 plasmids, abundantly reported in humans. The bla(CTX-M) F22:A1:B20 or F2:A2:B20 plasmids were also found. Different sequence types (STs) were identified, such as ST10, ST410, ST359, ST617 and ST224. Only one E. coli isolate belonged to the ST131 E. coli clone and carried a bla(CTX-M) F2:A2:B20 plasmid. This is the first known extensive study on ESBL-producing E. coli isolates from pets in France. The ST131 clone was rare. However, the predominance of human-like bla(CTX-M) IncFII plasmids suggests exchanges of these plasmids with the human reservoir.

  8. VizieR Online Data Catalog: Far-UV spectral atlas of O-type stars (Smith, 2012)

    NASA Astrophysics Data System (ADS)

    Smith, M. A.

    2012-10-01

    In this paper, we present a spectral atlas covering the wavelength interval 930-1188Å for O2-O9.5 stars using Far-Ultraviolet Spectroscopic Explorer archival data. The stars selected for the atlas were drawn from three populations: Galactic main-sequence (classes III-V) stars, supergiants, and main-sequence stars in the Magellanic Clouds, which have low metallicities. For several of these stars, we have prepared FITS files comprised of pairs of merged spectra for user access via the Multimission Archive at Space Telescope (MAST). We chose spectra from the first population with spectral types O4, O5, O6, O7, O8, and O9.5 and used them to compile tables and figures with identifications of all possible atmospheric and interstellar medium lines in the region 949-1188Å. Our identified line totals for these six representative spectra are 821 (500), 992 (663), 1077 (749), 1178 (847), 1359 (1001), and 1798 (1392) lines, respectively, where the numbers in parentheses are the totals of lines formed in the atmospheres, according to spectral synthesis models. The total number of unique atmospheric identifications for the six main-sequence O-star template spectra is 1792, whereas the number of atmospheric lines in common to these spectra is 300. The number of identified lines decreases toward earlier types (increasing effective temperature), while the percentages of "missed" features (unknown lines not predicted from our spectral syntheses) drop from a high of 8% at type B0.2, from our recently published B-star far-UV atlas (Cat. J/ApJS/186/175), to 1%-3% for type O spectra. The percentages of overpredicted lines are similar, despite their being much higher for B-star spectra. (4 data files).

  9. Identification and characterization of EBV genomes in spontaneously immortalized human peripheral blood B lymphocytes by NGS technology.

    PubMed

    Lei, Haiyan; Li, Tianwei; Hung, Guo-Chiuan; Li, Bingjie; Tsai, Shien; Lo, Shyh-Ching

    2013-11-19

    We conducted genomic sequencing to identify Epstein Barr Virus (EBV) genomes in 2 human peripheral blood B lymphocytes that underwent spontaneous immortalization promoted by mycoplasma infections in culture, using the high-throughput sequencing (HTS) Illumina MiSeq platform. The purpose of this study was to examine if rapid detection and characterization of a viral agent could be effectively achieved by HTS using a platform that has become readily available in general biology laboratories. Raw read sequences, averaging 175 bps in length, were mapped with DNA databases of human, bacteria, fungi and virus genomes using the CLC Genomics Workbench bioinformatics tool. Overall 37,757 out of 49,520,834 total reads in one lymphocyte line (# K4413-Mi) and 28,178 out of 45,335,960 reads in the other lymphocyte line (# K4123-Mi) were identified as EBV sequences. The two EBV genomes with estimated 35.22-fold and 31.06-fold sequence coverage respectively, designated K4413-Mi EBV and K4123-Mi EBV (GenBank accession number KC440852 and KC440851 respectively), are characteristic of type-1 EBV. Sequence comparison and phylogenetic analysis among K4413-Mi EBV, K4123-Mi EBV and the EBV genomes previously reported to GenBank as well as the NA12878 EBV genome assembled from database of the 1000 Genome Project showed that these 2 EBVs are most closely related to B95-8, an EBV previously isolated from a patient with infectious mononucleosis and WT-EBV. They are less similar to EBVs associated with nasopharyngeal carcinoma (NPC) from Hong Kong and China as well as the Akata strain of a case of Burkitt's lymphoma from Japan. They are most different from type 2 EBV found in Western African Burkitt's lymphoma.

  10. Human papillomavirus type 16 variants in cervical intraepithelial neoplasia and invasive carcinoma in San Luis Potosí City, Mexico

    PubMed Central

    López-Revilla, Rubén; Pineda, Marco A; Ortiz-Valdez, Julio; Sánchez-Garza, Mireya; Riego, Lina

    2009-01-01

    Background In San Luis Potosí City cervical infection by human papillomavirus type 16 (HPV16) associated to dysplastic lesions is more prevalent in younger women. In this work HPV16 subtypes and variants associated to low-grade intraepithelial lesions (LSIL), high-grade intraepithelial lesions (HSIL) and invasive cervical cancer (ICC) of 38 women residing in San Luis Potosí City were identified by comparing their E6 open reading frame sequences. Results Three European (E) variants (E-P, n = 27; E-T350G, n = 7; E-C188G, n = 2) and one AA-a variant (n = 2) were identified among the 38 HPV16 sequences analyzed. E-P variant sequences contained 23 single nucleotide changes, two of which (A334G, A404T) had not been described before and allowed the phylogenetic separation from the other variants. E-P A334G sequences were the most prevalent (22 cases, 57.9%), followed by the E-P Ref prototype (8 cases, 21.1%) and E-P A404T (1 case, 2.6%) sequences. The HSIL + ICC fraction was 0.21 for the E-P A334G variants and 0.00 for the E-P Ref variants. Conclusion We conclude that in the women included in this study the HPV16 E subtype is 19 times more frequent than the AA subtype; that the circulating E variants are E-P (71.1%) > E-T350G (18.4%) > E-C188G (5.3%); that 71.0% of the E-P sequences carry the A334G single nucleotide change and appear to correspond to a HPV16 variant characteristic of San Luis Potosi City more oncogenic than the E-P Ref prototype. PMID:19216802

  11. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schmidt, Edward G., E-mail: eschmidt1@unl.edu

    We have obtained VR photometry of 447 Cepheid variable star candidates with declinations north of -14 Degree-Sign 30', most of which were identified using the Northern Sky Variability Survey (NSVS) data archive. Periods and other photometric properties were derived from the combination of our data with the NSVS data. Atmospheric parameters were determined for 81 of these stars from low-resolution spectra. The identification of type II Cepheids based on the data presented in all four papers in this series is discussed. On the basis of spectra, 30 type II Cepheids were identified while 53 variables were identified as cool, mainmore » sequence stars and 283 as red giants following the definitions in Paper III. An additional 30 type II Cepheids were identified on the basis of light curves. The present classifications are compared with those from the Machine-learned All Sky Automated Survey Classification Catalog for 174 stars in common.« less

  12. A Wide Variety of Clostridium perfringens Type A Food-Borne Isolates That Carry a Chromosomal cpe Gene Belong to One Multilocus Sequence Typing Cluster

    PubMed Central

    Xiao, Yinghua; Wagendorp, Arjen; Moezelaar, Roy; Abee, Tjakko

    2012-01-01

    Of 98 suspected food-borne Clostridium perfringens isolates obtained from a nationwide survey by the Food and Consumer Product Safety Authority in The Netherlands, 59 strains were identified as C. perfringens type A. Using PCR-based techniques, the cpe gene encoding enterotoxin was detected in eight isolates, showing a chromosomal location for seven isolates and a plasmid location for one isolate. Further characterization of these strains by using (GTG)5 fingerprint repetitive sequence-based PCR analysis distinguished C. perfringens from other sulfite-reducing clostridia but did not allow for differentiation between various types of C. perfringens strains. To characterize the C. perfringens strains further, multilocus sequence typing (MLST) analysis was performed on eight housekeeping genes of both enterotoxic and non-cpe isolates, and the data were combined with a previous global survey covering strains associated with food poisoning, gas gangrene, and isolates from food or healthy individuals. This revealed that the chromosomal cpe strains (food strains and isolates from food poisoning cases) belong to a distinct cluster that is significantly distant from all the other cpe plasmid-carrying and cpe-negative strains. These results suggest that different groups of C. perfringens have undergone niche specialization and that a distinct group of food isolates has specific core genome sequences. Such findings have epidemiological and evolutionary significance. Better understanding of the origin and reservoir of enterotoxic C. perfringens may allow for improved control of this organism in foods. PMID:22865060

  13. Case reports of juvenile GM1 gangliosidosisis type II caused by mutation in GLB1 gene.

    PubMed

    Karimzadeh, Parvaneh; Naderi, Samaneh; Modarresi, Farzaneh; Dastsooz, Hassan; Nemati, Hamid; Farokhashtiani, Tayebeh; Shamsian, Bibi Shahin; Inaloo, Soroor; Faghihi, Mohammad Ali

    2017-07-17

    Type II or juvenile GM1-gangliosidosis is an autosomal recessive lysosomal storage disorder, which is clinically distinct from infantile form of the disease by the lack of characteristic cherry-red spot and hepatosplenomegaly. The disease is characterized by slowly progressive neurodegeneration and mild skeletal changes. Due to the later age of onset and uncharacteristic presentation, diagnosis is frequently puzzled with other ataxic and purely neurological disorders. Up to now, 3-4 types of GM1-gangliosidosis have been reported and among them type I is the most common phenotype with the age of onset around 6 months. Various forms of GM1-gangliosidosis are caused by GLB1 gene mutations but severity of the disease and age of onset are directly related to the position and the nature of deleterious mutations. However, due to its unique genetic cause and overlapping clinical features, some researchers believe that GM1 gangliosidosis represents an overlapped disease spectrum instead of four distinct types. Here, we report a less frequent type of autosomal recessive GM1 gangliosidosis with perplexing clinical presentation in three families in the southwest part of Iran, who are unrelated but all from "Lurs" ethnic background. To identify disease-causing mutations, Whole Exome Sequencing (WES) utilizing next generation sequencing was performed. Four patients from three families were investigated with the age of onset around 3 years old. Clinical presentations were ataxia, gate disturbances and dystonia leading to wheelchair-dependent disability, regression of intellectual abilities, and general developmental regression. They all were born in consanguineous families with no previous documented similar disease in their parents. A homozygote missense mutation in GLB1 gene (c. 601 G > A, p.R201C) was found in all patients. Using Sanger sequencing this identified mutation was confirmed in the proband, their parents, grandparents, and extended family members, confirming its autosomal recessive pattern of inheritance. Our study identified a rare pathogenic missense mutation in GLB1 gene in patients with complex neurodevelopmental findings, which can extend the list of differential diagnoses for childhood ataxia in Iranian patients.

  14. Analysis of resistance genes of clinical Pannonibacter phragmitetus strain 31801 by complete genome sequencing.

    PubMed

    Ming, De-Song; Chen, Qing-Qing; Chen, Xiao-Tin

    2018-05-14

    To clarify the resistance mechanisms of Pannonibacter phragmitetus 31801, isolated from the blood of a liver abscess patient, at the genomic level, we performed whole genomic sequencing using a PacBio RS II single-molecule real-time long-read sequencer. Bioinformatic analysis of the resulting sequence was then carried out to identify any possible resistance genes. Analyses included Basic Local Alignment Search Tool searches against the Antibiotic Resistance Genes Database, ResFinder analysis of the genome sequence, and Resistance Gene Identifier analysis within the Comprehensive Antibiotic Resistance Database. Prophages, clustered regularly interspaced short palindromic repeats (CRISPR), and other putative virulence factors were also identified using PHAST, CRISPRfinder, and the Virulence Factors Database, respectively. The circular chromosome and single plasmid of P. phragmitetus 31801 contained multiple antibiotic resistance genes, including those coding for three different types of β-lactamase [NPS β-lactamase (EC 3.5.2.6), β-lactamase class C, and a metal-dependent hydrolase of β-lactamase superfamily I]. In addition, genes coding for subunits of several multidrug-resistance efflux pumps were identified, including those targeting macrolides (adeJ, cmeB), tetracycline (acrB, adeAB), fluoroquinolones (acrF, ceoB), and aminoglycosides (acrD, amrB, ceoB, mexY, smeB). However, apart from the tripartite macrolide efflux pump macAB-tolC, the genome did not appear to contain the complete complement of subunit genes required for production of most of the major multidrug-resistance efflux pumps.

  15. Draft Genome Sequence of Mycobacterium chimaera Type ...

    EPA Pesticide Factsheets

    We report the draft genome sequence of the type strain Mycobacterium chimaera Fl-0169T, a member of the Mycobacterium avium complex (MAC). M. chimaera Fl-0169T was isolated from a patient in Italy and is highly similar to strains of M. chimaera isolated in Ireland, though Fl-0169T possesses unique virulence genes. Evidence suggests that M. avium, M. intracellulare, and M. chimaera are differently virulent and a comparative genomic analysis is critically needed to identify diagnostic targets that reliably differentiate species of MAC. With treatment costs for Mycobacterium infections estimated to be >$1.8 B annually in the U.S., correct species identification will result in improved treatment selection, lower costs, and improved patient outcomes.

  16. The Danish STR sequence database: duplicate typing of 363 Danes with the ForenSeq™ DNA Signature Prep Kit.

    PubMed

    Hussing, C; Bytyci, R; Huber, C; Morling, N; Børsting, C

    2018-05-24

    Some STR loci have internal sequence variations, which are not revealed by the standard STR typing methods used in forensic genetics (PCR and fragment length analysis by capillary electrophoresis (CE)). Typing of STRs with next-generation sequencing (NGS) uncovers the sequence variation in the repeat region and in the flanking regions. In this study, 363 Danish individuals were typed for 56 STRs (26 autosomal STRs, 24 Y-STRs, and 6 X-STRs) using the ForenSeq™ DNA Signature Prep Kit to establish a Danish STR sequence database. Increased allelic diversity was observed in 34 STRs by the PCR-NGS assay. The largest increases were found in DYS389II and D12S391, where the numbers of sequenced alleles were around four times larger than the numbers of alleles determined by repeat length alone. Thirteen SNPs and one InDel were identified in the flanking regions of 12 STRs. Furthermore, 36 single positions and five longer stretches in the STR flanking regions were found to have dubious genotyping quality. The combined match probability of the 26 autosomal STRs was 10,000 times larger using the PCR-NGS assay than by using PCR-CE. The typical paternity indices for trios and duos were 500 and 100 times larger, respectively, than those obtained with PCR-CE. The assay also amplified 94 SNPs selected for human identification. Eleven of these loci were not in Hardy-Weinberg equilibrium in the Danish population, most likely because the minimum threshold for allele calling (30 reads) in the ForenSeq™ Universal Analysis Software was too low and frequent allele dropouts were not detected.

  17. Unique Conformation in a Natural Interruption Sequence of Type XIX Collagen Revealed by Its High-Resolution Crystal Structure.

    PubMed

    Xu, Tingting; Zhou, Cong-Zhao; Xiao, Jianxi; Liu, Jinsong

    2018-02-20

    Naturally occurring interruptions in nonfibrillar collagen play key roles in molecular flexibility, collagen degradation, and ligand binding. The structural feature of the interruption sequences and the molecular basis for their functions have not been well studied. Here, we focused on a G5G type natural interruption sequence G-POALO-G from human type XIX collagen, a homotrimer collagen, as this sequence possesses distinct properties compared with those of a pathological similar Gly mutation sequence in collagen mimic peptides. We determined the crystal structures of the host-guest peptide (GPO) 3 -GPOALO-(GPO) 4 to 1.03 Å resolution in two crystal forms. In these structures, the interruption zone brings localized disruptions to the triple helix and introduces a light 6-8° bend with the same directional preference to the whole molecule, which may correspond structurally to the first physiological kink site in type XIX collagen. Furthermore, at the G5G interruption site, the presence of Ala and Leu residues, both with free N-H groups, allows the formation of more direct and water-mediated interchain hydrogen bonds than in the related Gly → Ala structure. These could partly explain the difference in thermal stability between the different interruptions. In addition, our structures provide a detailed view of the dynamic property of such an interrupted zone with respect to hydrogen bonding topology, torsion angles, and helical parameters. Our results, for the first time, also identified the binding of zinc to the end of the triple helix. These findings will shed light on how the interruption sequence influences the conformation of the collagen molecule and provide a structural basis for further functional studies.

  18. Sequence and structural characterization of Trx-Grx type of monothiol glutaredoxins from Ashbya gossypii.

    PubMed

    Yadav, Saurabh; Kumari, Pragati; Kushwaha, Hemant Ritturaj

    2013-01-01

    Glutaredoxins are enzymatic antioxidants which are small, ubiquitous, glutathione dependent and essentially classified under thioredoxin-fold superfamily. Glutaredoxins are classified into two types: dithiol and monothiol. Monothiol glutaredoxins which carry the signature "CGFS" as a redox active motif is known for its role in oxidative stress, inside the cell. In the present analysis, the 138 amino acid long monothiol glutaredoxin, AgGRX1 from Ashbya gossypii was identified and has been used for the analysis. The multiple sequence alignment of the AgGRX1 protein sequence revealed the characteristic motif of typical monothiol glutaredoxin as observed in various other organisms. The proposed structure of the AgGRX1 protein was used to analyze signature folds related to the thioredoxin superfamily. Further, the study highlighted the structural features pertaining to the complex mechanism of glutathione docking and interacting residues.

  19. Genome sequence of the pink–pigmented marine bacterium Loktanella hongkongensis type strain (UST950701–009PT), a representative of the Roseobacter group

    DOE PAGES

    Lau, Stanley CK; Riedel, Thomas; Fiebig, Anne; ...

    2015-08-11

    Loktanella hongkongensis UST950701-009PT is a Gram-negative, non-motile and rod-shaped bacterium isolated from a marine biofilm in the subtropical seawater of Hong Kong. When growing as a monospecies biofilm on polystyrene surfaces, this bacterium is able to induce larval settlement and metamorphosis of a ubiquitous polychaete tubeworm Hydroides elegans. The inductive cues are low-molecular weight compounds bound to the exopolymeric matrix of the bacterial cells. In the present study we describe the features of L. hongkongensis strain DSM 17492T together with its genome sequence and annotation and novel aspects of its phenotype. The 3,198,444 bp long genome sequence encodes 3104 protein-codingmore » genes and 57 RNA genes. Lastly, the two unambiguously identified extrachromosomal replicons contain replication modules of the RepB and the Rhodobacteraceae-specific DnaA-like type, respectively.« less

  20. Methyl-CpG island-associated genome signature tags

    DOEpatents

    Dunn, John J

    2014-05-20

    Disclosed is a method for analyzing the organismic complexity of a sample through analysis of the nucleic acid in the sample. In the disclosed method, through a series of steps, including digestion with a type II restriction enzyme, ligation of capture adapters and linkers and digestion with a type IIS restriction enzyme, genome signature tags are produced. The sequences of a statistically significant number of the signature tags are determined and the sequences are used to identify and quantify the organisms in the sample. Various embodiments of the invention described herein include methods for using single point genome signature tags to analyze the related families present in a sample, methods for analyzing sequences associated with hyper- and hypo-methylated CpG islands, methods for visualizing organismic complexity change in a sampling location over time and methods for generating the genome signature tag profile of a sample of fragmented DNA.

Top