El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R
2013-07-01
Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Arisue, Nobuko; Sánchez, Lidya B.; Weiss, Louis M.; Müller, Miklós; Hashimoto, Tetsuo
2011-01-01
Genes encoding putative mitochondrial-type heat shock protein 70 (mit-hsp70) were isolated and sequenced from amitochondriate protists, Giardia intestinalis, Entamoeba histolytica, and two microsporidians, Encephalitozoon hellem and Glugea plecoglossi. The deduced mit-hsp70 sequences were analyzed by sequence alignments and phylogenetic reconstructions. The mit-hsp70 sequence of these four amitochondriate protists were divergent from other mit-hsp70 sequences of mitochondriate eukaryotes. However, all of these sequences were clearly located within a eukaryotic mitochondrial clade in the tree including various type hsp70 sequences, supporting the emerging notion that none of these amitochondriate lineages are primitively amitochodrial, but lost their mitochondria secondarily in their evolutionary past. PMID:11880223
An automated genotyping tool for enteroviruses and noroviruses.
Kroneman, A; Vennema, H; Deforche, K; v d Avoort, H; Peñaranda, S; Oberste, M S; Vinjé, J; Koopmans, M
2011-06-01
Molecular techniques are established as routine in virological laboratories and virus typing through (partial) sequence analysis is increasingly common. Quality assurance for the use of typing data requires harmonization of genotype nomenclature, and agreement on target genes, depending on the level of resolution required, and robustness of methods. To develop and validate web-based open-access typing-tools for enteroviruses and noroviruses. An automated web-based typing algorithm was developed, starting with BLAST analysis of the query sequence against a reference set of sequences from viruses in the family Picornaviridae or Caliciviridae. The second step is phylogenetic analysis of the query sequence and a sub-set of the reference sequences, to assign the enterovirus type or norovirus genotype and/or variant, with profile alignment, construction of phylogenetic trees and bootstrap validation. Typing is performed on VP1 sequences of Human enterovirus A to D, and ORF1 and ORF2 sequences of genogroup I and II noroviruses. For validation, we used the tools to automatically type sequences in the RIVM and CDC enterovirus databases and the FBVE norovirus database. Using the typing-tools, 785(99%) of 795 Enterovirus VP1 sequences, and 8154(98.5%) of 8342 norovirus sequences were typed in accordance with previously used methods. Subtyping into variants was achieved for 4439(78.4%) of 5838 NoV GII.4 sequences. The online typing-tools reliably assign genotypes for enteroviruses and noroviruses. The use of phylogenetic methods makes these tools robust to ongoing evolution. This should facilitate standardized genotyping and nomenclature in clinical and public health laboratories, thus supporting inter-laboratory comparisons. Copyright © 2011 Elsevier B.V. All rights reserved.
Mining co-occurrence and sequence patterns from cancer diagnoses in New York State.
Wang, Yu; Hou, Wei; Wang, Fusheng
2018-01-01
The goal of this study is to discover disease co-occurrence and sequence patterns from large scale cancer diagnosis histories in New York State. In particular, we want to identify disparities among different patient groups. Our study will provide essential knowledge for clinical researchers to further investigate comorbidities and disease progression for improving the management of multiple diseases. We used inpatient discharge and outpatient visit records from the New York State Statewide Planning and Research Cooperative System (SPARCS) from 2011-2015. We grouped each patient's visit history to generate diagnosis sequences for seven most popular cancer types. We performed frequent disease co-occurrence mining using the Apriori algorithm, and frequent disease sequence patterns discovery using the cSPADE algorithm. Different types of cancer demonstrated distinct patterns. Disparities of both disease co-occurrence and sequence patterns were observed from patients within different age groups. There were also considerable disparities in disease co-occurrence patterns with respect to different claim types (i.e., inpatient, outpatient, emergency department and ambulatory surgery). Disparities regarding genders were mostly found where the cancer types were gender specific. Supports of most patterns were usually higher for males than for females. Compared with secondary diagnosis codes, primary diagnosis codes can convey more stable results. Two disease sequences consisting of the same diagnoses but in different orders were usually with different supports. Our results suggest that the methods adopted can generate potentially interesting and clinically meaningful disease co-occurrence and sequence patterns, and identify disparities among various patient groups. These patterns could imply comorbidities and disease progressions.
Rodrigues, Thaís C S; Subramaniam, Kuttichantran; Cortés-Hinojosa, Galaxia; Wellehan, James F X; Ng, Terry Fei Fan; Delwart, Eric; McCulloch, Stephen D; Goldstein, Juli D; Schaefer, Adam M; Fair, Patricia A; Reif, John S; Bossart, Gregory D; Waltzek, Thomas B
2018-04-26
The genome sequence of a papillomavirus was determined from fecal samples collected from bottlenose dolphins in the Indian River Lagoon, FL. The genome was 7,772 bp and displayed a typical papillomavirus genome organization. Phylogenetic analysis supported the bottlenose dolphin papillomavirus as being a novel type of Omikronpapillomavirus 1 . Copyright © 2018 Rodrigues et al.
Chen, Yen-Kuang; Li, Kuo-Bin
2013-02-07
The type information of un-annotated membrane proteins provides an important hint for their biological functions. The experimental determination of membrane protein types, despite being more accurate and reliable, is not always feasible due to the costly laboratory procedures, thereby creating a need for the development of bioinformatics methods. This article describes a novel computational classifier for the prediction of membrane protein types using proteins' sequences. The classifier, comprising a collection of one-versus-one support vector machines, makes use of the following sequence attributes: (1) the cationic patch sizes, the orientation, and the topology of transmembrane segments; (2) the amino acid physicochemical properties; (3) the presence of signal peptides or anchors; and (4) the specific protein motifs. A new voting scheme was implemented to cope with the multi-class prediction. Both the training and the testing sequences were collected from SwissProt. Homologous proteins were removed such that there is no pair of sequences left in the datasets with a sequence identity higher than 40%. The performance of the classifier was evaluated by a Jackknife cross-validation and an independent testing experiments. Results show that the proposed classifier outperforms earlier predictors in prediction accuracy in seven of the eight membrane protein types. The overall accuracy was increased from 78.3% to 88.2%. Unlike earlier approaches which largely depend on position-specific substitution matrices and amino acid compositions, most of the sequence attributes implemented in the proposed classifier have supported literature evidences. The classifier has been deployed as a web server and can be accessed at http://bsaltools.ym.edu.tw/predmpt. Copyright © 2012 Elsevier Ltd. All rights reserved.
UCSC genome browser: deep support for molecular biomedical research.
Mangan, Mary E; Williams, Jennifer M; Lathe, Scott M; Karolchik, Donna; Lathe, Warren C
2008-01-01
The volume and complexity of genomic sequence data, and the additional experimental data required for annotation of the genomic context, pose a major challenge for display and access for biomedical researchers. Genome browsers organize this data and make it available in various ways to extract useful information to advance research projects. The UCSC Genome Browser is one of these resources. The official sequence data for a given species forms the framework to display many other types of data such as expression, variation, cross-species comparisons, and more. Visual representations of the data are available for exploration. Data can be queried with sequences. Complex database queries are also easily achieved with the Table Browser interface. Associated tools permit additional query types or access to additional data sources such as images of in situ localizations. Support for solving researcher's issues is provided with active discussion mailing lists and by providing updated training materials. The UCSC Genome Browser provides a source of deep support for a wide range of biomedical molecular research (http://genome.ucsc.edu).
Yoshida, Catherine E; Kruczkiewicz, Peter; Laing, Chad R; Lingohr, Erika J; Gannon, Victor P J; Nash, John H E; Taboada, Eduardo N
2016-01-01
For nearly 100 years serotyping has been the gold standard for the identification of Salmonella serovars. Despite the increasing adoption of DNA-based subtyping approaches, serotype information remains a cornerstone in food safety and public health activities aimed at reducing the burden of salmonellosis. At the same time, recent advances in whole-genome sequencing (WGS) promise to revolutionize our ability to perform advanced pathogen characterization in support of improved source attribution and outbreak analysis. We present the Salmonella In Silico Typing Resource (SISTR), a bioinformatics platform for rapidly performing simultaneous in silico analyses for several leading subtyping methods on draft Salmonella genome assemblies. In addition to performing serovar prediction by genoserotyping, this resource integrates sequence-based typing analyses for: Multi-Locus Sequence Typing (MLST), ribosomal MLST (rMLST), and core genome MLST (cgMLST). We show how phylogenetic context from cgMLST analysis can supplement the genoserotyping analysis and increase the accuracy of in silico serovar prediction to over 94.6% on a dataset comprised of 4,188 finished genomes and WGS draft assemblies. In addition to allowing analysis of user-uploaded whole-genome assemblies, the SISTR platform incorporates a database comprising over 4,000 publicly available genomes, allowing users to place their isolates in a broader phylogenetic and epidemiological context. The resource incorporates several metadata driven visualizations to examine the phylogenetic, geospatial and temporal distribution of genome-sequenced isolates. As sequencing of Salmonella isolates at public health laboratories around the world becomes increasingly common, rapid in silico analysis of minimally processed draft genome assemblies provides a powerful approach for molecular epidemiology in support of public health investigations. Moreover, this type of integrated analysis using multiple sequence-based methods of sub-typing allows for continuity with historical serotyping data as we transition towards the increasing adoption of genomic analyses in epidemiology. The SISTR platform is freely available on the web at https://lfz.corefacility.ca/sistr-app/.
Jones, Gary; Macken, Bill
2015-11-01
Traditional accounts of verbal short-term memory explain differences in performance for different types of verbal material by reference to inherent characteristics of the verbal items making up memory sequences. The role of previous experience with sequences of different types is ostensibly controlled for either by deliberate exclusion or by presenting multiple trials constructed from different random permutations. We cast doubt on this general approach in a detailed analysis of the basis for the robust finding that short-term memory for digit sequences is superior to that for other sequences of verbal material. Specifically, we show across four experiments that this advantage is not due to inherent characteristics of digits as verbal items, nor are individual digits within sequences better remembered than other types of individual verbal items. Rather, the advantage for digit sequences stems from the increased frequency, compared to other verbal material, with which digits appear in random sequences in natural language, and furthermore, relatively frequent digit sequences support better short-term serial recall than less frequent ones. We also provide corpus-based computational support for the argument that performance in a short-term memory setting is a function of basic associative learning processes operating on the linguistic experience of the rememberer. The experimental and computational results raise questions not only about the role played by measurement of digit span in cognition generally, but also about the way in which long-term memory processes impact on short-term memory functioning. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Kuhn, Jens H.; Andersen, Kristian G.; Bào, Yīmíng; Bavari, Sina; Becker, Stephan; Bennett, Richard S.; Bergman, Nicholas H.; Blinkova, Olga; Bradfute, Steven; Brister, J. Rodney; Bukreyev, Alexander; Chandran, Kartik; Chepurnov, Alexander A.; Davey, Robert A.; Dietzgen, Ralf G.; Doggett, Norman A.; Dolnik, Olga; Dye, John M.; Enterlein, Sven; Fenimore, Paul W.; Formenty, Pierre; Freiberg, Alexander N.; Garry, Robert F.; Garza, Nicole L.; Gire, Stephen K.; Gonzalez, Jean-Paul; Griffiths, Anthony; Happi, Christian T.; Hensley, Lisa E.; Herbert, Andrew S.; Hevey, Michael C.; Hoenen, Thomas; Honko, Anna N.; Ignatyev, Georgy M.; Jahrling, Peter B.; Johnson, Joshua C.; Johnson, Karl M.; Kindrachuk, Jason; Klenk, Hans-Dieter; Kobinger, Gary; Kochel, Tadeusz J.; Lackemeyer, Matthew G.; Lackner, Daniel F.; Leroy, Eric M.; Lever, Mark S.; Mühlberger, Elke; Netesov, Sergey V.; Olinger, Gene G.; Omilabu, Sunday A.; Palacios, Gustavo; Panchal, Rekha G.; Park, Daniel J.; Patterson, Jean L.; Paweska, Janusz T.; Peters, Clarence J.; Pettitt, James; Pitt, Louise; Radoshitzky, Sheli R.; Ryabchikova, Elena I.; Saphire, Erica Ollmann; Sabeti, Pardis C.; Sealfon, Rachel; Shestopalov, Aleksandr M.; Smither, Sophie J.; Sullivan, Nancy J.; Swanepoel, Robert; Takada, Ayato; Towner, Jonathan S.; van der Groen, Guido; Volchkov, Viktor E.; Volchkova, Valentina A.; Wahl-Jensen, Victoria; Warren, Travis K.; Warfield, Kelly L.; Weidmann, Manfred; Nichol, Stuart T.
2014-01-01
Sequence determination of complete or coding-complete genomes of viruses is becoming common practice for supporting the work of epidemiologists, ecologists, virologists, and taxonomists. Sequencing duration and costs are rapidly decreasing, sequencing hardware is under modification for use by non-experts, and software is constantly being improved to simplify sequence data management and analysis. Thus, analysis of virus disease outbreaks on the molecular level is now feasible, including characterization of the evolution of individual virus populations in single patients over time. The increasing accumulation of sequencing data creates a management problem for the curators of commonly used sequence databases and an entry retrieval problem for end users. Therefore, utilizing the data to their fullest potential will require setting nomenclature and annotation standards for virus isolates and associated genomic sequences. The National Center for Biotechnology Information’s (NCBI’s) RefSeq is a non-redundant, curated database for reference (or type) nucleotide sequence records that supplies source data to numerous other databases. Building on recently proposed templates for filovirus variant naming [
Kaphingst, Kimberly A; Ivanovich, Jennifer; Lyons, Sarah; Biesecker, Barbara; Dresser, Rebecca; Elrick, Ashley; Matsen, Cindy; Goodman, Melody
2018-01-29
The growing importance of genome sequencing means that patients will increasingly face decisions regarding what results they would like to learn. The present study examined psychological and clinical factors that might affect these preferences. 1,080 women diagnosed with breast cancer at age 40 or younger completed an online survey. We assessed their interest in learning various types of genome sequencing results: risk of preventable disease or unpreventable disease, cancer treatment response, uncertain meaning, risk to relatives' health, and ancestry/physical traits. Multivariable logistic regression was used to examine whether being "very" interested in each result type was associated with clinical factors: BRCA1/2 mutation status, prior genetic testing, family history of breast cancer, and psychological factors: cancer recurrence worry, genetic risk worry, future orientation, health information orientation, and genome sequencing knowledge. The proportion of respondents who were very interested in learning each type of result ranged from 16% to 77%. In all multivariable models, those who were very interested in learning a result type had significantly higher knowledge about sequencing benefits, greater genetic risks worry, and stronger health information orientation compared to those with less interest (p-values < .05). Our findings indicate that high interest in return of various types of genome sequencing results was more closely related to psychological factors. Shared decision-making approaches that increase knowledge about genome sequencing and incorporate patient preferences for health information and learning about genetic risks may help support patients' informed choices about learning different types of sequencing results. © Society of Behavioral Medicine 2018.
SPAR: small RNA-seq portal for analysis of sequencing experiments.
Kuksa, Pavel P; Amlie-Wolf, Alexandre; Katanic, Živadin; Valladares, Otto; Wang, Li-San; Leung, Yuk Yee
2018-05-04
The introduction of new high-throughput small RNA sequencing protocols that generate large-scale genomics datasets along with increasing evidence of the significant regulatory roles of small non-coding RNAs (sncRNAs) have highlighted the urgent need for tools to analyze and interpret large amounts of small RNA sequencing data. However, it remains challenging to systematically and comprehensively discover and characterize sncRNA genes and specifically-processed sncRNA products from these datasets. To fill this gap, we present Small RNA-seq Portal for Analysis of sequencing expeRiments (SPAR), a user-friendly web server for interactive processing, analysis, annotation and visualization of small RNA sequencing data. SPAR supports sequencing data generated from various experimental protocols, including smRNA-seq, short total RNA sequencing, microRNA-seq, and single-cell small RNA-seq. Additionally, SPAR includes publicly available reference sncRNA datasets from our DASHR database and from ENCODE across 185 human tissues and cell types to produce highly informative small RNA annotations across all major small RNA types and other features such as co-localization with various genomic features, precursor transcript cleavage patterns, and conservation. SPAR allows the user to compare the input experiment against reference ENCODE/DASHR datasets. SPAR currently supports analyses of human (hg19, hg38) and mouse (mm10) sequencing data. SPAR is freely available at https://www.lisanwanglab.org/SPAR.
Arneth, Borros
2012-10-01
As possible mechanisms to explain the emergence of autoimmune diseases, the current author has suggested in earlier papers two new pathways: the "protein localization hypothesis" and the "protein traffic hypothesis". The "protein localization hypothesis" states that an autoimmune disease develops if a protein accumulates in a previously unoccupied compartment, that did not previously contain that protein. Similarly, the "protein traffic hypothesis" states that a sudden error within the transport of a certain protein leads to the emergence of an autoimmune disease. The current article discusses the usefulness of the different commercially available transgenic murine models of diabetes mellitus type 1 to confirm the aforementioned hypotheses. This discussion shows that several transgenic murine models of diabetes mellitus type 1 are in-line and confirm the aforementioned hypotheses. Furthermore, these hypotheses are additionally inline with the occurrence of several newly discovered protein sequences, the so-called trepitope sequences. These sequences modulate the immune response to certain proteins. The current study analyzed to what extent the hypotheses are supported by the occurrence of these new sequences. Thereby the occurrence of the trepitope sequences provides additional evidence supporting the aforementioned hypotheses. Both the "protein localization hypothesis" and the "protein traffic hypothesis" have the potential to lead to new causal therapy concepts. The "protein localization hypothesis" and the "protein traffic hypothesis" provide conceptional explanations for the diabetes mouse models as well as for the newly discovered trepitope sequences. Copyright © 2012 Elsevier Ltd. All rights reserved.
Forsythe, Stephen J; Dickins, Benjamin; Jolley, Keith A
2014-12-16
Following the association of Cronobacter spp. to several publicized fatal outbreaks in neonatal intensive care units of meningitis and necrotising enterocolitis, the World Health Organization (WHO) in 2004 requested the establishment of a molecular typing scheme to enable the international control of the organism. This paper presents the application of Next Generation Sequencing (NGS) to Cronobacter which has led to the establishment of the Cronobacter PubMLST genome and sequence definition database (http://pubmlst.org/cronobacter/) containing over 1000 isolates with metadata along with the recognition of specific clonal lineages linked to neonatal meningitis and adult infections Whole genome sequencing and multilocus sequence typing (MLST) has supports the formal recognition of the genus Cronobacter composed of seven species to replace the former single species Enterobacter sakazakii. Applying the 7-loci MLST scheme to 1007 strains revealed 298 definable sequence types, yet only C. sakazakii clonal complex 4 (CC4) was principally associated with neonatal meningitis. This clonal lineage has been confirmed using ribosomal-MLST (51-loci) and whole genome-MLST (1865 loci) to analyse 107 whole genomes via the Cronobacter PubMLST database. This database has enabled the retrospective analysis of historic cases and outbreaks following re-identification of those strains. The Cronobacter PubMLST database offers a central, open access, reliable sequence-based repository for researchers. It has the capacity to create new analysis schemes 'on the fly', and to integrate metadata (source, geographic distribution, clinical presentation). It is also expandable and adaptable to changes in taxonomy, and able to support the development of reliable detection methods of use to industry and regulatory authorities. Therefore it meets the WHO (2004) request for the establishment of a typing scheme for this emergent bacterial pathogen. Whole genome sequencing has additionally shown a range of potential virulence and environmental fitness traits which may account for the association of C. sakazakii CC4 pathogenicity, and propensity for neonatal CNS.
The genetic architecture of type 2 diabetes.
Fuchsberger, Christian; Flannick, Jason; Teslovich, Tanya M; Mahajan, Anubha; Agarwala, Vineeta; Gaulton, Kyle J; Ma, Clement; Fontanillas, Pierre; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Denis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; van der Schouw, Yvonne T; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeriya; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana C N; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Burtt, Noël P; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Florez, Jose C; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Boehnke, Michael; Altshuler, David; McCarthy, Mark I
2016-08-04
The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of the heritability of this disease. Here, to test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole-genome sequencing in 2,657 European individuals with and without diabetes, and exome sequencing in 12,940 individuals from five ancestry groups. To increase statistical power, we expanded the sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support the idea that lower-frequency variants have a major role in predisposition to type 2 diabetes.
The genetic architecture of type 2 diabetes
Ma, Clement; Fontanillas, Pierre; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Denis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; van der Schouw, Yvonne T; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeriya; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana C N; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Burtt, Noël P; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Florez, Jose C; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Boehnke, Michael; Altshuler, David; McCarthy, Mark I
2016-01-01
The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of heritability. To test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole genome sequencing in 2,657 Europeans with and without diabetes, and exome sequencing in a total of 12,940 subjects from five ancestral groups. To increase statistical power, we expanded sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support a major role for lower-frequency variants in predisposition to type 2 diabetes. PMID:27398621
A RESTful application programming interface for the PubMLST molecular typing and genome databases
Bray, James E.; Maiden, Martin C. J.
2017-01-01
Abstract Molecular typing is used to differentiate microorganisms at the subspecies or strain level for epidemiological investigations, infection control, public health and environmental sampling. DNA sequence-based typing methods require authoritative databases that link sequence variants to nomenclature in order to facilitate communication and comparison of identified types in national or global settings. The PubMLST website (https://pubmlst.org/) fulfils this role for over a hundred microorganisms for which it hosts curated molecular sequence typing data, providing sequence and allelic profile definitions for multi-locus sequence typing (MLST) and single-gene typing approaches. In recent years, these have expanded to cover the whole genome with schemes such as core genome MLST (cgMLST) and whole genome MLST (wgMLST) which catalogue the allelic diversity found in hundreds to thousands of genes. These approaches provide a common nomenclature for high-resolution strain characterization and comparison. Molecular typing information is linked to isolate provenance, phenotype, and increasingly genome assemblies, providing a resource for outbreak investigation and research in to population structure, gene association, global epidemiology and vaccine coverage. A Representational State Transfer (REST) Application Programming Interface (API) has been developed for the PubMLST website to make these large quantities of structured molecular typing and whole genome sequence data available for programmatic access by any third party application. The API is an integral component of the Bacterial Isolate Genome Sequence Database (BIGSdb) platform that is used to host PubMLST resources, and exposes all public data within the site. In addition to data browsing, searching and download, the API supports authentication and submission of new data to curator queues. Database URL: http://rest.pubmlst.org/ PMID:29220452
Detection of distorted frames in retinal video-sequences via machine learning
NASA Astrophysics Data System (ADS)
Kolar, Radim; Liberdova, Ivana; Odstrcilik, Jan; Hracho, Michal; Tornow, Ralf P.
2017-07-01
This paper describes detection of distorted frames in retinal sequences based on set of global features extracted from each frame. The feature vector is consequently used in classification step, in which three types of classifiers are tested. The best classification accuracy 96% has been achieved with support vector machine approach.
NASA Technical Reports Server (NTRS)
1981-01-01
Developments in numerical solution of certain types of partial differential equations by rapidly converging sequences of operations on supporting grids that range from very fine to very coarse are presented.
Nyakatura, John A; Heymann, Eckhard W
2010-03-01
The adoption of a specific gait sequence pattern during symmetrical locomotion has been proposed to have been a key advantage for the exploitation of the fine branch niche in early primates. Diverse aspects of primate locomotion have been extensively studied in technically equipped laboratory settings, but evolutionary conclusions derived from these investigations have rarely been verified in wild primates. Bridging the gap from the lab to the field, we conducted an actual performance determination of symmetrical gaits in two free-ranging tamarin species (Saguinus mystax and Saguinus fuscicollis) of Amazonian Peru by analyzing high-speed video recordings of naturally occurring locomotor bouts. Tamarins arguably represent viable models for aspects of early primate locomotion. We tested three specific hypotheses derived from laboratory studies to test for the influence of support size and orientation and to gain further insight into the functional significance of primate gait sequence patterns: (1) The tamarins utilize symmetrical gaits at a higher rate on small supports than on larger ones. (2) During symmetrical locomotion on small supports, diagonal sequences are utilized at a higher rate than on larger supports. (3) On inclines, diagonal sequences are predominantly used and on declines, lateral sequences are predominantly used. Our results corroborated hypotheses 1 and 3. We found no clear support for hypothesis 2. In conclusion, our results add to the notion that primate gait plasticity, rather than uniform adoption of diagonal sequence gaits, enabled early primates to accommodate different support types and effectively exploit the small branch niche. Copyright 2009 Elsevier Ltd. All rights reserved.
New steroid 5alpha-reductase type I (SRD5A1) homologous sequences on human chromosomes 6 and 8.
Eminović, I; Liović, M; Prezelj, J; Kocijancic, A; Rozman, D; Komel, R
2001-01-01
To date, two genes encoding 5alpha-reductase isoenzymes are known (type I, type II), and one type I pseudogene. The divergent localization of these genes and the still not fully understood function of the encoded enzymes as well as the perplexing results we obtained after sequencing PCR-amplified SRD5A1 gene fragments (out of genomic DNA), made us assume that, in addition to the known SRD5A1 gene, one or more different human 5alpha-reductase type I coding genes may exist. Our research provide the first evidence for the existence of two new SRD5A1 related, previously unidentified sequences in the human genome. These sequences which were localized to chromosomes 6 and 8 are highly homologous (> 99%) to SRD5A1, and also do not contain any deletions or insertions that are otherwise a characteristic of the SRD5API pseudogene. Our results imply that these sequences may be either coding parts of yet unknown, active SRD5A1 genes, and/or of previously unidentified pseudogenes. These findings additionally support data of Chen et al. who confirmed the existence of various SRD5A1 proteins in cultured human skin cells.
Diversity of the Cronobacter Genus as Revealed by Multilocus Sequence Typing
Joseph, S.; Sonbol, H.; Hariri, S.; Desai, P.; McClelland, M.
2012-01-01
Cronobacter (previously known as Enterobacter sakazakii) is a diverse bacterial genus consisting of seven species: C. sakazakii, C. malonaticus, C. turicensis, C. universalis, C. muytjensii, C. dublinensis, and C. condimenti. In this study, we have used a multilocus sequence typing (MLST) approach employing the alleles of 7 genes (atpD, fusA, glnS, gltB, gyrB, infB, and ppsA; total length, 3,036 bp) to investigate the phylogenetic relationship of 325 Cronobacter species isolates. Strains were chosen on the basis of their species, geographic and temporal distribution, source, and clinical outcome. The earliest strain was isolated from milk powder in 1950, and the earliest clinical strain was isolated in 1953. The existence of seven species was supported by MLST. Intraspecific variation ranged from low diversity in C. sakazakii to extensive diversity within some species, such as C. muytjensii and C. dublinensis, including evidence of gene conversion between species. The predominant species from clinical sources was found to be C. sakazakii. C. sakazakii sequence type 4 (ST4) was the predominant sequence type of cerebral spinal fluid isolates from cases of meningitis. PMID:22785185
DNA methylation assessment from human slow- and fast-twitch skeletal muscle fibers
Begue, Gwénaëlle; Raue, Ulrika; Jemiolo, Bozena
2017-01-01
A new application of the reduced representation bisulfite sequencing method was developed using low-DNA input to investigate the epigenetic profile of human slow- and fast-twitch skeletal muscle fibers. Successful library construction was completed with as little as 15 ng of DNA, and high-quality sequencing data were obtained with 32 ng of DNA. Analysis identified 143,160 differentially methylated CpG sites across 14,046 genes. In both fiber types, selected genes predominantly expressed in slow or fast fibers were hypomethylated, which was supported by the RNA-sequencing analysis. These are the first fiber type-specific methylation data from human skeletal muscle and provide a unique platform for future research. NEW & NOTEWORTHY This study validates a low-DNA input reduced representation bisulfite sequencing method for human muscle biopsy samples to investigate the methylation patterns at a fiber type-specific level. These are the first fiber type-specific methylation data reported from human skeletal muscle and thus provide initial insight into basal state differences in myosin heavy chain I and IIa muscle fibers among young, healthy men. PMID:28057818
Liu, Q; Yong, C B; Astell, C R
1994-06-01
Previous characterization of the terminal sequences of the minute virus of mice (MVM) genome demonstrated that the right hand palindrome contains two sequences, each the inverted complement of the other. However, the left hand palindrome was shown to exist as a unique sequence [Astell et al., J. Virol. 54: 179-185 (1985)]. The modified rolling hairpin (MRH) model for MVM replication provided an explanation of how the right hand palindrome could undergo hairpin transfer to generate two sequences, while the left end palindrome within the dimer bridge could undergo asymmetric resolution and retain the unique left end sequence. This report describes in vitro resolution of the wild-type dimer bridge sequence of MVM using recombinant (baculovirus) expressed NS-1 and a replication extract from LA9 cells. The resolution products are consistent with those predicted by the MRH model, providing support for this replication mechanism. In addition, mutant dimer bridge clones were constructed and used in the resolution assay. The mutant structures included removal of the asymmetry in the hairpin stem, inversion of the sequence at the initiating nick site, and a 2-bp deletion within one stem of the dimer bridge. In all cases, the mutant dimer bridge structures are resolved; however, the resolution pattern observed with the mutant dimer bridge compared with the wild-type dimer bridge is shifted toward symmetrical resolution. These results suggest that sequences within the left hand hairpin (and hence dimer bridge sequence) are responsible for asymmetric resolution and conservation of the unique sequence within the left hand palindrome of the MVM genome.
Campbell, Matthew A; Alfaro, Michael E; Belasco, Max; López, J Andrés
2017-01-01
Phylogenetic inference based on evidence from DNA sequences has led to significant strides in the development of a stable and robustly supported framework for the vertebrate tree of life. To date, the bulk of those advances have relied on sequence data from a small number of genome regions that have proven unable to produce satisfactory answers to consistently recalcitrant phylogenetic questions. Here, we re-examine phylogenetic relationships among early-branching euteleostean fish lineages classically grouped in the Protacanthopterygii using DNA sequence data surrounding ultraconserved elements. We report and examine a dataset of thirty-four OTUs with 17,957 aligned characters from fifty-three nuclear loci. Phylogenetic analysis is conducted in concatenated, joint gene trees and species tree estimation and summary coalescent frameworks. All analytical frameworks yield supporting evidence for existing hypotheses of relationship for the placement of Lepidogalaxias salamandroides , monophyly of the Stomiatii and the presence of an esociform + salmonid clade. Lepidogalaxias salamandroides and the Esociformes + Salmoniformes are successive sister lineages to all other euteleosts in the majority of analyses. The concatenated and joint gene trees and species tree analysis types produce high support values for this arrangement. However, inter-relationships of Argentiniformes, Stomiatii and Neoteleostei remain uncertain as they varied by analysis type while receiving strong and contradictory indices of support. Topological differences between analysis types are also apparent within the otomorph and the percomorph taxa in the data set. Our results identify concordant areas with strong support for relationships within and between early-branching euteleost lineages but they also reveal limitations in the ability of larger datasets to conclusively resolve other aspects of that phylogeny.
Alfaro, Michael E.; Belasco, Max; López, J. Andrés
2017-01-01
Phylogenetic inference based on evidence from DNA sequences has led to significant strides in the development of a stable and robustly supported framework for the vertebrate tree of life. To date, the bulk of those advances have relied on sequence data from a small number of genome regions that have proven unable to produce satisfactory answers to consistently recalcitrant phylogenetic questions. Here, we re-examine phylogenetic relationships among early-branching euteleostean fish lineages classically grouped in the Protacanthopterygii using DNA sequence data surrounding ultraconserved elements. We report and examine a dataset of thirty-four OTUs with 17,957 aligned characters from fifty-three nuclear loci. Phylogenetic analysis is conducted in concatenated, joint gene trees and species tree estimation and summary coalescent frameworks. All analytical frameworks yield supporting evidence for existing hypotheses of relationship for the placement of Lepidogalaxias salamandroides, monophyly of the Stomiatii and the presence of an esociform + salmonid clade. Lepidogalaxias salamandroides and the Esociformes + Salmoniformes are successive sister lineages to all other euteleosts in the majority of analyses. The concatenated and joint gene trees and species tree analysis types produce high support values for this arrangement. However, inter-relationships of Argentiniformes, Stomiatii and Neoteleostei remain uncertain as they varied by analysis type while receiving strong and contradictory indices of support. Topological differences between analysis types are also apparent within the otomorph and the percomorph taxa in the data set. Our results identify concordant areas with strong support for relationships within and between early-branching euteleost lineages but they also reveal limitations in the ability of larger datasets to conclusively resolve other aspects of that phylogeny. PMID:28929008
Genetic recombination of tick-borne flaviviruses among wild-type strains.
Norberg, Peter; Roth, Anette; Bergström, Tomas
2013-06-05
Genetic recombination has been suggested to occur in mosquito-borne flaviviruses. In contrast, tick-borne flaviviruses have been thought to evolve in a clonal manner, although recent studies suggest that recombination occurs also for these viruses. We re-analyzed the data and found that previous conclusions on wild type recombination were probably falsely drawn due to misalignments of nucleotide sequences, ambiguities in GenBank sequences, or different laboratory culture histories suggestive of recombination events in laboratory. To evaluate if reliable predictions of wild type recombination of tick-borne flaviviruses can be made, we analyzed viral strains sequenced exclusively for this study, and other flavivirus sequences retrieved from GenBank. We detected genetic signals supporting recombination between viruses within the three clades of TBEV-Eu, TBEV-Sib and TBEV-Fe, respectively. Our results suggest that the tick-borne encephalitis viruses may undergo recombination under natural conditions, but that geographic barriers restrict most recombination events to involve only closely genetically related viruses. Copyright © 2013 Elsevier Inc. All rights reserved.
Cell type-specific termination of transcription by transposable element sequences.
Conley, Andrew B; Jordan, I King
2012-09-30
Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription termination by TEs seen here, along with the preference for sense-oriented TE insertions to provide TTS, is consistent with the observed antisense orientation bias of human TEs.
Horn, T; Chang, C A; Urdea, M S
1997-12-01
The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays.
Horn, T; Chang, C A; Urdea, M S
1997-01-01
The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays. PMID:9365265
Doroghazi, J. R.; Ju, K.-S.; Metcalf, W. W.
2014-01-01
In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T forms a cluster with five other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these other species, including Streptomyces almquistii NRRL B-1685T, Streptomyces flocculus NRRL B-2465T, Streptomyces gibsonii NRRL B-1335T and Streptomyces rangoonensis NRRL B-12378T are quite similar. This cluster is of particular taxonomic interest because Streptomyces albus is the type species of the genus Streptomyces. The related strains were subjected to multilocus sequence analysis (MLSA) utilizing partial sequences of the housekeeping genes atpD, gyrB, recA, rpoB and trpB and confirmation of previously reported phenotypic characteristics. The five strains formed a coherent cluster supported by a 100 % bootstrap value in phylogenetic trees generated from sequence alignments prepared by concatenating the sequences of the housekeeping genes, and identical tree topology was observed using various different tree-making algorithms. Moreover, all but one strain, S. flocculus NRRL B-2465T, exhibited identical sequences for all of the five housekeeping gene loci sequenced, but NRRL B-2465T still exhibited an MLSA evolutionary distance of 0.005 from the other strains, a value that is lower than the 0.007 MLSA evolutionary distance threshold proposed for species-level relatedness. These data support a proposal to reclassify S. almquistii, S. flocculus, S. gibsonii and S. rangoonensis as later heterotypic synonyms of S. albus with NRRL B-1811T as the type strain. The MLSA sequence database also demonstrated utility for quickly and conclusively confirming that numerous strains within the ARS Culture Collection had been previously misidentified as subspecies of S. albus and that Streptomyces albus subsp. pathocidicus should be redescribed as a novel species, Streptomyces pathocidini sp. nov., with the type strain NRRL B-24287T. PMID:24277863
Desoubeaux, Guillaume; Debourgogne, Anne; Wiederhold, Nathan P; Zaffino, Marie; Sutton, Deanna; Burns, Rachel E; Frasca, Salvatore; Hyatt, Michael W; Cray, Carolyn
2018-07-01
Fusarium spp. are saprobic moulds that are responsible for severe opportunistic infections in humans and animals. However, we need epidemiological tools to reliably trace the circulation of such fungal strains within medical or veterinary facilities, to recognize environmental contaminations that might lead to infection and to improve our understanding of factors responsible for the onset of outbreaks. In this study, we used molecular genotyping to investigate clustered cases of Fusarium solani species complex (FSSC) infection that occurred in eight Sphyrnidae sharks under managed care at a public aquarium. Genetic relationships between fungal strains were determined by multi-locus sequence typing (MLST) analysis based on DNA sequencing at five loci, followed by comparison with sequences of 50 epidemiologically unrelated FSSC strains. Our genotyping approach revealed that F. keratoplasticum and F. solani haplotype 9x were most commonly isolated. In one case, the infection proved to be with another Hypocrealian rare opportunistic pathogen Metarhizium robertsii. Twice, sharks proved to be infected with FSSC strains with the same MLST sequence type, supporting the hypothesis the hypothesis that common environmental populations of fungi existed for these sharks and would suggest the longtime persistence of the two clonal strains within the environment, perhaps in holding pools and life support systems of the aquarium. This study highlights how molecular tools like MLST can be used to investigate outbreaks of microbiological disease. This work reinforces the need for regular controls of water quality to reduce microbiological contamination due to waterborne microorganisms.
Applications of alignment-free methods in epigenomics.
Pinello, Luca; Lo Bosco, Giosuè; Yuan, Guo-Cheng
2014-05-01
Epigenetic mechanisms play an important role in the regulation of cell type-specific gene activities, yet how epigenetic patterns are established and maintained remains poorly understood. Recent studies have supported a role of DNA sequences in recruitment of epigenetic regulators. Alignment-free methods have been applied to identify distinct sequence features that are associated with epigenetic patterns and to predict epigenomic profiles. Here, we review recent advances in such applications, including the methods to map DNA sequence to feature space, sequence comparison and prediction models. Computational studies using these methods have provided important insights into the epigenetic regulatory mechanisms.
Kämpfer, Peter; Falsen, Enevold; Busse, Hans-Jürgen
2008-01-01
Pseudomonas mephitica CCUG 2513(T) has been reinvestigated to clarify its taxonomic position. 16S rRNA gene sequence comparisons demonstrated that this strain clusters phylogenetically closely with Janthinobacterium lividum (99.8% sequence similarity to the type strain). Investigation of fatty acid patterns, polar lipid profiles, polyamine patterns and quinone systems supported this delineation. Substrate utilization profiles and biochemical characteristics displayed no differences from the type strain of J. lividum, CCUG 2344(T). Therefore, the reclassification of Pseudomonas mephitica as a later heterotypic synonym of Janthinobacterium lividum is proposed, based upon the estimated phylogenetic position derived from 16S rRNA gene sequence data and chemotaxonomic and biochemical data.
Layer, Franziska; Sanchini, Andrea; Strommenger, Birgit; Cuny, Christiane; Breier, Ann-Christin; Proquitté, Hans; Bührer, Christoph; Schenkel, Karl; Bätzing-Feigenbaum, Jörg; Greutelaers, Benedikt; Nübel, Ulrich; Gastmeier, Petra; Eckmanns, Tim; Werner, Guido
2015-10-01
Outbreaks of Staphylococcus aureus are common in neonatal intensive care units (NICUs). Usually they are documented for methicillin-resistant strains, while reports involving methicillin-susceptible S. aureus (MSSA) strains are rare. In this study we report the epidemiological and molecular investigation of an MSSA outbreak in a NICU among preterm neonates. Infection control measures and interventions were commissioned by the Local Public Health Authority and supported by the Robert Koch Institute. To support epidemiological investigations molecular typing was done by spa-typing and Multilocus sequence typing; the relatedness of collected isolates was further elucidated by DNA SmaI-macrorestriction, microarray analysis and bacterial whole genome sequencing. A total of 213 neonates, 123 healthcare workers and 205 neonate parents were analyzed in the period November 2011 to November 2012. The outbreak strain was characterized as a MSSA spa-type t021, able to produce toxic shock syndrome toxin-1 and Enterotoxin A. We identified seventeen neonates (of which two died from toxic shock syndrome), four healthcare workers and three parents putatively involved in the outbreak. Whole-genome sequencing permitted to exclude unrelated cases from the outbreak and to discuss the role of healthcare workers as a reservoir of S. aureus on the NICU. Genome comparisons also indicated the presence of the respective clone on the ward months before the first colonized/infected neonates were detected. Copyright © 2015 Elsevier GmbH. All rights reserved.
Anonymization of electronic medical records for validating genome-wide association studies
Loukides, Grigorios; Gkoulalas-Divanis, Aris; Malin, Bradley
2010-01-01
Genome-wide association studies (GWAS) facilitate the discovery of genotype–phenotype relations from population-based sequence databases, which is an integral facet of personalized medicine. The increasing adoption of electronic medical records allows large amounts of patients’ standardized clinical features to be combined with the genomic sequences of these patients and shared to support validation of GWAS findings and to enable novel discoveries. However, disseminating these data “as is” may lead to patient reidentification when genomic sequences are linked to resources that contain the corresponding patients’ identity information based on standardized clinical features. This work proposes an approach that provably prevents this type of data linkage and furnishes a result that helps support GWAS. Our approach automatically extracts potentially linkable clinical features and modifies them in a way that they can no longer be used to link a genomic sequence to a small number of patients, while preserving the associations between genomic sequences and specific sets of clinical features corresponding to GWAS-related diseases. Extensive experiments with real patient data derived from the Vanderbilt's University Medical Center verify that our approach generates data that eliminate the threat of individual reidentification, while supporting GWAS validation and clinical case analysis tasks. PMID:20385806
Angiuoli, Samuel V; White, James R; Matalka, Malcolm; White, Owen; Fricke, W Florian
2011-01-01
The widespread popularity of genomic applications is threatened by the "bioinformatics bottleneck" resulting from uncertainty about the cost and infrastructure needed to meet increasing demands for next-generation sequence analysis. Cloud computing services have been discussed as potential new bioinformatics support systems but have not been evaluated thoroughly. We present benchmark costs and runtimes for common microbial genomics applications, including 16S rRNA analysis, microbial whole-genome shotgun (WGS) sequence assembly and annotation, WGS metagenomics and large-scale BLAST. Sequence dataset types and sizes were selected to correspond to outputs typically generated by small- to midsize facilities equipped with 454 and Illumina platforms, except for WGS metagenomics where sampling of Illumina data was used. Automated analysis pipelines, as implemented in the CloVR virtual machine, were used in order to guarantee transparency, reproducibility and portability across different operating systems, including the commercial Amazon Elastic Compute Cloud (EC2), which was used to attach real dollar costs to each analysis type. We found considerable differences in computational requirements, runtimes and costs associated with different microbial genomics applications. While all 16S analyses completed on a single-CPU desktop in under three hours, microbial genome and metagenome analyses utilized multi-CPU support of up to 120 CPUs on Amazon EC2, where each analysis completed in under 24 hours for less than $60. Representative datasets were used to estimate maximum data throughput on different cluster sizes and to compare costs between EC2 and comparable local grid servers. Although bioinformatics requirements for microbial genomics depend on dataset characteristics and the analysis protocols applied, our results suggests that smaller sequencing facilities (up to three Roche/454 or one Illumina GAIIx sequencer) invested in 16S rRNA amplicon sequencing, microbial single-genome and metagenomics WGS projects can achieve cost-efficient bioinformatics support using CloVR in combination with Amazon EC2 as an alternative to local computing centers.
Angiuoli, Samuel V.; White, James R.; Matalka, Malcolm; White, Owen; Fricke, W. Florian
2011-01-01
Background The widespread popularity of genomic applications is threatened by the “bioinformatics bottleneck” resulting from uncertainty about the cost and infrastructure needed to meet increasing demands for next-generation sequence analysis. Cloud computing services have been discussed as potential new bioinformatics support systems but have not been evaluated thoroughly. Results We present benchmark costs and runtimes for common microbial genomics applications, including 16S rRNA analysis, microbial whole-genome shotgun (WGS) sequence assembly and annotation, WGS metagenomics and large-scale BLAST. Sequence dataset types and sizes were selected to correspond to outputs typically generated by small- to midsize facilities equipped with 454 and Illumina platforms, except for WGS metagenomics where sampling of Illumina data was used. Automated analysis pipelines, as implemented in the CloVR virtual machine, were used in order to guarantee transparency, reproducibility and portability across different operating systems, including the commercial Amazon Elastic Compute Cloud (EC2), which was used to attach real dollar costs to each analysis type. We found considerable differences in computational requirements, runtimes and costs associated with different microbial genomics applications. While all 16S analyses completed on a single-CPU desktop in under three hours, microbial genome and metagenome analyses utilized multi-CPU support of up to 120 CPUs on Amazon EC2, where each analysis completed in under 24 hours for less than $60. Representative datasets were used to estimate maximum data throughput on different cluster sizes and to compare costs between EC2 and comparable local grid servers. Conclusions Although bioinformatics requirements for microbial genomics depend on dataset characteristics and the analysis protocols applied, our results suggests that smaller sequencing facilities (up to three Roche/454 or one Illumina GAIIx sequencer) invested in 16S rRNA amplicon sequencing, microbial single-genome and metagenomics WGS projects can achieve cost-efficient bioinformatics support using CloVR in combination with Amazon EC2 as an alternative to local computing centers. PMID:22028928
Hayat, Maqsood; Khan, Asifullah
2011-02-21
Membrane proteins are vital type of proteins that serve as channels, receptors, and energy transducers in a cell. Prediction of membrane protein types is an important research area in bioinformatics. Knowledge of membrane protein types provides some valuable information for predicting novel example of the membrane protein types. However, classification of membrane protein types can be both time consuming and susceptible to errors due to the inherent similarity of membrane protein types. In this paper, neural networks based membrane protein type prediction system is proposed. Composite protein sequence representation (CPSR) is used to extract the features of a protein sequence, which includes seven feature sets; amino acid composition, sequence length, 2 gram exchange group frequency, hydrophobic group, electronic group, sum of hydrophobicity, and R-group. Principal component analysis is then employed to reduce the dimensionality of the feature vector. The probabilistic neural network (PNN), generalized regression neural network, and support vector machine (SVM) are used as classifiers. A high success rate of 86.01% is obtained using SVM for the jackknife test. In case of independent dataset test, PNN yields the highest accuracy of 95.73%. These classifiers exhibit improved performance using other performance measures such as sensitivity, specificity, Mathew's correlation coefficient, and F-measure. The experimental results show that the prediction performance of the proposed scheme for classifying membrane protein types is the best reported, so far. This performance improvement may largely be credited to the learning capabilities of neural networks and the composite feature extraction strategy, which exploits seven different properties of protein sequences. The proposed Mem-Predictor can be accessed at http://111.68.99.218/Mem-Predictor. Copyright © 2010 Elsevier Ltd. All rights reserved.
Population Structure in Nontypeable Haemophilus influenzae
LaCross, Nathan C.; Marrs, Carl F.; Gilsdorf, Janet R.
2013-01-01
Nontypeable Haemophilus influenzae (NTHi) frequently colonize the human pharynx asymptomatically, and are an important cause of otitis media in children. Past studies have identified typeable H. influenzae as being clonal, but the population structure of NTHi has not been extensively characterized. The research presented here investigated the diversity and population structure in a well-characterized collection of NTHi isolated from the middle ears of children with otitis media or the pharynges of healthy children in three disparate geographic regions. Multilocus sequence typing identified 109 unique sequence types among 170 commensal and otitis media-associated NTHi isolates from Finland, Israel, and the US. The largest clonal complex contained only five sequence types, indicating a high level of genetic diversity. The eBURST v3, ClonalFrame 1.1, and structure 2.3.3 programs were used to further characterize diversity and population structure from the sequence typing data. Little clustering was apparent by either disease state (otitis media or commensalism) or geography in the ClonalFrame phylogeny. Population structure was clearly evident, with support for eight populations when all 170 isolates were analyzed. Interestingly, one population contained only commensal isolates, while two others consisted solely of otitis media isolates, suggesting associations between population structure and disease. PMID:23266487
Pinho, Marcos D; Erol, Erdal; Ribeiro-Gonçalves, Bruno; Mendes, Catarina I; Carriço, João A; Matos, Sandra C; Preziuso, Silvia; Luebke-Becker, Antina; Wieler, Lothar H; Melo-Cristino, Jose; Ramirez, Mario
2016-08-17
The pathogenic role of beta-hemolytic Streptococcus dysgalactiae in the equine host is increasingly recognized. A collection of 108 Lancefield group C (n = 96) or L (n = 12) horse isolates recovered in the United States and in three European countries presented multilocus sequence typing (MLST) alleles, sequence types and emm types (only 56% of the isolates could be emm typed) that were, with few exceptions, distinct from those previously found in human Streptococcus dysgalactiae subsp. equisimilis. Characterization of a subset of horse isolates by multilocus sequence analysis (MLSA) and 16S rRNA gene sequence showed that most equine isolates could also be differentiated from S. dysgalactiae strains from other animal species, supporting the existence of a horse specific genomovar. Draft genome information confirms the distinctiveness of the horse genomovar and indicates the presence of potentially horse-specific virulence factors. While this genomovar represents most of the isolates recovered from horses, a smaller MLST and MLSA defined sub-population seems to be able to cause infections in horses, other animals and humans, indicating that transmission between hosts of strains belonging to this group may occur.
Phillips, Anastasia; Sotomayor, Cristina; Wang, Qinning; Holmes, Nadine; Furlong, Catriona; Ward, Kate; Howard, Peter; Octavia, Sophie; Lan, Ruiting; Sintchenko, Vitali
2016-09-15
Salmonella Typhimurium (STM) is an important cause of foodborne outbreaks worldwide. Subtyping of STM remains critical to outbreak investigation, yet current techniques (e.g. multilocus variable number tandem repeat analysis, MLVA) may provide insufficient discrimination. Whole genome sequencing (WGS) offers potentially greater discriminatory power to support infectious disease surveillance. We performed WGS on 62 STM isolates of a single, endemic MLVA type associated with two epidemiologically independent, food-borne outbreaks along with sporadic cases in New South Wales, Australia, during 2014. Genomes of case and environmental isolates were sequenced using HiSeq (Illumina) and the genetic distance between them was assessed by single nucleotide polymorphism (SNP) analysis. SNP analysis was compared to the epidemiological context. The WGS analysis supported epidemiological evidence and genomes of within-outbreak isolates were nearly identical. Sporadic cases differed from outbreak cases by a small number of SNPs, although their close relationship to outbreak cases may represent an unidentified common food source that may warrant further public health follow up. Previously unrecognised mini-clusters were detected. WGS of STM can discriminate foodborne community outbreaks within a single endemic MLVA clone. Our findings support the translation of WGS into public health laboratory surveillance of salmonellosis.
Genetics Home Reference: dermatofibrosarcoma protuberans
... part of a large molecule called type I collagen, which strengthens and supports many tissues in the ... the chimeric sequence formed by the fusion of collagen gene COL1A1 and the platelet derived growth factor ...
Robson, Nicole D.; Telesnitsky, Alice
2000-01-01
Retrovirus plus-strand synthesis is primed by a cleavage remnant of the polypurine tract (PPT) region of viral RNA. In this study, we tested replication properties for Moloney murine leukemia viruses with targeted mutations in the PPT and in conserved sequences upstream, as well as for pools of mutants with randomized sequences in these regions. The importance of maintaining some purine residues within the PPT was indicated both by examining the evolution of random PPT pools and from the replication properties of targeted mutants. Although many different PPT sequences could support efficient replication and one mutant that contained two differences in the core PPT was found to replicate as well as the wild type, some sequences in the core PPT clearly conferred advantages over others. Contributions of sequences upstream of the core PPT were examined with deletion mutants. A conserved T-stretch within the upstream sequence was examined in detail and found to be unimportant to helper functions. Evolution of virus pools containing randomized T-stretch sequences demonstrated marked preference for the wild-type sequence in six of its eight positions. These findings demonstrate that maintenance of the T-rich element is more important to viral replication than is maintenance of the core PPT. PMID:11044073
Tu, Bin; Masaberg, Carly; Hou, Lihua; Behm, Daniel; Brescia, Peter; Cha, Nuri; Kariyawasam, Kanthi; Lee, Jar How; Nong, Thoa; Sells, John; Tausch, Paul; Yang, Ruyan; Ng, Jennifer; Hurley, Carolyn Katovich
2017-02-01
Sanger-based DNA sequencing of exons 2+3 of HLA class I alleles from a heterozygote frequently results in two or more alternative genotypes. This study was undertaken to reduce the time and effort required to produce a single high resolution HLA genotype. Samples were typed in parallel by Sanger sequencing and oligonucleotide probe hybridization. This workflow, together with optimization of analysis software, was tested and refined during the typing of over 42,000 volunteers for an unrelated hematopoietic progenitor cell donor registry. Next generation DNA sequencing (NGS) was applied to over 1000 of these samples to identify the alleles present within the G group designations. Single genotypes at G level resolution were obtained for over 95% of the loci without additional assays. The vast majority of alleles identified (>99%) were the primary allele giving the G groups their name. Only 0.7% of the alleles identified encoded protein variants that were not detected by a focus on the antigen recognition domain (ARD)-encoding exons. Our combined method routinely provides biologically relevant typing resolution at the level of the ARD. It can be applied to both single samples or to large volume typing supporting either bone marrow or solid organ transplantation using technologies currently available in many HLA laboratories. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Candida ruelliae sp. nov., a novel yeast species isolated from flowers of Ruellia sp. (Acanthaceae).
Saluja, Puja; Prasad, Gandham S
2008-06-01
Two novel yeast strains designated as 16Q1 and 16Q3 were isolated from flowers of the Ruellia species of the Acanthaceae family. The D1/D2 domain and ITS sequences of these two strains were identical. Sequence analysis of the D1/D2 domain of large-subunit rRNA gene indicated their relationship to species of the Candida haemulonii cluster. However, they differ from C. haemulonii by 14% nucleotide sequence divergence, from Candida pseudohaemulonii by 16.1% and from C. haemulonii type II by 16.5%. These strains also differ in 18 physiological tests from the type strain of C. haemulonii, and 12 and 16 tests, respectively, from C. pseudohaemulonii and C. haemulonii type II. They also differ from C. haemulonii and other related species by more than 13% sequence divergence in the internal transcribed spacer region. In the SSU rRNA gene sequences, strain 16Q1 differs by 1.7% nucleotide divergence from C. haemulonii. Sporulation was not observed in pure or mixed cultures on several media examined. All these data support the assignment of these strains to a novel species; we have named them as Candida ruelliae sp. nov., and designate strain 16Q1(T)=MTCC 7739(T)=CBS10815(T) as type strain of the novel species.
Petzold, Markus; Prior, Karola; Moran-Gilad, Jacob; Harmsen, Dag; Lück, Christian
2017-01-01
Introduction Whole genome sequencing (WGS) is increasingly used in Legionnaires’ disease (LD) outbreak investigations, owing to its higher resolution than sequence-based typing, the gold standard typing method for Legionella pneumophila, in the analysis of endemic strains. Recently, a gene-by-gene typing approach based on 1,521 core genes called core genome multilocus sequence typing (cgMLST) was described that enables a robust and standardised typing of L. pneumophila. Methods: We applied this cgMLST scheme to isolates obtained during the largest outbreak of LD reported so far in Germany. In this outbreak, the epidemic clone ST345 had been isolated from patients and four different environmental sources. In total 42 clinical and environmental isolates were retrospectively typed. Results: Epidemiologically unrelated ST345 isolates were clearly distinguishable from the epidemic clone. Remarkably, epidemic isolates split up into two distinct clusters, ST345-A and ST345-B, each respectively containing a mix of clinical and epidemiologically-related environmental samples. Discussion/conclusion: The outbreak was therefore likely caused by both variants of the single sequence type, which pre-existed in the environmental reservoirs. The two clusters differed by 40 alleles located in two neighbouring genomic regions of ca 42 and 26 kb. Additional analysis supported horizontal gene transfer of the two regions as responsible for the difference between the variants. Both regions comprise virulence genes and have previously been reported to be involved in recombination events. This corroborates the notion that genomic outbreak investigations should always take epidemiological information into consideration when making inferences. Overall, cgMLST proved helpful in disentangling the complex genomic epidemiology of the outbreak. PMID:29162202
Petzold, Markus; Prior, Karola; Moran-Gilad, Jacob; Harmsen, Dag; Lück, Christian
2017-11-01
IntroductionWhole genome sequencing (WGS) is increasingly used in Legionnaires' disease (LD) outbreak investigations, owing to its higher resolution than sequence-based typing, the gold standard typing method for Legionella pneumophila, in the analysis of endemic strains. Recently, a gene-by-gene typing approach based on 1,521 core genes called core genome multilocus sequence typing (cgMLST) was described that enables a robust and standardised typing of L. pneumophila . Methods : We applied this cgMLST scheme to isolates obtained during the largest outbreak of LD reported so far in Germany. In this outbreak, the epidemic clone ST345 had been isolated from patients and four different environmental sources. In total 42 clinical and environmental isolates were retrospectively typed. Results : Epidemiologically unrelated ST345 isolates were clearly distinguishable from the epidemic clone. Remarkably, epidemic isolates split up into two distinct clusters, ST345-A and ST345-B, each respectively containing a mix of clinical and epidemiologically-related environmental samples. Discussion/conclusion : The outbreak was therefore likely caused by both variants of the single sequence type, which pre-existed in the environmental reservoirs. The two clusters differed by 40 alleles located in two neighbouring genomic regions of ca 42 and 26 kb. Additional analysis supported horizontal gene transfer of the two regions as responsible for the difference between the variants. Both regions comprise virulence genes and have previously been reported to be involved in recombination events. This corroborates the notion that genomic outbreak investigations should always take epidemiological information into consideration when making inferences. Overall, cgMLST proved helpful in disentangling the complex genomic epidemiology of the outbreak.
Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.
Taheri, Sima; Lee Abdullah, Thohirah; Yusop, Mohd Rafii; Hanafi, Mohamed Musa; Sahebi, Mahbod; Azizi, Parisa; Shamshiri, Redmond Ramin
2018-02-13
Microsatellites, or simple sequence repeats (SSRs), are one of the most informative and multi-purpose genetic markers exploited in plant functional genomics. However, the discovery of SSRs and development using traditional methods are laborious, time-consuming, and costly. Recently, the availability of high-throughput sequencing technologies has enabled researchers to identify a substantial number of microsatellites at less cost and effort than traditional approaches. Illumina is a noteworthy transcriptome sequencing technology that is currently used in SSR marker development. Although 454 pyrosequencing datasets can be used for SSR development, this type of sequencing is no longer supported. This review aims to present an overview of the next generation sequencing, with a focus on the efficient use of de novo transcriptome sequencing (RNA-Seq) and related tools for mining and development of microsatellites in plants.
O'Hara, F. Patrick; Suaya, Jose A.; Ray, G. Thomas; Baxter, Roger; Brown, Megan L.; Mera, Robertino M.; Close, Nicole M.; Thomas, Elizabeth
2016-01-01
A number of molecular typing methods have been developed for characterization of Staphylococcus aureus isolates. The utility of these systems depends on the nature of the investigation for which they are used. We compared two commonly used methods of molecular typing, multilocus sequence typing (MLST) (and its clustering algorithm, Based Upon Related Sequence Type [BURST]) with the staphylococcal protein A (spa) typing (and its clustering algorithm, Based Upon Repeat Pattern [BURP]), to assess the utility of these methods for macroepidemiology and evolutionary studies of S. aureus in the United States. We typed a total of 366 clinical isolates of S. aureus by these methods and evaluated indices of diversity and concordance values. Our results show that, when combined with the BURP clustering algorithm to delineate clonal lineages, spa typing produces results that are highly comparable with those produced by MLST/BURST. Therefore, spa typing is appropriate for use in macroepidemiology and evolutionary studies and, given its lower implementation cost, this method appears to be more efficient. The findings are robust and are consistent across different settings, patient ages, and specimen sources. Our results also support a model in which the methicillin-resistant S. aureus (MRSA) population in the United States comprises two major lineages (USA300 and USA100), which each consist of closely related variants. PMID:26669861
O'Hara, F Patrick; Suaya, Jose A; Ray, G Thomas; Baxter, Roger; Brown, Megan L; Mera, Robertino M; Close, Nicole M; Thomas, Elizabeth; Amrine-Madsen, Heather
2016-01-01
A number of molecular typing methods have been developed for characterization of Staphylococcus aureus isolates. The utility of these systems depends on the nature of the investigation for which they are used. We compared two commonly used methods of molecular typing, multilocus sequence typing (MLST) (and its clustering algorithm, Based Upon Related Sequence Type [BURST]) with the staphylococcal protein A (spa) typing (and its clustering algorithm, Based Upon Repeat Pattern [BURP]), to assess the utility of these methods for macroepidemiology and evolutionary studies of S. aureus in the United States. We typed a total of 366 clinical isolates of S. aureus by these methods and evaluated indices of diversity and concordance values. Our results show that, when combined with the BURP clustering algorithm to delineate clonal lineages, spa typing produces results that are highly comparable with those produced by MLST/BURST. Therefore, spa typing is appropriate for use in macroepidemiology and evolutionary studies and, given its lower implementation cost, this method appears to be more efficient. The findings are robust and are consistent across different settings, patient ages, and specimen sources. Our results also support a model in which the methicillin-resistant S. aureus (MRSA) population in the United States comprises two major lineages (USA300 and USA100), which each consist of closely related variants.
Veldman, G M; Klootwijk, J; van Heerikhuizen, H; Planta, R J
1981-01-01
We have determined the nucleotide sequence of part of a cloned yeast ribosomal RNA operon extending from the 5.8S RNA gene downstream into the 5' -terminal region of the 26S RNA gene. We mapped the pertinent processing sites, viz. the 5' end of 26S rRNA and the 3'ends of 5.8S rRNA and its immediate precursor, 7S RNA. At the 3' end of 7S RNA we find the sequence UCGUUU which is very similar to the type I consensus sequence UCAUUA/U present at the 3' ends of 17S, 5.8S and 26S rRNA as well as 18S precursor rRNA in yeast. At the 5' end of the 26S RNA gene we find a sequence of thirteen nucleotides which is homologous to the type II sequence present at the 5' termini of both the 17S and the 5.8S RNA gene. These findings further support the suggestion put forward earlier (G.M. Veldman et al. (1980) Nucl. Acids Res. 8, 2907-2920) that both consensus sequences are involved in the recognition of precursor rRNA by the processing nuclease(s). We discuss a model for the processing of yeast rRNA in which a processing enzyme sequentially recognizes several combinations of a type I and a type II consensus sequence. We also describe the existence of a significant base complementarity between sequences in the 5' -terminal region of 26S rRNA and the 3' -terminal region of 5.8S rRNA. We suggest that base pairing between these sequences contributes to the binding between 5.8S and 26S rRNA. Images PMID:7312619
Mallika, V; Sivakumar, K C; Jaichand, S; Soniya, E V
2010-07-13
Type III Polyketide synthases (PKS) are family of proteins considered to have significant roles in the biosynthesis of various polyketides in plants, fungi and bacteria. As these proteins shows positive effects to human health, more researches are going on regarding this particular protein. Developing a tool to identify the probability of sequence being a type III polyketide synthase will minimize the time consumption and manpower efforts. In this approach, we have designed and implemented PKSIIIpred, a high performance prediction server for type III PKS where the classifier is Support Vector Machines (SVMs). Based on the limited training dataset, the tool efficiently predicts the type III PKS superfamily of proteins with high sensitivity and specificity. The PKSIIIpred is available at http://type3pks.in/prediction/. We expect that this tool may serve as a useful resource for type III PKS researchers. Currently work is being progressed for further betterment of prediction accuracy by including more sequence features in the training dataset.
Yedavalli, Venkat R. K.; Chappey, Colombe; Ahmad, Nafees
1998-01-01
The vpr sequences from six human immunodeficiency virus type 1 (HIV-1)-infected mother-infant pairs following perinatal transmission were analyzed. We found that 153 of the 166 clones analyzed from uncultured peripheral blood mononuclear cell DNA samples showed a 92.17% frequency of intact vpr open reading frames. There was a low degree of heterogeneity of vpr genes within mothers, within infants, and between epidemiologically linked mother-infant pairs. The distances between vpr sequences were greater in epidemiologically unlinked individuals than in epidemiologically linked mother-infant pairs. Moreover, the infants’ sequences displayed patterns similar to those seen in their mothers. The functional domains essential for Vpr activity, including virion incorporation, nuclear import, and cell cycle arrest and differentiation were highly conserved in most of the sequences. Phylogenetic analyses of 166 mother-infant pairs and 195 other available vpr sequences from HIV databases formed distinct clusters for each mother-infant pair and for other vpr sequences and grouped the six mother-infant pairs’ sequences with subtype B sequences. A high degree of conservation of intact and functional vpr supports the notion that vpr plays an important role in HIV-1 infection and replication in mother-infant isolates that are involved in perinatal transmission. PMID:9658150
The SAMI Galaxy Survey: spatially resolving the main sequence of star formation
NASA Astrophysics Data System (ADS)
Medling, Anne M.; Cortese, Luca; Croom, Scott M.; Green, Andrew W.; Groves, Brent; Hampton, Elise; Ho, I.-Ting; Davies, Luke J. M.; Kewley, Lisa J.; Moffett, Amanda J.; Schaefer, Adam L.; Taylor, Edward; Zafar, Tayyaba; Bekki, Kenji; Bland-Hawthorn, Joss; Bloom, Jessica V.; Brough, Sarah; Bryant, Julia J.; Catinella, Barbara; Cecil, Gerald; Colless, Matthew; Couch, Warrick J.; Drinkwater, Michael J.; Driver, Simon P.; Federrath, Christoph; Foster, Caroline; Goldstein, Gregory; Goodwin, Michael; Hopkins, Andrew; Lawrence, J. S.; Leslie, Sarah K.; Lewis, Geraint F.; Lorente, Nuria P. F.; Owers, Matt S.; McDermid, Richard; Richards, Samuel N.; Sharp, Robert; Scott, Nicholas; Sweet, Sarah M.; Taranu, Dan S.; Tescari, Edoardo; Tonini, Chiara; van de Sande, Jesse; Walcher, C. Jakob; Wright, Angus
2018-04-01
We present the ˜800 star formation rate maps for the Sydney-AAO Multi-object Integral field spectrograph (SAMI) Galaxy Survey based on H α emission maps, corrected for dust attenuation via the Balmer decrement, that are included in the SAMI Public Data Release 1. We mask out spaxels contaminated by non-stellar emission using the [O III]/H β, [N II]/H α, [S II]/H α, and [O I]/H α line ratios. Using these maps, we examine the global and resolved star-forming main sequences of SAMI galaxies as a function of morphology, environmental density, and stellar mass. Galaxies further below the star-forming main sequence are more likely to have flatter star formation profiles. Early-type galaxies split into two populations with similar stellar masses and central stellar mass surface densities. The main-sequence population has centrally concentrated star formation similar to late-type galaxies, while galaxies >3σ below the main sequence show significantly reduced star formation most strikingly in the nuclear regions. The split populations support a two-step quenching mechanism, wherein halo mass first cuts off the gas supply and remaining gas continues to form stars until the local stellar mass surface density can stabilize the reduced remaining fuel against further star formation. Across all morphologies, galaxies in denser environments show a decreased specific star formation rate from the outside in, supporting an environmental cause for quenching, such as ram-pressure stripping or galaxy interactions.
Vieira, Leila do Nascimento; Dos Anjos, Karina Goulart; Faoro, Helisson; Fraga, Hugo Pacheco de Freitas; Greco, Thiago Machado; Pedrosa, Fábio de Oliveira; de Souza, Emanuel Maltempi; Rogalski, Marcelo; de Souza, Robson Francisco; Guerra, Miguel Pedro
2016-05-01
The complete plastome sequencing is an efficient option for increasing phylogenetic resolution and evolutionary studies, as well as may greatly facilitate the use of plastid DNA markers in plant population genetic studies. Merostachys and Guadua stand out as the most common and the highest potential utilization bamboos indigenous of Brazil. Here, we sequenced the complete plastome sequences of the Brazilian Guadua chacoensis and Merostachys sp. to perform full plastome phylogeny and characterize the occurrence, type, and distribution of SRRs using 20 Bambuseae species. The determined plastome sequence of Merostachys sp. and G. chacoensis is 136,334 and 135,403 bp in size, respectively, with an identical gene content and typical quadripartite structure consisting of a pair of IRs separated by the LSC and SSC regions. The Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of Paleotropical and Neotropical Bamboos clades. The Neotropical bamboos segregated into three well-supported lineages, Chusqueinae, Guaduinae, and Arthrostylidiinae, with the last two forming a well-supported sister relationship. Paleotropical bamboos segregated into two well-supported lineages, Hickeliinae and Bambusinae + Melocanninae. We identified 141.8 cpSSR in Bambuseae plastomes and an inferior value (38.15) for plastome coding sequences. Among them, we identified 16 polymorphic SSR loci, with number of alleles varying from 3 to 10. These 16 polymorphic cpSSR loci in Bambuseae plastome can be assessed for the intraspecific level of polymorphism, leading to innovative highly sensitive phylogeographic and population genetics studies for this tribe.
Jespersen, H M; Kjaersgård, I V; Ostergaard, L; Welinder, K G
1997-01-01
Ascorbate peroxidases are haem proteins that efficiently scavenge H2O2 in the cytosol and chloroplasts of plants. Database analyses retrieved 52 expressed sequence tags coding for Arabidopsis thaliana ascorbate peroxidases. Complete sequencing of non-redundant clones revealed three novel types in addition to the two cytosol types described previously in Arabidopsis. Analysis of sequence data available for all plant ascorbate peroxidases resulted in the following classification: two types of cytosol soluble ascorbate peroxidase designated cs1 and cs2; three types of cytosol membrane-bound ascorbate peroxidase, namely cm1, bound to microbodies via a C-terminal membrane-spanning segment, and cm2 and cm3, both of unknown location; two types of chloroplast ascorbate peroxidase with N-terminal transit sequences, the stromal ascorbate peroxidase (chs), and the thylakoid-bound ascorbate peroxidase showing a C-terminal transmembrane segment and designated cht. Further comparison of the patterns of conserved residues and the crystal structure of pea ascorbate peroxidase showed that active site residues are conserved, and three peptide segments implicated in interaction with reducing substrate are similar, excepting cm2 and cm3 types. A change of Phe-175 in cytosol types to Trp-175 in chloroplast types might explain the greater ascorbate specificity of chloroplast compared with cytosol ascorbate peroxidases. Residues involved in homodimeric subunit interaction are conserved only in cs1, cs2 and cm1 types. The proximal cation (K+)-binding site observed in pea ascorbate peroxidase seems to be conserved. In addition, cm1, cm2, cm3, chs and cht ascorbate peroxidases contain Asp-43, Asn-57 and Ser-59, indicative of a distal monovalent cation site. The data support the hypothesis that present-day peroxidases evolved by an early gene duplication event. PMID:9291097
Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks.
Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S K; Mammel, Mark K; Tarr, Phillip I; Eppinger, Mark
2016-01-01
Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and long-term evolution and can complement currently employed typing schemes for outbreak ex- and inclusion, diagnostics, surveillance, and forensic studies.
Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks
Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S. K.; Mammel, Mark K.; Tarr, Phillip I.; Eppinger, Mark
2016-01-01
Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and long-term evolution and can complement currently employed typing schemes for outbreak ex- and inclusion, diagnostics, surveillance, and forensic studies. PMID:27446025
Explicit pre-training instruction does not improve implicit perceptual-motor sequence learning
Sanchez, Daniel J.; Reber, Paul J.
2012-01-01
Memory systems theory argues for separate neural systems supporting implicit and explicit memory in the human brain. Neuropsychological studies support this dissociation, but empirical studies of cognitively healthy participants generally observe that both kinds of memory are acquired to at least some extent, even in implicit learning tasks. A key question is whether this observation reflects parallel intact memory systems or an integrated representation of memory in healthy participants. Learning of complex tasks in which both explicit instruction and practice is used depends on both kinds of memory, and how these systems interact will be an important component of the learning process. Theories that posit an integrated, or single, memory system for both types of memory predict that explicit instruction should contribute directly to strengthening task knowledge. In contrast, if the two types of memory are independent and acquired in parallel, explicit knowledge should have no direct impact and may serve in a “scaffolding” role in complex learning. Using an implicit perceptual-motor sequence learning task, the effect of explicit pre-training instruction on skill learning and performance was assessed. Explicit pre-training instruction led to robust explicit knowledge, but sequence learning did not benefit from the contribution of pre-training sequence memorization. The lack of an instruction benefit suggests that during skill learning, implicit and explicit memory operate independently. While healthy participants will generally accrue parallel implicit and explicit knowledge in complex tasks, these types of information appear to be separately represented in the human brain consistent with multiple memory systems theory. PMID:23280147
Ağladıoğlu, Sebahat Yılmaz; Aycan, Zehra; Çetinkaya, Semra; Baş, Veysel Nijat; Önder, Aşan; Peltek Kendirci, Havva Nur; Doğan, Haldun; Ceylaner, Serdar
2016-04-01
Maturity-onset diabetes of the youth (MODY), is a genetically and clinically heterogeneous group of diseasesand is often misdiagnosed as type 1 or type 2 diabetes. The aim of this study is to investigate both novel and proven mutations of 11 MODY genes in Turkish children by using targeted next generation sequencing. A panel of 11 MODY genes were screened in 43 children with MODY diagnosed by clinical criterias. Studies of index cases was done with MISEQ-ILLUMINA, and family screenings and confirmation studies of mutations was done by Sanger sequencing. We identified 28 (65%) point mutations among 43 patients. Eighteen patients have GCK mutations, four have HNF1A, one has HNF4A, one has HNF1B, two have NEUROD1, one has PDX1 gene variations and one patient has both HNF1A and HNF4A heterozygote mutations. This is the first study including molecular studies of 11 MODY genes in Turkish children. GCK is the most frequent type of MODY in our study population. Very high frequency of novel mutations (42%) in our study population, supports that in heterogenous disorders like MODY sequence analysis provides rapid, cost effective and accurate genetic diagnosis.
Multistage morphological segmentation of bright-field and fluorescent microscopy images
NASA Astrophysics Data System (ADS)
Korzyńska, A.; Iwanowski, M.
2012-06-01
This paper describes the multistage morphological segmentation method (MSMA) for microscopic cell images. The proposed method enables us to study the cell behaviour by using a sequence of two types of microscopic images: bright field images and/or fluorescent images. The proposed method is based on two types of information: the cell texture coming from the bright field images and intensity of light emission, done by fluorescent markers. The method is dedicated to the image sequences segmentation and it is based on mathematical morphology methods supported by other image processing techniques. The method allows for detecting cells in image independently from a degree of their flattening and from presenting structures which produce the texture. It makes use of some synergic information from the fluorescent light emission image as the support information. The MSMA method has been applied to images acquired during the experiments on neural stem cells as well as to artificial images. In order to validate the method, two types of errors have been considered: the error of cell area detection and the error of cell position using artificial images as the "gold standard".
The Arabidopsis lyrata genome sequence and the basis of rapid genome size change
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, Tina T.; Pattyn, Pedro; Bakker, Erica G.
2011-04-29
In our manuscript, we present a high-quality genome sequence of the Arabidopsis thaliana relative, Arabidopsis lyrata, produced by dideoxy sequencing. We have performed the usual types of genome analysis (gene annotation, dN/dS studies etc. etc.), but this is relegated to the Supporting Information. Instead, we focus on what was a major motivation for sequencing this genome, namely to understand how A. thaliana lost half its genome in a few million years and lived to tell the tale. The rather surprising conclusion is that there is not a single genomic feature that accounts for the reduced genome, but that every aspectmore » centromeres, intergenic regions, transposable elements, gene family number is affected through hundreds of thousands of cuts. This strongly suggests that overall genome size in itself is what has been under selection, a suggestion that is strongly supported by our demonstration (using population genetics data from A. thaliana) that new deletions seem to be driven to fixation.« less
Integrative Genomics Viewer (IGV) | Informatics Technology for Cancer Research (ITCR)
The Integrative Genomics Viewer (IGV) is a high-performance visualization tool for interactive exploration of large, integrated genomic datasets. It supports a wide variety of data types, including array-based and next-generation sequence data, and genomic annotations.
Bromilow, Sophie; Gethings, Lee A; Buckley, Mike; Bromley, Mike; Shewry, Peter R; Langridge, James I; Clare Mills, E N
2017-06-23
The unique physiochemical properties of wheat gluten enable a diverse range of food products to be manufactured. However, gluten triggers coeliac disease, a condition which is treated using a gluten-free diet. Analytical methods are required to confirm if foods are gluten-free, but current immunoassay-based methods can unreliable and proteomic methods offer an alternative but require comprehensive and well annotated sequence databases which are lacking for gluten. A manually a curated database (GluPro V1.0) of gluten proteins, comprising 630 discrete unique full length protein sequences has been compiled. It is representative of the different types of gliadin and glutenin components found in gluten. An in silico comparison of their coeliac toxicity was undertaken by analysing the distribution of coeliac toxic motifs. This demonstrated that whilst the α-gliadin proteins contained more toxic motifs, these were distributed across all gluten protein sub-types. Comparison of annotations observed using a discovery proteomics dataset acquired using ion mobility MS/MS showed that more reliable identifications were obtained using the GluPro V1.0 database compared to the complete reviewed Viridiplantae database. This highlights the value of a curated sequence database specifically designed to support the proteomic workflows and the development of methods to detect and quantify gluten. We have constructed the first manually curated open-source wheat gluten protein sequence database (GluPro V1.0) in a FASTA format to support the application of proteomic methods for gluten protein detection and quantification. We have also analysed the manually verified sequences to give the first comprehensive overview of the distribution of sequences able to elicit a reaction in coeliac disease, the prevalent form of gluten intolerance. Provision of this database will improve the reliability of gluten protein identification by proteomic analysis, and aid the development of targeted mass spectrometry methods in line with Codex Alimentarius Commission requirements for foods designed to meet the needs of gluten intolerant individuals. Copyright © 2017. Published by Elsevier B.V.
Development and evaluation of a multi-locus sequence typing scheme for Mycoplasma synoviae.
Dijkman, R; Feberwee, A; Landman, W J M
2016-08-01
Reproducible molecular Mycoplasma synoviae typing techniques with sufficient discriminatory power may help to expand knowledge on its epidemiology and contribute to the improvement of control and eradication programmes of this mycoplasma species. The present study describes the development and validation of a novel multi-locus sequence typing (MLST) scheme for M. synoviae. Thirteen M. synoviae isolates originating from different poultry categories, farms and lesions, were subjected to whole genome sequencing. Their sequences were compared to that of M. synoviae reference strain MS53. A high number of single nucleotide polymorphisms (SNPs) indicating considerable genetic diversity were identified. SNPs were present in over 40 putative target genes for MLST of which five target genes were selected (nanA, uvrA, lepA, ruvB and ugpA) for the MLST scheme. This scheme was evaluated analysing 209 M. synoviae samples from different countries, categories of poultry, farms and lesions. Eleven clonal clusters and 76 different sequence types (STs) were obtained. Clustering occurred following geographical origin, supporting the hypothesis of regional population evolution. M. synoviae samples obtained from epidemiologically linked outbreaks often harboured the same ST. In contrast, multiple M. synoviae lineages were found in samples originating from swollen joints or oviducts from hens that produce eggs with eggshell apex abnormalities indicating that further research is needed to identify the genetic factors of M. synoviae that may explain its variations in tissue tropism and disease inducing potential. Furthermore, MLST proved to have a higher discriminatory power compared to variable lipoprotein and haemagglutinin A typing, which generated 50 different genotypes on the same database.
Verma, Pankaj; Pandey, Prashant Kumar; Gupta, Arvind Kumar; Seong, Chi Nam; Park, Seong Chan; Choe, Han Na; Baik, Keun Sik; Patole, Milind Shivaji; Shouche, Yogesh Shreepad
2012-10-01
We have carried out a polyphasic taxonomic characterization of Bacillus beijingensis DSM 19037(T) and Bacillus ginsengi DSM 19038(T), which are closely related phylogenetically to Bhargavaea cecembensis LMG 24411(T). All three strains are Gram-stain-positive, non-motile, moderately halotolerant and non-spore-forming. 16S rRNA gene sequence analyses showed that the strains constituted a coherent cluster, with sequence similarities between 99.7 and 98.7 %. The percentage similarity on the basis of amino acid sequences deduced from partial gyrB gene nucleotide sequences of these three type strains was 96.1-92.7 %. Phylogenetic trees based on the 16S rRNA gene and GyrB amino acid sequences, obtained by using three different algorithms, were consistent and showed that these three species constituted a deeply rooted cluster separated from the clades represented by the genera Bacillus, Planococcus, Planomicrobium, Sporosarcina, Lysinibacillus, Viridibacillus, Kurthia and Geobacillus, supporting their placement in the genus Bhargavaea. All three type strains have menaquinone MK-8 as the major respiratory quinone and showed similar fatty acid profiles. The main polar lipids present in the three type strains were diphosphatidylglycerol and phosphatidylglycerol, and the three strains showed peptidoglycan type A4α with L-lysine as the diagnostic diamino acid. The DNA G+C contents of Bacillus beijingensis DSM 19037(T), Bacillus ginsengi DSM 19038(T) and Bhargavaea cecembensis LMG 24411(T) were 53.1, 50.2 and 53.7 mol%, respectively. The level of DNA-DNA hybridization among the three strains was 57-39 %, indicating that they are members of different species of the genus Bhargavaea. The phenotypic data are consistent with the placement of these three species in a single genus and support their differentiation at the species level. On the basis of these data, we have emended the description of the genus Bhargavaea and propose the reclassification of Bacillus beijingensis and Bacillus ginsengi to the genus Bhargavaea, as Bhargavaea beijingensis comb. nov. (type strain ge10(T) = DSM 19037(T) = CGMCC 1.6762(T)) and Bhargavaea ginsengi comb. nov. (type strain ge14(T) = DSM 19038(T) = CGMCC 1.6763(T)).
Han, C; Dai, S F; Liu, D C; Pu, Z J; Wei, Y M; Zheng, Y L; Wen, D J; Zhao, L; Yan, Z H
2013-11-18
Previous genetic studies on wheat from various sources have indicated that aluminum (Al) tolerance may have originated independently in USA, Brazil, and China. Here, TaALMT1 promoter sequences of 92 landraces and cultivars from Sichuan, China, were sequenced. Five promoter types (I', II, III, IV, and V) were observed in 39 cultivars, and only three promoter types (I, II, and III) were observed in 53 landraces. Among the wheat collections worldwide, only the Chinese Spring (CS) landrace native to Sichuan, China, carried the TaALMT1 promoter type III. Besides CS, two other Sichuan-bred landraces and six cultivars with TaALMT1 promoter type III were identified in this study. In the phylogenetic tree constructed based on the TaALMT1 promoter sequences, type III formed a separate branch, which was supported by a high bootstrap value. It is likely that TaALMT1 promoter type III originated from Sichuan-bred wheat landraces of China. In addition, the landraces with promoter type I showed the lowest Al tolerance among all landraces and cultivars. Furthermore, the cultivars with promoter type IV showed better Al tolerance than landraces with promoter type II. A comparison of acid tolerance and Al tolerance between cultivars and landraces showed that the landraces had better acid tolerance than the cultivars, whereas the cultivars showed better Al tolerance than the landraces. Moreover, significant difference in Al tolerance was also observed between the cultivars raised by the National Ministry of Agriculture and by Sichuan Province. Among the landraces from different regions, those from the East showed better acid tolerance and Al tolerance than those from the South and West of Sichuan. Additional Al-tolerant and acid-tolerant wheat lines were also identified.
Boo, Ga Hun; Hughey, Jeffery R; Miller, Kathy Ann; Boo, Sung Min
2016-10-14
DNA sequences from type specimens provide independent, objective characters that enhance the value of type specimens and permit the correct application of species names to phylogenetic clades and specimens. We provide mitochondrial genomes (mitogenomes) from archival type specimens of ten species in agar-producing red algal genera Gelidium and Pterocladiella. The genomes contain 43-44 genes, ranging in size from 24,910 to 24,970 bp with highly conserved gene synteny. Low Ka/Ks ratios of apocytochrome b and cytochrome oxidase genes support their utility as markers. Phylogenies of mitogenomes and cox1+rbcL sequences clarified classification at the genus and species levels. Three species formerly in Gelidium and Pterocladia are transferred to Pterocladiella: P. media comb. nov., P. musciformis comb. nov., and P. luxurians comb. and stat. nov. Gelidium sinicola is merged with G. coulteri because they share identical cox1 and rbcL sequences. We describe a new species, Gelidium millariana sp. nov., previously identified as G. isabelae from Australia. We demonstrate that mitogenomes from type specimens provide a new tool for typifying species in the Gelidiales and that there is an urgent need for analyzing mitogenomes from type specimens of red algae and other morphologically simple organisms for insight into their nomenclature, taxonomy and evolution.
Boo, Ga Hun; Hughey, Jeffery R.; Miller, Kathy Ann; Boo, Sung Min
2016-01-01
DNA sequences from type specimens provide independent, objective characters that enhance the value of type specimens and permit the correct application of species names to phylogenetic clades and specimens. We provide mitochondrial genomes (mitogenomes) from archival type specimens of ten species in agar-producing red algal genera Gelidium and Pterocladiella. The genomes contain 43–44 genes, ranging in size from 24,910 to 24,970 bp with highly conserved gene synteny. Low Ka/Ks ratios of apocytochrome b and cytochrome oxidase genes support their utility as markers. Phylogenies of mitogenomes and cox1+rbcL sequences clarified classification at the genus and species levels. Three species formerly in Gelidium and Pterocladia are transferred to Pterocladiella: P. media comb. nov., P. musciformis comb. nov., and P. luxurians comb. and stat. nov. Gelidium sinicola is merged with G. coulteri because they share identical cox1 and rbcL sequences. We describe a new species, Gelidium millariana sp. nov., previously identified as G. isabelae from Australia. We demonstrate that mitogenomes from type specimens provide a new tool for typifying species in the Gelidiales and that there is an urgent need for analyzing mitogenomes from type specimens of red algae and other morphologically simple organisms for insight into their nomenclature, taxonomy and evolution. PMID:27739454
Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M
2017-01-01
Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
Evolution of early life inferred from protein and ribonucleic acid sequences
NASA Technical Reports Server (NTRS)
Dayhoff, M. O.; Schwartz, R. M.
1978-01-01
The chemical structures of ferredoxin, 5S ribosomal RNA, and c-type cytochrome sequences have been employed to construct a phylogenetic tree which connects all major photosynthesizing organisms: the three types of bacteria, blue-green algae, and chloroplasts. Anaerobic and aerobic bacteria, eukaryotic cytoplasmic components and mitochondria are also included in the phylogenetic tree. Anaerobic nonphotosynthesizing bacteria similar to Clostridium were the earliest organisms, arising more than 3.2 billion years ago. Bacterial photosynthesis evolved nearly 3.0 billion years ago, while oxygen-evolving photosynthesis, originating in the blue-green algal line, came into being about 2.0 billion years ago. The phylogenetic tree supports the symbiotic theory of the origin of eukaryotes.
Phonological and Semantic Cues to Learning from Word-Types
Richtsmeier, Peter
2017-01-01
Word-types represent the primary form of data for many models of phonological learning, and they often predict performance in psycholinguistic tasks. Word-types are often tacitly defined as phonologically unique words. Yet, an explicit test of this definition is lacking, and natural language patterning suggests that word meaning could also act as a cue to word-type status. This possibility was tested in a statistical phonotactic learning experiment in which phonological and semantic properties of word-types varied. During familiarization, the learning targets—word-medial consonant sequences—were instantiated either by four related word-types or by just one word-type (the experimental frequency factor). The expectation was that more word-types would lead participants to generalize the target sequences. Regarding semantic cues, related word-types were either associated with different referents or all with a single referent. Regarding phonological cues, related word-types differed from each other by one, two, or more phonemes. At test, participants rated novel wordforms for their similarity to the familiarization words. When participants heard four related word-types, they gave higher ratings to test words with the same consonant sequences, irrespective of the phonological and semantic manipulations. The results support the existing phonological definition of word-types. PMID:29187914
Porter, Teresita M; Gibson, Joel F; Shokralla, Shadi; Baird, Donald J; Golding, G Brian; Hajibabaei, Mehrdad
2014-01-01
Current methods to identify unknown insect (class Insecta) cytochrome c oxidase (COI barcode) sequences often rely on thresholds of distances that can be difficult to define, sequence similarity cut-offs, or monophyly. Some of the most commonly used metagenomic classification methods do not provide a measure of confidence for the taxonomic assignments they provide. The aim of this study was to use a naïve Bayesian classifier (Wang et al. Applied and Environmental Microbiology, 2007; 73: 5261) to automate taxonomic assignments for large batches of insect COI sequences such as data obtained from high-throughput environmental sequencing. This method provides rank-flexible taxonomic assignments with an associated bootstrap support value, and it is faster than the blast-based methods commonly used in environmental sequence surveys. We have developed and rigorously tested the performance of three different training sets using leave-one-out cross-validation, two field data sets, and targeted testing of Lepidoptera, Diptera and Mantodea sequences obtained from the Barcode of Life Data system. We found that type I error rates, incorrect taxonomic assignments with a high bootstrap support, were already relatively low but could be lowered further by ensuring that all query taxa are actually present in the reference database. Choosing bootstrap support cut-offs according to query length and summarizing taxonomic assignments to more inclusive ranks can also help to reduce error while retaining the maximum number of assignments. Additionally, we highlight gaps in the taxonomic and geographic representation of insects in public sequence databases that will require further work by taxonomists to improve the quality of assignments generated using any method.
NASA Astrophysics Data System (ADS)
Castro, E.
2018-02-01
From the perturbative expansion of the exact Green function, an exact counting formula is derived to determine the number of different types of connected Feynman diagrams. This formula coincides with the Arquès-Walsh sequence formula in the rooted map theory, supporting the topological connection between Feynman diagrams and rooted maps. A classificatory summing-terms approach is used, in connection to discrete mathematical theory.
Resolving the Quantitative-Qualitative Dilemma: A Critical Realist Approach
ERIC Educational Resources Information Center
Scott, David
2007-01-01
The philosophical issues underpinning the quantitative-qualitative divide in educational research are examined. Three types of argument which support a resolution are considered: pragmatism, false duality and warranty through triangulation. In addition a number of proposed strategies--alignment, sequencing, translation and triangulation--are…
A Single Multilocus Sequence Typing (MLST) Scheme for Seven Pathogenic Leptospira Species
Amornchai, Premjit; Wuthiekanun, Vanaporn; Bailey, Mark S.; Holden, Matthew T. G.; Zhang, Cuicai; Jiang, Xiugao; Koizumi, Nobuo; Taylor, Kyle; Galloway, Renee; Hoffmaster, Alex R.; Craig, Scott; Smythe, Lee D.; Hartskeerl, Rudy A.; Day, Nicholas P.; Chantratita, Narisara; Feil, Edward J.; Aanensen, David M.; Spratt, Brian G.; Peacock, Sharon J.
2013-01-01
Background The available Leptospira multilocus sequence typing (MLST) scheme supported by a MLST website is limited to L. interrogans and L. kirschneri. Our aim was to broaden the utility of this scheme to incorporate a total of seven pathogenic species. Methodology and Findings We modified the existing scheme by replacing one of the seven MLST loci (fadD was changed to caiB), as the former gene did not appear to be present in some pathogenic species. Comparison of the original and modified schemes using data for L. interrogans and L. kirschneri demonstrated that the discriminatory power of the two schemes was not significantly different. The modified scheme was used to further characterize 325 isolates (L. alexanderi [n = 5], L. borgpetersenii [n = 34], L. interrogans [n = 222], L. kirschneri [n = 29], L. noguchii [n = 9], L. santarosai [n = 10], and L. weilii [n = 16]). Phylogenetic analysis using concatenated sequences of the 7 loci demonstrated that each species corresponded to a discrete clade, and that no strains were misclassified at the species level. Comparison between genotype and serovar was possible for 254 isolates. Of the 31 sequence types (STs) represented by at least two isolates, 18 STs included isolates assigned to two or three different serovars. Conversely, 14 serovars were identified that contained between 2 to 10 different STs. New observations were made on the global phylogeography of Leptospira spp., and the utility of MLST in making associations between human disease and specific maintenance hosts was demonstrated. Conclusion The new MLST scheme, supported by an updated MLST website, allows the characterization and species assignment of isolates of the seven major pathogenic species associated with leptospirosis. PMID:23359622
Jakava-Viljanen, Miia; Miia, Jakava-Viljanen; Nokireki, Tiina; Tiina, Nokireki; Sironen, Tarja; Tarja, Sironen; Vapalahti, Olli; Olli, Vapalahti; Sihvonen, Liisa; Liisa, Sihvonen; Huovilainen, Anita; Anita, Huovilainen
2015-06-01
Among other Lyssaviruses, Daubenton's and pond-bat-related European bat lyssavirus type 2 (EBLV-2) can cause human rabies. To investigate the diversity and evolutionary trends of EBLV-2, complete genome sequences of two Finnish isolates were analysed. One originated from a human case in 1985, and the other originated from a bat in 2009. The overall nucleotide and deduced amino acid sequence identity of the two Finnish isolates were high, as well as the similarity to fully sequenced EBLV-2 strains originating from the UK and the Netherlands. In phylogenetic analysis, the EBLV-2 strains formed a monophyletic group that was separate from other bat-type lyssaviruses, with significant support. EBLV-2 shared the most recent common ancestry with Bokeloh bat lyssavirus (BBLV) and Khujan virus (KHUV). EBLV-2 showed limited diversity compared to RABV and appears to be well adapted to its host bat species. The slow tempo of viral evolution was evident in the estimations of divergence times for EBLV-2: the current diversity was estimated to have built up during the last 2000 years, and EBLV-2 diverged from KHUV about 8000 years ago. In a phylogenetic tree of partial N gene sequences, the Finnish EBLV-2 strains clustered with strains from Central Europe, supporting the hypothesis that EBLV-2 circulating in Finland might have a Central European origin. The Finnish EBLV-2 strains and a Swiss strain were estimated to have diverged from other EBLV-2 strains during the last 1000 years, and the two Finnish strains appear to have evolved from a common ancestor during the last 200 years.
Domain architecture conservation in orthologs
2011-01-01
Background As orthologous proteins are expected to retain function more often than other homologs, they are often used for functional annotation transfer between species. However, ortholog identification methods do not take into account changes in domain architecture, which are likely to modify a protein's function. By domain architecture we refer to the sequential arrangement of domains along a protein sequence. To assess the level of domain architecture conservation among orthologs, we carried out a large-scale study of such events between human and 40 other species spanning the entire evolutionary range. We designed a score to measure domain architecture similarity and used it to analyze differences in domain architecture conservation between orthologs and paralogs relative to the conservation of primary sequence. We also statistically characterized the extents of different types of domain swapping events across pairs of orthologs and paralogs. Results The analysis shows that orthologs exhibit greater domain architecture conservation than paralogous homologs, even when differences in average sequence divergence are compensated for, for homologs that have diverged beyond a certain threshold. We interpret this as an indication of a stronger selective pressure on orthologs than paralogs to retain the domain architecture required for the proteins to perform a specific function. In general, orthologs as well as the closest paralogous homologs have very similar domain architectures, even at large evolutionary separation. The most common domain architecture changes observed in both ortholog and paralog pairs involved insertion/deletion of new domains, while domain shuffling and segment duplication/deletion were very infrequent. Conclusions On the whole, our results support the hypothesis that function conservation between orthologs demands higher domain architecture conservation than other types of homologs, relative to primary sequence conservation. This supports the notion that orthologs are functionally more similar than other types of homologs at the same evolutionary distance. PMID:21819573
Kudlai, Olena; Tkach, Vasyl V; Pulis, Eric E; Kostadinova, Aneta
2015-01-01
Euparyphium capitaneum Dietz, 1909, the type-species of the genus Euparyphium Dietz, 1909, is described on the basis of material collected from the type-host Anhinga anhinga (L.) from Pascagoula River, which drains into the northern coast of the Gulf of Mexico. Combination of light and scanning electron microscopy observations of freshly collected and properly fixed specimens in our study has allowed us to provide novel information on the morphology and topology of the reproductive systems and other morphological features of the species. A Bayesian inference analysis based on the newly-obtained partial sequence of the nuclear 28S rRNA gene for E. capitaneum and 24 previously published sequences from the superfamily Echinostomatoidea Looss, 1899 provided evidence supporting the distinct status of the genera Euparyphium and Isthmiophora Lühe, 1909.
Zhang, Huibin; Artiles, Karen L.; Fire, Andrew Z.
2015-01-01
The founding heterochronic microRNAs, lin-4 and let-7, together with their validated targets and well-characterized phenotypes in C. elegans, offer an opportunity to test functionality of microRNAs in a developmental context. In this study, we defined sequence requirements at the microRNA level for these two microRNAs, evaluating lin-4 and let-7 mutant microRNAs for their ability to support temporal development under conditions where the wild-type lin-4 and let-7 gene products are absent. For lin-4, we found a strong requirement for seed sequences, with function drastically affected by several central mutations in the seed sequence, while rescue was retained by a set of mutations peripheral to the seed. let-7 rescuing activity was retained to a surprising degree by a variety of central seed mutations, while several non-seed mutant effects support potential noncanonical contributions to let-7 function. Taken together, this work illustrates both the functional partnership between seed and non-seed sequences in mediating C. elegans temporal development and a diversity among microRNA effectors in the contributions of seed and non-seed regions to activity. PMID:26385508
NASA Technical Reports Server (NTRS)
Venkateswaran, Kasthuri; Kempf, Michael; Chen, Fei; Satomi, Masataka; Nicholson, Wayne; Kern, Roger
2003-01-01
One of the spore-formers isolated from a spacecraft-assembly facility, belonging to the genus Bacillus, is described on the basis of phenotypic characterization, 16S rDNA sequence analysis and DNA-DNA hybridization studies. It is a Gram-positive, facultatively anaerobic, rod-shaped eubacterium that produces endospores. The spores of this novel bacterial species exhibited resistance to UV, gamma-radiation, H2O2 and desiccation. The 18S rDNA sequence analysis revealed a clear affiliation between this strain and members of the low G+C Firmicutes. High 16S rDNA sequence similarity values were found with members of the genus Bacillus and this was supported by fatty acid profiles. The 16S rDNA sequence similarity between strain FO-92T and Bacillus benzoevorans DSM 5391T was very high. However, molecular characterizations employing small-subunit 16S rDNA sequences were at the limits of resolution for the differentiation of species in this genus, but DNA-DNA hybridization data support the proposal of FO-92T as Bacillus nealsonii sp. nov. (type strain is FO-92T =ATCC BAAM-519T =DSM 15077T).
Yu, Yao; Hu, Hao; Bohlender, Ryan J; Hu, Fulan; Chen, Jiun-Sheng; Holt, Carson; Fowler, Jerry; Guthery, Stephen L; Scheet, Paul; Hildebrandt, Michelle A T; Yandell, Mark; Huff, Chad D
2018-04-06
High-throughput sequencing data are increasingly being made available to the research community for secondary analyses, providing new opportunities for large-scale association studies. However, heterogeneity in target capture and sequencing technologies often introduce strong technological stratification biases that overwhelm subtle signals of association in studies of complex traits. Here, we introduce the Cross-Platform Association Toolkit, XPAT, which provides a suite of tools designed to support and conduct large-scale association studies with heterogeneous sequencing datasets. XPAT includes tools to support cross-platform aware variant calling, quality control filtering, gene-based association testing and rare variant effect size estimation. To evaluate the performance of XPAT, we conducted case-control association studies for three diseases, including 783 breast cancer cases, 272 ovarian cancer cases, 205 Crohn disease cases and 3507 shared controls (including 1722 females) using sequencing data from multiple sources. XPAT greatly reduced Type I error inflation in the case-control analyses, while replicating many previously identified disease-gene associations. We also show that association tests conducted with XPAT using cross-platform data have comparable performance to tests using matched platform data. XPAT enables new association studies that combine existing sequencing datasets to identify genetic loci associated with common diseases and other complex traits.
Orlek, Alex; Phan, Hang; Sheppard, Anna E; Doumith, Michel; Ellington, Matthew; Peto, Tim; Crook, Derrick; Walker, A Sarah; Woodford, Neil; Anjum, Muna F; Stoesser, Nicole
2017-05-01
Plasmid typing can provide insights into the epidemiology and transmission of plasmid-mediated antibiotic resistance. The principal plasmid typing schemes are replicon typing and MOB typing, which utilize variation in replication loci and relaxase proteins respectively. Previous studies investigating the proportion of plasmids assigned a type by these schemes ('typeability') have yielded conflicting results; moreover, thousands of plasmid sequences have been added to NCBI in recent years, without consistent annotation to indicate which sequences represent complete plasmids. Here, a curated dataset of complete Enterobacteriaceae plasmids from NCBI was compiled, and used to assess the typeability and concordance of in silico replicon and MOB typing schemes. Concordance was assessed at hierarchical replicon type resolutions, from replicon family-level to plasmid multilocus sequence type (pMLST)-level, where available. We found that 85% and 65% of the curated plasmids could be replicon and MOB typed, respectively. Overall, plasmid size and the number of resistance genes were significant independent predictors of replicon and MOB typing success. We found some degree of non-concordance between replicon families and MOB types, which was only partly resolved when partitioning plasmids into finer-resolution groups (replicon and pMLST types). In some cases, non-concordance was attributed to ambiguous boundaries between MOBP and MOBQ types; in other cases, backbone mosaicism was considered a more plausible explanation. β-lactamase resistance genes tended not to show fidelity to a particular plasmid type, though some previously reported associations were supported. Overall, replicon and MOB typing schemes are likely to continue playing an important role in plasmid analysis, but their performance is constrained by the diverse and dynamic nature of plasmid genomes. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Phylogeny of Eleusine (Poaceae: Chloridoideae) based on nuclear ITS and plastid trnT-trnF sequences.
Neves, Susana S; Swire-Clark, Ginger; Hilu, Khidir W; Baird, Wm Vance
2005-05-01
Phylogenetic relationships in the genus Eleusine (Poaceae: Chloridoideae) were investigated using nuclear ITS and plastid trnT-trnF sequences. Separate and combined data sets were analyzed using parsimony, distance, and likelihood based methods, including Bayesian. Data congruence was examined using character and topological measures. Significant data heterogeneity was detected, but there was little conflict in the topological substructure measures for triplets and quartets, and resolution and clade support increased in the combined analysis. Data incongruence may be a result of noise and insufficient information in the slower evolving trnT-trnF. Monophyly of Eleusine is strongly supported in all analyses, but basal relationships in the genus remain uncertain. There is good support for a CAIK clade (E. coracana subsp. coracana and africana, E. indica, and E. kigeziensis), with E. tristachya as its sister group. Two putative ITS homeologues (A and B loci) were identified in the allotetraploid E. coracana; the 'B' locus sequence type was not found in the remaining species. Eleusine coracana and its putative 'A' genome donor, the diploid E. indica, are confirmed close allies, but sequence data contradicts the hypothesis that E. floccifolia is its second genome donor. The 'B' genome donor remains unidentified and may be extinct.
Halbedel, Sven; Prager, Rita; Fuchs, Stephan; Trost, Eva; Werner, Guido; Flieger, Antje
2018-06-01
Listeria monocytogenes causes foodborne outbreaks with high mortality. For improvement of outbreak cluster detection, the German consiliary laboratory for listeriosis implemented whole-genome sequencing (WGS) in 2015. A total of 424 human L. monocytogenes isolates collected in 2007 to 2017 were subjected to WGS and core-genome multilocus sequence typing (cgMLST). cgMLST grouped the isolates into 38 complexes, reflecting 4 known and 34 unknown disease clusters. Most of these complexes were confirmed by single nucleotide polymorphism (SNP) calling, but some were further differentiated. Interestingly, several cgMLST cluster types were further subtyped by pulsed-field gel electrophoresis, partly due to phage insertions in the accessory genome. Our results highlight the usefulness of cgMLST for routine cluster detection but also show that cgMLST complexes require validation by methods providing higher typing resolution. Twelve cgMLST clusters included recent cases, suggesting activity of the source. Therefore, the cgMLST nomenclature data presented here may support future public health actions. Copyright © 2018 American Society for Microbiology.
Epicuticular waxes and thrips resistance in onion
USDA-ARS?s Scientific Manuscript database
Next-generation sequencing of normalized cDNAs from two inbred lines of onion revealed over 3000 well supported single nucleotide polymorphisms (SNPs), of which over 800 have been mapped. This SNP-based map was used to identify quantitative trait loci (QTL) controlling the amounts and types of epicu...
Flipping the Composing Process: Collaborative Drafting and Résumé Writing
ERIC Educational Resources Information Center
Anders, Abram
2016-01-01
This article argues for a flipped learning approach to business and professional communication composing processes. Flipped learning sequences can scaffold more robust engagement with prewriting activities and support opportunities for in-class collaborative and facilitated drafting exercises. These types of learning experiences offer numerous…
A Bioinformatics Classifier and Database for Heme-Copper Oxygen Reductases
Sousa, Filipa L.; Alves, Renato J.; Pereira-Leal, José B.; Teixeira, Miguel; Pereira, Manuela M.
2011-01-01
Background Heme-copper oxygen reductases (HCOs) are the last enzymatic complexes of most aerobic respiratory chains, reducing dioxygen to water and translocating up to four protons across the inner mitochondrial membrane (eukaryotes) or cytoplasmatic membrane (prokaryotes). The number of completely sequenced genomes is expanding exponentially, and concomitantly, the number and taxonomic distribution of HCO sequences. These enzymes were initially classified into three different types being this classification recently challenged. Methodology We reanalyzed the classification scheme and developed a new bioinformatics classifier for the HCO and Nitric oxide reductases (NOR), which we benchmark against a manually derived gold standard sequence set. It is able to classify any given sequence of subunit I from HCO and NOR with a global recall and precision both of 99.8%. We use this tool to classify this protein family in 552 completely sequenced genomes. Conclusions We concluded that the new and broader data set supports three functional and evolutionary groups of HCOs. Homology between NORs and HCOs is shown and NORs closest relationship with C Type HCOs demonstrated. We established and made available a classification web tool and an integrated Heme-Copper Oxygen reductase and NOR protein database (www.evocell.org/hco). PMID:21559461
Lindahl, Susanne; Söderlund, Robert; Frosth, Sara; Pringle, John; Båverud, Viveca; Aspán, Anna
2011-11-21
Strangles is a serious respiratory disease in horses caused by Streptococcus equi subspecies equi (S. equi). Transmission of the disease occurs by direct contact with an infected horse or contaminated equipment. Genetically, S. equi strains are highly homogenous and differentiation of strains has proven difficult. However, the S. equi M-protein SeM contains a variable N-terminal region and has been proposed as a target gene to distinguish between different strains of S. equi and determine the source of an outbreak. In this study, strains of S. equi (n=60) from 32 strangles outbreaks in Sweden during 1998-2003 and 2008-2009 were genetically characterized by sequencing the SeM protein gene (seM), and by pulsed-field gel electrophoresis (PFGE). Swedish strains belonged to 10 different seM types, of which five have not previously been described. Most were identical or highly similar to allele types from strangles outbreaks in the UK. Outbreaks in 2008/2009 sharing the same seM type were associated by geographic location and/or type of usage of the horses (racing stables). Sequencing of the seM gene generally agreed with pulsed-field gel electrophoresis profiles. Our data suggest that seM sequencing as a epidemiological tool is supported by the agreement between seM and PFGE and that sequencing of the SeM protein gene is more sensitive than PFGE in discriminating strains of S. equi. Copyright © 2011 Elsevier B.V. All rights reserved.
SeqRate: sequence-based protein folding type classification and rates prediction
2010-01-01
Background Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic nature (two-state folding or multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using sequence length, amino acid composition, contact order, contact number, and secondary structure information predicted from only protein sequence with support vector machines. Results We systematically studied the contributions of individual features to folding rate prediction. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec-1) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Its performance can be further enhanced with additional information, such as structure-based geometric contacts, as inputs. Conclusions Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html. PMID:20438647
Leaché, Adam D.; Chavez, Andreas S.; Jones, Leonard N.; Grummer, Jared A.; Gottscho, Andrew D.; Linkem, Charles W.
2015-01-01
Sequence capture and restriction site associated DNA sequencing (RADseq) are popular methods for obtaining large numbers of loci for phylogenetic analysis. These methods are typically used to collect data at different evolutionary timescales; sequence capture is primarily used for obtaining conserved loci, whereas RADseq is designed for discovering single nucleotide polymorphisms (SNPs) suitable for population genetic or phylogeographic analyses. Phylogenetic questions that span both “recent” and “deep” timescales could benefit from either type of data, but studies that directly compare the two approaches are lacking. We compared phylogenies estimated from sequence capture and double digest RADseq (ddRADseq) data for North American phrynosomatid lizards, a species-rich and diverse group containing nine genera that began diversifying approximately 55 Ma. Sequence capture resulted in 584 loci that provided a consistent and strong phylogeny using concatenation and species tree inference. However, the phylogeny estimated from the ddRADseq data was sensitive to the bioinformatics steps used for determining homology, detecting paralogs, and filtering missing data. The topological conflicts among the SNP trees were not restricted to any particular timescale, but instead were associated with short internal branches. Species tree analysis of the largest SNP assembly, which also included the most missing data, supported a topology that matched the sequence capture tree. This preferred phylogeny provides strong support for the paraphyly of the earless lizard genera Holbrookia and Cophosaurus, suggesting that the earless morphology either evolved twice or evolved once and was subsequently lost in Callisaurus. PMID:25663487
Yoshitomi, Hideaki; Sera, Nobuyuki; Gonzalez, Gabriel; Hanaoka, Nozomu; Fujimoto, Tsuguto
2017-07-01
Human mastadenoviruses (HAdVs) are highly infectious viral pathogens that survive for prolonged periods in environmental waters. We monitored the presence of HAdVs in sewage waters between April 2014 and March 2015. A total of 27 adenoviral strains were detected in 75% (18/24 in occasion-base) of 24 wastewater collected samples. We identified the types of the strains as HAdV-C2 (n = 5), HAdV-A31 (5), HAdV-C1 (4), HAdV-B3 (4), HAdV-C5 (4), HAdV-B11 (2), P11H34F11 (2), and HAdV-D56 (1). The complete genome sequence of one P11H34F11 (strain T150125) was determined by next-generation sequencing and compared to other genome sequences of HAdV-B strains. The comparisons revealed evidence of a recombination event with breaking point in the hexon encoding region, which evidenced high similarity to HAdV-B34, while half of the rest of the genome showed similarity to HAdV-B11, including regions encoding fiber and E3 region proteins. The penton base encoding region seemed to be a recombinant product of HAdV-B14, -34; however, it was evidenced to be divergent to both as a novel type despite showing low bootstrap to support a new clade. We propose T150125 (P11H34F11) is a strain of a novel genotype, HAdV-79. These results support the usefulness of environmental surveillance approaches to monitor circulating HAdVs including novel types. © 2016 Wiley Periodicals, Inc.
Henssge, Uta; Do, Thuy; Gilbert, Steven C.; Cox, Steven; Clark, Douglas; Wickström, Claes; Ligtenberg, A. J. M.; Radford, David R.; Beighton, David
2011-01-01
Actinomyces naeslundii and Actinomyces oris are members of the oral biofilm. Their identification using 16S rRNA sequencing is problematic and better achieved by comparison of metG partial sequences. A. oris is more abundant and more frequently isolated than A. naeslundii. We used a multi-locus sequence typing approach to investigate the genotypic diversity of these species and assigned A. naeslundii (n = 37) and A. oris (n = 68) isolates to 32 and 68 sequence types (ST), respectively. Neighbor-joining and ClonalFrame dendrograms derived from the concatenated partial sequences of 7 house-keeping genes identified at least 4 significant subclusters within A. oris and 3 within A. naeslundii. The strain collection we had investigated was an under-representation of the total population since at least 3 STs composed of single strains may represent discrete clusters of strains not well represented in the collection. The integrity of these sub-clusters was supported by the sequence analysis of fimP and fimA, genes coding for the type 1 and 2 fimbriae, respectively. An A. naeslundii subcluster was identified with both fimA and fimP genes and these strains were able to bind to MUC7 and statherin while all other A. naeslundii strains possessed only fimA and did not bind to statherin. An A. oris subcluster harboured a fimA gene similar to that of Actinomyces odontolyticus but no detectable fimP failed to bind significantly to either MUC7 or statherin. These data are evidence of extensive genotypic and phenotypic diversity within the species A. oris and A. naeslundii but the status of the subclusters identified here will require genome comparisons before their phylogenic position can be unequivocally established. PMID:21738661
Henssge, Uta; Do, Thuy; Gilbert, Steven C; Cox, Steven; Clark, Douglas; Wickström, Claes; Ligtenberg, A J M; Radford, David R; Beighton, David
2011-01-01
Actinomyces naeslundii and Actinomyces oris are members of the oral biofilm. Their identification using 16S rRNA sequencing is problematic and better achieved by comparison of metG partial sequences. A. oris is more abundant and more frequently isolated than A. naeslundii. We used a multi-locus sequence typing approach to investigate the genotypic diversity of these species and assigned A. naeslundii (n = 37) and A. oris (n = 68) isolates to 32 and 68 sequence types (ST), respectively. Neighbor-joining and ClonalFrame dendrograms derived from the concatenated partial sequences of 7 house-keeping genes identified at least 4 significant subclusters within A. oris and 3 within A. naeslundii. The strain collection we had investigated was an under-representation of the total population since at least 3 STs composed of single strains may represent discrete clusters of strains not well represented in the collection. The integrity of these sub-clusters was supported by the sequence analysis of fimP and fimA, genes coding for the type 1 and 2 fimbriae, respectively. An A. naeslundii subcluster was identified with both fimA and fimP genes and these strains were able to bind to MUC7 and statherin while all other A. naeslundii strains possessed only fimA and did not bind to statherin. An A. oris subcluster harboured a fimA gene similar to that of Actinomyces odontolyticus but no detectable fimP failed to bind significantly to either MUC7 or statherin. These data are evidence of extensive genotypic and phenotypic diversity within the species A. oris and A. naeslundii but the status of the subclusters identified here will require genome comparisons before their phylogenic position can be unequivocally established.
Analysis of whole genome sequences of 16 strains of rubella virus from the United States, 1961-2009.
Abernathy, Emily; Chen, Min-hsin; Bera, Jayati; Shrivastava, Susmita; Kirkness, Ewen; Zheng, Qi; Bellini, William; Icenogle, Joseph
2013-01-25
Rubella virus is the causative agent of rubella, a mild rash illness, and a potent teratogenic agent when contracted by a pregnant woman. Global rubella control programs target the reduction and elimination of congenital rubella syndrome. Phylogenetic analysis of partial sequences of rubella viruses has contributed to virus surveillance efforts and played an important role in demonstrating that indigenous rubella viruses have been eliminated in the United States. Sixteen wild-type rubella viruses were chosen for whole genome sequencing. All 16 viruses were collected in the United States from 1961 to 2009 and are from 8 of the 13 known rubella genotypes. Phylogenetic analysis of 30 whole genome sequences produced a maximum likelihood tree giving high bootstrap values for all genotypes except provisional genotype 1a. Comparison of the 16 new complete sequences and 14 previously sequenced wild-type viruses found regions with clusters of variable amino acids. The 5' 250 nucleotides of the genome are more conserved than any other part of the genome. Genotype specific deletions in the untranslated region between the non-structural and structural open reading frames were observed for genotypes 2B and genotype 1G. No evidence was seen for recombination events among the 30 viruses. The analysis presented here is consistent with previous reports on the genetic characterization of rubella virus genomes. Conserved and variable regions were identified and additional evidence for genotype specific nucleotide deletions in the intergenic region was found. Phylogenetic analysis confirmed genotype groupings originally based on structural protein coding region sequences, which provides support for the WHO nomenclature for genetic characterization of wild-type rubella viruses.
Biogeography of Burkholderia pseudomallei in the Torres Strait Islands of Northern Australia
Baker, Anthony; Mayo, Mark; Owens, Leigh; Burgess, Graham; Norton, Robert; McBride, William John Hannan; Currie, Bart J.
2013-01-01
It has been hypothesized that biogeographical boundaries are a feature of Burkholderia pseudomallei ecology, and they impact the epidemiology of melioidosis on a global scale. This study examined the relatedness of B. pseudomallei sourced from islands in the Torres Strait of Northern Australia to determine if the geography of isolated island communities is a determinant of the organisms' dispersal. Environmental sampling on Badu Island in the Near Western Island cluster recovered a single clone. An additional 32 clinical isolates from the region were sourced. Isolates were characterized using multilocus sequence typing and a multiplex PCR targeting the flagellum gene cluster. Gene cluster analysis determined that 69% of the isolates from the region encoded the ancestral Burkholderia thailandensis-like flagellum and chemotaxis gene cluster, a proportion significantly lower than that reported from mainland Australia and consistent with observations of isolates from southern Papua New Guinea. A goodness-of-fit test indicated that there was geographic localization of sequence types throughout the archipelago, with the exception of Thursday Island, the economic and cultural hub of the region. Sequence types common to mainland Australia and Papua New Guinea were identified. These findings demonstrate for the first time an environmental reservoir for B. pseudomallei in the Torres Strait, and multilocus sequence typing suggests that the organism is not randomly distributed throughout this region and that seawater may provide a barrier to dispersal of the organism. Moreover, these findings support an anthropogenic dispersal hypothesis for the spread of B. pseudomallei throughout this region. PMID:23698533
Pulse sequence programming in a dynamic visual environment: SequenceTree.
Magland, Jeremy F; Li, Cheng; Langham, Michael C; Wehrli, Felix W
2016-01-01
To describe SequenceTree, an open source, integrated software environment for implementing MRI pulse sequences and, ideally, exporting them to actual MRI scanners. The software is a user-friendly alternative to vendor-supplied pulse sequence design and editing tools and is suited for programmers and nonprogrammers alike. The integrated user interface was programmed using the Qt4/C++ toolkit. As parameters and code are modified, the pulse sequence diagram is automatically updated within the user interface. Several aspects of pulse programming are handled automatically, allowing users to focus on higher-level aspects of sequence design. Sequences can be simulated using a built-in Bloch equation solver and then exported for use on a Siemens MRI scanner. Ideally, other types of scanners will be supported in the future. SequenceTree has been used for 8 years in our laboratory and elsewhere and has contributed to more than 50 peer-reviewed publications in areas such as cardiovascular imaging, solid state and nonproton NMR, MR elastography, and high-resolution structural imaging. SequenceTree is an innovative, open source, visual pulse sequence environment for MRI combining simplicity with flexibility and is ideal both for advanced users and users with limited programming experience. © 2015 Wiley Periodicals, Inc.
Parson, Walther; Strobl, Christina; Huber, Gabriela; Zimmermann, Bettina; Gomes, Sibylle M.; Souto, Luis; Fendt, Liane; Delport, Rhena; Langit, Reina; Wootton, Sharon; Lagacé, Robert; Irwin, Jodi
2013-01-01
Insights into the human mitochondrial phylogeny have been primarily achieved by sequencing full mitochondrial genomes (mtGenomes). In forensic genetics (partial) mtGenome information can be used to assign haplotypes to their phylogenetic backgrounds, which may, in turn, have characteristic geographic distributions that would offer useful information in a forensic case. In addition and perhaps even more relevant in the forensic context, haplogroup-specific patterns of mutations form the basis for quality control of mtDNA sequences. The current method for establishing (partial) mtDNA haplotypes is Sanger-type sequencing (STS), which is laborious, time-consuming, and expensive. With the emergence of Next Generation Sequencing (NGS) technologies, the body of available mtDNA data can potentially be extended much more quickly and cost-efficiently. Customized chemistries, laboratory workflows and data analysis packages could support the community and increase the utility of mtDNA analysis in forensics. We have evaluated the performance of mtGenome sequencing using the Personal Genome Machine (PGM) and compared the resulting haplotypes directly with conventional Sanger-type sequencing. A total of 64 mtGenomes (>1 million bases) were established that yielded high concordance with the corresponding STS haplotypes (<0.02% differences). About two-thirds of the differences were observed in or around homopolymeric sequence stretches. In addition, the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes. Further development of alignment software would be desirable to facilitate the application of NGS in mtDNA forensic genetics. PMID:23948325
Speidel, Wolfgang; Hausmann, Axel; Müller, Günter C; Kravchenko, Vasiliy; Mooser, Josef; Witt, Thomas J; Khallaayoune, Khalid; Prosser, Sean; Hebert, Paul D N
2015-08-11
The type of Lasiocampa decolorata (KLUG, 1830), collected in 1820, was successfully barcoded to generate a 658bp COI-fragment after 194 years. The resulting molecular data allowed the description of two closely related species from Morocco: Lasiocampa hannae SPEIDEL, MOOSER & WITT sp. n. from the Anti Atlas and Lasiocampa editae SPEIDEL, MOOSER & WITT sp. n. from the High Atlas.
Iterating between lessons on concepts and procedures can improve mathematics knowledge.
Rittle-Johnson, Bethany; Koedinger, Kenneth
2009-09-01
Knowledge of concepts and procedures seems to develop in an iterative fashion, with increases in one type of knowledge leading to increases in the other type of knowledge. This suggests that iterating between lessons on concepts and procedures may improve learning. The purpose of the current study was to evaluate the instructional benefits of an iterative lesson sequence compared to a concepts-before-procedures sequence for students learning decimal place-value concepts and arithmetic procedures. In two classroom experiments, sixth-grade students from two schools participated (N=77 and 26). Students completed six decimal lessons on an intelligent-tutoring systems. In the iterative condition, lessons cycled between concept and procedure lessons. In the concepts-first condition, all concept lessons were presented before introducing the procedure lessons. In both experiments, students in the iterative condition gained more knowledge of arithmetic procedures, including ability to transfer the procedures to problems with novel features. Knowledge of concepts was fairly comparable across conditions. Finally, pre-test knowledge of one type predicted gains in knowledge of the other type across experiments. An iterative sequencing of lessons seems to facilitate learning and transfer, particularly of mathematical procedures. The findings support an iterative perspective for the development of knowledge of concepts and procedures.
A Single Early Introduction of HIV-1 Subtype B into Central America Accounts for Most Current Cases
Murillo, Wendy; Veras, Nazle; Prosperi, Mattia; de Rivera, Ivette Lorenzana; Paz-Bailey, Gabriela; Morales-Miranda, Sonia; Juarez, Sandra I.; Yang, Chunfu; DeVos, Joshua; Marín, José Pablo; Mild, Mattias; Albert, Jan
2013-01-01
Human immunodeficiency virus type 1 (HIV-1) variants show considerable geographical separation across the world, but there is limited information from Central America. We provide the first detailed investigation of the genetic diversity and molecular epidemiology of HIV-1 in six Central American countries. Phylogenetic analysis was performed on 625 HIV-1 pol gene sequences collected between 2002 and 2010 in Honduras, El Salvador, Nicaragua, Costa Rica, Panama, and Belize. Published sequences from neighboring countries (n = 57) and the rest of the world (n = 740) were included as controls. Maximum likelihood methods were used to explore phylogenetic relationships. Bayesian coalescence-based methods were used to time HIV-1 introductions. Nearly all (98.9%) Central American sequences were of subtype B. Phylogenetic analysis revealed that 437 (70%) sequences clustered within five significantly supported monophyletic clades formed essentially by Central American sequences. One clade contained 386 (62%) sequences from all six countries; the other four clades were smaller and more country specific, suggesting discrete subepidemics. The existence of one large well-supported Central American clade provides evidence that a single introduction of HIV-1 subtype B in Central America accounts for most current cases. An introduction during the early phase of the HIV-1 pandemic may explain its epidemiological success. Moreover, the smaller clades suggest a subsequent regional spread related to specific transmission networks within each country. PMID:23616665
pyPaSWAS: Python-based multi-core CPU and GPU sequence alignment.
Warris, Sven; Timal, N Roshan N; Kempenaar, Marcel; Poortinga, Arne M; van de Geest, Henri; Varbanescu, Ana L; Nap, Jan-Peter
2018-01-01
Our previously published CUDA-only application PaSWAS for Smith-Waterman (SW) sequence alignment of any type of sequence on NVIDIA-based GPUs is platform-specific and therefore adopted less than could be. The OpenCL language is supported more widely and allows use on a variety of hardware platforms. Moreover, there is a need to promote the adoption of parallel computing in bioinformatics by making its use and extension more simple through more and better application of high-level languages commonly used in bioinformatics, such as Python. The novel application pyPaSWAS presents the parallel SW sequence alignment code fully packed in Python. It is a generic SW implementation running on several hardware platforms with multi-core systems and/or GPUs that provides accurate sequence alignments that also can be inspected for alignment details. Additionally, pyPaSWAS support the affine gap penalty. Python libraries are used for automated system configuration, I/O and logging. This way, the Python environment will stimulate further extension and use of pyPaSWAS. pyPaSWAS presents an easy Python-based environment for accurate and retrievable parallel SW sequence alignments on GPUs and multi-core systems. The strategy of integrating Python with high-performance parallel compute languages to create a developer- and user-friendly environment should be considered for other computationally intensive bioinformatics algorithms.
A novel 5-bp deletion in Clarin 1 in a family with Usher syndrome.
Akoury, Elie; El Zir, Elie; Mansour, Ahmad; Mégarbané, André; Majewski, Jacek; Slim, Rima
2011-11-01
To identify the genetic defect in a Lebanese family with two sibs diagnosed with Usher Syndrome. Exome capture and sequencing were performed on DNA from one affected member using Agilent in solution bead capture, followed by Illumina sequencing. This analysis revealed the presence of a novel homozygous 5-bp deletion, in Clarin 1 (CLRN1), a known gene responsible for Usher syndrome type III. The deletion is inherited from both parents and segregates with the disease phenotype in the family. The 5-bp deletion, c.301_305delGTCAT, p.Val101SerfsX27, is predicted to result in a frameshift and protein truncation after 27 amino acids. Sequencing all the coding regions of the CLRN1 gene in the proband did not reveal any other mutation or variant. Here we describe a novel deletion in CLRN1. Our data support previously reported intra familial variability in the clinical features of Usher syndrome type I and III.
Chloroplast Phylogenomics Indicates that Ginkgo biloba Is Sister to Cycads
Wu, Chung-Shien; Chaw, Shu-Miaw; Huang, Ya-Yi
2013-01-01
Molecular phylogenetic studies have not yet reached a consensus on the placement of Ginkgoales, which is represented by the only living species, Ginkgo biloba (common name: ginkgo). At least six discrepant placements of ginkgo have been proposed. This study aimed to use the chloroplast phylogenomic approach to examine possible factors that lead to such disagreeing placements. We found the sequence types used in the analyses as the most critical factor in the conflicting placements of ginkgo. In addition, the placement of ginkgo varied in the trees inferred from nucleotide (NU) sequences, which notably depended on breadth of taxon sampling, tree-building methods, codon positions, positions of Gnetopsida (common name: gnetophytes), and including or excluding gnetophytes in data sets. In contrast, the trees inferred from amino acid (AA) sequences congruently supported the monophyly of a ginkgo and Cycadales (common name: cycads) clade, regardless of which factors were examined. Our site-stripping analysis further revealed that the high substitution saturation of NU sequences mainly derived from the third codon positions and contributed to the variable placements of ginkgo. In summary, the factors we surveyed did not affect results inferred from analyses of AA sequences. Congruent topologies in our AA trees give more confidence in supporting the ginkgo–cycad sister-group hypothesis. PMID:23315384
Synchronized excitability in a network enables generation of internal neuronal sequences
Wang, Yingxue; Roth, Zachary; Pastalkova, Eva
2016-01-01
Hippocampal place field sequences are supported by sensory cues and network internal mechanisms. In contrast, sharp-wave (SPW) sequences, theta sequences, and episode field sequences are internally generated. The relationship of these sequences to memory is unclear. SPW sequences have been shown to support learning and have been assumed to also support episodic memory. Conversely, we demonstrate these SPW sequences were present in trained rats even after episodic memory was impaired and after other internal sequences – episode field and theta sequences – were eliminated. SPW sequences did not support memory despite continuing to ‘replay’ all task-related sequences – place- field and episode field sequences. Sequence replay occurred selectively during synchronous increases of population excitability -- SPWs. Similarly, theta sequences depended on the presence of repeated synchronized waves of excitability – theta oscillations. Thus, we suggest that either intermittent or rhythmic synchronized changes of excitability trigger sequential firing of neurons, which in turn supports learning and/or memory. DOI: http://dx.doi.org/10.7554/eLife.20697.001 PMID:27677848
Cryptic Diversity of Malassezia pachydermatis from Healthy and Diseased Domestic Animals.
Puig, Laura; Castellá, Gemma; Cabañes, F Javier
2016-10-01
Malassezia pachydermatis is part of the normal cutaneous microbiota of wild and domestic carnivores. However, under certain conditions this yeast can overproliferate and cause several diseases in its host, mainly otitis and dermatitis in dogs. The aim of this study was to conduct a molecular characterization of M. pachydermatis isolates from healthy and diseased domestic animals, in order to assess the molecular diversity and phylogenetic relationship within this species. The large subunit (LSU) and the internal transcribed spacer (ITS) of ribosomal RNA, chitin synthase 2 (CHS2) and β-tubulin genes from sixteen strains isolated from dogs, cats, a goat, a pig and a horse were sequenced. A different number of types of sequences were identified for each target gene, including some types described for the first time. Five sequence types were characterized for the LSU, eleven for the ITS region, nine for CHS2 and eight for β-tubulin. A multilocus analysis was performed including the four genes, and the resulting phylogenetic tree revealed fifteen genotypes. Genotypes were distributed in two well-supported clades. One clade comprised strains isolated from different domestic animals and a strongly supported cluster constituted by strains isolated from cats. The second clade included strains isolated mainly from dogs and an outlier strain isolated from a horse. No apparent association could be observed between the health status of the animal hosts and concrete strains. The multilocus phylogenetic analysis is a useful tool to assess the intraspecific variation within this species and could help understand the ecology, epidemiology and speciation process of M. pachydermatis.
ERIC Educational Resources Information Center
Song, Youngjin; Higgins, Teresa; Harding-DeKam, Jenni
2014-01-01
This article describes a series of inquiry-based lessons that provide English language learners (ELLs) with opportunities to experience science and engineering practices with conceptual understanding as well as to develop their language proficiency in elementary classrooms. The four-lesson sequence models how various types of instructional…
Personality and cognitive profiles of a general synesthetic trait.
Rouw, Romke; Scholte, H Steven
2016-07-29
The recent sharp increase in studies on synesthesia has taught us a lot about this fascinating condition. Still, while we define synesthesia as 'the mixing of senses', the great majority of synesthesia studies focus on only one synesthesia type (in particular grapheme-color synesthesia). In this study, a large group of subjects are tested on the presence or absence of different types of synesthesia. Efforts to recruit a representative sample of the Dutch population, not related to or aware of synesthesia as a research topic, helped counter a selection bias or a self-report bias in our subject group. A sharp increase in synesthesia prevalence was found, at least partially due to including many different types of synesthesia in the synesthesia 'diagnoses'. The five synesthesia types reported in the Novich et al (2011) study were obtained; Colored Sequences, Colored Music, Colored Sensations, Spatial Sequences, Non-Visual Sequelae, as well as an additional synesthesia type, Sequence-Personality. No differences were found between synesthetes and non-synesthetes in education level, handedness, age, and sex. The synesthetes showed increased intelligence as compared with matched non-synesthetes. This was a general effect rather than bound to a specific cognitive domain or to a specific (synesthesia-type to stimulus-material) relationship. The expected effect of increased "Openness" in synesthetes was obtained, as well as two unexpected effects in personality traits (increased "Neuroticism" and decreased "Conscientiousness"). We also found increased "Emotionality" (experiencing emotions) and increased "Fantasizing", but synesthetes did not differ in cognitive appraisal of emotions (identifying/analyzing/verbalizing of emotions). The personality and cognitive characteristics were found related to having synesthesia (in general) rather then to particular synesthesia subtypes. This supports the existence of a general synesthetic 'trait', over the notion of relatively independent 'types' of synesthesia. In further support, exploratory analyses showed that a measurement of synesthetic strength (number of subtypes of synesthesia) correlates with stronger findings (increased "Openness", "Fantasizing", and "Emotionality", and decreased "Conscientiousness"). In conclusion, results are in line with the notion of a general synesthetic 'trait', and this synesthetic trait is associated with particular personality traits and cognitive characteristics. Copyright © 2016. Published by Elsevier Ltd.
Effects of the Ion PGM™ Hi-Q™ sequencing chemistry on sequence data quality.
Churchill, Jennifer D; King, Jonathan L; Chakraborty, Ranajit; Budowle, Bruce
2016-09-01
Massively parallel sequencing (MPS) offers substantial improvements over current forensic DNA typing methodologies such as increased resolution, scalability, and throughput. The Ion PGM™ is a promising MPS platform for analysis of forensic biological evidence. The system employs a sequencing-by-synthesis chemistry on a semiconductor chip that measures a pH change due to the release of hydrogen ions as nucleotides are incorporated into the growing DNA strands. However, implementation of MPS into forensic laboratories requires a robust chemistry. Ion Torrent's Hi-Q™ Sequencing Chemistry was evaluated to determine if it could improve on the quality of the generated sequence data in association with selected genetic marker targets. The whole mitochondrial genome and the HID-Ion STR 10-plex panel were sequenced on the Ion PGM™ system with the Ion PGM™ Sequencing 400 Kit and the Ion PGM™ Hi-Q™ Sequencing Kit. Concordance, coverage, strand balance, noise, and deletion ratios were assessed in evaluating the performance of the Ion PGM™ Hi-Q™ Sequencing Kit. The results indicate that reliable, accurate data are generated and that sequencing through homopolymeric regions can be improved with the use of Ion Torrent's Hi-Q™ Sequencing Chemistry. Overall, the quality of the generated sequencing data supports the potential for use of the Ion PGM™ in forensic genetic laboratories.
Method for identifying and quantifying nucleic acid sequence aberrations
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
1998-01-01
A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations
Lucas, J.N.; Straume, T.; Bogen, K.T.
1998-07-21
A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
Wang, Liyan; Ma, Lina; Liu, Yongan; Gao, Pengcheng; Li, Youquan; Li, Xuerui; Liu, Yongsheng
2016-10-01
Haemophilus parasuis is the etiological agent of Glässers disease, which causes high morbidity and mortality in swine herds. Although H. parasuis strains can be classified into 15 serovars with the Kielstein-Rapp-Gabrielson serotyping scheme, a large number of isolates cannot be classified and have been designated 'nontypeable' strains. In this study, multilocus sequence typing (MLST) of H. parasuis was used to analyze 48 H. parasuis field strains isolated in China and two strains from Australia. Twenty-six new alleles and 29 new sequence types (STs) were detected, enriching the H. parasuis MLST databases. A BURST analysis indicated that H. parasuis lacks stable population structure and is highly heterogeneous, and that there is no association between STs and geographic area. When an UPGMA dendrogram was constructed, two major clades, clade A and clade B, were defined. Animal experiments, in which guinea pigs were challenged intraperitoneally with the bacterial isolates, supported the hypothesis that the H. parasuis STs in clade A are generally avirulent or weakly virulent, whereas the STs in clade B tend to be virulent. Copyright © 2016 Elsevier B.V. All rights reserved.
Woudstra, Cedric; Le Maréchal, Caroline; Souillard, Rozenn; Bayon-Auboyer, Marie-Hélène; Anniballi, Fabrizio; Auricchio, Bruna; De Medici, Dario; Bano, Luca; Koene, Miriam; Sansonetti, Marie-Hélène; Desoutter, Denise; Hansbauer, Eva-Maria; Dorner, Martin B.; Dorner, Brigitte G.
2015-01-01
We report the development of real-time PCR assays for genotyping Clostridium botulinum group III targeting the newly defined C. novyi sensu lato group; the nontoxic nonhemagglutinin (NTNH)-encoding gene ntnh; the botulinum neurotoxin (BoNT)-encoding genes bont/C, bont/C/D, bont/D, and bont/D/C; and the flagellin (fliC) gene. The genetic diversity of fliC among C. botulinum group III strains resulted in the definition of five major subgroups named fliC-I to fliC-V. Investigation of fliC subtypes in 560 samples, with various European origins, showed that fliC-I was predominant and found exclusively in samples contaminated by C. botulinum type C/D, fliC-II was rarely detected, no sample was recorded as fliC-III or fliC-V, and only C. botulinum type D/C samples tested positive for fliC-IV. The lack of genetic diversity of the flagellin gene of C. botulinum type C/D would support a clonal spread of type C/D strains in different geographical areas. fliC-I to fliC-III are genetically related (87% to 92% sequence identity), whereas fliC-IV from C. botulinum type D/C is more genetically distant from the other fliC types (with only 50% sequence identity). These findings suggest fliC-I to fliC-III have evolved in a common environment and support a different genetic evolution for fliC-IV. A combination of the C. novyi sensu lato, ntnh, bont, and fliC PCR assays developed in this study allowed better characterization of C. botulinum group III and showed the group to be less genetically diverse than C. botulinum groups I and II, supporting a slow genetic evolution of the strains belonging to C. botulinum group III. PMID:25636839
Keeping it together: Semantic coherence stabilizes phonological sequences in short-term memory.
Savill, Nicola; Ellis, Rachel; Brooke, Emma; Koa, Tiffany; Ferguson, Suzie; Rojas-Rodriguez, Elena; Arnold, Dominic; Smallwood, Jonathan; Jefferies, Elizabeth
2018-04-01
Our ability to hold a sequence of speech sounds in mind, in the correct configuration, supports many aspects of communication, but the contribution of conceptual information to this basic phonological capacity remains controversial. Previous research has shown modest and inconsistent benefits of meaning on phonological stability in short-term memory, but these studies were based on sets of unrelated words. Using a novel design, we examined the immediate recall of sentence-like sequences with coherent meaning, alongside both standard word lists and mixed lists containing words and nonwords. We found, and replicated, substantial effects of coherent meaning on phoneme-level accuracy: The phonemes of both words and nonwords within conceptually coherent sequences were more likely to be produced together and in the correct order. Since nonwords do not exist as items in long-term memory, the semantic enhancement of phoneme-level recall for both item types cannot be explained by a lexically based item reconstruction process employed at the point of retrieval ("redintegration"). Instead, our data show, for naturalistic input, that when meaning emerges from the combination of words, the phonological traces that support language are reinforced by a semantic-binding process that has been largely overlooked by past short-term memory research.
Bartoletti-Stella, Anna; Baiardi, Simone; Stanzani-Maserati, Michelangelo; Piras, Silvia; Caffarra, Paolo; Raggi, Alberto; Pantieri, Roberta; Baldassari, Sara; Caporali, Leonardo; Abu-Rumeileh, Samir; Linarello, Simona; Liguori, Rocco; Parchi, Piero; Capellari, Sabina
2018-06-01
Genetics is intricately involved in the etiology of neurodegenerative dementias. The incidence of monogenic dementia among all neurodegenerative forms is unknown due to the lack of systematic studies and of patient/clinician access to extensive diagnostic procedures. In this study, we conducted targeted sequencing in 246 clinically heterogeneous patients, mainly with early-onset and/or familial neurodegenerative dementia, using a custom-designed next-generation sequencing panel covering 27 genes known to harbor mutations that can cause different types of dementia, in addition to the detection of C9orf72 repeat expansions. Forty-nine patients (19.9%) carried known pathogenic or novel, likely pathogenic, variants, involving both common (presenilin 1, presenilin 2, C9orf72, and granulin) and rare (optineurin, serpin family I member 1 and protein kinase cyclic adenosine monophosphate (cAMP)-dependent type I regulatory subunit beta) dementia-associated genes. Our results support the use of an extended next-generation sequencing panels as a quick, accurate, and cost-effective method for diagnosis in clinical practice. This approach could have a significant impact on the proportion of tested patients, especially among those with an early disease onset. Copyright © 2018 Elsevier Inc. All rights reserved.
Hansen, Cristina M.; Himschoot, Elizabeth; Hare, Rebekah F.; Meixell, Brandt W.; Van Hemert, Caroline R.; Hueffer, Karsten
2017-01-01
During the summers of 2013 and 2014, isolates of a novel Gram-negative coccus in the Neisseria genus were obtained from the contents of nonviable greater white-fronted goose (Anser albifrons) eggs on the Arctic Coastal Plain of Alaska. We used a polyphasic approach to determine whether these isolates represent a novel species. 16S rRNA gene sequences, 23S rRNA gene sequences, and chaperonin 60 gene sequences suggested that these Alaskan isolates are members of a distinct species that is most closely related to Neisseria canis, N. animaloris, and N. shayeganii. Analysis of the rplF gene additionally showed that our isolates are unique and most closely related to N. weaveri. Average nucleotide identity of the whole genome sequence of our type strain was between 71.5% and 74.6% compared to close relatives, further supporting designation as a novel species. Fatty acid methyl ester analysis showed a predominance of C14:0, C16:0, and C16:1ω7c fatty acids. Finally, biochemical characteristics distinguished our isolates from other Neisseria species. The name Neisseria arctica (type strain KH1503T = ATCC TSD-57T = DSM 103136T) is proposed.
Miftahussurur, Muhammad; Tuda, Josef; Suzuki, Rumiko; Kido, Yasutoshi; Kawamoto, Fumihiko; Matsuda, Miyuki; Tantular, Indah S; Pusarawati, Suhintam; Nasronudin; Harijanto, Paul N; Yamaoka, Yoshio
2014-01-01
Sulawesi in Indonesia has a unique geographical profile with assumed separation from Sundaland. Studies of Helicobacter pylori in this region are rare due to the region's rural location and lack of endoscopy equipment. Indirect methods are, therefore, the most appropriate for measuring H. pylori infection in these areas; with the disposable gastric brush test, we can obtain gastric juice as well as small gastric tissue samples for H. pylori culture. We investigated the prevalence of H. pylori infection and evaluated human migration patterns in the remote areas of North Sulawesi. We recruited a total of 251 consecutive adult volunteers and 131 elementary school children. H. pylori infection was determined by urine antibody test. A gastric brush test was used to culture H. pylori. We used next-generation and polymerase chain reaction based sequencing to determine virulence factors and multi-locus sequence typing (MLST). The overall H. pylori prevalence was only 14.3% for adults and 3.8% for children, and 13.6% and 16.7% in Minahasanese and Mongondownese participants, respectively. We isolated a single H. pylori strain, termed -Manado-1. Manado-1 was East Asian type cagA (ABD type), vacA s1c-m1b, iceA1 positive/iceA2 negative, jhp0562-positive/β-(1,3) galT-negative, oipA "on", and dupA-negative. Phylogenetic analyses showed the strain to be hspMaori type, a major type observed in native Taiwanese and Maori tribes. Our data support that very low H. pylori infection prevalence in Indonesia. Identification of hspMaori type H. pylori in North Sulawesi may support the hypothesis that North Sulawesi people migrated from north.
AMS 4.0: consensus prediction of post-translational modifications in protein sequences.
Plewczynski, Dariusz; Basu, Subhadip; Saha, Indrajit
2012-08-01
We present here the 2011 update of the AutoMotif Service (AMS 4.0) that predicts the wide selection of 88 different types of the single amino acid post-translational modifications (PTM) in protein sequences. The selection of experimentally confirmed modifications is acquired from the latest UniProt and Phospho.ELM databases for training. The sequence vicinity of each modified residue is represented using amino acids physico-chemical features encoded using high quality indices (HQI) obtaining by automatic clustering of known indices extracted from AAindex database. For each type of the numerical representation, the method builds the ensemble of Multi-Layer Perceptron (MLP) pattern classifiers, each optimising different objectives during the training (for example the recall, precision or area under the ROC curve (AUC)). The consensus is built using brainstorming technology, which combines multi-objective instances of machine learning algorithm, and the data fusion of different training objects representations, in order to boost the overall prediction accuracy of conserved short sequence motifs. The performance of AMS 4.0 is compared with the accuracy of previous versions, which were constructed using single machine learning methods (artificial neural networks, support vector machine). Our software improves the average AUC score of the earlier version by close to 7 % as calculated on the test datasets of all 88 PTM types. Moreover, for the selected most-difficult sequence motifs types it is able to improve the prediction performance by almost 32 %, when compared with previously used single machine learning methods. Summarising, the brainstorming consensus meta-learning methodology on the average boosts the AUC score up to around 89 %, averaged over all 88 PTM types. Detailed results for single machine learning methods and the consensus methodology are also provided, together with the comparison to previously published methods and state-of-the-art software tools. The source code and precompiled binaries of brainstorming tool are available at http://code.google.com/p/automotifserver/ under Apache 2.0 licensing.
Eisenberg, Tobias; Glaeser, Stefanie P; Ewers, Christa; Semmler, Torsten; Nicklas, Werner; Rau, Jörg; Mauder, Norman; Hofmann, Nicola; Imaoka, Koichi; Kimura, Masanobu; Kämpfer, Peter
2015-12-01
A pleomorphic, Gram-negative, rod-shaped, indole-, oxidase- and catalase-negative, non-spore-forming, non-motile bacterium was isolated in 1979 from the heart of a spinifex hopping mouse (Notomys alexis Thomas, 1922) with septicaemia and stored as Streptobacillus moniliformis in the strain collection of the Animal Health Laboratory, South Perth, Western Australia (AHL 370-1), as well as under CCUG 12425. On the basis of 16SrRNA gene sequence analyses, the strain was assigned to the genus Streptobacillus, with 99.4 % sequence similarity to the type strain of Streptobacillus moniliformis, 95.6 %sequence similarity to the type strain of Streptobacillus hongkongensis and 99.0 %sequence similarity to the type strain of Streptobacillus felis. The clear differentiation of strain AHL 370-1T from Streptobacillus moniliformis, Streptobacillus hongkongensis and Streptobacillus felis was also supported by rpoB, groEL and recA nucleotide and amino acid sequence analysis. Average nucleotide identity was 87.16 % between strain AHL 370-1T and Streptobacillus moniliformis DSM 12112T. Physiological data confirmed the allocation of strain AHL 370-1T to the family Leptotrichiaceae, considering the very similar profiles of enzyme activities and fatty acids compared to closely related species. Within the genus Streptobacillus,isolate AHL 370-1T could also be separated unambiguously from the type strains of Streptobacillus moniliformis, Streptobacillus hongkongensis and Streptobacillus felis by MALDI-TOF mass spectrometry. Two further strains (KWG2 and KWG24) isolated from asymptomatic black rats in Japan were highly similar to AHL 370-1T. On the basis of these data, we propose the novel species Streptobacillus notomytis sp. nov., with the type strain AHL370-1T (=CCUG 12425T=DSM 100026T=CCM 8593T=EF 12425T).
CRISPR adaptive immune systems of Archaea
Vestergaard, Gisle; Garrett, Roger A; Shah, Shiraz A
2014-01-01
CRISPR adaptive immune systems were analyzed for all available completed genomes of archaea, which included representatives of each of the main archaeal phyla. Initially, all proteins encoded within, and proximal to, CRISPR-cas loci were clustered and analyzed using a profile–profile approach. Then cas genes were assigned to gene cassettes and to functional modules for adaptation and interference. CRISPR systems were then classified primarily on the basis of their concatenated Cas protein sequences and gene synteny of the interference modules. With few exceptions, they could be assigned to the universal Type I or Type III systems. For Type I, subtypes I-A, I-B, and I-D dominate but the data support the division of subtype I-B into two subtypes, designated I-B and I-G. About 70% of the Type III systems fall into the universal subtypes III-A and III-B but the remainder, some of which are phyla-specific, diverge significantly in Cas protein sequences, and/or gene synteny, and they are classified separately. Furthermore, a few CRISPR systems that could not be assigned to Type I or Type III are categorized as variant systems. Criteria are presented for assigning newly sequenced archaeal CRISPR systems to the different subtypes. Several accessory proteins were identified that show a specific gene linkage, especially to Type III interference modules, and these may be cofunctional with the CRISPR systems. Evidence is presented for extensive exchange having occurred between adaptation and interference modules of different archaeal CRISPR systems, indicating the wide compatibility of the functionally diverse interference complexes with the relatively conserved adaptation modules. PMID:24531374
Prediction of type III secretion signals in genomes of gram-negative bacteria.
Löwer, Martin; Schneider, Gisbert
2009-06-15
Pathogenic bacteria infecting both animals as well as plants use various mechanisms to transport virulence factors across their cell membranes and channel these proteins into the infected host cell. The type III secretion system represents such a mechanism. Proteins transported via this pathway ("effector proteins") have to be distinguished from all other proteins that are not exported from the bacterial cell. Although a special targeting signal at the N-terminal end of effector proteins has been proposed in literature its exact characteristics remain unknown. In this study, we demonstrate that the signals encoded in the sequences of type III secretion system effectors can be consistently recognized and predicted by machine learning techniques. Known protein effectors were compiled from the literature and sequence databases, and served as training data for artificial neural networks and support vector machine classifiers. Common sequence features were most pronounced in the first 30 amino acids of the effector sequences. Classification accuracy yielded a cross-validated Matthews correlation of 0.63 and allowed for genome-wide prediction of potential type III secretion system effectors in 705 proteobacterial genomes (12% predicted candidates protein), their chromosomes (11%) and plasmids (13%), as well as 213 Firmicute genomes (7%). We present a signal prediction method together with comprehensive survey of potential type III secretion system effectors extracted from 918 published bacterial genomes. Our study demonstrates that the analyzed signal features are common across a wide range of species, and provides a substantial basis for the identification of exported pathogenic proteins as targets for future therapeutic intervention. The prediction software is publicly accessible from our web server (www.modlab.org).
V, Pavana Jyothi; S, Akila; Selvan, Malini K; Naidu, Hariprasad; Raghunathan, Shwethaa; Kota, Sathish; Sundaram, R C Raja; Rana, Samir Kumar; Raj, G Dhinakar; Srinivasan, V A; Mohana Subramanian, B
2016-12-01
Canine parvovirus (CPV) is a non-enveloped single stranded DNA virus with an icosahedral capsid. Mini-sequencing based CPV typing was developed earlier to detect and differentiate all the CPV types and FPV in a single reaction. This technique was further evaluated in the present study by performing the mini-sequencing directly from fecal samples which avoided tedious virus isolation steps by cell culture system. Fecal swab samples were collected from 84 dogs with enteritis symptoms, suggestive of parvoviral infection from different locations across India. Seventy six of these samples were positive by PCR; the subsequent mini-sequencing reaction typed 74 of them as type 2a virus, and 2 samples as type 2b. Additionally, 25 of the positive samples were typed by cycle sequencing of PCR products. Direct CPV typing from fecal samples using mini-sequencing showed 100% correlation with CPV typing by cycle sequencing. Moreover, CPV typing was achieved by mini-sequencing even with faintly positive PCR amplicons which was not possible by cycle sequencing. Therefore, the mini-sequencing technique is recommended for regular epidemiological follow up of CPV types, since the technique is rapid, highly sensitive and high capacity method for CPV typing. Copyright © 2016. Published by Elsevier B.V.
Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.
2015-01-01
Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.
Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.
2015-01-01
Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673
Huser, Vojtech; Sincan, Murat; Cimino, James J
2014-01-01
Personalized medicine, the ability to tailor diagnostic and treatment decisions for individual patients, is seen as the evolution of modern medicine. We characterize here the informatics resources available today or envisioned in the near future that can support clinical interpretation of genomic test results. We assume a clinical sequencing scenario (germline whole-exome sequencing) in which a clinical specialist, such as an endocrinologist, needs to tailor patient management decisions within his or her specialty (targeted findings) but relies on a genetic counselor to interpret off-target incidental findings. We characterize the genomic input data and list various types of knowledge bases that provide genomic knowledge for generating clinical decision support. We highlight the need for patient-level databases with detailed lifelong phenotype content in addition to genotype data and provide a list of recommendations for personalized medicine knowledge bases and databases. We conclude that no single knowledge base can currently support all aspects of personalized recommendations and that consolidation of several current resources into larger, more dynamic and collaborative knowledge bases may offer a future path forward.
Huser, Vojtech; Sincan, Murat; Cimino, James J
2014-01-01
Personalized medicine, the ability to tailor diagnostic and treatment decisions for individual patients, is seen as the evolution of modern medicine. We characterize here the informatics resources available today or envisioned in the near future that can support clinical interpretation of genomic test results. We assume a clinical sequencing scenario (germline whole-exome sequencing) in which a clinical specialist, such as an endocrinologist, needs to tailor patient management decisions within his or her specialty (targeted findings) but relies on a genetic counselor to interpret off-target incidental findings. We characterize the genomic input data and list various types of knowledge bases that provide genomic knowledge for generating clinical decision support. We highlight the need for patient-level databases with detailed lifelong phenotype content in addition to genotype data and provide a list of recommendations for personalized medicine knowledge bases and databases. We conclude that no single knowledge base can currently support all aspects of personalized recommendations and that consolidation of several current resources into larger, more dynamic and collaborative knowledge bases may offer a future path forward. PMID:25276091
Common Data Analysis Pipeline | Office of Cancer Clinical Proteomics Research
CPTAC supports analyses of the mass spectrometry raw data (mapping of spectra to peptide sequences and protein identification) for the public using a Common Data Analysis Pipeline (CDAP). The data types available on the public portal are described below. A general overview of this pipeline can be downloaded here. Mass Spectrometry Data Formats RAW (Vendor) Format
Post-Fire Analysis of Solid-Sawn Heavy Timber Beams
Robert H. White; Frank E. Woeste
2013-01-01
After fire exposure, design professionals are sometimes called upon to determine if the charred heavy timbers (Figure 1) are safe for future use without additional support or repairs. In this article, the authors present a sequence of reasoned steps that will help design professionals analyze charred timbers and gain the type of information needed to...
Using a contextualized sensemaking model for interaction design: A case study of tumor contouring.
Aselmaa, Anet; van Herk, Marcel; Laprie, Anne; Nestle, Ursula; Götz, Irina; Wiedenmann, Nicole; Schimek-Jasch, Tanja; Picaud, Francois; Syrykh, Charlotte; Cagetti, Leonel V; Jolnerovski, Maria; Song, Yu; Goossens, Richard H M
2017-01-01
Sensemaking theories help designers understand the cognitive processes of a user when he/she performs a complicated task. This paper introduces a two-step approach of incorporating sensemaking support within the design of health information systems by: (1) modeling the sensemaking process of physicians while performing a task, and (2) identifying software interaction design requirements that support sensemaking based on this model. The two-step approach is presented based on a case study of the tumor contouring clinical task for radiotherapy planning. In the first step of the approach, a contextualized sensemaking model was developed to describe the sensemaking process based on the goal, the workflow and the context of the task. In the second step, based on a research software prototype, an experiment was conducted where three contouring tasks were performed by eight physicians respectively. Four types of navigation interactions and five types of interaction sequence patterns were identified by analyzing the gathered interaction log data from those twenty-four cases. Further in-depth study on each of the navigation interactions and interaction sequence patterns in relation to the contextualized sensemaking model revealed five main areas for design improvements to increase sensemaking support. Outcomes of the case study indicate that the proposed two-step approach was beneficial for gaining a deeper understanding of the sensemaking process during the task, as well as for identifying design requirements for better sensemaking support. Copyright © 2016. Published by Elsevier Inc.
Cassemiro, Klécia M S M; Burlandy, Fernanda M; da Silva, Edson E
2016-07-01
A natural type 3/type 2 intertypic capsid recombinant vaccine-related poliovirus was isolated from an acute flaccid paralytic case in Brazil. Genome sequencing revealed the uncommon location of the crossover site in the VP1 coding region (nucleotides 3251-3258 of Sabin 3 genome). The Sabin 2 donor sequence replaced the last 118 nt of VP1, resulting in the substitution of the complete antigenic site IIIa by PV2-specific amino acids. The low overall number of nucleotide substitutions in P1 region indicated that the predicted replication time of the isolate was about 8-9 weeks. Two of the principal determinants of attenuation in Sabin 3 genomes were mutated (U472C and C2493U), but the temperature-sensitive phenotype of the isolate was preserved. Our results support the theory that there exists a PV3/PV2 recombination hotspot site in the tail region of the VP1 capsid protein and that the recombination may occur soon after oral poliovirus vaccine administration.
Burkholderia cordobensis sp. nov., from agricultural soils.
Draghi, Walter O; Peeters, Charlotte; Cnockaert, Margo; Snauwaert, Cindy; Wall, Luis G; Zorreguieta, Angeles; Vandamme, Peter
2014-06-01
Two Gram-negative, rod-shaped bacteria were isolated from agricultural soils in Córdoba province in central Argentina. Their 16S rRNA gene sequences demonstrated that they belong to the genus Burkholderia, with Burkholderia zhejiangensis as most closely related formally named species; this relationship was confirmed through comparative gyrB sequence analysis. Whole-cell fatty acid analysis supported their assignment to the genus Burkholderia. Burkholderia sp. strain YI23, for which a whole-genome sequence is available, represents the same taxon, as demonstrated by its highly similar 16S rRNA (100% similarity) and gyrB (99.1-99.7%) gene sequences. The results of DNA-DNA hybridization experiments and physiological and biochemical characterization further substantiated the genotypic and phenotypic distinctiveness of the Argentinian soil isolates, for which the name Burkholderia cordobensis sp. nov. is proposed, with strain MMP81(T) ( = LMG 27620(T) = CCUG 64368(T)) as the type strain. © 2014 IUMS.
Pasion, S G; Hines, J C; Aebersold, R; Ray, D S
1992-01-01
A type II DNA topoisomerase, topoIImt, was shown previously to be associated with the kinetoplast DNA of the trypanosomatid Crithidia fasciculata. The gene encoding this kinetoplast-associated topoisomerase has been cloned by immunological screening of a Crithidia genomic expression library with monoclonal antibodies raised against the purified enzyme. The gene CfaTOP2 is a single copy gene and is expressed as a 4.8-kb polyadenylated transcript. The nucleotide sequence of CfaTOP2 has been determined and encodes a predicted polypeptide of 1239 amino acids with a molecular mass of 138,445. The identification of the cloned gene is supported by immunoblot analysis of the beta-galactosidase-CfaTOP2 fusion protein expressed in Escherichia coli and by analysis of tryptic peptide sequences derived from purified topoIImt. CfaTOP2 shares significant homology with nuclear type II DNA topoisomerases of other eukaryotes suggesting that in Crithidia both nuclear and mitochondrial forms of topoisomerase II are encoded by the same gene.
Iraola, G; Betancor, L; Calleros, L; Gadea, P; Algorta, G; Galeano, S; Muxi, P; Greif, G; Pérez, R
2015-08-01
Whole-genome characterisation in clinical microbiology enables to detect trends in infection dynamics and disease transmission. Here, we report a case of bacteraemia due to Campylobacter fetus subsp. fetus in a rural worker under cancer treatment that was diagnosed with cellulitis; the patient was treated with antibiotics and recovered. The routine typing methods were not able to identify the microorganism causing the infection, so it was further analysed by molecular methods and whole-genome sequencing. The multi-locus sequence typing (MLST) revealed the presence of the bovine-associated ST-4 genotype. Whole-genome comparisons with other C. fetus strains revealed an inconsistent phylogenetic position based on the core genome, discordant with previous ST-4 strains. To the best of our knowledge, this is the first C. fetus subsp. fetus carrying the ST-4 isolated from humans and represents a probable case of zoonotic transmission from cattle.
Prevalence of precore-defective mutant of hepatitis B virus in HBV carriers.
Niitsuma, H; Ishii, M; Saito, Y; Miura, M; Kobayashi, K; Ohori, H; Toyota, T
1995-08-01
Two hundred and seventy-three serum specimens from hepatitis B virus (HBV) carriers were examined for the presence of a characteristic one point mutation at nucleotide (nt) 1896 from the EcoRI site of the HBV genome in the precore region (the preC mutant) using restriction fragment length polymorphism (RFLP) analysis. This assay approach could detect preC mutants or wild-type sequences when either form constituted more than 10% of the total sample. Overall, 65.5% (76/116) of HBeAg-positive carriers had only the preC wild-type. All HBeAg-positive asymptomatic carriers (n = 14) had only the preC wild-type. In patients with chronic hepatitis B and in anti-HBe-positive asymptomatic carriers, increased prevalence of the preC mutant was associated with the development of anti-HBe antibodies and normalization of the serum alanine aminotransferase concentration. Furthermore, 27 (29.0%) of 93 HBeAg-negative carriers had unexpectedly preC wild-type sequences only. Direct sequencing of the HBV precore region of HBV specimens from 24 patients revealed no mutation at nt 1896, supporting the specificity of the RFLP analysis. These results suggest that RFLP analysis was accurate for the detection of the preC mutation and that the absence of serum HBeAg cannot be explained solely by the dominance of the preC mutant.
Zhang, Hua; Zhang, Tuo; Gao, Jianzhao; Ruan, Jishou; Shen, Shiyi; Kurgan, Lukasz
2012-01-01
Proteins fold through a two-state (TS), with no visible intermediates, or a multi-state (MS), via at least one intermediate, process. We analyze sequence-derived factors that determine folding types by introducing a novel sequence-based folding type predictor called FOKIT. This method implements a logistic regression model with six input features which hybridize information concerning amino acid composition and predicted secondary structure and solvent accessibility. FOKIT provides predictions with average Matthews correlation coefficient (MCC) between 0.58 and 0.91 measured using out-of-sample tests on four benchmark datasets. These results are shown to be competitive or better than results of four modern predictors. We also show that FOKIT outperforms these methods when predicting chains that share low similarity with the chains used to build the model, which is an important advantage given the limited number of annotated chains. We demonstrate that inclusion of solvent accessibility helps in discrimination of the folding kinetic types and that three of the features constitute statistically significant markers that differentiate TS and MS folders. We found that the increased content of exposed Trp and buried Leu are indicative of the MS folding, which implies that the exposure/burial of certain hydrophobic residues may play important role in the formation of the folding intermediates. Our conclusions are supported by two case studies.
Gelada vocal sequences follow Menzerath's linguistic law.
Gustison, Morgan L; Semple, Stuart; Ferrer-I-Cancho, Ramon; Bergman, Thore J
2016-05-10
Identifying universal principles underpinning diverse natural systems is a key goal of the life sciences. A powerful approach in addressing this goal has been to test whether patterns consistent with linguistic laws are found in nonhuman animals. Menzerath's law is a linguistic law that states that, the larger the construct, the smaller the size of its constituents. Here, to our knowledge, we present the first evidence that Menzerath's law holds in the vocal communication of a nonhuman species. We show that, in vocal sequences of wild male geladas (Theropithecus gelada), construct size (sequence size in number of calls) is negatively correlated with constituent size (duration of calls). Call duration does not vary significantly with position in the sequence, but call sequence composition does change with sequence size and most call types are abbreviated in larger sequences. We also find that intercall intervals follow the same relationship with sequence size as do calls. Finally, we provide formal mathematical support for the idea that Menzerath's law reflects compression-the principle of minimizing the expected length of a code. Our findings suggest that a common principle underpins human and gelada vocal communication, highlighting the value of exploring the applicability of linguistic laws in vocal systems outside the realm of language.
Dan, Michael; Yair, Yael; Samosav, Alex; Gottesman, Tamar; Yossepowitch, Orit; Harari-Schwartz, Orna; Tsivian, Alexander; Schreiber, Rachel; Gophna, Uri
2015-01-01
Transrectal ultrasound-guided (TRUS) prostate biopsy is a very common procedure that is generally considered relatively safe. However, severe sepsis can occur after TRUS prostate biopsies, with Escherichia coli being the predominant causative agent. A common perception is that the bacteria that cause post-TRUS prostate biopsy infections originate in the urinary tract, but this view has not been adequately tested. Yet other authors believe on the basis of indirect evidence that the pathogens are introduced into the bloodstream by the biopsy needle after passage through the rectal mucosa. We compared E. coli isolates from male patients with bacteremic urinary tract infection (B-UTI) to isolates of patients with post prostate biopsy sepsis (PPBS), in terms of their sequence types, determined by multi-locus sequence typing (MLST) and their virulence markers. B-UTI isolates were much richer in virulence genes than were PPBS isolates, supporting the hypothesis that E. coli causing PPBS derive directly from the rectum. Sequence type 131 (ST131) strains and related strain from the ST131 were common (>30%) among the E. coli isolates from PPBS patients as well as from B-UTI patients and all these strains expressed extended spectrum beta-lactamases. Our finding supports the hypothesis that E. coli causing PPBS derive directly from the rectum, bypassing the urinary tract, and therefore do not require many of the virulence capabilities necessary for an E. coli strain that must persist in the urinary tract. In light of the increasing prevalence of highly resistant E. coli strains, a new approach for prevention of PPBS is urgently required. Copyright © 2015. Published by Elsevier GmbH.
Cameron, Stephen L; Lo, Nathan; Bourguignon, Thomas; Svenson, Gavin J; Evans, Theodore A
2012-10-01
Despite their ecological significance as decomposers and their evolutionary significance as the most speciose eusocial insect group outside the Hymenoptera, termite (Blattodea: Termitoidae or Isoptera) evolutionary relationships have yet to be well resolved. Previous morphological and molecular analyses strongly conflict at the family level and are marked by poor support for backbone nodes. A mitochondrial (mt) genome phylogeny of termites was produced to test relationships between the recognised termite families, improve nodal support and test the phylogenetic utility of rare genomic changes found in the termite mt genome. Complete mt genomes were sequenced for 7 of the 9 extant termite families with additional representatives of each of the two most speciose families Rhinotermitidae (3 of 7 subfamilies) and Termitidae (3 of 8 subfamilies). The mt genome of the well supported sister-group of termites, the subsocial cockroach Cryptocercus, was also sequenced. A highly supported tree of termite relationships was produced by all analytical methods and data treatment approaches, however the relationship of the termites+Cryptocercus clade to other cockroach lineages was highly affected by the strong nucleotide compositional bias found in termites relative to other dictyopterans. The phylogeny supports previously proposed suprafamilial termite lineages, the Euisoptera and Neoisoptera, a later derived Kalotermitidae as sister group of the Neoisoptera and a monophyletic clade of dampwood (Stolotermitidae, Archotermopsidae) and harvester termites (Hodotermitidae). In contrast to previous termite phylogenetic studies, nodal supports were very high for family-level relationships within termites. Two rare genomic changes in the mt genome control region were found to be molecular synapomorphies for major clades. An elongated stem-loop structure defined the clade Polyphagidae + (Cryptocercus+termites), and a further series of compensatory base changes in this stem-loop is synapomorphic for the Neoisoptera. The complicated repeat structures first identified in Reticulitermes, composed of short (A-type) and long (B-type repeats) defines the clade Heterotermitinae+Termitidae, while the secondary loss of A-type repeats is synapomorphic for the non-macrotermitine Termitidae. Copyright © 2012 Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Crockett, Thomas M.; Joswig, Joseph C.; Shams, Khawaja S.; Norris, Jeffrey S.; Morris, John R.
2011-01-01
MSLICE Sequencing is a graphical tool for writing sequences and integrating them into RML files, as well as for producing SCMF files for uplink. When operated in a testbed environment, it also supports uplinking these SCMF files to the testbed via Chill. This software features a free-form textural sequence editor featuring syntax coloring, automatic content assistance (including command and argument completion proposals), complete with types, value ranges, unites, and descriptions from the command dictionary that appear as they are typed. The sequence editor also has a "field mode" that allows tabbing between arguments and displays type/range/units/description for each argument as it is edited. Color-coded error and warning annotations on problematic tokens are included, as well as indications of problems that are not visible in the current scroll range. "Quick Fix" suggestions are made for resolving problems, and all the features afforded by modern source editors are also included such as copy/cut/paste, undo/redo, and a sophisticated find-and-replace system optionally using regular expressions. The software offers a full XML editor for RML files, which features syntax coloring, content assistance and problem annotations as above. There is a form-based, "detail view" that allows structured editing of command arguments and sequence parameters when preferred. The "project view" shows the user s "workspace" as a tree of "resources" (projects, folders, and files) that can subsequently be opened in editors by double-clicking. Files can be added, deleted, dragged-dropped/copied-pasted between folders or projects, and these operations are undoable and redoable. A "problems view" contains a tabular list of all problems in the current workspace. Double-clicking on any row in the table opens an editor for the appropriate sequence, scrolling to the specific line with the problem, and highlighting the problematic characters. From there, one can invoke "quick fix" as described above to resolve the issue. Once resolved, saving the file causes the problem to be removed from the problem view.
Ramírez, Juan C; Torres, Carolina; Curto, María de Los A; Schijman, Alejandro G
2017-12-01
Trypanosoma cruzi has been subdivided into seven Discrete Typing Units (DTUs), TcI-TcVI and Tcbat. Two major evolutionary models have been proposed to explain the origin of hybrid lineages, but while it is widely accepted that TcV and TcVI are the result of genetic exchange between TcII and TcIII strains, the origin of TcIII and TcIV is still a matter of debate. T. cruzi satellite DNA (SatDNA), comprised of 195 bp units organized in tandem repeats, from both TcV and TcVI stocks were found to have SatDNA copies type TcI and TcII; whereas contradictory results were observed for TcIII stocks and no TcIV sequence has been analyzed yet. Herein, we have gone deeper into this matter analyzing 335 distinct SatDNA sequences from 19 T. cruzi stocks representative of DTUs TcI-TcVI for phylogenetic inference. Bayesian phylogenetic tree showed that all sequences were grouped in three major clusters, which corresponded to sequences from DTUs TcI/III, TcII and TcIV; whereas TcV and TcVI stocks had two sets of sequences distributed into TcI/III and TcII clusters. As expected, the lowest genetic distances were found between TcI and TcIII, and between TcV and TcVI sequences; whereas the highest ones were observed between TcII and TcI/III, and among TcIV sequences and those from the remaining DTUs. In addition, signature patterns associated to specific T. cruzi lineages were identified and new primers that improved SatDNA-based qPCR sensitivity were designed. Our findings support the theory that TcIII is not the result of a hybridization event between TcI and TcII, and that TcIV had an independent origin from the other DTUs, contributing to clarifying the evolutionary history of T. cruzi lineages. Moreover, this work opens the possibility of typing samples from Chagas disease patients with low parasitic loads and improving molecular diagnostic methods of T. cruzi infection based on SatDNA sequence amplification.
Qiu, Jian-Ding; Luo, San-Hua; Huang, Jian-Hua; Sun, Xing-Yu; Liang, Ru-Ping
2010-04-01
Apoptosis proteins have a central role in the development and homeostasis of an organism. These proteins are very important for understanding the mechanism of programmed cell death. As a result of genome and other sequencing projects, the gap between the number of known apoptosis protein sequences and the number of known apoptosis protein structures is widening rapidly. Because of this extremely unbalanced state, it would be worthwhile to develop a fast and reliable method to identify their subcellular locations so as to gain better insight into their biological functions. In view of this, a new method, in which the support vector machine combines with discrete wavelet transform, has been developed to predict the subcellular location of apoptosis proteins. The results obtained by the jackknife test were quite promising, and indicated that the proposed method can remarkably improve the prediction accuracy of subcellular locations, and might also become a useful high-throughput tool in characterizing other attributes of proteins, such as enzyme class, membrane protein type, and nuclear receptor subfamily according to their sequences.
Lefoulon, Emilie; Bourret, Jérôme; Junker, Kerstin; Guerrero, Ricardo; Cañizales, Israel; Kuzmin, Yuriy; Satoto, Tri Baskoro T.; Cardenas-Callirgos, Jorge Manuel; de Souza Lima, Sueli; Raccurt, Christian; Mutafchiev, Yasen; Gavotte, Laurent; Martin, Coralie
2015-01-01
During the past twenty years, a number of molecular analyses have been performed to determine the evolutionary relationships of Onchocercidae, a family of filarial nematodes encompassing several species of medical or veterinary importance. However, opportunities for broad taxonomic sampling have been scarce, and analyses were based mainly on 12S rDNA and coxI gene sequences. While being suitable for species differentiation, these mitochondrial genes cannot be used to infer phylogenetic hypotheses at higher taxonomic levels. In the present study, 48 species, representing seven of eight subfamilies within the Onchocercidae, were sampled and sequences of seven gene loci (nuclear and mitochondrial) analysed, resulting in the hitherto largest molecular phylogenetic investigation into this family. Although our data support the current hypothesis that the Oswaldofilariinae, Waltonellinae and Icosiellinae subfamilies separated early from the remaining onchocercids, Setariinae was recovered as a well separated clade. Dirofilaria, Loxodontofilaria and Onchocerca constituted a strongly supported clade despite belonging to different subfamilies (Onchocercinae and Dirofilariinae). Finally, the separation between Splendidofilariinae, Dirofilariinae and Onchocercinae will have to be reconsidered. PMID:26588229
Mumps virus F gene and HN gene sequencing as a molecular tool to study mumps virus transmission.
Gouma, Sigrid; Cremer, Jeroen; Parkkali, Saara; Veldhuijzen, Irene; van Binnendijk, Rob S; Koopmans, Marion P G
2016-11-01
Various mumps outbreaks have occurred in the Netherlands since 2004, particularly among persons who had received 2 doses of measles, mumps, and rubella (MMR) vaccination. Genomic typing of pathogens can be used to track outbreaks, but the established genotyping of mumps virus based on the small hydrophobic (SH) gene sequences did not provide sufficient resolution. Therefore, we expanded the sequencing to include fusion (F) gene and haemagglutinin-neuraminidase (HN) gene sequences in addition to the SH gene sequences from 109 mumps virus genotype G strains obtained between 2004 and mid 2015 in the Netherlands. When the molecular information from these 3 genes was combined, we were able to identify separate mumps virus clusters and track mumps virus transmission. The analyses suggested that multiple mumps virus introductions occurred in the Netherlands between 2004 and 2015 resulting in several mumps outbreaks throughout this period, whereas during some local outbreaks the molecular data pointed towards endemic circulation. Combined analysis of epidemiological data and sequence data collected in 2015 showed good support for the phylogenetic clustering. Copyright © 2016 Elsevier B.V. All rights reserved.
Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian
2009-11-01
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
Wang, Qi-Ming; Zhang, Yong-Hong; Wang, Bo; Wang, Long
2016-01-04
Two new species isolated from plant leaves belonging to Talaromyces section Talaromyces are reported, namely T. neofusisporus (ex-type AS3.15415 (T) = CBS 139516 (T)) and T. qii (ex-type AS3.15414 (T) = CBS 139515 (T)). Morphologically, T. neofusisporus is featured by forming synnemata on CYA and YES, bearing appressed biverticillate penicilli and smooth-walled fusiform conidia about 3.5-4.5 × 2-2.5 μm; and T. qii is characterized by velutinous colony texture, yellowish green conidia, yellow mycelium and ovoid to subglobose echinulate conidia measuring 3-3.5 μm. Phylogenetically, T. neofusisporus is such a unique species that no close relatives are found according to CaM, BenA and ITS1-5.8S-ITS2 as well as the combined three-gene sequences; and T. qii is related to T. thailandensis according to CaM, BenA and the combined sequence matrices, whereas ITS1-5.8S-ITS2 sequences do not support the close relationship between T. qii and T. thailandensis.
Dallman, T J; Byrne, L; Launders, N; Glen, K; Grant, K A; Jenkins, C
2015-06-01
Many serogroups of Shiga toxin-producing Escherichia coli (STEC) other than serogroup O157 (non-O157 STEC), for example STEC O26:H11, are highly pathogenic and capable of causing haemolytic uraemic syndrome. A recent increase in non-O157 STEC cases identified in England, resulting from a change in the testing paradigm, prompted a review of the current methods available for detection and typing of non-O157 STEC for surveillance and outbreak investigations. Nineteen STEC O26:H11 strains, including four from a nursery outbreak were selected to assess typing methods. Serotyping and multilocus sequence typing were not able to discriminate between the stx-producing strains in the dataset. However, genome sequencing provided rapid and robust confirmation that isolates of STEC O26:H11 associated with a nursery outbreak were linked at the molecular level, had a common source and were distinct from the other strains analysed. Virulence gene profiling of DNA extracted from a polymerase chain reaction (PCR)-positive/culture-negative faecal specimen from a case that was epidemiologically linked to the STEC O26:H11 nursery outbreak, provided evidence at the molecular level to support that link. During this study, we describe the utility of PCR and the genome sequencing approach in facilitating surveillance and enhancing the response to outbreaks of non-O157 STEC.
Meena, Seema; Kumar, Sarma R; Venkata Rao, D K; Dwivedi, Varun; Shilpashree, H B; Rastogi, Shubhra; Shasany, Ajit K; Nagegowda, Dinesh A
2016-01-01
Aromatic grasses of the genus Cymbopogon (Poaceae family) represent unique group of plants that produce diverse composition of monoterpene rich essential oils, which have great value in flavor, fragrance, cosmetic, and aromatherapy industries. Despite the commercial importance of these natural aromatic oils, their biosynthesis at the molecular level remains unexplored. As the first step toward understanding the essential oil biosynthesis, we performed de novo transcriptome assembly and analysis of C. flexuosus (lemongrass) by employing Illumina sequencing. Mining of transcriptome data and subsequent phylogenetic analysis led to identification of terpene synthases, pyrophosphatases, alcohol dehydrogenases, aldo-keto reductases, carotenoid cleavage dioxygenases, alcohol acetyltransferases, and aldehyde dehydrogenases, which are potentially involved in essential oil biosynthesis. Comparative essential oil profiling and mRNA expression analysis in three Cymbopogon species (C. flexuosus, aldehyde type; C. martinii, alcohol type; and C. winterianus, intermediate type) with varying essential oil composition indicated the involvement of identified candidate genes in the formation of alcohols, aldehydes, and acetates. Molecular modeling and docking further supported the role of identified protein sequences in aroma formation in Cymbopogon. Also, simple sequence repeats were found in the transcriptome with many linked to terpene pathway genes including the genes potentially involved in aroma biosynthesis. This work provides the first insights into the essential oil biosynthesis of aromatic grasses, and the identified candidate genes and markers can be a great resource for biotechnological and molecular breeding approaches to modulate the essential oil composition.
Meena, Seema; Kumar, Sarma R.; Venkata Rao, D. K.; Dwivedi, Varun; Shilpashree, H. B.; Rastogi, Shubhra; Shasany, Ajit K.; Nagegowda, Dinesh A.
2016-01-01
Aromatic grasses of the genus Cymbopogon (Poaceae family) represent unique group of plants that produce diverse composition of monoterpene rich essential oils, which have great value in flavor, fragrance, cosmetic, and aromatherapy industries. Despite the commercial importance of these natural aromatic oils, their biosynthesis at the molecular level remains unexplored. As the first step toward understanding the essential oil biosynthesis, we performed de novo transcriptome assembly and analysis of C. flexuosus (lemongrass) by employing Illumina sequencing. Mining of transcriptome data and subsequent phylogenetic analysis led to identification of terpene synthases, pyrophosphatases, alcohol dehydrogenases, aldo-keto reductases, carotenoid cleavage dioxygenases, alcohol acetyltransferases, and aldehyde dehydrogenases, which are potentially involved in essential oil biosynthesis. Comparative essential oil profiling and mRNA expression analysis in three Cymbopogon species (C. flexuosus, aldehyde type; C. martinii, alcohol type; and C. winterianus, intermediate type) with varying essential oil composition indicated the involvement of identified candidate genes in the formation of alcohols, aldehydes, and acetates. Molecular modeling and docking further supported the role of identified protein sequences in aroma formation in Cymbopogon. Also, simple sequence repeats were found in the transcriptome with many linked to terpene pathway genes including the genes potentially involved in aroma biosynthesis. This work provides the first insights into the essential oil biosynthesis of aromatic grasses, and the identified candidate genes and markers can be a great resource for biotechnological and molecular breeding approaches to modulate the essential oil composition. PMID:27516768
Muñoz, Marina; Ríos-Chaparro, Dora Inés; Patarroyo, Manuel Alfonso; Ramírez, Juan David
2017-03-14
Multilocus sequence typing (MLST) is a highly discriminatory typing strategy; it is reproducible and scalable. There is a MLST scheme for Clostridium difficile (CD), a gram positive bacillus causing different pathologies of the gastrointestinal tract. This work was aimed at describing the frequency of sequence types (STs) and Clades (C) reported and evalute the intra-taxa diversity in the CD MLST database (CD-MLST-db) using an MLSA approach. Analysis of 1778 available isolates showed that clade 1 (C1) was the most frequent worldwide (57.7%), followed by C2 (29.1%). Regarding sequence types (STs), it was found that ST-1, belonging to C2, was the most frequent. The isolates analysed came from 17 countries, mostly from the United Kingdom (UK) (1541 STs, 87.0%). The diversity of the seven housekeeping genes in the MLST scheme was evaluated, and alleles from the profiles (STs), for identifying CD population structure. It was found that adk and atpA are conserved genes allowing a limited amount of clusters to be discriminated; however, different genes such as drx, glyA and particularly sodA showed high diversity indexes and grouped CD populations in many clusters, suggesting that these genes' contribution to CD typing should be revised. It was identified that CD STs reported to date have a mostly clonal population structure with foreseen events of recombination; however, one group of STs was not assigned to a clade being highly different containing at least nine well-supported clusters, suggesting a greater amount of clades for CD. This study shows the usefulness of CD-MLST-db as a tool for studying CD distribution and population structure, identifying the need for reviewing the usefulness of sodA as housekeeping gene within the MLST scheme and suggesting the existence of a greater amount of CD clades. The study also shows the plausible exchange of genetic material between STs, contributing towards intra-taxa genetic diversity.
Legendre, Frédéric; Whiting, Michael F; Bordereau, Christian; Cancello, Eliana M; Evans, Theodore A; Grandcolas, Philippe
2008-08-01
A phylogenetic hypothesis of termite relationships was inferred from DNA sequence data. Seven gene fragments (12S rDNA, 16S rDNA, 18S rDNA, 28S rDNA, cytochrome oxidase I, cytochrome oxidase II and cytochrome b) were sequenced for 40 termite exemplars, representing all termite families and 14 outgroups. Termites were found to be monophyletic with Mastotermes darwiniensis (Mastotermitidae) as sister group to the remainder of the termites. In this remainder, the family Kalotermitidae was sister group to other families. The families Kalotermitidae, Hodotermitidae and Termitidae were retrieved as monophyletic whereas the Termopsidae and Rhinotermitidae appeared paraphyletic. All of these results were very stable and supported with high bootstrap and Bremer values. The evolution of worker caste and foraging behavior were discussed according to the phylogenetic hypothesis. Our analyses suggested that both true workers and pseudergates ("false workers") were the result of at least two different origins. Our data support a traditional hypothesis of foraging behavior, in which the evolutionary transition from a one-piece type to a separate life type occurred through an intermediate behavioral form.
mtDNA variation in the Yanomami: evidence for additional New World founding lineages.
Easton, R D; Merriwether, D A; Crews, D E; Ferrell, R E
1996-07-01
Native Americans have been classified into four founding haplogroups with as many as seven founding lineages based on mtDNA RFLPs and DNA sequence data. mtDNA analysis was completed for 83 Yanomami from eight villages in the Surucucu and Catrimani Plateau regions of Roraima in northwestern Brazil. Samples were typed for 15 polymorphic mtDNA sites (14 RFLP sites and 1 deletion site), and a subset was sequenced for both hypervariable regions of the mitochondrial D-loop. Substantial mitochondrial diversity was detected among the Yanomami, five of seven accepted founding haplotypes and three others were observed. Of the 83 samples, 4 (4.8%) were lineage B1, 1 (1.2%) was lineage B2, 31 (37.4%) were lineage C1, 29 (34.9%) were lineage C2, 2 (2.4%) were lineage D1, 6 (7.2%) were lineage D2, 7 (8.4%) were a haplotype we designated "X6," and 3 (3.6%) were a haplotype we designated "X7." Sequence analysis found 43 haplotypes in 50 samples. B2, X6, and X7 are previously unrecognized mitochondrial founding lineage types of Native Americans. The widespread distribution of these haplotypes in the New World and Asia provides support for declaring these lineages to be New World founding types.
mtDNA variation in the Yanomami: evidence for additional New World founding lineages.
Easton, R. D.; Merriwether, D. A.; Crews, D. E.; Ferrell, R. E.
1996-01-01
Native Americans have been classified into four founding haplogroups with as many as seven founding lineages based on mtDNA RFLPs and DNA sequence data. mtDNA analysis was completed for 83 Yanomami from eight villages in the Surucucu and Catrimani Plateau regions of Roraima in northwestern Brazil. Samples were typed for 15 polymorphic mtDNA sites (14 RFLP sites and 1 deletion site), and a subset was sequenced for both hypervariable regions of the mitochondrial D-loop. Substantial mitochondrial diversity was detected among the Yanomami, five of seven accepted founding haplotypes and three others were observed. Of the 83 samples, 4 (4.8%) were lineage B1, 1 (1.2%) was lineage B2, 31 (37.4%) were lineage C1, 29 (34.9%) were lineage C2, 2 (2.4%) were lineage D1, 6 (7.2%) were lineage D2, 7 (8.4%) were a haplotype we designated "X6," and 3 (3.6%) were a haplotype we designated "X7." Sequence analysis found 43 haplotypes in 50 samples. B2, X6, and X7 are previously unrecognized mitochondrial founding lineage types of Native Americans. The widespread distribution of these haplotypes in the New World and Asia provides support for declaring these lineages to be New World founding types. PMID:8659527
Zheng, H; Ye, C; Segura, M; Gottschalk, M; Xu, J
2008-09-01
Streptococcus suis serotype 2 sequence type 7 strains emerged in 1996 and caused a streptococcal toxic shock-like syndrome in 1998 and 2005 in China. Evidence indicated that the virulence of S. suis sequence type 7 had increased, but the mechanism was unknown. The sequence type 7 strain SC84, isolated from a patient with streptococcal toxic shock-like syndrome during the Sichuan outbreak, and the sequence type 1 strain 31533, a typical highly pathogenic strain isolated from a diseased pig, were used in comparative studies. In this study we show the mechanisms underlying cytokine production differed between the two types of strains. The S. suis sequence type 7 strain SC84 possesses a stronger capacity to stimulate T cells, naive T cells and peripheral blood mononuclear cell proliferation than does S. suis sequence type 1 strain 31533. The T cell response to both strains was dependent upon the presence of antigen-presenting cells. Histo-incompatible antigen-presenting cells were sufficient to provide the accessory signals to naive T cell stimulated by the two strains, indicating that both sequence type 7 and 1 strains possess mitogens; however, the mitogenic effect was different. Therefore, we propose that the difference in the mitogenic effect of sequence type 7 strain SC84 compared with the sequence type 1 strain 31533 of S. suis may be associated with the clinical, epidemiological and microbiological difference, where the ST 7 strains have a larger mitogenic effect.
Zheng, H; Ye, C; Segura, M; Gottschalk, M; Xu, J
2008-01-01
Streptococcus suis serotype 2 sequence type 7 strains emerged in 1996 and caused a streptococcal toxic shock-like syndrome in 1998 and 2005 in China. Evidence indicated that the virulence of S. suis sequence type 7 had increased, but the mechanism was unknown. The sequence type 7 strain SC84, isolated from a patient with streptococcal toxic shock-like syndrome during the Sichuan outbreak, and the sequence type 1 strain 31533, a typical highly pathogenic strain isolated from a diseased pig, were used in comparative studies. In this study we show the mechanisms underlying cytokine production differed between the two types of strains. The S. suis sequence type 7 strain SC84 possesses a stronger capacity to stimulate T cells, naive T cells and peripheral blood mononuclear cell proliferation than does S. suis sequence type 1 strain 31533. The T cell response to both strains was dependent upon the presence of antigen-presenting cells. Histo-incompatible antigen-presenting cells were sufficient to provide the accessory signals to naive T cell stimulated by the two strains, indicating that both sequence type 7 and 1 strains possess mitogens; however, the mitogenic effect was different. Therefore, we propose that the difference in the mitogenic effect of sequence type 7 strain SC84 compared with the sequence type 1 strain 31533 of S. suis may be associated with the clinical, epidemiological and microbiological difference, where the ST 7 strains have a larger mitogenic effect. PMID:18803762
Plasmon-polaritonic bands in sequential doped graphene superlattices
NASA Astrophysics Data System (ADS)
Ramos-Mendieta, Felipe; Palomino-Ovando, Martha; Hernández-López, Alejandro; Fuentecilla-Cárcamo, Iván
Doped graphene has the extraordinary quality of supporting two types of surface excitations that involve electric charges (the transverse magnetic surface plasmons) or electric currents (the transverse electric modes). We have studied numerically the collective modes that result from the coupling of surface plasmons in doped graphene multilayers. By use of structured supercells with fixed dielectric background and inter layer separation, we found a series of plasmon-polaritonic bands of structure dependent on the doping sequence chosen for the graphene sheets. Periodic and quasiperiodic sequences for the graphene chemical potential have been studied. Our results show that transverse magnetic bands exist only in the low frequency regime but transverse electric bands arise within specific ranges of higher frequencies. Our calculations are valid for THz frequencies and graphene sheets with doping levels between 0.1 eV and 1.2 eV have been considered. AHL and IFC aknowledge fellowship support from CONACYT México.
Pulse Sequence Programming in a Dynamic Visual Environment: SequenceTree
Magland, Jeremy F.; Li, Cheng; Langham, Michael C.; Wehrli, Felix W.
2015-01-01
Purpose To describe SequenceTree (ST), an open source. integrated software environment for implementing MRI pulse sequences, and ideally exported them to actual MRI scanners. The software is a user-friendly alternative to vendor-supplied pulse sequence design and editing tools and is suited for non-programmers and programmers alike. Methods The integrated user interface was programmed using the Qt4/C++ toolkit. As parameters and code are modified, the pulse sequence diagram is automatically updated within the user interface. Several aspects of pulse programming are handled automatically allowing users to focus on higher-level aspects of sequence design. Sequences can be simulated using a built-in Bloch equation solver and then exported for use on a Siemens MRI scanner. Ideally other types of scanners will be supported in the future. Results The software has been used for eight years in the authors’ laboratory and elsewhere and has been utilized in more than fifty peer-reviewed publications in areas such as cardiovascular imaging, solid state and non-proton NMR, MR elastography, and high resolution structural imaging. Conclusion ST is an innovative, open source, visual pulse sequence environment for MRI combining simplicity with flexibility and is ideal for both advanced users and those with limited programming experience. PMID:25754837
Lima, Luciana; Espinosa-Álvarez, Oneida; Ortiz, Paola A; Trejo-Varón, Javier A; Carranza, Julio C; Pinto, C Miguel; Serrano, Myrna G; Buck, Gregory A; Camargo, Erney P; Teixeira, Marta M G
2015-11-01
Trypanosoma cruzi is a complex of phenotypically and genetically diverse isolates distributed in six discrete typing units (DTUs) designated as TcI-TcVI. Five years ago, T. cruzi isolates from Brazilian bats showing unique patterns of traditional ribosomal and spliced leader PCRs not clustering into any of the six DTUs were designated as the Tcbat genotype. In the present study, phylogenies inferred using SSU rRNA (small subunit of ribosomal rRNA), gGAPDH (glycosomal glyceraldehyde 3-phosphate dehydrogenase) and Cytb (cytochrome b) genes strongly supported Tcbat as a monophyletic lineage prevalent in Brazil, Panama and Colombia. Providing strong support for Tcbat, sequences from 37 of 47 nuclear and 12 mitochondrial genes (retrieved from a draft genome of Tcbat) and reference strains of all DTUs available in databanks corroborated Tcbat as an independent DTU. Consistent with previous studies, multilocus analysis of most nuclear genes corroborated the evolution of T. cruzi from bat trypanosomes its divergence into two main phylogenetic lineages: the basal TcII; and the lineage clustering TcIV, the clade comprising TcIII and the sister groups TcI-Tcbat. Most likely, the common ancestor of Tcbat and TcI was a bat trypanosome. However, the results of the present analysis did not support Tcbat as the ancestor of all DTUs. Despite the insights provided by reports of TcIII, TcIV and TcII in bats, including Amazonian bats harbouring TcII, further studies are necessary to understand the roles played by bats in the diversification of all DTUs. We also demonstrated that in addition to value as molecular markers for DTU assignment, Cytb, ITS rDNA and the spliced leader (SL) polymorphic sequences suggest spatially structured populations of Tcbat. Phylogenetic and phylogeographical analyses, multiple molecular markers specific to Tcbat, and the degrees of sequence divergence between Tcbat and the accepted DTUs strongly support the definitive classification of Tcbat as a new DTU. Copyright © 2015 Elsevier B.V. All rights reserved.
Alirezaie, Behnam; Taqavian, Mohammad; Aghaiypour, Khosrow; Esna-Ashari, Fatemeh; Shafyi, Abbas
2011-05-01
The cell substrate has a pivotal role in live virus vaccines production. It is necessary to evaluate the effects of the cell substrate on the properties of the propagated viruses, especially in the case of viruses which are unstable genetically such as polioviruses, by monitoring the molecular and phenotypical characteristics of harvested viruses. To investigate the presence/absence of mutation(s), the near full-length genomic sequence of different harvests of the type 3 Sabin strain of poliovirus propagated in MRC-5 cells were determined. The sequences were compared with genomic sequences of different virus seeds, vaccines, and OPV-like isolates. Nearly complete genomic sequencing results, however, revealed no detectable mutations throughout the genome RNA-plaque purified (RSO)-derived monopool of type 3 OPVs manufactured in MRC-5. Thirty-six years of experience in OPV production, trend analysis, and vaccine surveillance also suggest that: (i) different monopools of serotype 3 OPV produced in MRC-5 retained their phenotypic characteristics (temperature sensitivity and neuroattenuation), (ii) MRC-5 cells support the production of acceptable virus yields, (iii) OPV replicated in the MRC-5 cell substrate is a highly efficient and safe vaccine. These results confirm previous reports that MRC-5 is a desirable cell substrate for the production of OPV. Copyright © 2011 Wiley-Liss, Inc.
Pseudomonas aestus sp. nov., a plant growth-promoting bacterium isolated from mangrove sediments.
Vasconcellos, Rafael L F; Santos, Suikinai Nobre; Zucchi, Tiago Domingues; Silva, Fábio Sérgio Paulino; Souza, Danilo Tosta; Melo, Itamar Soares
2017-10-01
Strain CMAA 1215 T , a Gram-reaction-negative, aerobic, catalase positive, polarly flagellated, motile, rod-shaped (0.5-0.8 × 1.3-1.9 µm) bacterium, was isolated from mangrove sediments, Cananéia Island, Brazil. Analysis of the 16S rRNA gene sequences showed that strain CMAA 1215 T forms a distinct phyletic line within the Pseudomonas putida subclade, being closely related to P. plecoglossicida ATCC 700383 T , P. monteilii NBRC 103158 T , and P. taiwanensis BCRC 17751 T of sequence similarity of 98.86, 98.73, and 98.71%, respectively. Genomic comparisons of the strain CMAA 1215 T with its closest phylogenetic type strains using average nucleotide index (ANI) and DNA:DNA relatedness approaches revealed 84.3-85.3% and 56.0-63.0%, respectively. A multilocus sequence analysis (MLSA) performed concatenating 16S rRNA, gyrB and rpoB gene sequences from the novel species was related with Pseudomonas putida subcluster and formed a new phylogenetic lineage. The phenotypic, physiological, biochemical, and genetic characteristics support the assignment of CMAA 1215 T to the genus Pseudomonas, representing a novel species. The name Pseudomonas aestus sp.nov. is proposed, with CMAA 1215 T (=NRRL B-653100 T = CBMAI 1962 T ) as the type strain.
Molecular Typing of Legionella pneumophila Isolates in the Province of Quebec from 2005 to 2015.
Lévesque, Simon; Lalancette, Cindy; Bernard, Kathryn; Pacheco, Ana Luisa; Dion, Réjean; Longtin, Jean; Tremblay, Cécile
2016-01-01
Legionella is found in natural and man-made aquatic environments, such as cooling towers and hot water plumbing infrastructures. Legionella pneumophila serogroup 1 (Lp1) is the most common etiological agent causing waterborne disease in the United States and Canada. This study reports the molecular characterization of Lp strains during a 10 year period. We conducted sequence-based typing (SBT) analysis on a large set of Lp isolates (n = 284) to investigate the province of Quebec sequence types (STs) distribution in order to identify dominant clusters. From 2005 to 2015, 181 clinical Lp isolates were typed by SBT (141 sporadic cases and 40 outbreak related cases). From the same period of time, 103 environmental isolates were also typed. Amongst the 108 sporadic cases of Lp1 typed, ST-62 was the most frequent (16.6%), followed by ST-213 (10.2%), ST-1 (8.3%) and ST-37 (8.3%). Amongst other serogroups (SG), ST-1327 (SG5) (27.3%) and ST-378 (SG10) (12.2%) were the most frequent. From the environmental isolates, ST-1 represent the more frequent SBT type (26.5%). Unweighted pair group method with arithmetic mean (UPGMA) dendrogram from the 108 sporadic cases of SG1 contains 4 major clusters (A to D) of related STs. Cluster B contains the majority of the strains (n = 61) and the three most frequent STs in our database (ST-62, ST-213 and ST-1). During the study period, we observed an important increase in the incidence rate in Quebec. All the community associated outbreaks, potentially or confirmed to be associated with a cooling tower were caused by Lp1 strains, by opposition to hospital associated outbreaks that were caused by serogroups of Lp other than SG1. The recent major Quebec City outbreak caused by ST-62, and the fact that this genotype is the most common in the province supports whole genome sequencing characterization of this particular sequence type in order to understand its evolution and associated virulence factors.
Infant auditory short-term memory for non-linguistic sounds.
Ross-Sheehy, Shannon; Newman, Rochelle S
2015-04-01
This research explores auditory short-term memory (STM) capacity for non-linguistic sounds in 10-month-old infants. Infants were presented with auditory streams composed of repeating sequences of either 2 or 4 unique instruments (e.g., flute, piano, cello; 350 or 700 ms in duration) followed by a 500-ms retention interval. These instrument sequences either stayed the same for every repetition (Constant) or changed by 1 instrument per sequence (Varying). Using the head-turn preference procedure, infant listening durations were recorded for each stream type (2- or 4-instrument sequences composed of 350- or 700-ms notes). Preference for the Varying stream was taken as evidence of auditory STM because detection of the novel instrument required memory for all of the instruments in a given sequence. Results demonstrate that infants listened longer to Varying streams for 2-instrument sequences, but not 4-instrument sequences, composed of 350-ms notes (Experiment 1), although this effect did not hold when note durations were increased to 700 ms (Experiment 2). Experiment 3 replicates and extends results from Experiments 1 and 2 and provides support for a duration account of capacity limits in infant auditory STM. Copyright © 2014 Elsevier Inc. All rights reserved.
Hughes, L A; Wigley, P; Bennett, M; Chantrey, J; Williams, N
2010-10-01
Recent studies have suggested that Salmonella Typhimurium strains associated with mortality in UK garden birds are significantly different from strains that cause disease in humans and livestock and that wild bird strains may be host adapted. However, without further genomic characterization of these strains, it is not possible to determine whether they are host adapted. The aim of this study was to characterize a representative sample of Salm. Typhimurium strains detected in wild garden birds using multi-locus sequence typing (MLST)to investigate evolutionary relationships between them. Multi-locus sequence typing was performed on nine Salm. Typhimurium strains isolated from wild garden birds. Two sequence types were identified, the most common of which was ST568. Examination of the public Salmonella enterica MLST database revealed that only three other ST568 isolates had been cultured from a human in Scotland. Two further isolates of Salm. Typhimurium were determined to be ST19. Results of MLST analysis suggest that there is a predominant strain of Salm. Typhimurium circulating among garden bird populations in the United Kingdom, which is rarely detected in other species, supporting the hypothesis that this strain is host adapted. Host-pathogen evolution is often assumed to lead to pathogens becoming less virulent to avoid the death of their host; however, infection with ST568 led to high mortality rates among the wild birds examined, which were all found dead at wild bird-feeding stations. We hypothesize that by attracting unnaturally high densities of birds, wild bird-feeding stations may facilitate the transmission of ST568 between wild birds, therefore reducing the evolutionary cost of this pathogen killing its host, resulting in a host-adapted strain with increased virulence.
Double photoionization of the Be isoelectronic sequence
NASA Astrophysics Data System (ADS)
Barmaki, S.; Albert, M. A.; Belliveau, J.; Laulan, S.
2018-05-01
We investigate the double photoionization (DPI) process along the Be isoelectronic sequence (Be‑Ne6+) by solving the time-dependent Schrödinger equation with a spectral method of configuration interaction type. The results that we obtain of the DPI cross sections are in a good agreement with other reported data. We also present the first results of double-to-single photoionization cross sections ratios for Be-like ions in support of possible photofragmentation experiments with x-ray free electron lasers. Finally, we probe the mutual interaction of the valence electrons at different photon energies and examine the subsequent redistribution of the excess photon energy among them.
Multilocus sequence analysis and rpoB sequencing of Mycobacterium abscessus (sensu lato) strains.
Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate
2011-02-01
Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536(T), M. massiliense CIP 108297(T), and M. bolletii CIP 108541(T)) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering of strains. We found 10/120 (8.3%) isolates for which the concatenated MLSA gene sequence and rpoB sequence were discordant (e.g., M. massiliense MLSA sequence and M. abscessus rpoB sequence), suggesting the intergroup lateral transfers of rpoB. In conclusion, our study strongly supports the recent proposal that M. abscessus, M. massiliense, and M. bolletii should constitute a single species. Our findings also indicate that there has been a horizontal transfer of rpoB sequences between these subgroups, precluding the use of rpoB sequencing alone for the accurate identification of the two proposed M. abscessus subspecies.
Multilocus Sequence Analysis and rpoB Sequencing of Mycobacterium abscessus (Sensu Lato) Strains▿
Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate
2011-01-01
Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536T, M. massiliense CIP 108297T, and M. bolletii CIP 108541T) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering of strains. We found 10/120 (8.3%) isolates for which the concatenated MLSA gene sequence and rpoB sequence were discordant (e.g., M. massiliense MLSA sequence and M. abscessus rpoB sequence), suggesting the intergroup lateral transfers of rpoB. In conclusion, our study strongly supports the recent proposal that M. abscessus, M. massiliense, and M. bolletii should constitute a single species. Our findings also indicate that there has been a horizontal transfer of rpoB sequences between these subgroups, precluding the use of rpoB sequencing alone for the accurate identification of the two proposed M. abscessus subspecies. PMID:21106786
[What gene and chromosomes say about the origin and evolution of insects and other arthropods].
Lukhtanov, V A; Kuznetsova, V G
2010-09-01
At the turn of the 21st century, the use of molecular and molecular cytogenetic methods led to revolutionary advances in systematics of insects and other arthropods. Analysis of nuclear and mitochondrial genes, as well as investigation of structural rearrangements in the mitochondrial chromosome convincingly supported the Pancrustacea hypothesis, according to which insects originated directly from crustaceans, whereas myriapods are not closely related to them. The presence of the specific telomeric motif TTAGG confirmed the monophyletic origin of arthropods (Arthropoda) and the assignment of tongue worms (Pentastomida) to this type. Several different types of telomeric sequences have been found within the class of insects. Investigation of the molecular organization of these sequences may shed light on the relationships between the orders Diptera, Siphonaptera, and Mecoptera and on the origin of such enigmatic groups as the orders Strepsiptera, Zoraptera and suborder Coleorrhyncha.
Aspergillus asper sp. nov. and Aspergillus collinsii sp. nov., from Aspergillus section Usti.
Jurjevic, Zeljko; Peterson, Stephen W
2016-07-01
In sampling fungi from the built environment, two isolates that could not confidently be placed in described species were encountered. Phenotypic analysis suggested that they belonged in Aspergillus sect. Usti. In order to verify the sectional placement and to assure that they were undescribed rather than phenotypically aberrant isolates, DNA was isolated and sequenced at the beta-tubulin, calmodulin, internal transcribed spacer and RNA polymerase II loci and sequences compared with those from other species in the genus Aspergillus. At each locus, each new isolate was distant from existing species. Phylogenetic trees calculated from these data and GenBank data for species of the section Usti excluded the placement of these isolates in existing species, with statistical support. Because they were excluded from existing taxa, the distinct species Aspergillus asper (type strain NRRL 35910 T ) and Aspergillus collinsii (type strain NRRL 66196 T ) in sect. Usti are proposed to accommodate these strains.
A core microbiome associated with the peritoneal tumors of pseudomyxoma peritonei
2013-01-01
Background Pseudomyxoma peritonei (PMP) is a malignancy characterized by dissemination of mucus-secreting cells throughout the peritoneum. This disease is associated with significant morbidity and mortality and despite effective treatment options for early-stage disease, patients with PMP often relapse. Thus, there is a need for additional treatment options to reduce relapse rate and increase long-term survival. A previous study identified the presence of both typed and non-culturable bacteria associated with PMP tissue and determined that increased bacterial density was associated with more severe disease. These findings highlighted the possible role for bacteria in PMP disease. Methods To more clearly define the bacterial communities associated with PMP disease, we employed a sequenced-based analysis to profile the bacterial populations found in PMP tumor and mucin tissue in 11 patients. Sequencing data were confirmed by in situ hybridization at multiple taxonomic depths and by culturing. A pilot clinical study was initiated to determine whether the addition of antibiotic therapy affected PMP patient outcome. Main results We determined that the types of bacteria present are highly conserved in all PMP patients; the dominant phyla are the Proteobacteria, Actinobacteria, Firmicutes and Bacteroidetes. A core set of taxon-specific sequences were found in all 11 patients; many of these sequences were classified into taxonomic groups that also contain known human pathogens. In situ hybridization directly confirmed the presence of bacteria in PMP at multiple taxonomic depths and supported our sequence-based analysis. Furthermore, culturing of PMP tissue samples allowed us to isolate 11 different bacterial strains from eight independent patients, and in vitro analysis of subset of these isolates suggests that at least some of these strains may interact with the PMP-associated mucin MUC2. Finally, we provide evidence suggesting that targeting these bacteria with antibiotic treatment may increase the survival of PMP patients. Conclusions Using 16S amplicon-based sequencing, direct in situ hybridization analysis and culturing methods, we have identified numerous bacterial taxa that are consistently present in all PMP patients tested. Combined with data from a pilot clinical study, these data support the hypothesis that adding antimicrobials to the standard PMP treatment could improve PMP patient survival. PMID:23844722
Andersen, Mikael R.; Salazar, Margarita P.; Schaap, Peter J.; van de Vondervoort, Peter J.I.; Culley, David; Thykaer, Jette; Frisvad, Jens C.; Nielsen, Kristian F.; Albang, Richard; Albermann, Kaj; Berka, Randy M.; Braus, Gerhard H.; Braus-Stromeyer, Susanna A.; Corrochano, Luis M.; Dai, Ziyu; van Dijck, Piet W.M.; Hofmann, Gerald; Lasure, Linda L.; Magnuson, Jon K.; Menke, Hildegard; Meijer, Martin; Meijer, Susan L.; Nielsen, Jakob B.; Nielsen, Michael L.; van Ooyen, Albert J.J.; Pel, Herman J.; Poulsen, Lars; Samson, Rob A.; Stam, Hein; Tsang, Adrian; van den Brink, Johannes M.; Atkins, Alex; Aerts, Andrea; Shapiro, Harris; Pangilinan, Jasmyn; Salamov, Asaf; Lou, Yigong; Lindquist, Erika; Lucas, Susan; Grimwood, Jane; Grigoriev, Igor V.; Kubicek, Christian P.; Martinez, Diego; van Peij, Noël N.M.E.; Roubos, Johannes A.; Nielsen, Jens; Baker, Scott E.
2011-01-01
The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compel additional exploration. We therefore undertook whole-genome sequencing of the acidogenic A. niger wild-type strain (ATCC 1015) and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence, and half the telomeric regions have been elucidated. Moreover, sequence information from ATCC 1015 was used to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 Mb of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis supported up-regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases, and protein transporters in the protein producing CBS 513.88 strain. Our results and data sets from this integrative systems biology analysis resulted in a snapshot of fungal evolution and will support further optimization of cell factories based on filamentous fungi. PMID:21543515
Lucas, J.N.; Straume, T.; Bogen, K.T.
1998-03-24
A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
1998-01-01
A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
The Gap Procedure: for the identification of phylogenetic clusters in HIV-1 sequence data.
Vrbik, Irene; Stephens, David A; Roger, Michel; Brenner, Bluma G
2015-11-04
In the context of infectious disease, sequence clustering can be used to provide important insights into the dynamics of transmission. Cluster analysis is usually performed using a phylogenetic approach whereby clusters are assigned on the basis of sufficiently small genetic distances and high bootstrap support (or posterior probabilities). The computational burden involved in this phylogenetic threshold approach is a major drawback, especially when a large number of sequences are being considered. In addition, this method requires a skilled user to specify the appropriate threshold values which may vary widely depending on the application. This paper presents the Gap Procedure, a distance-based clustering algorithm for the classification of DNA sequences sampled from individuals infected with the human immunodeficiency virus type 1 (HIV-1). Our heuristic algorithm bypasses the need for phylogenetic reconstruction, thereby supporting the quick analysis of large genetic data sets. Moreover, this fully automated procedure relies on data-driven gaps in sorted pairwise distances to infer clusters, thus no user-specified threshold values are required. The clustering results obtained by the Gap Procedure on both real and simulated data, closely agree with those found using the threshold approach, while only requiring a fraction of the time to complete the analysis. Apart from the dramatic gains in computational time, the Gap Procedure is highly effective in finding distinct groups of genetically similar sequences and obviates the need for subjective user-specified values. The clusters of genetically similar sequences returned by this procedure can be used to detect patterns in HIV-1 transmission and thereby aid in the prevention, treatment and containment of the disease.
Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A
2012-05-01
The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.
Shaukat, Shahzad; Angez, Mehar; Alam, Muhammad Masroor; Jebbink, Maarten F; Deijs, Martin; Canuti, Marta; Sharif, Salmaan; de Vries, Michel; Khurshid, Adnan; Mahmood, Tariq; van der Hoek, Lia; Zaidi, Syed Sohail Zahoor
2014-08-12
The use of sequence independent methods combined with next generation sequencing for identification purposes in clinical samples appears promising and exciting results have been achieved to understand unexplained infections. One sequence independent method, Virus Discovery based on cDNA Amplified Fragment Length Polymorphism (VIDISCA) is capable of identifying viruses that would have remained unidentified in standard diagnostics or cell cultures. VIDISCA is normally combined with next generation sequencing, however, we set up a simplified VIDISCA which can be used in case next generation sequencing is not possible. Stool samples of 10 patients with unexplained acute flaccid paralysis showing cytopathic effect in rhabdomyosarcoma cells and/or mouse cells were used to test the efficiency of this method. To further characterize the viruses, VIDISCA-positive samples were amplified and sequenced with gene specific primers. Simplified VIDISCA detected seven viruses (70%) and the proportion of eukaryotic viral sequences from each sample ranged from 8.3 to 45.8%. Human enterovirus EV-B97, EV-B100, echovirus-9 and echovirus-21, human parechovirus type-3, human astrovirus probably a type-3/5 recombinant, and tetnovirus-1 were identified. Phylogenetic analysis based on the VP1 region demonstrated that the human enteroviruses are more divergent isolates circulating in the community. Our data support that a simplified VIDISCA protocol can efficiently identify unrecognized viruses grown in cell culture with low cost, limited time without need of advanced technical expertise. Also complex data interpretation is avoided thus the method can be used as a powerful diagnostic tool in limited resources. Redesigning the routine diagnostics might lead to additional detection of previously undiagnosed viruses in clinical samples of patients.
Van Ooteghem, Karen; Frank, James S.; Allard, Fran; Horak, Fay B
2011-01-01
Postural motor learning for dynamic balance tasks has been demonstrated in healthy older adults (Van Ooteghem et al. 2009). The purpose of this study was to investigate the type of knowledge (general or specific) obtained with balance training in this age group and to examine whether embedding perturbation regularities within a balance task masks specific learning. Two groups of older adults maintained balance on a constant frequency-variable amplitude oscillating platform. One group was trained using an embedded sequence (ES) protocol which contained the same 15-s sequence of variable amplitude oscillations in the middle of each trial. A second group was trained using a looped sequence (LS) protocol which contained a 15-s sequence repeated three times to form each trial. All trials were 45-s. Participants were not informed of any repetition. To examine learning, participants performed a retention test following a 24-h delay. LS participants also completed a transfer task. Specificity of learning was examined by comparing performance for repeated versus random sequences (ES) and training versus transfer sequences (LS). Performance was measured by deriving spatial and temporal measures of whole body centre of mass (COM), and trunk orientation. Both groups improved performance with practice as characterized by reduced COM displacement, improved COM-platform phase relationships, and decreased angular trunk motion. Improvements were also characterized by general rather than specific postural motor learning. These findings are similar to young adults (Van Ooteghem et al. 2008) and indicate that age does not influence the type of learning which occurs for balance control. PMID:20544184
[Standard algorithm of molecular typing of Yersinia pestis strains].
Eroshenko, G A; Odinokov, G N; Kukleva, L M; Pavlova, A I; Krasnov, Ia M; Shavina, N Iu; Guseva, N P; Vinogradova, N A; Kutyrev, V V
2012-01-01
Development of the standard algorithm of molecular typing of Yersinia pestis that ensures establishing of subspecies, biovar and focus membership of the studied isolate. Determination of the characteristic strain genotypes of plague infectious agent of main and nonmain subspecies from various natural foci of plague of the Russian Federation and the near abroad. Genotyping of 192 natural Y. pestis strains of main and nonmain subspecies was performed by using PCR methods, multilocus sequencing and multilocus analysis of variable tandem repeat number. A standard algorithm of molecular typing of plague infectious agent including several stages of Yersinia pestis differentiation by membership: in main and nonmain subspecies, various biovars of the main subspecies, specific subspecies; natural foci and geographic territories was developed. The algorithm is based on 3 typing methods--PCR, multilocus sequence typing and multilocus analysis of variable tandem repeat number using standard DNA targets--life support genes (terC, ilvN, inv, glpD, napA, rhaS and araC) and 7 loci of variable tandem repeats (ms01, ms04, ms06, ms07, ms46, ms62, ms70). The effectiveness of the developed algorithm is shown on the large number of natural Y. pestis strains. Characteristic sequence types of Y. pestis strains of various subspecies and biovars as well as MLVA7 genotypes of strains from natural foci of plague of the Russian Federation and the near abroad were established. The application of the developed algorithm will increase the effectiveness of epidemiologic monitoring of plague infectious agent, and analysis of epidemics and outbreaks of plague with establishing the source of origin of the strain and routes of introduction of the infection.
Gelada vocal sequences follow Menzerath’s linguistic law
Gustison, Morgan L.; Semple, Stuart; Ferrer-i-Cancho, Ramon; Bergman, Thore J.
2016-01-01
Identifying universal principles underpinning diverse natural systems is a key goal of the life sciences. A powerful approach in addressing this goal has been to test whether patterns consistent with linguistic laws are found in nonhuman animals. Menzerath’s law is a linguistic law that states that, the larger the construct, the smaller the size of its constituents. Here, to our knowledge, we present the first evidence that Menzerath’s law holds in the vocal communication of a nonhuman species. We show that, in vocal sequences of wild male geladas (Theropithecus gelada), construct size (sequence size in number of calls) is negatively correlated with constituent size (duration of calls). Call duration does not vary significantly with position in the sequence, but call sequence composition does change with sequence size and most call types are abbreviated in larger sequences. We also find that intercall intervals follow the same relationship with sequence size as do calls. Finally, we provide formal mathematical support for the idea that Menzerath’s law reflects compression—the principle of minimizing the expected length of a code. Our findings suggest that a common principle underpins human and gelada vocal communication, highlighting the value of exploring the applicability of linguistic laws in vocal systems outside the realm of language. PMID:27091968
Neugebauer, Tomasz; Bordeleau, Eric; Burrus, Vincent; Brzezinski, Ryszard
2015-01-01
Data visualization methods are necessary during the exploration and analysis activities of an increasingly data-intensive scientific process. There are few existing visualization methods for raw nucleotide sequences of a whole genome or chromosome. Software for data visualization should allow the researchers to create accessible data visualization interfaces that can be exported and shared with others on the web. Herein, novel software developed for generating DNA data visualization interfaces is described. The software converts DNA data sets into images that are further processed as multi-scale images to be accessed through a web-based interface that supports zooming, panning and sequence fragment selection. Nucleotide composition frequencies and GC skew of a selected sequence segment can be obtained through the interface. The software was used to generate DNA data visualization of human and bacterial chromosomes. Examples of visually detectable features such as short and long direct repeats, long terminal repeats, mobile genetic elements, heterochromatic segments in microbial and human chromosomes, are presented. The software and its source code are available for download and further development. The visualization interfaces generated with the software allow for the immediate identification and observation of several types of sequence patterns in genomes of various sizes and origins. The visualization interfaces generated with the software are readily accessible through a web browser. This software is a useful research and teaching tool for genetics and structural genomics.
[Study on the genetic difference of SEO type Hantaviruses].
Zhang, X; Zhou, S; Wang, H; Hu, J; Guan, Z; Liu, H
2000-10-01
To understand the genetic type of Hantaviruses and the difference between them caused by rodents in Beijing and to furhter explore the source of the infectious factors. Hantavirus RNA, isolated from lungs of rodents captured in Beijing and positive with Hantavirus antigens with frozen sectioning and Immunofluorescent assay, were reverse-transcribed and amplified with PCR with Hantavirus-specific primers. Five of the PCR amplifications were discovered and sequenced with 300 bp sequence data of M segments (from 2003 - 2302nt according cDNA of seoul 8039 strain). Nucleotide sequence homology showed that they were sequences of SEO-type Hantavirus. Compared with SEO type Hantavirus, the nucleotide sequence homology of these samples was more than 94% while the homology of amonia acid sequence was more than 98%. When compared with HNT type Hantavirus, the homology of nucleotide sequence became less than 72% with the homology of amonia acid sequence less than 81%. Similar to other Hantavirus of SEO type, their nucleotide sequences and deduced amino acid sequences were highly preserved. Phylogenetic tree analysis showed that the five viruses could be divided into at least 4 branches. It was quite likely that there were at least two sub-type SEO viruses with 4 branches that were circulating in Beijing.
A High-Throughput Process for the Solid-Phase Purification of Synthetic DNA Sequences
Grajkowski, Andrzej; Cieślak, Jacek; Beaucage, Serge L.
2017-01-01
An efficient process for the purification of synthetic phosphorothioate and native DNA sequences is presented. The process is based on the use of an aminopropylated silica gel support functionalized with aminooxyalkyl functions to enable capture of DNA sequences through an oximation reaction with the keto function of a linker conjugated to the 5′-terminus of DNA sequences. Deoxyribonucleoside phosphoramidites carrying this linker, as a 5′-hydroxyl protecting group, have been synthesized for incorporation into DNA sequences during the last coupling step of a standard solid-phase synthesis protocol executed on a controlled pore glass (CPG) support. Solid-phase capture of the nucleobase- and phosphate-deprotected DNA sequences released from the CPG support is demonstrated to proceed near quantitatively. Shorter than full-length DNA sequences are first washed away from the capture support; the solid-phase purified DNA sequences are then released from this support upon reaction with tetra-n-butylammonium fluoride in dry dimethylsulfoxide (DMSO) and precipitated in tetrahydrofuran (THF). The purity of solid-phase-purified DNA sequences exceeds 98%. The simulated high-throughput and scalability features of the solid-phase purification process are demonstrated without sacrificing purity of the DNA sequences. PMID:28628204
Wang, Zheng; Zhou, Di; Wang, Hui; Jia, Zhenjun; Liu, Jing; Qian, Xiaoqin; Li, Chengtao; Hou, Yiping
2017-11-01
Massively parallel sequencing (MPS) technologies have proved capable of sequencing the majority of the key forensic STR markers. By MPS, not only the repeat-length size but also sequence variations could be detected. Recently, Thermo Fisher Scientific has designed an advanced MPS 32-plex panel, named the Precision ID GlobalFiler™ NGS STR Panel, where the primer set has been designed specifically for the purpose of MPS technologies and the data analysis are supported by a new version HID STR Genotyper Plugin (V4.0). In this study, a series of experiments that evaluated concordance, reliability, sensitivity of detection, mixture analysis, and the ability to analyze case-type and challenged samples were conducted. In addition, 106 unrelated Han individuals were sequenced to perform genetic analyses of allelic diversity. As expected, MPS detected broader allele variations and gained higher power of discrimination and exclusion rate. MPS results were found to be concordant with current capillary electrophoresis methods, and single source complete profiles could be obtained stably using as little as 100pg of input DNA. Moreover, this MPS panel could be adapted to case-type samples and partial STR genotypes of the minor contributor could be detected up to 19:1 mixture. Aforementioned results indicate that the Precision ID GlobalFiler™ NGS STR Panel is reliable, robust and reproducible and have the potential to be used as a tool for human forensics. Copyright © 2017 Elsevier B.V. All rights reserved.
Postberg, Jan; Heyse, Katharina; Cremer, Marion; Cremer, Thomas; Lipps, Hans J
2008-01-01
Background: In this study we exploit the unique genome organization of ciliates to characterize the biological function of histone modification patterns and chromatin plasticity for the processing of specific DNA sequences during a nuclear differentiation process. Ciliates are single-cell eukaryotes containing two morphologically and functionally specialized types of nuclei, the somatic macronucleus and the germline micronucleus. In the course of sexual reproduction a new macronucleus develops from a micronuclear derivative. During this process specific DNA sequences are eliminated from the genome, while sequences that will be transcribed in the mature macronucleus are retained. Results: We show by immunofluorescence microscopy, Western analyses and chromatin immunoprecipitation (ChIP) experiments that each nuclear type establishes its specific histone modification signature. Our analyses reveal that the early macronuclear anlage adopts a permissive chromatin state immediately after the fusion of two heterochromatic germline micronuclei. As macronuclear development progresses, repressive histone modifications that specify sequences to be eliminated are introduced de novo. ChIP analyses demonstrate that permissive histone modifications are associated with sequences that will be retained in the new macronucleus. Furthermore, our data support the hypothesis that a PIWI-family protein is involved in a transnuclear cross-talk and in the RNAi-dependent control of developmental chromatin reorganization. Conclusion: Based on these data we present a comprehensive analysis of the spatial and temporal pattern of histone modifications during this nuclear differentiation process. Results obtained in this study may also be relevant for our understanding of chromatin plasticity during metazoan embryogenesis. PMID:19014664
From printed color to image appearance: tool for advertising assessment
NASA Astrophysics Data System (ADS)
Bonanomi, Cristian; Marini, Daniele; Rizzi, Alessandro
2012-07-01
We present a methodology to calculate the color appearance of advertising billboards set in indoor and outdoor environments, printed on different types of paper support and viewed under different illuminations. The aim is to simulate the visual appearance of an image printed on a specific support, observed in a certain context and illuminated with a specific source of light. Knowing in advance the visual rendering of an image in different conditions can avoid problems related to its visualization. The proposed method applies a sequence of transformations to convert a four channels image (CMYK) into a spectral one, considering the paper support, then it simulates the chosen illumination, and finally computes an estimation of the appearance.
Zhang, Yong; Zhang, Fan; Zhu, Shuangli; Chen, Li; Yan, Dongmei; Wang, Dongyan; Tang, Ruiyan; Zhu, Hui; Hou, Xiaohui; An, Hongqiu; Zhang, Hong; Xu, Wenbo
2010-02-01
A type 2 vaccine-related poliovirus (strain CHN3024), differing from the Sabin 2 strain by 0.44% in the VP1 coding region was isolated from a patient with vaccine-associated paralytic poliomyelitis. Sequences downstream of nucleotide position 6735 (3D(pol) coding region) were derived from an unidentified sequence; no close match for a potential parent was found, but it could be classified into a non-polio human enteroviruses species C (HEV-C) phylogeny. The virus differed antigenically from the parental Sabin strain, having an amino acid substitution in the neutralizing antigenic site 1. The similarity between CHN3024 and Sabin 2 sequences suggests that the recombination was recent; this is supported by the estimation that the initiating OPV dose was given only 36-75 days before sampling. The patient's clinical manifestations, intratypic differentiation examination, and whole-genome sequencing showed that this recombinant exhibited characteristics of neurovirulent vaccine-derived polioviruses (VDPV), which may, thus, pose a potential threat to a polio-free world.
Xu, Haiyan; Sun, Zhihong; Liu, Wenjun; Yu, Jie; Song, Yuqin; Lv, Qiang; Zhang, Jiachao; Shao, Yuyu; Menghe, Bilige; Zhang, Heping
2014-05-01
To determine the genetic diversity and phylogenetic relationships among Lactococcus lactis isolates, 197 strains isolated from naturally homemade yogurt in 9 ethnic minority areas of 6 provinces of China were subjected to multilocus sequence typing (MLST). The MLST analysis was performed using internal fragment sequences of 12 housekeeping genes (carB, clpX, dnaA, groEL, murC, murE, pepN, pepX, pyrG, recA, rpoB, and pheS). Six (dnaA) to 8 (murC) different alleles were detected for these genes, which ranged from 33.62 (clpX) to 41.95% (recA) GC (guanine-cytosine) content. The nucleotide diversity (π) ranged from 0.00362 (murE) to 0.08439 (carB). Despite this limited allelic diversity, the allele combinations of each strain revealed 72 different sequence types, which denoted significant genotypic diversity. The dN/dS ratios (where dS is the number of synonymous substitutions per synonymous site, and dN is the number of nonsynonymous substitutions per nonsynonymous site) were lower than 1, suggesting potential negative selection for these genes. The standardized index of association of the alleles IA(S)=0.3038 supported the clonality of Lc. lactis, but the presence of network structure revealed by the split decomposition analysis of the concatenated sequence was strong evidence for intraspecies recombination. Therefore, this suggests that recombination contributed to the evolution of Lc. lactis. A minimum spanning tree analysis of the 197 isolates identified 14 clonal complexes and 23 singletons. Phylogenetic trees were constructed based on the sequence types, using the minimum evolution algorithm, and on the concatenated sequence (6,192 bp), using the unweighted pair-group method with arithmetic mean, and these trees indicated that the evolution of our Lc. lactis population was correlated with geographic origin. Taken together, our results demonstrated that MLST could provide a better understanding of Lc. lactis genome evolution, as well as useful information for future studies on global Lc. lactis structure and genetic evolution, which will lay the foundation for screening Lc. lactis as starter cultures in fermented dairy products. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Martínez-Castilla, León Patricio; Alvarez-Buylla, Elena R.
2003-01-01
Gene duplication is a substrate of evolution. However, the relative importance of positive selection versus relaxation of constraints in the functional divergence of gene copies is still under debate. Plant MADS-box genes encode transcriptional regulators key in various aspects of development and have undergone extensive duplications to form a large family. We recovered 104 MADS sequences from the Arabidopsis genome. Bayesian phylogenetic trees recover type II lineage as a monophyletic group and resolve a branching sequence of monophyletic groups within this lineage. The type I lineage is comprised of several divergent groups. However, contrasting gene structure and patterns of chromosomal distribution between type I and II sequences suggest that they had different evolutionary histories and support the placement of the root of the gene family between these two groups. Site-specific and site-branch analyses of positive Darwinian selection (PDS) suggest that different selection regimes could have affected the evolution of these lineages. We found evidence for PDS along the branch leading to flowering time genes that have a direct impact on plant fitness. Sites with high probabilities of having been under PDS were found in the MADS and K domains, suggesting that these played important roles in the acquisition of novel functions during MADS-box diversification. Detected sites are targets for further experimental analyses. We argue that adaptive changes in MADS-domain protein sequences have been important for their functional divergence, suggesting that changes within coding regions of transcriptional regulators have influenced phenotypic evolution of plants. PMID:14597714
Xiong, Y; Eickbush, T H
1988-01-01
Two types of insertion elements, R1 and R2 (previously called type I and type II), are known to interrupt the 28S ribosomal genes of several insect species. In the silkmoth, Bombyx mori, each element occupies approximately 10% of the estimated 240 ribosomal DNA units, while at most only a few copies are located outside the ribosomal DNA units. We present here the complete nucleotide sequence of an R1 insertion from B. mori (R1Bm). This 5.1-kilobase element contains two overlapping open reading frames (ORFs) which together occupy 88% of its length. ORF1 is 461 amino acids in length and exhibits characteristics of retroviral gag genes. ORF2 is 1,051 amino acids in length and contains homology to reverse transcriptase-like enzymes. The analysis of 3' and 5' ends of independent isolates from the ribosomal locus supports the suggestion that R1 is still functioning as a transposable element. The precise location of the element within the genome implies that its transposition must occur with remarkable insertion sequence specificity. Comparison of the deduced amino acid sequences from six retrotransposons, R1 and R2 of B. mori, I factor and F element of Drosophila melanogaster, L1 of Mus domesticus, and Ingi of Trypanosoma brucei, reveals a relatively high level of sequence homology in the reverse transcriptase region. Like R1, these elements lack long terminal repeats. We have therefore named this class of related elements the non-long-terminal-repeat (non-LTR) retrotransposons. Images PMID:2447482
Li, Yong; Xue, Han; Sang, Sheng-Qi; Lin, Cai-Li; Wang, Xi-Zhuo
2017-01-01
Two Gram-stain negative aerobic bacterial strains were isolated from the bark tissue of Populus × euramericana. The novel isolates were investigated using a polyphasic approach including 16S rRNA gene sequencing, genome sequencing, average nucleotide identity (ANI) and both phenotypic and chemotaxonomic assays. The genome core gene sequence and 16S rRNA gene phylogenies suggest that the novel isolates are different from the genera Snodgrassella and Stenoxybacter. Additionally, the ANI, G+C content, main fatty acids and phospholipid profile data supported the distinctiveness of the novel strain from genus Snodgrassella. Therefore, based on the data presented, the strains constitute a novel species of a novel genus within the family Neisseriaceae, for which the name Populibacter corticis gen. nov., sp. nov. is proposed. The type strain is 15-3-5T (= CFCC 13594T = KCTC 42251T).
de Jong, W W; Zweers, A; Versteeg, M; Dessauer, H C; Goodman, M
1985-11-01
The amino acid sequences of the eye lens protein alpha-crystallin A from many mammalian and avian species, two frog species, and a dogfish have provided detailed information about the molecular evolution of this protein and allowed some useful inferences about phylogenetic relationships among these species. We now have isolated and sequenced the alpha-crystallins of the American alligator and the common tegu lizard. The reptilian alpha A chains appear to have evolved as slowly as those of other vertebrates, i.e., at two to three amino acid replacements per 100 residues in 100 Myr. The lack of charged replacements and the general types and distribution of replacements also are similar to those in other vertebrate alpha A chains. Maximum-parsimony analyses of the total data set of 67 vertebrate alpha A sequences support the monophyletic origin of alligator, tegu, and birds and favor the grouping of crocodilians and birds as surviving sister groups in the subclass Archosauria.
Cholley, Pascal; Stojanov, Milos; Hocquet, Didier; Thouverez, Michelle; Bertrand, Xavier; Blanc, Dominique S
2015-08-01
Reliable molecular typing methods are necessary to investigate the epidemiology of bacterial pathogens. Reference methods such as multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE) are costly and time consuming. Here, we compared our newly developed double-locus sequence typing (DLST) method for Pseudomonas aeruginosa to MLST and PFGE on a collection of 281 isolates. DLST was as discriminatory as MLST and was able to recognize "high-risk" epidemic clones. Both methods were highly congruent. Not surprisingly, a higher discriminatory power was observed with PFGE. In conclusion, being a simple method (single-strand sequencing of only 2 loci), DLST is valuable as a first-line typing tool for epidemiological investigations of P. aeruginosa. Coupled to a more discriminant method like PFGE or whole genome sequencing, it might represent an efficient typing strategy to investigate or prevent outbreaks. Copyright © 2015 Elsevier Inc. All rights reserved.
Guo, Yahong; Tsuruga, Ayako; Yamaguchi, Shigeharu; Oba, Koji; Iwai, Kasumi; Sekita, Setsuko; Mizukami, Hajime
2006-06-01
Chloroplast chlB gene encoding subunit B of light-independent protochlorophyllide reductase was amplified from herbarium and crude drug specimens of Ephedra sinica, E. intermedia, E. equisetina, and E. przewalskii. Sequence comparison of the chlB gene indicated that all the E. sinica specimens have the same sequence type (Type S) distinctive from other species, while there are two sequence types (Type E1 and Type E2) in E. equisetina. E. intermedia and E. prezewalskii revealed an identical sequence type (Type IP). E. sinica was also identified by digesting the chlB fragment with Bcl I. A novel method for DNA authentication of Ephedra Herb based on the sequences of the chloroplast chlB gene and internal transcribed spacer of nuclear rRNA genes was developed and successfully applied for identification of the crude drugs obtained in the Chinese market.
Suzuki, Masahiro; Segawa, Takahiro; Mori, Hiroshi; Akiyoshi, Ayumi; Ootsuki, Ryo; Kurihara, Akira; Sakayama, Hidetoshi; Kitayama, Taiju; Abe, Tsuyoshi; Kogame, Kazuhiro; Kawai, Hiroshi; Nozaki, Hisayoshi
2016-01-01
Liagora japonica is a red algal species distributed in temperate regions of Japan. This species has not been collected from its type locality on the Pacific coast of Japan since 1927 and seems to have become extinct in this area. For molecular characterization of L. japonica, we extracted DNA from the topotype material of L. japonica collected in 1927, analyzed seven genes using Illumina next-generation sequencing, and compared these data with sequences from modern samples of similar red algae collected from the Japan Sea coast of Japan. Both morphological and molecular data from modern samples and historical specimens (including the lectotype and topotype) suggest that the specimens from the Pacific and Japan Sea coasts of Japan should be treated as a single species, and that L. japonica is phylogenetically separated from the genus Liagora. Based on the phylogenetic results and examination of reproductive structures, we propose Otohimella japonica gen. et comb. nov., characterized morphologically by diffuse carposporophytes, undivided carposporangia, and involucral filaments initiated only from the cortical cell on the supporting cell. PMID:27388436
MMACHC gene mutation in familial hypogonadism with neurological symptoms.
Shi, Changhe; Shang, Dandan; Sun, Shilei; Mao, Chengyuan; Qin, Jie; Luo, Haiyang; Shao, Mingwei; Chen, Zhengguang; Liu, Yutao; Liu, Xinjing; Song, Bo; Xu, Yuming
2015-12-15
Recent studies have convincingly documented that hypogonadism is a component of various hereditary disorders and is often recognized as an important clinical feature in combination with various neurological symptoms, yet, the causative genes in a few related families are still unknown. High-throughput sequencing has become an efficient method to identify causative genes in related complex hereditary disorders. In this study, we performed exome sequencing in a family presenting hypergonadotropic hypogonadism with neurological presentations of mental retardation, epilepsy, ataxia, and leukodystrophy. After bioinformatic analysis and Sanger sequencing validation, we identified compound heterozygous mutations: c.482G>A (p.R161Q) and c.609G>A (p.W203X) in MMACHC gene in this pedigree. MMACHC was previously confirmed to be responsible for methylmalonic aciduria (MMA) combined with homocystinuria, cblC type (cblC disease), a hereditary vitamin B12 metabolic disorder. Biochemical and gas chromatography-mass spectrometry (GC-MS) examinations in this pedigree further supported the cblC disease diagnosis. These results indicated that hypergonadotropic hypogonadism may be a novel clinical manifestation of cblC disease, but more reports on additional patients are needed to support this hypothesis. Copyright © 2015 Elsevier B.V. All rights reserved.
TagDust2: a generic method to extract reads from sequencing data.
Lassmann, Timo
2015-01-28
Arguably the most basic step in the analysis of next generation sequencing data (NGS) involves the extraction of mappable reads from the raw reads produced by sequencing instruments. The presence of barcodes, adaptors and artifacts subject to sequencing errors makes this step non-trivial. Here I present TagDust2, a generic approach utilizing a library of hidden Markov models (HMM) to accurately extract reads from a wide array of possible read architectures. TagDust2 extracts more reads of higher quality compared to other approaches. Processing of multiplexed single, paired end and libraries containing unique molecular identifiers is fully supported. Two additional post processing steps are included to exclude known contaminants and filter out low complexity sequences. Finally, TagDust2 can automatically detect the library type of sequenced data from a predefined selection. Taken together TagDust2 is a feature rich, flexible and adaptive solution to go from raw to mappable NGS reads in a single step. The ability to recognize and record the contents of raw reads will help to automate and demystify the initial, and often poorly documented, steps in NGS data analysis pipelines. TagDust2 is freely available at: http://tagdust.sourceforge.net .
Scholz, Christian F P; Jensen, Anders
2017-01-01
The protocol describes a computational method to develop a Single Locus Sequence Typing (SLST) scheme for typing bacterial species. The resulting scheme can be used to type bacterial isolates as well as bacterial species directly from complex communities using next-generation sequencing technologies.
Development of Multilocus Sequence Typing (MLST) for Mycoplasma synoviae.
El-Gazzar, Mohamed; Ghanem, Mostafa; McDonald, Kristina; Ferguson-Noel, Naola; Raviv, Ziv; Slemons, Richard D
2017-03-01
Mycoplasma synoviae (MS) is a poultry pathogen that has had an increasing incidence and economic impact over the past few years. Strain identification is necessary for outbreak investigation, infection source identification, and facilitating prevention and control as well as eradication efforts. Currently, a segment of the variable lipoprotein hemagglutinin A (vlhA) gene (420 bp) is the only target that is used for MS strain identification. A major limitation of this assay is that colonality of typed samples can only be inferred if their vlhA sequences are identical; however, if their sequences are different, the degree of relatedness is uncertain. In this study we propose a multilocus sequence typing (MLST) assay to further refine MS strain identification. After initial screening of 24 housekeeping genes as potential targets, seven genes were selected for the MLST assay. An internal segment (450-711 bp) from each of the seven genes was successfully amplified and sequenced from 58 different MS strains and field isolates (n = 30) or positive clinical samples (n = 28). The collective sequence of all seven gene segments (3960 bp total) was used for MS sequence typing. The 58 tested MS samples were typed into 30 different sequence types using the MLST assay and, coincidentally, all the samples were typed into 30 sequence types using the vlhA assay. However, the phylogenetic tree generated using the MLST data was more congruent to the epidemiologic information than was the tree generated by the vlhA assay. We suggest that the newly developed MLST assay and the vlhA assay could be used in tandem for MS typing. The MLST assay will be a valuable and more reliable tool for MS sequence typing, providing better understanding of the epidemiology of MS infection. This in turn will aid disease prevention, control, and eradication efforts.
Kurgan, Lukasz; Cios, Krzysztof; Chen, Ke
2008-05-01
Protein structure prediction methods provide accurate results when a homologous protein is predicted, while poorer predictions are obtained in the absence of homologous templates. However, some protein chains that share twilight-zone pairwise identity can form similar folds and thus determining structural similarity without the sequence similarity would be desirable for the structure prediction. The folding type of a protein or its domain is defined as the structural class. Current structural class prediction methods that predict the four structural classes defined in SCOP provide up to 63% accuracy for the datasets in which sequence identity of any pair of sequences belongs to the twilight-zone. We propose SCPRED method that improves prediction accuracy for sequences that share twilight-zone pairwise similarity with sequences used for the prediction. SCPRED uses a support vector machine classifier that takes several custom-designed features as its input to predict the structural classes. Based on extensive design that considers over 2300 index-, composition- and physicochemical properties-based features along with features based on the predicted secondary structure and content, the classifier's input includes 8 features based on information extracted from the secondary structure predicted with PSI-PRED and one feature computed from the sequence. Tests performed with datasets of 1673 protein chains, in which any pair of sequences shares twilight-zone similarity, show that SCPRED obtains 80.3% accuracy when predicting the four SCOP-defined structural classes, which is superior when compared with over a dozen recent competing methods that are based on support vector machine, logistic regression, and ensemble of classifiers predictors. The SCPRED can accurately find similar structures for sequences that share low identity with sequence used for the prediction. The high predictive accuracy achieved by SCPRED is attributed to the design of the features, which are capable of separating the structural classes in spite of their low dimensionality. We also demonstrate that the SCPRED's predictions can be successfully used as a post-processing filter to improve performance of modern fold classification methods.
Kurgan, Lukasz; Cios, Krzysztof; Chen, Ke
2008-01-01
Background Protein structure prediction methods provide accurate results when a homologous protein is predicted, while poorer predictions are obtained in the absence of homologous templates. However, some protein chains that share twilight-zone pairwise identity can form similar folds and thus determining structural similarity without the sequence similarity would be desirable for the structure prediction. The folding type of a protein or its domain is defined as the structural class. Current structural class prediction methods that predict the four structural classes defined in SCOP provide up to 63% accuracy for the datasets in which sequence identity of any pair of sequences belongs to the twilight-zone. We propose SCPRED method that improves prediction accuracy for sequences that share twilight-zone pairwise similarity with sequences used for the prediction. Results SCPRED uses a support vector machine classifier that takes several custom-designed features as its input to predict the structural classes. Based on extensive design that considers over 2300 index-, composition- and physicochemical properties-based features along with features based on the predicted secondary structure and content, the classifier's input includes 8 features based on information extracted from the secondary structure predicted with PSI-PRED and one feature computed from the sequence. Tests performed with datasets of 1673 protein chains, in which any pair of sequences shares twilight-zone similarity, show that SCPRED obtains 80.3% accuracy when predicting the four SCOP-defined structural classes, which is superior when compared with over a dozen recent competing methods that are based on support vector machine, logistic regression, and ensemble of classifiers predictors. Conclusion The SCPRED can accurately find similar structures for sequences that share low identity with sequence used for the prediction. The high predictive accuracy achieved by SCPRED is attributed to the design of the features, which are capable of separating the structural classes in spite of their low dimensionality. We also demonstrate that the SCPRED's predictions can be successfully used as a post-processing filter to improve performance of modern fold classification methods. PMID:18452616
Bouaziz, Amira; Loucif, Lotfi; Ayachi, Ammar; Guehaz, Karima; Bendjama, Esma; Rolain, Jean-Marc
2018-05-01
The emergence of carbapenemase-producing Enterobacteriaceae is of great concern to public health worldwide. The aim of this study was to screen for the presence of carbapenemase-producing Enterobacteriaceae in white stork (Ciconia ciconia) migratory bird stools, and to investigate their molecular support on β-lactamase production. In March 2015, 32 fecal samples of white stork were collected in the Commune of El Madher Wilaya de Batna, in eastern Algeria. Samples were subjected to selective isolation of carbapenem-resistant Enterobacteriaceae. Representative colonies were screened phenotypically for carbapenemase production. Carbapenemase-producing isolates were subjected to antibiotic susceptibility testing and extended-spectrum β-lactamase (ESBL) coproduction. β-Lactamase determinants were searched for by PCR and sequencing. Three carbapenemase-producing Escherichia coli were obtained. Only one strain was positive for ESBL production. The OXA-48-type carbapenemase-encoding gene was detected in all isolates. Screening for other β-lactamase-encoding genes showed that all isolates coexpress the bla TEM gene, whereas one of them additionally harbored the bla CTX-M-15 ESBL gene. Multilocus sequence typing results showed that two strains belonged to the sequence type 38. This work demonstrated for the first time that the migratory white stork can play an important role in the dissemination of OXA-48-producing E. coli as a potential reservoir and vector.
Mobberley, Jennifer M; Ortega, Maya C; Foster, Jamie S
2012-01-01
Thrombolites are unlaminated carbonate structures that form as a result of the metabolic interactions of complex microbial mat communities. Thrombolites have a long geological history; however, little is known regarding the microbes associated with modern structures. In this study, we use a barcoded 16S rRNA gene-pyrosequencing approach coupled with morphological analysis to assess the bacterial, cyanobacterial and archaeal diversity associated with actively forming thrombolites found in Highborne Cay, Bahamas. Analyses revealed four distinct microbial mat communities referred to as black, beige, pink and button mats on the surfaces of the thrombolites. At a coarse phylogenetic resolution, the domain bacterial sequence libraries from the four mats were similar, with Proteobacteria and Cyanobacteria being the most abundant. At the finer resolution of the rRNA gene sequences, significant differences in community structure were observed, with dramatically different cyanobacterial communities. Of the four mat types, the button mats contained the highest diversity of Cyanobacteria, and were dominated by two sequence clusters with high similarity to the genus Dichothrix, an organism associated with the deposition of carbonate. Archaeal diversity was low, but varied in all mat types, and the archaeal community was predominately composed of members of the Thaumarchaeota and Euryarchaeota. The morphological and genetic data support the hypothesis that the four mat types are distinctive thrombolitic mat communities. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.
Alpert, Carl-Alfred; Crutz-Le Coq, Anne-Marie; Malleret, Christine; Zagorec, Monique
2003-01-01
The complete nucleotide sequence of the 13-kb plasmid pRV500, isolated from Lactobacillus sakei RV332, was determined. Sequence analysis enabled the identification of genes coding for a putative type I restriction-modification system, two genes coding for putative recombinases of the integrase family, and a region likely involved in replication. The structural features of this region, comprising a putative ori segment containing 11- and 22-bp repeats and a repA gene coding for a putative initiator protein, indicated that pRV500 belongs to the pUCL287 subfamily of theta-type replicons. A 3.7-kb fragment encompassing this region was fused to an Escherichia coli replicon to produce the shuttle vector pRV566 and was observed to be functional in L. sakei for plasmid replication. The L. sakei replicon alone could not support replication in E. coli. Plasmid pRV500 and its derivative pRV566 were determined to be at very low copy numbers in L. sakei. pRV566 was maintained at a reasonable rate over 20 generations in several lactobacilli, such as Lactobacillus curvatus, Lactobacillus casei, and Lactobacillus plantarum, in addition to L. sakei, making it an interesting basis for developing vectors. Sequence relationships with other plasmids are described and discussed. PMID:12957947
Velasco, Harvy Mauricio; Morales, Jessica L
2017-01-01
Osteogenesis imperfecta (OI) is a hereditary disease characterized by bone fragility caused by mutations in the proteins that support the formation of the extracellular matrix in the bone. The diagnosis of OI begins with clinical suspicion, from phenotypic findings at birth, low-impact fractures during childhood or family history that may lead to it. However, the variability in the semiology of the disease does not allow establishing an early diagnosis in all cases, and unfortunately, specific clinical data provided by the literature only report 28 patients with OI type XI. This information is limited and heterogeneous, and therefore, detailed information on the natural history of this disease is not yet available. This paper reports the case of a male patient who, despite undergoing multidisciplinary management, did not have a diagnosis for a long period of time, and could only be given one with the use of whole-exome sequencing. The use of the next-generation sequencing in patients with ultrarare genetic diseases, including skeletal dysplasias, should be justified when clear clinical criteria and an improvement in the quality of life of the patients and their families are intended while reducing economic and time costs. Thus, this case report corresponds to the 29th patient affected with OI type XI, and the 18th mutation in FKBP10, causative of this pathology. PMID:29158687
Bolivar, I; Fahrni, J F; Smirnov, A; Pawlowski, J
2001-12-01
Naked lobose amoebae (gymnamoebae) are among the most abundant group of protists present in all aquatic and terrestrial biotopes. Yet, because of lack of informative morphological characters, the origin and evolutionary history of gymnamoebae are poorly known. The first molecular studies revealed multiple origins for the amoeboid lineages and an extraordinary diversity of amoebae species. Molecular data, however, exist only for a few species of the numerous taxa belonging to this group. Here, we present the small-subunit (SSU) rDNA sequences of four species of typical large gymnamoebae: Amoeba proteus, Amoeba leningradensis, Chaos nobile, and Chaos carolinense. Sequence analysis suggests that the four species are closely related to the species of genera Saccamoeba, Leptomyxa, Rhizamoeba, Paraflabellula, Hartmannella, and Echinamoeba. All of them form a relatively well-supported clade, which corresponds to the subclass Gymnamoebia, in agreement with morphology-based taxonomy. The other gymnamoebae cluster in small groups or branch separately. Their relationships change depending on the type of analysis and the model of nucleotide substitution. All gymnamoebae branch together in Neighbor-Joining analysis with corrections for among-site rate heterogeneity and proportion of invariable sites. This clade, however, is not statistically supported by SSU rRNA gene sequences and further analysis of protein sequence data will be necessary to test the monophyly of gymnamoebae.
Cloning and Analysis of the Alternative Oxidase Gene of Neurospora Crassa
Li, Q.; Ritzel, R. G.; McLean, LLT.; McIntosh, L.; Ko, T.; Bertrand, H.; Nargang, F. E.
1996-01-01
Mitochondria of Neurospora crassa contain a cyanide-resistant alternative respiratory pathway in addition to the cytochrome pathway. The alternative oxidase is present only when electron flow through the cytochrome chain is restricted. Both genomic and cDNA copies for the alternative oxidase gene have been isolated and analyzed. The sequence of the predicted protein is homologous to that of other species. The mRNA for the alternative oxidase is scarce in wild-type cultures grown under normal conditions, but it is abundant in cultures grown in the presence of chloramphenicol, an inhibitor of mitochondrial protein synthesis, or in mutants deficient in mitochondrial cytochromes. Thus, induction of alternative oxidase appears to be at the transcriptional level. Restriction fragment length polymorphism mapping of the isolated gene demonstrated that it is located in a position corresponding to the aod-1 locus. Sequence analysis of mutant aod-1 alleles reveals mutations affecting the coding sequence of the alternative oxidase. The level of aod-1 mRNA in an aod-2 mutant strain that had been grown in the presence of chloramphenicol was reduced several fold relative to wild-type, supporting the hypothesis that the product of aod-2 is required for optimal expression of aod-1. PMID:8770590
Reads2Type: a web application for rapid microbial taxonomy identification.
Saputra, Dhany; Rasmussen, Simon; Larsen, Mette V; Haddad, Nizar; Sperotto, Maria Maddalena; Aarestrup, Frank M; Lund, Ole; Sicheritz-Pontén, Thomas
2015-11-25
Identification of bacteria may be based on sequencing and molecular analysis of a specific locus such as 16S rRNA, or a set of loci such as in multilocus sequence typing. In the near future, healthcare institutions and routine diagnostic microbiology laboratories may need to sequence the entire genome of microbial isolates. Therefore we have developed Reads2Type, a web-based tool for taxonomy identification based on whole bacterial genome sequence data. Raw sequencing data provided by the user are mapped against a set of marker probes that are derived from currently available bacteria complete genomes. Using a dataset of 1003 whole genome sequenced bacteria from various sequencing platforms, Reads2Type was able to identify the species with 99.5 % accuracy and on the minutes time scale. In comparison with other tools, Reads2Type offers the advantage of not needing to transfer sequencing files, as the entire computational analysis is done on the computer of whom utilizes the web application. This also prevents data privacy issues to arise. The Reads2Type tool is available at http://www.cbs.dtu.dk/~dhany/reads2type.html.
Method for isolating chromosomal DNA in preparation for hybridization in suspension
Lucas, Joe N.
2000-01-01
A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
Ren, Yong-Cheng; Xu, Long-Long; Zhang, Lin; Hui, Feng-Li
2015-10-01
Four yeast strains were isolated from the gut of beetles collected on Baotianman Mountain and People's Park of Nanyang in Henan Province, China. These strains produced unconjugated asci with one or two ellipsoidal to elongate ascospores in a persistent ascus. Phylogenetic analysis of the D1/D2 domains of the LSU rRNA gene sequences indicated that the isolates represent two novel sexual species in the Candida/Lodderomyces clade. Candida baotianmanensis sp. nov. was located in a statistically well-supported branch together with Candida maltosa. Candida pseudoviswanathii sp. nov. formed a subclade with its closest relative Candida viswanathii supported by a strong bootstrap value. The two novel species were distinguished from their most closely related described species, Candida maltosa and Candida viswanathii, in the D1/D2 LSU rRNA gene and internal transcribed spacer (ITS) sequences and in phenotypic traits. The type strain of Candida baotianmanensis sp. nov. is NYNU 14719T ( = CBS 13915T = CICC 33052T), and the type strain of Candida pseudoviswanathii sp. nov. is NYNU 14772T ( = CBS 13916T = CICC 33053T). The MycoBank numbers for Candida baotianmanensis sp. nov. and Candida pseudoviswanathii sp. nov. are MB 812621 and MB 812622.
Milius, Robert P; Heuer, Michael; Valiga, Daniel; Doroschak, Kathryn J; Kennedy, Caleb J; Bolon, Yung-Tsi; Schneider, Joel; Pollack, Jane; Kim, Hwa Ran; Cereb, Nezih; Hollenbach, Jill A; Mack, Steven J; Maiers, Martin
2015-12-01
We present an electronic format for exchanging data for HLA and KIR genotyping with extensions for next-generation sequencing (NGS). This format addresses NGS data exchange by refining the Histoimmunogenetics Markup Language (HML) to conform to the proposed Minimum Information for Reporting Immunogenomic NGS Genotyping (MIRING) reporting guidelines (miring.immunogenomics.org). Our refinements of HML include two major additions. First, NGS is supported by new XML structures to capture additional NGS data and metadata required to produce a genotyping result, including analysis-dependent (dynamic) and method-dependent (static) components. A full genotype, consensus sequence, and the surrounding metadata are included directly, while the raw sequence reads and platform documentation are externally referenced. Second, genotype ambiguity is fully represented by integrating Genotype List Strings, which use a hierarchical set of delimiters to represent allele and genotype ambiguity in a complete and accurate fashion. HML also continues to enable the transmission of legacy methods (e.g. site-specific oligonucleotide, sequence-specific priming, and Sequence Based Typing (SBT)), adding features such as allowing multiple group-specific sequencing primers, and fully leveraging techniques that combine multiple methods to obtain a single result, such as SBT integrated with NGS. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Al-Amoudi, Soha; Essack, Magbubah; Simões, Marta F; Bougouffa, Salim; Soloviev, Irina; Archer, John A C; Lafi, Feras F; Bajic, Vladimir B
2016-09-10
Microorganisms that inhabit unchartered unique soil such as in the highly saline and hot Red Sea lagoons on the Saudi Arabian coastline, represent untapped sources of potentially new bioactive compounds. In this study, a culture-dependent approach was applied to three types of sediments: mangrove mud (MN), microbial mat (MM), and barren soil (BS), collected from Rabigh harbor lagoon (RHL) and Al-Kharrar lagoon (AKL). The isolated bacteria were evaluated for their potential to produce bioactive compounds. The phylogenetic characterization of 251 bacterial isolates based on the 16S rRNA gene sequencing, supported their assignment to five different phyla: Proteobacteria, Firmicutes, Actinobacteria, Bacteroidetes, and Planctomycetes. Fifteen putative novel species were identified based on a 16S rRNA gene sequence similarity to other strain sequences in the NCBI database, being ≤98%. We demonstrate that 49 of the 251 isolates exhibit the potential to produce antimicrobial compounds. Additionally, at least one type of biosynthetic gene sequence, responsible for the synthesis of secondary metabolites, was recovered from 25 of the 49 isolates. Moreover, 10 of the isolates had a growth inhibition effect towards Staphylococcus aureus, Salmonella typhimurium and Pseudomonas syringae. We report the previously unknown antimicrobial activity of B. borstelensis, P. dendritiformis and M. salipaludis against all three indicator pathogens. Our study demonstrates the evidence of diverse cultured microbes associated with the Red Sea harbor/lagoon environments and their potential to produce antimicrobial compounds.
Evidence of birth-and-death evolution of 5S rRNA gene in Channa species (Teleostei, Perciformes).
Barman, Anindya Sundar; Singh, Mamta; Singh, Rajeev Kumar; Lal, Kuldeep Kumar
2016-12-01
In higher eukaryotes, minor rDNA family codes for 5S rRNA that is arranged in tandem arrays and comprises of a highly conserved 120 bp long coding sequence with a variable non-transcribed spacer (NTS). Initially the 5S rDNA repeats are considered to be evolved by the process of concerted evolution. But some recent reports, including teleost fishes suggested that evolution of 5S rDNA repeat does not fit into the concerted evolution model and evolution of 5S rDNA family may be explained by a birth-and-death evolution model. In order to study the mode of evolution of 5S rDNA repeats in Perciformes fish species, nucleotide sequence and molecular organization of five species of genus Channa were analyzed in the present study. Molecular analyses revealed several variants of 5S rDNA repeats (four types of NTS) and networks created by a neighbor net algorithm for each type of sequences (I, II, III and IV) did not show a clear clustering in species specific manner. The stable secondary structure is predicted and upstream and downstream conserved regulatory elements were characterized. Sequence analyses also shown the presence of two putative pseudogenes in Channa marulius. Present study supported that 5S rDNA repeats in genus Channa were evolved under the process of birth-and-death.
Molecular phylogeny of noctilucoid dinoflagellates (Noctilucales, Dinophyceae).
Gómez, Fernando; Moreira, David; López-García, Purificación
2010-07-01
The order Noctilucales or class Noctiluciphyceae encompasses three families of aberrant dinoflagellates (Noctilucaceae, Leptodiscaceae and Kofoidiniaceae) that, at least in some life stages, lack typical dinoflagellate characters such as the ribbon-like transversal flagellum or condensed chromosomes. Noctiluca scintillans, the first dinoflagellate to be described, has been intensively investigated. However, its phylogenetic position based on the small subunit ribosomal DNA (SSU rDNA) sequence is unstable and controversial. Noctiluca has been placed either as an early diverging lineage that diverged after Oxyrrhis and before the dinokaryotes -core dinoflagellates- or as a recent lineage branching from unarmoured dino fl agellates in the order Gymnodiniales. So far, the lack of other noctilucoid sequences has hampered the elucidation of their phylogenetic relationships to other dino fl agellates. Furthermore, even the monophyly of the noctilucoids remained uncertain. We have determined SSU rRNA gene sequences for Kofoidiniaceae, those of the type Spatulodinium (=Gymnodinium) pseudonoctiluca and another Spatulodinium species, as well as of two species of Kofoidinium, and the first gene sequence of Leptodiscaceae, that of Abedinium (=Leptophyllus) dasypus. These taxa were collected from their type localities, the English Channel and the NW Mediterranean Sea, respectively. Phylogenetic analyses place the Noctilucales as a monophyletic group at a basal position close to parasites of the Marine Alveolate Group I (MAGI) and the Syndiniales (MAGII), before the core of dinokaryotic dinoflagellates, although with moderate support. 2010 Elsevier GmbH. All rights reserved.
BioWarehouse: a bioinformatics database warehouse toolkit
Lee, Thomas J; Pouliot, Yannick; Wagner, Valerie; Gupta, Priyanka; Stringer-Calvert, David WJ; Tenenbaum, Jessica D; Karp, Peter D
2006-01-01
Background This article addresses the problem of interoperation of heterogeneous bioinformatics databases. Results We introduce BioWarehouse, an open source toolkit for constructing bioinformatics database warehouses using the MySQL and Oracle relational database managers. BioWarehouse integrates its component databases into a common representational framework within a single database management system, thus enabling multi-database queries using the Structured Query Language (SQL) but also facilitating a variety of database integration tasks such as comparative analysis and data mining. BioWarehouse currently supports the integration of a pathway-centric set of databases including ENZYME, KEGG, and BioCyc, and in addition the UniProt, GenBank, NCBI Taxonomy, and CMR databases, and the Gene Ontology. Loader tools, written in the C and JAVA languages, parse and load these databases into a relational database schema. The loaders also apply a degree of semantic normalization to their respective source data, decreasing semantic heterogeneity. The schema supports the following bioinformatics datatypes: chemical compounds, biochemical reactions, metabolic pathways, proteins, genes, nucleic acid sequences, features on protein and nucleic-acid sequences, organisms, organism taxonomies, and controlled vocabularies. As an application example, we applied BioWarehouse to determine the fraction of biochemically characterized enzyme activities for which no sequences exist in the public sequence databases. The answer is that no sequence exists for 36% of enzyme activities for which EC numbers have been assigned. These gaps in sequence data significantly limit the accuracy of genome annotation and metabolic pathway prediction, and are a barrier for metabolic engineering. Complex queries of this type provide examples of the value of the data warehousing approach to bioinformatics research. Conclusion BioWarehouse embodies significant progress on the database integration problem for bioinformatics. PMID:16556315
BioWarehouse: a bioinformatics database warehouse toolkit.
Lee, Thomas J; Pouliot, Yannick; Wagner, Valerie; Gupta, Priyanka; Stringer-Calvert, David W J; Tenenbaum, Jessica D; Karp, Peter D
2006-03-23
This article addresses the problem of interoperation of heterogeneous bioinformatics databases. We introduce BioWarehouse, an open source toolkit for constructing bioinformatics database warehouses using the MySQL and Oracle relational database managers. BioWarehouse integrates its component databases into a common representational framework within a single database management system, thus enabling multi-database queries using the Structured Query Language (SQL) but also facilitating a variety of database integration tasks such as comparative analysis and data mining. BioWarehouse currently supports the integration of a pathway-centric set of databases including ENZYME, KEGG, and BioCyc, and in addition the UniProt, GenBank, NCBI Taxonomy, and CMR databases, and the Gene Ontology. Loader tools, written in the C and JAVA languages, parse and load these databases into a relational database schema. The loaders also apply a degree of semantic normalization to their respective source data, decreasing semantic heterogeneity. The schema supports the following bioinformatics datatypes: chemical compounds, biochemical reactions, metabolic pathways, proteins, genes, nucleic acid sequences, features on protein and nucleic-acid sequences, organisms, organism taxonomies, and controlled vocabularies. As an application example, we applied BioWarehouse to determine the fraction of biochemically characterized enzyme activities for which no sequences exist in the public sequence databases. The answer is that no sequence exists for 36% of enzyme activities for which EC numbers have been assigned. These gaps in sequence data significantly limit the accuracy of genome annotation and metabolic pathway prediction, and are a barrier for metabolic engineering. Complex queries of this type provide examples of the value of the data warehousing approach to bioinformatics research. BioWarehouse embodies significant progress on the database integration problem for bioinformatics.
High-Resolution Melting Analysis for Rapid Detection of Sequence Type 131 Escherichia coli.
Harrison, Lucas B; Hanson, Nancy D
2017-06-01
Escherichia coli isolates belonging to the sequence type 131 (ST131) clonal complex have been associated with the global distribution of fluoroquinolone and β-lactam resistance. Whole-genome sequencing and multilocus sequence typing identify sequence type but are expensive when evaluating large numbers of samples. This study was designed to develop a cost-effective screening tool using high-resolution melting (HRM) analysis to differentiate ST131 from non-ST131 E. coli in large sample populations in the absence of sequence analysis. The method was optimized using DNA from 12 E. coli isolates. Singleplex PCR was performed using 10 ng of DNA, Type-it HRM buffer, and multilocus sequence typing primers and was followed by multiplex PCR. The amplicon sizes ranged from 630 to 737 bp. Melt temperature peaks were determined by performing HRM analysis at 0.1°C resolution from 50 to 95°C on a Rotor-Gene Q 5-plex HRM system. Derivative melt curves were compared between sequence types and analyzed by principal component analysis. A blinded study of 191 E. coli isolates of ST131 and unknown sequence types validated this methodology. This methodology returned 99.2% specificity (124 true negatives and 1 false positive) and 100% sensitivity (66 true positives and 0 false negatives). This HRM methodology distinguishes ST131 from non-ST131 E. coli without sequence analysis. The analysis can be accomplished in about 3 h in any laboratory with an HRM-capable instrument and principal component analysis software. Therefore, this assay is a fast and cost-effective alternative to sequencing-based ST131 identification. Copyright © 2017 Harrison and Hanson.
PFAAT version 2.0: a tool for editing, annotating, and analyzing multiple sequence alignments.
Caffrey, Daniel R; Dana, Paul H; Mathur, Vidhya; Ocano, Marco; Hong, Eun-Jong; Wang, Yaoyu E; Somaroo, Shyamal; Caffrey, Brian E; Potluri, Shobha; Huang, Enoch S
2007-10-11
By virtue of their shared ancestry, homologous sequences are similar in their structure and function. Consequently, multiple sequence alignments are routinely used to identify trends that relate to function. This type of analysis is particularly productive when it is combined with structural and phylogenetic analysis. Here we describe the release of PFAAT version 2.0, a tool for editing, analyzing, and annotating multiple sequence alignments. Support for multiple annotations is a key component of this release as it provides a framework for most of the new functionalities. The sequence annotations are accessible from the alignment and tree, where they are typically used to label sequences or hyperlink them to related databases. Sequence annotations can be created manually or extracted automatically from UniProt entries. Once a multiple sequence alignment is populated with sequence annotations, sequences can be easily selected and sorted through a sophisticated search dialog. The selected sequences can be further analyzed using statistical methods that explicitly model relationships between the sequence annotations and residue properties. Residue annotations are accessible from the alignment viewer and are typically used to designate binding sites or properties for a particular residue. Residue annotations are also searchable, and allow one to quickly select alignment columns for further sequence analysis, e.g. computing percent identities. Other features include: novel algorithms to compute sequence conservation, mapping conservation scores to a 3D structure in Jmol, displaying secondary structure elements, and sorting sequences by residue composition. PFAAT provides a framework whereby end-users can specify knowledge for a protein family in the form of annotation. The annotations can be combined with sophisticated analysis to test hypothesis that relate to sequence, structure and function.
Radiation effects on MOS devices - dosimetry, annealing, irradiation sequence, and sources
NASA Technical Reports Server (NTRS)
Stassinopoulos, E. G.; Brucker, G. J.; Van Gunten, O.; Knudson, A. R.; Jordan, T. M.
1983-01-01
This paper reports on some investigations of dosimetry, annealing, irradiation sequences, and radioactive sources, involved in the determination of radiation effects on MOS devices. Results show that agreement in the experimental and theoretical surface to average doses support the use of thermo-luminescent dosimeters (manganese activated calcium fluoride) in specifying the surface dose delivered to thin gate insulators of MOS devices. Annealing measurements indicate the existence of at least two energy levels,,s or a activation energies, for recovery of soft oxide MOS devices after irradiation by electrons, protons, and gammas. Damage sensitivities of MOS devices were found to be independent of combinations and sequences of radiation type or energies. Comparison of various gamma sources indicated a small dependence of damage sensitivity on the Cobalt facility, but a more significant dependence in the case of the Cesium source. These results were attributed to differences in the spectral content of the several sources.
Kottmann, Renzo; Gray, Tanya; Murphy, Sean; Kagan, Leonid; Kravitz, Saul; Lombardot, Thierry; Field, Dawn; Glöckner, Frank Oliver
2008-06-01
The Genomic Contextual Data Markup Language (GCDML) is a core project of the Genomic Standards Consortium (GSC) that implements the "Minimum Information about a Genome Sequence" (MIGS) specification and its extension, the "Minimum Information about a Metagenome Sequence" (MIMS). GCDML is an XML Schema for generating MIGS/MIMS compliant reports for data entry, exchange, and storage. When mature, this sample-centric, strongly-typed schema will provide a diverse set of descriptors for describing the exact origin and processing of a biological sample, from sampling to sequencing, and subsequent analysis. Here we describe the need for such a project, outline design principles required to support the project, and make an open call for participation in defining the future content of GCDML. GCDML is freely available, and can be downloaded, along with documentation, from the GSC Web site (http://gensc.org).
Xiao, Chao-Ting; Halbur, Patrick G; Opriessnig, Tanja
2015-07-01
The oldest porcine circovirus type 2 (PCV2) sequence dates back to 1962 and is among several hundreds of publicly available PCV2 sequences. Despite this resource, few studies have investigated the global genetic diversity of PCV2. To evaluate the phylogenetic relationship of PCV2 strains, 1680 PCV2 open reading frame 2 (ORF2) sequences were compared and analysed by methods of neighbour-joining, maximum-likelihood, Bayesian inference and network analysis. Four distinct clades were consistently identified and included PCV2a, PCV2b, PCV2c and PCV2d; the p-distance between PCV2d and PCV2b was 0.055±0.008, larger than the PCV2 genotype-definition cut-off of 0.035, supporting PCV2d as an independent genotype. Among the 1680 sequences, 278-285 (16.5-17 %) were classified as PCV2a, 1007-1058 (59.9-63 %) as PCV2b, three (0.2 %) as PCV2c and 322-323 (19.2 %) as PCV2d, with the remaining 12-78 sequences (0.7-4.6 %) classified as intermediate clades or strains by the various methods. Classification of strains to genotypes differed based on the number of sequences used for the analysis, indicating that sample size is important when determining classification and assessing PCV2 trends and shifts. PCV2d was initially identified in 1999 in samples collected in Switzerland, now appears to be widespread in China and has been present in North America since 2012. During 2012-2013, 37 % of all investigated PCV2 sequences from US pigs were classified as PCV2d and overall data analysis suggests an ongoing genotype shift from PCV2b towards PCV2d. The present analyses indicate that PCV2d emerged approximately 20 years ago.
Chae, Minho; Danko, Charles G; Kraus, W Lee
2015-07-16
Global run-on coupled with deep sequencing (GRO-seq) provides extensive information on the location and function of coding and non-coding transcripts, including primary microRNAs (miRNAs), long non-coding RNAs (lncRNAs), and enhancer RNAs (eRNAs), as well as yet undiscovered classes of transcripts. However, few computational tools tailored toward this new type of sequencing data are available, limiting the applicability of GRO-seq data for identifying novel transcription units. Here, we present groHMM, a computational tool in R, which defines the boundaries of transcription units de novo using a two state hidden-Markov model (HMM). A systematic comparison of the performance between groHMM and two existing peak-calling methods tuned to identify broad regions (SICER and HOMER) favorably supports our approach on existing GRO-seq data from MCF-7 breast cancer cells. To demonstrate the broader utility of our approach, we have used groHMM to annotate a diverse array of transcription units (i.e., primary transcripts) from four GRO-seq data sets derived from cells representing a variety of different human tissue types, including non-transformed cells (cardiomyocytes and lung fibroblasts) and transformed cells (LNCaP and MCF-7 cancer cells), as well as non-mammalian cells (from flies and worms). As an example of the utility of groHMM and its application to questions about the transcriptome, we show how groHMM can be used to analyze cell type-specific enhancers as defined by newly annotated enhancer transcripts. Our results show that groHMM can reveal new insights into cell type-specific transcription by identifying novel transcription units, and serve as a complete and useful tool for evaluating functional genomic elements in cells.
Prevalence, genetic diversity and recombination of species G enteroviruses infecting pigs in Vietnam
Van Dung, Nguyen; Anh, Pham Hong; Van Cuong, Nguyen; Hoa, Ngo Thi; Carrique-Mas, Juan; Hien, Vo Be; Campbell, James; Baker, Stephen; Farrar, Jeremy; Woolhouse, Mark E.; Bryant, Juliet E.
2014-01-01
Picornaviruses infecting pigs, described for many years as ‘porcine enteroviruses’, have recently been recognized as distinct viruses within three distinct genera (Teschovirus, Sapelovirus and Enterovirus). To better characterize the epidemiology and genetic diversity of members of the Enterovirus genus, faecal samples from pigs from four provinces in Vietnam were screened by PCR using conserved enterovirus (EV)-specific primers from the 5′ untranslated region (5′ UTR). High rates of infection were recorded in pigs on all farms, with detection frequencies of approximately 90 % in recently weaned pigs but declining to 40 % in those aged over 1 year. No differences in EV detection rates were observed between pigs with and without diarrhoea [74 % (n = 70) compared with 72 % (n = 128)]. Genetic analysis of consensus VP4/VP2 and VP1 sequences amplified from a subset of EV-infected pigs identified species G EVs in all samples. Among these, VP1 sequence comparisons identified six type 1 and seven type 6 variants, while four further VP1 sequences failed to group with any previously identified EV-G types. These have now been formally assigned as EV-G types 8–11 by the Picornavirus Study Group. Comparison of VP1, VP4/VP2, 3Dpol and 5′ UTRs of study samples and those available on public databases showed frequent, bootstrap-supported differences in their phylogenies indicative of extensive within-species recombination between genome regions. In summary, we identified extremely high frequencies of infection with EV-G in pigs in Vietnam, substantial genetic diversity and recombination within the species, and evidence for a much larger number of circulating EV-G types than currently described. PMID:24323635
Di Pierro, Michele; Cheng, Ryan R; Lieberman Aiden, Erez; Wolynes, Peter G; Onuchic, José N
2017-11-14
Inside the cell nucleus, genomes fold into organized structures that are characteristic of cell type. Here, we show that this chromatin architecture can be predicted de novo using epigenetic data derived from chromatin immunoprecipitation-sequencing (ChIP-Seq). We exploit the idea that chromosomes encode a 1D sequence of chromatin structural types. Interactions between these chromatin types determine the 3D structural ensemble of chromosomes through a process similar to phase separation. First, a neural network is used to infer the relation between the epigenetic marks present at a locus, as assayed by ChIP-Seq, and the genomic compartment in which those loci reside, as measured by DNA-DNA proximity ligation (Hi-C). Next, types inferred from this neural network are used as an input to an energy landscape model for chromatin organization [Minimal Chromatin Model (MiChroM)] to generate an ensemble of 3D chromosome conformations at a resolution of 50 kilobases (kb). After training the model, dubbed Maximum Entropy Genomic Annotation from Biomarkers Associated to Structural Ensembles (MEGABASE), on odd-numbered chromosomes, we predict the sequences of chromatin types and the subsequent 3D conformational ensembles for the even chromosomes. We validate these structural ensembles by using ChIP-Seq tracks alone to predict Hi-C maps, as well as distances measured using 3D fluorescence in situ hybridization (FISH) experiments. Both sets of experiments support the hypothesis of phase separation being the driving process behind compartmentalization. These findings strongly suggest that epigenetic marking patterns encode sufficient information to determine the global architecture of chromosomes and that de novo structure prediction for whole genomes may be increasingly possible. Copyright © 2017 the Author(s). Published by PNAS.
David, Sophia; Afshar, Baharak; Mentasti, Massimo; Ginevra, Christophe; Podglajen, Isabelle; Harris, Simon R; Chalker, Victoria J; Jarraud, Sophie; Harrison, Timothy G; Parkhill, Julian
2017-05-01
Legionnaires' disease is an important cause of hospital-acquired pneumonia and is caused by infection with the bacterium Legionella. Because current typing methods often fail to resolve the infection source in possible nosocomial cases, we aimed to determine whether whole-genome sequencing (WGS) could be used to support or refute suspected links between cases and hospitals. We focused on cases involving a major nosocomial-associated strain, L. pneumophila sequence type (ST) 1. WGS data from 229 L. pneumophila ST1 isolates were analyzed, including 99 isolates from the water systems of 17 hospitals and 42 clinical isolates from patients with confirmed or suspected hospital-acquired infections, as well as isolates obtained from or associated with community-acquired sources of Legionnaires' disease. Phylogenetic analysis demonstrated that all hospitals from which multiple isolates were obtained have been colonized by 1 or more distinct ST1 populations. However, deep sampling of 1 hospital also revealed the existence of substantial diversity and ward-specific microevolution within the population. Across all hospitals, suspected links with cases were supported with WGS, although the degree of support was dependent on the depth of environmental sampling and available contextual information. Finally, phylogeographic analysis revealed that hospitals have been seeded with L. pneumophila via both local and international spread of ST1. WGS can be used to support or refute suspected links between hospitals and Legionnaires' disease cases. However, deep hospital sampling is frequently required due to the potential coexistence of multiple populations, existence of substantial diversity, and similarity of hospital isolates to local populations. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America.
Van Ooteghem, Karen; Frank, James S; Allard, Fran; Horak, Fay B
2010-08-01
Postural motor learning for dynamic balance tasks has been demonstrated in healthy older adults (Van Ooteghem et al. in Exp Brain Res 199(2):185-193, 2009). The purpose of this study was to investigate the type of knowledge (general or specific) obtained with balance training in this age group and to examine whether embedding perturbation regularities within a balance task masks specific learning. Two groups of older adults maintained balance on a translating platform that oscillated with variable amplitude and constant frequency. One group was trained using an embedded-sequence (ES) protocol which contained the same 15-s sequence of variable amplitude oscillations in the middle of each trial. A second group was trained using a looped-sequence (LS) protocol which contained a 15-s sequence repeated three times to form each trial. All trials were 45 s. Participants were not informed of any repetition. To examine learning, participants performed a retention test following a 24-h delay. LS participants also completed a transfer task. Specificity of learning was examined by comparing performance for repeated versus random sequences (ES) and training versus transfer sequences (LS). Performance was measured by deriving spatial and temporal measures of whole body center of mass (COM) and trunk orientation. Both groups improved performance with practice as characterized by reduced COM displacement, improved COM-platform phase relationships, and decreased angular trunk motion. Furthermore, improvements reflected general rather than specific postural motor learning regardless of training protocol (ES or LS). This finding is similar to young adults (Van Ooteghem et al. in Exp Brain Res 187(4):603-611, 2008) and indicates that age does not influence the type of learning which occurs for balance control.
Hardwicke, Joseph T; Richards, Helen; Cafferky, Louise; Underwood, Imogen; ter Horst, Britt; Slator, Rona
2016-03-01
Pierre Robin sequence results from a cascade of events that occur during embryologic development and frequently presents with cleft palate. Some studies have shown speech outcomes to be worse in patients with Pierre Robin sequence after cleft palate repair. A cohort of Pierre Robin sequence patients who all required an airway intervention and nasogastric feeding in the neonatal period were identified and speech outcomes assessed at 5 years of age. A cleft- and sex-matched non-Pierre Robin sequence, cleft palate-only comparison group was also identified from the same institution and study period. A total of 24 patients with Pierre Robin sequence that required airway and nutritional support in the neonatal period were matched for age, sex, and cleft type to a group of 24 non-Pierre Robin sequence cleft patients. There was no significant difference in the incidence of oronasal fistula between the groups. Secondary surgery for velopharyngeal incompetence was significantly more (p = 0.017) in the Pierre Robin sequence group, who also had significantly greater nasality (p = 0.031) and cleft speech characteristic (p = 0.023) scores. The authors hypothesize that other factors may exist in Pierre Robin sequence that may lead to poor speech outcomes. The authors would suggest counseling parents of children with Pierre Robin sequence that have required a neonatal airway intervention, that speech development may be poorer than in other children with cleft palate, and that these children will have a significantly higher incidence of secondary speech surgery. Risk, II.
Sequence modelling and an extensible data model for genomic database
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Peter Wei-Der
1992-01-01
The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS's do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data modelmore » that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the Extensible Object Model'', to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.« less
Sequence modelling and an extensible data model for genomic database
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Peter Wei-Der
1992-01-01
The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS`s do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data modelmore » that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the ``Extensible Object Model``, to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.« less
NASA Astrophysics Data System (ADS)
Munyanyiwa, Hubert
1999-02-01
The Umkondo Group is a supracrustal sequence cropping out in eastern Zimbabwe in the Nyanga, Chimanimani and Chipinge Districts. In these areas the sequence has been divided into a weakly metamorphosed and deformed unit of argillaceous, arenaceous and carbonate rocks (Zimbabwe facies) in the west, and a strongly deformed and medium- to high-grade metamorphosed sequence of mainly quartzites and metapelites (Mozambique facies) in the east. The two sequences were tectonically juxtaposed during the Neoproterozoic Pan-African Mozambique Belt deformation. The Zimbabwe facies sedimentary rocks are intruded by extensive dolerite sills and minor interlayered basalts flows. The mafic rocks are sub-alkaline continental tholeiites. They have low mg numbers associated with low Cr, Cu, Ni and Co, which indicate that the parental magma underwent some differentiation processes en route to the surface. They are LREE enriched with ( {La}/{Yb}N = 5.0-7.6 , high Ce/Yb (>10) and {La}/{Nb} (>0.5) values, and exhibit troughs at Nb, Sr, Ti and P on a MORB-normalised, multi-element spider diagram. These chemical characteristics, together with the large areal extent of the Umkondo dolerites and basalts, suggest that the Umkondo mafic igneous suite was once widespread and formed part of a continental flood basalt province. This is supported by the depositional environment (shallow water platform type setting) of the sedimentary sequence into which the mafic rocks were emplaced. The widespread occurrence of the Umkondo igneous event is further supported by the similarity in palæomagnetic poles of a number of mafic units in southern Africa.
Adamiak, Paul; Vanderkooi, Otto G; Kellner, James D; Schryvers, Anthony B; Bettinger, Julie A; Alcantara, Joenel
2014-06-03
Multi-locus sequence typing (MLST) is a portable, broadly applicable method for classifying bacterial isolates at an intra-species level. This methodology provides clinical and scientific investigators with a standardized means of monitoring evolution within bacterial populations. MLST uses the DNA sequences from a set of genes such that each unique combination of sequences defines an isolate's sequence type. In order to reliably determine the sequence of a typing gene, matching sequence reads for both strands of the gene must be obtained. This study assesses the ability of both the standard, and an alternative set of, Streptococcus pneumoniae MLST primers to completely sequence, in both directions, the required typing alleles. The results demonstrated that for five (aroE, recP, spi, xpt, ddl) of the seven S. pneumoniae typing alleles, the standard primers were unable to obtain the complete forward and reverse sequences. This is due to the standard primers annealing too closely to the target regions, and current sequencing technology failing to sequence the bases that are too close to the primer. The alternative primer set described here, which includes a combination of primers proposed by the CDC and several designed as part of this study, addresses this limitation by annealing to highly conserved segments further from the target region. This primer set was subsequently employed to sequence type 105 S. pneumoniae isolates collected by the Canadian Immunization Monitoring Program ACTive (IMPACT) over a period of 18 years. The inability of several of the standard S. pneumoniae MLST primers to fully sequence the required region was consistently observed and is the result of a shift in sequencing technology occurring after the original primers were designed. The results presented here introduce clear documentation describing this phenomenon into the literature, and provide additional guidance, through the introduction of a widely validated set of alternative primers, to research groups seeking to undertake S. pneumoniae MLST based studies.
eXframe: reusable framework for storage, analysis and visualization of genomics experiments
2011-01-01
Background Genome-wide experiments are routinely conducted to measure gene expression, DNA-protein interactions and epigenetic status. Structured metadata for these experiments is imperative for a complete understanding of experimental conditions, to enable consistent data processing and to allow retrieval, comparison, and integration of experimental results. Even though several repositories have been developed for genomics data, only a few provide annotation of samples and assays using controlled vocabularies. Moreover, many of them are tailored for a single type of technology or measurement and do not support the integration of multiple data types. Results We have developed eXframe - a reusable web-based framework for genomics experiments that provides 1) the ability to publish structured data compliant with accepted standards 2) support for multiple data types including microarrays and next generation sequencing 3) query, analysis and visualization integration tools (enabled by consistent processing of the raw data and annotation of samples) and is available as open-source software. We present two case studies where this software is currently being used to build repositories of genomics experiments - one contains data from hematopoietic stem cells and another from Parkinson's disease patients. Conclusion The web-based framework eXframe offers structured annotation of experiments as well as uniform processing and storage of molecular data from microarray and next generation sequencing platforms. The framework allows users to query and integrate information across species, technologies, measurement types and experimental conditions. Our framework is reusable and freely modifiable - other groups or institutions can deploy their own custom web-based repositories based on this software. It is interoperable with the most important data formats in this domain. We hope that other groups will not only use eXframe, but also contribute their own useful modifications. PMID:22103807
McDowell, Andrew; Barnard, Emma; Liu, Jared; Li, Huiying; Patrick, Sheila
2016-12-01
Recently, it has been proposed that strains of Propionibacterium acnes from the type III genetic division should be classified as P. acnessubsp. elongatum subsp. nov., with strains from the type I and II divisions collectively classified as P. acnessubsp. acnes subsp. nov. Under such a taxonomic re-appraisal, we believe that types I and II should also have their own separate rank of subspecies. In support of this, we describe a polyphasic taxonomic study based on the analysis of publicly available multilocus and whole-genome sequence datasets, alongside a systematic review of previously published phylogenetic, genomic, phenotypic and clinical data. Strains of types I and II form highly distinct clades on the basis of multilocus sequence analysis (MLSA) and whole-genome phylogenetic reconstructions. In silico or digital DNA-DNA similarity values also fall within the 70-80 % boundary recommended for bacterial subspecies. Furthermore, we see important differences in genome content, including the presence of an active CRISPR/Cas system in type II strains, but not type I, and evidence for increasing linkage equilibrium within the separate divisions. Key biochemical differences include positive test results for β-haemolytic, neuraminidase and sorbitol fermentation activities with type I strains, but not type II. We now propose that type I strains should be classified as P. acnessubsp. acnes subsp. nov., and type II as P. acnessubsp. defendens subsp. nov. The type strain of P. acnessubsp. acnes subsp. nov. is NCTC 737T (=ATCC 6919T=JCM 6425T=DSM 1897T=CCUG 1794T), while the type strain of P. acnessubsp. defendens subsp. nov. is ATCC 11828 (=JCM 6473=CCUG 6369).
Serotype IV Sequence Type 468 Group B Streptococcus Neonatal Invasive Disease, Minnesota, USA.
Teatero, Sarah; Ferrieri, Patricia; Fittipaldi, Nahuel
2016-11-01
To further understand the emergence of serotype IV group B Streptococcus (GBS) invasive disease, we used whole-genome sequencing to characterize 3 sequence type 468 strains isolated from neonates in Minnesota, USA. We found that strains of tetracycline-resistant sequence type 468 GBS have acquired virulence genes from a putative clonal complex 17 GBS donor by recombination.
Multilocus sequence typing of total-genome-sequenced bacteria.
Larsen, Mette V; Cosentino, Salvatore; Rasmussen, Simon; Friis, Carsten; Hasman, Henrik; Marvig, Rasmus Lykke; Jelsbak, Lars; Sicheritz-Pontén, Thomas; Ussery, David W; Aarestrup, Frank M; Lund, Ole
2012-04-01
Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the "gold standard" of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.
Characterization of six mutations in Exon 37 of neurofibromatosis type 1 gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Upadhyaya, M.; Osborn, M.; Maynard, J.
Neurofibromatosis type 1 (NF1) is one of the most common inherited disorders, with an incidence of 1 in 3,000. We screened a total of 320 unrelated NF1 patients for mutations in exon 37 of the NF1 gene. Six independent mutations were identified, of which three are novel, and these include a recurrent nonsense mutation identified in 2 unrelated patients at codon 2281 (G2281X), a 1-bp insertion (6791 ins A) resulting in a change of TAG (tyrosine) to a TAA (stop codon), and a 3-bp deletion (6839 del TAC) which generated a frameshift. Another recurrent nonsense mutation, Y2264X, which was detectedmore » in 2 unrelated patients in this study, was also previously reported in 2 NF1 individuals. All the mutations were identified within a contiguous 49-bp sequence. Further studies are warranted to support the notion that this region of the gene contains highly mutable sequences. 17 refs., 2 figs., 1 tab.« less
Chalker, Victoria J; Smith, Alyson; Al-Shahib, Ali; Botchway, Stella; Macdonald, Emily; Daniel, Roger; Phillips, Sarah; Platt, Steven; Doumith, Michel; Tewolde, Rediat; Coelho, Juliana; Jolley, Keith A; Underwood, Anthony; McCarthy, Noel D
2016-06-01
Single-strain outbreaks of Streptococcus pyogenes infections are common and often go undetected. In 2013, two clusters of invasive group A Streptococcus (iGAS) infection were identified in independent but closely located care homes in Oxfordshire, United Kingdom. Investigation included visits to each home, chart review, staff survey, microbiologic sampling, and genome sequencing. S. pyogenes emm type 1.0, the most common circulating type nationally, was identified from all cases yielding GAS isolates. A tailored whole-genome reference population comprising epidemiologically relevant contemporaneous isolates and published isolates was assembled. Data were analyzed independently using whole-genome multilocus sequencing and single-nucleotide polymorphism analyses. Six isolates from staff and residents of the homes formed a single cluster that was separated from the reference population by both analytical approaches. No further cases occurred after mass chemoprophylaxis and enhanced infection control. Our findings demonstrate the ability of 2 independent analytical approaches to enable robust conclusions from nonstandardized whole-genome analysis to support public health practice.
PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities.
Troshin, Peter V; Postis, Vincent Lg; Ashworth, Denise; Baldwin, Stephen A; McPherson, Michael J; Barton, Geoffrey J
2011-03-07
Facilities that provide a service for DNA sequencing typically support large numbers of users and experiment types. The cost of services is often reduced by the use of liquid handling robots but the efficiency of such facilities is hampered because the software for such robots does not usually integrate well with the systems that run the sequencing machines. Accordingly, there is a need for software systems capable of integrating different robotic systems and managing sample information for DNA sequencing services. In this paper, we describe an extension to the Protein Information Management System (PIMS) that is designed for DNA sequencing facilities. The new version of PIMS has a user-friendly web interface and integrates all aspects of the sequencing process, including sample submission, handling and tracking, together with capture and management of the data. The PIMS sequencing extension has been in production since July 2009 at the University of Leeds DNA Sequencing Facility. It has completely replaced manual data handling and simplified the tasks of data management and user communication. Samples from 45 groups have been processed with an average throughput of 10000 samples per month. The current version of the PIMS sequencing extension works with Applied Biosystems 3130XL 96-well plate sequencer and MWG 4204 or Aviso Theonyx liquid handling robots, but is readily adaptable for use with other combinations of robots. PIMS has been extended to provide a user-friendly and integrated data management solution for DNA sequencing facilities that is accessed through a normal web browser and allows simultaneous access by multiple users as well as facility managers. The system integrates sequencing and liquid handling robots, manages the data flow, and provides remote access to the sequencing results. The software is freely available, for academic users, from http://www.pims-lims.org/.
Glass, Leslie L; Calero-Nieto, Fernando J; Jawaid, Wajid; Larraufie, Pierre; Kay, Richard G; Göttgens, Berthold; Reimann, Frank; Gribble, Fiona M
2017-10-01
To identify sub-populations of intestinal preproglucagon-expressing (PPG) cells producing Glucagon-like Peptide-1, and their associated expression profiles of sensory receptors, thereby enabling the discovery of therapeutic strategies that target these cell populations for the treatment of diabetes and obesity. We performed single cell RNA sequencing of PPG-cells purified by flow cytometry from the upper small intestine of 3 GLU-Venus mice. Cells from 2 mice were sequenced at low depth, and from the third mouse at high depth. High quality sequencing data from 234 PPG-cells were used to identify clusters by tSNE analysis. qPCR was performed to compare the longitudinal and crypt/villus locations of cluster-specific genes. Immunofluorescence and mass spectrometry were used to confirm protein expression. PPG-cells formed 3 major clusters: a group with typical characteristics of classical L-cells, including high expression of Gcg and Pyy (comprising 51% of all PPG-cells); a cell type overlapping with Gip-expressing K-cells (14%); and a unique cluster expressing Tph1 and Pzp that was predominantly located in proximal small intestine villi and co-produced 5-HT (35%). Expression of G-protein coupled receptors differed between clusters, suggesting the cell types are differentially regulated and would be differentially targetable. Our findings support the emerging concept that many enteroendocrine cell populations are highly overlapping, with individual cells producing a range of peptides previously assigned to distinct cell types. Different receptor expression profiles across the clusters highlight potential drug targets to increase gut hormone secretion for the treatment of diabetes and obesity. Copyright © 2017 The Authors. Published by Elsevier GmbH.. All rights reserved.
Development of Mycoplasma synoviae (MS) core genome multilocus sequence typing (cgMLST) scheme.
Ghanem, Mostafa; El-Gazzar, Mohamed
2018-05-01
Mycoplasma synoviae (MS) is a poultry pathogen with reported increased prevalence and virulence in recent years. MS strain identification is essential for prevention, control efforts and epidemiological outbreak investigations. Multiple multilocus based sequence typing schemes have been developed for MS, yet the resolution of these schemes could be limited for outbreak investigation. The cost of whole genome sequencing became close to that of sequencing the seven MLST targets; however, there is no standardized method for typing MS strains based on whole genome sequences. In this paper, we propose a core genome multilocus sequence typing (cgMLST) scheme as a standardized and reproducible method for typing MS based whole genome sequences. A diverse set of 25 MS whole genome sequences were used to identify 302 core genome genes as cgMLST targets (35.5% of MS genome) and 44 whole genome sequences of MS isolates from six countries in four continents were used for typing applying this scheme. cgMLST based phylogenetic trees displayed a high degree of agreement with core genome SNP based analysis and available epidemiological information. cgMLST allowed evaluation of two conventional MLST schemes of MS. The high discriminatory power of cgMLST allowed differentiation between samples of the same conventional MLST type. cgMLST represents a standardized, accurate, highly discriminatory, and reproducible method for differentiation between MS isolates. Like conventional MLST, it provides stable and expandable nomenclature, allowing for comparing and sharing the typing results between different laboratories worldwide. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Hölzemer, Angelique; Thobakgale, Christina F; Jimenez Cruz, Camilo A; Garcia-Beltran, Wilfredo F; Carlson, Jonathan M; van Teijlingen, Nienke H; Mann, Jaclyn K; Jaggernath, Manjeetha; Kang, Seung-gu; Körner, Christian; Chung, Amy W; Schafer, Jamie L; Evans, David T; Alter, Galit; Walker, Bruce D; Goulder, Philip J; Carrington, Mary; Hartmann, Pia; Pertel, Thomas; Zhou, Ruhong; Ndung'u, Thumbi; Altfeld, Marcus
2015-11-01
Viruses can evade immune surveillance, but the underlying mechanisms are insufficiently understood. Here, we sought to understand the mechanisms by which natural killer (NK) cells recognize HIV-1-infected cells and how this virus can evade NK-cell-mediated immune pressure. Two sequence mutations in p24 Gag associated with the presence of specific KIR/HLA combined genotypes were identified in HIV-1 clade C viruses from a large cohort of infected, untreated individuals in South Africa (n = 392), suggesting viral escape from KIR+ NK cells through sequence variations within HLA class I-presented epitopes. One sequence polymorphism at position 303 of p24 Gag (TGag303V), selected for in infected individuals with both KIR2DL3 and HLA-C*03:04, enabled significantly better binding of the inhibitory KIR2DL3 receptor to HLA-C*03:04-expressing cells presenting this variant epitope compared to the wild-type epitope (wild-type mean 18.01 ± 10.45 standard deviation [SD] and variant mean 44.67 ± 14.42 SD, p = 0.002). Furthermore, activation of primary KIR2DL3+ NK cells from healthy donors in response to HLA-C*03:04+ target cells presenting the variant epitope was significantly reduced in comparison to cells presenting the wild-type sequence (wild-type mean 0.78 ± 0.07 standard error of the mean [SEM] and variant mean 0.63 ± 0.07 SEM, p = 0.012). Structural modeling and surface plasmon resonance of KIR/peptide/HLA interactions in the context of the different viral sequence variants studied supported these results. Future studies will be needed to assess processing and antigen presentation of the investigated HIV-1 epitope in natural infection, and the consequences for viral control. These data provide novel insights into how viruses can evade NK cell immunity through the selection of mutations in HLA-presented epitopes that enhance binding to inhibitory NK cell receptors. Better understanding of the mechanisms by which HIV-1 evades NK-cell-mediated immune pressure and the functional validation of a structural modeling approach will facilitate the development of novel targeted immune interventions to harness the antiviral activities of NK cells.
Carro, Lorena; Nouioui, Imen; Sangal, Vartul; Meier-Kolthoff, Jan P; Trujillo, Martha E; Montero-Calasanz, Maria Del Carmen; Sahin, Nevzat; Smith, Darren Lee; Kim, Kristi E; Peluso, Paul; Deshpande, Shweta; Woyke, Tanja; Shapiro, Nicole; Kyrpides, Nikos C; Klenk, Hans-Peter; Göker, Markus; Goodfellow, Michael
2018-01-11
There is a need to clarify relationships within the actinobacterial genus Micromonospora, the type genus of the family Micromonosporaceae, given its biotechnological and ecological importance. Here, draft genomes of 40 Micromonospora type strains and two non-type strains are made available through the Genomic Encyclopedia of Bacteria and Archaea project and used to generate a phylogenomic tree which showed they could be assigned to well supported phyletic lines that were not evident in corresponding trees based on single and concatenated sequences of conserved genes. DNA G+C ratios derived from genome sequences showed that corresponding data from species descriptions were imprecise. Emended descriptions include precise base composition data and approximate genome sizes of the type strains. antiSMASH analyses of the draft genomes show that micromonosporae have a previously unrealised potential to synthesize novel specialized metabolites. Close to one thousand biosynthetic gene clusters were detected, including NRPS, PKS, terpenes and siderophores clusters that were discontinuously distributed thereby opening up the prospect of prioritising gifted strains for natural product discovery. The distribution of key stress related genes provide an insight into how micromonosporae adapt to key environmental variables. Genes associated with plant interactions highlight the potential use of micromonosporae in agriculture and biotechnology.
Virgibacillus halophilus sp. nov., spore-forming bacteria isolated from soil in Japan.
An, Sun-Young; Asahara, Mika; Goto, Keiichi; Kasai, Hiroaki; Yokota, Akira
2007-07-01
Two Gram-positive, round-spore-forming, rod-shaped, halophilic bacterial strains, 5B73C(T) and 5B133E, were isolated from field soil in Kakegawa, Shizuoka, Japan, and were characterized taxonomically using a polyphasic approach. These two strains were found to comprise strictly aerobic, motile rods that formed subterminal endospores. Phylogenetic analyses based on 16S rRNA gene sequences showed that strains 5B73C(T) and 5B133E are phylogenetically affiliated to the genus Virgibacillus, exhibiting sequence similarities of 94.1-96.4 % with respect to the type strains of Virgibacillus species. The DNA G+C contents of strains 5B73C(T) and 5B133E were 42.6 and 42.3 mol%, respectively. The cell-wall peptidoglycan type (meso-diaminopimelic acid), the major cellular fatty acids (anteiso-C(15 : 0), iso-C(15 : 0), anteiso-C(17 : 0) and iso-C(16 : 0)) and the quinone type (MK-7) of the isolates support their affiliation to the genus Virgibacillus. On the basis of their genotypic and phenotypic characteristics, the isolates represent a novel species of the genus Virgibacillus, for which the name Virgibacillus halophilus sp. nov. is proposed. The type strain is 5B73C(T) (=IAM 15308(T)=KCTC 13935(T)).
Chappell, J D; Gunn, V L; Wetzel, J D; Baer, G S; Dermody, T S
1997-03-01
The reovirus attachment protein, sigma1, determines numerous aspects of reovirus-induced disease, including viral virulence, pathways of spread, and tropism for certain types of cells in the central nervous system. The sigma1 protein projects from the virion surface and consists of two distinct morphologic domains, a virion-distal globular domain known as the head and an elongated fibrous domain, termed the tail, which is anchored into the virion capsid. To better understand structure-function relationships of sigma1 protein, we conducted experiments to identify sequences in sigma1 important for viral binding to sialic acid, a component of the receptor for type 3 reovirus. Three serotype 3 reovirus strains incapable of binding sialylated receptors were adapted to growth in murine erythroleukemia (MEL) cells, in which sialic acid is essential for reovirus infectivity. MEL-adapted (MA) mutant viruses isolated by serial passage in MEL cells acquired the capacity to bind sialic acid-containing receptors and demonstrated a dependence on sialic acid for infection of MEL cells. Analysis of reassortant viruses isolated from crosses of an MA mutant virus and a reovirus strain that does not bind sialic acid indicated that the sigma1 protein is solely responsible for efficient growth of MA mutant viruses in MEL cells. The deduced sigma1 amino acid sequences of the MA mutant viruses revealed that each strain contains a substitution within a short region of sequence in the sigma1 tail predicted to form beta-sheet. These studies identify specific sequences that determine the capacity of reovirus to bind sialylated receptors and suggest a location for a sialic acid-binding domain. Furthermore, the results support a model in which type 3 sigma1 protein contains discrete receptor binding domains, one in the head and another in the tail that binds sialic acid.
Delamuta, Jakeline Renata Marçon; Ribeiro, Renan Augusto; Ormeño-Orrillo, Ernesto; Parma, Marcia Maria; Melo, Itamar Soares; Martínez-Romero, Esperanza; Hungria, Mariangela
2015-12-01
Biological nitrogen fixation is a key process for agricultural production and environmental sustainability, but there are comparatively few studies of symbionts of tropical pasture legumes, as well as few described species of the genus Bradyrhizobium, although it is the predominant rhizobial genus in the tropics. A detailed polyphasic study was conducted with two strains of the genus Bradyrhizobium used in commercial inoculants for tropical pastures in Brazil, CNPSo 1112T, isolated from perennial soybean (Neonotonia wightii), and CNPSo 2833T, from desmodium (Desmodium heterocarpon). Based on 16S-rRNA gene phylogeny, both strains were grouped in the Bradyrhizobium elkanii superclade, but were not clearly clustered with any known species. Multilocus sequence analysis of three (glnII, gyrB and recA) and five (plus atpD and dnaK) housekeeping genes confirmed that the strains are positioned in two distinct clades. Comparison with intergenic transcribed spacer sequences of type strains of described species of the genus Bradyrhizobium showed similarity lower than 93.1 %, and differences were confirmed by BOX-PCR analysis. Nucleotide identity of three housekeeping genes with type strains of described species ranged from 88.1 to 96.2 %. Average nucleotide identity of genome sequences showed values below the threshold for distinct species of the genus Bradyrhizobium ( < 90.6 %), and the value between the two strains was also below this threshold (91.2 %). Analysis of nifH and nodC gene sequences positioned the two strains in a clade distinct from other species of the genus Bradyrhizobium. Morphophysiological, genotypic and genomic data supported the description of two novel species in the genus Bradyrhizobium, Bradyrhizobium tropiciagri sp. nov. (type strain CNPSo 1112T = SMS 303T = BR 1009T = SEMIA 6148T = LMG 28867T) and Bradyrhizobium embrapense sp. nov. (type strain CNPSo 2833T = CIAT 2372T = BR 2212T = SEMIA 6208T = U674T = LMG 2987).
Angus, Robert B.; Ribera, Ignacio; Jia, Fenglong
2017-01-01
Abstract Karyotypes are given for Boreonectes emmerichi (Falkenström, 1936) from its type locality at Kangding, China, and for B. alpestris (Dutton & Angus, 2007) from the St Gotthard and San Bernardino passes in the Swiss Alps. A phylogeny based on sequence data from a combination of mitochondrial and nuclear genes recovered western Palaearctic species of Boreonectes as monophyletic with strong support. Boreonectes emmerichi was placed as sister to the north American forms of B. griseostriatus (De Geer, 1774), although with low support. The diversity of Palaearctic species of the B. griseostriatus species group is discussed. PMID:28919958
Angus, Robert B; Ribera, Ignacio; Jia, Fenglong
2017-01-01
Karyotypes are given for Boreonectes emmerichi (Falkenström, 1936) from its type locality at Kangding, China, and for B. alpestris (Dutton & Angus, 2007) from the St Gotthard and San Bernardino passes in the Swiss Alps. A phylogeny based on sequence data from a combination of mitochondrial and nuclear genes recovered western Palaearctic species of Boreonectes as monophyletic with strong support. Boreonectes emmerichi was placed as sister to the north American forms of B. griseostriatus (De Geer, 1774), although with low support. The diversity of Palaearctic species of the B. griseostriatus species group is discussed.
Galbany-Casals, M; Carnicero-Campmany, P; Blanco-Moreno, J M; Smissen, R D
2012-09-01
Hybridisation is considered an important evolutionary phenomenon in Gnaphalieae, but contemporary hybridisation has been little explored within the tribe. Here, hybridisation between Helichrysum orientale and Helichrysum stoechas is studied at two different localities in the islands of Crete and Rhodes (Greece). Using three different types of molecular data (AFLP, nrDNA ITS sequences and cpDNA ndhF sequences) and morphological data, the aim is to provide simultaneous and direct comparisons between molecular and morphological variation among the parental species and the studied hybrid populations. AFLP profiles, ITS sequences and morphological data support the existence of hybrids at the two localities studied, shown as morphological and genetic intermediates between the parental species. Chloroplast DNA sequences show that both parental species can act either as pollen donor or as maternal parent. Fertility of hybrids is demonstrated by the viability of seeds produced by hybrids from both localities, and the detection of a backcross specimen to H. orientale. Although there is general congruence of morphological and molecular data, the analysis of morphology and ITS sequences can fail to detect backcross hybrids. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.
Risk stratification for therapeutic management and prognosis.
Coelho-Filho, Otavio R; Nallamshetty, Leelakrishna; Kwong, Raymond Y
2009-07-01
In coronary artery disease (CAD), cardiac magnetic resonance (CMR) imaging can integrate several types of pulse-sequence examinations (eg, myocardial perfusion, cine wall motion, T2-weighted imaging for myocardial edema, late gadolinium enhancement, and CMR angiography) that can provide anatomic, functional, and physiologic information about the heart in a single imaging session. Because of this ability to interrogate myocardial physiology using different pulse sequence techniques within a single CMR session, this technique has been recognized increasingly in many centers as the test of choice for assessing patients who present with cardiomyopathy of undetermined cause. This article first reviews the current evidence supporting the prognosticating role of CMR in assessing CAD and then discusses CMR applications and prognostication in many non-coronary cardiac conditions.
Whole-Genome Sequencing Data for Serotyping Escherichia coli-It's Time for a Change!
Jenkins, Claire
2015-08-01
The accessibility of whole-genome sequencing (WGS) presents the opportunity for national reference laboratories to provide a state-of-the-art public health surveillance service. The replacement of traditional serology-based typing of Escherichia coli by WGS is supported by user-friendly, freely available data analysis Web tools. An article in this issue of the Journal of Clinical Microbiology (K. G. Joensen, A. M. M. Tetzschner, A. Iguchi, F. M. Aarestrup, and F. Scheutz, J Clin Microbiol, 53:2410-2426, 2015, http://dx.doi.org/10.1128/JCM.00008-15) describes SerotypeFinder, an essential guide to serotyping E. coli in the 21st century. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Constructing and Modifying Sequence Statistics for relevent Using informR in 𝖱
Marcum, Christopher Steven; Butts, Carter T.
2015-01-01
The informR package greatly simplifies the analysis of complex event histories in 𝖱 by providing user friendly tools to build sufficient statistics for the relevent package. Historically, building sufficient statistics to model event sequences (of the form a→b) using the egocentric generalization of Butts’ (2008) relational event framework for modeling social action has been cumbersome. The informR package simplifies the construction of the complex list of arrays needed by the rem() model fitting for a variety of cases involving egocentric event data, multiple event types, and/or support constraints. This paper introduces these tools using examples from real data extracted from the American Time Use Survey. PMID:26185488
A catalog of aftershock sequences in Greece (1971 1997): Their spatial and temporal characteristics
NASA Astrophysics Data System (ADS)
Drakatos, George; Latoussakis, John
A complete catalog of aftershock sequences is provided for main earthquakes with ML 5.0, which occurred in the area of Greece and surrounding regions the last twenty-seven years. The Monthly Bulletins of the Institute of Geodynamics (National Observatory of Athens) have been used as data source. In order to get a homogeneous catalog, several selection criteria have been applied and hence a catalog of 44 aftershock sequences is compiled. The relations between the duration of the sequence, the number of aftershocks, the magnitude of the largest aftershock and its delay time from the main shock as well as the subsurface rupture length versus the magnitude of the main shock are calculated. The results show that linearity exists between the subsurface rupture length and the magnitude of the main shock independent of the slip type, as well as between the magnitude of the main shock (M) and its largest aftershock (Ma). The mean difference M-Ma is almost one unit. In the 40% of the analyzed sequences, the largest aftershock occurred within one day after the main shock.The fact that the aftershock sequences show the same behavior for earthquakes that occur in the same region supports the theory that the spatial and temporal characteristics are strongly related to the stress distribution of the fault area.
NASA Astrophysics Data System (ADS)
Shenkman, B. S.; Nemirovskaya, T. L.; Shapovalova, K. B.; Podlubnaya, Z. A.; Vikhliantsev, I. M.; Moukhina, A. M.; Kozlovskaya, I. B.
2007-02-01
It was recently established that support withdrawal (withdrawal of support reaction force) in microgravity provokes a sequence of functional shifts in the activity of motor units (inactivation of slow ones) and peripheral muscle apparatus which lead to the decline of postural muscle contractility and alterations in fiber characteristics. However, mechanisms involved in inactivation of the slow motor units and appropriate slow-twitch muscle fiber disuse under the supportless conditions remained unknown. We show here that artificial inactivation of muscles-antagonists (which are known to be hyperactive during unloading) counteracts some of the unloading-induced events in the rat soleus (fiber size reduction, slow-to-fast fiber-type transition and decline of titin and nebulin content). It was also demonstrated that direct activation of the muscarinic receptors of the neostriatum neurons prevented slow-to-fast fiber-type transformation in soleus of hindlimb suspended rats.
GenSeq: An updated nomenclature and ranking for genetic sequences from type and non-type sources
Chakrabarty, Prosanta; Warren, Melanie; Page, Lawrence M.; Baldwin, Carole C.
2013-01-01
Abstract An improved and expanded nomenclature for genetic sequences is introduced that corresponds with a ranking of the reliability of the taxonomic identification of the source specimens. This nomenclature is an advancement of the “Genetypes” naming system, which some have been reluctant to adopt because of the use of the “type” suffix in the terminology. In the new nomenclature, genetic sequences are labeled “genseq,” followed by a reliability ranking (e.g., 1 if the sequence is from a primary type), followed by the name of the genes from which the sequences were derived (e.g., genseq-1 16S, COI). The numbered suffix provides an indication of the likely reliability of taxonomic identification of the voucher. Included in this ranking system, in descending order of taxonomic reliability, are the following: sequences from primary types – “genseq-1,” secondary types – “genseq-2,” collection-vouchered topotypes – “genseq-3,” collection-vouchered non-types – “genseq-4,” and non-types that lack specimen vouchers but have photo vouchers – “genseq-5.” To demonstrate use of the new nomenclature, we review recently published new-species descriptions in the ichthyological literature that include DNA data and apply the GenSeq nomenclature to sequences referenced in those publications. We encourage authors to adopt the GenSeq nomenclature (note capital “G” and “S” when referring to the nomenclatural program) to provide a searchable tag (e.g., “genseq”; note lowercase “g” and “s” when referring to sequences) for genetic sequences from types and other vouchered specimens. Use of the new nomenclature and ranking system will improve integration of molecular phylogenetics and biological taxonomy and enhance the ability of researchers to assess the reliability of sequence data. We further encourage authors to update sequence information on databases such as GenBank whenever nomenclatural changes are made. PMID:24223486
Rodas, Claudia; Klena, John D.; Nicklasson, Matilda; Iniguez, Volga; Sjöling, Åsa
2011-01-01
Background Enterotoxigenic Escherichia coli (ETEC) is a major cause of traveller's and infantile diarrhoea in the developing world. ETEC produces two toxins, a heat-stable toxin (known as ST) and a heat-labile toxin (LT) and colonization factors that help the bacteria to attach to epithelial cells. Methodology/Principal Findings In this study, we characterized a subset of ETEC clinical isolates recovered from Bolivian children under 5 years of age using a combination of multilocus sequence typing (MLST) analysis, virulence typing, serotyping and antimicrobial resistance test patterns in order to determine the genetic background of ETEC strains circulating in Bolivia. We found that strains expressing the heat-labile (LT) enterotoxin and colonization factor CS17 were common and belonged to several MLST sequence types but mainly to sequence type-423 and sequence type-443 (Achtman scheme). To further study the LT/CS17 strains we analysed the nucleotide sequence of the CS17 operon and compared the structure to LT/CS17 ETEC isolates from Bangladesh. Sequence analysis confirmed that all sequence type-423 strains from Bolivia had a single nucleotide polymorphism; SNPbol in the CS17 operon that was also found in some other MLST sequence types from Bolivia but not in strains recovered from Bangladeshi children. The dominant ETEC clone in Bolivia (sequence type-423/SNPbol) was found to persist over multiple years and was associated with severe diarrhoea but these strains were variable with respect to antimicrobial resistance patterns. Conclusion/Significance The results showed that although the LT/CS17 phenotype is common among ETEC strains in Bolivia, multiple clones, as determined by unique MLST sequence types, populate this phenotype. Our data also appear to suggest that acquisition and loss of antimicrobial resistance in LT-expressing CS17 ETEC clones is more dynamic than acquisition or loss of virulence factors. PMID:22140423
Rodas, Claudia; Klena, John D; Nicklasson, Matilda; Iniguez, Volga; Sjöling, Asa
2011-01-01
Enterotoxigenic Escherichia coli (ETEC) is a major cause of traveller's and infantile diarrhoea in the developing world. ETEC produces two toxins, a heat-stable toxin (known as ST) and a heat-labile toxin (LT) and colonization factors that help the bacteria to attach to epithelial cells. In this study, we characterized a subset of ETEC clinical isolates recovered from Bolivian children under 5 years of age using a combination of multilocus sequence typing (MLST) analysis, virulence typing, serotyping and antimicrobial resistance test patterns in order to determine the genetic background of ETEC strains circulating in Bolivia. We found that strains expressing the heat-labile (LT) enterotoxin and colonization factor CS17 were common and belonged to several MLST sequence types but mainly to sequence type-423 and sequence type-443 (Achtman scheme). To further study the LT/CS17 strains we analysed the nucleotide sequence of the CS17 operon and compared the structure to LT/CS17 ETEC isolates from Bangladesh. Sequence analysis confirmed that all sequence type-423 strains from Bolivia had a single nucleotide polymorphism; SNP(bol) in the CS17 operon that was also found in some other MLST sequence types from Bolivia but not in strains recovered from Bangladeshi children. The dominant ETEC clone in Bolivia (sequence type-423/SNP(bol)) was found to persist over multiple years and was associated with severe diarrhoea but these strains were variable with respect to antimicrobial resistance patterns. The results showed that although the LT/CS17 phenotype is common among ETEC strains in Bolivia, multiple clones, as determined by unique MLST sequence types, populate this phenotype. Our data also appear to suggest that acquisition and loss of antimicrobial resistance in LT-expressing CS17 ETEC clones is more dynamic than acquisition or loss of virulence factors.
Weterings, Veronica; Bosch, Thijs; Witteveen, Sandra; Landman, Fabian; Schouls, Leo; Kluytmans, Jan
2017-09-01
Resistance to methicillin in Staphylococcus aureus is caused primarily by the mecA gene, which is carried on a mobile genetic element, the staphylococcal cassette chromosome mec (SCC mec ). Horizontal transfer of this element is supposed to be an important factor in the emergence of new clones of methicillin-resistant Staphylococcus aureus (MRSA) but has been rarely observed in real time. In 2012, an outbreak occurred involving a health care worker (HCW) and three patients, all carrying a fusidic acid-resistant MRSA strain. The husband of the HCW was screened for MRSA carriage, but only a methicillin-susceptible S. aureus (MSSA) strain, which was also resistant to fusidic acid, was detected. Multiple-locus variable-number tandem-repeat analysis (MLVA) typing showed that both the MSSA and MRSA isolates were MT4053-MC0005. This finding led to the hypothesis that the MSSA strain acquired the SCC mec and subsequently caused an outbreak. To support this hypothesis, next-generation sequencing of the MSSA and MRSA isolates was performed. This study showed that the MSSA isolate clustered closely with the outbreak isolates based on whole-genome multilocus sequence typing and single-nucleotide polymorphism (SNP) analysis, with a genetic distance of 17 genes and 44 SNPs, respectively. Remarkably, there were relatively large differences in the mobile genetic elements in strains within and between individuals. The limited genetic distance between the MSSA and MRSA isolates in combination with a clear epidemiologic link supports the hypothesis that the MSSA isolate acquired a SCC mec and that the resulting MRSA strain caused an outbreak. Copyright © 2017 American Society for Microbiology.
Keith. Boggs
2000-01-01
A classification of community types, successional sequences, and landscapes is presented for the piedmont of the Copper River Delta. The classification was based on a sampling of 471 sites. A total of 75 community types, 42 successional sequences, and 6 landscapes are described. The classification of community types reflects the existing vegetation communities on the...
Brown, Steven D.; Podar, Mircea; Klingeman, Dawn M.; Johnson, Courtney M.; Yang, Zamin K.; Utturkar, Sagar M.; Land, Miriam L.; Mosher, Jennifer J.; Hurt, Richard A.; Phelps, Tommy J.; Palumbo, Anthony V.; Arkin, Adam P.; Hazen, Terry C.
2012-01-01
Pelosinus fermentans 16S rRNA gene sequences have been reported from diverse geographical sites since the recent isolation of the type strain. We present the genome sequence of the P. fermentans type strain R7 (DSM 17108) and genome sequences for two new strains with different abilities to reduce iron, chromate, and uranium. PMID:22933770
Molecular epidemiology of type 1 and 2 dengue viruses in Brazil from 1988 to 2001.
Pires Neto, R J; Lima, D M; de Paula, S O; Lima, C M; Rocco, I M; Fonseca, B A L
2005-06-01
Dengue is a mosquito-borne viral infection that in recent decades has become a major international public health concern. Epidemic dengue fever reemerged in Brazil in 1981. Since 1990 more than one dengue virus serotype has been circulating in this tropical country and increasing rates of dengue hemorrhagic fever and dengue shock syndrome have been detected every year. Some evidence supports the association between the introduction of a new serotype and/or genotype in a region and the appearance of dengue hemorrhagic fever. In order to study the evolutionary relationships and possible detection of the introduction of new dengue virus genotypes in Brazil in the last years, we analyzed partial nucleotide sequences of 52 Brazilian samples of both dengue type 1 and dengue type 2 isolated from 1988 to 2001 from highly endemic regions. A 240-nucleotide-long sequence from the envelope/nonstructural protein 1 gene junction was used for phylogenetic analysis. After comparing the nucleotide sequences originally obtained in this study to those previously studied by others, and analyzing the phylogenetic trees, we conclude that, after the initial introduction of the currently circulating dengue-1 and dengue-2 genotypes in Brazil, there has been no evidence of introduction of new genotypes since 1988. The increasing number of dengue hemorrhagic fever cases seen in Brazil in the last years is probably associated with secondary infections or with the introduction of new serotypes but not with the introduction of new genotypes.
Miragaia, M.; Thomas, J. C.; Couto, I.; Enright, M. C.; de Lencastre, H.
2007-01-01
Despite its importance as a human pathogen, information on population structure and global epidemiology of Staphylococcus epidermidis is scarce and the relative importance of the mechanisms contributing to clonal diversification is unknown. In this study, we addressed these issues by analyzing a representative collection of S. epidermidis isolates from diverse geographic and clinical origins using multilocus sequence typing (MLST). Additionally, we characterized the mobile element (SCCmec) carrying the genetic determinant of methicillin resistance. The 217 S. epidermidis isolates from our collection were split by MLST into 74 types, suggesting a high level of genetic diversity. Analysis of MLST data using the eBURST algorithm revealed the existence of nine epidemic clonal lineages that were disseminated worldwide. One single clonal lineage (clonal complex 2) comprised 74% of the isolates, whereas the remaining isolates were clustered into 8 minor clonal lineages and 13 singletons. According to our evolutionary model, SCCmec was acquired at least 56 times by S. epidermidis. Although geographic dissemination of S. epidermidis strains and the value of the index of association between the alleles, 0.2898 (P < 0.05), support the clonality of S. epidermidis species, examination of the sequence changes at MLST loci during clonal diversification showed that recombination gives rise to new alleles approximately twice as frequently as point mutations. We suggest that S. epidermidis has a population with an epidemic structure, in which nine clones have emerged upon a recombining background and evolved quickly through frequent transfer of genetic mobile elements, including SCCmec. PMID:17220222
Clostridium botulinum strains producing BoNT/F4 or BoNT/F5.
Raphael, Brian H; Bradshaw, Marite; Kalb, Suzanne R; Joseph, Lavin A; Lúquez, Carolina; Barr, John R; Johnson, Eric A; Maslanka, Susan E
2014-05-01
Botulinum neurotoxin type F (BoNT/F) may be produced by Clostridium botulinum alone or in combination with another toxin type such as BoNT/A or BoNT/B. Type F neurotoxin gene sequences have been further classified into seven toxin subtypes. Recently, the genome sequence of one strain of C. botulinum (Af84) was shown to contain three neurotoxin genes (bont/F4, bont/F5, and bont/A2). In this study, eight strains containing bont/F4 and seven strains containing bont/F5 were examined. Culture supernatants produced by these strains were incubated with BoNT/F-specific peptide substrates. Cleavage products of these peptides were subjected to mass spectral analysis, allowing detection of the BoNT/F subtypes present in the culture supernatants. PCR analysis demonstrated that a plasmid-specific marker (PL-6) was observed only among strains containing bont/F5. Among these strains, Southern hybridization revealed the presence of an approximately 242-kb plasmid harboring bont/F5. Genome sequencing of four of these strains revealed that the genomic backgrounds of strains harboring either bont/F4 or bont/F5 are diverse. None of the strains analyzed in this study were shown to produce BoNT/F4 and BoNT/F5 simultaneously, suggesting that strain Af84 is unusual. Finally, these data support a role for the mobility of a bont/F5-carrying plasmid among strains of diverse genomic backgrounds.
Enterovirus Migration Patterns between France and Tunisia.
Othman, Ines; Mirand, Audrey; Slama, Ichrak; Mastouri, Maha; Peigue-Lafeuille, Hélène; Aouni, Mahjoub; Bailly, Jean-Luc
2015-01-01
The enterovirus (EV) types echovirus (E-) 5, E-9, and E-18, and coxsackievirus (CV-) A9 are infrequently reported in human diseases and their epidemiologic features are poorly defined. Virus transmission patterns between countries have been estimated with phylogenetic data derived from the 1D/VP1 and 3CD gene sequences of a sample of 74 strains obtained in France (2000-2012) and Tunisia (2011-2013) and from the publicly available sequences. The EV types (E-5, E-9, and E-18) exhibited a lower worldwide genetic diversity (respective number of genogroups: 4, 5, and 3) in comparison to CV-A9 (n = 10). The phylogenetic trees estimated with both 1D/VP1 and 3CD sequence data showed variations in the number of co-circulating lineages over the last 20 years among the four EV types. Despite the low number of genogroups in E-18, the virus exhibited the highest number of recombinant 3CD lineages (n = 10) versus 4 (E-5) to 8 (E-9). The phylogenies provided evidence of multiple transportation events between France and Tunisia involving E-5, E-9, E-18, and CV-A9 strains. Virus spread events between France and 17 other countries in five continents had high probabilities of occurrence as those between Tunisia and two European countries other than France. All transportation events were supported by BF values > 10. Inferring the source of virus transmission from phylogenetic data may provide insights into the patterns of sporadic and epidemic diseases caused by EVs.
Mathlouthi, Najla; El Salabi, Allaaeddin Ali; Ben Jomàa-Jemili, Mariem; Bakour, Sofiane; Al-Bayssari, Charbel; Zorgani, Abdulaziz A; Kraiema, Abdulmajeed; Elahmer, Omar; Okdah, Liliane; Rolain, Jean-Marc; Chouchani, Chedly
2016-07-01
Acinetobacter baumannii is an opportunistic pathogen causing various nosocomial infections. The aim of this study was to characterise the molecular support of carbapenem-resistant A. baumannii clinical isolates recovered from two Libyan hospitals. Bacterial isolates were identified by matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry (MALDI-TOF/MS). Antibiotic susceptibility testing was performed using disk diffusion and Etest methods, and carbapenem resistance determinants were studied by PCR amplification and sequencing. Multilocus sequence typing (MLST) was performed for typing of the isolates. All 36 imipenem-resistant isolates tested were identified as A. baumannii. The blaOXA-23 gene was detected in 29 strains (80.6%). The metallo-β-lactamase blaNDM-1 gene was detected in eight isolates (22.2%), showing dissemination of multidrug-resistant (MDR) A. baumannii in Tripoli Medical Center and Burn and Plastic Surgery Hospital in Libya, including one isolate that co-expressed the blaOXA-23 gene. MLST revealed several sequence types (STs). Imipenem-resistant A. baumannii ST2 was the predominant clone (16/36; 44.4%). This study shows that NDM-1 and OXA-23 contribute to antibiotic resistance in Libyan hospitals and represents the first incidence of the association of these two carbapenemases in an autochthonous MDR A. baumannii isolated from patients in Libya, indicating that there is a longstanding infection control problem in these hospitals. Copyright © 2016 Elsevier B.V. and International Society of Chemotherapy. All rights reserved.
Young, C-C; Busse, H-J; Langer, S; Chu, Jiunn-Nan; Schumann, P; Arun, A B; Shen, Fo-Ting; Rekha, P D; Kämpfer, P
2010-04-01
Three Gram-positive, rod-shaped bacteria (strains CC-SBCK-209( T), CC-12309(T) and CC-5209(T)) were isolated from the stalk of the edible mushroom Agaricus blazei grown in the laboratory. 16S rRNA gene sequence analysis indicated that all three isolates clearly belonged to the genus Microbacterium. Strains CC-SBCK-209( T) and CC-12309(T) were most related closely to the type strain of Microbacterium halotolerans (95.9 and 96.1 % 16S rRNA gene sequence similarity, respectively). These two novel strains shared 97.9 % 16S rRNA gene sequence similarity. Levels of similarity to the type strains of all other recognized Microbacterium species were lower than 95.5 %. The third strain (CC-5209( T)) showed the highest 16S rRNA gene sequence similarity to the type strain of Microbacterium resistens (97.6 %); levels of similarity to the type strains of all other recognized Microbacterium species were lower than 96 %. The quinone systems of strains CC-SBCK-209(T), CC-12309(T) and CC-5209(T) consisted of MK-11/MK-12, MK-11/MK-10 and MK-13 as major compounds, respectively. All three strains contained ornithine in their peptidoglycan. The major polar lipids were diphosphatidylglycerol, phosphatidylglycerol and an unknown glycolipid. The polyamine pattern consisted of spermidine and spermine as predominant components. Fatty acid profiles (anteiso-C(15 : 0), iso-C(16 : 0) and anteiso-C(17 : 0 ) as major components) supported the affiliation of all three strains to the genus Microbacterium. The results of physiological and biochemical tests and DNA-DNA hybridization experiments allowed the clear phenotypic and genotypic differentiation of strains CC-SBCK-209(T) and CC-12309( T) from M. halotolerans and other closely related Microbacterium species. Strain CC-5209(T) could be differentiated clearly from M. resistens both genotypically and phenotypically. Based on these data, the novel strains are considered to represent three novel species of the genus Microbacterium. The names proposed for these organisms are Microbacterium agarici sp. nov. [type strain CC-SBCK-209( T) (=DSM 21798(T)=CCM 7686(T))], Microbacterium humi sp. nov. [type strain CC-12309(T) (=DSM 21799(T)=CCM 7687(T))] and Microbacterium pseudoresistens sp. nov. [type strain CC-5209(T) (=DSM 22185(T)=CCM 7688(T))].
Butler, Nathaniel M.; Baltes, Nicholas J.; Voytas, Daniel F.; Douches, David S.
2016-01-01
Genome editing using sequence-specific nucleases (SSNs) is rapidly being developed for genetic engineering in crop species. The utilization of zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and clustered regularly interspaced short palindromic repeats/CRISPR-associated systems (CRISPR/Cas) for inducing double-strand breaks facilitates targeting of virtually any sequence for modification. Targeted mutagenesis via non-homologous end-joining (NHEJ) has been demonstrated extensively as being the preferred DNA repair pathway in plants. However, gene targeting via homologous recombination (HR) remains more elusive but could be a powerful tool for directed DNA repair. To overcome barriers associated with gene targeting, a geminivirus replicon (GVR) was used to deliver SSNs targeting the potato ACETOLACTATE SYNTHASE1 (ALS1) gene and repair templates designed to incorporate herbicide-inhibiting point mutations within the ALS1 locus. Transformed events modified with GVRs held point mutations that were capable of supporting a reduced herbicide susceptibility phenotype, while events transformed with conventional T-DNAs held no detectable mutations and were similar to wild-type. Regeneration of transformed events improved detection of point mutations that supported a stronger reduced herbicide susceptibility phenotype. These results demonstrate the use of geminiviruses for delivering genome editing reagents in plant species, and a novel approach to gene targeting in a vegetatively propagated species. PMID:27493650
Indel variant analysis of short-read sequencing data with Scalpel
Fang, Han; Bergmann, Ewa A; Arora, Kanika; Vacic, Vladimir; Zody, Michael C; Iossifov, Ivan; O’Rawe, Jason A; Wu, Yiyang; Barron, Laura T Jimenez; Rosenbaum, Julie; Ronemus, Michael; Lee, Yoon-ha; Wang, Zihua; Dikoglu, Esra; Jobanputra, Vaidehi; Lyon, Gholson J; Wigler, Michael; Schatz, Michael C; Narzisi, Giuseppe
2017-01-01
As the second most common type of variation in the human genome, insertions and deletions (indels) have been linked to many diseases, but the discovery of indels of more than a few bases in size from short-read sequencing data remains challenging. Scalpel (http://scalpel.sourceforge.net) is an open-source software for reliable indel detection based on the microassembly technique. It has been successfully used to discover mutations in novel candidate genes for autism, and it is extensively used in other large-scale studies of human diseases. This protocol gives an overview of the algorithm and describes how to use Scalpel to perform highly accurate indel calling from whole-genome and whole-exome sequencing data. We provide detailed instructions for an exemplary family-based de novo study, but we also characterize the other two supported modes of operation: single-sample and somatic analysis. Indel normalization, visualization and annotation of the mutations are also illustrated. Using a standard server, indel discovery and characterization in the exonic regions of the example sequencing data can be completed in ~5 h after read mapping. PMID:27854363
Horn, T; Chang, C A; Urdea, M S
1997-12-01
The divergent synthesis of bDNA structures is described. This new type of branched DNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branching network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb molecules were assembled on a solid support using parameters optimized for bDNA synthesis. The chemistry was used to synthesize bDNA comb molecules containing 15 secondary sequences. The bDNA comb molecules were elaborated by enzymatic ligation into branched amplification multimers, large bDNA molecules (a total of 1068 nt) containing an average of 36 repeated DNA oligomer sequences, each capable of hybridizing specifically to an alkaline phosphatase-labeled oligonucleotide. The bDNA comb molecules were characterized by electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The branched amplification multimers have been used as signal amplifiers in nucleic acid quantification assays for detection of viral infection. It is possible to detect as few as 50 molecules with bDNA technology.
Horn, T; Chang, C A; Urdea, M S
1997-01-01
The divergent synthesis of bDNA structures is described. This new type of branched DNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branching network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb molecules were assembled on a solid support using parameters optimized for bDNA synthesis. The chemistry was used to synthesize bDNA comb molecules containing 15 secondary sequences. The bDNA comb molecules were elaborated by enzymatic ligation into branched amplification multimers, large bDNA molecules (a total of 1068 nt) containing an average of 36 repeated DNA oligomer sequences, each capable of hybridizing specifically to an alkaline phosphatase-labeled oligonucleotide. The bDNA comb molecules were characterized by electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The branched amplification multimers have been used as signal amplifiers in nucleic acid quantification assays for detection of viral infection. It is possible to detect as few as 50 molecules with bDNA technology. PMID:9365266
Lim, P O; Sears, B B
1992-01-01
The families within the class Mollicutes are distinguished by their morphologies, nutritional requirements, and abilities to metabolize certain compounds. Biosystematic classification of the plant-pathogenic mycoplasmalike organisms (MLOs) has been difficult because these organisms have not been cultured in vitro, and hence their nutritional requirements have not been determined nor have physiological characterizations been possible. To investigate the evolutionary relationship of the MLOs to other members of the class Mollicutes, a segment of a ribosomal protein operon was cloned and sequenced from an aster yellows-type MLO which is pathogenic for members of the genus Oenothera and from Acholeplasma laidlawii. The deduced amino acid sequence data from the rpl22 and rps3 genes indicate that the MLOs are more closely related to A. laidlawii than to animal mycoplasmas, confirming previous results from 16S rRNA sequence comparisons. This conclusion is also supported by the finding that the UGA codon is not read as a tryptophan codon in the MLO and A. laidlawii, in contrast to its usage in Mycoplasma capricolum. PMID:1556079
Lenzmeier, B A; Giebler, H A; Nyborg, J K
1998-02-01
Efficient human T-cell leukemia virus type 1 (HTLV-1) replication and viral gene expression are dependent upon the virally encoded oncoprotein Tax. To activate HTLV-1 transcription, Tax interacts with the cellular DNA binding protein cyclic AMP-responsive element binding protein (CREB) and recruits the coactivator CREB binding protein (CBP), forming a nucleoprotein complex on the three viral cyclic AMP-responsive elements (CREs) in the HTLV-1 promoter. Short stretches of dG-dC-rich (GC-rich) DNA, immediately flanking each of the viral CREs, are essential for Tax recruitment of CBP in vitro and Tax transactivation in vivo. Although the importance of the viral CRE-flanking sequences is well established, several studies have failed to identify an interaction between Tax and the DNA. The mechanistic role of the viral CRE-flanking sequences has therefore remained enigmatic. In this study, we used high resolution methidiumpropyl-EDTA iron(II) footprinting to show that Tax extended the CREB footprint into the GC-rich DNA flanking sequences of the viral CRE. The Tax-CREB footprint was enhanced but not extended by the KIX domain of CBP, suggesting that the coactivator increased the stability of the nucleoprotein complex. Conversely, the footprint pattern of CREB on a cellular CRE lacking GC-rich flanking sequences did not change in the presence of Tax or Tax plus KIX. The minor-groove DNA binding drug chromomycin A3 bound to the GC-rich flanking sequences and inhibited the association of Tax and the Tax-CBP complex without affecting CREB binding. Tax specifically cross-linked to the viral CRE in the 5'-flanking sequence, and this cross-link was blocked by chromomycin A3. Together, these data support a model where Tax interacts directly with both CREB and the minor-groove viral CRE-flanking sequences to form a high-affinity binding site for the recruitment of CBP to the HTLV-1 promoter.
Walker, M D; Park, C W; Rosen, A; Aronheim, A
1990-01-01
Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Postel, Alexander; Schmeiser, Stefanie; Zimmermann, Bernd; Becher, Paul
2016-01-01
Molecular epidemiology has become an indispensable tool in the diagnosis of diseases and in tracing the infection routes of pathogens. Due to advances in conventional sequencing and the development of high throughput technologies, the field of sequence determination is in the process of being revolutionized. Platforms for sharing sequence information and providing standardized tools for phylogenetic analyses are becoming increasingly important. The database (DB) of the European Union (EU) and World Organisation for Animal Health (OIE) Reference Laboratory for classical swine fever offers one of the world’s largest semi-public virus-specific sequence collections combined with a module for phylogenetic analysis. The classical swine fever (CSF) DB (CSF-DB) became a valuable tool for supporting diagnosis and epidemiological investigations of this highly contagious disease in pigs with high socio-economic impacts worldwide. The DB has been re-designed and now allows for the storage and analysis of traditionally used, well established genomic regions and of larger genomic regions including complete viral genomes. We present an application example for the analysis of highly similar viral sequences obtained in an endemic disease situation and introduce the new geographic “CSF Maps” tool. The concept of this standardized and easy-to-use DB with an integrated genetic typing module is suited to serve as a blueprint for similar platforms for other human or animal viruses. PMID:27827988
Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D.; Callac, Philippe
2016-01-01
The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated. PMID:27228131
Complete genome sequence of the Antarctic Halorubrum lacusprofundi type strain ACAM 34
Anderson, Iain J.; DasSarma, Priya; Lucas, Susan; ...
2016-09-10
Halorubrum lacusprofundi is an extreme halophile within the archaeal phylum Euryarchaeota. The type strain ACAM 34 was isolated from Deep Lake, Antarctica. H. lacusprofundi is of phylogenetic interest because it is distantly related to the haloarchaea that have previously been sequenced. It is also of interest because of its psychrotolerance. We report here the complete genome sequence of H. lacusprofundi type strain ACAM 34 and its annotation. In conclusion, this genome is part of a 2006 Joint Genome Institute Community Sequencing Program project to sequence genomes of diverse Archaea.
Complete genome sequence of the Antarctic Halorubrum lacusprofundi type strain ACAM 34
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anderson, Iain J.; DasSarma, Priya; Lucas, Susan
Halorubrum lacusprofundi is an extreme halophile within the archaeal phylum Euryarchaeota. The type strain ACAM 34 was isolated from Deep Lake, Antarctica. H. lacusprofundi is of phylogenetic interest because it is distantly related to the haloarchaea that have previously been sequenced. It is also of interest because of its psychrotolerance. We report here the complete genome sequence of H. lacusprofundi type strain ACAM 34 and its annotation. In conclusion, this genome is part of a 2006 Joint Genome Institute Community Sequencing Program project to sequence genomes of diverse Archaea.
Micromonospora halotolerans sp. nov., isolated from the rhizosphere of a Pisum sativum plant.
Carro, Lorena; Pukall, Rüdiger; Spröer, Cathrin; Kroppenstedt, Reiner M; Trujillo, Martha E
2013-06-01
A filamentous actinomycete strain designated CR18(T) was isolated on humic acid agar from the rhizosphere of a Pisum sativum plant collected in Spain. This isolate was observed to grow optimally at 28 °C, pH 7.0 and in the presence of 5 % NaCl. Phylogenetic analyses based on the 16S rRNA gene sequence indicated a close relationship with the type strains of Micromonospora chersina and Micromonospora endolithica. A further analysis based on a concatenated DNA sequence stretch of 4,523 bp that included partial sequences of the atpD, gyrB, recA, rpoB and 16S rRNA genes clearly differentiated the new strain from recognized Micromonospora species compared. DNA-DNA hybridization studies further supported the taxonomic position of strain CR18(T) as a novel genomic species. Chemotaxonomic analyses which included whole cell sugars, polar lipids, fatty acid profiles and menaquinone composition confirmed the affiliation of the new strain to the genus Micromonospora and also highlighted differences at the species level. These studies were finally complemented with an array of physiological tests to help differentiate between the new strain and its phylogenetic neighbours. Consequently, strain CR18(T) (= CECT 7890(T) = DSM 45598(T)) is proposed as the type strain of a novel species, Micromonospora halotolerans sp. nov.
Gancberg, David; Corbisier, Philippe; Meeus, Nele; Marki-Zay, Janos; Mannhalter, Christine; Schimmel, Heinz
2008-01-01
There is a need for reference materials (RMs) in the field of genetic testing for verification of test results obtained in patients and probands. For the frequent genetic variation G20210A in the prothrombin gene, it has been shown that purified plasmids containing the gene fragment harbouring the mutation constitute good candidate RMs. Plasmid-type RMs were characterised for homogeneity, stability, sequence identity and fitness for purpose. Their certification required the use of different real-time PCR methods for genotyping and quantification of the plasmid copy number. Homogeneity, stability and fitness for the purpose of the plasmids could be demonstrated. The long-term stability (up to 24 months) of the materials was confirmed by highly sensitive and specific quantitative real-time PCR methods. New types of certified RMs (CRMs) for genetic testing of the human prothrombin gene G20210A sequence variant are available. Their fitness for purpose was demonstrated and no evidence was found that they would not work with other methods as long as these are targeting the whole or parts of the prothrombin gene fragment inserted into the plasmids. The described CRMs support the efforts of the international community in development, validation and harmonisation of tests for molecular genetic testing.
Ahlstrom, Christina A; Bonnedahl, Jonas; Woksepp, Hanna; Hernandez, Jorge; Olsen, Björn; Ramey, Andrew M
2018-05-09
Antimicrobial resistance (AMR) in bacterial pathogens threatens global health, though the spread of AMR bacteria and AMR genes between humans, animals, and the environment is still largely unknown. Here, we investigated the role of wild birds in the epidemiology of AMR Escherichia coli. Using next-generation sequencing, we characterized cephalosporin-resistant E. coli cultured from sympatric gulls and bald eagles inhabiting a landfill habitat in Alaska to identify genetic determinants conferring AMR, explore potential transmission pathways of AMR bacteria and genes at this site, and investigate how their genetic diversity compares to isolates reported in other taxa. We found genetically diverse E. coli isolates with sequence types previously associated with human infections and resistance genes of clinical importance, including bla CTX-M and bla CMY . Identical resistance profiles were observed in genetically unrelated E. coli isolates from both gulls and bald eagles. Conversely, isolates with indistinguishable core-genomes were found to have different resistance profiles. Our findings support complex epidemiological interactions including bacterial strain sharing between gulls and bald eagles and horizontal gene transfer among E. coli harboured by birds. Results suggest that landfills may serve as a source for AMR acquisition and/or maintenance, including bacterial sequence types and AMR genes relevant to human health.
Molecular dynamics study of some non-hydrogen-bonding base pair DNA strands
NASA Astrophysics Data System (ADS)
Tiwari, Rakesh K.; Ojha, Rajendra P.; Tiwari, Gargi; Pandey, Vishnudatt; Mall, Vijaysree
2018-05-01
In order to elucidate the structural activity of hydrophobic modified DNA, the DMMO2-D5SICS, base pair is introduced as a constituent in different set of 12-mer and 14-mer DNA sequences for the molecular dynamics (MD) simulation in explicit water solvent. AMBER 14 force field was employed for each set of duplex during the 200ns production-dynamics simulation in orthogonal-box-water solvent by the Particle-Mesh-Ewald (PME) method in infinite periodic boundary conditions (PBC) to determine conformational parameters of the complex. The force-field parameters of modified base-pair were calculated by Gaussian-code using Hartree-Fock /ab-initio methodology. RMSD Results reveal that the conformation of the duplex is sequence dependent and the binding energy of the complex depends on the position of the modified base-pair in the nucleic acid strand. We found that non-bonding energy had a significant contribution to stabilising such type of duplex in comparison to electrostatic energy. The distortion produced within strands by such type of base-pair was local and destabilised the duplex integrity near to substitution, moreover the binding energy of duplex depends on the position of substitution of hydrophobic base-pair and the DNA sequence and strongly supports the corresponding experimental study.
The present and future of de novo whole-genome assembly.
Sohn, Jang-Il; Nam, Jin-Wu
2018-01-01
As the advent of next-generation sequencing (NGS) technology, various de novo assembly algorithms based on the de Bruijn graph have been developed to construct chromosome-level sequences. However, numerous technical or computational challenges in de novo assembly still remain, although many bright ideas and heuristics have been suggested to tackle the challenges in both experimental and computational settings. In this review, we categorize de novo assemblers on the basis of the type of de Bruijn graphs (Hamiltonian and Eulerian) and discuss the challenges of de novo assembly for short NGS reads regarding computational complexity and assembly ambiguity. Then, we discuss how the limitations of the short reads can be overcome by using a single-molecule sequencing platform that generates long reads of up to several kilobases. In fact, the long read assembly has caused a paradigm shift in whole-genome assembly in terms of algorithms and supporting steps. We also summarize (i) hybrid assemblies using both short and long reads and (ii) overlap-based assemblies for long reads and discuss their challenges and future prospects. This review provides guidelines to determine the optimal approach for a given input data type, computational budget or genome. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
EnsMart: A Generic System for Fast and Flexible Access to Biological Data
Kasprzyk, Arek; Keefe, Damian; Smedley, Damian; London, Darin; Spooner, William; Melsopp, Craig; Hammond, Martin; Rocca-Serra, Philippe; Cox, Tony; Birney, Ewan
2004-01-01
The EnsMart system (www.ensembl.org/EnsMart) provides a generic data warehousing solution for fast and flexible querying of large biological data sets and integration with third-party data and tools. The system consists of a query-optimized database and interactive, user-friendly interfaces. EnsMart has been applied to Ensembl, where it extends its genomic browser capabilities, facilitating rapid retrieval of customized data sets. A wide variety of complex queries, on various types of annotations, for numerous species are supported. These can be applied to many research problems, ranging from SNP selection for candidate gene screening, through cross-species evolutionary comparisons, to microarray annotation. Users can group and refine biological data according to many criteria, including cross-species analyses, disease links, sequence variations, and expression patterns. Both tabulated list data and biological sequence output can be generated dynamically, in HTML, text, Microsoft Excel, and compressed formats. A wide range of sequence types, such as cDNA, peptides, coding regions, UTRs, and exons, with additional upstream and downstream regions, can be retrieved. The EnsMart database can be accessed via a public Web site, or through a Java application suite. Both implementations and the database are freely available for local installation, and can be extended or adapted to `non-Ensembl' data sets. PMID:14707178
Joseph, Susan; Forsythe, Stephen J.
2012-01-01
Cronobacter spp. (previously known as Enterobacter sakazakii) is a bacterial pathogen affecting all age groups, with particularly severe clinical complications in neonates and infants. One recognized route of infection being the consumption of contaminated infant formula. As a recently recognized bacterial pathogen of considerable importance and regulatory control, appropriate detection, and identification schemes are required. The application of multilocus sequence typing (MLST) and analysis (MLSA) of the seven alleles atpD, fusA, glnS, gltB, gyrB, infB, and ppsA (concatenated length 3036 base pairs) has led to considerable advances in our understanding of the genus. This approach is supported by both the reliability of DNA sequencing over subjective phenotyping and the establishment of a MLST database which has open access and is also curated; http://www.pubMLST.org/cronobacter. MLST has been used to describe the diversity of the newly recognized genus, instrumental in the formal recognition of new Cronobacter species (C. universalis and C. condimenti) and revealed the high clonality of strains and the association of clonal complex 4 with neonatal meningitis cases. Clearly the MLST approach has considerable benefits over the use of non-DNA sequence based methods of analysis for newly emergent bacterial pathogens. The application of MLST and MLSA has dramatically enabled us to better understand this opportunistic bacterium which can cause irreparable damage to a newborn baby’s brain, and has contributed to improved control measures to protect neonatal health. PMID:23189075
Bourland, William; Rotterova, Johana; Čepička, Ivan
2017-06-01
Metopid ciliates occupy terrestrial, freshwater, and marine habitats worldwide, playing important roles as predominant consumers of bacteria, flagellates, algae, and diatoms in hypoxic environments. Metopus and Brachonella are the most species-rich metopid genera, however most of their species have not been studied by modern methods Here, we report the morphologic, morphometric and molecular characterization, and phylogeny of Metopus es and Brachonella contorta, both types of their respective genera, collected in a broad global sampling effort. Five strains of M. es and three strains of B. contorta were studied in detail, providing the first correlation of morphology, morphometrics, and 18S rRNA gene sequencing for both. We submitted 29 new 18S rRNA gene sequences to GenBank. Phylogenetic analyses yielded trees of similar topology. A strongly supported Metopus es clade is sister to the Brachonella contorta clade. Our analysis shows genus Metopus is not monophyletic. The monophyly of Brachonella cannot yet be determined due to lack of sequences for other species of this genus in molecular databases. Both species appear to have a global distribution. Metopus es was not found in Africa, probably reflecting low sampling effort. Strains of both species showed low 18S rRNA gene sequence divergence despite wide geographic separation. Copyright © 2016 Elsevier GmbH. All rights reserved.
Leao, Sylvia Cardoso; Tortoli, Enrico; Viana-Niero, Cristina; Ueki, Suely Yoko Mizuka; Lima, Karla Valeria Batista; Lopes, Maria Luiza; Yubero, Jesus; Menendez, Maria Carmen; Garcia, Maria Jesus
2009-01-01
An outbreak of postsurgical infections caused by rapidly growing mycobacteria has been ongoing in Brazil since 2004. The degrees of similarity of the rpoB and hsp65 sequences from the clinical isolates and the corresponding sequences from both the Mycobacterium massiliense and the M. bolletii type strains were above the accepted limit for interspecies variability, leading to conflicting identification results. Therefore, an extensive characterization of members of the M. chelonae-M. abscessus group was carried out. The M. abscessus, M. chelonae, M. immunogenum, M. massiliense, and M. bolletii type strains and a subset of clinical isolates were analyzed by biochemical tests, high-performance liquid chromatography, drug susceptibility testing, PCR-restriction enzyme analysis of hsp65 (PRA-hsp65), rpoB, and hsp65 gene sequencing and analysis of phylogenetic trees, DNA-DNA hybridization (DDH), and restriction fragment length polymorphism (RFLP) analysis of the 16S rRNA gene (RFLP-16S rRNA). The clinical isolates and the M. abscessus, M. massiliense, and M. bolletii type strains could not be separated by phenotypic tests and were grouped in the phylogenetic trees obtained. The results of DDH also confirmed the >70% relatedness of the clinical isolates and the M. abscessus, M. massiliense, and M. bolletii type strains; and indistinguishable RFLP-16S rRNA patterns were obtained. On the contrary, the separation of clinical isolates and the M. abscessus, M. massiliense, and M. bolletii type strains from M. chelonae and M. immunogenum was supported by the results of PRA-hsp65, DDH, and RFLP-16S rRNA and by the rpoB and hsp65 phylogenetic trees. Taken together, these results led to the proposition that M. abscessus, M. massiliense, and M. bolletii represent a single species, that of M. abscessus. Two subspecies are also proposed, M. abscessus subsp. abscessus and M. abscessus subsp. massiliense, and these two subspecies can be distinguished by two different PRA-hsp65 patterns, which differ by a single HaeIII band, and by differences in their rpoB (3.4%) and hsp65 (1.3%) sequences. PMID:19571015
DOE Office of Scientific and Technical Information (OSTI.GOV)
Scheuner, Carmen; Tindall, Brian J.; Lu, Megan
Planctomyces brasiliensis Schlesner 1990 belongs to the order Planctomycetales, which differs from other bacterial taxa by several distinctive features such as internal cell compartmentalization, multiplication by forming buds directly from the spherical, ovoid or pear-shaped mother cell and a cell wall consisting of a proteinaceous layer rather than a peptidoglycan layer. The first strains of P. brasiliensis, including the type strain IFAM 1448 T, were isolated from a water sample of Lagoa Vermelha, a salt pit near Rio de Janeiro, Brasil. This is the second completed genome sequence of a type strain of the genus Planctomyces to be published andmore » the sixth type strain genome sequence from the family Planctomycetaceae. The 6,006,602 bp long genome with its 4,811 protein-coding and 54 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project. We study phylogenomic analyses that indicate that the classification within the Planctomycetaceae is partially in conflict with its evolutionary history, as the positioning of Schlesneria renders the genus Planctomyces paraphyletic. A re-analysis of published fatty-acid measurements also does not support the current arrangement of the two genera. A quantitative comparison of phylogenetic and phenotypic aspects indicates that the three Planctomyces species with type strains available in public culture collections should be placed in separate genera. Thus the genera Gimesia, Planctopirus and Rubinisphaera are proposed to accommodate P. maris, P. limnophilus and P. brasiliensis, respectively. Pronounced differences between the reported G + C content of Gemmata obscuriglobus, Singulisphaera acidiphila and Zavarzinella formosa and G + C content calculated from their genome sequences call for emendation of their species descriptions. Lastly, in addition to other features, the range of G + C values reported for the genera within the Planctomycetaceae indicates that the descriptions of the family and the order should be emended.« less
Scheuner, Carmen; Tindall, Brian J.; Lu, Megan; ...
2014-12-08
Planctomyces brasiliensis Schlesner 1990 belongs to the order Planctomycetales, which differs from other bacterial taxa by several distinctive features such as internal cell compartmentalization, multiplication by forming buds directly from the spherical, ovoid or pear-shaped mother cell and a cell wall consisting of a proteinaceous layer rather than a peptidoglycan layer. The first strains of P. brasiliensis, including the type strain IFAM 1448 T, were isolated from a water sample of Lagoa Vermelha, a salt pit near Rio de Janeiro, Brasil. This is the second completed genome sequence of a type strain of the genus Planctomyces to be published andmore » the sixth type strain genome sequence from the family Planctomycetaceae. The 6,006,602 bp long genome with its 4,811 protein-coding and 54 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project. We study phylogenomic analyses that indicate that the classification within the Planctomycetaceae is partially in conflict with its evolutionary history, as the positioning of Schlesneria renders the genus Planctomyces paraphyletic. A re-analysis of published fatty-acid measurements also does not support the current arrangement of the two genera. A quantitative comparison of phylogenetic and phenotypic aspects indicates that the three Planctomyces species with type strains available in public culture collections should be placed in separate genera. Thus the genera Gimesia, Planctopirus and Rubinisphaera are proposed to accommodate P. maris, P. limnophilus and P. brasiliensis, respectively. Pronounced differences between the reported G + C content of Gemmata obscuriglobus, Singulisphaera acidiphila and Zavarzinella formosa and G + C content calculated from their genome sequences call for emendation of their species descriptions. Lastly, in addition to other features, the range of G + C values reported for the genera within the Planctomycetaceae indicates that the descriptions of the family and the order should be emended.« less
Fitzgerald, Jamie; Holden, Paul; Wright, Hollis; Wilmot, Beth; Hata, Abigail; Steiner, Robert D.; Basel, Don
2016-01-01
Background Osteogenesis imperfecta (OI) type V is a dominantly inherited skeletal dysplasia characterized by fractures and progressive deformity of long bones. In addition, patients often present with radial head dislocation, hyperplastic callus, and calcification of the forearm interosseous membrane. Recently, a specific mutation in the IFITM5 gene was found to be responsible for OI type V. This mutation, a C to T transition 14 nucleotides upstream from the endogenous start codon, creates a new start methionine that appears to be preferentially used by the translational machinery. However, the mechanism by which the lengthened protein results in a dominant type of OI is unknown. Methods and Results We report 7 ethnically diverse (African-American, Caucasian, Hispanic, and African) individuals with OI type V from 2 families and 2 sporadic cases. Exome sequencing failed to identify a causative mutation. Using Sanger sequencing, we found that all affected individuals in our cohort possess the c.−14 IFITM5 variant, further supporting the notion that OI type V is caused by a single, discrete mutation. Our patient cohort demonstrated inter-and intrafamilial phenotypic variability, including a father with classic OI type V whose daughter had a phenotype similar to OI type I. This clinical variability suggests that modifier genes influence the OI type V phenotype. We also confirm that the mutation creates an aberrant IFITM5 protein containing an additional 5 amino acids at the N-terminus. Conclusions The variable clinical signs in these cases illustrate the significant variability of the OI type V phenotype caused by the c.−14 IFITM5 mutation. The affected individuals are more ethnically diverse than previously reported. PMID:28824928
Chromosomal 16S Ribosomal RNA Methyltransferase RmtE1 in Escherichia coli Sequence Type 448
Li, Bin; Pacey, Marissa P.
2017-01-01
We identified rmtE1, an uncommon 16S ribosomal methyltransferase gene, in an aminoglycoside- and cephalosporin-resistant Escherichia coli sequence type 448 clinical strain co-harboring blaCMY-2. Long-read sequencing revealed insertion of a 101,257-bp fragment carrying both resistance genes to the chromosome. Our findings underscore E. coli sequence type 448 as a potential high-risk multidrug-resistant clone. PMID:28418308
2012-03-01
responsible for self -organizing an appropriate network infrastructure with multi-hop connection between sensor nodes. The network is self - healing ...a self -destruct mechanism that will flood the casing with water in the event that the mine is separated from its mooring. Provided that this does...mechanically severed from its mooring cable, would then initiate its self -destruct sequence whereby the mine is flooded. Then, depending upon the type of
Liang, Di; Wang, Tianfang; Rotgans, Bronwyn A.; McManus, Donald P.; Cummins, Scott F.
2016-01-01
Biomphalaria glabrata (B. glabrata) is an air-breathing aquatic mollusc found in freshwater habitats across the Western Hemisphere. It is most well-known for its recognized capacity to act as a major intermediate host for Schistosoma mansoni, the human blood fluke parasite. Ionotropic receptors (IRs), a variant family of the ionotropic glutamate receptors (iGluR), have an evolutionary ancient function in detecting odors to initiate chemosensory signaling. In this study, we applied an array of methods towards the goal of identifying IR-like family members in B. glabrata, ultimately revealing two types, the iGluR and IR. Sequence alignment showed that three ligand-binding residues are conserved in most Biomphalaria iGluR sequences, while the IRs did exhibit a variable pattern, lacking some or all known glutamate-interactingresidues, supporting their distinct classification from the iGluRs. We show that B. glabrata contains 7 putative IRs, some of which are expressed within its chemosensory organs. To further investigate a role for the more ancient IR25a type in chemoreception, we tested its spatial distribution pattern within the snail cephalic tentacle by in situ hybridization. The presence of IR25a within presumptive sensory neurons supports a role for this receptor in olfactory processing, contributing to our understanding of the molecular pathways that are involved in Biomphalaria olfactory processing. PMID:27253696
Liang, Di; Wang, Tianfang; Rotgans, Bronwyn A; McManus, Donald P; Cummins, Scott F
2016-01-01
Biomphalaria glabrata (B. glabrata) is an air-breathing aquatic mollusc found in freshwater habitats across the Western Hemisphere. It is most well-known for its recognized capacity to act as a major intermediate host for Schistosoma mansoni, the human blood fluke parasite. Ionotropic receptors (IRs), a variant family of the ionotropic glutamate receptors (iGluR), have an evolutionary ancient function in detecting odors to initiate chemosensory signaling. In this study, we applied an array of methods towards the goal of identifying IR-like family members in B. glabrata, ultimately revealing two types, the iGluR and IR. Sequence alignment showed that three ligand-binding residues are conserved in most Biomphalaria iGluR sequences, while the IRs did exhibit a variable pattern, lacking some or all known glutamate-interactingresidues, supporting their distinct classification from the iGluRs. We show that B. glabrata contains 7 putative IRs, some of which are expressed within its chemosensory organs. To further investigate a role for the more ancient IR25a type in chemoreception, we tested its spatial distribution pattern within the snail cephalic tentacle by in situ hybridization. The presence of IR25a within presumptive sensory neurons supports a role for this receptor in olfactory processing, contributing to our understanding of the molecular pathways that are involved in Biomphalaria olfactory processing.
Sequencing intractable DNA to close microbial genomes.
Hurt, Richard A; Brown, Steven D; Podar, Mircea; Palumbo, Anthony V; Elias, Dwayne A
2012-01-01
Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.
Analysis of whole genome sequencing for the Escherichia coli O157:H7 typing phages.
Cowley, Lauren A; Beckett, Stephen J; Chase-Topping, Margo; Perry, Neil; Dallman, Tim J; Gally, David L; Jenkins, Claire
2015-04-08
Shiga toxin producing Escherichia coli O157 can cause severe bloody diarrhea and haemolytic uraemic syndrome. Phage typing of E. coli O157 facilitates public health surveillance and outbreak investigations, certain phage types are more likely to occupy specific niches and are associated with specific age groups and disease severity. The aim of this study was to analyse the genome sequences of 16 (fourteen T4 and two T7) E. coli O157 typing phages and to determine the genes responsible for the subtle differences in phage type profiles. The typing phages were sequenced using paired-end Illumina sequencing at The Genome Analysis Centre and the Animal Health and Veterinary Laboratories Agency and bioinformatics programs including Velvet, Brig and Easyfig were used to analyse them. A two-way Euclidian cluster analysis highlighted the associations between groups of phage types and typing phages. The analysis showed that the T7 typing phages (9 and 10) differed by only three genes and that the T4 typing phages formed three distinct groups of similar genomic sequences: Group 1 (1, 8, 11, 12 and 15, 16), Group 2 (3, 6, 7 and 13) and Group 3 (2, 4, 5 and 14). The E. coli O157 phage typing scheme exhibited a significantly modular network linked to the genetic similarity of each group showing that these groups are specialised to infect a subset of phage types. Sequencing the typing phage has enabled us to identify the variable genes within each group and to determine how this corresponds to changes in phage type.
Existence of host-related DNA sequences in the schistosome genome.
Iwamura, Y; Irie, Y; Kominami, R; Nara, T; Yasuraoka, K
1991-06-01
DNA sequences homologous to the mouse intracisternal A particle and endogenous type C retrovirus were detected in the DNAs of Schistosoma japonicum adults and S. mansoni eggs. Furthermore, other kinds of repetitive sequences in the host genome such as mouse type 1 Alu sequence (B1), mouse type 2 Alu sequence (B2) and mo-2 sequence, a mouse mini-satellite, were also detected in the DNAs from adults and eggs of S. japonicum and eggs of S. mansoni. Almost all of the sequences described above were absent in the DNAs of S. mansoni adults. The DNA fingerprints of schistosomes, using the mo-2 sequence, were indistinguishable from each other and resembled those of their murine hosts. Moreover, the mo-2 sequence was hypermethylated in the DNAs of schistosomes and its amount was variable in them. These facts indicate that host-related sequences are actually present in schistosomes and that the mo-2 repetitive sequence exists probably in extra-chromosome.
The Processing on Different Types of English Formulaic Sequences
ERIC Educational Resources Information Center
Qian, Li
2015-01-01
Formulaic sequences are found to be processed faster than their matched novel phrases in previous studies. Given the variety of formulaic types, few studies have compared processing on different types of formulaic sequences. The present study explored the processing among idioms, speech formulae and written formulae. It has been found that in…
Sequencing artifacts in the type A influenza database and attempts to correct them
USDA-ARS?s Scientific Manuscript database
Currently over 300,000 Type A influenza gene sequences representing over 50,000 strains are available in publicly available databases. However, the quality of the sequences submitted are determined by the contributor and many sequence errors are present in the databases, which can affect the result...
Odegaard, Justin I; Vincent, John J; Mortimer, Stefanie; Vowles, James V; Ulrich, Bryan C; Banks, Kimberly C; Fairclough, Stephen R; Zill, Oliver A; Sikora, Marcin; Mokhtari, Reza; Abdueva, Diana; Nagy, Rebecca J; Lee, Christine E; Kiedrowski, Lesli A; Paweletz, Cloud P; Eltoukhy, Helmy; Lanman, Richard B; Chudova, Darya I; Talasaz, AmirAli
2018-04-24
Purpose: To analytically and clinically validate a circulating cell-free tumor DNA sequencing test for comprehensive tumor genotyping and demonstrate its clinical feasibility. Experimental Design: Analytic validation was conducted according to established principles and guidelines. Blood-to-blood clinical validation comprised blinded external comparison with clinical droplet digital PCR across 222 consecutive biomarker-positive clinical samples. Blood-to-tissue clinical validation comprised comparison of digital sequencing calls to those documented in the medical record of 543 consecutive lung cancer patients. Clinical experience was reported from 10,593 consecutive clinical samples. Results: Digital sequencing technology enabled variant detection down to 0.02% to 0.04% allelic fraction/2.12 copies with ≤0.3%/2.24-2.76 copies 95% limits of detection while maintaining high specificity [prevalence-adjusted positive predictive values (PPV) >98%]. Clinical validation using orthogonal plasma- and tissue-based clinical genotyping across >750 patients demonstrated high accuracy and specificity [positive percent agreement (PPAs) and negative percent agreement (NPAs) >99% and PPVs 92%-100%]. Clinical use in 10,593 advanced adult solid tumor patients demonstrated high feasibility (>99.6% technical success rate) and clinical sensitivity (85.9%), with high potential actionability (16.7% with FDA-approved on-label treatment options; 72.0% with treatment or trial recommendations), particularly in non-small cell lung cancer, where 34.5% of patient samples comprised a directly targetable standard-of-care biomarker. Conclusions: High concordance with orthogonal clinical plasma- and tissue-based genotyping methods supports the clinical accuracy of digital sequencing across all four types of targetable genomic alterations. Digital sequencing's clinical applicability is further supported by high rates of technical success and biomarker target discovery. Clin Cancer Res; 1-11. ©2018 AACR. ©2018 American Association for Cancer Research.
Effect of Public Deliberation on Attitudes toward Return of Secondary Results in Genomic Sequencing
Gornick, Michele C.; Scherer, Aaron M.; Sutton, Erica J.; Ryan, Kerry A.; Exe, Nicole L.; Li, Ming; Uhlmann, Wendy R.; Kim, Scott Y.H.; Roberts, J. Scott; De Vries, Raymond G.
2016-01-01
The increased use of genomic sequencing in clinical diagnostics and therapeutics makes imperative the development of guidelines and policies about how to handle secondary findings. For reasons both practical and ethical, the creation of these guidelines must take into consideration the informed opinions of the lay public. As part of a larger Clinical Sequencing Exploratory Research (CSER) consortium project, we organized a deliberative democracy (DD) session that engaged 66 participants in dialogue about the benefits and risks associated with the return of secondary findings from clinical genomic sequencing. Participants were educated about the scientific and ethical aspects of the disclosure of secondary findings by experts in medical genetics and bioethics, and then engaged in facilitated discussion of policy options for the disclosure of three types of secondary findings: 1) medically actionable results; 2) adult onset disorders found in children; and 3) carrier status. Participants’ opinions were collected via surveys administered one month before, immediately following, and one month after the DD session. Post DD session, participants were significantly more willing to support policies that do not allow access to secondary findings related to adult onset conditions in children (Χ2 (2, N = 62) = 13.300, p = 0.001) or carrier status (Χ2 (2, N = 60) = 11.375, p = 0.003). After one month, the level of support for the policy denying access to secondary findings regarding adult-onset conditions remained significantly higher than the pre-DD level, although less than immediately post-DD (Χ2 (1, N = 60) = 2.465, p = 0.041). Our findings suggest that education and deliberation enhance public appreciation of the scientific and ethical complexities of genome sequencing. PMID:27307100
Effect of Public Deliberation on Attitudes toward Return of Secondary Results in Genomic Sequencing.
Gornick, Michele C; Scherer, Aaron M; Sutton, Erica J; Ryan, Kerry A; Exe, Nicole L; Li, Ming; Uhlmann, Wendy R; Kim, Scott Y H; Roberts, J Scott; De Vries, Raymond G
2017-02-01
The increased use of genomic sequencing in clinical diagnostics and therapeutics makes imperative the development of guidelines and policies about how to handle secondary findings. For reasons both practical and ethical, the creation of these guidelines must take into consideration the informed opinions of the lay public. As part of a larger Clinical Sequencing Exploratory Research (CSER) consortium project, we organized a deliberative democracy (DD) session that engaged 66 participants in dialogue about the benefits and risks associated with the return of secondary findings from clinical genomic sequencing. Participants were educated about the scientific and ethical aspects of the disclosure of secondary findings by experts in medical genetics and bioethics, and then engaged in facilitated discussion of policy options for the disclosure of three types of secondary findings: 1) medically actionable results; 2) adult onset disorders found in children; and 3) carrier status. Participants' opinions were collected via surveys administered one month before, immediately following, and one month after the DD session. Post DD session, participants were significantly more willing to support policies that do not allow access to secondary findings related to adult onset conditions in children (Χ 2 (2, N = 62) = 13.300, p = 0.001) or carrier status (Χ 2 (2, N = 60) = 11.375, p = 0.003). After one month, the level of support for the policy denying access to secondary findings regarding adult-onset conditions remained significantly higher than the pre-DD level, although less than immediately post-DD (Χ 2 (1, N = 60) = 2.465, p = 0.041). Our findings suggest that education and deliberation enhance public appreciation of the scientific and ethical complexities of genome sequencing.
Rebehmed, Joseph; Quintus, Flavien; Mornon, Jean-Paul; Callebaut, Isabelle
2016-05-01
Several studies have highlighted the leading role of the sequence periodicity of polar and nonpolar amino acids (binary patterns) in the formation of regular secondary structures (RSS). However, these were based on the analysis of only a few simple cases, with no direct mean to correlate binary patterns with the limits of RSS. Here, HCA-derived hydrophobic clusters (HC) which are conditioned binary patterns whose positions fit well those of RSS, were considered. All the HC types, defined by unique binary patterns, which were commonly observed in three-dimensional (3D) structures of globular domains, were analyzed. The 180 HC types with preferences for either α-helices or β-strands distinctly contain basic binary units typical of these RSS. Therefore a general trend supporting the "binary pattern preference" assumption was observed. HC for which observed RSS are in disagreement with their expected behavior (discordant HC) were also examined. They were separated in HC types with moderate preferences for RSS, having "weak" binary patterns and versatile RSS and HC types with high preferences for RSS, having "strong" binary patterns and then displaying nonpolar amino acids at the protein surface. It was shown that in both cases, discordant HC could be distinguished from concordant ones by well-differentiated amino acid compositions. The obtained results could, thus, help to complement the currently available methods for the accurate prediction of secondary structures in proteins from the only information of a single amino acid sequence. This can be especially useful for characterizing orphan sequences and for assisting protein engineering and design. © 2016 Wiley Periodicals, Inc.
Wen, B; Rikihisa, Y; Fuerst, P A; Chaichanasiriwithaya, W
1995-04-01
Ehrlichia risticii is the causative agent of Potomac horse fever. Variations among the major antigens of different local E. risticii strains have been detected previously. To further assess genetic variability in this species or species complex, the sequences of the 16S rRNA genes of several isolates obtained from sick horses diagnosed as having Potomac horse fever were determined. The sequences of six isolates obtained from Ohio and three isolates obtained from Kentucky were amplified by PCR. Three groups of sequences were identified. The sequences of five of the Ohio isolates were identical to the sequence of the type strain of E. risticii, the Illinois strain. The sequence of one Ohio isolate, isolate 081, was unique; this sequence differed in 10 nucleotides from the sequence of the type strain (level of similarity, 99.3%). The sequences of the three Kentucky isolates were identical to each other, but differed by five bases from the sequence of the type strain (level of similarity, 99.6%). The levels of sequence similarity of isolate 081, the Kentucky isolates, and the type strain to the next most closely related Ehrlichia sp., Ehrlichia sennetsu, were 99.3, 99.2, and 99.2%, respectively. On the basis of the distinct antigenic profiles and the levels of 16S rRNA sequence divergence, isolate 081 is as divergent from the type strain of E. risticii as E. sennetsu is. Therefore, we suggest that strain 081 and the Kentucky isolates may represent two new distinct Ehrlichia species.
Crescenzo-Chaigne, Bernadette; Barbezange, Cyril; van der Werf, Sylvie
2008-01-01
Background The transcription/replication of the influenza viruses implicate the terminal nucleotide sequences of viral RNA, which comprise sequences at the extremities conserved among the genomic segments as well as variable 3' and 5' non-coding (NC) regions. The plasmid-based system for the in vivo reconstitution of functional ribonucleoproteins, upon expression of viral-like RNAs together with the nucleoprotein and polymerase proteins has been widely used to analyze transcription/replication of influenza viruses. It was thus shown that the type A polymerase could transcribe and replicate type A, B, or C vRNA templates whereas neither type B nor type C polymerases were able to transcribe and replicate type A templates efficiently. Here we studied the importance of the NC regions from the seven segments of type C influenza virus for efficient transcription/replication by the type A and C polymerases. Results The NC sequences of the seven genomic segments of the type C influenza virus C/Johannesburg/1/66 strain were found to be more variable in length than those of the type A and B viruses. The levels of transcription/replication of viral-like vRNAs harboring the NC sequences of the respective type C virus segments flanking the CAT reporter gene were comparable in the presence of either type C or type A polymerase complexes except for the NS and PB2-like vRNAs. For the NS-like vRNA, the transcription/replication level was higher after introduction of a U residue at position 6 in the 5' NC region as for all other segments. For the PB2-like vRNA the CAT expression level was particularly reduced with the type C polymerase. Analysis of mutants of the 5' NC sequence in the PB2-like vRNA, the shortest 5' NC sequence among the seven segments, showed that additional sequences within the PB2 ORF were essential for the efficiency of transcription but not replication by the type C polymerase complex. Conclusion In the context of a PB2-like reporter vRNA template, the sequence upstream the polyU stretch plays a role in the transcription/replication process by the type C polymerase complex. PMID:18973655
Poomtien, Jamroonsri; Jindamorakot, Sasitorn; Limtong, Savitree; Pinphanichakarn, Pairoh; Thaniyavarn, Jiraporn
2013-01-01
Three yeast strains were isolated from industrial wastes in Thailand. Based on the phylogenetic sequence analysis of the D1/D2 region of the large subunit rRNA gene, the internal transcribed spacer (ITS1-5.8S rRNA gene-ITS2; ITS1-2) region, and their physiological characteristics, the three strains were found to represent two novel species of the ascomycetous anamorphic yeast. Strain JP52(T) represent a novel species which was named Cyberlindnera samutprakarnensis sp. nov. (type strain JP52(T); = BCC 46825(T) = JCM 17816(T) = CBS 12528(T), MycoBank no. MB800879), which was differentiated from the closely related species Cyberlindnera mengyuniae CBS 10845(T) by 2.9 % sequence divergence in the D1/D2 region and 4.4 % sequence divergence in the ITS1-2. Strain JP59(T) and JP60 were identical in their D1/D2 and ITS1-2 regions, which were closely related to those of Scheffersomyces spartinae CBS 6059(T) by 0.9 and 1.0 % sequence divergence, respectively. In addition, supportive evidence of actin gene and translational elongation factor gene by sequence divergence of 6.5 % each confirmed their distinct status. Furthermore, JP59(T) and JP60 differentiated from the closely related species in some biochemical and physiological characteristics. These two strains were assigned as a single novel species which was named Candida thasaenensis sp. nov. (type JP59(T) = BCC 46828(T) = JCM 17817(T) = CBS 12529(T), MycoBank no. MB800880).
Meats, Emma; Feil, Edward J.; Stringer, Suzanna; Cody, Alison J.; Goldstein, Richard; Kroll, J. Simon; Popovic, Tanja; Spratt, Brian G.
2003-01-01
A multilocus sequence typing (MLST) scheme has been developed for the unambiguous characterization of encapsulated and noncapsulated Haemophilus influenzae isolates. The sequences of internal fragments of seven housekeeping genes were determined for 131 isolates, comprising a diverse set of 104 serotype a, b, c, d, e, and f isolates and 27 noncapsulated isolates. Many of the encapsulated isolates had previously been characterized by multilocus enzyme electrophoresis (MLEE), and the validity of the MLST scheme was established by the very similar clustering of isolates obtained by these methods. Isolates of serotypes c, d, e, and f formed monophyletic groups on a dendrogram constructed from the differences in the allelic profiles of the isolates, whereas there were highly divergent lineages of both serotype a and b isolates. Noncapsulated isolates were distinct from encapsulated isolates and, with one exception, were within two highly divergent clusters. The relationships between the major lineages of encapsulated H. influenzae inferred from MLEE data could not be discerned on a dendrogram constructed from differences in the allelic profiles, but were apparent on a tree reconstructed from the concatenated nucleotide sequences. Recombination has not therefore completely eliminated phylogenetic signal, and in support of this, for encapsulated isolates, there was significant congruence between many of the trees reconstructed from the sequences of the seven individual loci. Congruence was less apparent for noncapsulated isolates, suggesting that the impact of recombination is greater among noncapsulated than encapsulated isolates. The H. influenzae MLST scheme is available at www.mlst.net, it allows any isolate to be compared with those in the MLST database, and (for encapsulated isolates) it assigns isolates to their phylogenetic lineage, via the Internet. PMID:12682154
Zhang, Yiming; Jin, Quan; Wang, Shuting; Ren, Ren
2011-05-01
The mobile behavior of 1481 peptides in ion mobility spectrometry (IMS), which are generated by protease digestion of the Drosophila melanogaster proteome, is modeled and predicted based on two different types of characterization methods, i.e. sequence-based approach and structure-based approach. In this procedure, the sequence-based approach considers both the amino acid composition of a peptide and the local environment profile of each amino acid in the peptide; the structure-based approach is performed with the CODESSA protocol, which regards a peptide as a common organic compound and generates more than 200 statistically significant variables to characterize the whole structure profile of a peptide molecule. Subsequently, the nonlinear support vector machine (SVM) and Gaussian process (GP) as well as linear partial least squares (PLS) regression is employed to correlate the structural parameters of the characterizations with the IMS drift times of these peptides. The obtained quantitative structure-spectrum relationship (QSSR) models are evaluated rigorously and investigated systematically via both one-deep and two-deep cross-validations as well as the rigorous Monte Carlo cross-validation (MCCV). We also give a comprehensive comparison on the resulting statistics arising from the different combinations of variable types with modeling methods and find that the sequence-based approach can give the QSSR models with better fitting ability and predictive power but worse interpretability than the structure-based approach. In addition, though the QSSR modeling using sequence-based approach is not needed for the preparation of the minimization structures of peptides before the modeling, it would be considerably efficient as compared to that using structure-based approach. Copyright © 2011 Elsevier Ltd. All rights reserved.
Lee, Dong-Hun
2017-01-01
To determine the genetic and epidemiological relationship of infectious bronchitis virus (IBV) isolates from commercial poultry to attenuated live IBV vaccines we conducted a phylogenetic network analysis on the full-length S1 sequence for Arkansas (Ark), Massachusetts (Mass) and Delmarva/1639 (DMV/1639) type viruses isolated in 2015 from clinical cases by 3 different diagnostic laboratories. Phylogenetic network analysis of Ark isolates showed two predominant groups linked by 2 mutations, consistent with subpopulations found in commercial vaccines for this IBV type. In addition, a number of satellite groups surrounding the two predominant populations were observed for the Ark type virus, which is likely due to mutations associated with the nature of this vaccine to persist in flocks. The phylogenetic network analysis of Mass-type viruses shows two groupings corresponding to different manufacturers vaccine sequences. No satellite groups were observed for Mass-type viruses, which is consistent with no persistence of this vaccine type in the field. At the time of collection, no vaccine was being used for the DMV/1639 type viruses and phylogenetic network analysis showed a dispersed network suggesting no clear change in genetic distribution. Selection pressure analysis showed that the DMV/1639 and Mass-type strains were evolving under negative selection, whereas the Ark type viruses had evolved under positive selection. This data supports the hypothesis that live attenuated vaccine usage does play a role in the genetic profile of similar IB viruses in the field and phylogenetic network analysis can be used to identify vaccine and vaccine origin isolates, which is important for our understanding of the role live vaccines play in the evolutionary trajectory of those viruses. PMID:28472110
Structural and sequence features of two residue turns in beta-hairpins.
Madan, Bharat; Seo, Sung Yong; Lee, Sun-Gu
2014-09-01
Beta-turns in beta-hairpins have been implicated as important sites in protein folding. In particular, two residue β-turns, the most abundant connecting elements in beta-hairpins, have been a major target for engineering protein stability and folding. In this study, we attempted to investigate and update the structural and sequence properties of two residue turns in beta-hairpins with a large data set. For this, 3977 beta-turns were extracted from 2394 nonhomologous protein chains and analyzed. First, the distribution, dihedral angles and twists of two residue turn types were determined, and compared with previous data. The trend of turn type occurrence and most structural features of the turn types were similar to previous results, but for the first time Type II turns in beta-hairpins were identified. Second, sequence motifs for the turn types were devised based on amino acid positional potentials of two-residue turns, and their distributions were examined. From this study, we could identify code-like sequence motifs for the two residue beta-turn types. Finally, structural and sequence properties of beta-strands in the beta-hairpins were analyzed, which revealed that the beta-strands showed no specific sequence and structural patterns for turn types. The analytical results in this study are expected to be a reference in the engineering or design of beta-hairpin turn structures and sequences. © 2014 Wiley Periodicals, Inc.
Lee, Chao-Hung; Helweg-Larsen, Jannik; Tang, Xing; Jin, Shaoling; Li, Baozheng; Bartlett, Marilyn S.; Lu, Jang-Jih; Lundgren, Bettina; Lundgren, Jens D.; Olsson, Mats; Lucas, Sebastian B.; Roux, Patricia; Cargnel, Antonietta; Atzori, Chiara; Matos, Olga; Smith, James W.
1998-01-01
Pneumocystis carinii f. sp. hominis isolates from 207 clinical specimens from nine countries were typed based on nucleotide sequence variations in the internal transcribed spacer regions I and II (ITS1 and ITS2, respectively) of rRNA genes. The number of ITS1 nucleotides has been revised from the previously reported 157 bp to 161 bp. Likewise, the number of ITS2 nucleotides has been changed from 177 to 192 bp. The number of ITS1 sequence types has increased from 2 to 15, and that of ITS2 has increased from 3 to 14. The 15 ITS1 sequence types are designated types A through O, and the 14 ITS2 types are named types a through n. A total of 59 types of P. carinii f. sp. hominis were found in this study. PMID:9508304
Barcellos, Leonardo H; Palmeiro, Marina Lobato; Naconecy, Marcos M; Geremia, Tomás; Cervieri, André; Shinkai, Rosemary S
2018-05-17
To compare the effects of different screw-tightening sequences and torque applications on stresses in implant-supported fixed complete dentures supported by five abutments. Strain gauges fixed to the abutments were used to test the sequences 2-4-3-1-5; 1-2-3-4-5; 3-2-4-1-5; and 2-5-4-1-3 with direct 10-Ncm torque or progressive torque (5 + 10 Ncm). Data were analyzed using analysis of variance and standardized effect size. No effects of tightening sequence or torque application were found except for the sequence 3-2-4-1-5 and some small to moderate effect sizes. Screw-tightening sequences and torque application modes have only a marginal effect on residual stresses.
A public HTLV-1 molecular epidemiology database for sequence management and data mining.
Araujo, Thessika Hialla Almeida; Souza-Brito, Leandro Inacio; Libin, Pieter; Deforche, Koen; Edwards, Dustin; de Albuquerque-Junior, Antonio Eduardo; Vandamme, Anne-Mieke; Galvao-Castro, Bernardo; Alcantara, Luiz Carlos Junior
2012-01-01
It is estimated that 15 to 20 million people are infected with the human T-cell lymphotropic virus type 1 (HTLV-1). At present, there are more than 2,000 unique HTLV-1 isolate sequences published. A central database to aggregate sequence information from a range of epidemiological aspects including HTLV-1 infections, pathogenesis, origins, and evolutionary dynamics would be useful to scientists and physicians worldwide. Described here, we have developed a database that collects and annotates sequence data and can be accessed through a user-friendly search interface. The HTLV-1 Molecular Epidemiology Database website is available at http://htlv1db.bahia.fiocruz.br/. All data was obtained from publications available at GenBank or through contact with the authors. The database was developed using Apache Webserver 2.1.6 and SGBD MySQL. The webpage interfaces were developed in HTML and sever-side scripting written in PHP. The HTLV-1 Molecular Epidemiology Database is hosted on the Gonçalo Moniz/FIOCRUZ Research Center server. There are currently 2,457 registered sequences with 2,024 (82.37%) of those sequences representing unique isolates. Of these sequences, 803 (39.67%) contain information about clinical status (TSP/HAM, 17.19%; ATL, 7.41%; asymptomatic, 12.89%; other diseases, 2.17%; and no information, 60.32%). Further, 7.26% of sequences contain information on patient gender while 5.23% of sequences provide the age of the patient. The HTLV-1 Molecular Epidemiology Database retrieves and stores annotated HTLV-1 proviral sequences from clinical, epidemiological, and geographical studies. The collected sequences and related information are now accessible on a publically available and user-friendly website. This open-access database will support clinical research and vaccine development related to viral genotype.
Sanchez, Daniel J; Reber, Paul J
2012-04-01
The memory system that supports implicit perceptual-motor sequence learning relies on brain regions that operate separately from the explicit, medial temporal lobe memory system. The implicit learning system therefore likely has distinct operating characteristics and information processing constraints. To attempt to identify the limits of the implicit sequence learning mechanism, participants performed the serial interception sequence learning (SISL) task with covertly embedded repeating sequences that were much longer than most previous studies: ranging from 30 to 60 (Experiment 1) and 60 to 90 (Experiment 2) items in length. Robust sequence-specific learning was observed for sequences up to 80 items in length, extending the known capacity of implicit sequence learning. In Experiment 3, 12-item repeating sequences were embedded among increasing amounts of irrelevant nonrepeating sequences (from 20 to 80% of training trials). Despite high levels of irrelevant trials, learning occurred across conditions. A comparison of learning rates across all three experiments found a surprising degree of constancy in the rate of learning regardless of sequence length or embedded noise. Sequence learning appears to be constant with the logarithm of the number of sequence repetitions practiced during training. The consistency in learning rate across experiments and conditions implies that the mechanisms supporting implicit sequence learning are not capacity-constrained by very long sequences nor adversely affected by high rates of irrelevant sequences during training.
DOE Office of Scientific and Technical Information (OSTI.GOV)
FitzGerald, Michael
2012-06-01
Michael FitzGerald on "A rapid whole genome sequencing and analysis system supporting genomic epidemiology" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
FitzGerald, Michael
2018-01-11
Michael FitzGerald on "A rapid whole genome sequencing and analysis system supporting genomic epidemiology" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
Typing Clostridium difficile strains based on tandem repeat sequences
2009-01-01
Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124
Nakano, Tadao; Okamoto, Munehiro; Ikeda, Yatsukaho; Hasegawa, Hideo
2006-12-01
Sequences of mitochondrial cytochrome c oxidase subunit 1 (CO1) gene, nuclear internal transcribed spacer 2 (ITS2) region of ribosomal DNA (rDNA), and 5S rDNA of Enterobius vermicularis from captive chimpanzees in five zoos/institutions in Japan were analyzed and compared with those of pinworm eggs from humans in Japan. Three major types of variants appearing in both CO1 and ITS2 sequences, but showing no apparent connection, were observed among materials collected from the chimpanzees. Each one of them was also observed in pinworms in humans. Sequences of 5S rDNA were identical in the materials from chimpanzees and humans. Phylogenetic analysis of CO1 gene revealed three clusters with high bootstrap value, suggesting considerable divergence, presumably correlated with human evolution, has occurred in the human pinworms. The synonymy of E. gregorii with E. vermicularis is supported by the molecular evidence.
Rekadwad, Bhagwan N; Gonzalez, Juan M
2017-08-01
A report on 16S rRNA gene sequence re-analysis and digitalization is presented using Lysinibacillus species (one example) deposited in National Microbial Repositories in India. Lysinibacillus species 16S rRNA gene sequences were digitalized to provide quick response (QR) codes, Chaose Game Representation (CGR) and Frequency of Chaose Game Representation (FCGR). GC percentage, phylogenetic analysis, and principal component analysis (PCA) are tools used for the differentiation and reclassification of the strains under investigation. The seven reasons supporting the statements made by us as misclassified Lysinibacillus species deposited in National Microbial Depositories are given in this paper. Based on seven reasons, bacteria deposited in National Microbial Repositories such as Lysinibacillus and many other needs reanalyses for their exact identity. Leaves of identity with type strains of related species shows difference 2 to 8 % suggesting that reclassification is needed to correctly assign species names to the analyzed Lysinibacillus strains available in National Microbial Repositories.
On-Line Detection and Segmentation of Sports Motions Using a Wearable Sensor.
Kim, Woosuk; Kim, Myunggyu
2018-03-19
In sports motion analysis, observation is a prerequisite for understanding the quality of motions. This paper introduces a novel approach to detect and segment sports motions using a wearable sensor for supporting systematic observation. The main goal is, for convenient analysis, to automatically provide motion data, which are temporally classified according to the phase definition. For explicit segmentation, a motion model is defined as a sequence of sub-motions with boundary states. A sequence classifier based on deep neural networks is designed to detect sports motions from continuous sensor inputs. The evaluation on two types of motions (soccer kicking and two-handed ball throwing) verifies that the proposed method is successful for the accurate detection and segmentation of sports motions. By developing a sports motion analysis system using the motion model and the sequence classifier, we show that the proposed method is useful for observation of sports motions by automatically providing relevant motion data for analysis.
Lee, Sejoon; Lee, Soohyun; Ouellette, Scott; Park, Woong-Yang; Lee, Eunjung A; Park, Peter J
2017-06-20
In many next-generation sequencing (NGS) studies, multiple samples or data types are profiled for each individual. An important quality control (QC) step in these studies is to ensure that datasets from the same subject are properly paired. Given the heterogeneity of data types, file types and sequencing depths in a multi-dimensional study, a robust program that provides a standardized metric for genotype comparisons would be useful. Here, we describe NGSCheckMate, a user-friendly software package for verifying sample identities from FASTQ, BAM or VCF files. This tool uses a model-based method to compare allele read fractions at known single-nucleotide polymorphisms, considering depth-dependent behavior of similarity metrics for identical and unrelated samples. Our evaluation shows that NGSCheckMate is effective for a variety of data types, including exome sequencing, whole-genome sequencing, RNA-seq, ChIP-seq, targeted sequencing and single-cell whole-genome sequencing, with a minimal requirement for sequencing depth (>0.5X). An alignment-free module can be run directly on FASTQ files for a quick initial check. We recommend using this software as a QC step in NGS studies. https://github.com/parklab/NGSCheckMate. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Applying phylogenetic analysis to viral livestock diseases: moving beyond molecular typing.
Olvera, Alex; Busquets, Núria; Cortey, Marti; de Deus, Nilsa; Ganges, Llilianne; Núñez, José Ignacio; Peralta, Bibiana; Toskano, Jennifer; Dolz, Roser
2010-05-01
Changes in livestock production systems in recent years have altered the presentation of many diseases resulting in the need for more sophisticated control measures. At the same time, new molecular assays have been developed to support the diagnosis of animal viral disease. Nucleotide sequences generated by these diagnostic techniques can be used in phylogenetic analysis to infer phenotypes by sequence homology and to perform molecular epidemiology studies. In this review, some key elements of phylogenetic analysis are highlighted, such as the selection of the appropriate neutral phylogenetic marker, the proper phylogenetic method and different techniques to test the reliability of the resulting tree. Examples are given of current and future applications of phylogenetic reconstructions in viral livestock diseases. Copyright 2009 Elsevier Ltd. All rights reserved.
First genome report on novel sequence types of Neisseria meningitidis: ST12777 and ST12778.
Veeraraghavan, Balaji; Lal, Binesh; Devanga Ragupathi, Naveen Kumar; Neeravi, Iyyan Raj; Jeyaraman, Ranjith; Varghese, Rosemol; Paul, Miracle Magdalene; Baskaran, Ashtawarthani; Ranjan, Ranjini
2018-03-01
Neisseria meningitidis is an important causative agent of meningitis and/or sepsis with high morbidity and mortality. Baseline genome data on N. meningitidis, especially from developing countries such as India, are lacking. This study aimed to investigate the whole genome sequences of N. meningitidis isolates from a tertiary care centre in India. Whole-genome sequencing was performed using an Ion Torrent™ Personal Genome Machine™ (PGM) with 400-bp chemistry. Data were assembled de novo using SPAdes Genome Assembler v.5.0.0.0. Sequence annotation was performed through PATRIC, RAST and the NCBI PGAAP server. Downstream analysis of the isolates was performed using the Center for Genomic Epidemiology databases for antimicrobial resistance genes and sequence types. Virulence factors and CRISPR were analysed using the PubMLST database and CRISPRFinder, respectively. This study reports the whole genome shotgun sequences of eight N. meningitidis isolates from bloodstream infections. The genome data revealed two novel sequence types (ST12777 and ST12778), along with ST11, ST437 and ST6928. The virulence profile of the isolates matched their sequence types. All isolates were negative for plasmid-mediated resistance genes. To the best of our knowledge, this is the first report of ST11 and ST437 N. meningitidis isolates in India along with two novel sequence types (ST12777 and ST12778). These results indicate that the sequence types circulating in India are diverse and require continuous monitoring. Further studies strengthening the genome data on N. meningitidis are required to understand the prevalence, spread, exact resistance and virulence mechanisms along with serotypes. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
T3SEdb: data warehousing of virulence effectors secreted by the bacterial Type III Secretion System.
Tay, Daniel Ming Ming; Govindarajan, Kunde Ramamoorthy; Khan, Asif M; Ong, Terenze Yao Rui; Samad, Hanif M; Soh, Wei Wei; Tong, Minyan; Zhang, Fan; Tan, Tin Wee
2010-10-15
Effectors of Type III Secretion System (T3SS) play a pivotal role in establishing and maintaining pathogenicity in the host and therefore the identification of these effectors is important in understanding virulence. However, the effectors display high level of sequence diversity, therefore making the identification a difficult process. There is a need to collate and annotate existing effector sequences in public databases to enable systematic analyses of these sequences for development of models for screening and selection of putative novel effectors from bacterial genomes that can be validated by a smaller number of key experiments. Herein, we present T3SEdb http://effectors.bic.nus.edu.sg/T3SEdb, a specialized database of annotated T3SS effector (T3SE) sequences containing 1089 records from 46 bacterial species compiled from the literature and public protein databases. Procedures have been defined for i) comprehensive annotation of experimental status of effectors, ii) submission and curation review of records by users of the database, and iii) the regular update of T3SEdb existing and new records. Keyword fielded and sequence searches (BLAST, regular expression) are supported for both experimentally verified and hypothetical T3SEs. More than 171 clusters of T3SEs were detected based on sequence identity comparisons (intra-cluster difference up to ~60%). Owing to this high level of sequence diversity of T3SEs, the T3SEdb provides a large number of experimentally known effector sequences with wide species representation for creation of effector predictors. We created a reliable effector prediction tool, integrated into the database, to demonstrate the application of the database for such endeavours. T3SEdb is the first specialised database reported for T3SS effectors, enriched with manual annotations that facilitated systematic construction of a reliable prediction model for identification of novel effectors. The T3SEdb represents a platform for inclusion of additional annotations of metadata for future developments of sophisticated effector prediction models for screening and selection of putative novel effectors from bacterial genomes/proteomes that can be validated by a small number of key experiments.
Identification of Novel Betaherpesviruses in Iberian Bats Reveals Parallel Evolution
Vázquez-Morón, Sonia; Aznar-López, Carolina; Ibáñez, Carlos; Garin, Inazio; Aihartza, Joxerra; Casas, Inmaculada; Tenorio, Antonio; Echevarría, Juan Emilio
2016-01-01
A thorough search for bat herpesviruses was carried out in oropharyngeal samples taken from most of the bat species present in the Iberian Peninsula from the Vespertilionidae, Miniopteridae, Molossidae and Rhinolophidae families, in addition to a colony of captive fruit bats from the Pteropodidae family. By using two degenerate consensus PCR methods targeting two conserved genes, distinct and previously unrecognized bat-hosted herpesviruses were identified for the most of the tested species. All together a total of 42 potentially novel bat herpesviruses were partially characterized. Thirty-two of them were tentatively assigned to the Betaherpesvirinae subfamily while the remaining 10 were allocated into the Gammaherpesvirinae subfamily. Significant diversity was observed among the novel sequences when compared with type herpesvirus species of the ICTV-approved genera. The inferred phylogenetic relationships showed that most of the betaherpesviruses sequences fell into a well-supported unique monophyletic clade and support the recognition of a new betaherpesvirus genus. This clade is subdivided into three major clades, corresponding to the families of bats studied. This supports the hypothesis of a species-specific parallel evolution process between the potentially new betaherpesviruses and their bat hosts. Interestingly, two of the betaherpesviruses’ sequences detected in rhinolophid bats clustered together apart from the rest, closely related to viruses that belong to the Roseolovirus genus. This suggests a putative third roseolo lineage. On the contrary, no phylogenetic structure was detected among several potentially novel bat-hosted gammaherpesviruses found in the study. Remarkably, all of the possible novel bat herpesviruses described in this study are linked to a unique bat species. PMID:28036408
Wacker, Michael A.
2010-01-01
Borehole geophysical logs were obtained from selected exploratory coreholes in the vicinity of the Florida Power and Light Company Turkey Point Power Plant. The geophysical logging tools used and logging sequences performed during this project are summarized herein to include borehole logging methods, descriptions of the properties measured, types of data obtained, and calibration information.
Storage and utilization of HLA genomic data--new approaches to HLA typing.
Helmberg, W
2000-01-01
Currently available DNA-based HLA typing assays can provide detailed information about sequence motifs of a tested sample. It is still a common practice, however, for information acquired by high-resolution sequence specific oligonucleotide probe (SSOP) typing or sequence specific priming (SSP) to be presented in a low-resolution serological format. Unfortunately, this representation can lead to significant loss of useful data in many cases. An alternative to assigning allele equivalents to suchDNA typing results is simply to store the observed typing pattern and utilize the information with the help of Virtual DNA Analysis (VDA). Interpretation of the stored typing patterns can then be updated based on newly defined alleles, assuming the sequence motifs detected by the typing reagents are known. Rather than updating reagent specificities in individual laboratories, such updates should be performed in a central, publicly available sequence database. By referring to this database, HLA genomic data can then be stored and transferred between laboratories without loss of information. The 13th International Histocompatibility Workshop offers an ideal opportunity to begin building this common database for the entire human MHC.
Sheikh, Faruk G; Mukhopadhyay, Sudit S; Gupta, Prabhakar
2002-02-01
The PstI family of elements are short, highly repetitive DNA sequences interspersed throughout the genome of the Bovidae. We have cloned and sequenced some members of the PstI family from cattle, goat, and buffalo. These elements are approximately 500 bp, have a copy number of 2 x 10(5) - 4 x 10(5), and comprise about 4% of the haploid genome. Studies of nucleotide sequence homology indicate that the buffalo and goat PstI repeats (type II) are similar types of short interspersed nucleotide element (SINE) sequences, but the cattle PstI repeat (type I) is considerably more divergent. Additionally, the goat PstI sequence showed significant sequence homology with bovine serine tRNA, and is therefore likely derived from serine tRNA. Interestingly, Southern hybridization suggests that both types of SINEs (I and II) are present in all the species of Bovidae. Dendrogram analysis indicates that cattle PstI SINE is similar to bovine Alu-like SINEs. Goat and buffalo SINEs formed a separate cluster, suggesting that these two types of SINEs evolved separately in the genome of the Bovidae.
Takami, Hideto; Noguchi, Hideki; Takaki, Yoshihiro; Uchiyama, Ikuo; Toyoda, Atsushi; Nishi, Shinro; Chee, Gab-Joo; Arai, Wataru; Nunoura, Takuro; Itoh, Takehiko; Hattori, Masahira; Takai, Ken
2012-01-01
A nearly complete genome sequence of Candidatus ‘Acetothermum autotrophicum’, a presently uncultivated bacterium in candidate division OP1, was revealed by metagenomic analysis of a subsurface thermophilic microbial mat community. Phylogenetic analysis based on the concatenated sequences of proteins common among 367 prokaryotes suggests that Ca. ‘A. autotrophicum’ is one of the earliest diverging bacterial lineages. It possesses a folate-dependent Wood-Ljungdahl (acetyl-CoA) pathway of CO2 fixation, is predicted to have an acetogenic lifestyle, and possesses the newly discovered archaeal-autotrophic type of bifunctional fructose 1,6-bisphosphate aldolase/phosphatase. A phylogenetic analysis of the core gene cluster of the acethyl-CoA pathway, shared by acetogens, methanogens, some sulfur- and iron-reducers and dechlorinators, supports the hypothesis that the core gene cluster of Ca. ‘A. autotrophicum’ is a particularly ancient bacterial pathway. The habitat, physiology and phylogenetic position of Ca. ‘A. autotrophicum’ support the view that the first bacterial and archaeal lineages were H2-dependent acetogens and methanogenes living in hydrothermal environments. PMID:22303444
Larsson, Ellen; Orstadius, Leif
2008-10-01
Psathyrella species growing on dung or occasionally on dung in the Nordic countries were studied using morphological characters and nu-rDNA sequence data and type collections were examined when available. Fourteen species capable of growing on dung were identified. Descriptions are given of all dung-inhabiting species and to a lesser extent of the species occasionally growing on dung. Three new species are described: Psathyrella fimiseda, P. merdicola, and P. scatophila. P. stercoraria is described as a new species in order to validate the name. A key to the coprophilous species in Europe including the species described by Peck & Smith from North America is provided. The phylogenetic analyses recovered four major supported clades within Psathyrellaceae corresponding to Parasola, Coprinopsis, Lacrymaria/Spadiceae pro parte, and Psathyrella. The status of Coprinellus was ambiguous. The current morphology-based infrageneric classification of Psathyrella was not supported by the phylogenetic analyses and a coprophilous habit has apparently evolved on multiple occasions. Three new combinations are proposed: Parasola conopilus, Coprinopsis marcescibilis, and Coprinopsis pannucioides.
Zhang, Jun-Rong; Guo, Xian-Guang; Liu, Jin-Long; Zhou, Tian-He; Gong, Xiong; Chen, Da-Li; Chen, Jian-Ping
2016-10-01
Leishmaniasis caused by Leishmania is still endemic in Northwest China. It has been thought that reptiles could be a reservoir for mammalian leishmaniasis. However, data are still scarce on natural infection of lizards with Leishmania spp. in China. The present study deals with detection, identification and phylogenetic inference of Leishmania parasites at species and intraspecies levels isolated from six desert lizard species from 10 geographical locations in Northwest China using amplification and sequencing of ITS-rDNA. In total, 83 haplotypes were found among 137 ITS1 sequences obtained from up to 64.6% of all captured lizards. Representative sequences of Leishmania available in GenBank were compiled for comparison with the obtained haplotypes. Tree-based species delimitation was achieved by using Bayesian phylogenitc analyses and maximum parsimony approach. Phylogenetic trees congruently supported that the haplotypes were found to belong to three Leishmania species including L. (sauroleishmania) sp., Leishmania tropica and Leishmania donovani complex. A network approach revealed paraphyletic populations of L. (sauroleishmania) sp. and L. tropica at intraspecies level regarding geographical origin and low host specificity. Chinese L. tropica from lizards showed significant heterogeneity as the obtained haplotypes were distributed in different clusters from other countries. Common ancestry was observed between some sequences of L. tropica from lizards and other sequence types from clinical samples from other countries. This may lend support to the potential reservoir role of lizards for human leishmaniasis. Our results appear to be the first molecular evidence for natural infection of lizards in Northwest China with reptilian Leishmania and mammalian Leishmania species. Desert lizards may be considered as putative reservoir hosts for Leishmania in China. Further studies on persistence of the Leishmania parasites in lizards and sandflies are recommended for the better understanding of their epidemiological involvement. Copyright © 2016 Elsevier B.V. All rights reserved.
Cavanagh, Jorunn Pauline; Klingenberg, Claus; Hanssen, Anne-Merethe; Fredheim, Elizabeth Aarag; Francois, Patrice; Schrenzel, Jacques; Flægstad, Trond; Sollid, Johanna Ericson
2012-06-01
The notoriously multi-resistant Staphylococcus haemolyticus is an emerging pathogen causing serious infections in immunocompromised patients. Defining the population structure is important to detect outbreaks and spread of antimicrobial resistant clones. Currently, the standard typing technique is pulsed-field gel electrophoresis (PFGE). In this study we describe novel molecular typing schemes for S. haemolyticus using multi locus sequence typing (MLST) and multi locus variable number of tandem repeats (VNTR) analysis. Seven housekeeping genes (MLST) and five VNTR loci (MLVF) were selected for the novel typing schemes. A panel of 45 human and veterinary S. haemolyticus isolates was investigated. The collection had diverse PFGE patterns (38 PFGE types) and was sampled over a 20 year-period from eight countries. MLST resolved 17 sequence types (Simpsons index of diversity [SID]=0.877) and MLVF resolved 14 repeat types (SID=0.831). We found a low sequence diversity. Phylogenetic analysis clustered the isolates in three (MLST) and one (MLVF) clonal complexes, respectively. Taken together, neither the MLST nor the MLVF scheme was suitable to resolve the population structure of this S. haemolyticus collection. Future MLVF and MLST schemes will benefit from addition of more variable core genome sequences identified by comparing different fully sequenced S. haemolyticus genomes. Copyright © 2012 Elsevier B.V. All rights reserved.
de Vries, G E; Arfman, N; Terpstra, P; Dijkhuizen, L
1992-01-01
The gene (mdh) coding for methanol dehydrogenase (MDH) of thermotolerant, methylotroph Bacillus methanolicus C1 has been cloned and sequenced. The deduced amino acid sequence of the mdh gene exhibited similarity to those of five other alcohol dehydrogenase (type III) enzymes, which are distinct from the long-chain zinc-containing (type I) or short-chain zinc-lacking (type II) enzymes. Highly efficient expression of the mdh gene in Escherichia coli was probably driven from its own promoter sequence. After purification of MDH from E. coli, the kinetic and biochemical properties of the enzyme were investigated. The physiological effect of MDH synthesis in E. coli and the role of conserved sequence patterns in type III alcohol dehydrogenases have been analyzed and are discussed. Images PMID:1644761
de Villiers, Ethel-Michele; Sandstrom, Robert E; zur Hausen, Harald; Buck, Charles E
2005-01-01
Background Viruses including Epstein–Barr virus (EBV), a human equivalent of murine mammary tumour virus (MMTV) and human papillomavirus (HPV) have been implicated in the aetiology of human breast cancer. We report the presence of HPV DNA sequences in areolar tissue and tumour tissue samples from female patients with breast carcinoma. The presence of virus in the areolar–nipple complex suggests to us a potential pathogenic mechanism. Methods Polymerase chain reaction (PCR) was undertaken to amplify HPV types in areolar and tumour tissue from breast cancer cases. In situ hybridisation supported the PCR findings and localised the virus in nipple, areolar and tumour tissue. Results Papillomavirus DNA was present in 25 of 29 samples of breast carcinoma and in 20 of 29 samples from the corresponding mamilla. The most prevalent type in both carcinomas and nipples was HPV 11, followed by HPV 6. Other types detected were HPV 16, 23, 27 and 57 (nipples and carcinomas), HPV 20, 21, 32, 37, 38, 66 and GA3-1 (nipples only) and HPV 3, 15, 24, 87 and DL473 (carcinomas only). Multiple types were demonstrated in seven carcinomas and ten nipple samples. Conclusions The data demonstrate the occurrence of HPV in nipple and areolar tissues in patients with breast carcinoma. The authors postulate a retrograde ductular pattern of viral spread that may have pathogenic significance. PMID:15642157
Assessing the diversity of AM fungi in arid gypsophilous plant communities.
Alguacil, M M; Roldán, A; Torres, M P
2009-10-01
In the present study, we used PCR-Single-Stranded Conformation Polymorphism (SSCP) techniques to analyse arbuscular mycorrhizal fungi (AMF) communities in four sites within a 10 km(2) gypsum area in Southern Spain. Four common plant species from these ecosystems were selected. The AM fungal small-subunit (SSU) rRNA genes were subjected to PCR, cloning, SSCP analysis, sequencing and phylogenetic analyses. A total of 1443 SSU rRNA sequences were analysed, for 21 AM fungal types: 19 belonged to the genus Glomus, 1 to the genus Diversispora and 1 to the Scutellospora. Four sequence groups were identified, which showed high similarity to sequences of known glomalean species or isolates: Glo G18 to Glomus constrictum, Glo G1 to Glomus intraradices, Glo G16 to Glomus clarum, Scut to Scutellospora dipurpurescens and Div to one new genus in the family Diversisporaceae identified recently as Otospora bareai. There were three sequence groups that received strong support in the phylogenetic analysis, and did not seem to be related to any sequences of AM fungi in culture or previously found in the database; thus, they could be novel taxa within the genus Glomus: Glo G4, Glo G2 and Glo G14. We have detected the presence of both generalist and potential specialist AMF in gypsum ecosystems. The AMF communities were different in the plant studied suggesting some degree of preference in the interactions between these symbionts.
IC 4663: The First Unambiguous [WN] Wolf-Rayet Central Star of a Planetary Nebula
NASA Astrophysics Data System (ADS)
Miszalski, B.; Crowther, P. A.; De Marco, O.; Köppen, J.; Moffat, A. F. J.; Acker, A.; Hillwig, T. C.
2013-01-01
Several [WC]-type central stars of planetary nebulae (PNe) are known to mimic the spectroscopic appearance of massive carbon-rich or WC-type Wolf-Rayet stars. In stark contrast, no [WN]-type central stars have yet been identified as clear-cut analogues of the common nitrogen-rich or WN-type Wolf-Rayet stars. We have identified the [WN3] central star of IC 4663 to be the first unambiguous example in PNe. The low luminosity nucleus and an asymptotic giant branch (AGB) halo surrounding the main nebula prove the bona-fide PN nature of IC 4663. Model atmosphere analysis reveals the [WN3] star to have an exotic chemical composition of helium (95%), hydrogen (<2%), nitrogen (0.8%), neon (0.2%) and oxygen (0.05%) by mass. Such an extreme helium-dominated composition cannot be predicted by current evolutionary scenarios for hydrogen deficient [WC]-type central stars. Only with the discovery of IC 4663 and its unusual composition can we now connect [WN] central stars to the O(He) central stars in a second H-deficient and He-rich evolutionary sequence, [WN]→O(He), that exists in parallel to the carbon-rich [WC]→PG1159 sequence. This suggests a simpler mechanism, perhaps a binary merger, can better explain H-deficiency in PNe and potentially other H-deficient/He-rich stars. In this respect IC 4663 is the best supported case for a possible merged binary central star of a PN.
Giuffrè, Mario; Amodio, Emanuele; Bonura, Celestino; Geraci, Daniela M; Saporito, Laura; Ortolano, Rita; Corsello, Giovanni; Mammina, Caterina
2015-05-01
To describe epidemiologic features and identify risk factors for methicillin-resistant Staphylococcus aureus (MRSA) acquisition in a level III neonatal intensive care unit (NICU). A prospective, cohort study in a university-affiliated NICU with an infection control program including weekly nasal cultures of all neonates. Demographic, clinical, and microbiologic data were prospectively collected between June 2009 and June 2013. Molecular characterization of MRSA isolates was done by multilocus variable number tandem repeat fingerprinting, staphylococcal cassette chromosome mec typing, and on representative isolates by multilocus sequence typing and spa typing. Of 949 neonates, 217 (22.87%) had a culture growing MRSA, including 117 neonates testing positive at their first sampling. Of these latter infants, 96 (82.05%) were inborn and 59 (50.43%) had been transferred from the nursery. Length of stay and colonization pressure were strong independent predictors of MRSA acquisition. Among MRSA isolates, 7 sequence types were identified, with ST22-IVa, spa type t223, being the predominant strain. In an endemic area, early MRSA acquisition and high colonization pressure, likely related to an influx of colonized infants from a well-infant nursery, can support persistence of MRSA in NICUs. Surveillance, molecular tracking of strains, and reinforcement of infection control practices, involving well-infant nurseries in a comprehensive infection control program, could be helpful in containing MRSA transmission. Copyright © 2015 Association for Professionals in Infection Control and Epidemiology, Inc. Published by Elsevier Inc. All rights reserved.
Spike-Based Bayesian-Hebbian Learning of Temporal Sequences
Lindén, Henrik; Lansner, Anders
2016-01-01
Many cognitive and motor functions are enabled by the temporal representation and processing of stimuli, but it remains an open issue how neocortical microcircuits can reliably encode and replay such sequences of information. To better understand this, a modular attractor memory network is proposed in which meta-stable sequential attractor transitions are learned through changes to synaptic weights and intrinsic excitabilities via the spike-based Bayesian Confidence Propagation Neural Network (BCPNN) learning rule. We find that the formation of distributed memories, embodied by increased periods of firing in pools of excitatory neurons, together with asymmetrical associations between these distinct network states, can be acquired through plasticity. The model’s feasibility is demonstrated using simulations of adaptive exponential integrate-and-fire model neurons (AdEx). We show that the learning and speed of sequence replay depends on a confluence of biophysically relevant parameters including stimulus duration, level of background noise, ratio of synaptic currents, and strengths of short-term depression and adaptation. Moreover, sequence elements are shown to flexibly participate multiple times in the sequence, suggesting that spiking attractor networks of this type can support an efficient combinatorial code. The model provides a principled approach towards understanding how multiple interacting plasticity mechanisms can coordinate hetero-associative learning in unison. PMID:27213810
Origins of domestication and polyploidy in oca (Oxalis tuberosa : Oxalidaceae): nrDNA ITS data.
Emshwiller, E; Doyle, J
1998-07-01
As part of a study aimed at elucidating the origins of the octoploid tuber crop "oca," Oxalis tuberosa, DNA sequences of the internal trancribed spacer of nuclear ribosomal DNA (nrDNA ITS) were determined for oca and several wild Oxalis species, mostly from Bolivia. Phylogenetic analysis of these data supports a group of these species as being close relatives of oca, in agreement with morphology and cytology, but at odds with traditional infrageneric taxonomy. Variation in ITS sequences within this group is quite low (0-7 substitutions in the entire ITS region), contrasting with the highly divergent (unalignable in some cases) sequences within the genus overall. Some groups of morphologically differentiated species were found to have identical sequences, notably a group that includes oca, wild populations of Oxalis that bear small tubers, and several other clearly distinct species. The presence of a second, minor sequence type in at least some oca accessions suggests a possible contribution from a second genome donor, also from within this same species group. ITS data lack sufficient variation to elucidate the origins of oca precisely, but have identified a pool of candidate species and so can be used as a tool to screen yet unsampled species for possible progenitors.
Peng, Duo; Gu, Xi; Xue, Liang-Jiao; Leebens-Mack, James H.; Tsai, Chung-Jui
2014-01-01
Sucrose transporters (SUTs) are essential for the export and efficient movement of sucrose from source leaves to sink organs in plants. The angiosperm SUT family was previously classified into three or four distinct groups, Types I, II (subgroup IIB), and III, with dicot-specific Type I and monocot-specific Type IIB functioning in phloem loading. To shed light on the underlying drivers of SUT evolution, Bayesian phylogenetic inference was undertaken using 41 sequenced plant genomes, including seven basal lineages at key evolutionary junctures. Our analysis supports four phylogenetically and structurally distinct SUT subfamilies, originating from two ancient groups (AG1 and AG2) that diverged early during terrestrial colonization. In both AG1 and AG2, multiple intron acquisition events in the progenitor vascular plant established the gene structures of modern SUTs. Tonoplastic Type III and plasmalemmal Type II represent evolutionarily conserved descendants of AG1 and AG2, respectively. Type I and Type IIB were previously thought to evolve after the dicot-monocot split. We show, however, that divergence of Type I from Type III SUT predated basal angiosperms, likely associated with evolution of vascular cambium and phloem transport. Type I SUT was subsequently lost in monocots along with vascular cambium, and independent evolution of Type IIB coincided with modified monocot vasculature. Both Type I and Type IIB underwent lineage-specific expansion. In multiple unrelated taxa, the newly-derived SUTs exhibit biased expression in reproductive tissues, suggesting a functional link between phloem loading and reproductive fitness. Convergent evolution of Type I and Type IIB for SUT function in phloem loading and reproductive organs supports the idea that differential vascular development in dicots and monocots is a strong driver for SUT family evolution in angiosperms. PMID:25429293
Ye, Weixing; Zhu, Lei; Liu, Yingying; Crickmore, Neil; Peng, Donghai; Ruan, Lifang; Sun, Ming
2012-07-01
We have designed a high-throughput system for the identification of novel crystal protein genes (cry) from Bacillus thuringiensis strains. The system was developed with two goals: (i) to acquire the mixed plasmid-enriched genomic sequence of B. thuringiensis using next-generation sequencing biotechnology, and (ii) to identify cry genes with a computational pipeline (using BtToxin_scanner). In our pipeline method, we employed three different kinds of well-developed prediction methods, BLAST, hidden Markov model (HMM), and support vector machine (SVM), to predict the presence of Cry toxin genes. The pipeline proved to be fast (average speed, 1.02 Mb/min for proteins and open reading frames [ORFs] and 1.80 Mb/min for nucleotide sequences), sensitive (it detected 40% more protein toxin genes than a keyword extraction method using genomic sequences downloaded from GenBank), and highly specific. Twenty-one strains from our laboratory's collection were selected based on their plasmid pattern and/or crystal morphology. The plasmid-enriched genomic DNA was extracted from these strains and mixed for Illumina sequencing. The sequencing data were de novo assembled, and a total of 113 candidate cry sequences were identified using the computational pipeline. Twenty-seven candidate sequences were selected on the basis of their low level of sequence identity to known cry genes, and eight full-length genes were obtained with PCR. Finally, three new cry-type genes (primary ranks) and five cry holotypes, which were designated cry8Ac1, cry7Ha1, cry21Ca1, cry32Fa1, and cry21Da1 by the B. thuringiensis Toxin Nomenclature Committee, were identified. The system described here is both efficient and cost-effective and can greatly accelerate the discovery of novel cry genes.
TIR-NBS-LRR genes are rare in monocots: evidence from diverse monocot orders
Tarr, D Ellen K; Alexander, Helen M
2009-01-01
Background Plant resistance (R) gene products recognize pathogen effector molecules. Many R genes code for proteins containing nucleotide binding site (NBS) and C-terminal leucine-rich repeat (LRR) domains. NBS-LRR proteins can be divided into two groups, TIR-NBS-LRR and non-TIR-NBS-LRR, based on the structure of the N-terminal domain. Although both classes are clearly present in gymnosperms and eudicots, only non-TIR sequences have been found consistently in monocots. Since most studies in monocots have been limited to agriculturally important grasses, it is difficult to draw conclusions. The purpose of our study was to look for evidence of these sequences in additional monocot orders. Findings Using degenerate PCR, we amplified NBS sequences from four monocot species (C. blanda, D. marginata, S. trifasciata, and Spathiphyllum sp.), a gymnosperm (C. revoluta) and a eudicot (C. canephora). We successfully amplified TIR-NBS-LRR sequences from dicot and gymnosperm DNA, but not from monocot DNA. Using databases, we obtained NBS sequences from additional monocots, magnoliids and basal angiosperms. TIR-type sequences were not present in monocot or magnoliid sequences, but were present in the basal angiosperms. Phylogenetic analysis supported a single TIR clade and multiple non-TIR clades. Conclusion We were unable to find monocot TIR-NBS-LRR sequences by PCR amplification or database searches. In contrast to previous studies, our results represent five monocot orders (Poales, Zingiberales, Arecales, Asparagales, and Alismatales). Our results establish the presence of TIR-NBS-LRR sequences in basal angiosperms and suggest that although these sequences were present in early land plants, they have been reduced significantly in monocots and magnoliids. PMID:19785756
Kanduma, Esther G; Mwacharo, Joram M; Githaka, Naftaly W; Kinyanjui, Peter W; Njuguna, Joyce N; Kamau, Lucy M; Kariuki, Edward; Mwaura, Stephen; Skilton, Robert A; Bishop, Richard P
2016-06-22
The ixodid tick Rhipicephalus appendiculatus transmits the apicomplexan protozoan parasite Theileria parva, which causes East coast fever (ECF), the most economically important cattle disease in eastern and southern Africa. Recent analysis of micro- and minisatellite markers showed an absence of geographical and host-associated genetic sub-structuring amongst field populations of R. appendiculatus in Kenya. To assess further the phylogenetic relationships between field and laboratory R. appendiculatus tick isolates, this study examined sequence variations at two mitochondrial genes, cytochrome c oxidase subunit I (COI) and 12S ribosomal RNA (rRNA), and the nuclear encoded ribosomal internal transcribed spacer 2 (ITS2) of the rRNA gene, respectively. The analysis of 332 COI sequences revealed 30 polymorphic sites, which defined 28 haplotypes that were separated into two distinct haplogroups (A and B). Inclusion of previously published haplotypes in our analysis revealed a high degree of phylogenetic complexity never reported before in haplogroup A. Neither haplogroup however, showed any clustering pattern related to either the geographical sampling location, the type of tick sampled (laboratory stocks vs field populations) or the mammalian host species. This finding was supported by the results obtained from the analysis of 12S rDNA sequences. Analysis of molecular variance (AMOVA) indicated that 90.8 % of the total genetic variation was explained by the two haplogroups, providing further support for their genetic divergence. These results were, however, not replicated by the nuclear transcribed ITS2 sequences likely because of recombination between the nuclear genomes maintaining a high level of genetic sequence conservation. COI and 12S rDNA are better markers than ITS2 for studying intraspecific diversity. Based on these genes, two major genetic groups of R. appendiculatus that have gone through a demographic expansion exist in Kenya. The two groups show no phylogeographic structure or correlation with the type of host species from which the ticks were collected, nor to the evolutionary and breeding history of the species. The two lineages may have a wide geographic distribution range in eastern and southern Africa. The findings of this study may have implications for the spread and control of R. appendiculatus, and indirectly, on the transmission dynamics of ECF.
Winterton, Shaun L; Wiegmann, Brian M; Schlinger, Evert I
2007-06-01
The first formal analysis of phylogenetic relationships among small-headed flies (Acroceridae) is presented based on DNA sequence data from two ribosomal (16S and 28S) and two protein-encoding genes: carbomoylphosphate synthase (CPS) domain of CAD (i.e., rudimentary locus) and cytochrome oxidase I (COI). DNA sequences from 40 species in 22 genera of Acroceridae (representing all three subfamilies) were compared with outgroup exemplars from Nemestrinidae, Stratiomyidae, Tabanidae, and Xylophagidae. Parsimony and Bayesian simultaneous analyses of the full data set recover a well-resolved and strongly supported hypothesis of phylogenetic relationships for major lineages within the family. Molecular evidence supports the monophyly of traditionally recognised subfamilies Philopotinae and Panopinae, but Acrocerinae are polyphyletic. Panopinae, sometimes considered "primitive" based on morphology and host-use, are always placed in a more derived position in the current study. Furthermore, these data support emerging morphological evidence that the type genus Acrocera Meigen, and its sister genus Sphaerops, are atypical acrocerids, comprising a sister lineage to all other Acroceridae. Based on the phylogeny generated in the simultaneous analysis, historical divergence times were estimated using Bayesian methodology constrained with fossil data. These estimates indicate Acroceridae likely evolved during the late Triassic but did not diversify greatly until the Cretaceous.
Susanti, Dwi; Johnson, Eric F; Lapidus, Alla; Han, James; Reddy, T B K; Pilay, Manoj; Ivanova, Natalia N; Markowitz, Victor M; Woyke, Tanja; Kyrpides, Nikos C; Mukhopadhyay, Biswarup
2016-01-01
This report presents the permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, an obligate anaerobic hyperthermophilic crenarchaeon that was isolated from acidic hot springs in Hveravellir, Iceland. D. mobilis utilizes peptides as carbon and energy sources and reduces elemental sulfur to H2S. A metabolic construction derived from the draft genome identified putative pathways for peptide degradation and sulfur respiration in this archaeon. Existence of several hydrogenase genes in the genome supported previous findings that H2 is produced during the growth of D. mobilis in the absence of sulfur. Interestingly, genes encoding glucose transport and utilization systems also exist in the D. mobilis genome though this archaeon does not utilize carbohydrate for growth. The draft genome of D. mobilis provides an additional mean for comparative genomic analysis of desulfurococci. In addition, our analysis on the Average Nucleotide Identity between D. mobilis and Desulfurococcus mucosus suggested that these two desulfurococci are two different strains of the same species.
XML schemas for common bioinformatic data types and their application in workflow systems
Seibel, Philipp N; Krüger, Jan; Hartmeier, Sven; Schwarzer, Knut; Löwenthal, Kai; Mersch, Henning; Dandekar, Thomas; Giegerich, Robert
2006-01-01
Background Today, there is a growing need in bioinformatics to combine available software tools into chains, thus building complex applications from existing single-task tools. To create such workflows, the tools involved have to be able to work with each other's data – therefore, a common set of well-defined data formats is needed. Unfortunately, current bioinformatic tools use a great variety of heterogeneous formats. Results Acknowledging the need for common formats, the Helmholtz Open BioInformatics Technology network (HOBIT) identified several basic data types used in bioinformatics and developed appropriate format descriptions, formally defined by XML schemas, and incorporated them in a Java library (BioDOM). These schemas currently cover sequence, sequence alignment, RNA secondary structure and RNA secondary structure alignment formats in a form that is independent of any specific program, thus enabling seamless interoperation of different tools. All XML formats are available at , the BioDOM library can be obtained at . Conclusion The HOBIT XML schemas and the BioDOM library simplify adding XML support to newly created and existing bioinformatic tools, enabling these tools to interoperate seamlessly in workflow scenarios. PMID:17087823
DOE Office of Scientific and Technical Information (OSTI.GOV)
Susanti, Dwi; Johnson, Eric F.; Lapidus, Alla
Our report presents the permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, an obligate anaerobic hyperthermophilic crenarchaeon that was isolated from acidic hot springs in Hveravellir, Iceland. D. mobilis utilizes peptides as carbon and energy sources and reduces elemental sulfur to H 2S. A metabolic construction derived from the draft genome identified putative pathways for peptide degradation and sulfur respiration in this archaeon. Existence of several hydrogenase genes in the genome supported previous findings that H 2 is produced during the growth of D. mobilis in the absence of sulfur. Interestingly, genes encoding glucose transport and utilizationmore » systems also exist in the D. mobilis genome though this archaeon does not utilize carbohydrate for growth. The draft genome of D. mobilis provides an additional mean for comparative genomic analysis of desulfurococci. In addition, our analysis on the Average Nucleotide Identity between D. mobilis and Desulfurococcus mucosus suggested that these two desulfurococci are two different strains of the same species.« less
Susanti, Dwi; Johnson, Eric F.; Lapidus, Alla; ...
2016-01-13
Our report presents the permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, an obligate anaerobic hyperthermophilic crenarchaeon that was isolated from acidic hot springs in Hveravellir, Iceland. D. mobilis utilizes peptides as carbon and energy sources and reduces elemental sulfur to H 2S. A metabolic construction derived from the draft genome identified putative pathways for peptide degradation and sulfur respiration in this archaeon. Existence of several hydrogenase genes in the genome supported previous findings that H 2 is produced during the growth of D. mobilis in the absence of sulfur. Interestingly, genes encoding glucose transport and utilizationmore » systems also exist in the D. mobilis genome though this archaeon does not utilize carbohydrate for growth. The draft genome of D. mobilis provides an additional mean for comparative genomic analysis of desulfurococci. In addition, our analysis on the Average Nucleotide Identity between D. mobilis and Desulfurococcus mucosus suggested that these two desulfurococci are two different strains of the same species.« less
Pattaradilokrat, Sittiporn; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Siripoon, Napaporn; Harnyuttanakorn, Pongchai
2016-10-21
An effective malaria vaccine is an urgently needed tool to fight against human malaria, the most deadly parasitic disease of humans. One promising candidate is the merozoite surface protein-3 (MSP-3) of Plasmodium falciparum. This antigenic protein, encoded by the merozoite surface protein (msp-3) gene, is polymorphic and classified according to size into the two allelic types of K1 and 3D7. A recent study revealed that both the K1 and 3D7 alleles co-circulated within P. falciparum populations in Thailand, but the extent of the sequence diversity and variation within each allelic type remains largely unknown. The msp-3 gene was sequenced from 59 P. falciparum samples collected from five endemic areas (Mae Hong Son, Kanchanaburi, Ranong, Trat and Ubon Ratchathani) in Thailand and analysed for nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity. The gene was also subject to population genetic analysis (F st ) and neutrality tests (Tajima's D, Fu and Li D* and Fu and Li' F* tests) to determine any signature of selection. The sequence analyses revealed eight unique DNA haplotypes and seven amino acid sequence variants, with a haplotype and nucleotide diversity of 0.828 and 0.049, respectively. Neutrality tests indicated that the polymorphism detected in the alanine heptad repeat region of MSP-3 was maintained by positive diversifying selection, suggesting its role as a potential target of protective immune responses and supporting its role as a vaccine candidate. Comparison of MSP-3 variants among parasite populations in Thailand, India and Nigeria also inferred a close genetic relationship between P. falciparum populations in Asia. This study revealed the extent of the msp-3 gene diversity in P. falciparum in Thailand, providing the fundamental basis for the better design of future blood stage malaria vaccines against P. falciparum.
Fukumori, F; Saint, C P
1997-01-01
A 9,233-bp HindIII fragment of the aromatic amine catabolic plasmid pTDN1, isolated from a derivative of Pseudomonas putida mt-2 (UCC22), confers the ability to degrade aniline on P. putida KT2442. The fragment encodes six open reading frames which are arranged in the same direction. Their 5' upstream region is part of the direct-repeat sequence of pTDN1. Nucleotide sequence of 1.8 kb of the repeat sequence revealed only a single base pair change compared to the known sequence of IS1071 which is involved in the transposition of the chlorobenzoate genes (C. Nakatsu, J. Ng, R. Singh, N. Straus, and C. Wyndham, Proc. Natl. Acad. Sci. USA 88:8312-8316, 1991). Four open reading frames encode proteins with considerable homology to proteins found in other aromatic-compound degradation pathways. On the basis of sequence similarity, these genes are proposed to encode the large and small subunits of aniline oxygenase (tdnA1 and tdnA2, respectively), a reductase (tdnB), and a LysR-type regulatory gene (tdnR). The putative large subunit has a conserved [2Fe-2S]R Rieske-type ligand center. Two genes, tdnQ and tdnT, which may be involved in amino group transfer, are localized upstream of the putative oxygenase genes. The tdnQ gene product shares about 30% similarity with glutamine synthetases; however, a pUC-based plasmid carrying tdnQ did not support the growth of an Escherichia coli glnA strain in the absence of glutamine. TdnT possesses domains that are conserved among amidotransferases. The tdnQ, tdnA1, tdnA2, tdnB, and tdnR genes are essential for the conversion of aniline to catechol. PMID:8990291
Raven, Kathy E; Reuter, Sandra; Reynolds, Rosy; Brodrick, Hayley J; Russell, Julie E; Török, M Estée; Parkhill, Julian; Peacock, Sharon J
2016-10-01
Vancomycin-resistant Enterococcus faecium (VREfm) is an important cause of healthcare-associated infections worldwide. We undertook whole-genome sequencing (WGS) of 495 E. faecium bloodstream isolates from 2001-2011 in the United Kingdom and Ireland (UK&I) and 11 E. faecium isolates from a reference collection. Comparison between WGS and multilocus sequence typing (MLST) identified major discrepancies for 17% of isolates, with multiple instances of the same sequence type (ST) being located in genetically distant positions in the WGS tree. This confirms that WGS is superior to MLST for evolutionary analyses and is more accurate than current typing methods used during outbreak investigations. E. faecium has been categorized as belonging to three clades (Clades A1, hospital-associated; A2, animal-associated; and B, community-associated). Phylogenetic analysis of our isolates replicated the distinction between Clade A (97% of isolates) and Clade B but did not support the subdivision of Clade A into Clade A1 and A2. Phylogeographic analyses revealed that Clade A had been introduced multiple times into each hospital referral network or country, indicating frequent movement of E. faecium between regions that rarely share hospital patients. Numerous genetic clusters contained highly related vanA-positive and -negative E. faecium, which implies that control of vancomycin-resistant enterococci (VRE) in hospitals also requires consideration of vancomycin-susceptible E. faecium Our findings reveal the evolution and dissemination of hospital-associated E. faecium in the UK&I and provide evidence for WGS as an instrument for infection control. © 2016 Raven et al.; Published by Cold Spring Harbor Laboratory Press.
Raven, Kathy E.; Reuter, Sandra; Reynolds, Rosy; Brodrick, Hayley J.; Russell, Julie E.; Török, M. Estée; Parkhill, Julian; Peacock, Sharon J.
2016-01-01
Vancomycin-resistant Enterococcus faecium (VREfm) is an important cause of healthcare-associated infections worldwide. We undertook whole-genome sequencing (WGS) of 495 E. faecium bloodstream isolates from 2001–2011 in the United Kingdom and Ireland (UK&I) and 11 E. faecium isolates from a reference collection. Comparison between WGS and multilocus sequence typing (MLST) identified major discrepancies for 17% of isolates, with multiple instances of the same sequence type (ST) being located in genetically distant positions in the WGS tree. This confirms that WGS is superior to MLST for evolutionary analyses and is more accurate than current typing methods used during outbreak investigations. E. faecium has been categorized as belonging to three clades (Clades A1, hospital-associated; A2, animal-associated; and B, community-associated). Phylogenetic analysis of our isolates replicated the distinction between Clade A (97% of isolates) and Clade B but did not support the subdivision of Clade A into Clade A1 and A2. Phylogeographic analyses revealed that Clade A had been introduced multiple times into each hospital referral network or country, indicating frequent movement of E. faecium between regions that rarely share hospital patients. Numerous genetic clusters contained highly related vanA-positive and -negative E. faecium, which implies that control of vancomycin-resistant enterococci (VRE) in hospitals also requires consideration of vancomycin-susceptible E. faecium. Our findings reveal the evolution and dissemination of hospital-associated E. faecium in the UK&I and provide evidence for WGS as an instrument for infection control. PMID:27527616
Enterovirus Migration Patterns between France and Tunisia
Othman, Ines; Mirand, Audrey; Slama, Ichrak; Mastouri, Maha; Peigue-Lafeuille, Hélène; Aouni, Mahjoub; Bailly, Jean-Luc
2015-01-01
The enterovirus (EV) types echovirus (E-) 5, E-9, and E-18, and coxsackievirus (CV-) A9 are infrequently reported in human diseases and their epidemiologic features are poorly defined. Virus transmission patterns between countries have been estimated with phylogenetic data derived from the 1D/VP1 and 3CD gene sequences of a sample of 74 strains obtained in France (2000–2012) and Tunisia (2011–2013) and from the publicly available sequences. The EV types (E-5, E-9, and E-18) exhibited a lower worldwide genetic diversity (respective number of genogroups: 4, 5, and 3) in comparison to CV-A9 (n = 10). The phylogenetic trees estimated with both 1D/VP1 and 3CD sequence data showed variations in the number of co-circulating lineages over the last 20 years among the four EV types. Despite the low number of genogroups in E-18, the virus exhibited the highest number of recombinant 3CD lineages (n = 10) versus 4 (E-5) to 8 (E-9). The phylogenies provided evidence of multiple transportation events between France and Tunisia involving E-5, E-9, E-18, and CV-A9 strains. Virus spread events between France and 17 other countries in five continents had high probabilities of occurrence as those between Tunisia and two European countries other than France. All transportation events were supported by BF values > 10. Inferring the source of virus transmission from phylogenetic data may provide insights into the patterns of sporadic and epidemic diseases caused by EVs. PMID:26709514
Archaebacterial phylogeny: perspectives on the urkingdoms
NASA Technical Reports Server (NTRS)
Woese, C. R.; Olsen, G. J.
1986-01-01
Comparisons of complete 16S ribosomal RNA sequences have been used to confirm, refine and extend earlier concepts of archaebacterial phylogeny. The archaebacteria fall naturally into two major branches or divisions, I--the sulfur-dependent thermophilic archaebacteria, and II--the methanogenic archaebacteria and their relatives. Division I comprises a relatively closely related and phenotypically homogeneous collection of thermophilic sulfur-dependent species--encompassing the genera Sulfolobus, Thermoproteus, Pyrodictium and Desulfurococcus. The organisms of Division II, however, form a less compact grouping phylogenetically, and are also more diverse in phenotype. All three of the (major) methanogen groups are found in Division II, as are the extreme halophiles and two types of thermoacidophiles, Thermoplasma acidophilum and Thermococcus celer. This last species branches sufficiently deeply in the Division II line that it might be considered to represent a separate, third Division. However, both the extreme halophiles and Tp. acidophilum branch within the cluster of methanogens. The extreme halophiles are specifically related to the Methanomicrobiales, to the exclusion of both the Methanococcales and the Methanobacteriales. Tp. acidophilum is peripherally related to the halophile-Methanomicrobiales group. By 16S rRNA sequence measure the archaebacteria constitute a phylogenetically coherent grouping (clade), which excludes both the eubacteria and the eukaryotes--a conclusion that is supported by other sequence evidence as well. Alternative proposals for archaebacterial phylogeny, not based upon sequence evidence, are discussed and evaluated. In particular, proposals to rename (reclassify) various subgroups of the archaebacteria as new kingdoms are found wanting, for both their lack of proper experimental support and the taxonomic confusion they introduce.
Whole exome or genome sequencing: nurses need to prepare families for the possibilities.
Prows, Cynthia A; Tran, Grace; Blosser, Beverly
2014-12-01
A discussion of whole exome sequencing and the type of possible results patients and families should be aware of before samples are obtained. To find the genetic cause of a rare disorder, whole exome sequencing analyses all known and suspected human genes from a single sample. Over 20,000 detected DNA variants in each individual exome must be considered as possibly causing disease or disregarded as not relevant to the person's disease. In the process, unexpected gene variants associated with known diseases unrelated to the primary purpose of the test may be incidentally discovered. Because family members' DNA samples are often needed, gene variants associated with known genetic diseases or predispositions for diseases can also be discovered in their samples. Discussion paper. PubMed 2009-2013, list of references in retrieved articles, Google Scholar. Nurses need a general understanding of the scope of potential genomic information that may be revealed with whole exome sequencing to provide support and guidance to individuals and families during their decision-making process, while waiting for results and after disclosure. Nurse scientists who want to use whole exome sequencing in their study design and methods must decide early in study development if they will return primary whole exome sequencing research results and if they will give research participants choices about learning incidental research results. It is critical that nurses translate their knowledge about whole exome sequencing into their patient education and patient advocacy roles and relevant programmes of research. © 2014 John Wiley & Sons Ltd.
den Bakker, Henk C; Warchocki, Steven; Wright, Emily M; Allred, Adam F; Ahlstrom, Christina; Manuel, Clyde S; Stasiewicz, Matthew J; Burrell, Angela; Roof, Sherry; Strawn, Laura K; Fortes, Esther; Nightingale, Kendra K; Kephart, Daniel; Wiedmann, Martin
2014-06-01
Sampling of agricultural and natural environments in two US states (Colorado and Florida) yielded 18 Listeria-like isolates that could not be assigned to previously described species using traditional methods. Using whole-genome sequencing and traditional phenotypic methods, we identified five novel species, each with a genome-wide average BLAST nucleotide identity (ANIb) of less than 85% to currently described species. Phylogenetic analysis based on 16S rRNA gene sequences and amino acid sequences of 31 conserved loci showed the existence of four well-supported clades within the genus Listeria; (i) a clade representing Listeria monocytogenes, L. marthii, L. innocua, L. welshimeri, L. seeligeri and L. ivanovii, which we refer to as Listeria sensu stricto, (ii) a clade consisting of Listeria fleischmannii and two newly described species, Listeria aquatica sp. nov. (type strain FSL S10-1188(T) = DSM 26686(T) = LMG 28120(T) = BEI NR-42633(T)) and Listeria floridensis sp. nov. (type strain FSL S10-1187(T) = DSM 26687(T) = LMG 28121(T) = BEI NR-42632(T)), (iii) a clade consisting of Listeria rocourtiae, L. weihenstephanensis and three novel species, Listeria cornellensis sp. nov. (type strain TTU A1-0210(T) = FSL F6-0969(T) = DSM 26689(T) = LMG 28123(T) = BEI NR-42630(T)), Listeria grandensis sp. nov. (type strain TTU A1-0212(T) = FSL F6-0971(T) = DSM 26688(T) = LMG 28122(T) = BEI NR-42631(T)) and Listeria riparia sp. nov. (type strain FSL S10-1204(T) = DSM 26685(T) = LMG 28119(T) = BEI NR- 42634(T)) and (iv) a clade containing Listeria grayi. Genomic and phenotypic data suggest that the novel species are non-pathogenic. © 2014 IUMS.
Williams, E. J.; Lewis, J.; John, T.; Hoe, J. C.; Yu, L.; Dongol, S.; Kelly, D. F.; Griffiths, D. T.; Shah, A.; Limbu, B.; Pradhan, R.; Mawas, F.; Shrestha, S.; Thorson, S.; Werno, A. M.; Murdoch, D. R.; Adhikari, N.; Pollard, A. J.
2011-01-01
Haemophilus influenzae type b (Hib) is a major cause of invasive bacterial infection in children that can be prevented by a vaccine, but there is still uncertainty about its relative importance in Asia. This study investigated the age-specific prevalence of Hib carriage and its molecular epidemiology in carriage and disease in Nepal. Oropharyngeal swabs were collected from children in Kathmandu, Nepal, from 3 different settings: a hospital outpatient department (OPD), schools, and children's homes. Hib was isolated using Hib antiserum agar plates, and serotyping was performed with latex agglutination. Hib isolates from children with invasive disease were obtained during active microbiological surveillance at Patan Hospital, Kathmandu, Nepal. Genotyping of disease and carriage isolates was undertaken using multilocus sequence typing (MLST). Swabs were taken from 2,195 children, including 1,311 children at an OPD, 647 children attending schools, and 237 children in homes. Overall, Hib was identified in 5.0% (110/2,195; 95% confidence interval [95% CI], 3.9% to 6.4%). MLST was performed on 108 Hib isolates from children carrying Hib isolates and 15 isolates from children with invasive disease. Thirty-one sequence types (STs) were identified, and 20 of these were novel STs. The most common ST isolates were sequence type 6 (ST6) and the novel ST722. There was marked heterogeneity among the STs from children with disease and children carrying Hib. STs identified from invasive infections were those commonly identified in carriage. This study provides evidence of Hib carriage among children in urban Nepal with genetically diverse strains prior to introduction of universal vaccination. The Hib carriage rate in Nepal was similar to the rates observed in other populations with documented high disease rates prior to vaccination, supporting implementation of Hib vaccine in Nepal in 2009. PMID:21270225
Krishnamurthi, S; Ruckmani, A; Pukall, R; Chakrabarti, T
2010-11-01
The taxonomic status of three Bacillus species, Bacillus insolitus, B. psychrodurans and B. psychrotolerans was reexamined using a polyphasic approach. In our analysis, these three Bacillus species formed a cluster separate from other members of Bacillus rRNA group 2 [5] and from Bacillus sensu stricto. These three species shared high 16S rRNA gene sequence similarities between them (97.8-99.7%) and showed closest sequence similarity (95.3-96.3%) to Paenisporosarcina quisquiliarum gen. nov., sp. nov. [18]. Sequence similarities with other related genera ranged between 90.9% and 94.5%. Phylogenetic coherence of the three species was supported by phenotypic characteristics, such as growth at low temperatures, negative oxidation and assimilation of many carbohydrates, MK8 as the major isoprenoid quinine and broadly similar polar lipid profiles. All three species had a similar peptidoglycan type of the variation A4β and similar genomic G+C contents (35.7-36.6 mol% [1]). Genomic relatedness among them was shown to be less than 70% and justified their separate species status [1]. These three species could be differentiated from each other and from related taxa on the basis of phenotypic, including chemotaxonomic, characteristics and ribotype patterns. On the basis of our analysis, we propose a new genus Psychrobacillus gen. nov. and to transfer B. insolitus, B. psychrodurans and B. psychrotolerans to the new genus as Psychrobacillus insolitus comb. nov. (type species of the genus; type strain W16B(T)=DSM 5(T)), P. psychrodurans comb. nov. (type strain 68E3(T)=DSM 11713(T)) and P. psychrotolerans comb. nov. (type strain 3H1(T)=DSM 11706(T)). Copyright © 2010 Elsevier GmbH. All rights reserved.
Tau, Nomsa; Smouse, Shannon L.; Mtshali, Phillip S.; Mnyameni, Florah; Khumalo, Zamantungwa T. H.; Ismail, Arshad; Govender, Nevashan; Thomas, Juno
2018-01-01
ABSTRACT We report whole-genome sequences for 10 Listeria monocytogenes sequence type 6 isolates associated with a large listeriosis outbreak in South Africa, which occurred over the period of 2017 to 2018. The possibility of listeriosis spreading beyond South Africa’s borders as a result of exported contaminated food products prompted us to make the genome sequences publicly available. PMID:29930052
PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities
2011-01-01
Background Facilities that provide a service for DNA sequencing typically support large numbers of users and experiment types. The cost of services is often reduced by the use of liquid handling robots but the efficiency of such facilities is hampered because the software for such robots does not usually integrate well with the systems that run the sequencing machines. Accordingly, there is a need for software systems capable of integrating different robotic systems and managing sample information for DNA sequencing services. In this paper, we describe an extension to the Protein Information Management System (PIMS) that is designed for DNA sequencing facilities. The new version of PIMS has a user-friendly web interface and integrates all aspects of the sequencing process, including sample submission, handling and tracking, together with capture and management of the data. Results The PIMS sequencing extension has been in production since July 2009 at the University of Leeds DNA Sequencing Facility. It has completely replaced manual data handling and simplified the tasks of data management and user communication. Samples from 45 groups have been processed with an average throughput of 10000 samples per month. The current version of the PIMS sequencing extension works with Applied Biosystems 3130XL 96-well plate sequencer and MWG 4204 or Aviso Theonyx liquid handling robots, but is readily adaptable for use with other combinations of robots. Conclusions PIMS has been extended to provide a user-friendly and integrated data management solution for DNA sequencing facilities that is accessed through a normal web browser and allows simultaneous access by multiple users as well as facility managers. The system integrates sequencing and liquid handling robots, manages the data flow, and provides remote access to the sequencing results. The software is freely available, for academic users, from http://www.pims-lims.org/. PMID:21385349
Old, M O; Logan, L H; Maldonado, Y A
1997-11-01
Sabin type 3 polio vaccine virus is the most common cause of poliovaccine associated paralytic poliomyelitis. Vaccine associated paralytic poliomyelitis cases have been associated with Sabin type 3 revertants containing a single U to C substitution at bp 472 of Sabin type 3. A rapid method of identification of Sabin type 3 bp 472 mutants is described. An enterovirus group-specific probe for use in a chemiluminescent dot blot hybridization assay was developed to identify enterovirus positive viral lysates. A reverse transcription-polymerase chain reaction (RT-PCR) assay producing a 319 bp PCR product containing the Sabin type 3 bp 472 mutation site was then employed to identify Sabin type 3 isolates. Chemiluminescent nucleic acid cycle sequencing of the purified 319 bp PCR product was then employed to identify nucleic acid sequences at bp 472. The enterovirus group probe hybridization procedure and isolation of the Sabin type 3 PCR product were highly sensitive and specific; nucleic acid cycle sequencing corresponded to the known sequence of stock Sabin type 3 isolates. These methods will be used to identify the Sabin type 3 reversion rate from sequential stool samples of infants obtained after the first and second doses of oral poliovirus vaccine.
Developing eThread pipeline using SAGA-pilot abstraction for large-scale structural bioinformatics.
Ragothaman, Anjani; Boddu, Sairam Chowdary; Kim, Nayong; Feinstein, Wei; Brylinski, Michal; Jha, Shantenu; Kim, Joohyun
2014-01-01
While most of computational annotation approaches are sequence-based, threading methods are becoming increasingly attractive because of predicted structural information that could uncover the underlying function. However, threading tools are generally compute-intensive and the number of protein sequences from even small genomes such as prokaryotes is large typically containing many thousands, prohibiting their application as a genome-wide structural systems biology tool. To leverage its utility, we have developed a pipeline for eThread--a meta-threading protein structure modeling tool, that can use computational resources efficiently and effectively. We employ a pilot-based approach that supports seamless data and task-level parallelism and manages large variation in workload and computational requirements. Our scalable pipeline is deployed on Amazon EC2 and can efficiently select resources based upon task requirements. We present runtime analysis to characterize computational complexity of eThread and EC2 infrastructure. Based on results, we suggest a pathway to an optimized solution with respect to metrics such as time-to-solution or cost-to-solution. Our eThread pipeline can scale to support a large number of sequences and is expected to be a viable solution for genome-scale structural bioinformatics and structure-based annotation, particularly, amenable for small genomes such as prokaryotes. The developed pipeline is easily extensible to other types of distributed cyberinfrastructure.
Developing eThread Pipeline Using SAGA-Pilot Abstraction for Large-Scale Structural Bioinformatics
Ragothaman, Anjani; Feinstein, Wei; Jha, Shantenu; Kim, Joohyun
2014-01-01
While most of computational annotation approaches are sequence-based, threading methods are becoming increasingly attractive because of predicted structural information that could uncover the underlying function. However, threading tools are generally compute-intensive and the number of protein sequences from even small genomes such as prokaryotes is large typically containing many thousands, prohibiting their application as a genome-wide structural systems biology tool. To leverage its utility, we have developed a pipeline for eThread—a meta-threading protein structure modeling tool, that can use computational resources efficiently and effectively. We employ a pilot-based approach that supports seamless data and task-level parallelism and manages large variation in workload and computational requirements. Our scalable pipeline is deployed on Amazon EC2 and can efficiently select resources based upon task requirements. We present runtime analysis to characterize computational complexity of eThread and EC2 infrastructure. Based on results, we suggest a pathway to an optimized solution with respect to metrics such as time-to-solution or cost-to-solution. Our eThread pipeline can scale to support a large number of sequences and is expected to be a viable solution for genome-scale structural bioinformatics and structure-based annotation, particularly, amenable for small genomes such as prokaryotes. The developed pipeline is easily extensible to other types of distributed cyberinfrastructure. PMID:24995285
Current whole-body MRI applications in the neurofibromatoses
Fayad, Laura M.; Khan, Muhammad Shayan; Bredella, Miriam A.; Harris, Gordon J.; Evans, D. Gareth; Farschtschi, Said; Jacobs, Michael A.; Chhabra, Avneesh; Salamon, Johannes M.; Wenzel, Ralph; Mautner, Victor F.; Dombi, Eva; Cai, Wenli; Plotkin, Scott R.; Blakeley, Jaishri O.
2016-01-01
Objectives: The Response Evaluation in Neurofibromatosis and Schwannomatosis (REiNS) International Collaboration Whole-Body MRI (WB-MRI) Working Group reviewed the existing literature on WB-MRI, an emerging technology for assessing disease in patients with neurofibromatosis type 1 (NF1), neurofibromatosis type 2 (NF2), and schwannomatosis (SWN), to recommend optimal image acquisition and analysis methods to enable WB-MRI as an endpoint in NF clinical trials. Methods: A systematic process was used to review all published data about WB-MRI in NF syndromes to assess diagnostic accuracy, feasibility and reproducibility, and data about specific techniques for assessment of tumor burden, characterization of neoplasms, and response to therapy. Results: WB-MRI at 1.5T or 3.0T is feasible for image acquisition. Short tau inversion recovery (STIR) sequence is used in all investigations to date, suggesting consensus about the utility of this sequence for detection of WB tumor burden in people with NF. There are insufficient data to support a consensus statement about the optimal imaging planes (axial vs coronal) or 2D vs 3D approaches. Functional imaging, although used in some NF studies, has not been systematically applied or evaluated. There are no comparative studies between regional vs WB-MRI or evaluations of WB-MRI reproducibility. Conclusions: WB-MRI is feasible for identifying tumors using both 1.5T and 3.0T systems. The STIR sequence is a core sequence. Additional investigation is needed to define the optimal approach for volumetric analysis, the reproducibility of WB-MRI in NF, and the diagnostic performance of WB-MRI vs regional MRI. PMID:27527647
Amino-terminal sequence of glycoprotein D of herpes simplex virus types 1 and 2
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eisenberg, R.J.; Long, D.; Hogue-Angeletti, R.
1984-01-01
Glycoprotein D (gD) of herpes simplex virus is a structural component of the virion envelope which stimulates production of high titers of herpes simplex virus type-common neutralizing antibody. The authors caried out automated N-terminal amino acid sequencing studies on radiolabeled preparations of gD-1 (gD of herpes simplex virus type 1) and gD-2 (gD of herpes simplex virus type 2). Although some differences were noted, particularly in the methionine and alanine profiles for gD-1 and gD-2, the amino acid sequence of a number of the first 30 residues of the amino terminus of gD-1 and gD-2 appears to be quite similar.more » For both proteins, the first residue is a lysine. When we compared out sequence data for gD-1 with those predicted by nucleic acid sequencing, the two sequences could be aligned (with one exception) starting at residue 26 (lysine) of the predicted sequence. Thus, the first 25 amino acids of the predicted sequence are absent from the polypeptides isolated from infected cells.« less
A comparative study of retrotransposons in the centromeric regions of A and B chromosomes of maize.
Theuri, J; Phelps-Durr, T; Mathews, S; Birchler, J
2005-01-01
Bacterial Artificial Chromosomes (BACs) derived from the B chromosome, based on homology with the B specific sequence, were subcloned and sequenced. Analysis of DNA sequence data indicated the presence of 23 common retroelements, as well as novel sequences of B chromosome origin. Generally, where the same retrotransposon type was observed in both A and B chromosomes, there were more copies per unit of sequence in the B centromeric region (the major site of B repeat) than in the A centromere, except for Huck-1. Based on previous estimates of the age of the major burst of transposition into the maize genome, the oldest retrotransposons (Ji-6 and Tekay, approximately 5.0 and 5.2 million years ago, respectively) were found in the B centromere region only, while the next two oldest (Huck-1 and Opie-1) were found in both the A and B sequences. Phylogenetic analysis of Opie retroelements from both A and B centromeres indicated that some of the B Opie centromeric sequences share a more recent common ancestor with A Opie retroelements than they do with other B Opie centromeric sequences. These results imply that the supernumerary maize B chromosome has coexisted with the A chromosomes during that period of transposition. They also support the hypothesis that the B chromosome had its origins from A chromosome elements, or that alternative origins, such as being donated to the maize genome in a wide species cross, preceded six million years ago, because the spectrum of retrotransposons in the two chromosomes is quite similar.
Microbe-ID: an open source toolbox for microbial genotyping and species identification.
Tabima, Javier F; Everhart, Sydney E; Larsen, Meredith M; Weisberg, Alexandra J; Kamvar, Zhian N; Tancos, Matthew A; Smart, Christine D; Chang, Jeff H; Grünwald, Niklaus J
2016-01-01
Development of tools to identify species, genotypes, or novel strains of invasive organisms is critical for monitoring emergence and implementing rapid response measures. Molecular markers, although critical to identifying species or genotypes, require bioinformatic tools for analysis. However, user-friendly analytical tools for fast identification are not readily available. To address this need, we created a web-based set of applications called Microbe-ID that allow for customizing a toolbox for rapid species identification and strain genotyping using any genetic markers of choice. Two components of Microbe-ID, named Sequence-ID and Genotype-ID, implement species and genotype identification, respectively. Sequence-ID allows identification of species by using BLAST to query sequences for any locus of interest against a custom reference sequence database. Genotype-ID allows placement of an unknown multilocus marker in either a minimum spanning network or dendrogram with bootstrap support from a user-created reference database. Microbe-ID can be used for identification of any organism based on nucleotide sequences or any molecular marker type and several examples are provided. We created a public website for demonstration purposes called Microbe-ID (microbe-id.org) and provided a working implementation for the genus Phytophthora (phytophthora-id.org). In Phytophthora-ID, the Sequence-ID application allows identification based on ITS or cox spacer sequences. Genotype-ID groups individuals into clonal lineages based on simple sequence repeat (SSR) markers for the two invasive plant pathogen species P. infestans and P. ramorum. All code is open source and available on github and CRAN. Instructions for installation and use are provided at https://github.com/grunwaldlab/Microbe-ID.
Using R in Taverna: RShell v1.2
Wassink, Ingo; Rauwerda, Han; Neerincx, Pieter BT; Vet, Paul E van der; Breit, Timo M; Leunissen, Jack AM; Nijholt, Anton
2009-01-01
Background R is the statistical language commonly used by many life scientists in (omics) data analysis. At the same time, these complex analyses benefit from a workflow approach, such as used by the open source workflow management system Taverna. However, Taverna had limited support for R, because it supported just a few data types and only a single output. Also, there was no support for graphical output and persistent sessions. Altogether this made using R in Taverna impractical. Findings We have developed an R plugin for Taverna: RShell, which provides R functionality within workflows designed in Taverna. In order to fully support the R language, our RShell plugin directly uses the R interpreter. The RShell plugin consists of a Taverna processor for R scripts and an RShell Session Manager that communicates with the R server. We made the RShell processor highly configurable allowing the user to define multiple inputs and outputs. Also, various data types are supported, such as strings, numeric data and images. To limit data transport between multiple RShell processors, the RShell plugin also supports persistent sessions. Here, we will describe the architecture of RShell and the new features that are introduced in version 1.2, i.e.: i) Support for R up to and including R version 2.9; ii) Support for persistent sessions to limit data transfer; iii) Support for vector graphics output through PDF; iv)Syntax highlighting of the R code; v) Improved usability through fewer port types. Our new RShell processor is backwards compatible with workflows that use older versions of the RShell processor. We demonstrate the value of the RShell processor by a use-case workflow that maps oligonucleotide probes designed with DNA sequence information from Vega onto the Ensembl genome assembly. Conclusion Our RShell plugin enables Taverna users to employ R scripts within their workflows in a highly configurable way. PMID:19607662
Oxley, Andrew P A; Argo, Jeffrey A; McKay, David B
2005-11-01
The gastric fluid of six bottlenose dolphins and the faeces of four polar bears from the same oceanarium were examined for the presence of Helicobacter. As detected by PCR, all dolphins and 8/12 samples collected from polar bears were positive for Helicobacter. Novel sequence types were identified in samples collected from these animals of which several were unique to either the dolphins or the polar bears. At least one sequence type was, however, detected in both animal taxa. In addition, a sequence type from a dolphin shared a 98.2-100% identity to sequences from other Helicobacter species from harp seals, sea otters and sea lions. This study reports on the occurrence of novel Helicobacter sequence types in polar bears and dolphins and demonstrates the broad-host range of some species within these animals.
Park, Hae-Jeong; Chun, Ji-Won; Park, Bumhee; Park, Haeil; Kim, Joong Il; Lee, Jong Doo; Kim, Jae-Jin
2011-05-01
Although blind people heavily depend on working memory to manage daily life without visual information, it is not clear yet whether their working memory processing involves functional reorganization of the memory-related cortical network. To explore functional reorganization of the cortical network that supports various types of working memory processes in the early blind, we investigated activation differences between 2-back tasks and 0-back tasks using fMRI in 10 congenitally blind subjects and 10 sighted subjects. We used three types of stimulus sequences: words for a verbal task, pitches for a non-verbal task, and sound locations for a spatial task. When compared to the sighted, the blind showed additional activations in the occipital lobe for all types of stimulus sequences for working memory and more significant deactivation in the posterior cingulate cortex of the default mode network. The blind had increased effective connectivity from the default mode network to the left parieto-frontal network and from the occipital cortex to the right parieto-frontal network during the 2-back tasks than the 0-back tasks. These findings suggest not only cortical plasticity of the occipital cortex but also reorganization of the cortical network for the executive control of working memory.
Knight, Cameron G; Munday, John S; Rosa, Brielle V; Kiupel, Matti
2011-12-01
A 9-year-old gelding presented with approximately 100 papillomas that covered about 75% of the distal penis. Biopsy was performed, and histology showed evidence of viral cytopathic change and koilocytosis. Polymerase chain reaction using DNA extracted from biopsied tissue amplified equine papillomavirus type 2 (EcPV-2) DNA sequences. Sixteen months later, the horse was re-examined and the appearance of the papillomas was unchanged. Equine papillomavirus type 2 DNA sequences were again amplified from both biopsied tissue and swabs of the penis. Papillomavirus was localized to the lesions by immunohistochemistry and in situ hybridization. An examination 2 years after the initial presentation revealed no detectable change in the appearance of the penis. The large number of papillomas and their failure to regress over an extended period support a clinical classification of papillomatosis. To the authors' knowledge, this is the first report of papillomatosis of the equine penis. This novel clinical manifestation suggests that persistent EcPV-2 infection is possible in horses. As there is evidence that EcPV-2 may promote development of equine penile squamous cell carcinoma, understanding the natural history of EcPV-2 infections may be important in preventing equine penile neoplasia. © 2011 The Authors. Veterinary Dermatology. © 2011 ESVD and ACVD.
Whiteduck-Léveillée, Kerri; Whiteduck-Léveillée, Jenni; Cloutier, Michel; Tambong, James T; Xu, Renlin; Topp, Edward; Arts, Michael T; Chao, Jerry; Adam, Zaky; Lévesque, C André; Lapen, David R; Villemur, Richard; Khan, Izhar U H
2016-03-01
A study on the taxonomic classification of Arcobacter species was performed on the cultures isolated from various fecal sources where an Arcobacter strain AF1078(T) from human waste septic tank near Ottawa, Ontario, Canada was characterized using a polyphasic approach. Genetic investigations including 16S rRNA, atpA, cpn60, gyrA, gyrB and rpoB gene sequences of strain AF1078(T) are unique in comparison with other arcobacters. Phylogenetic analysis based on the 16S rRNA gene sequence revealed that the strain is most closely related to Arcobacter lanthieri and Arcobacter cibarius. Analyses of atpA, cpn60, gyrA, gyrB and rpoB gene sequences suggested that strain AF1078(T) formed a phylogenetic lineage independent of other species in the genus. Whole-genome sequence, DNA-DNA hybridization, fatty acid profile and phenotypic analysis further supported the conclusion that strain AF1078(T) represents a novel Arcobacter species, for which the name Arcobacter faecis sp. nov. is proposed, with type strain AF1078(T) (=LMG 28519(T); CCUG 66484(T)). Crown Copyright © 2015. Published by Elsevier GmbH. All rights reserved.
Effect of Noise on DNA Sequencing via Transverse Electronic Transport
Krems, Matt; Zwolak, Michael; Pershin, Yuriy V.; Di Ventra, Massimiliano
2009-01-01
Abstract Previous theoretical studies have shown that measuring the transverse current across DNA strands while they translocate through a nanopore or channel may provide a statistically distinguishable signature of the DNA bases, and may thus allow for rapid DNA sequencing. However, fluctuations of the environment, such as ionic and DNA motion, introduce important scattering processes that may affect the viability of this approach to sequencing. To understand this issue, we have analyzed a simple model that captures the role of this complex environment in electronic dephasing and its ability to remove charge carriers from current-carrying states. We find that these effects do not strongly influence the current distributions due to the off-resonant nature of tunneling through the nucleotides—a result we expect to be a common feature of transport in molecular junctions. In particular, only large scattering strengths, as compared to the energetic gap between the molecular states and the Fermi level, significantly alter the form of the current distributions. Since this gap itself is quite large, the current distributions remain protected from this type of noise, further supporting the possibility of using transverse electronic transport measurements for DNA sequencing. PMID:19804730
Allam, Mushal; Tau, Nomsa; Smouse, Shannon L; Mtshali, Phillip S; Mnyameni, Florah; Khumalo, Zamantungwa T H; Ismail, Arshad; Govender, Nevashan; Thomas, Juno; Smith, Anthony M
2018-06-21
We report whole-genome sequences for 10 Listeria monocytogenes sequence type 6 isolates associated with a large listeriosis outbreak in South Africa, which occurred over the period of 2017 to 2018. The possibility of listeriosis spreading beyond South Africa's borders as a result of exported contaminated food products prompted us to make the genome sequences publicly available. Copyright © 2018 Allam et al.
DraGnET: Software for storing, managing and analyzing annotated draft genome sequence data
2010-01-01
Background New "next generation" DNA sequencing technologies offer individual researchers the ability to rapidly generate large amounts of genome sequence data at dramatically reduced costs. As a result, a need has arisen for new software tools for storage, management and analysis of genome sequence data. Although bioinformatic tools are available for the analysis and management of genome sequences, limitations still remain. For example, restrictions on the submission of data and use of these tools may be imposed, thereby making them unsuitable for sequencing projects that need to remain in-house or proprietary during their initial stages. Furthermore, the availability and use of next generation sequencing in industrial, governmental and academic environments requires biologist to have access to computational support for the curation and analysis of the data generated; however, this type of support is not always immediately available. Results To address these limitations, we have developed DraGnET (Draft Genome Evaluation Tool). DraGnET is an open source web application which allows researchers, with no experience in programming and database management, to setup their own in-house projects for storing, retrieving, organizing and managing annotated draft and complete genome sequence data. The software provides a web interface for the use of BLAST, allowing users to perform preliminary comparative analysis among multiple genomes. We demonstrate the utility of DraGnET for performing comparative genomics on closely related bacterial strains. Furthermore, DraGnET can be further developed to incorporate additional tools for more sophisticated analyses. Conclusions DraGnET is designed for use either by individual researchers or as a collaborative tool available through Internet (or Intranet) deployment. For genome projects that require genome sequencing data to initially remain proprietary, DraGnET provides the means for researchers to keep their data in-house for analysis using local programs or until it is made publicly available, at which point it may be uploaded to additional analysis software applications. The DraGnET home page is available at http://www.dragnet.cvm.iastate.edu and includes example files for examining the functionalities, a link for downloading the DraGnET setup package and a link to the DraGnET source code hosted with full documentation on SourceForge. PMID:20175920
McCutchen-Maloney, Sandra L.
2002-01-01
DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
Kim, Jeong-Yu; Jeong, Jo-Eun; Rhee, Je-Keun; Cho, Hyun; Chun, Ji-Won; Kim, Tae-Min; Choi, Sam-Wook; Choi, Jung-Seok; Kim, Dai-Jin
2016-01-01
Background and aims Internet gaming disorder (IGD) has gained recognition as a potential new diagnosis in the fifth revision of the Diagnostic and Statistical Manual of Mental Disorders, but genetic evidence supporting this disorder remains scarce. Methods In this study, targeted exome sequencing was conducted in 30 IGD patients and 30 control subjects with a focus on genes linked to various neurotransmitters associated with substance and non-substance addictions, depression, and attention deficit hyperactivity disorder. Results rs2229910 of neurotrophic tyrosine kinase receptor, type 3 (NTRK3) was the only single nucleotide polymorphism (SNP) that exhibited a significantly different minor allele frequency in IGD subjects compared to controls (p = .01932), suggesting that this SNP has a protective effect against IGD (odds ratio = 0.1541). The presence of this potentially protective allele was also associated with less time spent on Internet gaming and lower scores on the Young’s Internet Addiction Test and Korean Internet Addiction Proneness Scale for Adults. Conclusions The results of this first targeted exome sequencing study of IGD subjects indicate that rs2229910 of NTRK3 is a genetic variant that is significantly related to IGD. These findings may have significant implications for future research investigating the genetics of IGD and other behavioral addictions. PMID:27826991
Takeda, Ryuta; Petrov, Anton I.; Leontis, Neocles B.; Ding, Biao
2011-01-01
Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5′-CGA-3′...5′-GAC-3′ flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes. PMID:21258006
Takeda, Ryuta; Petrov, Anton I; Leontis, Neocles B; Ding, Biao
2011-01-01
Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5'-CGA-3'...5'-GAC-3' flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes.
NASA Astrophysics Data System (ADS)
Daneshian, Jahanbakhsh; Ramezani Dana, Leila; Sadler, Peter
2017-01-01
Benthic foraminifera species commonly outnumber planktic species in the type area of the Lower Miocene Qom Formation, in north central Iran, where it records the Tethyan link between the eastern Mediterranean and Indo- Pacific provinces. Because measured sections preserve very different sequences of first and last occurrences of these species, no single section provides a completely suitable baseline for correlation. To resolve this problem, we combined bioevents from three stratigraphic sections into a single composite sequence by constrained optimization (CONOP). The composite section arranges the first and last appearance events (FAD and LAD) of 242 foraminifera in an optimal order that minimizes the implied diachronism between sections. The composite stratigraphic ranges of the planktic foraminifera support a practical biozonation which reveals substantial local changes of accumulation rate during Aquitanian to Burdigalian times. Traditional biozone boundaries emerge little changed but an order of magnitude more correlations can be interpolated. The top of the section at Dobaradar is younger than previously thought and younger than sections at Dochah and Tigheh Reza-Abad. The latter two sections probably extend older into the Aquitanian than the Dobaradar section, but likely include a hiatus near the base of the Burdigalian. The bounding contacts with the Upper Red and Lower Red Formations are shown to be diachronous.
Wassif, Christopher A; Cross, Joanna L; Iben, James; Sanchez-Pulido, Luis; Cougnoux, Antony; Platt, Frances M; Ory, Daniel S; Ponting, Chris P; Bailey-Wilson, Joan E; Biesecker, Leslie G; Porter, Forbes D
2016-01-01
Niemann-Pick disease type C (NPC) is a recessive, neurodegenerative, lysosomal storage disease caused by mutations in either NPC1 or NPC2. The diagnosis is difficult and frequently delayed. Ascertainment is likely incomplete because of both these factors and because the full phenotypic spectrum may not have been fully delineated. Given the recent development of a blood-based diagnostic test and the development of potential therapies, understanding the incidence of NPC and defining at-risk patient populations are important. We evaluated data from four large, massively parallel exome sequencing data sets. Variant sequences were identified and classified as pathogenic or nonpathogenic based on a combination of literature review and bioinformatic analysis. This methodology provided an unbiased approach to determining the allele frequency. Our data suggest an incidence rate for NPC1 and NPC2 of 1/92,104 and 1/2,858,998, respectively. Evaluation of common NPC1 variants, however, suggests that there may be a late-onset NPC1 phenotype with a markedly higher incidence, on the order of 1/19,000-1/36,000. We determined a combined incidence of classical NPC of 1/89,229, or 1.12 affected patients per 100,000 conceptions, but predict incomplete ascertainment of a late-onset phenotype of NPC1. This finding strongly supports the need for increased screening of potential patients.
Standards for Clinical Grade Genomic Databases.
Yohe, Sophia L; Carter, Alexis B; Pfeifer, John D; Crawford, James M; Cushman-Vokoun, Allison; Caughron, Samuel; Leonard, Debra G B
2015-11-01
Next-generation sequencing performed in a clinical environment must meet clinical standards, which requires reproducibility of all aspects of the testing. Clinical-grade genomic databases (CGGDs) are required to classify a variant and to assist in the professional interpretation of clinical next-generation sequencing. Applying quality laboratory standards to the reference databases used for sequence-variant interpretation presents a new challenge for validation and curation. To define CGGD and the categories of information contained in CGGDs and to frame recommendations for the structure and use of these databases in clinical patient care. Members of the College of American Pathologists Personalized Health Care Committee reviewed the literature and existing state of genomic databases and developed a framework for guiding CGGD development in the future. Clinical-grade genomic databases may provide different types of information. This work group defined 3 layers of information in CGGDs: clinical genomic variant repositories, genomic medical data repositories, and genomic medicine evidence databases. The layers are differentiated by the types of genomic and medical information contained and the utility in assisting with clinical interpretation of genomic variants. Clinical-grade genomic databases must meet specific standards regarding submission, curation, and retrieval of data, as well as the maintenance of privacy and security. These organizing principles for CGGDs should serve as a foundation for future development of specific standards that support the use of such databases for patient care.
NASA Astrophysics Data System (ADS)
Alguacil, M. M.; Torrecillas, E.; Roldán, A.; Díaz, G.; Torres, P.
2012-04-01
The arbuscular mycorrhizal fungi (AMF) communities composition regulate plant interactions and determine the structure of plant communities. In this study we analysed the diversity of AMF in the roots of two perennial gypsophyte plant species, Herniaria fruticosa and Senecio auricula, and an annual herbaceous species, Bromus rubens, growing in a gypsum soil from a semiarid area. The objective was to determine whether perennial and annual host plants support different AMF communities in their roots and whether there are AMF species that might be indicators of specific functional plant roles in these ecosystems. The roots were analysed by nested PCR, cloning, sequencing of the ribosomal DNA small subunit region and phylogenetic analysis. Twenty AMF sequence types, belonging to the Glomus group A, Glomus group B, Diversisporaceae, Acaulosporaceae, Archaeosporaceae and Paraglomeraceae, were identified. Both gypsophyte perennial species had differing compositions of the AMF community and higher diversity when compared with the annual species, showing preferential selection by specific AMF sequences types. B. rubens did not show host specificity, sharing the full composition of its AMF community with both perennial plant species. Seasonal variations in the competitiveness of AM fungi could explain the observed differences in AMF community composition, but this is still a working hypothesis that requires the analysis of further data obtained from a higher number of both annual and perennial plant species in order to be fully tested.
Kowalsky, Caitlin A; Whitehead, Timothy A
2016-12-01
The comprehensive sequence determinants of binding affinity for type I cohesin toward dockerin from Clostridium thermocellum and Clostridium cellulolyticum was evaluated using deep mutational scanning coupled to yeast surface display. We measured the relative binding affinity to dockerin for 2970 and 2778 single point mutants of C. thermocellum and C. cellulolyticum, respectively, representing over 96% of all possible single point mutants. The interface ΔΔG for each variant was reconstructed from sequencing counts and compared with the three independent experimental methods. This reconstruction results in a narrow dynamic range of -0.8-0.5 kcal/mol. The computational software packages FoldX and Rosetta were used to predict mutations that disrupt binding by more than 0.4 kcal/mol. The area under the curve of receiver operator curves was 0.82 for FoldX and 0.77 for Rosetta, showing reasonable agreements between predictions and experimental results. Destabilizing mutations to core and rim positions were predicted with higher accuracy than support positions. This benchmark dataset may be useful for developing new computational prediction tools for the prediction of the mutational effect on binding affinities for protein-protein interactions. Experimental considerations to improve precision and range of the reconstruction method are discussed. Proteins 2016; 84:1914-1928. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Walk this way: approaching bodies can influence the processing of faces.
Pilz, Karin S; Vuong, Quoc C; Bülthoff, Heinrich H; Thornton, Ian M
2011-01-01
A highly familiar type of movement occurs whenever a person walks towards you. In the present study, we investigated whether this type of motion has an effect on face processing. We took a range of different 3D head models and placed them on a single, identical 3D body model. The resulting figures were animated to approach the observer. In a first series of experiments, we used a sequential matching task to investigate how the motion of an approaching person affects immediate responses to faces. We compared observers' responses following approach sequences to their performance with figures walking backwards (receding motion) or remaining still. Observers were significantly faster in responding to a target face that followed an approach sequence, compared to both receding and static primes. In a second series of experiments, we investigated long-term effects of motion using a delayed visual search paradigm. After studying moving or static avatars, observers searched for target faces in static arrays of varying set sizes. Again, observers were faster at responding to faces that had been learned in the context of an approach sequence. Together these results suggest that the context of a moving body influences face processing, and support the hypothesis that our visual system has mechanisms that aid the encoding of behaviourally-relevant and familiar dynamic events. Copyright © 2010 Elsevier B.V. All rights reserved.
Holmes, Anne; Allison, Lesley; Ward, Melissa; Dallman, Timothy J; Clark, Richard; Fawkes, Angie; Murphy, Lee; Hanson, Mary
2015-11-01
Detailed laboratory characterization of Escherichia coli O157 is essential to inform epidemiological investigations. This study assessed the utility of whole-genome sequencing (WGS) for outbreak detection and epidemiological surveillance of E. coli O157, and the data were used to identify discernible associations between genotypes and clinical outcomes. One hundred five E. coli O157 strains isolated over a 5-year period from human fecal samples in Lothian, Scotland, were sequenced with the Ion Torrent Personal Genome Machine. A total of 8,721 variable sites in the core genome were identified among the 105 isolates; 47% of the single nucleotide polymorphisms (SNPs) were attributable to six "atypical" E. coli O157 strains and included recombinant regions. Phylogenetic analyses showed that WGS correlated well with the epidemiological data. Epidemiological links existed between cases whose isolates differed by three or fewer SNPs. WGS also correlated well with multilocus variable-number tandem repeat analysis (MLVA) typing data, with only three discordant results observed, all among isolates from cases not known to be epidemiologically related. WGS produced a better-supported, higher-resolution phylogeny than MLVA, confirming that the method is more suitable for epidemiological surveillance of E. coli O157. A combination of in silico analyses (VirulenceFinder, ResFinder, and local BLAST searches) were used to determine stx subtypes, multilocus sequence types (15 loci), and the presence of virulence and acquired antimicrobial resistance genes. There was a high level of correlation between the WGS data and our routine typing methods, although some discordant results were observed, mostly related to the limitation of short sequence read assembly. The data were used to identify sublineages and clades of E. coli O157, and when they were correlated with the clinical outcome data, they showed that one clade, Ic3, was significantly associated with severe disease. Together, the results show that WGS data can provide higher resolution of the relationships between E. coli O157 isolates than that provided by MLVA. The method has the potential to streamline the laboratory workflow and provide detailed information for the clinical management of patients and public health interventions. Copyright © 2015, Holmes et al.
Allison, Lesley; Ward, Melissa; Dallman, Timothy J.; Clark, Richard; Fawkes, Angie; Murphy, Lee; Hanson, Mary
2015-01-01
Detailed laboratory characterization of Escherichia coli O157 is essential to inform epidemiological investigations. This study assessed the utility of whole-genome sequencing (WGS) for outbreak detection and epidemiological surveillance of E. coli O157, and the data were used to identify discernible associations between genotypes and clinical outcomes. One hundred five E. coli O157 strains isolated over a 5-year period from human fecal samples in Lothian, Scotland, were sequenced with the Ion Torrent Personal Genome Machine. A total of 8,721 variable sites in the core genome were identified among the 105 isolates; 47% of the single nucleotide polymorphisms (SNPs) were attributable to six “atypical” E. coli O157 strains and included recombinant regions. Phylogenetic analyses showed that WGS correlated well with the epidemiological data. Epidemiological links existed between cases whose isolates differed by three or fewer SNPs. WGS also correlated well with multilocus variable-number tandem repeat analysis (MLVA) typing data, with only three discordant results observed, all among isolates from cases not known to be epidemiologically related. WGS produced a better-supported, higher-resolution phylogeny than MLVA, confirming that the method is more suitable for epidemiological surveillance of E. coli O157. A combination of in silico analyses (VirulenceFinder, ResFinder, and local BLAST searches) were used to determine stx subtypes, multilocus sequence types (15 loci), and the presence of virulence and acquired antimicrobial resistance genes. There was a high level of correlation between the WGS data and our routine typing methods, although some discordant results were observed, mostly related to the limitation of short sequence read assembly. The data were used to identify sublineages and clades of E. coli O157, and when they were correlated with the clinical outcome data, they showed that one clade, Ic3, was significantly associated with severe disease. Together, the results show that WGS data can provide higher resolution of the relationships between E. coli O157 isolates than that provided by MLVA. The method has the potential to streamline the laboratory workflow and provide detailed information for the clinical management of patients and public health interventions. PMID:26354815
Urmersbach, Sara; Alter, Thomas; Koralage, Madura Sanjeevani Gonsal; Sperling, Lisa; Gerdts, Gunnar; Messelhäusser, Ute; Huehn, Stephan
2014-03-08
Vibrio parahaemolyticus is frequently isolated from environmental and seafood samples and associated with gastroenteritis outbreakes in American, European, Asian and African countries. To distinguish between different lineages of V. parahaemolyticus various genotyping techniques have been used, incl. multilocus sequence typing (MLST). Even though some studies have already applied MLST analysis to characterize V. parahaemolyticus strain sets, these studies have been restricted to specific geographical areas (e.g. U.S. coast, Thailand and Peru), have focused exclusively on pandemic or non-pandemic pathogenic isolates or have been based on a limited strain number. To generate a global picture of V. parahaemolyticus genotype distribution, a collection of 130 environmental and seafood related V. parahaemolyticus isolates of different geographical origins (Sri Lanka, Ecuador, North Sea and Baltic Sea as well as German retail) was subjected to MLST analysis after modification of gyrB and recA PCRs. The V. parahaemolyticus population was composed of 82 unique Sequence Types (STs), of which 68 (82.9%) were new to the pubMLST database. After translating the in-frame nucleotide sequences into amino acid sequences, less diversity was detectable: a total of 31 different peptide Sequence Types (pSTs) with 19 (61.3%) new pSTs were generated from the analyzed isolates. Most STs did not show a global dissemination, but some were supra-regionally distributed and clusters of STs were dependent on geographical origin. On peptide level no general clustering of strains from specific geographical regions was observed, thereby the most common pSTs were found on all continents (Asia, South America and Europe) and rare pSTs were restricted to distinct countries or even geographical regions. One lineage of pSTs associated only with strains from North and Baltic Sea strains was identified. Our study reveals a high genetic diversity in the analyzed V. parahaemolyticus strain set as well as for geographical strain subsets, with a high proportion of newly discovered alleles and STs. Differences between the subsets were identified. Our data support the postulated population structure of V. parahaemolyticus which follows the 'epidemic' model of clonal expansion. Application of peptide based AA-MLST allowed the identification of reliable relationships between strains.
Hall, Susan M.; Mihalasky, Mark J.; Tureck, Kathleen; Hammarstrom, Jane M.; Hannon, Mark
2017-01-01
The coincidence of a number of geologic and climatic factors combined to create conditions favorable for the development of mineable concentrations of uranium hosted by Eocene through Pliocene sandstones in the Texas Coastal Plain. Here 254 uranium occurrences, including 169 deposits, 73 prospects, 6 showings and 4 anomalies, have been identified. About 80 million pounds of U3O8 have been produced and about 60 million pounds of identified producible U3O8 remain in place. The development of economic roll-type uranium deposits requires a source, large-scale transport of uranium in groundwater, and deposition in reducing zones within a sedimentary sequence. The weight of the evidence supports a source from thick sequences of volcanic ash and volcaniclastic sediment derived mostly from the Trans-Pecos volcanic field and Sierra Madre Occidental that lie west of the region. The thickest accumulations of source material were deposited and preserved south and west of the San Marcos arch in the Catahoula Formation. By the early Oligocene, a formerly uniformly subtropical climate along the Gulf Coast transitioned to a zoned climate in which the southwestern portion of Texas Coastal Plain was dry, and the eastern portion humid. The more arid climate in the southwestern area supported weathering of volcanic ash source rocks during pedogenesis and early diagenesis, concentration of uranium in groundwater and movement through host sediments. During the middle Tertiary Era, abundant clastic sediments were deposited in thick sequences by bed-load dominated fluvial systems in long-lived channel complexes that provided transmissive conduits favoring transport of uranium-rich groundwater. Groundwater transported uranium through permeable sandstones that were hydrologically connected with source rocks, commonly across formation boundaries driven by isostatic loading and eustatic sea level changes. Uranium roll fronts formed as a result of the interaction of uranium-rich groundwater with either (1) organic-rich debris adjacent to large long-lived fluvial channels and barrier–bar sequences or (2) extrinsic reductants entrained in formation water or discrete gas that migrated into host units via faults and along the flanks of salt domes and shale diapirs. The southwestern portion of the region, the Rio Grande embayment, contains all the necessary factors required for roll-type uranium deposits. However, the eastern portion of the region, the Houston embayment, is challenged by a humid environment and a lack of source rock and transmissive units, which may combine to preclude the deposition of economic deposits. A grade and tonnage model for the Texas Coastal Plain shows that the Texas deposits represent a lower tonnage subset of roll-type deposits that occur around the world, and required aggregation of production centers into deposits based on geologic interpretation for the purpose of conducting a quantitative mineral resource assessment.
Pöggeler, S; Risch, S; Kück, U; Osiewacz, H D
1997-10-01
Homokaryons from the homothallic ascomycte Sordaria macrospora are able to enter the sexual pathway and to form fertile fruiting bodies. To analyze the molecular basis of homothallism and to elucidate the role of mating-products during fruiting body development, we cloned and sequenced the entire S. macrospora mating-type locus. Comparison of the Sordaria mating-type locus with mating-type idiomorphs from the heterothallic ascomycetes Neurospora crassa and Podospora anserina revealed that sequences from both idiomorphs (A/a and mat-/mat+, respectively) are contiguous in S. macrospora. DNA sequencing of the S. macrospora mating-type region allowed the identification of four open reading frames (ORFs), which were termed Smt-a1, SmtA-1, SmtA-2 and SmtA-3. While Smt-a1, SmtA-1, and SmtA-2 show strong sequence similarities with the corresponding N. crassa mating-type ORFs, SmtA-3 has a chimeric character. It comprises sequences that are similar to the A and a mating-type idiomorph from N. crassa. To determine functionality of the S. macrospora mating-type genes, we show that all ORFs are transcriptionally expressed. Furthermore, we transformed the S. macrospora mating-type genes into mat- and mat+ strains of the closely related heterothallic fungus P. anserina. The transformation experiments show that mating-type genes from S. macrospora induce fruiting body formation in P. anserina.
Cech, Gábor; Molnár, Kálmán; Székely, Csaba
2017-06-01
Molnár et al. (2015) reported two types of echinostomatid metacercariae in the lateral line organ of Hungarian fish species. Type 1 metacercariae possessed 27 collar spines and 16 uniform and three larger dorsal spines, whereas Type 2 metacercariae bore 27 collar spines and 19 equal-sized dorsal spines. In the recent work, molecular studies carried out on the ITS region and partial 28S rDNA sequences of two types of echinostomatid metacercariae and the sequences of adult stages of the species of Petasiger Dietz, 1909 collected from cormorants (Phalacrocorax carbo L.) showed that some of the Type 2 metacercariae corresponded to Petasiger exaeretus Dietz, 1909, whereas other morphologically similar metacercariae were identified as Petasiger phalacrocoracis (Yamaguti, 1939). The sequences of the Type 1 metacercariae with three larger dorsal spines could not be identified with any of the known sequences from echinostomatid trematodes.
Evolution of nuclear rDNA ITS sequences in the Cladophora albida/sericea clade (Chlorophyta).
Bakker, F T; Olsen, J L; Stam, W T
1995-06-01
Ribosomal DNA ITS sequences were compared among 13 different species and biogeographic isolates from the monophyletic "albida/sericea clade" in the green algal genus Cladophora. Six distinct ITS sequence types were found, characterized by multiple insertions and deletions and high levels of nucleotide substitution. Conserved domains within the ITS regions indicate the presence of ITS secondary structure. Low transition/transversion ratios among the six types and nearly symmetrical tree-length frequency distributions indicate some saturation, and low phylogenetic signal. Although branching order among five of the six ITS sequence types could not be resolved, estimates of ITS sequence divergence as compared with 18S divergence in a subset of the taxa suggests that the origin of the different ITS types is probably in the mid-Miocene (12 Ma ago) but that biogeographic isolates within a single ITS type (including both Pacific and Atlantic representatives) have probably dispersed on a time scale of thousands rather than millions of years.
Saranathan, Vinodkumar; Hamilton, Deborah; Powell, George V N; Kroodsma, Donald E; Prum, Richard O
2007-09-01
Vocal learning is thought to have evolved in three clades of birds (parrots, hummingbirds, and oscine passerines), and three clades of mammals (whales, bats, and primates). Behavioural data indicate that, unlike other suboscine passerines, the three-wattled bellbird Procnias tricarunculata (Cotingidae) is capable of vocal learning. Procnias tricarunculata shows conspicuous vocal ontogeny, striking geographical variation in song, and rapid temporal change in song within a population. Deprivation studies of vocal development in P. tricarunculata are impractical. Here, we report evidence from mitochondrial DNA sequences and nuclear microsatellite loci that genetic variation within and among the four allopatric breeding populations of P. tricarunculata is not congruent with variation in vocal behaviour. Sequences of the mitochondrial DNA control region document extensive haplotype sharing among localities and song types, and no phylogenetic resolution of geographical populations or behavioural groups. The vocally differentiated, allopatric breeding populations of P. tricarunculata are only weakly genetically differentiated populations, and are not distinct taxa. Mitochondrial DNA and microsatellite variation show small (2.9% and 13.5%, respectively) but significant correlation with geographical distance, but no significant residual variation by song type. Estimates of the strength of selection that would be needed to maintain the observed geographical pattern in vocal differentiation if songs were genetically based are unreasonably high, further discrediting the hypothesis of a genetic origin of vocal variation. These data support a fourth, phylogenetically independent origin of avian vocal learning in Procnias. Geographical variations in P. tricarunculata vocal behaviour are likely culturally evolved dialects.
Hybrid Modeling for Testing Intelligent Software for Lunar-Mars Closed Life Support
NASA Technical Reports Server (NTRS)
Malin, Jane T.; Nicholson, Leonard S. (Technical Monitor)
1999-01-01
Intelligent software is being developed for closed life support systems with biological components, for human exploration of the Moon and Mars. The intelligent software functions include planning/scheduling, reactive discrete control and sequencing, management of continuous control, and fault detection, diagnosis, and management of failures and errors. Four types of modeling information have been essential to system modeling and simulation to develop and test the software and to provide operational model-based what-if analyses: discrete component operational and failure modes; continuous dynamic performance within component modes, modeled qualitatively or quantitatively; configuration of flows and power among components in the system; and operations activities and scenarios. CONFIG, a multi-purpose discrete event simulation tool that integrates all four types of models for use throughout the engineering and operations life cycle, has been used to model components and systems involved in the production and transfer of oxygen and carbon dioxide in a plant-growth chamber and between that chamber and a habitation chamber with physicochemical systems for gas processing.
Advances in DNA sequencing technologies for high resolution HLA typing.
Cereb, Nezih; Kim, Hwa Ran; Ryu, Jaejun; Yang, Soo Young
2015-12-01
This communication describes our experience in large-scale G group-level high resolution HLA typing using three different DNA sequencing platforms - ABI 3730 xl, Illumina MiSeq and PacBio RS II. Recent advances in DNA sequencing technologies, so-called next generation sequencing (NGS), have brought breakthroughs in deciphering the genetic information in all living species at a large scale and at an affordable level. The NGS DNA indexing system allows sequencing multiple genes for large number of individuals in a single run. Our laboratory has adopted and used these technologies for HLA molecular testing services. We found that each sequencing technology has its own strengths and weaknesses, and their sequencing performances complement each other. HLA genes are highly complex and genotyping them is quite challenging. Using these three sequencing platforms, we were able to meet all requirements for G group-level high resolution and high volume HLA typing. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Rapid Multi-Locus Sequence Typing Using Microfluidic Biochips
2010-05-12
Sequence Types. The evolutionary history of all the B. cereus MLST concatenated Sequence Types (545 taxa, 2,394 nucleotide positions) was inferred using...the Neighbor-Joining method [28]. The bootstrap consensus tree inferred from 100 replicates was taken to represent the evolutionary history of the... Chlamydia (manuscript in preparation) and performed pilot studies on Staphylococcus aureus and Streptoccus pneumoniae (Data S4 and Text S2). Another potential
A Gauge-generalized Solution for Non-Keplerian Motion in the Frenet-Serret Frame
NASA Astrophysics Data System (ADS)
Garber, Darren D.
2009-05-01
The customary modeling of perturbed planetary and spacecraft motion as a continuous sequence of unperturbed two-body orbits (instantaneous ellipses) is conveniently assigned a physical interpretation through the Keplerian and Delaunay elements and complemented mathematically by the Lagrange-type equations which describe the evolution of these variables. If however the actual motion is very non-Keplerian (i.e. the perturbed orbit varies greatly from a two-body orbit), then its modeling by a sequence of conics is not necessarily optimal in terms of its mathematical description and its resulting physical interpretation. Since, in principle a curve of any type can be represented as a sequence of points from a family of curves of any other type (Efroimsky 2005), alternate non-conic curves can be utilized to better describe the perturbed non-Keplerian motion of the body both mathematically and with a physically relevant interpretation. Non-Keplerian motion exists in both celestial mechanics and astrodynamics as evident by the complex interactions within star clusters and also as the result of a spacecraft accelerating via ion propulsion, solar sails and electro-dynamic tethers. For these cases, the sequence of simple orbits to describe the motion is not based on conics, but instead a family of spirals. The selection of spirals as the underlying simple motion is supported by the fact that it is unnecessary to describe the motion in terms of instantaneous orbits tangent to the actual trajectory (Efroimsky 2002, Newman & Efroimsky 2003) and at times there is an advantage to deviate from osculation, in order to greatly simplify the resulting mathematics via gauge freedom (Efroimsky & Goldreich 2003, Slabinski 2003, Gurfil 2004). From these two principles, (1) spirals as instantaneous orbits, and (2) controlled deviation from osculation, new planetary equations are derived for new non-osculating elements in the Frenet-Serret frame with the gauge function as a measure of non-osculation.
Kim, Byoung-Jun; Kim, Ga-Na; Kim, Bo-Ram; Jeon, Che Ok; Jeong, Joseph; Lee, Seon Ho; Lim, Ji-Hun; Lee, Seung-Heon; Kim, Chang Ki; Kook, Yoon-Hoh; Kim, Bum-Joon
2017-10-01
Three rapidly growing mycobacterial strains, QIA-37 T , QIA-40 and QIA-41, were isolated from the lymph nodes of three separate Korean native cattle, Hanwoo (Bos taurus coreanae). These strains were previously shown to be phylogenetically distinct but closely related to Mycobacterium chelonae ATCC 35752 T by taxonomic approaches targeting three genes (16S rRNA, hsp6 and rpoB) and were further characterized using a polyphasic approach in this study. The 16S rRNA gene sequences of all three strains showed 99.7 % sequence similarity with that of the M. chelonae type strain. A multilocus sequence typing analysis targeting 10 housekeeping genes, including hsp65 and rpoB, revealed a phylogenetic cluster of these strains with M. chelonae. DNA-DNA hybridization values of 78.2 % between QIA-37 T and M. chelonae indicated that it belongs to M. chelonae but is a novel subspecies distinct from M. chelonae. Phylogenetic analysis based on whole-genome sequences revealed a 95.44±0.06 % average nucleotide identity (ANI) value with M. chelonae, slightly higher than the 95.0 % ANI criterion for determining a novel species. In addition, distinct phenotypic characteristics such as positive growth at 37 °C, at which temperature M. chelonae does not grow, further support the taxonomic status of these strains as representatives of a novel subspecies of M. chelonae. Therefore, we propose an emended description of Mycobacterium chelonae, and descriptions of M. chelonae subsp. chelonae subsp. nov. and M. chelonae subsp. bovis subsp. nov. are presented; strains ATCC 35752 T (=CCUG 47445 T =CIP 104535 T =DSM 43804 T =JCM 6388 T =NCTC 946 T ) and QIA-37 T (=KCTC 39630 T =JCM 30986 T ) are the type strains of the two novel subspecies.
Hosseinkhani, Farideh; Tammes Buirs, Matthias; Jabalameli, Fereshteh; Emaneini, Mohammad; van Leeuwen, Willem B
2018-06-06
Staphylococcus haemolyticus has emerged as a highly antimicrobial-resistant healthcare-associated pathogen, in particular for patients admitted to neonatal intensive care. The objective of this study was to study the nature of SCCmec types among MDR-SH strains isolated from paediatric patients. S. haemolyticus strains (n=60) were isolated from paediatric patients. Antibiotic resistance patterns were established using the disk agar diffusion and micro-broth dilution methods. SCCmec typing was performed using whole-genome sequencing (WGS) and an additional PCR analysis. All S. haemolyticus isolates demonstrated multidrug resistance. Using WGS, various novel mec types and combinations of SCCmec types were found, including a new composite island [SCCmec type V (Vd)+SCC cad/ars/cop] comprising 30 % of the strains. SCCmec type V was identified in 23 % of the isolates. A combination of the mecA gene enclosed by two copies of IS431 and absence of the mecRI and ccr genes was identified in 11 strains. In total, mecA regulatory genes were absent in all SH isolates used in this study. A high diversity of SCCmec elements with the prevalence of a new composite island was determined among MRSH strains. The structure of the composite island represented by MDR-SH strains in this study, in combination with the presence of a restriction-modification system type III, is described for the first time in this study. The presence of an 8 bp direct repeat (DR) and the sequences flanking the DR may support the integration of the mecA gene complex as a composite transposon (IS431-mecA-IS431) independently from recombinase genes.
Kavousi, Niloofar; Eng, Wilhelm Wei Han; Lee, Yin Peng; Tan, Lian Huat; Thuraisingham, Ravindran; Yule, Catherine M; Gan, Han Ming
2016-03-03
We report here the first high-quality draft genome sequence of Pasteurella multocida sequence type 128, which was isolated from the infected finger bone of an adult female who was bitten by a domestic dog. The draft genome will be a valuable addition to the scarce genomic resources available for P. multocida. Copyright © 2016 Kavousi et al.
Learning of goal-relevant and -irrelevant complex visual sequences in human V1.
Rosenthal, Clive R; Mallik, Indira; Caballero-Gaudes, Cesar; Sereno, Martin I; Soto, David
2018-06-12
Learning and memory are supported by a network involving the medial temporal lobe and linked neocortical regions. Emerging evidence indicates that primary visual cortex (i.e., V1) may contribute to recognition memory, but this has been tested only with a single visuospatial sequence as the target memorandum. The present study used functional magnetic resonance imaging to investigate whether human V1 can support the learning of multiple, concurrent complex visual sequences involving discontinous (second-order) associations. Two peripheral, goal-irrelevant but structured sequences of orientated gratings appeared simultaneously in fixed locations of the right and left visual fields alongside a central, goal-relevant sequence that was in the focus of spatial attention. Pseudorandom sequences were introduced at multiple intervals during the presentation of the three structured visual sequences to provide an online measure of sequence-specific knowledge at each retinotopic location. We found that a network involving the precuneus and V1 was involved in learning the structured sequence presented at central fixation, whereas right V1 was modulated by repeated exposure to the concurrent structured sequence presented in the left visual field. The same result was not found in left V1. These results indicate for the first time that human V1 can support the learning of multiple concurrent sequences involving complex discontinuous inter-item associations, even peripheral sequences that are goal-irrelevant. Copyright © 2018. Published by Elsevier Inc.
Automated sequence analysis and editing software for HIV drug resistance testing.
Struck, Daniel; Wallis, Carole L; Denisov, Gennady; Lambert, Christine; Servais, Jean-Yves; Viana, Raquel V; Letsoalo, Esrom; Bronze, Michelle; Aitken, Sue C; Schuurman, Rob; Stevens, Wendy; Schmit, Jean Claude; Rinke de Wit, Tobias; Perez Bercoff, Danielle
2012-05-01
Access to antiretroviral treatment in resource-limited-settings is inevitably paralleled by the emergence of HIV drug resistance. Monitoring treatment efficacy and HIV drugs resistance testing are therefore of increasing importance in resource-limited settings. Yet low-cost technologies and procedures suited to the particular context and constraints of such settings are still lacking. The ART-A (Affordable Resistance Testing for Africa) consortium brought together public and private partners to address this issue. To develop an automated sequence analysis and editing software to support high throughput automated sequencing. The ART-A Software was designed to automatically process and edit ABI chromatograms or FASTA files from HIV-1 isolates. The ART-A Software performs the basecalling, assigns quality values, aligns query sequences against a set reference, infers a consensus sequence, identifies the HIV type and subtype, translates the nucleotide sequence to amino acids and reports insertions/deletions, premature stop codons, ambiguities and mixed calls. The results can be automatically exported to Excel to identify mutations. Automated analysis was compared to manual analysis using a panel of 1624 PR-RT sequences generated in 3 different laboratories. Discrepancies between manual and automated sequence analysis were 0.69% at the nucleotide level and 0.57% at the amino acid level (668,047 AA analyzed), and discordances at major resistance mutations were recorded in 62 cases (4.83% of differences, 0.04% of all AA) for PR and 171 (6.18% of differences, 0.03% of all AA) cases for RT. The ART-A Software is a time-sparing tool for pre-analyzing HIV and viral quasispecies sequences in high throughput laboratories and highlighting positions requiring attention. Copyright © 2012 Elsevier B.V. All rights reserved.
Mucosal and Cutaneous Human Papillomaviruses Detected in Raw Sewages
La Rosa, Giuseppina; Fratini, Marta; Accardi, Luisa; D'Oro, Graziana; Della Libera, Simonetta; Muscillo, Michele; Di Bonito, Paola
2013-01-01
Epitheliotropic viruses can find their way into sewage. The aim of the present study was to investigate the occurrence, distribution, and genetic diversity of Human Papillomaviruses (HPVs) in urban wastewaters. Sewage samples were collected from treatment plants distributed throughout Italy. The DNA extracted from these samples was analyzed by PCR using five PV-specific sets of primers targeting the L1 (GP5/GP6, MY09/MY11, FAP59/64, SKF/SKR) and E1 regions (PM-A/PM-B), according to the protocols previously validated for the detection of mucosal and cutaneous HPV genotypes. PCR products underwent sequencing analysis and the sequences were aligned to reference genomes from the Papillomavirus Episteme database. Phylogenetic analysis was then performed to assess the genetic relationships among the different sequences and between the sequences of the samples and those of the prototype strains. A broad spectrum of sequences related to mucosal and cutaneous HPV types was detected in 81% of the sewage samples analyzed. Surprisingly, sequences related to the anogenital HPV6 and 11 were detected in 19% of the samples, and sequences related to the “high risk” oncogenic HPV16 were identified in two samples. Sequences related to HPV9, HPV20, HPV25, HPV76, HPV80, HPV104, HPV110, HPV111, HPV120 and HPV145 beta Papillomaviruses were detected in 76% of the samples. In addition, similarity searches and phylogenetic analysis of some sequences suggest that they could belong to putative new genotypes of the beta genus. In this study, for the first time, the presence of HPV viruses strongly related to human cancer is reported in sewage samples. Our data increases the knowledge of HPV genomic diversity and suggests that virological analysis of urban sewage can provide key information useful in supporting epidemiological studies. PMID:23341898
Chander, Vishal; Chakravarti, Soumendu; Gupta, Vikas; Nandi, Sukdeb; Singh, Mithilesh; Badasara, Surendra Kumar; Sharma, Chhavi; Mittal, Mitesh; Dandapat, S; Gupta, V K
2016-12-01
Canine parvovirus-2 antigenic variants (CPV-2a, CPV-2b and CPV-2c) ubiquitously distributed worldwide in canine population causes severe fatal gastroenteritis. Antigenic typing of CPV-2 remains a prime focus of research groups worldwide in understanding the disease epidemiology and virus evolution. The present study was thus envisioned to provide a simple sequencing independent, rapid, robust, specific, user-friendly technique for detecting and typing of presently circulating CPV-2 antigenic variants. ARMS-PCR strategy was employed using specific primers for CPV-2a, CPV-2b and CPV-2c to differentiate these antigenic types. ARMS-PCR was initially optimized with reference positive controls in two steps; where first reaction was used to differentiate CPV-2a from CPV-2b/CPV-2c. The second reaction was carried out with CPV-2c specific primers to confirm the presence of CPV-2c. Initial validation of the ARMS-PCR was carried out with 24 sequenced samples and the results were matched with the sequencing results. ARMS-PCR technique was further used to screen and type 90 suspected clinical samples. Randomly selected 15 suspected clinical samples that were typed with this technique were sequenced. The results of ARMS-PCR and the sequencing matched exactly with each other. The developed technique has a potential to become a sequencing independent method for simultaneous detection and typing of CPV-2 antigenic variants in veterinary disease diagnostic laboratories globally. Copyright © 2016 Elsevier B.V. All rights reserved.
Epstein-Barr Virus Sequence Variation—Biology and Disease
Tzellos, Stelios; Farrell, Paul J.
2012-01-01
Some key questions in Epstein-Barr virus (EBV) biology center on whether naturally occurring sequence differences in the virus affect infection or EBV associated diseases. Understanding the pattern of EBV sequence variation is also important for possible development of EBV vaccines. At present EBV isolates worldwide can be grouped into Type 1 and Type 2, a classification based on the EBNA2 gene sequence. Type 1 EBV is the most prevalent worldwide but Type 2 is common in parts of Africa. Type 1 transforms human B cells into lymphoblastoid cell lines much more efficiently than Type 2 EBV. Molecular mechanisms that may account for this difference in cell transformation are now becoming clearer. Advances in sequencing technology will greatly increase the amount of whole EBV genome data for EBV isolated from different parts of the world. Study of regional variation of EBV strains independent of the Type 1/Type 2 classification and systematic investigation of the relationship between viral strains, infection and disease will become possible. The recent discovery that specific mutation of the EBV EBNA3B gene may be linked to development of diffuse large B cell lymphoma illustrates the importance that mutations in the virus genome may have in infection and human disease. PMID:25436768
Johnson, Timothy J.; Liu, Cindy M.; Sokurenko, Evgeni; Kisiela, Dagmara I.; Paul, Sandip; Andersen, Paal; Johnson, James R.; Price, Lance B.
2016-01-01
We report here the complete genome sequence, including five plasmid sequences, of Escherichia coli sequence type 131 (ST131) strain JJ1887. The strain was isolated in 2007 in the United States from a patient with recurrent cystitis, whose caregiver sister died from urosepsis caused by a nearly identical strain. PMID:27174264
Optimization of Multilocus Sequence Analysis for Identification of Species in the Genus Vibrio
Gabriel, Michael W.; Matsui, George Y.; Friedman, Robert
2014-01-01
Multilocus sequence analysis (MLSA) is an important method for identification of taxa that are not well differentiated by 16S rRNA gene sequences alone. In this procedure, concatenated sequences of selected genes are constructed and then analyzed. The effects that the number and the order of genes used in MLSA have on reconstruction of phylogenetic relationships were examined. The recA, rpoA, gapA, 16S rRNA gene, gyrB, and ftsZ sequences from 56 species of the genus Vibrio were used to construct molecular phylogenies, and these were evaluated individually and using various gene combinations. Phylogenies from two-gene sequences employing recA and rpoA in both possible gene orders were different. The addition of the gapA gene sequence, producing all six possible concatenated sequences, reduced the differences in phylogenies to degrees of statistical (bootstrap) support for some nodes. The overall statistical support for the phylogenetic tree, assayed on the basis of a reliability score (calculated from the number of nodes having bootstrap values of ≥80 divided by the total number of nodes) increased with increasing numbers of genes used, up to a maximum of four. No further improvement was observed from addition of the fifth gene sequence (ftsZ), and addition of the sixth gene (gyrB) resulted in lower proportions of strongly supported nodes. Reductions in the numbers of strongly supported nodes were also observed when maximum parsimony was employed for tree construction. Use of a small number of gene sequences in MLSA resulted in accurate identification of Vibrio species. PMID:24951781
Naz, Iffat; Hodgson, Douglas; Smith, Ann; Marchesi, Julian; Ahmed, Safia; Avignone-Rossa, Claudio
2016-01-01
This study investigates the microbial community composition in the biofilms grown on two different support media in fixed biofilm reactors for aerobic wastewater treatment, using next generation sequencing (NGS) technology. The chemical composition of the new type of support medium (TDR) was found to be quite different from the conventionally used support medium (stone). The analysis of 16S rRNA gene fragments recovered from the laboratory scale biofilm system show that biofilm support media and temperature conditions influence bacterial community structure and composition. Greater bacterial diversity was observed under each condition, primarily due to the large number of sequences available and sustenance of rare species. There were 6 phyla found, with the highest relative abundance shown by the phylum Proteobacteria (52.71%) followed by Bacteroidetes (33.33%), Actinobacteria (4.65%), Firmicutes, Verrucomicrobia (3.1%) and Chloroflex (>1%). The dataset showed 17 genera of bacterial populations to be commonly shared under all conditions, suggesting the presence of a core microbial community in the biofilms for wastewater treatment. However, some genera in the biofilms on TDR were observed in high proportions, which may be attributed to its chemical composition, explaining the improved level of wastewater treatment. The findings show that the structure of microbial communities in biofilm systems for wastewater treatment is affected by the properties of support matrix. PMID:28018581
Lentzea soli sp. nov., an actinomycete isolated from soil.
Li, Dongmei; Zheng, Weiwei; Zhao, Junwei; Han, Liyuan; Zhao, Xueli; Jiang, Hao; Wang, Xiangjing; Xiang, Wensheng
2018-05-01
A novel actinobacterium, designated strain NEAU-LZC 7 T , was isolated from soil collected from Mount Song and characterized using a polyphasic approach. Phylogenetic analysis based on 16S rRNA gene sequence indicated that strain NEAU-LZC 7 T belonged to the genus Lentzea, with highest sequence similarity to Lentzea violacea JCM 10975 T (98.1 %). Morphological and chemotaxonomic characteristics of the strain also supported its assignment to the genus Lentzea. However, DNA-DNA relatedness, physiological and biochemical data showed that strain NEAU-LZC 7 T could be distinguished from its closest relative. Therefore, strain NEAU-LZC 7 T represents a novel species of the genus Lentzea, for which the name Lentzea soli sp. nov. is proposed, with NEAU-LZC 7 T (=CCTCC AA 2017027 T =JCM 32384 T ) as the type strain.
Cancian, Laila; Bosshard, Rachel; Lucchesi, Walter; Karstegl, Claudio Elgueta; Farrell, Paul J.
2011-01-01
Type 1 Epstein-Barr virus (EBV) strains immortalize B lymphocytes in vitro much more efficiently than type 2 EBV, a difference previously mapped to the EBNA-2 locus. Here we demonstrate that the greater transforming activity of type 1 EBV correlates with a stronger and more rapid induction of the viral oncogene LMP-1 and the cell gene CXCR7 (which are both required for proliferation of EBV-LCLs) during infection of primary B cells with recombinant viruses. Surprisingly, although the major sequence differences between type 1 and type 2 EBNA-2 lie in N-terminal parts of the protein, the superior ability of type 1 EBNA-2 to induce proliferation of EBV-infected lymphoblasts is mostly determined by the C-terminus of EBNA-2. Substitution of the C-terminus of type 1 EBNA-2 into the type 2 protein is sufficient to confer a type 1 growth phenotype and type 1 expression levels of LMP-1 and CXCR7 in an EREB2.5 cell growth assay. Within this region, the RG, CR7 and TAD domains are the minimum type 1 sequences required. Sequencing the C-terminus of EBNA-2 from additional EBV isolates showed high sequence identity within type 1 isolates or within type 2 isolates, indicating that the functional differences mapped are typical of EBV type sequences. The results indicate that the C-terminus of EBNA-2 accounts for the greater ability of type 1 EBV to promote B cell proliferation, through mechanisms that include higher induction of genes (LMP-1 and CXCR7) required for proliferation and survival of EBV-LCLs. PMID:21857817
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons
Pagano, Johanna F.B.; Ensink, Wim A.; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P.; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J.; Dekker, Rob J.
2017-01-01
5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. PMID:28003516
Tedim, Ana P.; Lanza, Val F.; Manrique, Marina; Pareja, Eduardo; Ruiz-Garbajosa, Patricia; Cantón, Rafael; Baquero, Fernando; Tobes, Raquel
2017-01-01
ABSTRACT The emergence of nosocomial infections by multidrug-resistant sequence type 117 (ST117) Enterococcus faecium has been reported in several European countries. ST117 has been detected in Spanish hospitals as one of the main causes of bloodstream infections. We analyzed genome variations of ST117 strains isolated in Madrid and describe the first ST117 closed genome sequences. PMID:28360174
Sequencing artifacts in the type A influenza databases and attempts to correct them.
Suarez, David L; Chester, Nikki; Hatfield, Jason
2014-07-01
There are over 276 000 influenza gene sequences in public databases, with the quality of the sequences determined by the contributor. As part of a high school class project, influenza sequences with possible errors were identified in the public databases based on the size of the gene being longer than expected, with the hypothesis that these sequences would have an error. Students contacted sequence submitters alerting them of the possible sequence issue(s) and requested they the suspect sequence(s) be correct as appropriate. Type A influenza viruses were screened, and gene segments longer than the accepted size were identified for further analysis. Attention was placed on sequences with additional nucleotides upstream or downstream of the highly conserved non-coding ends of the viral segments. A total of 1081 sequences were identified that met this criterion. Three types of errors were commonly observed: non-influenza primer sequence wasn't removed from the sequence; PCR product was cloned and plasmid sequence was included in the sequence; and Taq polymerase added an adenine at the end of the PCR product. Internal insertions of nucleotide sequence were also commonly observed, but in many cases it was unclear if the sequence was correct or actually contained an error. A total of 215 sequences, or 22.8% of the suspect sequences, were corrected in the public databases in the first year of the student project. Unfortunately 138 additional sequences with possible errors were added to the databases in the second year. Additional awareness of the need for data integrity of sequences submitted to public databases is needed to fully reap the benefits of these large data sets. © 2014 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.
Pushing the Limits of Imagination: Mental Practice for Learning Sequences
ERIC Educational Resources Information Center
Wohldmann, Erica L.; Healy, Alice F.; Bourne, Lyle E., Jr.
2007-01-01
In 2 experiments, the efficacy of motor imagery for learning to type number sequences was examined. Adults practiced typing 4-digit numbers. Then, during subsequent training, they either typed in the same or a different location, imagined typing, merely looked at each number, or performed an irrelevant task. Repetition priming (faster responses…
Analysis of Ribosome Inactivating Protein (RIP): A Bioinformatics Approach
NASA Astrophysics Data System (ADS)
Jothi, G. Edward Gnana; Majilla, G. Sahaya Jose; Subhashini, D.; Deivasigamani, B.
2012-10-01
In spite of the medical advances in recent years, the world is in need of different sources to encounter certain health issues.Ribosome Inactivating Proteins (RIPs) were found to be one among them. In order to get easy access about RIPs, there is a need to analyse RIPs towards constructing a database on RIPs. Also, multiple sequence alignment was done towards screening for homologues of significant RIPs from rare sources against RIPs from easily available sources in terms of similarity. Protein sequences were retrieved from SWISS-PROT and are further analysed using pair wise and multiple sequence alignment.Analysis shows that, 151 RIPs have been characterized to date. Amongst them, there are 87 type I, 37 type II, 1 type III and 25 unknown RIPs. The sequence length information of various RIPs about the availability of full or partial sequence was also found. The multiple sequence alignment of 37 type I RIP using the online server Multalin, indicates the presence of 20 conserved residues. Pairwise alignment and multiple sequence alignment of certain selected RIPs in two groups namely Group I and Group II were carried out and the consensus level was found to be 98%, 98% and 90% respectively.
Estep, Anne L; Tidyman, William E; Teitell, Michael A; Cotter, Philip D; Rauen, Katherine A
2006-01-01
Costello syndrome (CS) is a complex developmental disorder involving characteristic craniofacial features, failure to thrive, developmental delay, cardiac and skeletal anomalies, and a predisposition to develop neoplasia. Based on similarities with other cancer syndromes, we previously hypothesized that CS is likely due to activation of signal transduction through the Ras/MAPK pathway [Tartaglia et al., 2003]. In this study, the HRAS coding region was sequenced for mutations in a large, well-characterized cohort of 36 CS patients. Heterogeneous missense point mutations predicting an amino acid substitution were identified in 33/36 (92%) patients. The majority (91%) had a 34G --> A transition in codon 12. Less frequent mutations included 35G --> C (codon 12) and 37G --> T (codon 13). Parental samples did not have an HRAS mutation supporting the hypothesis of de novo heterogeneous mutations. There is phenotypic variability among patients with a 34G --> A transition. The most consistent features included characteristic facies and skin, failure to thrive, developmental delay, musculoskeletal abnormalities, visual impairment, cardiac abnormalities, and generalized hyperpigmentation. The two patients with 35G --> C had cardiac arrhythmias whereas one patient with a 37G --> T transversion had an enlarged aortic root. Of the patients with a clinical diagnosis of CS, neoplasia was the most consistent phenotypic feature for predicating an HRAS mutation. To gain an understanding of the relationship between constitutional HRAS mutations and malignancy, HRAS was sequenced in an advanced biphasic rhabdomyosarcoma/fibrosarcoma from an individual with a 34G --> A mutation. Loss of the wild-type HRAS allele was observed, suggesting tumorigenesis in CS patients is accompanied by additional somatic changes affecting HRAS. Finally, due to phenotypic overlap between CS and cardio-facio-cutaneous (CFC) syndromes, the HRAS coding region was sequenced in a well-characterized CFC cohort. No mutations were found which support a distinct genetic etiology between CS and CFC syndromes. (c) 2005 Wiley-Liss, Inc.
NASA Astrophysics Data System (ADS)
Esteban, Pere; Beck, Christoph; Philipp, Andreas
2010-05-01
Using data associated with accidents or damages caused by snow avalanches over the eastern Pyrenees (Andorra and Catalonia) several atmospheric circulation type catalogues have been obtained. For this purpose, different circulation type classification methods based on Principal Component Analysis (T-mode and S-mode using the extreme scores) and on optimization procedures (Improved K-means and SANDRA) were applied . Considering the characteristics of the phenomena studied, not only single day circulation patterns were taken into account but also sequences of circulation types of varying length. Thus different classifications with different numbers of types and for different sequence lengths were obtained using the different classification methods. Simple between type variability, within type variability, and outlier detection procedures have been applied for selecting the best result concerning snow avalanches type classifications. Furthermore, days without occurrence of the hazards were also related to the avalanche centroids using pattern-correlations, facilitating the calculation of the anomalies between hazardous and no hazardous days, and also frequencies of occurrence of hazardous events for each circulation type. Finally, the catalogues statistically considered the best results are evaluated using the avalanche forecaster expert knowledge. Consistent explanation of snow avalanches occurrence by means of circulation sequences is obtained, but always considering results from classifications with different sequence length. This work has been developed in the framework of the COST Action 733 (Harmonisation and Applications of Weather Type Classifications for European regions).
Structure and function of neonatal social communication in a genetic mouse model of autism.
Takahashi, T; Okabe, S; Broin, P Ó; Nishi, A; Ye, K; Beckert, M V; Izumi, T; Machida, A; Kang, G; Abe, S; Pena, J L; Golden, A; Kikusui, T; Hiroi, N
2016-09-01
A critical step toward understanding autism spectrum disorder (ASD) is to identify both genetic and environmental risk factors. A number of rare copy number variants (CNVs) have emerged as robust genetic risk factors for ASD, but not all CNV carriers exhibit ASD and the severity of ASD symptoms varies among CNV carriers. Although evidence exists that various environmental factors modulate symptomatic severity, the precise mechanisms by which these factors determine the ultimate severity of ASD are still poorly understood. Here, using a mouse heterozygous for Tbx1 (a gene encoded in 22q11.2 CNV), we demonstrate that a genetically triggered neonatal phenotype in vocalization generates a negative environmental loop in pup-mother social communication. Wild-type pups used individually diverse sequences of simple and complicated call types, but heterozygous pups used individually invariable call sequences with less complicated call types. When played back, representative wild-type call sequences elicited maternal approach, but heterozygous call sequences were ineffective. When the representative wild-type call sequences were randomized, they were ineffective in eliciting vigorous maternal approach behavior. These data demonstrate that an ASD risk gene alters the neonatal call sequence of its carriers and this pup phenotype in turn diminishes maternal care through atypical social communication. Thus, an ASD risk gene induces, through atypical neonatal call sequences, less than optimal maternal care as a negative neonatal environmental factor.
Structure and function of neonatal social communication in a genetic mouse model of autism
Takahashi, Tomohisa; Okabe, Shota; Ó Broin, Pilib; Nishi, Akira; Ye, Kenny; Beckert, Michael V.; Izumi, Takeshi; Machida, Akihiro; Kang, Gina; Abe, Seiji; Pena, Jose L.; Golden, Aaron; Kikusui, Takefumi; Hiroi, Noboru
2015-01-01
A critical step toward understanding autism spectrum disorder (ASD) is to identify both genetic and environmental risk factors. A number of rare copy number variants (CNVs) have emerged as robust genetic risk factors for ASD, but not all CNV carriers exhibit ASD and the severity of ASD symptoms varies among CNV carriers. Although evidence exists that various environmental factors modulate symptomatic severity, the precise mechanisms by which these factors determine the ultimate severity of ASD are still poorly understood. Here, using a mouse heterozygous for Tbx1 (a gene encoded in 22q11.2 CNV), we demonstrate that a genetically-triggered neonatal phenotype in vocalization generates a negative environmental loop in pup-mother social communication. Wild-type pups used individually diverse sequences of simple and complicated call types, but heterozygous pups used individually invariable call sequences with less complicated call types. When played back, representative wild-type call sequences elicited maternal approach, but heterozygous call sequences were ineffective. When the representative wild-type call sequences were randomized, they were ineffective in eliciting vigorous maternal approach behavior. These data demonstrate that an ASD risk gene alters the neonatal call sequence of its carriers and this pup phenotype in turn diminishes maternal care through atypical social communication. Thus, an ASD risk gene induces, through atypical neonatal call sequences, less than optimal maternal care as a negative neonatal environmental factor. PMID:26666205
O'Leary, Nuala A; Wright, Mathew W; Brister, J Rodney; Ciufo, Stacy; Haddad, Diana; McVeigh, Rich; Rajput, Bhanu; Robbertse, Barbara; Smith-White, Brian; Ako-Adjei, Danso; Astashyn, Alexander; Badretdin, Azat; Bao, Yiming; Blinkova, Olga; Brover, Vyacheslav; Chetvernin, Vyacheslav; Choi, Jinna; Cox, Eric; Ermolaeva, Olga; Farrell, Catherine M; Goldfarb, Tamara; Gupta, Tripti; Haft, Daniel; Hatcher, Eneida; Hlavina, Wratko; Joardar, Vinita S; Kodali, Vamsi K; Li, Wenjun; Maglott, Donna; Masterson, Patrick; McGarvey, Kelly M; Murphy, Michael R; O'Neill, Kathleen; Pujar, Shashikant; Rangwala, Sanjida H; Rausch, Daniel; Riddick, Lillian D; Schoch, Conrad; Shkeda, Andrei; Storz, Susan S; Sun, Hanzhen; Thibaud-Nissen, Francoise; Tolstoy, Igor; Tully, Raymond E; Vatsan, Anjana R; Wallin, Craig; Webb, David; Wu, Wendy; Landrum, Melissa J; Kimchi, Avi; Tatusova, Tatiana; DiCuccio, Michael; Kitts, Paul; Murphy, Terence D; Pruitt, Kim D
2016-01-04
The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Blake, Jonathon; Riddell, Andrew; Theiss, Susanne; Gonzalez, Alexis Perez; Haase, Bettina; Jauch, Anna; Janssen, Johannes W. G.; Ibberson, David; Pavlinic, Dinko; Moog, Ute; Benes, Vladimir; Runz, Heiko
2014-01-01
Balanced chromosome abnormalities (BCAs) occur at a high frequency in healthy and diseased individuals, but cost-efficient strategies to identify BCAs and evaluate whether they contribute to a phenotype have not yet become widespread. Here we apply genome-wide mate-pair library sequencing to characterize structural variation in a patient with unclear neurodevelopmental disease (NDD) and complex de novo BCAs at the karyotype level. Nucleotide-level characterization of the clinically described BCA breakpoints revealed disruption of at least three NDD candidate genes (LINC00299, NUP205, PSMD14) that gave rise to abnormal mRNAs and could be assumed as disease-causing. However, unbiased genome-wide analysis of the sequencing data for cryptic structural variation was key to reveal an additional submicroscopic inversion that truncates the schizophrenia- and bipolar disorder-associated brain transcription factor ZNF804A as an equally likely NDD-driving gene. Deep sequencing of fluorescent-sorted wild-type and derivative chromosomes confirmed the clinically undetected BCA. Moreover, deep sequencing further validated a high accuracy of mate-pair library sequencing to detect structural variants larger than 10 kB, proposing that this approach is powerful for clinical-grade genome-wide structural variant detection. Our study supports previous evidence for a role of ZNF804A in NDD and highlights the need for a more comprehensive assessment of structural variation in karyotypically abnormal individuals and patients with neurocognitive disease to avoid diagnostic deception. PMID:24625750
Molecular phylogeography of the Andean alpine plant, Gunnera magellanica
NASA Astrophysics Data System (ADS)
Shimizu, M.; Fujii, N.; Ito, M.; Asakawa, T.; Nishida, H.; Suyama, C.; Ueda, K.
2015-12-01
To clarify the evolutionary history of Gunnera magellanica (Gunneraceae), an alpine plant of the Andes mountains, we performed molecular phylogeographic analyses based on the sequences of an internal transcribed spacer (ITS) of nuclear ribosomal DNA and four non-coding regions (trnH-psbA, trnL-trnF, atpB-rbcL, rpl16 intron) of chloroplast DNA. We investigated 3, 4, 4 and 11 populations in, Ecuador, Bolivia, Argentina, and Chile, respectively, and detected six ITS genotypes (Types A-F) in G. magellanica. Five genotypes (Types A-E) were observed in the northern Andes population (Ecuador and Bolivia); only one ITS genotype (Type F) was observed in the southern Andes population (Chile and Argentina). Phylogenetic analyses showed that the ITS genotypes of the northern and southern Andes populations form different clades with high bootstrap probability. Furthermore, network analysis, analysis of molecular variance, and spatial analysis of molecular variance showed that there were two major clusters (the northern and southern Andes populations) in this species. Furthermore, in chloroplast DNA analysis, three major clades (northern Andes, Chillan, and southern Andes) were inferred from phylogenetic analyses using four non-coding regions, a finding that was supported by the above three types of analysis. The Chillan clade is the northernmost population in the southern Andes populations. With the exception of the Chillan clade (Chillan population), results of nuclear DNA and chloroplast DNA analyses were consistent. Both markers showed that the northern and southern Andes populations of G. magellanica were genetically different from each other. This type of clear phylogeographical structure was supported by PERMUT analysis according to Pons & Petit (1995, 1996). Moreover, based on our preliminary estimation that is based on the ITS sequences, the northern and southern Andes clades diverged ~0.63-3 million years ago, during a period of upheaval in the Andes. This suggests that the populations of G. magellanica that were distributed along the Andes have been divided into the two local populations of the northern and southern Andes during the uplift of the Andes.
Peirano, Gisele; van der Bij, Akke K; Gregson, Daniel B; Pitout, Johann D D
2012-02-01
A study was designed to assess the importance of sequence types among extended-spectrum β-lactamase (ESBL)-producing Escherichia coli isolates causing bacteremia over an 11-year period (2000 to 2010) in a centralized Canadian region. A total of 197 patients with incident infections were identified; the majority presented with community-onset urosepsis, with a significant increase in the prevalence of ESBL-producing E. coli during the later part of the study. The majority of E. coli isolates produced either CTX-M-15 or CTX-M-14. We identified 7 different major sequence types among 91% of isolates (i.e., the ST10 clonal complex, ST38, ST131, ST315, ST393, ST405, and ST648) and provided insight into their clinical and molecular characteristics. ST38 was the most antimicrobial-susceptible sequence type and predominated during 2000 to 2004 but disappeared after 2008. ST131 was the most antimicrobial-resistant sequence type, and the influx of a single pulsotype of this sequence type was responsible for the significant increase of ESBL-producing E. coli strains since 2007. During 2010, 49/63 (78%) of the ESBL-producing E. coli isolates belonged to ST131, and this sequence type had established itself as a major drug-resistant pathogen in Calgary, Alberta, Canada, posing an important new public health threat within our region. We urgently need well-designed epidemiological and molecular studies to understand the dynamics of transmission, risk factors, and reservoirs for E. coli ST131. This will provide insight into the emergence and spread of this multiresistant sequence type.
Biophysical and structural considerations for protein sequence evolution
2011-01-01
Background Protein sequence evolution is constrained by the biophysics of folding and function, causing interdependence between interacting sites in the sequence. However, current site-independent models of sequence evolutions do not take this into account. Recent attempts to integrate the influence of structure and biophysics into phylogenetic models via statistical/informational approaches have not resulted in expected improvements in model performance. This suggests that further innovations are needed for progress in this field. Results Here we develop a coarse-grained physics-based model of protein folding and binding function, and compare it to a popular informational model. We find that both models violate the assumption of the native sequence being close to a thermodynamic optimum, causing directional selection away from the native state. Sampling and simulation show that the physics-based model is more specific for fold-defining interactions that vary less among residue type. The informational model diffuses further in sequence space with fewer barriers and tends to provide less support for an invariant sites model, although amino acid substitutions are generally conservative. Both approaches produce sequences with natural features like dN/dS < 1 and gamma-distributed rates across sites. Conclusions Simple coarse-grained models of protein folding can describe some natural features of evolving proteins but are currently not accurate enough to use in evolutionary inference. This is partly due to improper packing of the hydrophobic core. We suggest possible improvements on the representation of structure, folding energy, and binding function, as regards both native and non-native conformations, and describe a large number of possible applications for such a model. PMID:22171550
DOE Office of Scientific and Technical Information (OSTI.GOV)
Korfhagen, T.R.; Glasser, S.W.; Wert, S.E.
1990-08-01
Pulmonary surfactant is produced in late gestation by developing type II epithelial cells lining the alveolar epithelium of the lung. Lack of surfactant at birth is associated with respiratory distress syndrome in premature infants. Surfactant protein C (SP-C) is a highly hydrophobic peptide isolated from pulmonary tissue that enhances the biophysical activity of surfactant phospholipids. Like surfactant phospholipid, SP-C is produced by epithelial cells in the distal respiratory epithelium, and its expression increases during the latter part of gestation. A chimeric gene containing 3.6 kilobases of the promoter and 5{prime}-flanking sequences of the human SP-C gene was used to expressmore » diphtheria toxin A. The SP-C-diphtheria toxin A fusion gene was injected into fertilized mouse eggs to produce transgenic mice. Affected mice developed respiratory failure in the immediate postnatal period. Morphologic analysis of lungs from affected pups showed variable but severe cellular injury confined to pulmonary tissues. Ultrastructural changes consistent with cell death and injury were prominent in the distal respiratory epithelium. Proximal components of the tracheobronchial tree were not severely affected. Transgenic animals were of normal size at birth, and structural abnormalities were not detected in nonpulmonary tissues. Lung-specific diphtheria toxin A expression controlled by the human SP-C gene injured type II epithelial cells and caused extensive necrosis of the distal respiratory epithelium. The absence of type I epithelial cells in the most severely affected transgenic animals supports the concept that developing type II cells serve as precursors to type I epithelial cells.« less
AN IMPORTED CASE OF ACUTE MELIOIDOSIS CAUSED BY ST881 BURKHOLDERIA PSEUDOMALLEI.
Zong, Zhiyong; Wang, Xiaohui; Deng, Yiyun
2016-03-01
A previously healthy Chinese male working in Malaysia returned to China with high fever. A blood culture showed Burkholderia pseudomallei strain WCBP1. This isolate was sequenced, showing type, ST881, which appears to be present in Malaysia. WCP1 had unusual susceptibility to aminoglycosides and habored the Yersinia-like fimbrial gene cluster for virulence. The patient's condition deteriorated rapidly but he recovered after receiving meropenem and intensive care support. Melioidosis is a potential problem among Chinese imigrant workers with strains new to China being identified.
AgdbNet – antigen sequence database software for bacterial typing
Jolley, Keith A; Maiden, Martin CJ
2006-01-01
Background Bacterial typing schemes based on the sequences of genes encoding surface antigens require databases that provide a uniform, curated, and widely accepted nomenclature of the variants identified. Due to the differences in typing schemes, imposed by the diversity of genes targeted, creating these databases has typically required the writing of one-off code to link the database to a web interface. Here we describe agdbNet, widely applicable web database software that facilitates simultaneous BLAST querying of multiple loci using either nucleotide or peptide sequences. Results Databases are described by XML files that are parsed by a Perl CGI script. Each database can have any number of loci, which may be defined by nucleotide and/or peptide sequences. The software is currently in use on at least five public databases for the typing of Neisseria meningitidis, Campylobacter jejuni and Streptococcus equi and can be set up to query internal isolate tables or suitably-configured external isolate databases, such as those used for multilocus sequence typing. The style of the resulting website can be fully configured by modifying stylesheets and through the use of customised header and footer files that surround the output of the script. Conclusion The software provides a rapid means of setting up customised Internet antigen sequence databases. The flexible configuration options enable typing schemes with differing requirements to be accommodated. PMID:16790057
RNA Editome in Rhesus Macaque Shaped by Purifying Selection
Yang, Xin-Zhuang; Tan, Bertrand Chin-Ming; Fang, Huaying; Liu, Chu-Jun; Shi, Mingming; Ye, Zhi-Qiang; Zhang, Yong E.; Deng, Minghua; Zhang, Xiuqin; Li, Chuan-Yun
2014-01-01
Understanding of the RNA editing process has been broadened considerably by the next generation sequencing technology; however, several issues regarding this regulatory step remain unresolved – the strategies to accurately delineate the editome, the mechanism by which its profile is maintained, and its evolutionary and functional relevance. Here we report an accurate and quantitative profile of the RNA editome for rhesus macaque, a close relative of human. By combining genome and transcriptome sequencing of multiple tissues from the same animal, we identified 31,250 editing sites, of which 99.8% are A-to-G transitions. We verified 96.6% of editing sites in coding regions and 97.5% of randomly selected sites in non-coding regions, as well as the corresponding levels of editing by multiple independent means, demonstrating the feasibility of our experimental paradigm. Several lines of evidence supported the notion that the adenosine deamination is associated with the macaque editome – A-to-G editing sites were flanked by sequences with the attributes of ADAR substrates, and both the sequence context and the expression profile of ADARs are relevant factors in determining the quantitative variance of RNA editing across different sites and tissue types. In support of the functional relevance of some of these editing sites, substitution valley of decreased divergence was detected around the editing site, suggesting the evolutionary constraint in maintaining some of these editing substrates with their double-stranded structure. These findings thus complement the “continuous probing” model that postulates tinkering-based origination of a small proportion of functional editing sites. In conclusion, the macaque editome reported here highlights RNA editing as a widespread functional regulation in primate evolution, and provides an informative framework for further understanding RNA editing in human. PMID:24722121
Ma, Zhaoxu; Zhao, Shanshan; Cao, Tingting; Liu, Chongxi; Huang, Ying; Gao, Yuhang; Yan, Kai; Xiang, Wensheng; Wang, Xiangjing
2016-12-01
A novel actinobacterium, designated strain NEAU-QY3T, was isolated from the leaves of Sonchus oleraceus L. and examined using a polyphasic taxonomic approach. The organism formed single spores with smooth surface on substrate mycelia. Phylogenetic analysis based on the 16S rRNA gene sequence indicated that the strain had a close association with the genus Verrucosispora and shared the highest sequence similarity with Verrucosispora qiuiae RtIII47T (99.17 %), an association that was supported by a bootstrap value of 94 % in the neighbour-joining tree and also recovered with the maximum-likelihood algorithm. The strain also showed high 16S rRNA gene sequence similarities to Xiangella phaseoli NEAU-J5T (98.78 %), Jishengella endophytica 202201T (98.51 %), Micromonospora eburnea LK2-10T (98.28 %), Verrucosispora lutea YIM 013T (98.23 %) and Salinispora pacifica CNR-114T (98.23 %). Furthermore, phylogenetic analysis based on the gyrB gene sequences supported the conclusion that strain NEAU-QY3T should be assigned to the genus Verrucosispora. However, the DNA-DNA hybridization relatedness values between strain NEAU-QY3T and V. qiuiae RtIII47T and V. lutea YIM 013T were below 70 %. With reference to phenotypic characteristics, phylogenetic data and DNA-DNA hybridization results, strain NEAU-QY3T was readily distinguished from its most closely related strains and classified as a new species, for which the name Verrucosispora sonchi sp. nov. is proposed. The type strain is NEAU-QY3T (=CGMCC 4.7312T=DSM 101530T).
Couto, Natacha; Chlebowicz, Monika A; Raangs, Erwin C; Friedrich, Alex W; Rossen, John W
2018-04-05
The emergence of nosocomial infections by multidrug-resistant Staphylococcus haemolyticus isolates has been reported in several European countries. Here, we report the first two complete genome sequences of S. haemolyticus sequence type 25 (ST25) isolates 83131A and 83131B. Both isolates were isolated from the same clinical sample and were first identified through shotgun metagenomics. Copyright © 2018 Couto et al.
Yuan, Xiaoli; Morano, Lisa; Bromley, Robin; Spring-Pearson, Senanu; Stouthamer, Richard; Nunney, Leonard
2010-06-01
Using a modified multilocus sequence typing (MLST) scheme for the bacterial plant pathogen Xylella fastidiosa based on the same seven housekeeping genes employed in a previously published MLST, we studied the genetic diversity of two subspecies, X. fastidiosa subsp. fastidiosa and X. fastidiosa subsp. sandyi, which cause Pierce's disease and oleander leaf scorch, respectively. Typing of 85 U.S. isolates (plus one from northern Mexico) of X. fastidiosa subsp. fastidiosa from 15 different plant hosts and 21 isolates of X. fastidiosa subsp. sandyi from 4 different hosts in California and Texas supported their subspecific status. Analysis using the MLST genes plus one cell-surface gene showed no significant genetic differentiation based on geography or host plant within either subspecies. Two cases of homologous recombination (with X. fastidiosa subsp. multiplex, the third U.S. subspecies) were detected in X. fastidiosa subsp. fastidiosa. Excluding recombination, MLST site polymorphism in X. fastidiosa subsp. fastidiosa (0.048%) and X. fastidiosa subsp. sandyi (0.000%) was substantially lower than in X. fastidiosa subsp. multiplex (0.240%), consistent with the hypothesis that X. fastidiosa subspp. fastidiosa and sandyi were introduced into the United States (probably just prior to 1880 and 1980, respectively). Using whole-genome analysis, we showed that MLST is more effective at genetic discrimination at the specific and subspecific level than other typing methods applied to X. fastidiosa. Moreover, MLST is the only technique effective in detecting recombination.
Cheng, Lucy; Marinelli, Laura J; Grosset, Noël; Fitz-Gibbon, Sorel T; Bowman, Charles A; Dang, Brian Q; Russell, Daniel A; Jacobs-Sera, Deborah; Shi, Baochen; Pellegrini, Matteo; Miller, Jeff F; Gautier, Michel; Hatfull, Graham F; Modlin, Robert L
2018-03-01
A remarkable exception to the large genetic diversity often observed for bacteriophages infecting a specific bacterial host was found for the Cutibacterium acnes (formerly Propionibacterium acnes) phages, which are highly homogeneous. Phages infecting the related species, which is also a member of the Propionibacteriaceae family, Propionibacterium freudenreichii, a bacterium used in production of Swiss-type cheeses, have also been described and are common contaminants of the cheese manufacturing process. However, little is known about their genetic composition and diversity. We obtained seven independently isolated bacteriophages that infect P. freudenreichii from Swiss-type cheese samples, and determined their complete genome sequences. These data revealed that all seven phage isolates are of similar genomic length and GC% content, but their genomes are highly diverse, including genes encoding the capsid, tape measure, and tail proteins. In contrast to C. acnes phages, all P. freudenreichii phage genomes encode a putative integrase protein, suggesting they are capable of lysogenic growth. This is supported by the finding of related prophages in some P. freudenreichii strains. The seven phages could further be distinguished as belonging to two distinct genomic types, or 'clusters', based on nucleotide sequences, and host range analyses conducted on a collection of P. freudenreichii strains show a higher degree of host specificity than is observed for the C. acnes phages. Overall, our data demonstrate P. freudenreichii bacteriophages are distinct from C. acnes phages, as evidenced by their higher genetic diversity, potential for lysogenic growth, and more restricted host ranges. This suggests substantial differences in the evolution of these related species from the Propionibacteriaceae family and their phages, which is potentially related to their distinct environmental niches.
Sathkumara, Harindra D; Merritt, Adam J; Corea, Enoka M; Krishnananthasivam, Shivankari; Natesan, Mohan; Inglis, Timothy J J; De Silva, Aruna Dharshan
2018-02-01
Melioidosis, a potentially fatal tropical infection, is said to be underdiagnosed in low-income countries. An increase in melioidosis cases in Sri Lanka allowed us to analyze the relationship among clinical outcome, bacteriology, epidemiology, and geography in the first 108 laboratory-confirmed cases of melioidosis from a nationwide surveillance program. The additional 76 cases of laboratory-confirmed melioidosis confirmed further associations between Burkholderia pseudomallei multilocus sequence typing (MLST) and infection phenotype; ST1137/unifocal bacteremic infection (χ 2 = 3.86, P < 0.05), ST1136/multifocal infection without bacteremia (χ 2 = 15.8, P < 0.001), and ST1132/unifocal nonbacteremic infection (χ 2 = 6.34, P = 0.02). ST1137 infections were predominantly seen in the Western Province, whereas ST1132, 1135, and 1136 infections predominated in the Northwestern Province. Early participating centers in the surveillance program had a lower melioidosis-associated mortality than later participants (χ 2 = 3.99, P < 0.05). The based upon related sequence types (eBURST) algorithm, a MLST clustering method that infers founding genotypes and patterns of descent for related isolates and clonal complexes in an unrooted tree, showed uneven distribution of sequence types (STs). There was spatial clustering of the commonest STs (ST1132, 1136, and 1137) in the Western, Northwestern, and Central provinces. The recent increase in melioidosis in Sri Lanka uncovered by laboratory-enhanced surveillance is likely to be the result of a combination of improved laboratory detection, increased clinician awareness, recruitment of clinical centers, and small outbreaks. Further development of the surveillance program into a national genotyping-supported melioidosis registry will improve melioidosis diagnosis, treatment, and prevention where underdiagnosis and mortality rates remain high.
Systematic analysis and evolution of 5S ribosomal DNA in metazoans.
Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M
2013-11-01
Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12,766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades.
Systematic analysis and evolution of 5S ribosomal DNA in metazoans
Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M
2013-01-01
Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12 766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades. PMID:23838690
Equilibrium figures inside the dark-matter ring and the shapes of elliptical galaxies
NASA Astrophysics Data System (ADS)
Kondratyev, B. P.; Trubitsyna, N. G.; Kireeva, E. N.
We solve the general problem of the theory of equilibrium figures and analyze two classes of liquid rotating gravitating figures residing inside a gravitating ring or torus. These figures form families of sequences of generalized oblate spheroids and triaxial ellipsoids, which at the lower limit of the tidal parameter α = 0 have the form of the Maclaurin spheroids and the Jacobi ellipsoids. In intermediate cases 0 < α ≤ αmax each new sequence of axisymmetric equilibrium figures has two non-rotating boundary spheroids. At the upper limit αmax/(π Gρ ) = 0.1867 the sequence degenerates into a single non-rotating spheroid with the eccentricity {e cr} ≈ 0.96 corresponding to the flattening limit of elliptical galaxies (E7). We also perform a detailed study of the sequences of generalized triaxial ellipsoids and find bifurcation points of triaxial ellipsoids in the sequences of generalized spheroids. We use this method to explain the shapes of E-galaxies. According to observations, very slowly rotating oblate E-type galaxies are known that have the shapes, which, because of instability, cannot be supported by velocity dispersion anisotropy exclusively. The hypothesis of a massive dark-matter outer ring requires no extreme anisotropy of pressure; it not only explains the shape of these elliptical galaxies, but also sheds new light on the riddle of the ellipticity limit (E7) of elliptical galaxies.
Predicting the host of influenza viruses based on the word vector.
Xu, Beibei; Tan, Zhiying; Li, Kenli; Jiang, Taijiao; Peng, Yousong
2017-01-01
Newly emerging influenza viruses continue to threaten public health. A rapid determination of the host range of newly discovered influenza viruses would assist in early assessment of their risk. Here, we attempted to predict the host of influenza viruses using the Support Vector Machine (SVM) classifier based on the word vector, a new representation and feature extraction method for biological sequences. The results show that the length of the word within the word vector, the sequence type (DNA or protein) and the species from which the sequences were derived for generating the word vector all influence the performance of models in predicting the host of influenza viruses. In nearly all cases, the models built on the surface proteins hemagglutinin (HA) and neuraminidase (NA) (or their genes) produced better results than internal influenza proteins (or their genes). The best performance was achieved when the model was built on the HA gene based on word vectors (words of three-letters long) generated from DNA sequences of the influenza virus. This results in accuracies of 99.7% for avian, 96.9% for human and 90.6% for swine influenza viruses. Compared to the method of sequence homology best-hit searches using the Basic Local Alignment Search Tool (BLAST), the word vector-based models still need further improvements in predicting the host of influenza A viruses.
Janecek, S
1995-12-11
A short conserved sequence equivalent to the fifth conserved sequence region of alpha-amylases (173_LPDLD, Aspergillus oryzae alpha-amylase) comprising the calcium-ligand aspartate, Asp-175, was identified in the amino acid sequences of several members of the family of (alpha/beta)8-barrel glycosyl hydrolases. Despite the fact that the aspartate is not invariantly conserved, the stretch can be easily recognised in all sequences to be positioned 26-28 amino acid residues in front of the well-known catalytic aspartate (Asp-206, A. oryzae alpha-amylase) located in the beta 4-strand of the barrel. The identification of this region revealed remarkable similarities between some alpha-amylases (those from Bacillus megaterium, Bacillus subtilis and Dictyoglomus thermophilum) on the one hand and several different enzyme specificities (such as oligo-1,6-glucosidase, amylomaltase and neopullulanase, respectively) on the other hand. The most interesting example was offered by B. subtilis alpha-amylase and potato amylomaltase with the regions LYDWN and LYDWK, respectively. These observations support the idea that all members of the family of glycosyl hydrolases adopting the structure of the alpha-amylase-type (alpha/beta)8-barrel are mutually closely related and the strict evolutionary borders separating the individual enzyme specificities can be hardly defined.
Infants' statistical learning: 2- and 5-month-olds' segmentation of continuous visual sequences.
Slone, Lauren Krogh; Johnson, Scott P
2015-05-01
Past research suggests that infants have powerful statistical learning abilities; however, studies of infants' visual statistical learning offer differing accounts of the developmental trajectory of and constraints on this learning. To elucidate this issue, the current study tested the hypothesis that young infants' segmentation of visual sequences depends on redundant statistical cues to segmentation. A sample of 20 2-month-olds and 20 5-month-olds observed a continuous sequence of looming shapes in which unit boundaries were defined by both transitional probability and co-occurrence frequency. Following habituation, only 5-month-olds showed evidence of statistically segmenting the sequence, looking longer to a statistically improbable shape pair than to a probable pair. These results reaffirm the power of statistical learning in infants as young as 5 months but also suggest considerable development of statistical segmentation ability between 2 and 5 months of age. Moreover, the results do not support the idea that infants' ability to segment visual sequences based on transitional probabilities and/or co-occurrence frequencies is functional at the onset of visual experience, as has been suggested previously. Rather, this type of statistical segmentation appears to be constrained by the developmental state of the learner. Factors contributing to the development of statistical segmentation ability during early infancy, including memory and attention, are discussed. Copyright © 2015 Elsevier Inc. All rights reserved.
Yin, Li; Yao, Jiqiang; Gardner, Brent P; Chang, Kaifen; Yu, Fahong; Goodenow, Maureen M
2012-01-01
Next Generation sequencing (NGS) applied to human papilloma viruses (HPV) can provide sensitive methods to investigate the molecular epidemiology of multiple type HPV infection. Currently a genotyping system with a comprehensive collection of updated HPV reference sequences and a capacity to handle NGS data sets is lacking. HPV-QUEST was developed as an automated and rapid HPV genotyping system. The web-based HPV-QUEST subtyping algorithm was developed using HTML, PHP, Perl scripting language, and MYSQL as the database backend. HPV-QUEST includes a database of annotated HPV reference sequences with updated nomenclature covering 5 genuses, 14 species and 150 mucosal and cutaneous types to genotype blasted query sequences. HPV-QUEST processes up to 10 megabases of sequences within 1 to 2 minutes. Results are reported in html, text and excel formats and display e-value, blast score, and local and coverage identities; provide genus, species, type, infection site and risk for the best matched reference HPV sequence; and produce results ready for additional analyses.
Providing Epistemic Support For Assessments Through Mobile-Supported Sharing Activities1
Raclaw, Joshua; Robles, Jessica S.; DiDomenico, Stephen M.
2017-01-01
This paper examines how participants in face-to-face conversation employ mobile phones as a resource for social action. We focus on what we call mobile-supported sharing activities, in which participants use a mobile phone to share text or images with others by voicing text aloud from their mobile or providing others with visual access to the device’s display screen. Drawing from naturalistic video recordings, we focus on how mobile-supported sharing activities invite assessments by providing access to an object that is not locally accessible to the participants. Such practices make relevant co-participants’ assessment of these objects and allow for different forms of co-participation across sequence types. We additionally examine how the organization of assessments during these sharing activities displays sensitivity to preference structure. The analysis illustrates the relevance of embodiment, local objects, and new communicative technologies to the production of action in co-present interaction. Data are in American English. PMID:28936031
Microbe-ID: an open source toolbox for microbial genotyping and species identification
Tabima, Javier F.; Everhart, Sydney E.; Larsen, Meredith M.; Weisberg, Alexandra J.; Kamvar, Zhian N.; Tancos, Matthew A.; Smart, Christine D.; Chang, Jeff H.
2016-01-01
Development of tools to identify species, genotypes, or novel strains of invasive organisms is critical for monitoring emergence and implementing rapid response measures. Molecular markers, although critical to identifying species or genotypes, require bioinformatic tools for analysis. However, user-friendly analytical tools for fast identification are not readily available. To address this need, we created a web-based set of applications called Microbe-ID that allow for customizing a toolbox for rapid species identification and strain genotyping using any genetic markers of choice. Two components of Microbe-ID, named Sequence-ID and Genotype-ID, implement species and genotype identification, respectively. Sequence-ID allows identification of species by using BLAST to query sequences for any locus of interest against a custom reference sequence database. Genotype-ID allows placement of an unknown multilocus marker in either a minimum spanning network or dendrogram with bootstrap support from a user-created reference database. Microbe-ID can be used for identification of any organism based on nucleotide sequences or any molecular marker type and several examples are provided. We created a public website for demonstration purposes called Microbe-ID (microbe-id.org) and provided a working implementation for the genus Phytophthora (phytophthora-id.org). In Phytophthora-ID, the Sequence-ID application allows identification based on ITS or cox spacer sequences. Genotype-ID groups individuals into clonal lineages based on simple sequence repeat (SSR) markers for the two invasive plant pathogen species P. infestans and P. ramorum. All code is open source and available on github and CRAN. Instructions for installation and use are provided at https://github.com/grunwaldlab/Microbe-ID. PMID:27602267
CRISPR/Cas9 in Genome Editing and Beyond.
Wang, Haifeng; La Russa, Marie; Qi, Lei S
2016-06-02
The Cas9 protein (CRISPR-associated protein 9), derived from type II CRISPR (clustered regularly interspaced short palindromic repeats) bacterial immune systems, is emerging as a powerful tool for engineering the genome in diverse organisms. As an RNA-guided DNA endonuclease, Cas9 can be easily programmed to target new sites by altering its guide RNA sequence, and its development as a tool has made sequence-specific gene editing several magnitudes easier. The nuclease-deactivated form of Cas9 further provides a versatile RNA-guided DNA-targeting platform for regulating and imaging the genome, as well as for rewriting the epigenetic status, all in a sequence-specific manner. With all of these advances, we have just begun to explore the possible applications of Cas9 in biomedical research and therapeutics. In this review, we describe the current models of Cas9 function and the structural and biochemical studies that support it. We focus on the applications of Cas9 for genome editing, regulation, and imaging, discuss other possible applications and some technical considerations, and highlight the many advantages that CRISPR/Cas9 technology offers.
Amyloid-like aggregation of provasopressin in diabetes insipidus and secretory granule sorting.
Beuret, Nicole; Hasler, Franziska; Prescianotto-Baschong, Cristina; Birk, Julia; Rutishauser, Jonas; Spiess, Martin
2017-01-26
Aggregation of peptide hormone precursors in the trans-Golgi network is an essential process in the biogenesis of secretory granules in endocrine cells. It has recently been proposed that this aggregation corresponds to the formation of functional amyloids. Our previous finding that dominant mutations in provasopressin, which cause cell degeneration and diabetes insipidus, prevent native folding and produce fibrillar aggregates in the endoplasmic reticulum (ER) might thus reflect mislocalized amyloid formation by sequences that evolved to mediate granule sorting. Here we identified two sequences responsible for fibrillar aggregation of mutant precursors in the ER: the N-terminal vasopressin nonapeptide and the C-terminal glycopeptide. To test their role in granule sorting, the glycopeptide was deleted and/or vasopressin mutated to inactivate ER aggregation while still permitting precursor folding and ER exit. These mutations strongly reduced sorting into granules and regulated secretion in endocrine AtT20 cells. The same sequences - vasopressin and the glycopeptide - mediate physiological aggregation of the wild-type hormone precursor into secretory granules and the pathological fibrillar aggregation of disease mutants in the ER. These findings support the amyloid hypothesis for secretory granule biogenesis.
NASA Astrophysics Data System (ADS)
Shuster, W.; Schifman, L. A.; Herrmann, D.
2017-12-01
Green infrastructure represents a broad set of site- to landscape-scale practices that can be flexibly implemented to increase sewershed retention capacity, and can thereby improve on the management of water quantity and quality. Although much green infrastructure presents as formal engineered designs, urbanized landscapes with highly-interspersed pervious surfaces (e.g., right-of-way, parks, lawns, vacant land) may offer ecosystem services as passive, infiltrative green infrastructure. Yet, infiltration and drainage processes are regulated by soil surface conditions, and then the layering of subsoil horizons, respectively. Drawing on a unique urban soil taxonomic and hydrologic dataset collected in 12 cities (each city representing a major soil order), we determined how urbanization processes altered the sequence of soil horizons (compared to pre-urbanized reference soil pedons) and modeled the hydrologic implications of these shifts in layering with an unsaturated zone code (HYDRUS2D). We found that the different layering sequences in urbanized soils render different types and extents of supporting (plant-available soil water), provisioning (productive vegetation), and regulating (runoff mitigation) ecosystem services.
Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M
2017-04-01
5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Smith, Gretchen N. L.; Conway, Christopher M.; Bauernschmidt, Althea; Pisoni, David B.
2015-01-01
Recent research suggests that language acquisition may rely on domain-general learning abilities, such as structured sequence processing, which is the ability to extract, encode, and represent structured patterns in a temporal sequence. If structured sequence processing supports language, then it may be possible to improve language function by enhancing this foundational learning ability. The goal of the present study was to use a novel computerized training task as a means to better understand the relationship between structured sequence processing and language function. Participants first were assessed on pre-training tasks to provide baseline behavioral measures of structured sequence processing and language abilities. Participants were then quasi-randomly assigned to either a treatment group involving adaptive structured visuospatial sequence training, a treatment group involving adaptive non-structured visuospatial sequence training, or a control group. Following four days of sequence training, all participants were assessed with the same pre-training measures. Overall comparison of the post-training means revealed no group differences. However, in order to examine the potential relations between sequence training, structured sequence processing, and language ability, we used a mediation analysis that showed two competing effects. In the indirect effect, adaptive sequence training with structural regularities had a positive impact on structured sequence processing performance, which in turn had a positive impact on language processing. This finding not only identifies a potential novel intervention to treat language impairments but also may be the first demonstration that structured sequence processing can be improved and that this, in turn, has an impact on language processing. However, in the direct effect, adaptive sequence training with structural regularities had a direct negative impact on language processing. This unexpected finding suggests that adaptive training with structural regularities might potentially interfere with language processing. Taken together, these findings underscore the importance of pursuing designs that promote a better understanding of the mechanisms underlying training-related changes, so that regimens can be developed that help reduce these types of negative effects while simultaneously maximizing the benefits to outcome measures of interest. PMID:25946222
Smith, Gretchen N L; Conway, Christopher M; Bauernschmidt, Althea; Pisoni, David B
2015-01-01
Recent research suggests that language acquisition may rely on domain-general learning abilities, such as structured sequence processing, which is the ability to extract, encode, and represent structured patterns in a temporal sequence. If structured sequence processing supports language, then it may be possible to improve language function by enhancing this foundational learning ability. The goal of the present study was to use a novel computerized training task as a means to better understand the relationship between structured sequence processing and language function. Participants first were assessed on pre-training tasks to provide baseline behavioral measures of structured sequence processing and language abilities. Participants were then quasi-randomly assigned to either a treatment group involving adaptive structured visuospatial sequence training, a treatment group involving adaptive non-structured visuospatial sequence training, or a control group. Following four days of sequence training, all participants were assessed with the same pre-training measures. Overall comparison of the post-training means revealed no group differences. However, in order to examine the potential relations between sequence training, structured sequence processing, and language ability, we used a mediation analysis that showed two competing effects. In the indirect effect, adaptive sequence training with structural regularities had a positive impact on structured sequence processing performance, which in turn had a positive impact on language processing. This finding not only identifies a potential novel intervention to treat language impairments but also may be the first demonstration that structured sequence processing can be improved and that this, in turn, has an impact on language processing. However, in the direct effect, adaptive sequence training with structural regularities had a direct negative impact on language processing. This unexpected finding suggests that adaptive training with structural regularities might potentially interfere with language processing. Taken together, these findings underscore the importance of pursuing designs that promote a better understanding of the mechanisms underlying training-related changes, so that regimens can be developed that help reduce these types of negative effects while simultaneously maximizing the benefits to outcome measures of interest.
Okada, Kazuma; Moriya, Shigeki; Haji, Takashi; Abe, Kazuyuki
2013-06-01
Using 11 consensus primer pairs designed from S-linked F-box genes of apple and Japanese pear, 10 new F-box genes (MdFBX21 to 30) were isolated from the apple cultivar 'Spartan' (S(9)S(10)). MdFBX21 to 23 and MdFBX24 to 30 were completely linked to the S(9) -RNase and S(10-)RNase, respectively, and showed pollen-specific expression and S-haplotype-specific polymorphisms. Therefore, these 10 F-box genes are good candidates for the pollen determinant of self-incompatibility in apple. Phylogenetic analysis and comparison of deduced amino acid sequences of MdFBX21 to 30 with those of 25 S-linked F-box genes previously isolated from apple showed that a deduced amino acid identity of greater than 88.0 % can be used as the tentative criterion to classify F-box genes into one type. Using this criterion, 31 of 35 F-box genes of apple were classified into 11 types (SFBB1-11). All types included F-box genes derived from S(3-) and S(9-)haplotypes, and seven types included F-box genes derived from S(3-), S(9-), and S(10-)haplotypes. Moreover, comparison of nucleotide sequences of S-RNases and multiple F-box genes among S(3-), S(9-), and S(10-)haplotypes suggested that F-box genes within each type showed high nucleotide identity regardless of the identity of the S-RNase. The large number of F-box genes as candidates for the pollen determinant and the high degree of conservation within each type are consistent with the collaborative non-self-recognition model reported for Petunia. These findings support that the collaborative non-self-recognition system also exists in apple.
Genotyping of Indian antigenic, vaccine, and field Brucella spp. using multilocus sequence typing.
Shome, Rajeswari; Krithiga, Natesan; Shankaranarayana, Padmashree B; Jegadesan, Sankarasubramanian; Udayakumar S, Vishnu; Shome, Bibek Ranjan; Saikia, Girin Kumar; Sharma, Narendra Kumar; Chauhan, Harshad; Chandel, Bharat Singh; Jeyaprakash, Rajendhran; Rahman, Habibur
2016-03-31
Brucellosis is one of the most important zoonotic diseases that affects multiple livestock species and causes great economic losses. The highly conserved genomes of Brucella, with > 90% homology among species, makes it important to study the genetic diversity circulating in the country. A total of 26 Brucella spp. (4 reference strains and 22 field isolates) and 1 B. melitensis draft genome sequence from India (B. melitensis Bm IND1) were included for sequence typing. The field isolates were identified by biochemical tests and confirmed by both conventional and quantitative polymerase chain reaction (qPCR) targeting bcsp 31Brucella genus-specific marker. Brucella speciation and biotyping was done by Bruce ladder, probe qPCR, and AMOS PCRs, respectively, and genotyping was done by multilocus sequence typing (MLST). The MLST typing of 27 Brucella spp. revealed five distinct sequence types (STs); the B. abortus S99 reference strain and 21 B. abortus field isolates belonged to ST1. On the other hand, the vaccine strain B. abortus S19 was genotyped as ST5. Similarly, B. melitensis 16M reference strain and one B. melitensis field isolate were grouped into ST7. Another B. melitensis field isolate belonged to ST8 (draft genome sequence from India), and only B. suis 1330 reference strain was found to be ST14. The sequences revealed genetic similarity of the Indian strains to the global reference and field strains. The study highlights the usefulness of MLST for typing of field isolates and validation of reference strains used for diagnosis and vaccination against brucellosis.
Reimer, Aleisha; Verghese, Bindhu; Lok, Mei; Ziegler, Jennifer; Farber, Jeffrey; Pagotto, Franco; Graham, Morag; Nadon, Celine A.
2012-01-01
Human listeriosis outbreaks in Canada have been predominantly caused by serotype 1/2a isolates with highly similar pulsed-field gel electrophoresis (PFGE) patterns. Multilocus sequence typing (MLST) and multi-virulence-locus sequence typing (MVLST) each identified a diverse population of Listeria monocytogenes isolates, and within that, both methods had congruent subtypes that substantiated a predominant clone (clonal complex 8; virulence type 59; proposed epidemic clone 5 [ECV]) that has been causing human illness across Canada for more than 2 decades. PMID:22337989
Current whole-body MRI applications in the neurofibromatoses: NF1, NF2, and schwannomatosis.
Ahlawat, Shivani; Fayad, Laura M; Khan, Muhammad Shayan; Bredella, Miriam A; Harris, Gordon J; Evans, D Gareth; Farschtschi, Said; Jacobs, Michael A; Chhabra, Avneesh; Salamon, Johannes M; Wenzel, Ralph; Mautner, Victor F; Dombi, Eva; Cai, Wenli; Plotkin, Scott R; Blakeley, Jaishri O
2016-08-16
The Response Evaluation in Neurofibromatosis and Schwannomatosis (REiNS) International Collaboration Whole-Body MRI (WB-MRI) Working Group reviewed the existing literature on WB-MRI, an emerging technology for assessing disease in patients with neurofibromatosis type 1 (NF1), neurofibromatosis type 2 (NF2), and schwannomatosis (SWN), to recommend optimal image acquisition and analysis methods to enable WB-MRI as an endpoint in NF clinical trials. A systematic process was used to review all published data about WB-MRI in NF syndromes to assess diagnostic accuracy, feasibility and reproducibility, and data about specific techniques for assessment of tumor burden, characterization of neoplasms, and response to therapy. WB-MRI at 1.5T or 3.0T is feasible for image acquisition. Short tau inversion recovery (STIR) sequence is used in all investigations to date, suggesting consensus about the utility of this sequence for detection of WB tumor burden in people with NF. There are insufficient data to support a consensus statement about the optimal imaging planes (axial vs coronal) or 2D vs 3D approaches. Functional imaging, although used in some NF studies, has not been systematically applied or evaluated. There are no comparative studies between regional vs WB-MRI or evaluations of WB-MRI reproducibility. WB-MRI is feasible for identifying tumors using both 1.5T and 3.0T systems. The STIR sequence is a core sequence. Additional investigation is needed to define the optimal approach for volumetric analysis, the reproducibility of WB-MRI in NF, and the diagnostic performance of WB-MRI vs regional MRI. © 2016 American Academy of Neurology.
Tsuchida, Sayaka; Kitahara, Maki; Nguema, Pierre Philippe Mbehang; Norimitsu, Saeko; Fujita, Shiho; Yamagiwa, Juichi; Ngomanda, Alfred; Ohkuma, Moriya; Ushida, Kazunari
2014-12-01
Four strains of Gram-staining-positive, anaerobic rods were isolated from the faeces of western lowland gorillas (Gorilla gorilla gorilla). Three strains, KZ01(T), KZ02 and KZ03, were isolated at the Kyoto City Zoo, Japan, and one strain, GG02, was isolated in the Moukalaba-Doudou National Park, Gabon. These strains were investigated taxonomically. These strains belonged to the Lactobacillus reuteri phylogenetic group according to phylogenetic analysis based on 16S rRNA gene sequences and specific phenotypic characteristics. Phylogenetic analysis of their 16S rRNA gene sequences revealed that strains KZ01(T), KZ02, KZ03 and GG02 formed a single monophyletic cluster and had a distinct line of descent. Based on sequence similarity of the 16S rRNA gene, Lactobacillus fermentum JCM 1173(T) (96.6 %) was the closest neighbour to these novel strains, although it was clear that these strains belonged to a different species. Partial pheS sequences also supported these relationships. DNA-DNA relatedness between strain KZ01(T) and L. fermentum JCM 1173(T) was less than 22 % and the DNA G+C content of strain KZ01(T) was 50.7 mol%. The cell-wall peptidoglycan type was A4β (l-Orn-d-Asp) and the major fatty acids were C16 : 0, C18 : 1ω9c and C19 : 1 cyclo 9,10. Therefore, based on phylogenetic, phenotypic and physiological evidence, these strains represent a novel species of the genus Lactobacillus, for which the name Lactobacillus gorillae sp. nov. is proposed. The type strain is KZ01(T) ( = JCM 19575(T) = DSM 28356(T)). © 2014 IUMS.
Ancestral European roots of Helicobacter pylori in India
Devi, S Manjulata; Ahmed, Irshad; Francalacci, Paolo; Hussain, M Abid; Akhter, Yusuf; Alvi, Ayesha; Sechi, Leonardo A; Mégraud, Francis; Ahmed, Niyaz
2007-01-01
Background The human gastric pathogen Helicobacter pylori is co-evolved with its host and therefore, origins and expansion of multiple populations and sub populations of H. pylori mirror ancient human migrations. Ancestral origins of H. pylori in the vast Indian subcontinent are debatable. It is not clear how different waves of human migrations in South Asia shaped the population structure of H. pylori. We tried to address these issues through mapping genetic origins of present day H. pylori in India and their genomic comparison with hundreds of isolates from different geographic regions. Results We attempted to dissect genetic identity of strains by multilocus sequence typing (MLST) of the 7 housekeeping genes (atpA, efp, ureI, ppa, mutY, trpC, yphC) and phylogeographic analysis of haplotypes using MEGA and NETWORK software while incorporating DNA sequences and genotyping data of whole cag pathogenicity-islands (cagPAI). The distribution of cagPAI genes within these strains was analyzed by using PCR and the geographic type of cagA phosphorylation motif EPIYA was determined by gene sequencing. All the isolates analyzed revealed European ancestry and belonged to H. pylori sub-population, hpEurope. The cagPAI harbored by Indian strains revealed European features upon PCR based analysis and whole PAI sequencing. Conclusion These observations suggest that H. pylori strains in India share ancestral origins with their European counterparts. Further, non-existence of other sub-populations such as hpAfrica and hpEastAsia, at least in our collection of isolates, suggest that the hpEurope strains enjoyed a special fitness advantage in Indian stomachs to out-compete any endogenous strains. These results also might support hypotheses related to gene flow in India through Indo-Aryans and arrival of Neolithic practices and languages from the Fertile Crescent. PMID:17584914
Sekizuka, Tsuyoshi; Yamashita, Akifumi; Murase, Yoshiro; Iwamoto, Tomotada; Mitarai, Satoshi; Kato, Seiya; Kuroda, Makoto
2015-01-01
Whole-genome sequencing (WGS) with next-generation DNA sequencing (NGS) is an increasingly accessible and affordable method for genotyping hundreds of Mycobacterium tuberculosis (Mtb) isolates, leading to more effective epidemiological studies involving single nucleotide variations (SNVs) in core genomic sequences based on molecular evolution. We developed an all-in-one web-based tool for genotyping Mtb, referred to as the Total Genotyping Solution for TB (TGS-TB), to facilitate multiple genotyping platforms using NGS for spoligotyping and the detection of phylogenies with core genomic SNVs, IS6110 insertion sites, and 43 customized loci for variable number tandem repeat (VNTR) through a user-friendly, simple click interface. This methodology is implemented with a KvarQ script to predict MTBC lineages/sublineages and potential antimicrobial resistance. Seven Mtb isolates (JP01 to JP07) in this study showing the same VNTR profile were accurately discriminated through median-joining network analysis using SNVs unique to those isolates. An additional IS6110 insertion was detected in one of those isolates as supportive genetic information in addition to core genomic SNVs. The results of in silico analyses using TGS-TB are consistent with those obtained using conventional molecular genotyping methods, suggesting that NGS short reads could provide multiple genotypes to discriminate multiple strains of Mtb, although longer NGS reads (≥300-mer) will be required for full genotyping on the TGS-TB web site. Most available short reads (~100-mer) can be utilized to discriminate the isolates based on the core genome phylogeny. TGS-TB provides a more accurate and discriminative strain typing for clinical and epidemiological investigations; NGS strain typing offers a total genotyping solution for Mtb outbreak and surveillance. TGS-TB web site: https://gph.niid.go.jp/tgs-tb/. PMID:26565975
Nguyen, David; Valenzuela, Nicole; Takemura, Ping; Bolon, Yung-Tsi; Springer, Brianna; Saito, Katsuyuki; Zheng, Ying; Hague, Tim; Pasztor, Agnes; Horvath, Gyorgy; Rigo, Krisztina; Reed, Elaine F.; Zhang, Qiuheng
2016-01-01
Background Unambiguous HLA typing is important in hematopoietic stem cell transplantation (HSCT), HLA disease association studies, and solid organ transplantation. However, current molecular typing methods only interrogate the antigen recognition site (ARS) of HLA genes, resulting in many cis-trans ambiguities that require additional typing methods to resolve. Here we report high-resolution HLA typing of 10,063 National Marrow Donor Program (NMDP) registry donors using long-range PCR by next generation sequencing (NGS) approach on buccal swab DNA. Methods Multiplex long-range PCR primers amplified the full-length of HLA class I genes (A, B, C) from promotor to 3’ UTR. Class II genes (DRB1, DQB1) were amplified from exon 2 through part of exon 4. PCR amplicons were pooled and sheared using Covaris fragmentation. Library preparation was performed using the Illumina TruSeq Nano kit on the Beckman FX automated platform. Each sample was tagged with a unique barcode, followed by 2×250 bp paired-end sequencing on the Illumina MiSeq. HLA typing was assigned using Omixon Twin software that combines two independent computational algorithms to ensure high confidence in allele calling. Consensus sequence and typing results were reported in Histoimmunogenetics Markup Language (HML) format. All homozygous alleles were confirmed by Luminex SSO typing and exon novelties were confirmed by Sanger sequencing. Results Using this automated workflow, over 10,063 NMDP registry donors were successfully typed under high-resolution by NGS. Despite known challenges of nucleic acid degradation and low DNA concentration commonly associated with buccal-based specimens, 97.8% of samples were successfully amplified using long-range PCR. Among these, 98.2% were successfully reported by NGS, with an accuracy rate of 99.84% in an independent blind Quality Control audit performed by the NDMP. In this study, NGS-HLA typing identified 23 null alleles (0.023%), 92 rare alleles (0.091%) and 42 exon novelties (0.042%). Conclusion Long-range, unambiguous HLA genotyping is achievable on clinical buccal swab-extracted DNA. Importantly, full-length gene sequencing and the ability to curate full sequence data will permit future interrogation of the impact of introns, expanded exons, and other gene regulatory sequences on clinical outcomes in transplantation. PMID:27798706
Yin, Yuxin; Lan, James H; Nguyen, David; Valenzuela, Nicole; Takemura, Ping; Bolon, Yung-Tsi; Springer, Brianna; Saito, Katsuyuki; Zheng, Ying; Hague, Tim; Pasztor, Agnes; Horvath, Gyorgy; Rigo, Krisztina; Reed, Elaine F; Zhang, Qiuheng
2016-01-01
Unambiguous HLA typing is important in hematopoietic stem cell transplantation (HSCT), HLA disease association studies, and solid organ transplantation. However, current molecular typing methods only interrogate the antigen recognition site (ARS) of HLA genes, resulting in many cis-trans ambiguities that require additional typing methods to resolve. Here we report high-resolution HLA typing of 10,063 National Marrow Donor Program (NMDP) registry donors using long-range PCR by next generation sequencing (NGS) approach on buccal swab DNA. Multiplex long-range PCR primers amplified the full-length of HLA class I genes (A, B, C) from promotor to 3' UTR. Class II genes (DRB1, DQB1) were amplified from exon 2 through part of exon 4. PCR amplicons were pooled and sheared using Covaris fragmentation. Library preparation was performed using the Illumina TruSeq Nano kit on the Beckman FX automated platform. Each sample was tagged with a unique barcode, followed by 2×250 bp paired-end sequencing on the Illumina MiSeq. HLA typing was assigned using Omixon Twin software that combines two independent computational algorithms to ensure high confidence in allele calling. Consensus sequence and typing results were reported in Histoimmunogenetics Markup Language (HML) format. All homozygous alleles were confirmed by Luminex SSO typing and exon novelties were confirmed by Sanger sequencing. Using this automated workflow, over 10,063 NMDP registry donors were successfully typed under high-resolution by NGS. Despite known challenges of nucleic acid degradation and low DNA concentration commonly associated with buccal-based specimens, 97.8% of samples were successfully amplified using long-range PCR. Among these, 98.2% were successfully reported by NGS, with an accuracy rate of 99.84% in an independent blind Quality Control audit performed by the NDMP. In this study, NGS-HLA typing identified 23 null alleles (0.023%), 92 rare alleles (0.091%) and 42 exon novelties (0.042%). Long-range, unambiguous HLA genotyping is achievable on clinical buccal swab-extracted DNA. Importantly, full-length gene sequencing and the ability to curate full sequence data will permit future interrogation of the impact of introns, expanded exons, and other gene regulatory sequences on clinical outcomes in transplantation.
Literature classification for semi-automated updating of biological knowledgebases
2013-01-01
Background As the output of biological assays increase in resolution and volume, the body of specialized biological data, such as functional annotations of gene and protein sequences, enables extraction of higher-level knowledge needed for practical application in bioinformatics. Whereas common types of biological data, such as sequence data, are extensively stored in biological databases, functional annotations, such as immunological epitopes, are found primarily in semi-structured formats or free text embedded in primary scientific literature. Results We defined and applied a machine learning approach for literature classification to support updating of TANTIGEN, a knowledgebase of tumor T-cell antigens. Abstracts from PubMed were downloaded and classified as either "relevant" or "irrelevant" for database update. Training and five-fold cross-validation of a k-NN classifier on 310 abstracts yielded classification accuracy of 0.95, thus showing significant value in support of data extraction from the literature. Conclusion We here propose a conceptual framework for semi-automated extraction of epitope data embedded in scientific literature using principles from text mining and machine learning. The addition of such data will aid in the transition of biological databases to knowledgebases. PMID:24564403
Arthropod phylogeny based on eight molecular loci and morphology
NASA Technical Reports Server (NTRS)
Giribet, G.; Edgecombe, G. D.; Wheeler, W. C.
2001-01-01
The interrelationships of major clades within the Arthropoda remain one of the most contentious issues in systematics, which has traditionally been the domain of morphologists. A growing body of DNA sequences and other types of molecular data has revitalized study of arthropod phylogeny and has inspired new considerations of character evolution. Novel hypotheses such as a crustacean-hexapod affinity were based on analyses of single or few genes and limited taxon sampling, but have received recent support from mitochondrial gene order, and eye and brain ultrastructure and neurogenesis. Here we assess relationships within Arthropoda based on a synthesis of all well sampled molecular loci together with a comprehensive data set of morphological, developmental, ultrastructural and gene-order characters. The molecular data include sequences of three nuclear ribosomal genes, three nuclear protein-coding genes, and two mitochondrial genes (one protein coding, one ribosomal). We devised new optimization procedures and constructed a parallel computer cluster with 256 central processing units to analyse molecular data on a scale not previously possible. The optimal 'total evidence' cladogram supports the crustacean-hexapod clade, recognizes pycnogonids as sister to other euarthropods, and indicates monophyly of Myriapoda and Mandibulata.
Deep Sequencing to Identify the Causes of Viral Encephalitis
Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.
2014-01-01
Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691
Demczuk, W; Sidhu, S; Unemo, M; Whiley, D M; Allen, V G; Dillon, J R; Cole, M; Seah, C; Trembizki, E; Trees, D L; Kersh, E N; Abrams, A J; de Vries, H J C; van Dam, A P; Medina, I; Bharat, A; Mulvey, M R; Van Domselaar, G; Martin, I
2017-05-01
A curated Web-based user-friendly sequence typing tool based on antimicrobial resistance determinants in Neisseria gonorrhoeae was developed and is publicly accessible (https://ngstar.canada.ca). The N. gonorrhoeae Sequence Typing for Antimicrobial Resistance (NG-STAR) molecular typing scheme uses the DNA sequences of 7 genes ( penA , mtrR , porB , ponA , gyrA , parC , and 23S rRNA) associated with resistance to β-lactam antimicrobials, macrolides, or fluoroquinolones. NG-STAR uses the entire penA sequence, combining the historical nomenclature for penA types I to XXXVIII with novel nucleotide sequence designations; the full mtrR sequence and a portion of its promoter region; portions of ponA , porB , gyrA , and parC ; and 23S rRNA sequences. NG-STAR grouped 768 isolates into 139 sequence types (STs) ( n = 660) consisting of 29 clonal complexes (CCs) having a maximum of a single-locus variation, and 76 NG-STAR STs ( n = 109) were identified as unrelated singletons. NG-STAR had a high Simpson's diversity index value of 96.5% (95% confidence interval [CI] = 0.959 to 0.969). The most common STs were NG-STAR ST-90 ( n = 100; 13.0%), ST-42 and ST-91 ( n = 45; 5.9%), ST-64 ( n = 44; 5.72%), and ST-139 ( n = 42; 5.5%). Decreased susceptibility to azithromycin was associated with NG-STAR ST-58, ST-61, ST-64, ST-79, ST-91, and ST-139 ( n = 156; 92.3%); decreased susceptibility to cephalosporins was associated with NG-STAR ST-90, ST-91, and ST-97 ( n = 162; 94.2%); and ciprofloxacin resistance was associated with NG-STAR ST-26, ST-90, ST-91, ST-97, ST-150, and ST-158 ( n = 196; 98.0%). All isolates of NG-STAR ST-42, ST-43, ST-63, ST-81, and ST-160 ( n = 106) were susceptible to all four antimicrobials. The standardization of nomenclature associated with antimicrobial resistance determinants through an internationally available database will facilitate the monitoring of the global dissemination of antimicrobial-resistant N. gonorrhoeae strains. © Crown copyright 2017.
Gharsa, H; Slama, K Ben; Gómez-Sanz, E; Gómez, P; Klibi, N; Zarazaga, M; Boudabous, A; Torres, C
2015-07-01
Staphylococcus intermedius group (SIG) bacteria can colonise the nares of some animals but are also emerging pathogens in humans and animals. To analyse SIG nasal carriage in healthy donkeys destined for food consumption in Tunisia and to characterise recovered isolates. Nasal swabs from 100 healthy donkeys were tested for SIG recovery, and isolates were identified by biochemical and molecular methods. Antimicrobial susceptibility of isolates was tested and detection of antimicrobial resistance and virulence genes was performed. Isolates were typed at the clonal level by multilocus sequence typing and SmaI pulsed-field gel electrophoresis. Staphylococcus delphini and Staphylococcus pseudintermedius (included in SIG) were obtained in 19% and 2% of the tested samples, respectively, and one isolate per sample was characterised. All isolates were meticillin susceptible and mecA negative. Most S. delphini and S. pseudintermedius isolates showed susceptibility to all antimicrobials tested, with the exception of 2 isolates resistant to tetracycline (tet(M) gene) or fusidic acid. The following toxin genes were identified (percentage of isolates): lukS-I (100%), lukF-I (9.5%), siet (100%), se-int (90%), seccanine (19%) and expA (9.5%). Thirteen different pulsed-field gel electrophoresis profiles were identified among the 21 SIG isolates. Additionally, the following 9 different sequence types (STs) were detected by multilocus sequence typing, 6 of them new: ST219 (6 isolates), ST12 (5 isolates), ST220 (3 isolates), ST13, ST50, ST193, ST196, ST218 and ST221 (one isolate each). Staphylococcus delphini and S. pseudintermedius are common nasal colonisers of donkeys, generally susceptible to the antimicrobials tested; nevertheless, these SIG isolates contain virulence genes, including the recently described exfoliative gene (expA) and several enterotoxin genes, with potential implications for public health. This is the first description of S. delphini in Tunisia. The Summary is available in Chinese - see Supporting information. © 2014 EVJ Ltd.
Occurrence and distribution of Giardia species in wild rodents in Germany.
Helmy, Yosra A; Spierling, Nastasja G; Schmidt, Sabrina; Rosenfeld, Ulrike M; Reil, Daniela; Imholt, Christian; Jacob, Jens; Ulrich, Rainer G; Aebischer, Toni; Klotz, Christian
2018-03-27
Giardiasis is an important gastrointestinal parasitic disease in humans and other mammals caused by the protozoan Giardia duodenalis. This species complex is represented by genetically distinct groups (assemblages A-H) with varying zoonotic potential and host preferences. Wild rodents can harbor potentially zoonotic assemblages A and B, and the rodent-specific assemblage G. Other Giardia spp. found in these animals are Giardia muris and Giardia microti. For the latter, only limited information on genetic typing is available. It has been speculated that wild rodents might represent an important reservoir for parasites causing human giardiasis. The aim of this study was to investigate the occurrence and distribution of Giardia spp. and assemblage types in wild rodents from different study sites in Germany. Screening of 577 wild rodents of the genera Apodemus, Microtus and Myodes, sampled at eleven study sites in Germany, revealed a high overall Giardia prevalence. Giardia species determination at the SSU rDNA gene locus revealed that Apodemus mice, depending on species, were predominantly infected with one of two distinct G. muris sequence types. Giardia microti was the predominant parasite species found in voles of the genera Microtus and Myodes. Only a few animals were positive for potentially zoonotic G. duodenalis. Subtyping at the beta-giardin (bg) and glutamine dehydrogenase (gdh) genes strongly supported the existence of different phylogenetic subgroups of G. microti that are preferentially harbored by distinct host species. The present study highlights the preference of G. muris for Apodemus, and G. microti for Microtus and Myodes hosts and argues for a very low prevalence of zoonotic G. duodenalis assemblages in wild rodents in Germany. It also provides evidence that G. muris and G. microti subdivide into several phylogenetically distinguishable subgroups, each of which appears to be preferentially harbored by species of a particular rodent host genus. Finally, the study expands the database of sequences relevant for sequence typing of G. muris and G. microti isolates which will greatly help future analyses of these parasites' population structure.
Xie, Cheng-Hui; Yokota, Akira
2005-11-01
The aim of this study was to clarify the taxonomic position of the nitrogen-fixing and hydrogen-oxidizing bacteria Alcaligenes latus strains IAM 12599T, IAM 12664 and IAM 12665 and Pseudomonas saccharophila IAM 14368T. It was found that the type strain of Alcaligenes latus, IAM 12599T, showed 99 x 9 and 96 x 1 % 16S rRNA gene sequence similarity to strains IAM 12665 and IAM 12664, respectively. A comparison using DNA-DNA hybridization suggested that strains IAM 12599T and IAM 12665 belong to a single species (89 x 7 %) and that strain IAM 12664 (35 x 1 %) forms a separate species. The phenotypic characteristics also support the conclusion that these bacteria should be identified as two species of a new genus: Azohydromonas lata gen. nov., comb. nov. (type strain IAM 12599T=DSM 1122T=LMG 3321T=ATCC 29712T; reference strain IAM 12665=DSM 1123=LMG 3325=ATCC 29714) and Azohydromonas australica sp. nov. (type strain IAM 12664T=DSM 1124T=LMG 3324T=ATCC 29713T). Pseudomonas saccharophila IAM 14368T was found to be closely related to the phototrophic bacterium Roseateles depolymerans, with 96 x 8 % 16S rRNA gene sequence similarity, but the two bacteria are quite different with respect to their metabolism and some significant phenotypic characteristics, suggesting that they cannot be included in a single genus. Further studies on their nifH gene sequences, G+C content of the DNA and cellular fatty acid composition confirm that Pseudomonas saccharophila should be reclassified: the name Pelomonas saccharophila gen. nov., comb. nov. is proposed, with the type strain IAM 14368T (=LMG 2256T=ATCC 15946T).
NASA Technical Reports Server (NTRS)
Sugavanam, E. B.; Vidyadharan, K. T.
1988-01-01
Presented here are the results of detailed investigations encompassing externsive structural mapping in the charnockite-high grade gneiss terrain of North Arcot district and the type area in Pallavaram in Tamil Nadu supported by petrography, mineral chemistry, major, minor and REE distribution patterns in various lithounits. This has helped in understanding the evolutionary history of the southern peninsular shield. A possible tectonic model is also suggested. The results of these studies are compared with similar rock types from parts of Andhra Pradesh, Kerala, Sri Lanka, Lapland and Nigeria which has brought about a well defined correlation in geochemical characteristics. The area investigated has an interbanded sequence of thick pile of charnockite and a supracrustal succession of shelf type sediments, layered igneous complex, basic and ultrabasic rocks involved in a complex structural, tectonic, igneous and metamorphic events.
Malur, Achut G.; Gupta, Neera K.; De, Bishnu P.; Banerjee, Amiya K.
2002-01-01
The large protein (L) of the human parainfluenza virus type 3 (HPIV3) is the functional RNA-dependent RNA polymerase, which possesses highly conserved residues QGDNQ located within motif C of domain III comprising the putative polymerase active site. We have characterized the role of the QGDNQ residues as well as the residues flanking this region in the polymerase activity of the L protein by site-directed mutagenesis and examining the polymerase activity of the wild-type and mutant L proteins by an in vivo minigenome replication assay and an in vitro mRNA transcription assay. All mutations in the QGDNQ residues abolished transcription while mutations in the flanking residues gave rise to variable polymerase activities. These observations support the contention that the QGDNQ sequence is absolutely required for the polymerase activity of the HPIV3 RNA-dependent RNA polymerase. PMID:12064576
Compartment-specific control of signaling from a DNA-sensing immune receptor.
Engel, Alex; Barton, Gregory M
2010-11-30
Many cell signaling events are spatially organized, enabling control of specificity, amplitude, and duration. Toll-like receptor 9 (TLR9) binds to nucleic acid sequences present in bacteria or DNA viruses and initiates a signaling pathway that culminates in the transcriptional induction of genes important for host defense, such as those encoding proinflammatory cytokines and type I interferon. A specialized membrane trafficking pathway has been described that is required for a specific branch of TLR9 signaling: the production of type I interferon. Cells deficient for the clathrin adaptor complex AP-3 failed to traffic TLR9 to a specific endosomal compartment and were unable to produce type I interferon despite normal increases in the abundance of interleukin-12p40, a proinflammatory cytokine. These findings support a model in which the targets of TLR9 engagement are controlled by the compartment in which TLR9 is activated.
Episodic-like memory trace in awake replay of hippocampal place cell activity sequences.
Takahashi, Susumu
2015-10-20
Episodic memory retrieval of events at a specific place and time is effective for future planning. Sequential reactivation of the hippocampal place cells along familiar paths while the animal pauses is well suited to such a memory retrieval process. It is, however, unknown whether this awake replay represents events occurring along the path. Using a subtask switching protocol in which the animal experienced three subtasks as 'what' information in a maze, I here show that the replay represents a trial type, consisting of path and subtask, in terms of neuronal firing timings and rates. The actual trial type to be rewarded could only be reliably predicted from replays that occurred at the decision point. This trial-type representation implies that not only 'where and when' but also 'what' information is contained in the replay. This result supports the view that awake replay is an episodic-like memory retrieval process.
Li, Zhirong; Liu, Xiaolei; Zhao, Jianhong; Xu, Kaiyue; Tian, Tiantian; Yang, Jing; Qiang, Cuixin; Shi, Dongyan; Wei, Honglian; Sun, Suju; Cui, Qingqing; Li, Ruxin; Niu, Yanan; Huang, Bixing
2018-04-01
Clostridium difficile is the causative pathogen for antibiotic-related nosocomial diarrhea. For epidemiological study and identification of virulent clones, a new binary typing method was developed for C. difficile in this study. The usefulness of this newly developed optimized 10-loci binary typing method was compared with two widely used methods ribotyping and multilocus sequence typing (MLST) in 189 C. difficile samples. The binary typing, ribotyping and MLST typed the samples into 53 binary types (BTs), 26 ribotypes (RTs), and 33 MLST sequence types (STs), respectively. The typing ability of the binary method was better than that of either ribotyping or MLST expressed in Simpson Index (SI) at 0.937, 0.892 and 0.859, respectively. The ease of testing, portability and cost-effectiveness of the new binary typing would make it a useful typing alternative for outbreak investigations within healthcare facilities and epidemiological research. Copyright © 2018 Elsevier B.V. All rights reserved.
NASA Technical Reports Server (NTRS)
Venkateswaran, K.; Dollhopf, M. E.; Aller, R.; Stackebrandt, E.; Nealson, K. H.
1998-01-01
A new bacterial species belonging to the genus Shewanella is described on the basis of phenotypic characterization and sequence analysis of its 16S rRNA-encoding and gyrase B (gyrB) genes. This organism, isolated from shallow-water marine sediments derived from the Amazon River delta, is a Gram-negative, motile, polarly flagellated, facultatively anaerobic, rod-shaped eubacterium and has a G&C content of 51.7 mol%. Strain SB2BT is exceptionally active in the anaerobic reduction of iron, manganese and sulfur compounds. SB2BT grows optimally at 35 degrees C, with 1-3% NaCl and over a pH range of 7-8. Analysis of the 16S rDNA sequence revealed a clear affiliation between strain SB2BT and members of the gamma subclass of the class Proteobacteria. High similarity values were found with certain members of the genus Shewanella, especially with Shewanella putrefaciens, and this was supported by cellular fatty acid profiles and phenotypic characterization. DNA-DNA hybridization between strain SB2BT and its phylogenetically closest relatives revealed low similarity values (24.6-42.7%) which indicated species status for strain SB2BT. That SB2BT represents a distinct bacterial species within the genus Shewanella is also supported by gyrB sequence analysis. Considering the source of the isolate, the name Shewanella amazonensis sp. nov. is proposed and strain SB2BT (= ATCC 700329T) is designated as the type strain.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A.
Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, orderedmore » restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.« less
Halpern, Malka; Fridman, Svetlana; Aizenberg-Gershtein, Yana; Izhaki, Ido
2013-01-01
Pseudomonas flectens Johnson 1956, a plant-pathogenic bacterium on the pods of the French bean, is no longer considered to be a member of the genus Pseudomonas sensu stricto. A polyphasic approach that included examination of phenotypic properties and phylogenetic analyses based on 16S rRNA, rpoB and atpD gene sequences supported the transfer of Pseudomonas flectens Johnson 1956 to a new genus in the family Enterobacteriaceae as Phaseolibacter flectens gen. nov., comb. nov. Two strains of Phaseolibacter flectens were studied (ATCC 12775(T) and LMG 2186); the strains shared 99.8 % sequence similarity in their 16S rRNA genes and the housekeeping gene sequences were identical. Strains of Phaseolibacter flectens shared 96.6 % or less 16S rRNA gene sequence similarity with members of different genera in the family Enterobacteriaceae and only 84.7 % sequence similarity with Pseudomonas aeruginosa LMG 1242(T), demonstrating that they are not related to the genus Pseudomonas. As Phaseolibacter flectens formed an independent phyletic lineage in all of the phylogenetic analyses, it could not be affiliated to any of the recognized genera within the family Enterobacteriaceae and therefore was assigned to a new genus. Cells were Gram-negative, straight rods, motile by means of one or two polar flagella, fermentative, facultative anaerobes, oxidase-negative and catalase-positive. Growth occurred in the presence of 0-60 % sucrose. The DNA G+C content of the type strain was 44.3 mol%. On the basis of phenotypic properties and phylogenetic distinctiveness, Pseudomonas flectens Johnson 1956 is transferred to the novel genus Phaseolibacter gen. nov. as Phaseolibacter flectens gen. nov., comb. nov. The type strain of Phaseolibacter flectens is ATCC 12775(T) = CFBP 3281(T) = ICMP 745(T) = LMG 2187(T) = NCPPB 539(T).
Boité, Mariana C.; Mauricio, Isabel L.; Miles, Michael A.; Cupolillo, Elisa
2012-01-01
The Leishmania genus comprises up to 35 species, some with status still under discussion. The multilocus sequence typing (MLST)—extensively used for bacteria—has been proposed for pathogenic trypanosomatids. For Leishmania, however, a detailed analysis and revision on the taxonomy is still required. We have partially sequenced four housekeeping genes—glucose-6-phosphate dehydrogenase (G6PD), 6-phosphogluconate dehydrogenase (6PGD), mannose phosphate isomerase (MPI) and isocitrate dehydrogenase (ICD)—from 96 Leishmania (Viannia) strains and assessed their discriminatory typing capacity. The fragments had different degrees of diversity, and are thus suitable to be used in combination for intra- and inter-specific inferences. Species-specific single nucleotide polymorphisms were detected, but not for all species; ambiguous sites indicating heterozygosis were observed, as well as the putative homozygous donor. A large number of haplotypes were detected for each marker; for 6PGD a possible ancestral allele for L. (Viannia) was found. Maximum parsimony-based haplotype networks were built. Strains of different species, as identified by multilocus enzyme electrophoresis (MLEE), formed separated clusters in each network, with exceptions. NeighborNet of concatenated sequences confirmed species-specific clusters, suggesting recombination occurring in L. braziliensis and L. guyanensis. Phylogenetic analysis indicates L. lainsoni and L. naiffi as the most divergent species and does not support L. shawi as a distinct species, placing it in the L. guyanensis cluster. BURST analysis resulted in six clonal complexes (CC), corresponding to distinct species. The L. braziliensis strains evaluated correspond to one widely geographically distributed CC and another restricted to one endemic area. This study demonstrates the value of systematic multilocus sequence analysis (MLSA) for determining intra- and inter-species relationships and presents an approach to validate the species status of some entities. Furthermore, it contributes to the phylogeny of L. (Viannia) and might be helpful for epidemiological and population genetics analysis based on haplotype/diplotype determinations and inferences. PMID:23133690
Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A; Awosika, Joy; Briska, Adam; Ptashkin, Ryan N; Wagner, Trevor; Rajanna, Chythanya; Tsang, Hsinyi; Johnson, Shannon L; Mokashi, Vishwesh P; Chain, Patrick S G; Sozhamannan, Shanmuga
2015-01-01
Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, ordered restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.
Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A.; ...
2015-03-20
Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, orderedmore » restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.« less
G-CNV: A GPU-Based Tool for Preparing Data to Detect CNVs with Read-Depth Methods.
Manconi, Andrea; Manca, Emanuele; Moscatelli, Marco; Gnocchi, Matteo; Orro, Alessandro; Armano, Giuliano; Milanesi, Luciano
2015-01-01
Copy number variations (CNVs) are the most prevalent types of structural variations (SVs) in the human genome and are involved in a wide range of common human diseases. Different computational methods have been devised to detect this type of SVs and to study how they are implicated in human diseases. Recently, computational methods based on high-throughput sequencing (HTS) are increasingly used. The majority of these methods focus on mapping short-read sequences generated from a donor against a reference genome to detect signatures distinctive of CNVs. In particular, read-depth based methods detect CNVs by analyzing genomic regions with significantly different read-depth from the other ones. The pipeline analysis of these methods consists of four main stages: (i) data preparation, (ii) data normalization, (iii) CNV regions identification, and (iv) copy number estimation. However, available tools do not support most of the operations required at the first two stages of this pipeline. Typically, they start the analysis by building the read-depth signal from pre-processed alignments. Therefore, third-party tools must be used to perform most of the preliminary operations required to build the read-depth signal. These data-intensive operations can be efficiently parallelized on graphics processing units (GPUs). In this article, we present G-CNV, a GPU-based tool devised to perform the common operations required at the first two stages of the analysis pipeline. G-CNV is able to filter low-quality read sequences, to mask low-quality nucleotides, to remove adapter sequences, to remove duplicated read sequences, to map the short-reads, to resolve multiple mapping ambiguities, to build the read-depth signal, and to normalize it. G-CNV can be efficiently used as a third-party tool able to prepare data for the subsequent read-depth signal generation and analysis. Moreover, it can also be integrated in CNV detection tools to generate read-depth signals.
Palma, Paolo; Zangari, Paola; Alteri, Claudia; Tchidjou, Hyppolite K; Manno, Emma Concetta; Liuzzi, Giuseppina; Perno, Carlo Federico; Rossi, Paolo; Bertoli, Ada; Bernardi, Stefania
2016-12-09
HIV genetic diversity implicates major challenges for the control of viral infection by the immune system and for the identification of an effective immunotherapeutic strategy. With the present case report we underline as HIV evolution could be effectively halted by early antiretroviral treatment (eART). Few cases supported this evidence due to the difficulty of performing amplification and sequencing analysis in long-term viral suppressed patients. Here, we reported the case of limited HIV-1 viral evolution over time in a successful early treated child. A perinatally HIV-1 infected infant was treated within 7 weeks of age with zidovudine, lamivudine, nevirapine and lopinavir/ritonavir. At antiretroviral treatment (ART) initiation HIV-1 viral load (VL) and CD4 percentage were >500,000 copies/ml and 35%, respectively. Plasma genotypic resistance test showed a wild-type virus. The child reached VL undetectability after 33 weeks of combination antiretroviral therapy (cART) since he maintained a stable VL <40copies/ml. After 116 weeks on ART we were able to perform amplification and sequencing assay on the plasma virus. At this time VL was <40 copies/ml and CD4 percentage was 40%. Again the genotypic resistance test revealed a wild-type virus. The phylogenetic analysis performed on the HIV-1 pol sequences of the mother and the child revealed that sequences clustered with C subtype reference strains and formed a monophyletic cluster distinct from the other C sequences included in the analysis (bootstrap value >90%). Any major evolutionary divergence was detected. eART limits the viral evolution avoiding the emergence of new viral variants. This result may have important implications in host immune control and may sustain the challenge search of new personalized immunotherapeutic approaches to achieve a prolonged viral remission.
USDA-ARS?s Scientific Manuscript database
Background: Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. Results: We describe the sequencing and assembly of...
Genome Sequence of the Yeast Clavispora lusitaniae Type Strain CBS 6936.
Durrens, Pascal; Klopp, Christophe; Biteau, Nicolas; Fitton-Ouhabi, Valérie; Dementhon, Karine; Accoceberry, Isabelle; Sherman, David J; Noël, Thierry
2017-08-03
Clavispora lusitaniae , an environmental saprophytic yeast belonging to the CTG clade of Candida , can behave occasionally as an opportunistic pathogen in humans. We report here the genome sequence of the type strain CBS 6936. Comparison with sequences of strain ATCC 42720 indicates conservation of chromosomal structure but significant nucleotide divergence. Copyright © 2017 Durrens et al.
Genome Sequence of the Yeast Clavispora lusitaniae Type Strain CBS 6936
Klopp, Christophe; Biteau, Nicolas; Fitton-Ouhabi, Valérie; Dementhon, Karine; Accoceberry, Isabelle; Sherman, David J.; Noël, Thierry
2017-01-01
ABSTRACT Clavispora lusitaniae, an environmental saprophytic yeast belonging to the CTG clade of Candida, can behave occasionally as an opportunistic pathogen in humans. We report here the genome sequence of the type strain CBS 6936. Comparison with sequences of strain ATCC 42720 indicates conservation of chromosomal structure but significant nucleotide divergence. PMID:28774979
Dojka, Michael A.; Hugenholtz, Philip; Haack, Sheridan K.; Pace, Norman R.
1998-01-01
A culture-independent molecular phylogenetic approach was used to survey constituents of microbial communities associated with an aquifer contaminated with hydrocarbons (mainly jet fuel) and chlorinated solvents undergoing intrinsic bioremediation. Samples were obtained from three redox zones: methanogenic, methanogenic-sulfate reducing, and iron or sulfate reducing. Small-subunit rRNA genes were amplified directly from aquifer material DNA by PCR with universally conserved or Bacteria- or Archaea-specific primers and were cloned. A total of 812 clones were screened by restriction fragment length polymorphisms (RFLP), approximately 50% of which were unique. All RFLP types that occurred more than once in the libraries, as well as many of the unique types, were sequenced. A total of 104 (94 bacterial and 10 archaeal) sequence types were determined. Of the 94 bacterial sequence types, 10 have no phylogenetic association with known taxonomic divisions and are phylogenetically grouped in six novel division level groups (candidate divisions WS1 to WS6); 21 belong to four recently described candidate divisions with no cultivated representatives (OP5, OP8, OP10, and OP11); and 63 are phylogenetically associated with 10 well-recognized divisions. The physiology of two particularly abundant sequence types obtained from the methanogenic zone could be inferred from their phylogenetic association with groups of microorganisms with a consistent phenotype. One of these sequence types is associated with the genus Syntrophus; Syntrophus spp. produce energy from the anaerobic oxidation of organic acids, with the production of acetate and hydrogen. The organism represented by the other sequence type is closely related to Methanosaeta spp., which are known to be capable of energy generation only through aceticlastic methanogenesis. We hypothesize, therefore, that the terminal step of hydrocarbon degradation in the methanogenic zone of the aquifer is aceticlastic methanogenesis and that the microorganisms represented by these two sequence types occur in syntrophic association. PMID:9758812
Dojka, M.A.; Hugenholtz, P.; Haack, S.K.; Pace, N.R.
1998-01-01
A culture-independent molecular phylogenetic approach was used to survey constituents of microbial communities associated with an aquifer contaminated with hydrocarbons (mainly jet fuel) and chlorinated solvents undergoing intrinsic bioremediation. Samples were obtained from three redox zones: methanogenic, methanogenic-sulfate reducing, and iron or sulfate reducing. Small-subunit rRNA genes were amplified directly from aquifer material DNA by PCR with universally conserved or Bacteria- or Archaea-specific primers and were cloned. A total of 812 clones were screened by restriction fragment length polymorphisms (RFLP), approximately 50% of which were unique. All RFLP types that occurred more than once in the libraries, as well as many of the unique types, were sequenced. A total of 104 (94 bacterial and 10 archaeal) sequence types were determined. Of the 94 bacterial sequence types, 10 have no phylogenetic association with known taxonomic divisions and are phylogenetically grouped in six novel division level groups (candidate divisions WS1 to WS6); 21 belong to four recently described candidate divisions with no cultivated representatives (OPS, OP8, OP10, and OP11); and 63 are phylogenetically associated with 10 well-recognized divisions. The physiology of two particularly abundant sequence types obtained from the methanogenic zone could be inferred from their phylogenetic association with groups of microorganisms with a consistent phenotype. One of these sequence types is associated with the genus Syntrophus; Syntrophus spp. produce energy from the anaerobic oxidation of organic acids, with the production of acetate and hydrogen. The organism represented by the other sequence type is closely related to Methanosaeta spp., which are known to be capable of energy generation only through aceticlastic methanogenesis. We hypothesize, therefore, that the terminal step of hydrocarbon degradation in the methanogenic zone of the aquifer is aceticlastic methanogenesis and that the microorganisms represented by these two sequence types occur in syntrophic association.
2014-01-01
Background Wheat glutenin polymers are made up of two main subunit types, the high- (HMW-GS) and low- (LMW-GS) molecular weight subunits. These latter are represented by heterogeneous proteins. The most common, based on the first amino acid of the mature sequence, are known as LMW-m and LMW-s types. The mature sequences differ as a consequence of three extra amino acids (MET-) at the N-terminus of LMW-m types. The nucleotide sequences of their encoding genes are, however, nearly identical, so that the relationship between gene and protein sequences is difficult to ascertain. It has been hypothesized that the presence of an asparagine residue in position 23 of the complete coding sequence for the LMW-s type might account for the observed three-residue shortened sequence, as a consequence of cleavage at the asparagine by an asparaginyl endopeptidase. Results We performed site-directed mutagenesis of a LMW-s gene to replace asparagine at position 23 with threonine and thus convert it to a candidate LMW-m type gene. Similarly, a candidate LMW-m type gene was mutated at position 23 to replace threonine with asparagine. Next, we produced transgenic durum wheat (cultivar Svevo) lines by introducing the mutated versions of the LMW-m and LMW-s genes, along with the wild type counterpart of the LMW-m gene. Proteomic comparisons between the transgenic and null segregant plants enabled identification of transgenic proteins by mass spectrometry analyses and Edman N-terminal sequencing. Conclusions Our results show that the formation of LMW-s type relies on the presence of an asparagine residue close to the N-terminus generated by signal peptide cleavage, and that LMW-GS can be quantitatively processed most likely by vacuolar asparaginyl endoproteases, suggesting that those accumulated in the vacuole are not sequestered into stable aggregates that would hinder the action of proteolytic enzymes. Rather, whatever is the mechanism of glutenin polymer transport to the vacuole, the proteins remain available for proteolytic processing, and can be converted to the mature form by the removal of a short N-terminal sequence. PMID:24629124
Madan, Bharat; Sokalingam, Sriram; Raghunathan, Govindan; Lee, Sun-Gu
2014-10-01
Both Type I' and Type II' β-turns have the same sense of the β-turn twist that is compatible with the β-sheet twist. They occur predominantly in two residue β-hairpins, but the occurrence of Type I' β-turns is two times higher than Type II' β-turns. This suggests that Type I' β-turns may be more stable than Type II' β-turns, and Type I' β-turn sequence and structure can be more favorable for protein folding than Type II' β-turns. Here, we redesigned the native Type II' β-turn in GFP to Type I' β-turn, and investigated its effect on protein folding and stability. The Type I' β-turns were designed based on the statistical analysis of residues in natural Type I' β-turns. The substitution of the native "GD" sequence of i+1 and i+2 residues with Type I' preferred "(N/D)G" sequence motif increased the folding rate by 50% and slightly improved the thermodynamic stability. Despite the enhancement of in vitro refolding kinetics and stability of the redesigned mutants, they showed poor soluble expression level compared to wild type. To overcome this problem, i and i + 3 residues of the designed Type I' β-turn were further engineered. The mutation of Thr to Lys at i + 3 could restore the in vivo soluble expression of the Type I' mutant. This study indicates that Type II' β-turns in natural β-hairpins can be further optimized by converting the sequence to Type I'. © 2014 Wiley Periodicals, Inc.
XML schemas for common bioinformatic data types and their application in workflow systems.
Seibel, Philipp N; Krüger, Jan; Hartmeier, Sven; Schwarzer, Knut; Löwenthal, Kai; Mersch, Henning; Dandekar, Thomas; Giegerich, Robert
2006-11-06
Today, there is a growing need in bioinformatics to combine available software tools into chains, thus building complex applications from existing single-task tools. To create such workflows, the tools involved have to be able to work with each other's data--therefore, a common set of well-defined data formats is needed. Unfortunately, current bioinformatic tools use a great variety of heterogeneous formats. Acknowledging the need for common formats, the Helmholtz Open BioInformatics Technology network (HOBIT) identified several basic data types used in bioinformatics and developed appropriate format descriptions, formally defined by XML schemas, and incorporated them in a Java library (BioDOM). These schemas currently cover sequence, sequence alignment, RNA secondary structure and RNA secondary structure alignment formats in a form that is independent of any specific program, thus enabling seamless interoperation of different tools. All XML formats are available at http://bioschemas.sourceforge.net, the BioDOM library can be obtained at http://biodom.sourceforge.net. The HOBIT XML schemas and the BioDOM library simplify adding XML support to newly created and existing bioinformatic tools, enabling these tools to interoperate seamlessly in workflow scenarios.
Dutta, Debasree; Gachhui, Ratan
2007-02-01
A few members of the family Acetobacteraceae are cellulose-producers, while only six members fix nitrogen. Bacterial strain RG3T, isolated from Kombucha tea, displays both of these characteristics. A high bootstrap value in the 16S rRNA gene sequence-based phylogenetic analysis supported the position of this strain within the genus Gluconacetobacter, with Gluconacetobacter hansenii LMG 1527T as its nearest neighbour (99.1 % sequence similarity). It could utilize ethanol, fructose, arabinose, glycerol, sorbitol and mannitol, but not galactose or xylose, as sole sources of carbon. Single amino acids such as L-alanine, L-cysteine and L-threonine served as carbon and nitrogen sources for growth of strain RG3T. Strain RG3T produced cellulose in both nitrogen-free broth and enriched medium. The ubiquinone present was Q-10 and the DNA base composition was 55.8 mol% G+C. It exhibited low values of 5.2-27.77 % DNA-DNA relatedness to the type strains of related gluconacetobacters, which placed it within a separate taxon, for which the name Gluconacetobacter kombuchae sp. nov. is proposed, with the type strain RG3T (=LMG 23726T=MTCC 6913T).
Inns, Thomas; Jombart, Thibaut; Ashton, Philip; Loman, Nicolas; Chatt, Carol; Messelhaeusser, Ute; Rabsch, Wolfgang; Simon, Sandra; Nikisins, Sergejs; Bernard, Helen; le Hello, Simon; Jourdan da-Silva, Nathalie; Kornschober, Christian; Mossong, Joel; Hawkey, Peter; de Pinna, Elizabeth; Grant, Kathie; Cleary, Paul
2016-01-01
Outbreaks of Salmonella Enteritidis have long been associated with contaminated poultry and eggs. In the summer of 2014 a large multi-national outbreak of Salmonella Enteritidis phage type 14b occurred with over 350 cases reported in the United Kingdom, Germany, Austria, France and Luxembourg. Egg supply network investigation and microbiological sampling identified the source to be a Bavarian egg producer. As part of the international investigation into the outbreak, over 400 isolates were sequenced including isolates from cases, implicated UK premises and eggs from the suspected source producer. We were able to show a clear statistical correlation between the topology of the UK egg distribution network and the phylogenetic network of outbreak isolates. This correlation can most plausibly be explained by different parts of the egg distribution network being supplied by eggs solely from independent premises of the Bavarian egg producer (Company X). Microbiological sampling from the source premises, traceback information and information on the interventions carried out at the egg production premises all supported this conclusion. The level of insight into the outbreak epidemiology provided by whole-genome sequencing (WGS) would not have been possible using traditional microbial typing methods. PMID:28348865
Neisseria oralis sp. nov., isolated from healthy gingival plaque and clinical samples
Passaretti, Teresa V.; Jose, Reashma; Cole, Jocelyn; Coorevits, An; Carpenter, Andrea N.; Jose, Sherly; Van Landschoot, Anita; Izard, Jacques; Kohlerschmidt, Donna J.; Vandamme, Peter; Dewhirst, Floyd E.; Fisher, Mark A.; Musser, Kimberlee A.
2013-01-01
A polyphasic analysis was undertaken of seven independent isolates of Gram-negative cocci collected from pathological clinical samples from New York, Louisiana, Florida and Illinois and healthy subgingival plaque from a patient in Virginia, USA. The 16S rRNA gene sequence similarity among these isolates was 99.7–100 %, and the closest species with a validly published name was Neisseria lactamica (96.9 % similarity to the type strain). DNA–DNA hybridization confirmed that these isolates are of the same species and are distinct from their nearest phylogenetic neighbour, N. lactamica. Phylogenetic analysis of 16S and 23S rRNA gene sequences indicated that the novel species belongs in the genus Neisseria. The predominant cellular fatty acids were C16 : 0, summed feature 3 (C16 : 1ω7c and/or iso-C15 : 0 2-OH) and C18 : 1ω7c. The cellular fatty acid profile, together with other phenotypic characters, further supports the inclusion of the novel species in the genus Neisseria. The name Neisseria oralis sp. nov. (type strain 6332T = DSM 25276T = LMG 26725T) is proposed. PMID:22798652
Costimulatory receptors in jawed vertebrates: Conserved CD28, odd CTLA4 and multiple BTLAs
Bernard, D.; Hansen, J.D.; Du, Pasquier L.; Lefranc, M.-P.; Benmansour, A.; Boudinot, P.
2007-01-01
CD28 family of costimulatory receptors is comprised of molecules with a single V-type extracellular Ig domain, a transmembrane and an intracytoplasmic region with signaling motifs. CD28 and cytotoxic T lymphocyte antigen-4 (CTLA4) homologs have been recently identified in rainbow trout. Other sequences similar to mammalian CD28 family members have now been identified using teleost, Xenopus and chicken databases. CD28- and CTLA4 homologs were found in all vertebrate classes whereas inducible costimulatory signal (ICOS) was restricted to tetrapods, and programmed cell death-1 (PD1) was limited to mammals and chicken. Multiple B and T Lymphocyte Attenuator (BTLA) sequences were found in teleosts, but not in Xenopus or in avian genomes. The intron/exon structure of btlas was different from that of cd28 and other members of the family. The Ig domain encoded in all the btla genes has features of the C-type structure, which suggests that BTLA does not belong to the CD28 family. The genomic localization of these genes in vertebrate genomes supports the split between the BTLA and CD28 families. ?? 2006 Elsevier Ltd. All rights reserved.
Eye movements reflect and shape strategies in fraction comparison.
Ischebeck, Anja; Weilharter, Marina; Körner, Christof
2016-01-01
The comparison of fractions is a difficult task that can often be facilitated by separately comparing components (numerators and denominators) of the fractions--that is, by applying so-called component-based strategies. The usefulness of such strategies depends on the type of fraction pair to be compared. We investigated the temporal organization and the flexibility of strategy deployment in fraction comparison by evaluating sequences of eye movements in 20 young adults. We found that component-based strategies could account for the response times and the overall number of fixations observed for the different fraction pairs. The analysis of eye movement sequences showed that the initial eye movements in a trial were characterized by stereotypical scanning patterns indicative of an exploratory phase that served to establish the kind of fraction pair presented. Eye movements that followed this phase adapted to the particular type of fraction pair and indicated the deployment of specific comparison strategies. These results demonstrate that participants employ eye movements systematically to support strategy use in fraction comparison. Participants showed a remarkable flexibility to adapt to the most efficient strategy on a trial-by-trial basis. Our results confirm the value of eye movement measurements in the exploration of strategic adaptation in complex tasks.
Dallman, Tim; Inns, Thomas; Jombart, Thibaut; Ashton, Philip; Loman, Nicolas; Chatt, Carol; Messelhaeusser, Ute; Rabsch, Wolfgang; Simon, Sandra; Nikisins, Sergejs; Bernard, Helen; le Hello, Simon; Jourdan da-Silva, Nathalie; Kornschober, Christian; Mossong, Joel; Hawkey, Peter; de Pinna, Elizabeth; Grant, Kathie; Cleary, Paul
2016-08-01
Outbreaks of Salmonella Enteritidis have long been associated with contaminated poultry and eggs. In the summer of 2014 a large multi-national outbreak of Salmonella Enteritidis phage type 14b occurred with over 350 cases reported in the United Kingdom, Germany, Austria, France and Luxembourg. Egg supply network investigation and microbiological sampling identified the source to be a Bavarian egg producer. As part of the international investigation into the outbreak, over 400 isolates were sequenced including isolates from cases, implicated UK premises and eggs from the suspected source producer. We were able to show a clear statistical correlation between the topology of the UK egg distribution network and the phylogenetic network of outbreak isolates. This correlation can most plausibly be explained by different parts of the egg distribution network being supplied by eggs solely from independent premises of the Bavarian egg producer (Company X). Microbiological sampling from the source premises, traceback information and information on the interventions carried out at the egg production premises all supported this conclusion. The level of insight into the outbreak epidemiology provided by whole-genome sequencing (WGS) would not have been possible using traditional microbial typing methods.
Gharout-Sait, Alima; Touati, Abdelaziz; Guillard, Thomas; Brasme, Lucien; de Champs, Christophe
2015-01-01
In this study, 922 consecutive non-duplicate clinical isolates of Enterobacteriaceae obtained from hospitalized and non-hospitalized patients at Bejaia, Algeria were analyzed for AmpC-type β-lactamases production. The ampC genes and their genetic environment were characterized using polymerase chain reaction (PCR) and sequencing. Plasmid incompatibility groups were determined by using PCR-based replicon typing. Phylogenetic grouping and multilocus sequence typing were determined for molecular typing of the plasmid-mediated AmpC (pAmpC) isolates. Of the isolates, 15 (1.6%) were identified as AmpC producers including 14 CMY-4-producing isolates and one DHA-1-producing Klebsiella pneumoniae. All AmpC-producing isolates co-expressed the broad-spectrum TEM-1 β-lactamase and three of them co-produced CTX-M and/or SHV-12 ESBL. Phylogenetic grouping and virulence genotyping of the E. coli isolates revealed that most of them belonged to groups D and B1. Multilocus sequence typing analysis of K. pneumoniae isolates identified four different sequence types (STs) with two new sequences: ST1617 and ST1618. Plasmid replicon typing indicates that blaCMY-4 gene was located on broad host range A/C plasmid, while LVPK replicon was associated with blaDHA-1. All isolates carrying blaCMY-4 displayed the transposon-like structures ISEcp1/ΔISEcp1-blaCMY-blc-sugE. Our study showed that CMY-4 was the main pAmpC in the Enterobacteriaceae isolates in Algeria. Copyright © 2015 Elsevier Editora Ltda. All rights reserved.
Xu, Jingting; Hu, Hong; Dai, Yang
The identification of enhancers is a challenging task. Various types of epigenetic information including histone modification have been utilized in the construction of enhancer prediction models based on a diverse panel of machine learning schemes. However, DNA methylation profiles generated from the whole genome bisulfite sequencing (WGBS) have not been fully explored for their potential in enhancer prediction despite the fact that low methylated regions (LMRs) have been implied to be distal active regulatory regions. In this work, we propose a prediction framework, LMethyR-SVM, using LMRs identified from cell-type-specific WGBS DNA methylation profiles and a weighted support vector machine learning framework. In LMethyR-SVM, the set of cell-type-specific LMRs is further divided into three sets: reliable positive, like positive and likely negative, according to their resemblance to a small set of experimentally validated enhancers in the VISTA database based on an estimated non-parametric density distribution. Then, the prediction model is obtained by solving a weighted support vector machine. We demonstrate the performance of LMethyR-SVM by using the WGBS DNA methylation profiles derived from the human embryonic stem cell type (H1) and the fetal lung fibroblast cell type (IMR90). The predicted enhancers are highly conserved with a reasonable validation rate based on a set of commonly used positive markers including transcription factors, p300 binding and DNase-I hypersensitive sites. In addition, we show evidence that the large fraction of the LMethyR-SVM predicted enhancers are not predicted by ChromHMM in H1 cell type and they are more enriched for the FANTOM5 enhancers. Our work suggests that low methylated regions detected from the WGBS data are useful as complementary resources to histone modification marks in developing models for the prediction of cell-type-specific enhancers.
Spontaneous Generation of Infectious Prion Disease in Transgenic Mice
Castilla, Joaquín; Pintado, Belén; Gutiérrez-Adan, Alfonso; Andréoletti, Olivier; Aguilar-Calvo, Patricia; Arroba, Ana-Isabel; Parra-Arrondo, Beatriz; Ferrer, Isidro; Manzanares, Jorge; Espinosa, Juan-Carlos
2013-01-01
We generated transgenic mice expressing bovine cellular prion protein (PrPC) with a leucine substitution at codon 113 (113L). This protein is homologous to human protein with mutation 102L, and its genetic link with Gerstmann–Sträussler–Scheinker syndrome has been established. This mutation in bovine PrPC causes a fully penetrant, lethal, spongiform encephalopathy. This genetic disease was transmitted by intracerebral inoculation of brain homogenate from ill mice expressing mutant bovine PrP to mice expressing wild-type bovine PrP, which indicated de novo generation of infectious prions. Our findings demonstrate that a single amino acid change in the PrPC sequence can induce spontaneous generation of an infectious prion disease that differs from all others identified in hosts expressing the same PrPC sequence. These observations support the view that a variety of infectious prion strains might spontaneously emerge in hosts displaying random genetic PrPC mutations. PMID:24274622
Casein expression in cytotoxic T lymphocytes.
Grusby, M J; Mitchell, S C; Nabavi, N; Glimcher, L H
1990-01-01
A cDNA that expresses a mRNA restricted to cytotoxic T lymphocytes (CTL) and mammary tissue has been isolated and characterized. The deduced amino acid sequence from this cDNA shows extensive homology with the previously reported amino acid sequence for rat alpha-casein. Indeed, the presence of a six-residue-repeated motif that is specific for rodent alpha-caseins strongly supports the identification of this cDNA as mouse alpha-casein. Northern (RNA) blot analysis of many hematopoietic cell types revealed that this gene is restricted to CTL, being expressed in four of six CTL lines examined. Furthermore, CTL that express this gene were also found to express other members of the casein gene family, such as beta- and kappa-casein. These results suggest that caseins may be important in CTL function, and their potential role in CTL-mediated lysis is discussed. Images PMID:2395885
GfaPy: a flexible and extensible software library for handling sequence graphs in Python.
Gonnella, Giorgio; Kurtz, Stefan
2017-10-01
GFA 1 and GFA 2 are recently defined formats for representing sequence graphs, such as assembly, variation or splicing graphs. The formats are adopted by several software tools. Here, we present GfaPy, a software package for creating, parsing and editing GFA graphs using the programming language Python. GfaPy supports GFA 1 and GFA 2, using the same interface and allows for interconversion between both formats. The software package provides a simple interface for custom record types, which is an important new feature of GFA 2 (compared to GFA 1). This enables new applications of the format. GfaPy is available open source at https://github.com/ggonnella/gfapy and installable via pip. gonnella@zbh.uni-hamburg.de. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Marciniak, R A; Garcia-Blanco, M A; Sharp, P A
1990-01-01
Human immunodeficiency virus type 1 RNAs contain a sequence, trans-activation-response (TAR) element, which is required for tat protein-mediated trans-activation of viral gene expression. We have identified a nuclear protein from extracts of HeLa cells that binds to the TAR element RNA in a sequence-specific manner. The binding of this 68-kDa polypeptide was detected by UV cross-linking proteins to TAR element RNA transcribed in vitro. Competition experiments were performed by using a partially purified preparation of the protein to quantify the relative binding affinities of TAR element RNA mutants. The binding affinity of the TAR mutants paralleled the reported ability of those mutants to support tat trans-activation in vivo. We propose that this cellular protein moderates TAR activity in vivo. Images PMID:2333305
Integration, warehousing, and analysis strategies of Omics data.
Gedela, Srinubabu
2011-01-01
"-Omics" is a current suffix for numerous types of large-scale biological data generation procedures, which naturally demand the development of novel algorithms for data storage and analysis. With next generation genome sequencing burgeoning, it is pivotal to decipher a coding site on the genome, a gene's function, and information on transcripts next to the pure availability of sequence information. To explore a genome and downstream molecular processes, we need umpteen results at the various levels of cellular organization by utilizing different experimental designs, data analysis strategies and methodologies. Here comes the need for controlled vocabularies and data integration to annotate, store, and update the flow of experimental data. This chapter explores key methodologies to merge Omics data by semantic data carriers, discusses controlled vocabularies as eXtensible Markup Languages (XML), and provides practical guidance, databases, and software links supporting the integration of Omics data.
A Data Type for Efficient Representation of Other Data Types
NASA Technical Reports Server (NTRS)
James, Mark
2008-01-01
A self-organizing, monomorphic data type denoted a sequence has been conceived to address certain concerns that arise in programming parallel computers. A sequence in the present sense can be regarded abstractly as a vector, set, bag, queue, or other construct. Heretofore, in programming a parallel computer, it has been necessary for the programmer to state explicitly, at the outset, what parts of the program and the underlying data structures must be represented in parallel form. Not only is this requirement not optimal from the perspective of implementation; it entails an additional requirement that the programmer have intimate understanding of the underlying parallel structure. The present sequence data type overcomes both the implementation and parallel structure obstacles. In so doing, the sequence data type provides unified means by which the programmer can represent a data structure for natural and automatic decomposition to a parallel computing architecture. Sequences exhibit the behavioral and structural characteristics of vectors, but the underlying representations are automatically synthesized from combinations of programmers advice and execution use metrics. Sequences can vary bidirectionally between sparseness and density, making them excellent choices for many kinds of algorithms. The novelty and benefit of this behavior lies in the fact that it can relieve programmers of the details of implementations. The creation of a sequence enables decoupling of a conceptual representation from an implementation. The underlying representation of a sequence is a hybrid of representations composed of vectors, linked lists, connected blocks, and hash tables. The internal structure of a sequence can automatically change from time to time on the basis of how it is being used. Those portions of a sequence where elements have not been added or removed can be as efficient as vectors. As elements are inserted and removed in a given portion, then different methods are utilized to provide both an access and memory strategy that is optimized for that portion and the use to which it is put.
Novel nitrogen-fixing Acetobacter nitrogenifigens sp. nov., isolated from Kombucha tea.
Dutta, Debasree; Gachhui, Ratan
2006-08-01
The four nitrogen-fixing bacteria so far described in the family Acetobacteraceae belong to the genera Gluconacetobacter and Acetobacter. Nitrogen-fixing bacterial strain RG1(T) was isolated from Kombucha tea and, based on the phylogenetic analysis of 16S rRNA gene sequence which is supported by a high bootstrap value, was found to belong to the genus Acetobacter. Strain RG1(T) differed from Acetobacter aceti, the nearest member with a 16S rRNA gene sequence similarity of 98.2 %, and type strains of other Acetobacter species with regard to several characteristics of growth features in culture media, growth in nitrogen-free medium, production of gamma-pyrone from glucose and dihydroxyacetone from glycerol. Strain RG1(T) utilized maltose, glycerol, sorbitol, fructose, galactose, arabinose and ethanol, but not methanol as a carbon source. These results, along with electrophoretic mobility patterns of nine metabolic enzymes, suggest that strain RG1(T) represents a novel nitrogen-fixing species. The ubiquinone present was Q-9 and DNA G+C content was 64.1 mol%. Strain RG1(T) exhibited a low value of 2-24 % DNA-DNA relatedness to the type strains of related acetobacters, which placed it as a separate taxon. On the basis of this data, the name Acetobacter nitrogenifigens sp. nov. is proposed, with the type strain RG1(T) (=MTCC 6912(T)=LMG 23498(T)).
Blocked recombinase polymerase amplification for mutation analysis of PIK3CA gene.
Martorell, Sara; Palanca, Sarai; Maquieira, Ángel; Tortajada-Genaro, Luis A
2018-03-01
A blocked recombinase polymerase amplification (blocked-RPA) approach has been developed for the enrichment of mutated templates in heterogeneous specimens as tumor tissues. This isothermal amplification technique opens alternative solutions for meeting the technological demand of physician office laboratories. Herein, the detection of mutations in PIK3CA gene, such as p.E545K, and p.H1047L, is presented. The main element was an oligonucleotide (dideoxycytidine functionalized at 3'-end) which matched with wild-type sequence in the target locus. The amplification was performed operating at 37 °C during 40 min. The results demonstrated that the competition between the upstream primer and the blocker reduced the percentage of amplified wild-type allele, making the detection of the present mutation easier. For mutation discrimination, a fast hybridization assay was performed in microarray format on plastic chip and colorimetric detection. This approach enabled the reliable discrimination of specific mutations against a background of up to 95% wild-type DNA. The applicability of the method, based on the combination of blocked-RPA and low-cost chip hybridization, was successfully proven for the genotyping of various cancer cell lines as well as tumor tissues. The assignations agreed with those provided by next-generation sequencing. Therefore, these investigations would support a personalized approach to patient care based on the molecular signature of human cancers. Copyright © 2017 Elsevier Inc. All rights reserved.
Structure-Specific Ribonucleases for MS-Based Elucidation of Higher-Order RNA Structure
NASA Astrophysics Data System (ADS)
Scalabrin, Matteo; Siu, Yik; Asare-Okai, Papa Nii; Fabris, Daniele
2014-07-01
Supported by high-throughput sequencing technologies, structure-specific nucleases are experiencing a renaissance as biochemical probes for genome-wide mapping of nucleic acid structure. This report explores the benefits and pitfalls of the application of Mung bean (Mb) and V1 nuclease, which attack specifically single- and double-stranded regions of nucleic acids, as possible structural probes to be employed in combination with MS detection. Both enzymes were found capable of operating in ammonium-based solutions that are preferred for high-resolution analysis by direct infusion electrospray ionization (ESI). Sequence analysis by tandem mass spectrometry (MS/MS) was performed to confirm mapping assignments and to resolve possible ambiguities arising from the concomitant formation of isobaric products with identical base composition and different sequences. The observed products grouped together into ladder-type series that facilitated their assignment to unique regions of the substrate, but revealed also a certain level of uncertainty in identifying the boundaries between paired and unpaired regions. Various experimental factors that are known to stabilize nucleic acid structure, such as higher ionic strength, presence of Mg(II), etc., increased the accuracy of cleavage information, but did not completely eliminate deviations from expected results. These observations suggest extreme caution in interpreting the results afforded by these types of reagents. Regardless of the analytical platform of choice, the results highlighted the need to repeat probing experiments under the most diverse possible conditions to recognize potential artifacts and to increase the level of confidence in the observed structural information.
Gcebe, Nomakorinte; Rutten, Victor P M G; van Pittius, Nicolaas Gey; Naicker, Brendon; Michel, Anita L
2018-05-01
Some species of non-tuberculous mycobacteria (NTM) have been reported to be opportunistic pathogens of animals and humans. Recently there has been an upsurge in the number of cases of NTM infections, such that some NTM species are now recognized as pathogens of humans and animals. From a veterinary point of view, the major significance of NTM is the cross-reactive immune response they elicit against Mycobacterium bovis antigens, leading to misdiagnosis of bovine tuberculosis. Four NTM isolates were detected from a bovine nasal swab, soil and water, during an NTM survey in South Africa. These were all found using 16S rRNA gene sequence analysis to be closely related to Mycobacterium moriokaense. The isolates were further characterised by sequence analysis of the partial fragments of hsp65, rpoB and sodA. The genome of the type strain was also elucidated. Gene (16S rRNA, hsp65, rpoB and sodA) and protein sequence data analysis of 6 kDa early secretory antigenic target (ESAT 6) and 10 kDa culture filtrate protein (CFP-10) revealed that these isolates belong to a unique Mycobacterium species. Differences in phenotypic and biochemical traits between the isolates and closely related species further supported that these isolates belong to novel Mycobacterium species. We proposed the name Mycobacterium komaniense sp. nov. for this new species. The type strain is GPK 1020 T (=CIP 110823T=ATCC BAA-2758).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Heringer, E.; Kerkwijk, M. H. van; Sim, S. A.
2017-09-01
Type Ia supernovae (SNe Ia) are generally agreed to arise from thermonuclear explosions of carbon–oxygen white dwarfs. The actual path to explosion, however, remains elusive, with numerous plausible parent systems and explosion mechanisms suggested. Observationally, SNe Ia have multiple subclasses, distinguished by their light curves and spectra. This raises the question of whether these indicate that multiple mechanisms occur in nature or that explosions have a large but continuous range of physical properties. We revisit the idea that normal and 91bg-like SNe can be understood as part of a spectral sequence in which changes in temperature dominate. Specifically, we findmore » that a single ejecta structure is sufficient to provide reasonable fits of both the normal SN Ia SN 2011fe and the 91bg-like SN 2005bl, provided that the luminosity and thus temperature of the ejecta are adjusted appropriately. This suggests that the outer layers of the ejecta are similar, thus providing some support for a common explosion mechanism. Our spectral sequence also helps to shed light on the conditions under which carbon can be detected in premaximum SN Ia spectra—we find that emission from iron can “fill in” the carbon trough in cool SNe Ia. This may indicate that the outer layers of the ejecta of events in which carbon is detected are relatively metal-poor compared to events in which carbon is not detected.« less